0% found this document useful (0 votes)

4 views15 pages

Module 4

Bayesian learning is a probabilistic approach that utilizes Bayes' Theorem to update the probability of hypotheses based on new data, combining prior knowledge and evidence for informed predictions. It involves calculating prior, likelihood, and posterior probabilities, allowing for flexible learning and robust decision-making under uncertainty. Key applications include the Naive Bayes classifier and Bayesian Belief Networks, which model relationships between variables using probabilities.

Uploaded by

smizba777

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views15 pages

Module 4

Uploaded by

smizba777

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Module 4

Bayesian learning is a probabilistic approach to machine learning that uses Bayes' Theorem to
update the probability of hypotheses as new data is observed, combining prior knowledge and
evidence to make informed predictions and decisions.

Bayesian learning is a machine learning method that uses Bayes' Theorem to update the probability
of a hypothesis as more evidence or data becomes available. It treats learning as a process of
probabilistic inference, where the goal is to estimate the posterior distribution of hypotheses given
observed data, combining prior beliefs and the likelihood of the data under those hypotheses

 Bayes' Theorem: Central to Bayesian learning, it mathematically relates the prior probability
of a hypothesis P(h)P(h), the likelihood of observed data given the hypothesis P(D∣h)P(D∣h),
and the posterior probability P(h∣D)P(h∣D) - the updated belief about the hypothesis after
seeing the data:

 Prior Probability: Represents initial beliefs about hypotheses before seeing data.

 Likelihood: Probability of observing the data assuming a particular hypothesis is true.

 Posterior Probability: Updated probability of the hypothesis after considering the data.

 Hypothesis Space: The set of all possible models or explanations considered.

How Bayesian Learning Works:

 Starts with prior beliefs about hypotheses.

 Observes data and calculates how likely this data is under each hypothesis.

 Updates beliefs to form posterior probabilities.

 Uses posterior to make predictions or decisions.

 Allows combining prior knowledge with new evidence flexibly.

Advantages:

 Incorporates prior knowledge explicitly.

 Provides a probabilistic framework that quantifies uncertainty.

 Can combine predictions from multiple hypotheses weighted by their probabilities.

 Offers a principled approach to decision-making under uncertainty.

 Robust to overfitting compared to some other methods because it considers distributions

over parameters rather than point estimates
Example

 Suppose a drug test is 98% accurate:

 If a person uses the drug, the test correctly detects it 98% of the time (true positive
rate).

 If a person does not use the drug, the test correctly shows negative 98% of the time
(true negative rate).

 The prevalence of drug use in the population is 0.5% (0.005 probability).

 A person tests positive. What is the probability that this person actually uses the drug?

Solution

Step 1: Define the events

 A: Person uses the drug.

 B: Test result is positive.

Step 2: Identify the known probabilities

 P(A) = 0.005 (0.5% of people use the drug)

 P(¬A) = 1 - P(A) = 0.995 (person does not use the drug)

 P(B|A) = 0.98 (test is positive if person uses the drug)

 P(B|¬A) = 0.02 (false positive rate: test is positive even if person does not use the drug)

Step 3: Calculate the total probability of testing positive, P(B)

This includes both true positives and false positives:

Step 4: Apply Bayes' Theorem to find P(A|B)

Reasons for learning Bayesian algorithms

1. To calculate probability
2. for hypothesis
3. To provide useful understanding of learning algorithm that do explicitly manipulate
probabilities
4. To minimize the mean squared error in the neural network

Key features

 Incremental updating of hypothesis probabilities: Each observed training example can

increase or decrease the estimated probability that a hypothesis is correct, allowing more
flexible learning than methods that discard hypotheses after a single inconsistency

 Incorporation of prior knowledge: Bayesian learning combines prior probabilities (previous

knowledge or beliefs about hypotheses) with observed data to compute the posterior
probability of hypotheses

 Probabilistic predictions: It can handle hypotheses that make probabilistic, rather than
deterministic, predictions about data

 Combining multiple hypotheses: New instances can be classified by aggregating predictions

from multiple hypotheses weighted by their posterior probabilities, rather than relying on a
single best hypothesis.

 Optimal decision-making benchmark: Even when computationally expensive or intractable,

Bayesian methods provide a theoretical standard of optimal decision-making against which
other practical algorithms can be measured
 Handling uncertainty and sparse data: Bayesian learning explicitly models uncertainty and is
particularly effective when data is limited or noisy, as it integrates prior knowledge and
evidence probabilistically.

 Computational considerations: Bayesian methods may require significant computational

resources and prior probability estimates, which can be challenging but sometimes reducible
in special cases.

Limitations

1. Typically requires initial knowledge of many probabilities

2. Significant computational cost required to determine the bayes optimal hypothesis

The Brute Force MAP (Maximum A Posteriori) Learning algorithm is a straightforward Bayesian
concept learning method that finds the most probable hypothesis given training data by exhaustively
evaluating all hypotheses in a finite hypothesis space.

Key assumptions:

 Training data is noise-free (no mislabelling).

 Target concept is contained in the hypothesis space.

 Prior probabilities are uniform (no hypothesis favoured a priori).

The MAP hypothesis (Maximum A Posteriori hypothesis) is the hypothesis that maximizes
the posterior probability given the observed data. In other words, it is the most probable hypothesis
after combining both the likelihood of the data under the hypothesis and the prior belief about the
hypothesis.

Mathematically, it is defined as:

 h is a hypothesis,

 D is the observed data,

 P(D∣h) is the likelihood of the data given hypothesis h

 P(h) is the prior probability of hypothesis h

 P(h∣D) is the posterior probability of h given the data.

The MAP hypothesis is the best guess for the true hypothesis after considering both the observed
data and prior beliefs.

NAÏVE-BAYES CLASSIFIER

The Naive Bayes classifier is a simple and popular supervised machine learning algorithm used for
classification tasks, such as text classification or spam detection. It is based on Bayes’ Theorem and
assumes that all features (predictors) are conditionally independent given the class label-this is
called the naive independence assumption

 P(Ck) is the prior probability of class CkCk.

 P(xi∣Ck)P(xi∣Ck) is the likelihood of feature xixi given class CkCk.

 The denominator P(x1,x2,...,xn)P(x1,x2,...,xn) is constant for all classes and thus omitted in
classification.

Numerical example below

BBN(Bayesian belief network)

A Bayesian Belief Network (BBN) is like a smart map that shows how different things (variables) are
connected and influence each other, using probabilities. Imagine you want to understand how
weather, traffic, and being late to work are related. A BBN draws arrows from one factor to another
to show which causes which, and uses numbers (probabilities) to express how likely things are given
other things.

In simple terms:

 It’s a diagram made of nodes and arrows; each node is a variable (like "Rain" or "Traffic
Jam").

 The arrows show cause-and-effect relationships (e.g., rain can cause traffic jams).

 Each node has a table that tells you the chance of that variable happening given its causes.

 When you get new information (like it’s raining), the network updates the chances of related
events (like traffic jams or being late).

This helps in making decisions or predictions when things are uncertain by combining what you
know with new evidence.

Mathematical definition

joint probability distribution over all variables X1,X2,...,XnX1,X2,...,Xn in a Bayesian Belief Network
(BBN) can be expressed as the product of the conditional probabilities of each variable given its
parents in the network.

split:

 Instead of calculating the probability of every possible combination of all variables together
(which grows exponentially and becomes infeasible for many variables),

 The Bayesian network breaks down the joint probability into smaller, manageable pieces.

 Each variable depends only on its parent variables (the nodes with arrows pointing to it).

 By multiplying these conditional probabilities, you get the overall joint probability
Gradient ascent training of Bayesian networks is an optimization method that updates the
conditional probabilities in the network by following the gradient of the log-likelihood of observed
data, improving the network’s fit to data iteratively.

 Bayesian network with a fixed structure (nodes and edges).

 Each node has a CPT, where each entry wijk represents the probability that variable Yi takes
value yi given its parents Ui have values uik

 The goal is to find the set of wijk values that maximize the probability of the observed
training data DD, i.e., maximize P(D∣h) where h represents the hypothesis defined by the
CPTs.
Derivation
Example

Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
module_5_notes BAYESIAN learning notes
No ratings yet
module_5_notes BAYESIAN learning notes
24 pages
UNIT 4 - Bayesian Learning
No ratings yet
UNIT 4 - Bayesian Learning
54 pages
ML Unit-4
No ratings yet
ML Unit-4
24 pages
15CS73 Module 4
No ratings yet
15CS73 Module 4
60 pages
3.1 New
No ratings yet
3.1 New
12 pages
UNIT-4
No ratings yet
UNIT-4
24 pages
ML Unit 3 Bayesian - Learning (Textbook)
No ratings yet
ML Unit 3 Bayesian - Learning (Textbook)
25 pages
Unit 2 Bayesian Learning
No ratings yet
Unit 2 Bayesian Learning
50 pages
ML Unit-Iii
No ratings yet
ML Unit-Iii
178 pages
Bayesian Learning Video Tutorial
No ratings yet
Bayesian Learning Video Tutorial
25 pages
ML UNIT 4-1-24
No ratings yet
ML UNIT 4-1-24
24 pages
Module - 4 AIML
No ratings yet
Module - 4 AIML
22 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
25 pages
Module 5
No ratings yet
Module 5
24 pages
Module 2 Notes
No ratings yet
Module 2 Notes
24 pages
Bayesian Learning: Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Bayesian Learning: Artificial Intelligence and Machine Learning 18CS71
24 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
Module - 4 Bayeian Learning
No ratings yet
Module - 4 Bayeian Learning
44 pages
ML - Unit4pdf
No ratings yet
ML - Unit4pdf
65 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
3-Bayesian Modelling - Inference and Bayesian NT
No ratings yet
3-Bayesian Modelling - Inference and Bayesian NT
25 pages
Naive Bayes
No ratings yet
Naive Bayes
60 pages
Module4 Notes
100% (1)
Module4 Notes
31 pages
2BAYESIAN LEARNING (1)
No ratings yet
2BAYESIAN LEARNING (1)
22 pages
Mod 4
No ratings yet
Mod 4
26 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Aiml Module 04
No ratings yet
Aiml Module 04
62 pages
Unit III
No ratings yet
Unit III
19 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Module 4 - Bayesian Learning
No ratings yet
Module 4 - Bayesian Learning
36 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
unit-3(after_mid)
No ratings yet
unit-3(after_mid)
10 pages
Bayes Theorem
No ratings yet
Bayes Theorem
13 pages
Bayesian Learning: Salma Itagi, Svit
No ratings yet
Bayesian Learning: Salma Itagi, Svit
14 pages
Features of Bayesian Learning Methods
No ratings yet
Features of Bayesian Learning Methods
39 pages
Wa0002.
No ratings yet
Wa0002.
24 pages
Bayesian Learning
No ratings yet
Bayesian Learning
44 pages
ML - Unit 1 - Part Ii
No ratings yet
ML - Unit 1 - Part Ii
18 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
Lecture 9: Bayesian Learning: Cognitive Systems II - Machine Learning SS 2005
No ratings yet
Lecture 9: Bayesian Learning: Cognitive Systems II - Machine Learning SS 2005
39 pages
Bayesian Inference: Fundamentals and Applications
From Everand
Bayesian Inference: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bayes
No ratings yet
Bayes
4 pages
Unit-3
No ratings yet
Unit-3
157 pages
18CS71 Module 4
No ratings yet
18CS71 Module 4
30 pages
AI&ML-Q With Answer
No ratings yet
AI&ML-Q With Answer
18 pages
BAYESIAN
No ratings yet
BAYESIAN
8 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
ML Unit III
No ratings yet
ML Unit III
40 pages
MODULE - 4 QB SOLVED-1
No ratings yet
MODULE - 4 QB SOLVED-1
31 pages
Unit-4
No ratings yet
Unit-4
36 pages
Bayesian
No ratings yet
Bayesian
91 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
Bayesian Learning Methods
No ratings yet
Bayesian Learning Methods
57 pages
Bayesian Learning Note
No ratings yet
Bayesian Learning Note
20 pages
Concept Learning
No ratings yet
Concept Learning
33 pages
2022 Slide9 BayesML Eng
No ratings yet
2022 Slide9 BayesML Eng
34 pages
Bayesian Methodology: an Overview With The Help Of R Software
From Everand
Bayesian Methodology: an Overview With The Help Of R Software
Editor IJSMI
No ratings yet
Unit 5
No ratings yet
Unit 5
37 pages
Aiml Unit 2
No ratings yet
Aiml Unit 2
34 pages
Probability II Conditional Probability, Bayes Theorem, Decision Trees. (1) - BHUVNESHWARI RATHORE
No ratings yet
Probability II Conditional Probability, Bayes Theorem, Decision Trees. (1) - BHUVNESHWARI RATHORE
16 pages
Data Science
No ratings yet
Data Science
64 pages
Chapter 1. Probability
No ratings yet
Chapter 1. Probability
44 pages
Module 2 - Probability Concepts and Applications
No ratings yet
Module 2 - Probability Concepts and Applications
67 pages
SMBI Ch1 - Introduction To Bayesian Statistics
No ratings yet
SMBI Ch1 - Introduction To Bayesian Statistics
92 pages
AISECT_Booklet_Data Science SSC-Q8104 (3) (1)
No ratings yet
AISECT_Booklet_Data Science SSC-Q8104 (3) (1)
114 pages
Unit-V (1)
No ratings yet
Unit-V (1)
57 pages
Probability of An Event, Conditional Probability, Total Probability, Bayes' Rule
No ratings yet
Probability of An Event, Conditional Probability, Total Probability, Bayes' Rule
3 pages
IntroClassificationDA-2024
No ratings yet
IntroClassificationDA-2024
129 pages
Get From Statistical Physics To Data-Driven Modelling: With Applications To Quantitative Biology Simona Cocco Free All Chapters
100% (9)
Get From Statistical Physics To Data-Driven Modelling: With Applications To Quantitative Biology Simona Cocco Free All Chapters
48 pages
Bayes Rule Related Problems
No ratings yet
Bayes Rule Related Problems
2 pages
Reading 3 - Probability Concepts
No ratings yet
Reading 3 - Probability Concepts
47 pages
Probability and Statistics by Prof Sudip Roy Roorkee
No ratings yet
Probability and Statistics by Prof Sudip Roy Roorkee
21 pages
Math Section A
No ratings yet
Math Section A
4 pages
Thesis - Mastromatteo On The Typicalproblems of Inverse Statistical Mechanics
No ratings yet
Thesis - Mastromatteo On The Typicalproblems of Inverse Statistical Mechanics
183 pages
Stats HW #4
No ratings yet
Stats HW #4
3 pages
On Unit-3
No ratings yet
On Unit-3
30 pages
Introduction To Probability and Statistics: Sayantan Banerjee IPS Session 3
No ratings yet
Introduction To Probability and Statistics: Sayantan Banerjee IPS Session 3
14 pages
BAYES
No ratings yet
BAYES
11 pages
Ex1 Probability
No ratings yet
Ex1 Probability
10 pages
CS3491 - Notes - Unit 2 - Probabilistic Reasoning
No ratings yet
CS3491 - Notes - Unit 2 - Probabilistic Reasoning
28 pages
Probabilistic Reasoning in AI
No ratings yet
Probabilistic Reasoning in AI
29 pages
PPT05-Quantifying Uncertainty
No ratings yet
PPT05-Quantifying Uncertainty
39 pages
Chap 005
No ratings yet
Chap 005
55 pages
7 Posterior Probability and Bayes PDF
No ratings yet
7 Posterior Probability and Bayes PDF
20 pages
What Is The Bayes' Theorem?
100% (1)
What Is The Bayes' Theorem?
12 pages
Arnold Zellner - Statistics, Econometrics & Forecasting PDF
No ratings yet
Arnold Zellner - Statistics, Econometrics & Forecasting PDF
186 pages
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
No ratings yet
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
11 pages