0% found this document useful (0 votes)

16 views8 pages

Bayesian_theory_daniel_restrepo

Uploaded by

cehik41931

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views8 pages

Bayesian_theory_daniel_restrepo

Uploaded by

cehik41931

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Bayesian theory

Note taker: Daniel Restrepo-Montoya

In classification, Bayes’ rule is used to calculate the probabilities of the classes.

The main aim is related about how we can make rational decisions to minimize
expected risk.

Bayes’ theorem provides a way to calculate the probability of a hypothesis based

on its prior probability, the probabilities of observing various data given the
hypothesis, and the observed data itself.

Probability and inference

Data comes from a process that is not completely known. This lack of knowledge
is indicating by modelling the process as a random process.

Bernoulli process: performing multiple experiments.

In probability and statistics, a Bernoulli process is a discrete-time stochastic process

consisting of a sequence of independent random variables taking values over two
symbols. Prosaically, a Bernoulli process is coin flipping, possibly with an unfair coin. A
variable in such a sequence may be called a Bernoulli variable. (wikipedia)

There is a example, normally speaking if you know you got a fair coin the
probability will be .5, but, if you suspect that you got a charge coin you can
estimate the probability doing a Bernoulli process.

Classification, probabilistic model

Input/Output
Example related about credit scoring and how to establish 2 classes, basically,
High and Low risk costumer. A prediction in a form presented, is homologue to
the second one (coin and credit decision).

The focus is basically analyzing past transactions, the bank is planning to identify
good and bad customers from their bank accounts. They have the customer
yearly income and savings which are fundamental to build the classification
model.

Bayes’ rule
Explain de posterior, prior (historical data), likelihood (conditional probability, IF-
THEN rule), and evidence concepts. Join probability, it has to be exhaustive and
excluded.
• Prior: P(C=1) is call prior probability that C takes the value 1, but it
depends in the situation. Is the conditional probability based on the
knowledge
• Likelihood: Conditional probability that an event belonging to C associated
observation value X.
• Evidence: is the marginal probability that an observation X is seen,
regardless of whether it is a positive or negative example.
• Posterior: Combining the prior and what the data tells us using Baye’s
rule, it is calculated the posterior probability.

Posterior = Prior X likelihood

Evidence

The bayes’ classifier chooses the class with the highest posterior probability.

Making a decision based on probability

Bayes’ rule K>2 Classes

Marginalization concept: Marginalization is the correct Bayesian way of dealing

with nuisance variables (recall that for the prediction task, w is nuisance). Again,
marginalization is a basic procedure of probability and not Bayesian per se.
(Seeger, 2006)

Choose classes, after you calculate de probability of multiple classes, so you will
get the highest probability.

Losses and risks

When you do a classification you can include some risk in your decision. Each
decision has a cost. Minimizing the risk is part of the key, taking a decision
depending in the decision thinking about given or denying a credit.

An action define αi as the decision to assign the input to class C1 and λ as the
loss incurred for taking action αi when the input actually belongs to Ck.
The choose action is the one with a minimum risk.
λ: depends on two variables, the first is the action
λ (s) C0 Low risk C0 High risk
give α0 0 1
deny α1 1 0

Losses and risks:

Loss
There are some cases where a decision is not equally good or costly, in some
cases it is fundamental take into account potential situations related to the
condition.

Choose α i if R(α i|x)=min R(α k|x)

1-P(Ci|x)=min1-P(ci|x)
K

P(ci|x)=maxP(ci|x)
K

You can get three kinds of actions

0 accepting
λ is the cost of rejecting <
1 otherwise

Then:
You have to compare the risk of all actions and also must get the action that
gives you the minimum risk. On the other hand, you look for the minimum risk
and you can decide. At the end you must compare the choose related with the
minimum risk and the minimum rejecting and decide:

Reject Otherwise

In some cases, wrong decisions (misclassifications) may have a very high cost,
and it is required a complex system.

Discriminant functions
The aim is establish a model able to discriminate classes. Basically, classification
can also be seen as implementing a set of discriminant functions. There ar some
ways to partition the space to choose and option related with the data include in
the model. (Figure 1)
Figure 1: example of decision regions and decision boundaries. (Alpaidyn, 2004)
In the way to discriminate a G or set of G’s, it will be given you the discriminated
functions.

When there are two classes, we can define a single discriminant:

g (x) = g1 (x) – g2(x)

C1 if g (x)>0
and choose <
C2 otherwise

The two-class learning problem where the positive examples can be taken as C1
and the negative examples as C2.

Classification system
Dichotomizer K = 2 Classes
Polychotomizer K > 2 Classes

Utility Theory
It is possible to generalize the problem of the utility theory thinking about the
approach related to the expected risk and chose the action that minimizes
expected risk. The utility theory is concerned with making rational decisions when
we are uncertain about the state.

In the context of classification, decisions correspond to choosing one of the

classes, and maximizing the expected utility is equivalent to minimizing expected
risk.

Note that maximizing expected utility is just one possibility; one may define other
types of rational behaviour, for example, minimizing worst possible loss.
Value of information
It is relevant to evaluate the quality of the information. So, it is relevant to decide
about good and bad information because this is part and is one of the most
important related characteristics of the model.

Bayesian networks
This method is also called belief networks or a probabilistic network is the model
and is one of the most used methods at the moment.

• Graphical model.
• Representing interaction between variables visually.
• Composed of nodes and arcs between the nodes.
• Each node corresponds to the random variable, X, an has a value
corresponding to a probability.
• If there is a direct arc X to Y, means that X has a direct influence on Y.
• Direct acyclic graph (DAG), there are no cycles.
• The nodes and the arcs define the structure of the network.

Causes and Baye’s Rule

Bayes’ rules allows us to invert the dependencies and have a diagnosis.

Casual vs diagnostic inference

R and S are independent, then, it is possible to calculate the probability that the
sprinkler is on, given the grass is wet. Note also that R and S are independent,
however we may think that they are actually dependent in the presence of
another variable.

Bayesian networks: Causes

For the model given, C, R and S are independent, this is part of the advantage of
Bayesian networks, which explicitly encode independencies and allow breaking
down inference into calculation over small groups of variables.

The graphical representation is visual and helps understanding. The network

represents conditional independence statements and allows us to break down
the problem of representing the joint distribution of many variables into local
structures; this eases both analysis and computation.

Bayesian networks, inference.

Belief propagation
It is an efficient algorithm that is used for inference when the network is a tree.

Junction Tree
An algorithm, which converts a given directed acyclic graph to a tree by
clustering variables, so, that belief propagation can be done.

One of the best advantage of using Bayesian networks is that we do not need to
designate explicitly certain variables as input and certain others as output. The
value of any set of variables can be established through evidence and the
probabilities of any othes set of variables can be inferred, and the differences
between unsupervised and supervised learning becomes blurry.
Bayesian networks, classification

A B

A: This is a classical Bayesian network for classification. B: Naïve Bayes’

classifier is a Bayesian network for classification assuming independent inputs.

Influence diagrams

Influence diagrams are graphical models that allow the generalization of

Bayesian networks to include decisions and utilities. And influence diagram
contains chance nodes representing random variables that we use in Bayesian
networks. A decision node represents a choice of actions. A utility node is where
the utility is calculated. Decisions may be based on chance nodes and may affect
other chance nodes and the utility node.

Association Rules
An association rule is an implication of the form X  Y.
There are two measures:

Confidence: confidence of association rule X  Y.

Conditional probability, P(Y|X), which is what we normally calculate.

Support: support of the association rule X  Y.

Support shows the statistical significance of the rule whereas confidence shows
the strength of the rule.

Reference
ALPAYDIN, E. Introduction to Machine Learning. The MIT Press, October 2004,
ISBN 0-262-01211-1.

Seeger, M. Bayesian Modelling for Data Analysis and Learning from Data. Max-
Planck Institute for Biological Cybernetics. March 18, 2006.These notes provide
clarifying remarks and definitions complementing the course Bayesian Modelling
for Data Analysis and Learning from Data, to be held at IK 2006.
(http://www.kyb.tuebingen.mpg.de/bs/people/seeger/papers/handout.pdf)

Erdas 2011training Guide
No ratings yet
Erdas 2011training Guide
73 pages
Baes Theory
No ratings yet
Baes Theory
76 pages
UNIT-IV
No ratings yet
UNIT-IV
34 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
Fairness Lectures-21
No ratings yet
Fairness Lectures-21
63 pages
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
64 pages
CS-DM Module-4
No ratings yet
CS-DM Module-4
22 pages
9 - Session 9 - Visualizing Model Performance, Evidence and Probabilities
No ratings yet
9 - Session 9 - Visualizing Model Performance, Evidence and Probabilities
37 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
12 pages
Module 2 - Bayesian Learning
No ratings yet
Module 2 - Bayesian Learning
7 pages
Bayes
No ratings yet
Bayes
10 pages
Machine Learning Models and Theories
No ratings yet
Machine Learning Models and Theories
38 pages
Lecture 6_Generative Models
No ratings yet
Lecture 6_Generative Models
33 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
PR Mod1
No ratings yet
PR Mod1
4 pages
Unit-Iv Data Classification: Data Warehousing and Data Mining
No ratings yet
Unit-Iv Data Classification: Data Warehousing and Data Mining
7 pages
03-bayes-nearest-neighbors
No ratings yet
03-bayes-nearest-neighbors
34 pages
Bayesian Theory
No ratings yet
Bayesian Theory
66 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
Unit I Probabilistic Reasoning I 9
No ratings yet
Unit I Probabilistic Reasoning I 9
20 pages
Pattern Recognition
No ratings yet
Pattern Recognition
76 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
CS263 - Bayesian Decision Theory
No ratings yet
CS263 - Bayesian Decision Theory
16 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Lec 6
No ratings yet
Lec 6
14 pages
AI NOTES unit 2
No ratings yet
AI NOTES unit 2
9 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Bayes&Voice Recognition
No ratings yet
Bayes&Voice Recognition
76 pages
CS1004 DataMining Unit 4 Notes
No ratings yet
CS1004 DataMining Unit 4 Notes
8 pages
Classification-Alternative Techniques: Bayesian Classifiers
No ratings yet
Classification-Alternative Techniques: Bayesian Classifiers
7 pages
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
Chapter 5 Classification
No ratings yet
Chapter 5 Classification
24 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
22 pages
unit II AI PPT.pptx
No ratings yet
unit II AI PPT.pptx
43 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Week 4 - Classification Alternative Techniques
No ratings yet
Week 4 - Classification Alternative Techniques
87 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
T06 - Bayes Classifiers
No ratings yet
T06 - Bayes Classifiers
22 pages
Homework1 Solutions
No ratings yet
Homework1 Solutions
5 pages
Bark08 Ghahramani Samlbb 01
No ratings yet
Bark08 Ghahramani Samlbb 01
26 pages
Note 1518944988
No ratings yet
Note 1518944988
27 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
26 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
Data Classification and Prediction : Lecture-11
No ratings yet
Data Classification and Prediction : Lecture-11
36 pages
Inf2b Learn Note10 2up
No ratings yet
Inf2b Learn Note10 2up
7 pages
UNIT 5 NOTES DWM
No ratings yet
UNIT 5 NOTES DWM
18 pages
Robust Bayes Classifiers: Research Note
No ratings yet
Robust Bayes Classifiers: Research Note
18 pages
PR January20 03 PDF
No ratings yet
PR January20 03 PDF
74 pages
Bayesian Decision Theory: Intro To
No ratings yet
Bayesian Decision Theory: Intro To
56 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Fake News Detection Using Machine Learning Algorithms: June 2020
No ratings yet
Fake News Detection Using Machine Learning Algorithms: June 2020
10 pages
The Data Explosion: Modern Computer Systems Are Accumulating Data at An Almost Unimaginable Rate and From A
No ratings yet
The Data Explosion: Modern Computer Systems Are Accumulating Data at An Almost Unimaginable Rate and From A
14 pages
MCSL-223 2024-25 em
0% (1)
MCSL-223 2024-25 em
13 pages
OR PPT CH 1...... For Acc
No ratings yet
OR PPT CH 1...... For Acc
45 pages
Packag Technol Sci - 2022 - Esfahanian - A Novel Packaging Evaluation Method Using Sentiment Analysis of Customer Reviews
No ratings yet
Packag Technol Sci - 2022 - Esfahanian - A Novel Packaging Evaluation Method Using Sentiment Analysis of Customer Reviews
9 pages
EMAIL+SPAM+DETECTION Final Fishries++ (2658+to+2664) - 1
No ratings yet
EMAIL+SPAM+DETECTION Final Fishries++ (2658+to+2664) - 1
7 pages
Astrological Prediction For Profession Doctor Usin
No ratings yet
Astrological Prediction For Profession Doctor Usin
5 pages
Timeline SPRING-v0122
No ratings yet
Timeline SPRING-v0122
2 pages
Cyberbullying Detection Through Sentiment Analysis
No ratings yet
Cyberbullying Detection Through Sentiment Analysis
6 pages
MCA2Syllabus2024-25
No ratings yet
MCA2Syllabus2024-25
24 pages
SQL Injection Attack Detection by Machine Learning Classifier
No ratings yet
SQL Injection Attack Detection by Machine Learning Classifier
8 pages
Detection of Online Employment Scam Through Fake Jobs Using Random Forest Classifier
No ratings yet
Detection of Online Employment Scam Through Fake Jobs Using Random Forest Classifier
8 pages
54 Batch Project Documentation-1
No ratings yet
54 Batch Project Documentation-1
82 pages
chapter3- Perceptron Adaline
No ratings yet
chapter3- Perceptron Adaline
53 pages
An Explainable Machine Learning Framework For Intrusion Detection Systems
No ratings yet
An Explainable Machine Learning Framework For Intrusion Detection Systems
15 pages
Paper 6
No ratings yet
Paper 6
8 pages
Investigating Factors Affecting Food Security in Heipang Community of Barkin Ladi, Plateau State
No ratings yet
Investigating Factors Affecting Food Security in Heipang Community of Barkin Ladi, Plateau State
13 pages
2022 Multimodal brain tumor detection using multimodal deep transfer learning
No ratings yet
2022 Multimodal brain tumor detection using multimodal deep transfer learning
11 pages
Introduction To Artificial Neural Networks: Andrew L. Nelson
No ratings yet
Introduction To Artificial Neural Networks: Andrew L. Nelson
29 pages
Data Minig 2
No ratings yet
Data Minig 2
108 pages
Face Recognition: A Literature Review: A. S. Tolba, A.H. El-Baz, and A.A. El-Harby
No ratings yet
Face Recognition: A Literature Review: A. S. Tolba, A.H. El-Baz, and A.A. El-Harby
16 pages
1506 06726 PDF
No ratings yet
1506 06726 PDF
11 pages
Major In: Machine Learning
No ratings yet
Major In: Machine Learning
11 pages
Wheat Disease Detection Using Image Processing
No ratings yet
Wheat Disease Detection Using Image Processing
3 pages
X (Age Youth, Income Medium, Student Yes, Credit Rating Fair)
No ratings yet
X (Age Youth, Income Medium, Student Yes, Credit Rating Fair)
2 pages
Stance Detection of Political Tweets With Transformer Architectures
No ratings yet
Stance Detection of Political Tweets With Transformer Architectures
6 pages
Sign Language To Text Conversion - A Survey
No ratings yet
Sign Language To Text Conversion - A Survey
8 pages
A Machine Learning Based CIDS Model For Intrusion Detection To Ensure Security Within Cloud Network
No ratings yet
A Machine Learning Based CIDS Model For Intrusion Detection To Ensure Security Within Cloud Network
9 pages
Raster: Resolution
No ratings yet
Raster: Resolution
6 pages

Bayesian_theory_daniel_restrepo

Uploaded by

Bayesian_theory_daniel_restrepo

Uploaded by

Bayesian theory

Note taker: Daniel Restrepo-Montoya

In classification, Bayes’ rule is used to calculate the probabilities of the classes.

Bayes’ theorem provides a way to calculate the probability of a hypothesis based

Probability and inference

Bernoulli process: performing multiple experiments.

In probability and statistics, a Bernoulli process is a discrete-time stochastic process

Classification, probabilistic model

Posterior = Prior X likelihood

Making a decision based on probability

Marginalization concept: Marginalization is the correct Bayesian way of dealing

Losses and risks

Losses and risks:

Choose α i if R(α i|x)=min R(α k|x)

You can get three kinds of actions

When there are two classes, we can define a single discriminant:

In the context of classification, decisions correspond to choosing one of the

Causes and Baye’s Rule

Bayes’ rules allows us to invert the dependencies and have a diagnosis.

Casual vs diagnostic inference

Bayesian networks: Causes

The graphical representation is visual and helps understanding. The network

Bayesian networks, inference.

A: This is a classical Bayesian network for classification. B: Naïve Bayes’

Influence diagrams are graphical models that allow the generalization of

Confidence: confidence of association rule X  Y.

Conditional probability, P(Y|X), which is what we normally calculate.

You might also like