0% found this document useful (0 votes)

1 views

learningtheory-bns

The document discusses the concept of VC dimension in machine learning, explaining how it measures the capacity of hypothesis spaces to classify points accurately. It also introduces Bayesian networks, highlighting their ability to represent complex probability distributions and exploit conditional independencies. The document covers various examples and applications of these concepts in statistical AI, including decision trees, neural networks, and real-world applications like diagnosis and anomaly detection.

Uploaded by

NandKumar Mishra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views

learningtheory-bns

Uploaded by

NandKumar Mishra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

VC Dimension

Machine Learning – 10701/15781

Carlos Guestrin
Carnegie Mellon University

October 29th, 2007

What about continuous hypothesis

spaces?

 Continuous hypothesis space:

 |H| =∞
 Infinite variance???

 As with decision trees, only care about the

maximum number of points that can be
classified exactly!

©2005-2007 Carlos Guestrin 2

1
How many points can a linear
boundary classify exactly? (1-D)

©2005-2007 Carlos Guestrin 3

How many points can a linear

boundary classify exactly? (2-D)

©2005-2007 Carlos Guestrin 4

2
How many points can a linear
boundary classify exactly? (d-D)

©2005-2007 Carlos Guestrin 5

PAC bound using VC dimension

 Number of training points that can be
classified exactly is VC dimension!!!
 Measures relevant size of hypothesis space, as
with decision trees with k leaves

©2005-2007 Carlos Guestrin 6

3
Shattering a set of points

©2005-2007 Carlos Guestrin 7

VC dimension

©2005-2007 Carlos Guestrin 8

4
PAC bound using VC dimension
 Number of training points that can be
classified exactly is VC dimension!!!
 Measures relevant size of hypothesis space, as
with decision trees with k leaves
 Bound for infinite dimension hypothesis spaces:

©2005-2007 Carlos Guestrin 9

Examples of VC dimension
 Linear classifiers:
 VC(H) = d+1, for d features plus constant term b

 Neural networks
 VC(H) = #parameters
 Local minima means NNs will probably not find best
parameters

 1-Nearest neighbor?

©2005-2007 Carlos Guestrin 10

5
Another VC dim. example -
What can we shatter?
 What’s the VC dim. of decision stumps in 2d?

©2005-2007 Carlos Guestrin 11

Another VC dim. example -

What can’t we shatter?
 What’s the VC dim. of decision stumps in 2d?

©2005-2007 Carlos Guestrin 12

6
What you need to know
 Finite hypothesis space
 Derive results
 Counting number of hypothesis
 Mistakes on Training data

 Complexity of the classifier depends on number of

points that can be classified exactly
 Finite case – decision trees
 Infinite case – VC dimension

 Bias-Variance tradeoff in learning theory

 Remember: will your algorithm find best classifier?

©2005-2007 Carlos Guestrin 13

Bayesian Networks –
Representation
Machine Learning – 10701/15781
Carlos Guestrin
Carnegie Mellon University
October 29th, 2007
©2005-2007 Carlos Guestrin 14

7
Handwriting recognition

Character recognition, e.g., kernel SVMs

rr r
r
c r
a c r
z bc
©2005-2007 Carlos Guestrin 15

Webpage classification

Company home page

vs
Personal home page
vs
University home page
vs
…

©2005-2007 Carlos Guestrin 16

8
Handwriting recognition 2

©2005-2007 Carlos Guestrin 17

Webpage classification 2

©2005-2007 Carlos Guestrin 18

9
Today – Bayesian networks
 One of the most exciting advancements in
statistical AI in the last 10-15 years
 Generalizes naïve Bayes and logistic regression
classifiers
 Compact representation for exponentially-large
probability distributions
 Exploit conditional independencies

©2005-2007 Carlos Guestrin 19

Causal structure
 Suppose we know the following:
 The flu causes sinus inflammation
 Allergies cause sinus inflammation
 Sinus inflammation causes a runny nose
 Sinus inflammation causes headaches
 How are these connected?

©2005-2007 Carlos Guestrin 20

10
Possible queries
 Inference
Flu Allergy

 Most probable
Sinus explanation

Headache Nose  Active data

collection

©2005-2007 Carlos Guestrin 21

Car starts BN
 18 binary attributes

 Inference
 P(BatteryAge|Starts=f)

 216 terms, why so fast?

 Not impressed?
 HailFinder BN – more than 354 =
58149737003040059690390169 terms

©2005-2007 Carlos Guestrin 22

11
Factored joint distribution -
Preview

Flu Allergy

Sinus

Headache Nose

©2005-2007 Carlos Guestrin 23

Number of parameters

Flu Allergy

Sinus

Headache Nose

©2005-2007 Carlos Guestrin 24

12
Key: Independence assumptions

Flu Allergy

Sinus

Headache Nose

Knowing sinus separates the variables from each other

©2005-2007 Carlos Guestrin 25

(Marginal) Independence
 Flu and Allergy are (marginally) independent
Flu = t

Flu = f

 More Generally: Allergy = t

Allergy = f

Flu = t Flu = f

Allergy = t

Allergy = f
©2005-2007 Carlos Guestrin 26

13
Marginally independent random
variables
 Sets of variables X, Y
 X is independent of Y if
P ²(X=x⊥Y=y), 8 x2Val(X), y2Val(Y)

 Shorthand:
 Marginal independence: P ² (X ⊥ Y)

 Proposition: P statisfies (X ⊥ Y) if and only if

 P(X,Y) = P(X) P(Y)

©2005-2007 Carlos Guestrin 27

Conditional independence
 Flu and Headache are not (marginally) independent

 Flu and Headache are independent given Sinus

infection

 More Generally:

©2005-2007 Carlos Guestrin 28

14
Conditionally independent random
variables
 Sets of variables X, Y, Z
 X is independent of Y given Z if
P ²(X=x ⊥ Y=y|Z=z), 8 x2Val(X), y2Val(Y), z2Val(Z)

 Shorthand:
 Conditional independence: P ² (X ⊥ Y | Z)
 For P ² (X ⊥ Y | ;), write P ² (X ⊥ Y)

 Proposition: P statisfies (X ⊥ Y | Z) if and only if

 P(X,Y|Z) = P(X|Z) P(Y|Z)

©2005-2007 Carlos Guestrin 29

15
The independence assumption

Flu Allergy
Local Markov Assumption:
Sinus A variable X is independent
of its non-descendants given
Headache Nose its parents

©2005-2007 Carlos Guestrin 31

Local Markov Assumption:

Explaining away A variable X is independent
of its non-descendants given
its parents
Flu Allergy

Sinus

Headache Nose

©2005-2007 Carlos Guestrin 32

16
Naïve Bayes revisited

Local Markov Assumption:

A variable X is independent
of its non-descendants given
its parents

©2005-2007 Carlos Guestrin 33

What about probabilities?

Conditional probability tables (CPTs)

Flu Allergy

Sinus

Headache Nose

©2005-2007 Carlos Guestrin 34

17
Joint distribution
Flu Allergy

Sinus

Headache Nose

Why can we decompose? Markov Assumption!

The chain rule of probabilities

 P(A,B) = P(A)P(B|A) Flu

Sinus

 More generally:
 P(X1,…,Xn) = P(X1) · P(X2|X1) · … · P(Xn|X1,…,Xn-1)

©2005-2007 Carlos Guestrin 36

18
Chain rule & Joint distribution
Local Markov Assumption:
A variable X is independent
Flu Allergy of its non-descendants given
its parents
Sinus

Headache Nose

©2005-2007 Carlos Guestrin 37

Two (trivial) special cases

Edgeless graph Fully-connected
graph

©2005-2007 Carlos Guestrin 38

19
The Representation Theorem –
Joint Distribution to BN

BN: Encodes independence

assumptions

If conditional Joint probability

independencies
Obtain distribution:
in BN are subset of
conditional
independencies in P

©2005-2007 Carlos Guestrin 39

Real Bayesian networks

applications
 Diagnosis of lymph node disease
 Speech recognition
 Microsoft office and Windows
 http://www.research.microsoft.com/research/dtg/
 Study Human genome
 Robot mapping
 Robots to identify meteorites to study
 Modeling fMRI data
 Anomaly detection
 Fault dianosis
 Modeling sensor network data

©2005-2007 Carlos Guestrin 40

20
A general Bayes net
 Set of random variables

 Directed acyclic graph

 Encodes independence assumptions

 CPTs

 Joint distribution:

How many parameters in a BN?

 Discrete variables X1, …, Xn
 Graph
 Defines parents of Xi, PaXi
 CPTs – P(Xi| PaXi)

21
Another example
 Variables:
B – Burglar
 E – Earthquake
 A – Burglar alarm
 N – Neighbor calls
 R – Radio report

 Both burglars and earthquakes can set off the

alarm
 If the alarm sounds, a neighbor may call
 An earthquake may be announced on the radio

Another example – Building the BN

 B – Burglar
 E – Earthquake
 A – Burglar alarm
 N – Neighbor calls
 R – Radio report

22
Independencies encoded in BN
 We said: All you need is the local Markov
assumption
 (Xi ⊥ NonDescendantsXi | PaXi)
 But then we talked about other (in)dependencies
 e.g., explaining away

 What are the independencies encoded by a BN?

 Only assumption is local Markov
 But many others can be derived using the algebra of
conditional independencies!!!
©2005-2007 Carlos Guestrin 45

Understanding independencies in BNs

– BNs with 3 nodes Local Markov Assumption:
A variable X is independent
of its non-descendants given
Indirect causal effect:
its parents
X Z Y

Indirect evidential effect: Common effect:

X Z Y
X Y

Common cause: Z
Z
X Y

23
Understanding independencies in BNs
– Some examples
A B

C
E
D

G
F

H J

I
K

An active trail – Example

E G
A B D H
C F

F’

F’’

When are A and H independent?

24
Active trails formalized
 A path X1 – X2 – · · · –Xk is an active trail when
variables Oµ{X1,…,Xn} are observed if for each
consecutive triplet in the trail:
 Xi-1→Xi→Xi+1, and Xi is not observed (Xi∉O)

 Xi-1←Xi←Xi+1, and Xi is not observed (Xi∉O)

 Xi-1←Xi→Xi+1, and Xi is not observed (Xi∉O)

 Xi-1→Xi←Xi+1, and Xi is observed (Xi2O), or one of

its descendents

Active trails and independence?

A B
 Theorem: Variables Xi
and Xj are independent
C
given Zµ{X1,…,Xn} if the E
is no active trail between D
Xi and Xj when variables G
Zµ{X1,…,Xn} are observed F

H J

I
K

25
The BN Representation Theorem

If conditional
Joint probability
independencies
distribution:
in BN are subset of Obtain
conditional
independencies in P

Important because:
Every P has at least one BN structure G

Then conditional
If joint independencies
probability Obtain in BN are subset of
distribution: conditional
independencies in P
Important because:
Read independencies of P from BN structure G
©2005-2007 Carlos Guestrin 51

“Simpler” BNs
 A distribution can be represented by many BNs:

 Simpler BN, requires fewer parameters

26
Learning Bayes nets
Known structure Unknown structure

Fully observable
data
Missing data

Learning the CPTs

For each discrete variable Xi
Data
x(1)
…
x(m)

27
Queries in Bayes nets
 Given BN, find:
 Probability of X given some evidence, P(X|e)

 Most probable explanation, maxx P(x1,…,xn | e)

1,…,xn

 Most informative query

 Learn more about these next class

What you need to know

 Bayesian networks
 A compact representation for large probability distributions
 Not an algorithm
 Semantics of a BN
 Conditional independence assumptions
 Representation
 Variables
 Graph
 CPTs
 Why BNs are useful
 Learning CPTs from fully observable data
 Play with applet!!! 

28
Acknowledgements
 JavaBayes applet
 http://www.pmr.poli.usp.br/ltd/Software/javabayes/Ho
me/index.html

05c GCSE 9 1 Mathematics Mock Set 5 Paper 2H Mark Scheme Word 1
100% (1)
05c GCSE 9 1 Mathematics Mock Set 5 Paper 2H Mark Scheme Word 1
17 pages
SCHOOL LEARNING AND DEVELOPMENT PLAN Abra ES
100% (6)
SCHOOL LEARNING AND DEVELOPMENT PLAN Abra ES
10 pages
L4 BN Semantics2
No ratings yet
L4 BN Semantics2
28 pages
Week 6 v1.61 (Hidden) - Revision, CW1, and Probabilistic Graphical Models
No ratings yet
Week 6 v1.61 (Hidden) - Revision, CW1, and Probabilistic Graphical Models
65 pages
L3 BN Semantics
No ratings yet
L3 BN Semantics
16 pages
Chapter 4 Bayesian Networks
No ratings yet
Chapter 4 Bayesian Networks
62 pages
Libpgm For Bayesian Networks: Dr. A. Obulesh Associate Professor
No ratings yet
Libpgm For Bayesian Networks: Dr. A. Obulesh Associate Professor
59 pages
Math Psych 03
No ratings yet
Math Psych 03
48 pages
All of Graphical Models
No ratings yet
All of Graphical Models
135 pages
cs188-su24-lec08
No ratings yet
cs188-su24-lec08
64 pages
Lecture 4
No ratings yet
Lecture 4
36 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
LARRANAGA_2021_10_3
No ratings yet
LARRANAGA_2021_10_3
17 pages
Bayesian Networks-Univ of Washington
No ratings yet
Bayesian Networks-Univ of Washington
21 pages
Probabilistic Reasoning: CS 188: Artificial Intelligence
No ratings yet
Probabilistic Reasoning: CS 188: Artificial Intelligence
10 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Bayes
No ratings yet
Bayes
10 pages
Lecture 4-5 Reasoning With Uncertainty-2
No ratings yet
Lecture 4-5 Reasoning With Uncertainty-2
34 pages
Unit Iv L Earning
No ratings yet
Unit Iv L Earning
23 pages
Unit Iv L Earning
No ratings yet
Unit Iv L Earning
33 pages
SP14 CS188 Lecture 13 - Markov Models
No ratings yet
SP14 CS188 Lecture 13 - Markov Models
33 pages
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
Bayesian Networks
No ratings yet
Bayesian Networks
45 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
2021 Lecture09 BayesianNetworks
No ratings yet
2021 Lecture09 BayesianNetworks
60 pages
Graph Lecture19
No ratings yet
Graph Lecture19
42 pages
Fairness Lectures-21
No ratings yet
Fairness Lectures-21
63 pages
Random Sets Approach and Its Applications
No ratings yet
Random Sets Approach and Its Applications
12 pages
ML pp8_u2
No ratings yet
ML pp8_u2
35 pages
Bayes Intro PT 2
No ratings yet
Bayes Intro PT 2
13 pages
Lec7_Bayesian Network I(1)
No ratings yet
Lec7_Bayesian Network I(1)
62 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
41 pages
CSE546: Naïve Bayes: Winter 2012
No ratings yet
CSE546: Naïve Bayes: Winter 2012
35 pages
10 Bayesnets
No ratings yet
10 Bayesnets
50 pages
Monte Carlo Artificial Intelligence: Bayesian Networks
No ratings yet
Monte Carlo Artificial Intelligence: Bayesian Networks
26 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
2 Graphical Models in A Nutshell: Daphne Koller, Nir Friedman, Lise Getoor and Ben Taskar
No ratings yet
2 Graphical Models in A Nutshell: Daphne Koller, Nir Friedman, Lise Getoor and Ben Taskar
43 pages
Statistical Classification
No ratings yet
Statistical Classification
6 pages
BayesianNetwork in ai for 6th sem
No ratings yet
BayesianNetwork in ai for 6th sem
17 pages
L12 Bayesian Network
No ratings yet
L12 Bayesian Network
35 pages
Computer Science CPSC 322: Bayesian Networks: Construction
No ratings yet
Computer Science CPSC 322: Bayesian Networks: Construction
70 pages
Probabilistic Reasoning
No ratings yet
Probabilistic Reasoning
58 pages
Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
Math Foundations
No ratings yet
Math Foundations
48 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
13. Bayes Nets - Representation
No ratings yet
13. Bayes Nets - Representation
96 pages
Good BayesianNetworksPrimer
No ratings yet
Good BayesianNetworksPrimer
23 pages
Bayesian Networks: Machine Learning, Lecture (Jaakkola)
No ratings yet
Bayesian Networks: Machine Learning, Lecture (Jaakkola)
8 pages
13 Bayes-Net
No ratings yet
13 Bayes-Net
19 pages
ECE 368 Course Review: Probabilistic Reasoning 2023
No ratings yet
ECE 368 Course Review: Probabilistic Reasoning 2023
138 pages
An Introduction To Artificial Intelligence: Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks
No ratings yet
An Introduction To Artificial Intelligence: Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks
31 pages
Ml2 Script v2
No ratings yet
Ml2 Script v2
123 pages
Bayesian Networks
No ratings yet
Bayesian Networks
24 pages
Bayesian Networks: A Tutorial
No ratings yet
Bayesian Networks: A Tutorial
73 pages
Naive Bayes
No ratings yet
Naive Bayes
25 pages
2B Naive Bayes
No ratings yet
2B Naive Bayes
90 pages
Probab Refresh
No ratings yet
Probab Refresh
7 pages
Naïve Bayes Classifier: Dr. Hussain Dawood
No ratings yet
Naïve Bayes Classifier: Dr. Hussain Dawood
20 pages
mcmc
No ratings yet
mcmc
76 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
(Real Language Series) Norman Fairclough - Critical Language Awareness-Addison Wesley Longman - Routledge (1992)
No ratings yet
(Real Language Series) Norman Fairclough - Critical Language Awareness-Addison Wesley Longman - Routledge (1992)
356 pages
Pre-Assessment in LS6
No ratings yet
Pre-Assessment in LS6
6 pages
no coward soul is mine
No ratings yet
no coward soul is mine
5 pages
4 CS V - VI Sem - 2018 - Syllabus FINAL
No ratings yet
4 CS V - VI Sem - 2018 - Syllabus FINAL
147 pages
English Project Draft
No ratings yet
English Project Draft
4 pages
Mapreduce1.Ipynb - Colab
No ratings yet
Mapreduce1.Ipynb - Colab
6 pages
Exp No 7
No ratings yet
Exp No 7
2 pages
Grade 10 Standard Indicators: Legend: F (Strand) (Standard) G10/ (Indicator)
No ratings yet
Grade 10 Standard Indicators: Legend: F (Strand) (Standard) G10/ (Indicator)
1 page
Issues in Syllabus Design
No ratings yet
Issues in Syllabus Design
9 pages
Lacp With STP Sim
No ratings yet
Lacp With STP Sim
3 pages
Importance of Gay Language
No ratings yet
Importance of Gay Language
3 pages
Dlleng8-Observed1st Quarter 2019
No ratings yet
Dlleng8-Observed1st Quarter 2019
6 pages
The Ultimate Barcode Detection Guide
No ratings yet
The Ultimate Barcode Detection Guide
33 pages
Reviewer in Introduction To Linguistics (FINAL TERM)
No ratings yet
Reviewer in Introduction To Linguistics (FINAL TERM)
7 pages
Kudriyah Ahmad S. PTK Uas - 1
No ratings yet
Kudriyah Ahmad S. PTK Uas - 1
19 pages
SynKernelDiag2020 07 02 - 21 08 16
No ratings yet
SynKernelDiag2020 07 02 - 21 08 16
222 pages
Grade 11 Investigation 2025_043152
No ratings yet
Grade 11 Investigation 2025_043152
11 pages
Template - App Jadwal Pelajaran
No ratings yet
Template - App Jadwal Pelajaran
22 pages
Bible Geneology......
No ratings yet
Bible Geneology......
3 pages
Balli Test PDF
No ratings yet
Balli Test PDF
31 pages
R12 Web ADI
No ratings yet
R12 Web ADI
9 pages
Americka Knjizevnost
No ratings yet
Americka Knjizevnost
61 pages
Annexure 1
No ratings yet
Annexure 1
1 page
Formal Observation 1
No ratings yet
Formal Observation 1
3 pages
TOEIC_4_Skills
No ratings yet
TOEIC_4_Skills
3 pages
Complete Download HCI in Business, Government and Organizations: 7th International Conference, HCIBGO 2020, Held as Part of the 22nd HCI International Conference, HCII 2020, Copenhagen, Denmark, July 19–24, 2020, Proceedings Fiona Fui-Hoon Nah PDF All Chapters
100% (3)
Complete Download HCI in Business, Government and Organizations: 7th International Conference, HCIBGO 2020, Held as Part of the 22nd HCI International Conference, HCII 2020, Copenhagen, Denmark, July 19–24, 2020, Proceedings Fiona Fui-Hoon Nah PDF All Chapters
62 pages
Marcion and The Dating of Marc and The S
No ratings yet
Marcion and The Dating of Marc and The S
32 pages
Module Summary - Python
No ratings yet
Module Summary - Python
5 pages

learningtheory-bns

Uploaded by

learningtheory-bns

Uploaded by

VC Dimension

Machine Learning – 10701/15781

October 29th, 2007

What about continuous hypothesis

 Continuous hypothesis space:

 As with decision trees, only care about the

©2005-2007 Carlos Guestrin 2

©2005-2007 Carlos Guestrin 3

How many points can a linear

©2005-2007 Carlos Guestrin 4

©2005-2007 Carlos Guestrin 5

PAC bound using VC dimension

©2005-2007 Carlos Guestrin 6

©2005-2007 Carlos Guestrin 7

©2005-2007 Carlos Guestrin 8

©2005-2007 Carlos Guestrin 9

©2005-2007 Carlos Guestrin 10

©2005-2007 Carlos Guestrin 11

Another VC dim. example -

©2005-2007 Carlos Guestrin 12

 Complexity of the classifier depends on number of

 Bias-Variance tradeoff in learning theory

©2005-2007 Carlos Guestrin 13

Character recognition, e.g., kernel SVMs

Company home page

©2005-2007 Carlos Guestrin 16

©2005-2007 Carlos Guestrin 17

©2005-2007 Carlos Guestrin 18

©2005-2007 Carlos Guestrin 19

©2005-2007 Carlos Guestrin 20

Headache Nose  Active data

©2005-2007 Carlos Guestrin 21

 216 terms, why so fast?

©2005-2007 Carlos Guestrin 22

©2005-2007 Carlos Guestrin 23

©2005-2007 Carlos Guestrin 24

Knowing sinus separates the variables from each other

©2005-2007 Carlos Guestrin 25

 More Generally: Allergy = t

 Proposition: P statisfies (X ⊥ Y) if and only if

©2005-2007 Carlos Guestrin 27

 Flu and Headache are independent given Sinus

©2005-2007 Carlos Guestrin 28

 Proposition: P statisfies (X ⊥ Y | Z) if and only if

©2005-2007 Carlos Guestrin 29

©2005-2007 Carlos Guestrin 31

Local Markov Assumption:

©2005-2007 Carlos Guestrin 32

Local Markov Assumption:

©2005-2007 Carlos Guestrin 33

What about probabilities?

©2005-2007 Carlos Guestrin 34

Why can we decompose? Markov Assumption!

The chain rule of probabilities

©2005-2007 Carlos Guestrin 36

©2005-2007 Carlos Guestrin 37

Two (trivial) special cases

©2005-2007 Carlos Guestrin 38

BN: Encodes independence

If conditional Joint probability

©2005-2007 Carlos Guestrin 39

Real Bayesian networks

©2005-2007 Carlos Guestrin 40

 Directed acyclic graph

©2005-2007 Carlos Guestrin 41

How many parameters in a BN?

©2005-2007 Carlos Guestrin 42

 Both burglars and earthquakes can set off the

©2005-2007 Carlos Guestrin 43

Another example – Building the BN

©2005-2007 Carlos Guestrin 44

 What are the independencies encoded by a BN?

Understanding independencies in BNs

Indirect evidential effect: Common effect:

©2005-2007 Carlos Guestrin 46

©2005-2007 Carlos Guestrin 47