0% found this document useful (0 votes)

9 views

Lecture 05 Reasoning Under Uncertainty

Uploaded by

thuctranduynguyen

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture 05 Reasoning Under Uncertainty

Uploaded by

thuctranduynguyen

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Vietnam National University of HCMC

International University
School of Computer Science and Engineering

INTRODUCTION TO ARTIFICIAL INTELLIGENCE

(IT097IU)
LECTURE 05: REASONING UNDER UNCERTAINTY

Instructor: Nguyen Trung Ky

Our Status in Intro to AI
 We’re done with Part I Intelligent Agents and Search!

 Part II: Reasoning Under Uncertainty and Machine

Learning
 Diagnosis
 Speech recognition
 Tracking objects
 Robot mapping
 Genetics
 Spell corrector
 … lots more!
Inference in Ghostbusters
 A ghost is in the grid
somewhere
 Sensor readings tell how
close a square is to the
ghost
 On the ghost: red
 1 or 2 away: orange
 3 or 4 away: yellow
 5+ away: green

 Sensors are noisy, but we know F(Color | Distance)

[Demo: Ghostbuster – no probability (L12D1) ]

Video of Demo Ghostbuster – No probability
Uncertainty
 General situation:
 Observed variables (evidence): Agent knows certain things
about the state of the world (e.g., sensor readings or
symptoms)
 Unobserved variables: Agent needs to reason about other
aspects (e.g. where an object is or what disease is present)
 Model: Agent knows something about how the known
variables relate to the unknown variables

 Reasoning under uncertainty:

 A rational agent is one that makes rational decisions - in
order to maximize its performance measure
 A rational decision depends on likelihood and degrees of
belief to which they will be achieved.
 Probability theory is the main tool for handling
degrees of belief and uncertainty
Today
 Probability
 Random Variables and Events
 Joint and Marginal Distributions
 Conditional Distribution
 Product Rule, Chain Rule, Bayes’ Rule
 Inference
 Independence

 You’ll need all this stuff A LOT for the

next few weeks, so make sure you go
over it now!
Random Variables
 A random variable is some aspect of the world about
which we (may) have uncertainty
 L = Where is the ghost?
 R = Is it raining?
 T = Is it hot or cold?
 D = How long will it take to drive to work?

 We denote random variables with capital letters

 Like variables in a CSP, random variables have domains

 L in possible locations, maybe {(0,0), (0,1), …}
 R in {true, false} (often write as {+r, -r})
 T in {hot, cold}
 D in [0, )
Probability Distributions
 Associate a probability with each value

 Temperature:  Weather:

W P
T P
sun 0.6
hot 0.5
rain 0.1
cold 0.5
fog 0.3
meteor 0.0
Probability Distributions
 Unobserved random variables have distributions
Shorthand notation:

T P W P
hot 0.5 sun 0.6
cold 0.5 rain 0.1
fog 0.3
meteor 0.0

 A distribution is a TABLE of probabilities of values OK if all domain entries are unique

 A probability (lower case value) is a single number

 Must have: and

Joint Probability Distributions
 A joint probability distribution (JPD) over a set of random variables:
specifies a real number for each assignment (or outcome):

T W P
 Must obey: hot sun 0.4
hot rain 0.1
cold sun 0.2
cold rain 0.3

 Size of distribution if n variables with domain sizes d?

 For all but the smallest distributions, impractical to write out!
Events
 An event is a set E of outcomes
T W P
hot sun 0.4
hot rain 0.1
 From a joint distribution, we can cold sun 0.2
calculate the probability of any event cold rain 0.3

 Probability that it’s hot AND sunny?

 Probability that it’s hot?

 Probability that it’s hot OR sunny?

 Typically, the events we care about are

partial assignments, like P(T=hot)
Quiz: Events
 P(+x, +y) ?

X Y P
+x +y 0.2
 P(+x) ?
+x -y 0.3
-x +y 0.4
-x -y 0.1
 P(-y OR +x) ?
Marginal Distributions
 Marginal distributions are sub-tables which eliminate variables
 Marginalization (summing out): Combine collapsed rows by adding

T P
hot 0.5
T W P
cold 0.5
hot sun 0.4
hot rain 0.1
cold sun 0.2 W P
cold rain 0.3 sun 0.6
rain 0.4
Quiz: Marginal Distributions

X P
+x
X Y P
-x
+x +y 0.2
+x -y 0.3
-x +y 0.4 Y P
-x -y 0.1 +y
-y
Conditional Probabilities
 A simple relation between joint and conditional probabilities
 In fact, this is taken as the definition of a conditional probability
P(a,b) W = sun W = rain P(T)

T = hot 0.4 0.1 0.5

T = cold 0.2 0.3 0.5
P(W) 0.6 0.4 1
P(a) P(b)

T W P
hot sun 0.4
hot rain 0.1
cold sun 0.2
cold rain 0.3
Conditional Distributions
 Conditional distributions are probability distributions over
some variables given fixed values of others
Conditional Distributions
Joint Distribution

W P
T W P
sun
hot sun 0.4
rain
hot rain 0.1
cold sun 0.2
W P cold rain 0.3
sun
rain
Quiz: Conditional Probabilities
 P(+x | +y) ?

X Y P
+x +y 0.2  P(-x | +y) ?
+x -y 0.3
-x +y 0.4
-x -y 0.1
 P(-y | +x) ?
Y = +y Y = -y P(X)
X = +x 0.2 0.3 0.5
X = -x 0.4 0.1 0.5
P(Y) 0.6 0.4 1
Normalization Trick
 A trick to get a whole conditional distribution at once:
 Select the joint probabilities matching the evidence
 Normalize the selection (make it sum to one)
T W P
hot sun 0.4 T R P T P
hot rain 0.1 hot rain 0.1 hot 0.25
Select Normalize
cold sun 0.2 cold rain 0.3 cold 0.75
cold rain 0.3
 Why does this work? Sum of selection is P(evidence)! (P(r), here)
To Normalize
 (Dictionary) To bring or restore to a normal condition

All entries sum to ONE

 Procedure:
 Step 1: Compute Z = sum over all entries
 Step 2: Divide every entry by Z

 Example 1  Example 2
W P W P T W P T W P
Normalize
hot sun 20 Normalize hot sun 0.4
sun 0.2 sun 0.4
hot rain 5 hot rain 0.1
rain 0.3 Z = 0.5 rain 0.6 Z = 50
cold sun 10 cold sun 0.2
cold rain 15 cold rain 0.3
Probabilistic Models
 A probabilistic model is a joint distribution over a set of
variables

 Inference: given a joint distribution, we can reason about

unobserved variables given observations (evidence)

 General form of a query:

This conditional distribution is called a posterior distribution or

the the belief function of an agent which uses this model
Probabilistic Inference
 Probabilistic inference: compute a desired probability
from other known probabilities (e.g. conditional from
joint)

 We generally compute conditional probabilities

 P(on time | no reported accidents) = 0.90
 These represent the agent’s beliefs given the evidence

 Probabilities change with new evidence:

 P(on time | no accidents, 5 a.m.) = 0.95
 P(on time | no accidents, 5 a.m., raining) = 0.80
 Observing new evidence causes beliefs to be updated
Inference by Enumeration
* Works fine with
 General case:  We want: multiple query
 Evidence variables: variables, too
 Query* variable:
All variables
 Hidden variables:

 Step 1: Select the  Step 2: Sum out H to get joint  Step 3: Normalize
entries consistent of Query and evidence
with the evidence
Inference by Enumeration
S T W P
 P(W)?
summer hot sun 0.30
summer hot rain 0.05
summer cold sun 0.10
 P(W| winter)? summer cold rain 0.05
winter hot sun 0.10
winter hot rain 0.05
winter cold sun 0.15
 P(W| winter, hot)? winter cold rain 0.20
Inference by Enumeration

 Obvious problems:
 Worst-case time complexity O(dn)
 Space complexity O(dn) to store the joint distribution
 Solution
 Better techniques
 Better representation
 Simplifying assumptions

Bayesian Network
Bayes’ Rule

 Two ways to factor a joint distribution over two variables:

That’s my rule!

 Dividing, we get:

 Why is this at all helpful?

 Lets us build one conditional from its reverse
 Often one conditional is tricky but the other one is simple
 Foundation of many systems we’ll see later (e.g. ASR, MT)

 In the running for most important AI equation!

The Product Rule
 Sometimes have conditional distributions but want the joint

 Example:

D W P D W P
wet sun 0.1 wet sun 0.08
R P
dry sun 0.9 dry sun 0.72
sun 0.8
wet rain 0.7 wet rain 0.14
rain 0.2
dry rain 0.3 dry rain 0.06
The Chain Rule

 More generally, can always write any joint distribution as an incremental product of
conditional distributions

 Why is this always true?

Independence
 Two variables are independent if:

 This says that their joint distribution factors into a product two
simpler distributions
 Another form:

 We write:

 Independence is a simplifying modeling assumption

 Empirical joint distributions: at best “close” to independent
 What could we assume for {Weather, Traffic, Cavity, Toothache}?
Example: Independence?

T P
hot 0.5
cold 0.5
T W P T W P
hot sun 0.4 hot sun 0.3
hot rain 0.1 hot rain 0.2
cold sun 0.2 cold sun 0.3
cold rain 0.3 cold rain 0.2
W P
sun 0.6
rain 0.4 34
Example: Independence
 N fair, independent coin flips:

H 0.5 H 0.5 H 0.5

T 0.5 T 0.5 T 0.5
Conditional Independence
 P(Toothache, Cavity, Catch)
 If I have a cavity, the probability that the probe catches in it doesn't
depend on whether I have a toothache:
 P(+catch | +toothache, +cavity) = P(+catch | +cavity)
 The same independence holds if I don’t have a cavity:
 P(+catch | +toothache, -cavity) = P(+catch| -cavity)
 Catch is conditionally independent of Toothache given Cavity:
 P(Catch | Toothache, Cavity) = P(Catch | Cavity)
 Equivalent statements:
 P(Toothache | Catch , Cavity) = P(Toothache | Cavity)
 P(Toothache, Catch | Cavity) = P(Toothache | Cavity) P(Catch | Cavity)
 One can be derived from the other easily
Conditional Independence
 Unconditional (absolute) independence very rare (why?)

 Conditional independence is our most basic and robust

form of knowledge about uncertain environments:

 What about this domain:

 Traffic
 Umbrella
 Raining
Probability Summary
Model-based Classification with Naïve Bayes

 A general Naive Bayes model:

|Y| parameters

F1 F2 Fn

|Y| x |F|n values n x |F| x |Y|

parameters

 We only have to specify how each feature depends on the class

 Total number of parameters is linear in n
 Model is very simplistic, but often works anyway
Inference for Naïve Bayes
 Goal: compute posterior distribution over label variable Y
 Step 1: get joint probability of label and evidence for each label

+
 Step 2: sum to get probability of evidence

 Step 3: normalize by dividing Step 1 by Step 2

General Naïve Bayes
 What do we need in order to use Naïve Bayes?

 Inference method (we just saw this part)

 Start with a bunch of probabilities: P(Y) and the P(Fi|Y) tables
 Use standard inference to compute P(Y|F1…Fn)
 Nothing new here

 Estimates of local conditional probability tables

 P(Y), the prior over labels
 P(Fi|Y) for each feature (evidence variable)
 These probabilities are collectively called the parameters of the
model and denoted by 
 Up until now, we assumed these appeared by magic, but…
 …they typically come from training data counts: we’ll look at this
soon
Example: Spam Filter
 Input: an email Dear Sir.
 Output: spam/ham First, I must solicit your confidence in
this transaction, this is by virture of its
 Setup: nature as being utterly confidencial and
top secret. …
 Get a large collection of example emails, each labeled
“spam” or “ham” TO BE REMOVED FROM FUTURE
 Note: someone has to hand label all this data! MAILINGS, SIMPLY REPLY TO THIS
MESSAGE AND PUT "REMOVE" IN THE
 Want to learn to predict labels of new, future emails SUBJECT.

 Features: The attributes used to make the ham / 99 MILLION EMAIL ADDRESSES
FOR ONLY $99
spam decision
 Words: FREE! Ok, Iknow this is blatantly OT but I'm
beginning to go insane. Had an old Dell
 Text Patterns: $dd, CAPS Dimension XPS sitting in the corner and
 Non-text: SenderInContacts decided to put it to use, I know it was
 … working pre being stuck in the corner,
but when I plugged it in, hit the power
nothing happened.
A Spam Filter
Dear Sir.
 Naïve Bayes spam filter
First, I must solicit your confidence in this
transaction, this is by virture of its nature
 Data: as being utterly confidencial and top
secret. …
 Collection of emails, labeled
spam or ham
TO BE REMOVED FROM FUTURE
 Note: someone has to hand MAILINGS, SIMPLY REPLY TO THIS
label all this data! MESSAGE AND PUT "REMOVE" IN THE
 Split into training, SUBJECT.
validation, test sets 99 MILLION EMAIL ADDRESSES
FOR ONLY $99
 Classifiers
Ok, Iknow this is blatantly OT but I'm
 Learn on the training set beginning to go insane. Had an old Dell
 (Tune it on a validation set) Dimension XPS sitting in the corner and
 Test it on new emails decided to put it to use, I know it was
working pre being stuck in the corner, but
when I plugged it in, hit the power nothing
happened.
Naïve Bayes for Text
 Bag-of-words Naïve Bayes:
 Features: Wi is the word at position i
 As before: predict label conditioned on feature variables (spam vs. ham)
 As before: assume features are conditionally independent given label
 New: each Wi is identically distributed Word at position
i, not ith word in
the dictionary!
 Generative model:

 “Tied” distributions and bag-of-words

 Usually, each variable gets its own conditional probability distribution P(F|Y)
 In a bag-of-words model
 Each position is identically distributed
 All positions share the same conditional probs P(W|Y)
 Why make this assumption?
 Called “bag-of-words” because model is insensitive to word order or reordering
Training and Testing
Important Concepts
 Data: labeled instances, e.g. emails marked spam/ham
 Training set
 Validation set
 Test set
 Features: attribute-value pairs which characterize each x Training
Data
 Experimentation cycle
 Learn parameters (e.g. model probabilities) on training set
 (Tune hyperparameters on validation set)
 Compute accuracy of test set
 Very important: never “peek” at the test set!
 Evaluation Validation
 Accuracy: fraction of instances predicted correctly
Data
 Overfitting and generalization
 Want a classifier which does well on test data
 Overfitting: fitting the training data very closely, but not Test
generalizing well Data
 We’ll investigate overfitting and generalization formally in a few
lectures

Descriptive and Infrential Statistics
No ratings yet
Descriptive and Infrential Statistics
33 pages
ProbabilityStatitic Review
No ratings yet
ProbabilityStatitic Review
41 pages
CS115 Probability (4)
No ratings yet
CS115 Probability (4)
41 pages
07 Bayesian Networks
No ratings yet
07 Bayesian Networks
106 pages
Probability
No ratings yet
Probability
56 pages
Current State of The Course!!!: We're Done With Part I Search and Planning! Part II: Probabilistic Reasoning
No ratings yet
Current State of The Course!!!: We're Done With Part I Search and Planning! Part II: Probabilistic Reasoning
30 pages
CSE3635 Lecture 12 Probability 3
No ratings yet
CSE3635 Lecture 12 Probability 3
33 pages
c9666f72511a0f23aec9d39cd8f73b69390f751b (1)
No ratings yet
c9666f72511a0f23aec9d39cd8f73b69390f751b (1)
62 pages
Lec 12
No ratings yet
Lec 12
54 pages
Probability Review À Markov Models: CSE 473: Artificial Intelligence
No ratings yet
Probability Review À Markov Models: CSE 473: Artificial Intelligence
23 pages
Probability Review
No ratings yet
Probability Review
29 pages
Announcements: Released Monday 3/10, 6:30pm-9:30pm
No ratings yet
Announcements: Released Monday 3/10, 6:30pm-9:30pm
40 pages
L08 Probabilistic Reasoning
No ratings yet
L08 Probabilistic Reasoning
90 pages
Outline of The Course: Unknown
No ratings yet
Outline of The Course: Unknown
26 pages
L07 Probabilistic Reasoning Till Sep6
No ratings yet
L07 Probabilistic Reasoning Till Sep6
71 pages
3. Probabilistic Reasoning
No ratings yet
3. Probabilistic Reasoning
37 pages
AIFA 25 Bayesian Logic 120324
No ratings yet
AIFA 25 Bayesian Logic 120324
33 pages
An Introduction To Artificial Intelligence: Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks
No ratings yet
An Introduction To Artificial Intelligence: Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks
31 pages
6.1. Quantifying uncertainty-Probability I (updated) (1)
No ratings yet
6.1. Quantifying uncertainty-Probability I (updated) (1)
23 pages
UNIT-4_New
No ratings yet
UNIT-4_New
79 pages
Naive Bayes
No ratings yet
Naive Bayes
25 pages
6 Probabilities
No ratings yet
6 Probabilities
52 pages
Uncertainty
No ratings yet
Uncertainty
27 pages
Uncertainty: CSE-345: Artificial Intelligence
No ratings yet
Uncertainty: CSE-345: Artificial Intelligence
30 pages
Lecture 10
No ratings yet
Lecture 10
59 pages
Contact session6
No ratings yet
Contact session6
57 pages
uncertainty-probabilty
No ratings yet
uncertainty-probabilty
25 pages
8 - Probability
No ratings yet
8 - Probability
54 pages
IT8601 unitIV
No ratings yet
IT8601 unitIV
47 pages
12.uncertainty Reasoning Class
No ratings yet
12.uncertainty Reasoning Class
68 pages
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
No ratings yet
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
35 pages
UNIT-4 Uncertainty in Artificial Intelligence
No ratings yet
UNIT-4 Uncertainty in Artificial Intelligence
38 pages
Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
cs188-su24-lec07
No ratings yet
cs188-su24-lec07
89 pages
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
No ratings yet
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
34 pages
26-Bayes Rule-16-03-2024
No ratings yet
26-Bayes Rule-16-03-2024
18 pages
Bayes Reasoning
No ratings yet
Bayes Reasoning
45 pages
Bcse306l Ai Module-5 Smsatapathy (1)
No ratings yet
Bcse306l Ai Module-5 Smsatapathy (1)
98 pages
Lecture 29
No ratings yet
Lecture 29
65 pages
Lecture8 - bays1
No ratings yet
Lecture8 - bays1
40 pages
University of Dar Es Salaam Coict: Department of Computer Science & Eng
No ratings yet
University of Dar Es Salaam Coict: Department of Computer Science & Eng
42 pages
13.Uncertainty
No ratings yet
13.Uncertainty
31 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Unit-4 Uncertainty
No ratings yet
Unit-4 Uncertainty
49 pages
Unit 3-2
No ratings yet
Unit 3-2
12 pages
L11a Uncertainty171105
No ratings yet
L11a Uncertainty171105
25 pages
Imp Class Bayes Therom and Basian Network Class
No ratings yet
Imp Class Bayes Therom and Basian Network Class
39 pages
Chapter Five AI
No ratings yet
Chapter Five AI
30 pages
Ch-5 Uncertain Knowledge and Reasoning
No ratings yet
Ch-5 Uncertain Knowledge and Reasoning
25 pages
Unit-4
No ratings yet
Unit-4
74 pages
Module 5
No ratings yet
Module 5
65 pages
AI CH:05 Reasoning Under Uncertainty: Universal College of Engineering, Vasai (E)
No ratings yet
AI CH:05 Reasoning Under Uncertainty: Universal College of Engineering, Vasai (E)
19 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
C 1 Reasoning
No ratings yet
C 1 Reasoning
19 pages
Lecture Quantifying Uncertainty
No ratings yet
Lecture Quantifying Uncertainty
40 pages
Chapt13 Uncertainty
No ratings yet
Chapt13 Uncertainty
39 pages
M2
No ratings yet
M2
9 pages
Module 2
No ratings yet
Module 2
12 pages
Uncertainty F23 Part1
No ratings yet
Uncertainty F23 Part1
44 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Ordinary Differential Equations and Stability Theory: An Introduction
From Everand
Ordinary Differential Equations and Stability Theory: An Introduction
David A. Sanchez
No ratings yet
NCHRP Syn 440 PDF
No ratings yet
NCHRP Syn 440 PDF
138 pages
Modelling Survival Data in Medical Research, 4th Edition David Collett download
No ratings yet
Modelling Survival Data in Medical Research, 4th Edition David Collett download
54 pages
Probability of One Event
No ratings yet
Probability of One Event
14 pages
CAT Permutations and Combinations Formulas PDF
No ratings yet
CAT Permutations and Combinations Formulas PDF
12 pages
ORF309 Stochasticprocesses
No ratings yet
ORF309 Stochasticprocesses
15 pages
Class 11-Probability-WS
No ratings yet
Class 11-Probability-WS
2 pages
FHMM 1134 General Mathematics III Tutorial 3 2013 New
No ratings yet
FHMM 1134 General Mathematics III Tutorial 3 2013 New
11 pages
STAT 290 Probability
No ratings yet
STAT 290 Probability
45 pages
Cafc Test Paper 3 Qa
No ratings yet
Cafc Test Paper 3 Qa
18 pages
Syllabus and Contents of Course For Semester: Second 1442
No ratings yet
Syllabus and Contents of Course For Semester: Second 1442
2 pages
COSMQB
No ratings yet
COSMQB
8 pages
Syllabus - Asset-V1 - MITx+6.431x+1T2022+type@asset+block@resources - 1T2022 - Syllabus - 1T2022
No ratings yet
Syllabus - Asset-V1 - MITx+6.431x+1T2022+type@asset+block@resources - 1T2022 - Syllabus - 1T2022
2 pages
Haasl Advanced Concepts in Fta
No ratings yet
Haasl Advanced Concepts in Fta
14 pages
Properties of The Trinomial Distribution
No ratings yet
Properties of The Trinomial Distribution
2 pages
Probability: Number of Questions: 20
No ratings yet
Probability: Number of Questions: 20
7 pages
05 MCMC
No ratings yet
05 MCMC
36 pages
Must Know Quantitative Aptitude Concepts For TCS Ninja: Faceprep - in
No ratings yet
Must Know Quantitative Aptitude Concepts For TCS Ninja: Faceprep - in
21 pages
IC Project Risk Assessment
No ratings yet
IC Project Risk Assessment
7 pages
Unit-2: Operations On Single Random Variables
No ratings yet
Unit-2: Operations On Single Random Variables
11 pages
Lp-Dependent Event March 19, 2024
No ratings yet
Lp-Dependent Event March 19, 2024
5 pages
2-Intro Random Process
No ratings yet
2-Intro Random Process
74 pages
Probability Review Stochastic
No ratings yet
Probability Review Stochastic
23 pages
Synopsis - Grade 9 Math Term II: Chapter 4: Linear Equations in Two Variables
No ratings yet
Synopsis - Grade 9 Math Term II: Chapter 4: Linear Equations in Two Variables
15 pages
Stats 210 Course Book
No ratings yet
Stats 210 Course Book
200 pages
SAHADEB - Categorical - Data - LECTURES - Till Session 6
No ratings yet
SAHADEB - Categorical - Data - LECTURES - Till Session 6
165 pages
Naive Bayes
No ratings yet
Naive Bayes
41 pages
PS3 PDF
No ratings yet
PS3 PDF
3 pages
Poisson Process
No ratings yet
Poisson Process
10 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages

Lecture 05 Reasoning Under Uncertainty

Uploaded by

Lecture 05 Reasoning Under Uncertainty

Uploaded by

Vietnam National University of HCMC

INTRODUCTION TO ARTIFICIAL INTELLIGENCE

Instructor: Nguyen Trung Ky

 Part II: Reasoning Under Uncertainty and Machine

 Sensors are noisy, but we know F(Color | Distance)

[Demo: Ghostbuster – no probability (L12D1) ]

 Reasoning under uncertainty:

 You’ll need all this stuff A LOT for the

 We denote random variables with capital letters

 Like variables in a CSP, random variables have domains

 A distribution is a TABLE of probabilities of values OK if all domain entries are unique

 A probability (lower case value) is a single number

 Must have: and

 Size of distribution if n variables with domain sizes d?

 Probability that it’s hot AND sunny?

 Probability that it’s hot?

 Probability that it’s hot OR sunny?

 Typically, the events we care about are

T = hot 0.4 0.1 0.5

All entries sum to ONE

 Inference: given a joint distribution, we can reason about

 General form of a query:

This conditional distribution is called a posterior distribution or

 We generally compute conditional probabilities

 Probabilities change with new evidence:

 Two ways to factor a joint distribution over two variables:

 Why is this at all helpful?

 In the running for most important AI equation!

 Why is this always true?

 Independence is a simplifying modeling assumption

H 0.5 H 0.5 H 0.5

 Conditional independence is our most basic and robust

 What about this domain:

 A general Naive Bayes model:

|Y| x |F|n values n x |F| x |Y|

 We only have to specify how each feature depends on the class

 Step 3: normalize by dividing Step 1 by Step 2

 Inference method (we just saw this part)

 Estimates of local conditional probability tables

 “Tied” distributions and bag-of-words

You might also like