Unit 4 Uncertainty Measure
Unit 4 Uncertainty Measure
Probability Theory
Introduction
• Intelligent system – have some degree of uncertainty
• Uncertainty - Not always possible to represent data in precise and
consistent manner
• Missing data, unreliable data, ambiguous due to errors in measurement ,
multiple conflicting measurements
• Default data will have exceptions leading to errors in intelligent systems
• Represented knowledge is based on best guesses of experts based on
observations and statistical conclusion – not appropriate at all situations
• Implementing uncertainty management - consider 3 issues
• 1.representation of uncertain data
• 2.combination of two or more pieces of uncertain data
• 3.drawing inference using uncertain data
• Methods for handling uncertainty includes – probability theory, fuzzy
logic , model and temporal logic
Probability Theory
• Probability – way of turning an opinion or an expectation into a number
lying between 0 and 1
• Reflects the likelihood of an event or a chance that a particular event will
occur
• Assume a set S (sample space) consisting of independent events
representing all possible outcomes of random experiment.
• Every non-empty subset A of sample space S is called an event – empty set
is called impossible event – S is called sure event
• Probability of an event A by P(A)
• For the first card the chance of drawing a King is 4 out of 52 (there are 4
Kings in a deck of 52 cards):
P(A) = 4/52
• But after removing a King from the deck the probability of the 2nd card
drawn is less likely to be a King (only 3 of the 51 cards left are Kings):
P(B|A) = 3/51
And so:
P(A and B) = P(A) x P(B|A) = (4/52) x (3/51) = 12/2652 = 1/221
• So the chance of getting 2 Kings is 1 in 221, or about 0.5%
Example
Bayes’ Theorem
•
Proof
Prove
Example
Extension of Bayes Theorem
Cumulative Probabilities
Bayesian Method: Advantages & Disadvantages
• Advantages
• Bayesian method is based on a strong theoretical foundation in probability theory;
it is currently the most advanced of all certainty reasoning methods.
• This method has well-defined semantics for decision making
• Disadvantages
• The system using Bayesian approach need quite a large amount of probability data
to construct a knowledge base.
• For example a diagnostic system handling 50 detectable conclusions (H) and 300
relevant and observable characteristics E requires a minimum of 15,050(H*E+H)
probability values assuming that all of the conclusions are mutually exclusive.
Handling large amount of data not practically
possible
• Since conditional probabilities are based on statistical data the sample
sizes must be sufficient to ensure that the probabilities obtained from
them are accurate
• If conditional probabilities are based on human experts, then the
question of values being consistent and comprehensive arises
• Since associations between the hypotheses and evidences are
reduced to numbers, it eliminates the actual knowledge embedded
within the data. Therefore a system’s ability to explain its reasoning
and browsing through the hierarchy of evidences to hypothesis to
user is lost.