0% found this document useful (0 votes)

17 views29 pages

lec-1 probabilistic models

The document provides an overview of probabilistic models, including definitions of key concepts such as probability, probability distribution, conditional and joint probability, and various types of probability distributions like binomial and Gaussian. It discusses the importance of sampling, inference, and Bayesian statistics in estimating model parameters and making predictions based on data. Additionally, it covers the concepts of entropy and mutual information as measures of uncertainty and independence in probability theory.

Uploaded by

yarno.prc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views29 pages

lec-1 probabilistic models

Uploaded by

yarno.prc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 29

Probabilistic models

Haixu Tang
School of Informatics
Probability
• Experiment: a procedure involving
chance that leads to different results
• Outcome: the result of a single trial of an
experiment;
• Event: one or more outcomes of an
experiment;
• Probability: the measure of how likely an
event is;
Example: a fair 6-sided dice
• Outcome: The possible outcomes of
this experiment are 1, 2, 3, 4, 5 and
6;
• Events: 1; 6; even
• Probability: outcomes are equally
likely to occur.
– P(A) = The Number Of Ways Event A Can Occur / The Total
Number Of Possible Outcomes
– P(1)=P(6)=1/6; P(even)=3/6=1/2;
Probability distribution
• Probability distribution: the assignment of a
probability P(x) to each outcome x.
• A fair dice: outcomes are equally likely to occur
 the probability distribution over the all six
outcomes P(x)=1/6, x=1,2,3,4,5 or 6.
• A loaded dice: outcomes are unequally likely to
occur  the probability distribution over the all
six outcomes P(x)=f(x), x=1,2,3,4,5 or 6, but
f(x)=1.
Example: DNA sequences
• Event: Observing a DNA sequence S=s1s2…sn:
si  {A,C,G,T};
• Random sequence model (or Independent and
identically-distributed, i.i.d. model): si occurs at
random with the probability P(si), independent
of all other
n
residues in the sequence;
• P(S)=  Psi 
i 1
• This model will be used as a background
model (or called null hypothesis).
Conditional probability
• P(i|): the measure of how likely an event i
happens under the condition ;
– Example: two dices D1, D2
• P(i|D1) probability for picking i using dicer D1
• P(i|D2) probability for picking i using dicer D2
Joint probability
• Two experiments X and Y
– P(X,Y)  joint probability (distribution) of experiments
X and Y
– P(X,Y)=P(X|Y)P(Y)=P(Y|X)P(X)
– P(X|Y)=P(X), X and Y are independent
• Example: experiment 1 (selecting a dice),
experiment 2 (rolling the selected dice)
– P(y): y=D1 or D2
– P(i, D1)=P(i| D1)P(D1)
– P(i| D1)=P(i| D2), independent events
Marginal probability
• P(X)=YP(X|Y)P(Y)
• Example: experiment 1 (selecting a dice),
experiment 2 (rolling the selected dice)
– P(y): y=D1 or D2
– P(i) =P(i| D1)P(D1)+P(i| D2)P(D2)
– P(i| D1)=P(i| D2), independent events
• P(i)= P(i| D1)(P(D1)+P(D2))= P(i| D1)
Probability models
• A system that produces different outcomes
with different probabilities.

• It can simulate a class of objects (events),

assigning each an associated probability.

• Simple objects (processes)  probability

distributions
Example: continuous variable
• The whole set of outcomes X (xX) can
be infinite.
• Continuous variable x[x0,x1]
– P(x0≤x≤x1) ->0
– P(x-x/2 ≤ x ≤ x+x/2) = f(x)x; f(x)x=1
– f(x) – probability density function (density, pdf)
– P(xy)= yx f(x)x – cumulated density function (cdf)
0

x

x0 x1
Mean and variance
• Mean
– m=xP(x)

• Variance
 2= (k-m)2P(k)
 : standard deviation
Typical probability distributions
• Binomial distribution
• Gaussian distribution
• Multinomial distribution
• Dirichlet distribution
• Extreme value distribution (EVD)
Binomial distribution
• An experiment with binary outcomes: 0 or 1;

• Probability distribution of a single experiment:

P(‘1’)=p and P(‘0’) = 1-p;

• Probability distribution of N tries of the same

experiment
N k
  p (1  p ) N  k
• Bi(k ‘1’s out of N tries) ~ k 
Gaussian distribution
• N -> , Bi -> Gaussian distribution

• Define the new variable u = (k-m)/ 

– f(u)~ 1 exp u 2 / 2 
2
Multinomial distribution
• An experiment with K independent outcomes
with probabilities i, i =1,…,K, i =1.

• Probability distribution of N tries of the same

experiment, getting ni occurrences of outcome i,
ni =N.
N ! K ni
• M(N|) ~  i
 ni ! i 1
i
Example: a fair dice
• Probability: outcomes (1,2,…,6) are equally
likely to occur

• Probability of rolling 1 dozen times (12) and

getting each outcome twice:
– 
12! 1 12
26 6
~3.410-3
Example: a loaded dice
• Probability: outcomes (1,2,…,6) are
unequally likely to occur: P(6)=0.5,
P(1)=P(2)=…=P(5)=0.1

• Probability of rolling 1 dozen times (12) and

getting each outcome twice:
– 12!
26
0.52 0.110 ~1.8710-4
Dirichlet distribution
• Outcomes: =(1, 2,…, K)
K
 K 
• Density: D(|)~  i     i  1
i  1

i 1  i 1 

• (1, 2,…, K) are constants  different

gives different probability distribution
over .

• K=2  Beta distribution

Example: dice factories
• Dice factories produces all kinds of dices:
(1), (2),…, (6)
• A dice factory distinguish itself from the
others by parameters

• The probability of producing a dice  in the
factory  is determined by D(|)
Extreme value distribution
• Outcome: the largest number among N
samples from a density g(x) is larger than
x;

• For a variety of densities g(x),

– pdf:

– cdf:
Probabilistic model
• Selecting a model
– Probabilistic distribution
– Machine learning methods
• Neural nets
• Support Vector Machines (SVMs)
– Probabilistic graphical models
• Markov models
• Hidden Markov models
• Bayesian models
• Stochastic grammars
• Model  data (sampling)
• Data  model (inference)
Sampling
• Probabilistic model with parameter   P(x| )
for event x;
• Sampling: generate a large set of events xi with
probability P(xi| );
• Random number generator ( function rand()
picks a number randomly from the interval [0,1)
with the uniform density;
• Sampling from a probabilistic model 
transforming P(xi| ) to a uniform distribution
– For a finite set X (xiX), find i s.t. P(x1)+…+P(xi-1) <
rand(0,1) < P(x1)+…+P(xi-1) + P(xi)
Inference (ML)
• Estimating the model parameters
(inference): from large sets of trusted
examples

• Given a set of data D (training set), find a

model with parameters  with the maximal
likelihood P( |D);
Example: a loaded dice
• loaded dice: to estimate parameters 1, 2,
…, 6, based on N observations D=d1,d2,…
dN

 i=ni / N, where ni is of i, is the maximum

likelihood solution (11.5)

• Inference from counts

Bayesian statistics
• P(X|Y)=P(Y|X)P(X)/P(Y)

• P( |D) = P()[P(D | )/P(D)]

=P()[P(D | )/ (P(D | )P ()]

P()  prior probability; P(|D)  posterior

probability;
Example: two dices
• Fair dice 0.99; loaded dice: 0.01, P(6)=0.5,
P(1)=…P(5)=0.1
• 3 consecutive ‘6’es:
– P(loaded|3’6’s)=P(loaded)*[P(3’6’s|loaded)/
P(3’6’s)] = 0.01*(0.53 / C)
– P(fair|3’6’s)=P(fair)*[P(3’6’s|fair)/P(3’6’s)] =
0.99 * ((1/6)3 / C)
– Likelihood ratio: P(loaded|3’6’s) / P(fair|3’6’s)
<1
Inference from counts: including
prior knowledge
• Prior knowledge is important when the data is
scarce

• Use Dirichlet distribution as prior:

– P( |n) = D(|)[P(n|)/P(n)]
– Equivalent to add i as pseudo-counts to the
observation I (11.5)
– We can forget about statistics and use pseudo-
counts in the parameter estimation!
Entropy
• Probabilities distributions P(xi) over K
events

• H(x)=- P(xi) log P(xi)

– Maximized for uniform distribution P(x i)=1/K
– A measure of average uncertainty
Mutual information
• Measure of independence of two random
variable X and Y
• P(X|Y)=P(X), X and Y are independent 
P(X,Y)/P(X)P(Y)=1
• M(X;Y)=x,y P(x,y)log[P(x,y)/P(x)P(y)]
– 0  independent

Statistic and Probability
100% (4)
Statistic and Probability
21 pages
Lec-1 Probabilistic Models
No ratings yet
Lec-1 Probabilistic Models
29 pages
PTSP PPT
No ratings yet
PTSP PPT
74 pages
PTSP
No ratings yet
PTSP
101 pages
2 Probability and Statistics
No ratings yet
2 Probability and Statistics
29 pages
Module 1 Lecture 4-Probability Distributions
No ratings yet
Module 1 Lecture 4-Probability Distributions
39 pages
Introduction To Discrete Probability Theory and Bayesian Networks
No ratings yet
Introduction To Discrete Probability Theory and Bayesian Networks
26 pages
All in One CheatSheet PDF
No ratings yet
All in One CheatSheet PDF
52 pages
Sam Roweis Probx
No ratings yet
Sam Roweis Probx
12 pages
On Probability Theory &stochastic Process
No ratings yet
On Probability Theory &stochastic Process
101 pages
Unit-Ii: Probability I: Introductory Ideas
No ratings yet
Unit-Ii: Probability I: Introductory Ideas
28 pages
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
No ratings yet
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
34 pages
Unit7 Probability Statistics I-1
No ratings yet
Unit7 Probability Statistics I-1
49 pages
CPE412 Pattern Recognition (Week 3)
No ratings yet
CPE412 Pattern Recognition (Week 3)
44 pages
UNIT 4 Probability
No ratings yet
UNIT 4 Probability
20 pages
Modeling and Simulation (MEMEE05/MPMEE02) : Unit - 2 Probability Concepts in Simulation
No ratings yet
Modeling and Simulation (MEMEE05/MPMEE02) : Unit - 2 Probability Concepts in Simulation
82 pages
Probability notes
No ratings yet
Probability notes
19 pages
Probabilistic Model
No ratings yet
Probabilistic Model
7 pages
Foundations of Machine Learning: Part A: Probability Basics
No ratings yet
Foundations of Machine Learning: Part A: Probability Basics
75 pages
ML U3
No ratings yet
ML U3
34 pages
Probability Distributions_training
No ratings yet
Probability Distributions_training
43 pages
Operations_Research_Lesson_3-1
No ratings yet
Operations_Research_Lesson_3-1
42 pages
report-mid
No ratings yet
report-mid
19 pages
ML Cheat Sheet
50% (2)
ML Cheat Sheet
74 pages
Unit-5 Notes Updated
No ratings yet
Unit-5 Notes Updated
22 pages
Lec 07
No ratings yet
Lec 07
51 pages
Scribe: Naive Bayes Classifier
No ratings yet
Scribe: Naive Bayes Classifier
16 pages
Welcome: To All PGDM Students
No ratings yet
Welcome: To All PGDM Students
47 pages
Introduction To Probability Theory: A Short Course On Graphical Models
No ratings yet
Introduction To Probability Theory: A Short Course On Graphical Models
30 pages
ML_Lec 2- Review of probability and statistics
No ratings yet
ML_Lec 2- Review of probability and statistics
30 pages
3_prob-review
No ratings yet
3_prob-review
77 pages
340 Printable Course Notes
No ratings yet
340 Printable Course Notes
184 pages
Presntation Slides
No ratings yet
Presntation Slides
43 pages
CENG 222 Statistical Methods For Computer Engineering
No ratings yet
CENG 222 Statistical Methods For Computer Engineering
31 pages
1--Probability 2024 Engg
No ratings yet
1--Probability 2024 Engg
41 pages
07 Probability Review
No ratings yet
07 Probability Review
56 pages
Conditional Statements in Python 20250430 002716 0000
No ratings yet
Conditional Statements in Python 20250430 002716 0000
27 pages
All Cheat Shests 1749903425
No ratings yet
All Cheat Shests 1749903425
3 pages
ML DL AI Cheatsheet
No ratings yet
ML DL AI Cheatsheet
52 pages
AI ML Cheatsheet
No ratings yet
AI ML Cheatsheet
51 pages
Probablity Mit Removed
No ratings yet
Probablity Mit Removed
31 pages
Turn in Recitation and Tutorial Scheduling Form Policy: Text
No ratings yet
Turn in Recitation and Tutorial Scheduling Form Policy: Text
52 pages
Material_MAT3003_Modules-(1+2+3)
No ratings yet
Material_MAT3003_Modules-(1+2+3)
63 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
AAS24_1
No ratings yet
AAS24_1
29 pages
Material_MAT3003_Modules-(1+2) (1)
No ratings yet
Material_MAT3003_Modules-(1+2) (1)
48 pages
3. Probabilistic Reasoning
No ratings yet
3. Probabilistic Reasoning
37 pages
AML-IV_new
No ratings yet
AML-IV_new
98 pages
SoICT IT2022 02 Probability Theory x4
No ratings yet
SoICT IT2022 02 Probability Theory x4
13 pages
Lecture 2.2 - Chapter 2 - Self-Review - Probability and Application
No ratings yet
Lecture 2.2 - Chapter 2 - Self-Review - Probability and Application
40 pages
Probability-The Science of Uncertainty and Data
No ratings yet
Probability-The Science of Uncertainty and Data
4 pages
Probability Theory: Much Inspired by The Presentation of Kren and Samuelsson
No ratings yet
Probability Theory: Much Inspired by The Presentation of Kren and Samuelsson
27 pages
DAP Unit 2 Notes
No ratings yet
DAP Unit 2 Notes
57 pages
Intro W05 Rev
No ratings yet
Intro W05 Rev
20 pages
Probability_FoundationalMathofAI_S24
No ratings yet
Probability_FoundationalMathofAI_S24
7 pages
Probability_theory
No ratings yet
Probability_theory
9 pages
1-ProbabilityReview v3
No ratings yet
1-ProbabilityReview v3
116 pages
Stat 333
No ratings yet
Stat 333
128 pages
Outline of The Course: Unknown
No ratings yet
Outline of The Course: Unknown
26 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
homework Oktober 2011
No ratings yet
homework Oktober 2011
1 page
ch09ppln+two+populations
No ratings yet
ch09ppln+two+populations
46 pages
Bernoulli Trials Geometric and Binomial Probability models
No ratings yet
Bernoulli Trials Geometric and Binomial Probability models
15 pages
ch13-1skelpart+1
No ratings yet
ch13-1skelpart+1
22 pages
3 Analyze Hypothesis Testing Normal Data
No ratings yet
3 Analyze Hypothesis Testing Normal Data
35 pages
7confidence+interval
No ratings yet
7confidence+interval
18 pages
vandurme2011 how to use prob
No ratings yet
vandurme2011 how to use prob
44 pages
Song11-11 probability distribution
No ratings yet
Song11-11 probability distribution
34 pages
Lec03 PROBABLITY MODELS
No ratings yet
Lec03 PROBABLITY MODELS
45 pages
6914_n_19724 sampling for proportions
No ratings yet
6914_n_19724 sampling for proportions
13 pages
Statistics 2 intro prob
No ratings yet
Statistics 2 intro prob
21 pages
Estimating Population Variances
No ratings yet
Estimating Population Variances
17 pages
P&SM QB for CIE-2
No ratings yet
P&SM QB for CIE-2
2 pages
AAMS2203 Make Good Assignment
No ratings yet
AAMS2203 Make Good Assignment
2 pages
Commonly Used Discrete Distributions: 1st Semester 2022
No ratings yet
Commonly Used Discrete Distributions: 1st Semester 2022
46 pages
STAT8150 7150assignment1
No ratings yet
STAT8150 7150assignment1
3 pages
Stam Formula Sheet
No ratings yet
Stam Formula Sheet
5 pages
Lesson 10
No ratings yet
Lesson 10
10 pages
Discrete Probability Distributions
No ratings yet
Discrete Probability Distributions
8 pages
L3-GEC410-S. O. Edeki - Module-1
No ratings yet
L3-GEC410-S. O. Edeki - Module-1
45 pages
ChapterStat 2
No ratings yet
ChapterStat 2
77 pages
Probability Distributions: X X X X X X
No ratings yet
Probability Distributions: X X X X X X
14 pages
Measure of Dispersion Suggesstions
No ratings yet
Measure of Dispersion Suggesstions
2 pages
Excel For Discrete Prob. Distributions
No ratings yet
Excel For Discrete Prob. Distributions
9 pages
CH (3) Part 2
No ratings yet
CH (3) Part 2
38 pages
Bivariate Distributions
No ratings yet
Bivariate Distributions
11 pages
Theory of Distribution WorkSheet I
No ratings yet
Theory of Distribution WorkSheet I
3 pages
Order Statistics Theory Amp Methods
No ratings yet
Order Statistics Theory Amp Methods
711 pages
Stats Formula - Pranav Popat
No ratings yet
Stats Formula - Pranav Popat
101 pages
Outline of Probability
No ratings yet
Outline of Probability
6 pages
Statistics Labwork
No ratings yet
Statistics Labwork
6 pages
5-3 Binomial Probability Distributions
No ratings yet
5-3 Binomial Probability Distributions
14 pages
Annualization and General Projection of Skewness Kurtosis and All Summary Statistics
No ratings yet
Annualization and General Projection of Skewness Kurtosis and All Summary Statistics
8 pages
Mathematical Statistics
No ratings yet
Mathematical Statistics
1 page
Poisson
No ratings yet
Poisson
7 pages
Hasil Uji Spss A. Normalitas Data: Valid Missing Total Cases
No ratings yet
Hasil Uji Spss A. Normalitas Data: Valid Missing Total Cases
5 pages
00 KokoskaIntroStat3e 04962 ch06.5 Online 001 010 4PP 105448
No ratings yet
00 KokoskaIntroStat3e 04962 ch06.5 Online 001 010 4PP 105448
10 pages
6.1 There Is An Urn Containing 9 Balls, Which Can Be Either Green or Red. The Number of Red Balls in The
No ratings yet
6.1 There Is An Urn Containing 9 Balls, Which Can Be Either Green or Red. The Number of Red Balls in The
6 pages
Notes On Mathematical Expectation
No ratings yet
Notes On Mathematical Expectation
6 pages
Probability - Expectation of Sample Variance - Mathematics Stack Exchange
No ratings yet
Probability - Expectation of Sample Variance - Mathematics Stack Exchange
5 pages

lec-1 probabilistic models

Uploaded by

lec-1 probabilistic models

Uploaded by

Probabilistic models

• It can simulate a class of objects (events),

• Simple objects (processes)  probability

• Probability distribution of a single experiment:

• Probability distribution of N tries of the same

• Define the new variable u = (k-m)/ 

• Probability distribution of N tries of the same

• Probability of rolling 1 dozen times (12) and

• Probability of rolling 1 dozen times (12) and

• (1, 2,…, K) are constants  different

• K=2  Beta distribution

• For a variety of densities g(x),

• Given a set of data D (training set), find a

 i=ni / N, where ni is of i, is the maximum

• Inference from counts

• P( |D) = P()[P(D | )/P(D)]

P()  prior probability; P(|D)  posterior

• Use Dirichlet distribution as prior:

• H(x)=- P(xi) log P(xi)

You might also like