Probability Distributions
Probability Distributions
A mathematical formula
2
Types of Probability Distributions
Discrete probability distributions
Binomial distribution
Multinomial distribution
Poisson distribution
Hypergeometric distribution
5
T YPES OF P ROBABILITY D ISTRIBUTION [PD]
Probability
Distrib ution
Continuous
Discrete PD
PD
Binomial
Distribution Normal
Distribution
Poisson 3
Distribution
Discrete Probability Distributions
A discrete random variable is a variable that can assume only a
countable number of values
Many possible outcomes:
number of complaints per day
number of TV’s in a household
number of rings before the phone is answered
temperature of a solution
height, in inches
4
T REE DIAGRAM –
A FAIR COIN IS TOSSED TWICE
1st 2nd
H HH
T HT Possible
Outcomes
H TH
T
T TT
Attach probabilities
1st 2nd
H HH P(H,H)=½x½=¼
½
½ H
½
T HT P(H,T)=½x½=¼
½ H TH P(T,H)=½x½=¼
½ T
½
T TT P(T,T)=½x½=¼
H
HT P(H,T)=½x½=¼
*
*
½ TH P(T,H)=½x½=¼
½ T
½
T TT P(T,T)=½x½=¼
8
B INOMIAL D ISTRIBUTION
There are certain phenomena in nat ur e which can be
identified as Bernoulli’s processes, in which:
Each trial has only two possible outcomes say success or failure,
true or false etc.
Probability of occurrence of any outcome remains same over
successive trials
Trials are statistically independent
9
B INOMIAL D ISTRIBUTION
9
THE BINOMIAL DISTRIBUTION
BERNOULLI RANDOM VARIABLES
Imagine a simple trial with only two possible outcomes
Success (S)
Failure (F)
Examples
Toss of a coin (heads or tails)
Sex of a newborn (male or female)
Survival of an organism in a region (live or die)
Suppose that the probability of success is p
contract or not
A marketing research firm receives survey
reject it
THE BINOMIAL DISTRIBUTION
OVERVIEW
Example
What is the probability of obtaining 2 heads from a coin that
was tossed 5 times?
2. Mean = E(x) = np
values of p and n 9
Application
11
M EASURES OF C ENTRAL T ENDENCY AND
D ISPERSION FOR T H E B INOMIAL D ISTRIBUTION
Mean of BD: µ = np
S t andard Deviation of BD: σ = 𝑛𝑝𝑞
12
F ITTING OF B INOMIAL D ISTRIBUTION
Four coins are tossed 160 times and the following results
were obtained:
No. of heads 0 1 2 3 4
Frequency 17 52 54 31 6
13
T PHE OISSON D ISTRIBUTION
outcomes
The average number of outcomes of interest per time or space
interval is
The number of outcomes of interest are random, and the occurrence
2. Mean = λ
3. Variance = λ.
14
THE POISSON DISTRIBUTION
If we substitute µ/n for p, and let n tend to infinity, the
binomial distribution becomes the Poisson distribution:
e -µµx
P(x) =
x!
S t andard Deviation of PD is 𝜆= 𝑛𝑝
It is always right skewed.
PD is a good approximation to BD when n > or = 20
and p< or = 0.05
16
THE Gaussian or NORMAL DISTRIBUTION
Importance lies in the Central Limit Theorem, which states that the
sum of a large number of independent random variables (binomial,
Poisson, etc.) will approximate a normal distribution
Definition
A continuous random variable x is said to be a normal distribution with
parameters µ and σ2 , if the density function is given by the probability law .
following conditions.
a) n, the number of trials is indefinitely large and
b) Neither p nor q is very small.
iii) Constants of normal distribution are mean = m, variation =s2 , Standard
deviation = s.
Properties of normal distribution
1. The normal curve is bell shaped and is symmetric at x = m.
2. Mean, median, and mode of the distribution are coincide
i.e., Mean = Median = Mode = m
3. It has only one mode at x = m (i.e., unimodal)
4. The points of inflection are at x = m ± s
5. The maximum ordinate occurs at x = m and its value is =
6. Area Property P(m - s < ´ < m + s) = 0.6826
P(m - 2s < ´ < m + 2s) = 0.9544
P(m - 3s < ´ < m + 3s) = 0.9973
The Normal Distribution
‘Bell Shaped’
Symmetrical
µ
The random variable has an infinite theoretical
Mean
range: + to
= Median
= Mode
N ORMAL D ISTRIBUTION
It is a continuous PD i.e. random variable can take on any
value within a given range. Ex: Height, Weight, Marks etc.
Developed by eighteenth century mathematician – astronomer
Karl Gauss, so also called Gaussian Distribution.
It is symmetrical, unimodal (one peak).
Since it is symmetrical, its mean, median and mode all
coincides i.e. all three are same.
The tails are asymptotic to horizontal axis i.e. curve goes to
infinity without touching horizontal axis.
X axis represents random variable like height, weight etc.
Y axis represents its probability density function.
Area und er the curve tells the probability.
The total area und er the curve is 1 (or 100%)
Mean = µ, SD = σ 17
D EFINING A N ORMAL D ISTRIBUTION
Only two parameters are considered: Mean &
S tan d ard Deviation
Same Mean, Different Standard Deviations
Same SD, Different Means
Different Mean & Different Standard Deviations
18
68-95-99.7 RULE
68% of
the data
21
S TANDARD N ORMAL PD
In stand ard Normal PD, Mean = 0, SD = 1
𝑥− 𝜇
Z=
𝜎
Z = No. of std. deviations from x to mean. Also called Z Score
Let X be random variable which follows normal distribution with mean m and
variance s2 .The standard normal variate is defined as which follows
standard normal distribution with mean 0 and standard deviation 1 i.e., Z ~
N(0,1).
22
P RACTICE P ROBLEMS – N ORMAL
D ISTRIBUTION
Mean height of gurkhas is 147 cm with SD of 3 cm. W hat is
the probability of:
Height being greater t h a n 152 cm. 4.75%
Height between 140 a nd 150 cm. 83.14%