0% found this document useful (0 votes)
130 views

RV and Their Prob Dist Complete

(1) Random variables can be discrete or continuous. A discrete variable can only take countable values, while a continuous variable can take any value within a range. (2) The probability mass function (PMF) describes the probability of a discrete random variable taking on particular values. It must be greater than or equal to 0 and sum to 1. (3) Examples show how to calculate the PMF for tossing coins and rolling dice. The PMF provides the complete probability distribution.

Uploaded by

Hamza Niaz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
130 views

RV and Their Prob Dist Complete

(1) Random variables can be discrete or continuous. A discrete variable can only take countable values, while a continuous variable can take any value within a range. (2) The probability mass function (PMF) describes the probability of a discrete random variable taking on particular values. It must be greater than or equal to 0 and sum to 1. (3) Examples show how to calculate the PMF for tossing coins and rolling dice. The PMF provides the complete probability distribution.

Uploaded by

Hamza Niaz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

3 Random Variables and their Probability Distributions (1)

Random Variables
(i) Discrete random variables
(ii) Continuous random variables
Random Variables and their Probability Distributions
A variable, such as the strength of a concrete or any other material or physical quantity, whose value is uncertain or
unpredictable or nondeterministic is called a random variable or a variate if its distribution is known.
Discrete Random Variable
A discrete variable can only assume at most a countable set of isolated values, as already stated, such as
{0, 1, 2, 3, 4, 5}, {1, 2, 3, 4, …}, { 1/2 , 1/3 , 5/7, 4/9 }, or { 0.1 , 0.15, 0.75, 1.45 } etc. {1, 4, 9, 16, 25, … }
(1) Probability Mass Function of a Discrete Random Variable
If a random variable X assumes discrete values (finite or countably infinite); x1, x2, … , xn, …… , it is called discrete
random variable. The function f(x) defined as
f(x) = P[X = x] if x  RX
is called the probability function (or the probability mass function, pmf ) of the random variable X with the properties:
(i) 0  P [X = x]  1 0  f(x)  1
 
(ii)  P [X = xi ] = 1  f(xi ) = 1
i=1 i=1
(iii)
In the following examples, for defined random variables, we can find out the probability distribution.
Examples
(1) The probability distribution for the sample space for tossing three coins is
Let X denotes the number of heads; X = 0, 1, 2, 3
S = {HHH, HHT, HTH, THH, TTH, THT, HTT, TTT} 23 = 8
X: 0 1 2 3 Total
Probability P(X) 1/8 3/8 3/8 1/8 1
Called a complete probability distribution for the random variable X
Examples (1)
Find the probability distribution of the sum of the dots when two dice are thrown.
Sol.
We are familiar with the sample space when two dice are thrown:
Let X denotes the sum of the dots of the top faces, so the range of
X = 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
X = 2 means the event {(1,1)}, therefore P[X=2] =1/36
X = 3 means, the event {(1,2), (2,1)}, therefore P[X=3] = 2/36
X = 4 means the event {(1,3), (3,1), (2,2)}, therefore P[X=4] = 3/36
…. …. ….
X = 12 means the event {(6,6)}, therefore P[X=12] = 1/36
Hence the complete probability distribution is
Xi 2 3 4 5 6 7 8 9 10 11 12 Total
PX(xi) = P[X=xi] 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 1

x P[X = xi] 2/36 6/36 12/36 20/36 30/36 42/36 40/36 36/36 30/36 22/36 12/36 E(X) =  x P[X = xi] = 252/36
x2 P[X=xi] 4/36 18/36 48/36 E(X2) =
Var (X) = E(X2 )  [E(X)]2

Exercise 1.
Many manufacturers have quality control programs that include inspection of incoming materials for defects.
Suppose a computer manufacturer receives computer boards in lots of five. Two boards are selected from each lot for
inspection. Suppose that boards 1 and 2 are the only defective boards in a lot of five.
List the ten different possible outcomes. Lot = {1, 2, 3, 4, 5 }
55 = 25 cases we have, when the selection is with replacement
5C = 10 cases are there when we draw, we pick up,
2
(1,1) (1,2) (1,3) (1,4) (1,5)
(2,1) (2,2) (2,3) (2,4) (2,5)
(3,1) (3,2) (3,3) (3,4) (3,5)
(4,1) (4,2) (4,3) (4,4) (4,5)
(5,1) (5,2) (5,3) (5,4) (5,5)
3 Random Variables and their Probability Distributions (2)

Suppose that boards 1 and 2 are the only defective boards in a lot of five. Two boards are to be chosen at random.
Define X to be the number of defective boards observed among those inspected. Find the probability distribution of X; X = 0,1,2
Sample space = {(1,2), (1,3), (1,4), (1,5), (2,3), (2,4), (2,5), (3,4), (3,5), (4,5) } , 5C2 = 10

X: 0 1 2 Total
P(X): 3/10 6/10 1/10 1
Exercise 2.
Let X be the number of years before a particular type of machines will need replacement. Assume that X has the
probability function f(1) = 0.1, f(2) = 0.2, f(3) = 0.2, f(4) = 0.2, f(5) = 0.3. Find the probability that machine needs no replacement
during the first 3 years. (Kreyszig 7th Ex 24.5)
P(that machine needs no replacement during the first 3 years) = 1  P(that machine needs replacement during the first 3 years)
= 1  {0.1 + 0.2 + 0.2}

OR
P(that machine needs no replacement during the first 3 years) = P(that machine will need replacement during 4 th or 5th year)
= 0.2 + 0.3 = 0.5

Exercise 3.
If X has the probability function f(x) = k/2x ( X = 0, 1, 2, 3, 4, 5, 6, … ) What is the value of k and find P(X  4) (Kreyszig 7th Ex 24.5)
Solution: as f is a probability function, therefore
 f(x) = 1
  k/2x = 1
K  1/2x = 1
a 1
By solving k[1 + ½ + 1/2 2 + 1/23 + ……] = 1 S = = 1=2
1r
12
2k = 1
We get k = ½ now our pmf becomes f(x) = (½)/2x = 1/ 2x+1
Hence p.m.f becomes f(x) = 1/2x+1 X = 0, 1, 2, 3, …
P(X  4) = P[X = 4] + P[X = 5] + …
= 1/25 + 1/26 + …… is G.P with a = ½5 and r = ½
a 1/25 1/25 2 1
now solve it S = = 1 = 1 = 1  25 = 1/ 2 = 1/16
4
1r
12 2
Repeat the same problem f(x) = k/3 ( X= 0, 1, 2, 3, 4, 5, 6, … )
x

Expectation
(Mean and Variance of a Discrete Random Variable)
Two numbers are often used to summarize a probability distribution for a random variable X. The mean is a measure
of the center or middle of the probability distribution, and the variance is a measure of the dispersion, or variability in the
distribution.
Definition
The mean or expected value of a random variable X denoted by E(X) indicates its weighted mean of all of its possible
values x1,x2, … ,xn each weighted by the respective probability.
Let X be a discrete random variable having values x1, x2, … , xn with corresponding probabilities P(X=x1) , P(X=x2), … ,
P(X=xn) such that  P(X=xi) = 1. Then the mathematical expectation or the expectation or the expected value of X, denoted by
E(X) is defined as
n
E(X) = x1 P(X=x1) + x2 P(X=x2) + … + xn P(X=xn) =  xi P(X=xi)
i =1
The measure of variability of a random variable X is referred to as the variance of the random variable or the variance
of the probability distribution of X is denoted by Var (X) and defined as:
n
Var(X) = E[(X  )2 ] =  (X  )2 P(X=xi) , if X is discrete and
i =1
Var(X) = E(X2)  [E(X)]2
Examples (2)
Two new product designs are to be compared on the basis of revenue potential. Marketing feels that the revenue
from design A can be predicted quite accurately to be $3 million. The revenue potential of design B is more difficult to assess.
Marketing concludes that there is a probability of 0.3 that the revenue from design B will be $7 million, but there is a 0.7
probability that the revenue will be only $2 million. Which design do you prefer?
3 Random Variables and their Probability Distributions (3)

Solution
Let X denote the revenue from design A. Because there is no uncertainty in the revenue from design A, we can model
the distribution of the random variable X as $3 million with probability 1. Therefore, E(X) = $3 million.
Let Y denote the revenue from design B. The expected value of Y in millions of dollars is
E(Y) = $ 7(0.3) + $2(0.7) = $3.5
Because E(Y) exceeds E(X), we might prefer design B. However, the variability of the result from design B is larger.
That is,
2 = (7 – 3.5)2 (0.3) + (2 – 3.5)2 (0.7)
= 5.25 millions of dollars squared
 Standard Deviation =  = 2 = $2.29
Because the units of standard deviation are the same as the units of the random variable, the standard deviation is
easier to interpret. In this example, we can summarize our results as “the average deviation of Y from its mean is $2.29 million.’
Examples (3)
How many heads would you expect if you flipped a coin twice? S = {HH, HT, TH, TT}
X = number of heads = {0, 1, 2} with p(0) = ¼, p(1) = ½ and p(2) = ¼
Weighted Average = E(X) = 0¼ + 1½ + 2¼ = 1
and E(X2) = 02¼ + 12½ + 22¼ = 3/2
Then Var (X) = E(X2)  [ E(X) ]2 = 3/2  1 = ½
Examples (4)
Roll a die. If the side comes up is odd, you win the $ equivalent of that side. If it is even, you lose $ 4. Let X denotes
our earning, so the possible values of the variable X are: 1,3,5, -4, therefore
X 1 3 5 -4
P(X=x) 1/6 1/6 1/6 1/6
E(X) = 11/6 + 31/6 + 51/6 + (-4)  3/6 = - ½
E(X2) = 121/6 + 321/6 + 521/6 + (-4)2  3/6 = 1/6 + 9/6 + 25/6 + 48 / 6 = 13.8
Var (X) = 13.8  0.25 = 13.55
Exercise 4.
If it rains, an umbrella salesman can earn $30 per day. If it is fair he can lose $6 per day. What is his expectation, if the
probability of rain is 0.3?
X = 30, 6 , P[X = 30] = 0.30 , P[X =  6] = 0.7
E(X) = 30  0.3 + (6)  0.7 = $ (9  4.2) = $ 4.8
Exercise 5.
A coin is biased such that a head is three times as likely to occur as a tail. Find the expected number of tails when this
coin is tossed twice. (Walpole Problem 4.4)
S = {HH, HT, TH, TT}
p + 3p = 1  p = ¼ i.e. the probability of the tail, hence the probability of the head is ¾
X denoted the number of tails; X = 0, 1, 2
P[X = 0] = (¾)2
P[X = 1] = 2C1 (¼)(¾) HHHHT, HHHTH, HHTHH, HTHHH, THHHH = 5C1 = 5
P[X = 2] = (¼) 2

E(X) =
Exercise 6.
A man draws two balls from a bag containing 3 white and 5 black balls. If he receives $.70 for every white ball he
draws and $.7 for every black ball, find his expectation?
S = {WW, WB, BW, BB}
Let denotes the winning amount, X = $ 140, 77, 14
Case (i) Both white , corresponding amount is $140, P[X = 140] = 3/8  2/7
Case (ii) one white and one black, corresponding amount is $77, P[X = 77] = 3/8  5/7 + 5/8  3/7
= 2  (3/8  5/7)
Case (iii) both black , corresponding amount $14 , P[X = 14] = 5/8  4/7
E(X) = $(140) (3/8  2/7) + $77  (3/8  5/7 + 5/8  3/7 ) + $14 (23/8  5/7)

Exercise 7.
In summer season, a dealer of desert room cooler can earn Rs.500 per day if the day is hot and can earn Rs.200 per
day if it is fair and loses Rs.60 per day if it is cloudy. Find the expectation if the probability of the day being hot is 0.40, for being
fair it is 0.35 and being cloudy it is 0.25?
Exercise 8.
One thousand tickets are sold at $1 each for a color television valued at $350. What is the expected value of the gain
if you purchase one ticket?
3 Random Variables and their Probability Distributions (4)

Example 5.12, page 264, “Introductory Statistics” by Bluman


(1) Bernoulli Probability Distribution:
An activity is repeated one time with two possible outcomes, called Beroulli trial / experiment
(i) Tossing a coin and observe number of heads
If X denotes number of heads then X = 0, 1, S = {T, H} and P[X = 0] = ½ , P[X = 1] = ½
X 0 1 Total
P[X=x] ½ ½ 1
Suppose a random experiment has two outcomes, namely Success and Failure. Let probability of success be
p and probability of failure be q = 1  p. Such an experiment is called Bernoulli experiment or Bernoulli trial.
Trials of random experiment are called Bernoulli trials, if they satisfy the following conditions:
1. There is only one trial
2. There are exactly two outcomes: success or failure with probability p and q respectively
3. The outcomes (or the events) are independent.
4. The sum of the probabilities of success and failure is equal to 1. i.e. p + q = 1
(2) Binomial Probability Distribution
Binomial experiment is a Bernoulli trial repeated finite many times
In binomial experiment, a trial is repeated finite many times and every time we have exactly two possible outcomes.
Consider a fixed number n of mutually independent Bernoulli trails. Suppose these trials have same probability of
success, say p. A random variable X is called a binomial random variable if it represents the total number of successes in n
independent Bernoulli trials.
FFFSFFSFSFFFSSSFFFSSFFFSF (n outcomes)
Thus, to find the probability density function of X we have to find the probability of x successes in n independent trails.
If we have x successes in n trails, then the probability of each n-tuple with x successes and n − x failures is
px (1 − p)n−x. as p + q = 1
However, there are nCx tuples with x successes and n − x failures in n trials.
x nx
Hence P[X = x] = nCx p q ( x = 0, 1, 2, … ,n )
Therefore the probability mass function of X is
x nx
f(x) = nCx p q ( x = 0, 1, 2, … ,n )
The random variable X is called the binomial random variable with parameters n and p, if its probability density
function is of the form
Probability for x successes out of n
x nx
P[X =x] = nCx p q ( x = 0, 1, 2, … ,n )
Mean and variance for a binomial random variable X
Mean = E(X) = np , Variance = E(X2)  [E(X)]2 = npq
Example suppose a coin is tossed 10 time, probability of a head is twice than the probability of a tail. Observe the
number of heads and find a complete probability distribution. Particularly, find the probability to obtain 6 heads in this activity.
Assume w is the probability of a tail, so 2w will be the probability of a head
W + 2W = 1  W = 1/3 i.e the probability of a tail
If X denotes number of heads ; X = 0, 1, …, 10
P = probability of a head = 2/3 and q = 1-p = probability of a tail = 1/3 such that p + q = 1
Now the probability mass function for X
P[X = x] = nCx  (p)x (q)n-x , X = 0, 1, 2, … ,10
P[X = 6] = 10C6  (2/3)6 (1/3)4 HHHHHHTTTT
Example
A fair coin is tossed 5 times, and consider the number of heads find the probability to get 2 heads. (binomial)
Find the probability that we get 2nd head at 5th toss of a coin. (negative binomial) THTTH 4C
1 p2 q3
X denotes number of heads; X = 0, 1, 2, 3, 4, 5, p = probability of one head = ½ , q = 1 – p = ½
P(2 heads) = P[X = 2] = 5C
2 p2 q3 = 5C2  ( ½ )2  (½)3 = 5C2  p2 q3
HHTTT p2 q3
THHTT
TTHHT
TTTHH
THTHT
HTHTT
TTHTH
HTTHT
3 Random Variables and their Probability Distributions (5)

HTTTH
THTTH
P(X=2) = 10  p2 q3
Repeat the question:
A coin is biased such that a head is three times as likely to occur as a tail. and consider the number of heads find the
probability to get 2 heads.
w + 3w = 1  4w = 1  w = ¼ i.e the probability of a tail, therefore the probability of a head is ¾
X denotes the number of heads, X = 0,1,2 p = probability of one head = ¾ hence q = ¼
P(2 heads) = P[X = 2] = 5C2  ( ¾ )2  (¼)3 =
Exercise 9.
If 20% of the bolts produced by a machine are defective, determine the probability that out of 4 bolts chosen
at random, there is one bolt defective.
Let X denotes the number of defective bolts in a sample of 4, X = 0,1,2,3,4
p = probability of a defective bolt = 0.20, q = 1-p = 0.80
P[X=x] = nCx px qn-x X = 0,1,2,3,4
(a) P[X = 1] = 4C
1  (0.2)1  (0.8)3 = 0.65 i.e. probability of getting 1 defective bolt
(b) P[X = 0] = 4C0  (0.2)0  (0.8)4 =
(c) P[X<2] = P[X = 0] + P[X = 1]
(d) Mean = np = 4  0.20 , var(X) = npq = 4  0.20  0.80
(negative binomial: what is the probability that 2nd defective bolt comes at 4th draw    D 3C
1 p2 q2 )
Examples (5)
On a five-question multiple-choice test there are five possible answers, of which one is correct. If a student guesses
randomly and independently.

(i) what is the probability that she is correct only on questions 1 and 4? pqqpq (1/5)2  (4/5)3
(ii) what is the probability that she is correct only on two questions? 2  (1/5)  (4/5)
5C 2 3

Solution (i)
1 4
The probability of a success is p = 5 , and thus 1  p = 5 , therefore, the probability that she is correct on question 1 and
4 is
1 2 4
3
64
P(correct on question 1 and 4) = p2 (1  p)3 =  5  = 55 = 0.02048
  5
Solution (ii)
1 2 4
3
64
P(correct on any two questions) = 5C2 p2 (1  p)3 = 10   5  = 55 = 0.2048
  5
Mean and Variance
The mean and variance of the binomial distribution b(x; n, p) are
 = np and 2 = npq
S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}

Examples (6)
Tests for impurities commonly found in drinking water from private wells showed that 30% of all wells in a particular
county have impurity A. If a random sample of five wells is selected from the large number of wells in the county, what is the
probability that:
a. Exactly three will have impurity A? P[X = 3] = 5C3  (0.3)3  (0.7)2
b. At least three? P[X  3] = P[X = 3] + P[X = 4] + P[X = 5]
c. Fewer than three? P[X < 3] = P[X = 0] + P[X = 1] + P[X =2]
Solution
X denotes the number of wells having impurity A, X = 0, 1, 2, 3, 4,5
p = probability of a well having impurity A = 0.30 q = 1-p = 0.70
The sampling procedure represents a Binomial experiment with n = 5 and p = 0.3.
3 Random Variables and their Probability Distributions (6)

Examples (7)
A large chain retailer purchases a certain kind of electronic device from a manufacturer. The manufacturer indicates
that the defective rate of the device is 3%.
(a) The inspector of the retailer randomly picks 20 items from a shipment. What is
the probability that there will be at least one defective item among these 20?
(b) Suppose that the retailer receives 10 shipments in a month and the inspector randomly tests 20 devices per
shipment. What is the probability that there will be 3 shipments containing at least one defective device out
of 20?
(Example 5.6, Book: Probability and Statistics by Walpole 8th Ed)
Solution (a)
Denote by X the number of defective devices among the 20. This follows that
, X = 0, 1, 2, …, 20 and p = 0.03 = probability for one defective, n = 20
P[X =x] = nCx px qn-x , X = 0, 1, 2, …, 20
the probability of a defective device, therefore
P[at least one defective] = P(X  1) = 1 - P(X = 0) = 1 – 20 C0 p0 q20-0
= 1 - 0.03°(1 - 0.03)20 = 0.4562
This is now a criteria for part (b), that in a shipment if we have at least one defective item out of 20
Solution (b)
Y denotes the number of shipments having at least one defective in a sample of 20, Y = 0, 1, 2, … , 10
p = the probability of a shipment having at least one defective in a sample of 20, = 0.4562
obviously q = 1-p = 0.5438
P[Y = 3] = 10C3 p3 q7
OR
In this case, each shipment contains at least one defective. Hence, testing the result of each shipment is a trial with p
= 0.4562 from part (a). Assuming the independence from shipment to shipment and denoting by Y the number of shipments
containing at least one defective item Y follows another binomial distribution b(x; 10, 0.4562). Therefore, the answer to this
question is
P(Y = 3) = 10 C3 0.45623(1 - 0.4562)7 = 0.1602
(3) The Poisson Probability Distribution
Definition
When n is large and value of p is < 0.05, we prefer Poisson distribution
Given an interval of real numbers, assume counts occur at random throughout the interval. If the interval can be
partitioned into subintervals of small enough length such that
(1) the probability of more than one count in a subinterval is zero,
(2) the probability of one count in a subinterval is the same for all subintervals and
proportional to the length of the subinterval, and
(3) the count in each subinterval is independent of other subintervals,
the random experiment is called a Poisson process.
The random variable X that equals the number of counts in the interval is a Poisson random variable with parameter 0 < , and
xe-
the probability mass function of X is P(X = x) = x! , x = 0, 1, 2, … where  = np
The characteristics of a Poisson random variable are listed below:
the experiment consist of counting the number X of times a particular event occur during a given unit of time, or
in a given area of volume (or weight, distance, or any other unit of measurement).
The probability that an event occurs in a given unit of time, area, or volume is the same for all the units.
The number of events that occur in one unit of time, area, or volume is independent of the number that occurs in
other units.
The mean (or expected) number of events in each unit will be denoted by the Greek letter meu, 
The Mean and Variance
Mean =  and 2 = 
 is called the parameter of a Poisson distribution.
Examples (8)
In a certain industrial facility, accidents occur infrequently. It is known that the probability of an accident on any given
day is 0.005 and accidents are independent of each other.
(a) What is the probability that in any given period of 400 days there will be an accident on one day?
(b) What is the probability that there are at most three days with an accident? (Walpole Example 5.19)
Solution
Let X denotes number of days in which we have accident; X = 0, 1, 2, … , 400
We prefer here Poisson distribution, n = 400 and p = the probability of one day to have an accident = 0.005. Thus,
 = np = 2.
3 Random Variables and their Probability Distributions (7)

xe- 2xe-2
Using the Poisson approximation, P(X = x) = x! = , x = 0, 1, 2, … where  = 2
1!
2xe-2
(a) P(that there will be an accident on one day) = P(X = 1) = 1! = e−221 = 0.271 and
3
(b) P(X ≤ 3) = P[x = 0] + P[X=1] + P[X = 2] + P[X = 3] =  e-22x/x! = 0.857
x=0
Examples (9)
In a manufacturing process where glass products are made, defects or bubbles occur, occasionally rendering the piece
undesirable for marketing. It is known that, on average, 1 in every 1000 of these items produced has one or more bubbles.
What is the probability that a random sample of 8000 will yield fewer than 7 items possessing bubbles?
Solution
This is essentially a binomial experiment with n = 8000 and p = 0.001. Since p is very close to 0 and n is quite large, we
shall approximate with the Poisson distribution using
 = np = (8000)(0.001) = 8.
Hence, if X represents the number of items having bubbles, we have
6 xe- 80e- 81e-8 xe- xe- xe- xe- xe-
P(X < 7) =  x! = 0! + 1! + x! + x! + x! + x! x! = 0.3134.
x=0
Examples (10)
If there are 200 typographical errors randomly distributed in a 500-page manuscript, find the probability that a given
page contains exactly 3 errors. (Bluman Example 5.27)
Solution
First, find the mean number  of errors. Since there are 200 errors distributed over 500 pages, each page has an
average of
200 2
 = 500 = 5 = 0.4
xe- (0.4)3e-0.4
P(X; ) = x! = 3! = 0.0072
Thus, there is less than a 1% chance that any given page will contain exactly 3 errors.
Examples
For the case of the thin copper wire, suppose that the number of flaws follows a Poisson distribution with a mean of
2.3 flaws per millimeter.
(i) Determine the probability of exactly two flaws in 1 millimeter of wire.
Let X denote the number of flaws in 1 millimeter of wire. Then, E(X) = 2.3 flaws and
e-2.3 (2.3)2
P(X = 2) = 2! = 0.265
(ii) Determine the probability of 10 flaws in 5 millimeters of wire.
Let X denote the number of flaws in 5 millimeters of wire. Then, X has a Poisson distribution with
E(X) = 5 mm  2.3 flaws/mm = 11.5 flaws
e-11.5 (11.5)10
Therefore P(X = 10) = 10! = 0.113
(iii) Determine the probability of at least one flaw in 2 millimeters of wire.
Let X denote the number of flaws in 2 millimeters of wire. Then, X has a Poisson distribution with
E(X) = 2 mm  2.3 flaws/mm = 4.6 flaws
Therefore,
P(X  1) = 1  P(X = 0) = 1  e-4.6 = 0.9899
Practical Interpretation: Notice that when a probability was requested for 2 mm of wire 2 was adjusted to 4.6, the
mean number of flaws in 2 mm. With such adjustments, probabilities can be calculated for intervals of any size.
Examples (11)
Suppose that number Y of a company’s employees absent on Mondays has approximately a Poisson probability
distribution. Furthermore, assume that the average number of Monday absent is 2.5.
(a) Find the mean and the standard deviation of Y, the number of employees absent on Monday.
(b) Find the probability that exactly 5 employees are absent on a given Monday.
(c) Find the probability that 2 or more employees are absent on a Monday.
Solution
a) The mean and variance of a Poisson random variable are both equals to . Thus, for this example
 =  = 2.5 and 2 =  = 2.5
then the standard deviation is
 = 2.5 = 1.58
b) We want the probability that exactly 5 employees are absent on Monday.
3 Random Variables and their Probability Distributions (8)

ye-
The probability distribution for y is P(y) = y!
Then, since  = 2.5, y = 5 and e-2.5 = 0.082085
(2.5)5e-2.5 (2.5)2 (0.082085)
 P(5; 2.5) = 5! = 5 . 4 . 3 . 2 . 1 = 0.067
c) to find the probability that two or more employees or absent on Monday,
we need to find

P(y 2) = P(2) + P(3) + P(4) + … =  P(y)
y=2
In order to find the probability of this event, we must consider the complementary event.
Thus, P(y 2) = 1 – P(y  1) = 1 – [P(0) + P(1)]
(2.5)0e-2.5 (2.5)1e-2.5 1 . (0.082085) (2.5)2 (0.082085)
=1- 0! - 1! = 1 - 1 - 1 = 1 – 0.287 = 0.713
According to our Poisson model, the probability that two or more employees are absent on Monday is 0.713.
Examples (12)
The probability that a man aged 50 years will die within a year is 0.01125. What is the probability that of 12 such men
at least 11 will reach their fifty-first birthday?
Solution
Required probability that at least 11 will not die within a year.
Let X denotes the number of men who will die within a year (i.e. they will not reach their 51 st birthday)
X = 0, 1, 2, 3, … , 12
Here p = 0.01125 i.e. a man will die within a year, and n = 12. We compute the desired probability by means of
xe-
Poisson distribution because the probability of death is very small. P[X = x] = x!
Therefore  = np = 12  (0.01125) = 0.135, and the Poisson distribution is
e-0.135(0.135)x
P[X = x] = x! , X = 0, 1, 2, 3, … , 12
P(at least 11 will reach their 51 birthday )
st

= P(at least 11 will not die within a year)


= P(11 will not die) + P(12 will not die)  P(X =11) + P(X =12)
= P(1 will die) + P(0 will die)
= P(X = 1) + P(X =0)
e-0.135(0.135)1 e-0.135(0.135)0
= 1! + 0!
= 0.8737 + 0.1179
= 0.9916
Exercise
1. If on the average 2 cars enter a certain parking lot per minute, what is the probability that during any given
minute 4 or more cars will enter the lot? Ans. 14.3% (Kreyszig 10 th Ed)
2. A business receives order at an average rate of 1 per minute. What is the probability of getting three orders in
one minute? (0.0613 or 6%)
3. An emergency service receives an average of 2.1 false alarms per day. What is the probability of getting four
false alarms in a given day? (0.0992 or 10%)
4. On average the demand for a certain product is four per week. If the stock at the beginning of each week is
renewed so that there are always 6 in store, what is the probability of running out of stock in any week (13.4 %)
5. A call centre has a capacity to deal with 25 calls per minute on average. What is the probability of getting 30 calls
in any minute period? (4.5%)
6. The average time taken for a worker to assemble a certain product is 45 minutes. There are 10 workers
employed to make these assemblies. What is the probability of assembling 10 units in an hour? (8%)

A discussion

1. (bernoulli) A biased coin is tossed with p = 2/3 and q = 1/3, X denotes the number of heads, X = 0, 1 find their
prob distribution.
2. A coin is tossed 10 times find the probability to get exactly 6 heads. 10C6 p6 q106
3. Find the probability that 6th head comes at 10th toss. (generally we can say, how many times we have to toss a
coin to get 6 number of heads) (Negative Binomial distribution)
3 Random Variables and their Probability Distributions (9)

4. Find the probability that first head comes at 10th toss. (geometric distribution)

Let a coin is biased, with the probability of a head twice than the probability of a tail.
p+2p = 1  p = 1/3 i.e. the probability of a tail, therefore probability of a head = 2/3

H T H H T T H H T H

P[X = 10] = 9C5  (2/3)6 (1/3)4


P[X = x] = x-1Ck-1 pk qx-k general formula for negative binomial distribution

X denotes the number of trials to get k successes, X = k , K+1 , k+2, …


P[X = x] = x-1Ck-1 pk qxk
(2) Geometric what is the probability to get 1st head in 10th toss
T T T T T T T T T H P[X = 10] = 10-1C1-1 p1 qx1 = p qx-1
1 . (1/3)9 (2/3)1
P[X = x] = 1. p1 qx1

Observe these two statements


(i) A coin is tossed 10 times, what is the probability to 4 heads. 10C4 p4 q6 (binomial)
(ii) What is the probability to get 4th head on 10th toss. HTHTTHTTTH 9C3 p4 q6 (negative binomial)

Note that in first statement trials are fixed but successes vary
But in second statement successes are fixed whereas the trials vary
NNNYNNNY 7C1 p2q6

(4) The Negative Binomial Probability Distribution Probability Mass Function


If repeated independent trials can result in a success with probability p and a failure with probability q = 1 − p, then
the probability distribution of the random variable X, the number of the trial on which the kth success occurs, is

P[X = x] = ( xk -- 11 ) p q k x–k , X = k, k+1, k+2, …


The mean and variance of a negative binomial random variable are, respectively,
k kq
=p and 2 = p2

TTTTTHTTTTTTTTTTHTTH 19C
2 p3 q17
Examples (1)
For a certain manufacturing process, it is known that, on the average, 10 in every 100 items is defective. What is the
probability that the 20th item inspected is the third defective item found? NDNNNNNNNNNDNNNNNNND 19C p3 q17
2
(geometric: what is the probability that 20 item is the first defective item NNNNNNNNNNNNNNNNNNNND C0  p q19 = p1q19
th 19 1

Solution

P[X = x] = ( xk -- 11 ) p q
3 5–3

here X = 20th trial containing k = 3rd defective item and p = 10/100 = 0.10

Therefore, P[X = 20]. = ( 203 --11 ) (0.10) (0.90) 3 20 – 3 = ( 192 ) (0.10) (0.90)
3 17

Exercise 1.
To attach the housing on a motor, a production line assembler must use electrical tool to select and tighten four bolts.
Suppose that the probability of setting and tightening a bolt in any 1-second is p = 0.8. if if the assembler fails in the first second,
the probability of success during the second 1-second interval is 0.8, find

(i) probability distribution of X, the length of time until complete housing is attached. P[X = x] = ( xk -- 11 ) p q
k x

–k

(ii) probability at X = 6
(iii) the mean and variance of X
Exercise 2.
The probability that a person will install a black telephone in a residence is estimated to be 0.3. find the probability
that the 10th phone installed in a new sub-division is the 5th black phone.
Exercise 3.
3 Random Variables and their Probability Distributions (10)

A Web site randomly selects among 10 products to discount each day. The color printer of interest to you is discounted today.
(a) What is the expected number of days until this product is again discounted?
(b) What is the probability that this product is first discounted again exactly 10 days from now?
(c) If the product is not discounted for the next five days, what is the probability that it is first discounted again 15 days from now?
(d) What is the probability that this product is first discounted again within three or fewer days?
(5) The Geometric Probability Distribution Probability Mass Function
If repeated independent trials can result in a success with probability p and a failure with probability q = 1 − p, then
the probability distribution of the random variable X, the number of the trial on which the first success occurs, is
P[X = x] = p qx -1 , x = 1, 2, 3, …
The mean and variance of a geometric random variable are, respectively,
1 1-p
=p and 2 = p2
Examples (2)
For a certain manufacturing process, it is known that, on the average, 1 in every 100 items is defective. What is the
probability that the fifth item inspected is the first defective item found? FFFF S 1  p1 q4
Solution
Using the geometric distribution with x = 5 and p = 0.01, we have
P[X = x] = p qx -1 , x = 1, 2, 3, …
P[X = 5] = (0.01)(0.99)4 = 0.0096.
Geometric and Negative Binomial Distributions
Exercise 4.
The probability of a successful optical alignment in the assembly of an optical data storage product is 0.8. Assume the
trials are independent.
(a) What is the probability that the first successful alignment requires exactly four trials?
(b) What is the probability that the first successful alignment requires at most four trials?
(c) What is the probability that the first successful alignment requires at least four trials?
(Douglas Montgomery 3rd Problem 3.73)
Exercise 5.
A particularly long traffic light on your morning commute is green 20% of the time that you approach it. Assume that
each morning represents an independent trial.
(i) What is the probability that the first morning that the light is green is the fourth morning that you approach it?
(ii) What is the probability that the light is not green for 10 consecutive mornings?
(Douglas Montgomery 3rd Problem 3.73 and 3.75)
Exercise 6.
For a certain manufacturing process, it is known that, on the average, 1 in every 100 items is defective. What is the
probability that the fifth item inspected is the first defective item found?
(Douglas Montgomery 5th Example 5.15)
3 Random Variables and their Probability Distributions (11)

(6) The Hypergeometric Probability Distribution


Characteristics of Hypergeometric Random Variable
The probability of selecting x successes from the k items labeled successes and n − x failures from the N − k items
labeled failures when a random sample of size n is selected from N items. This is known as a hypergeometric experiment, that is,
one that possesses the following two properties:
1. A random sample of size n is selected without replacement from N items.
2. Of the N items, k may be classified as successes and N − k are classified as failures.
Example a box contains 3 white and 5 black balls , 2 are randomly selected, what is the probability that our sample contains
exactly 1 white.

P[one white] = 3/85/7 + 5/83/7 =


(1) (1)
3 5

=
(2)
8

P[both white] = 3/8  2/7 =


(2) (0)
3 5

(82)
The Probability Distribution Function
The number X of successes of a hypergeometric experiment is called a hypergeometric random variable. Accordingly,
the probability distribution of the hypergeometric variable is called the hypergeometric distribution, and its values are denoted
by h(x;N, n, k),
The hypergeometric probability distribution is given by

P(X = x) =
(x) (n-x)
r N-r

[x = Maximum {0,n-(n-r)}, … ,Minimum (r,n)]


(Nn)
Where N = total no. of elements
r = no. of S’s in the N elements
n = no. of elements drawn
x = no. of S’s drawn in the n elements
The mean and variance of the hypergeometric random variable are, respectively,
nr r(N-r) n (N-n)
 = N and 2 = N2(N-1)
Example: two cards are drawn from a deck of 52 playing cards, what is the probability that one is heart.
If we draw 2 cards from a deck of 52 playing cards without replacement, then construct a probability distribution for
the hearts in our sample.
Let X denotes the number of hearts in our sample; X = 0, 1, 2,
39C  13C
2 0
P[X = 0] = 39/52  38/51 = 52C =
2
39C  13C
1 1
P[X = 1] = 2C1  13/52  39/51 = 52C
2
13C  39C
2 0
P[X =2] = 13/52  12/51 = 52C
2

Exercise 7.
A lot of 12 compressor tanks is checked to see whether there are any defective tanks. Three tanks are checked for
leaks. If 1 or more of the 3 is defective, the lot is rejected. Find the probability that the lot will be rejected if there are actually 3
defective tanks in the lot. (Bluman)
Total = 12, Defectives = 3 and non-defective = 9
Let X denoted the number of defective tanks in the sample; X = 0, 1, 2, 3 DDN, DND, NDD
P[that the lot will be rejected] = P[1 or more of the 3 is defective] = P[at least one is defective]
= P[X  1] = P[X=1] + P[X=2] + P[X=3] = 3 3/129/118/10 + 3 3/122/119/10 +
3C  9C
1 2
3/122/11  1/10 DNN, NDN, NND 12C
3
= 1  P[X=0]
= 1  P[ 0 is defective] =
= 1  9/12  8/11  7/10
3 Random Variables and their Probability Distributions (12)

3
9C 3C
0
=1 12C =?
3

Exercise 8.
Lots of 40 components each are deemed unacceptable if they contain 3 or more defectives. The procedure for
sampling a lot is to select 5 components at random and to reject the lot if a defective is found. What is the probability that
exactly 1 defective is found in the sample if there are 3 defectives in the entire lot?
(Example 5.9, Book: “Probability and Statistics for Engineers and Scientists” by Walpole 9 th Ed)

Total = 40, defectives = 3, non-defective = 37 sample size = 5


X denotes number of defective items in our sample X = 0,1,2,3,4,5, DNNNN, NDNNN, NNDNN, NNNDN, NNNND
1  C4
3C 37
P[X = 1] = 5C1  3/40  37/39  36/38  35/37  34/36 = 40C =?
5
Exercise 9.
A shipment of 24 electric typewriters is rejected if 3 are checked for defects and at least 1 is found to
be defective. Find the probability that the shipment will be returned if there are actually 6 typewriters that are defective.
(Problem 5.4.21, “Introductory Statistics” by Bluman)
P[that the lot will be rejected] = P[at least one is defective] = P[X  1] = P[X=1] + P[X=2] + P[X=3]
= 1  P[X=0] = 1  18/24  17/23  16/22
18C  6C
3 0
=1 24C =?
3

Exercise 10.
A batch of parts contains 100 parts from a local supplier of tubing and 200 parts from a supplier of tubing in the next
state. If four parts are selected randomly and without replacement, what is the probability they are all from the local supplier?
(Douglas Montgomery 3rd Example 3.27)

Exercise 11.
A lot of 75 washers contains 5 in which the variability in thickness around the circumference of the washer is
unacceptable. A sample of 10 washers is selected at random, without replacement.
(a) What is the probability that none of the unacceptable washers is in the sample?
(b) What is the probability that at least one unacceptable washer is in the sample?
(c) What is the probability that exactly one unacceptable washer is in the sample?
(d) What is the mean number of unacceptable washers in the sample?
(Douglas Montgomery 3rd problem 3.90)
Exercise 12.
Lots of 40 components each are deemed unacceptable if they contain 3 or more defectives. The procedure for
sampling a lot is to select 5 components at random and to reject the lot if a defective is found.
a) What is the probability that exactly 1 defective is found in the sample if there are 3 defectives in the entire
lot?
b) we have a lot of 100 items of which 12 are defective. What is the probability that in a sample of 10, 3 are
defective?
c) Find the mean and variance of the random variable (Walpole Example 5.9)
3 Random Variables and their Probability Distributions (13)

1. Descriptive statistics : graphs, mean, weighted mean, hypergeometric mean, mode, quartiles, deciles, percentiles, also using
ogive, Q.D, M.D, Var, S.D, Coeff of Var, Coeff of S.D, Coeff of dispersion, imperical rule, Chebshev theorem, moments, moments
ratios, skewness and Kurtosis
2. Probability
3. Discrete random variables: random variables and their distributions, mathematical expectation, Bernoulli, binomial, Poisson,
negative binomial, geometric, hypergeometric probability distributions.
4. Continuous random variables: uniform dist, normal dist, exponential dist

You might also like