Module 2 - Lesson 2
Module 2 - Lesson 2
MATH 212
ENGINEERING DATA ANALYSIS
1|Page
ENGINEERING DATA ANALYSIS
Faculty Information:
Getting help
2|Page
ENGINEERING DATA ANALYSIS
TABLE OF CONTENTS
CONTENTS PAGE
Lesson 2 ………………………………………. 4
Application 2…………………………………………. 12
References …………………………………………. 15
3|Page
ENGINEERING DATA ANALYSIS
Learning Outcome:
o Apply the fundamentals of probability to determine the probability of a
discreet random variable.
Introduction:
Discrete random variables are used in many practical applications. The three
important discreet random variables—the binomial, the Poisson, and the
hypergeometric and their usefulness in different physical situations are presented in this
lesson. These random variables are often used to describe the number of occurrences
of a specified event in a fixed number of trials or a fixed unit of time or space.
Abstraction:
1. Basic Concept: Probability Distribution
A probability distribution may be defined as a distribution showing the
different values of a random variable with their corresponding probabilities.
Generally, when it is not advisable to list all the possible values of the random
variable because the process is too cumbersome, a mathematical function is used to
characterize or generalize the correspondence involved. This correspondence which
assigns probabilities to the values of the random variable is referred to as a
probability function.
For a random variable X, the probability that it takes on x as value is denoted
by either
f(x) or P(X = x)
Thus if we toss a coin three times and denote the number of heads that fall as
X, then we have:
f(0) = P(X = 0) = 1/8
f(1) = P(X = 1) = 3/8
f(2) = P(X = 2) 3/8
f(3) = P(X = 1) = 1/8
In an experiment of tossing a die, if we denote the number that occurs by X,
the probability function may be written as:
f(x) = 1/6, x = 1, 2, 3, 4, 5, 6.
4|Page
ENGINEERING DATA ANALYSIS
5|Page
ENGINEERING DATA ANALYSIS
f(x, k) = , x = 1, 2, 3, . . . , n
f(x, 15) = , x = 1, 2, 3, . . . , 15
P(x < 6) = + + + + = =
Most often an experiment consists of repeated trials, each with two possible
outcomes that may be defined as a success or a failure. This referred to as the
Bernoulli process and each trial is called a Bernoulli trial.
The number, x, of successes in n Bernoulli trials is called a binomial random
variable that has a probability distribution called the binomial distribution.
A coin-tossing experiment is a simple example of an important discrete random
variable called the binomial random variable.
6|Page
ENGINEERING DATA ANALYSIS
k = 0, 1, 2, ..., n.
The random variable x, the number of successes in n trials, has a probability
distribution with this center and spread:
Mean : μ = np
Variance : σ2 = npq
Standard deviation: σ = √
Example 3: Over a long period of time it has observed that a given rifleman can hit a
target on a single trial with probability of 0.8. Suppose he fires four shots at the
target.1) What is the probability that he will hit the target exactly two times? 2) What
is the probability that he will hit the target at least once?
Solution: A ―trial‖ is a single shot at the target, and you can define a ―success‖ as a
hit and a ―failure‖ as a miss, so that n = 4 and p = 0.8. If you assume that the
rifleman’s chance of hitting the target does not change from shot to shot, then the
number x of times he hits the target is a binomial random variable.
Let:
A ―trial‖ is a single shot at the target, ―success‖ as a hit ; ―failure‖ as a miss,
n = 4 and p = 0.8.
If you assume that the rifleman’s chance of hitting the target does not change
from shot to shot, then the number x of times he hits the target is a binomial random
variable.
P(x = 2)
• p(2) = C42 (0.8)2 (0.2)4 – 2 = 0.1536 the probability that he will hit the target
exactly two time.
• P(at least 1) = P(x ≥ 1) = p(1) + p(2) + p(3) + p(x=4)
= 1 – p(0)
= 1 – C40 (0.8)0 (0.2)4 = 0.9984
Example 4: Given that 6 fair coins are flipped once, what is the probability of getting
a) exactly 6 heads?
b) 4, 5, or 6 heads?
c) less than 2 heads?
b) ∑ where k = 4, 5, 6
P (4 x ) = 6 C4 + 6C5
+ 6 C6
= 0.3438
c) ∑ where k = 0, 1
p(0 x ) = 6 C0 + 6 C1
= 0.1094
7|Page
ENGINEERING DATA ANALYSIS
If we let each trial to have more than 2 possible outcomes, then the binomial
experiment becomes a multinomial experiment. An example is the drawing of a card
from a deck with replacement if one is interested in the four suits as outcomes.
∑ and ∑ =1
Example 5: The probabilities that a delegate to a convention arrived by air, ferry, bus,
or taxi are 0.5, 0.2, 0.2, and 0. 1, respectively. What is the probability that among the
ten delegates randomly selected at this convention, 4 arrived by air, 3 arrived by ferry,
2 arrived by bus and 1 arrived by taxi.
Solution:
f( 0.5, 0.2, 0.2, 0.1, 10) =
= = 0.0252
Recall that in the binomial experiment the random variable X represents the
number of successes obtained in a series of n independent and identical Bernoulli
trials; the number of trial is fixed and the number of successes will vary from
experiment to experiment.
The negative binomial random variable represents the number of trials needed
to obtain exactly k successes.; here, the number of successes is fixed and the number
of trials will vary from experiment to experiment.
8|Page
ENGINEERING DATA ANALYSIS
Solution:
Let ―success‖ – obtaining a defective box
p = 0.1, k = 3
The probability that x = 15 is given by
f(15) = = 0.0257
Note that if we consider the special case of the negative binomial distribution
for which k = 1, the negative binomial reduces to the form
f(x) = b*(x; 1, p) =
f(x) = g(x; p) =
where q = 1 – p; 0 , p , 1; x = 1, 2, 3,. . .
The mean of a geometric random variable is
9|Page
ENGINEERING DATA ANALYSIS
Solution.
x = 3 , p = 0.1
g(3; 0.1) =
Example 8: Thirteen cards are drawn from a deck of ordinary playing cards. Find the
probability of selecting 8 red cards and 5 black cards.
Solution:
N = 52; n = 13; x = 8;
N – k = 26; n – x = 13- 8 = 5
* +[ ]
f(x) = h(x; N, n, k) =
[ ]
10 | P a g e
ENGINEERING DATA ANALYSIS
* +* +
f(x) = h(8; 52, 13, 26) = = 0.1618
* +
Example 9: A company ships his products in boxes of size 20. Since no
manufacturing process is perfect, defective items are inevitable. The buyer, before
accepting a box, selected and tested three items. Suppose that a given box actually
contains five defective items sampled.
a. Find the probability distribution function of the hypergeometric random
variable X.
b. What is the probability that exactly 2 defective are found in the sample?
c. Find the expected number of defective items in the sample.
d. Find the variance for the random variable X.
Solution:
a) The probability distribution function for X:
* +[ ]
f(x) = h(x; N, n, k) =
* +
where x = 0, 1, 2, 3
b) If x = 2, then
f(2) = h(x; N, n, k)
= h(2; 20, 3, 5)
= = 0.1316
c) The expected number of defective items in a sample of size 3 is
E(X) = = =
d) The variance for X, Var(X) = 3 ( ) ( ) ( ) =
Poisson random variables usually arise in connection with what are called
Poisson Processes. Such process is named after the French mathematician Simeon
Denis Poisson (1871-1840).
Examples of experiments for which the random variable x can be modeled by the
Poisson random variable.
11 | P a g e
ENGINEERING DATA ANALYSIS
In each example, let x represents the number of events that occur in a period of
time or space during which an average of μ such events can be expected to occur.
Let μ be the average number of times that an event occurs in a certain period of time
or space. The probability of k occurrence of this event is
The mean and standard deviation of the Poisson random variable x are
Mean: μ
Standard deviation: σ = √
Solution:
Application: Exercise #2
Solve the following problems.
1. Let x be a binomial random variable with n = 7, p= 0.3. Find these values:
a) P(x = 4) b) P(x 1) c) P(x > 1
d) μ = np e) σ = √
12 | P a g e
ENGINEERING DATA ANALYSIS
a. What is the probability that at most five cases of E.coli per 100,000 are
reported in California this year?
b. What is the probability that more than five cases of E. coli are reported in
California this year?
c. Approximately 95% of occurrences of E. coli involve at most how many
cases?
13 | P a g e
ENGINEERING DATA ANALYSIS
Closure
Congratulations! You have successfully completed the tasks and activities for
Lesson 2. It is expected that you have gained insights about discreet probability
distribution.
Now if you are ready, please proceed to Lesson 3 of this module which will
introduce you to another probability distribution, the continuous probability
distribution.
14 | P a g e
ENGINEERING DATA ANALYSIS
References
Peck, R., Olsen, C. and Devore, J.L. (2012): Introduction to Statistics and Data
Analysis(4th edition). Brooks/Cole/Cengage Learning, 20 Channel
Center Street Boston, MA 02210, USA
Walpole, RE, & Myers, RH.(1993). Probability and Statistics for Engineers and (5th
ed.). Macmillan Publishing Company, New York.
15 | P a g e