Week 7
Week 7
Variables 2
Distribution of Sample Means
Distribution of Sample Means
• A sampling distribution of sample means is a
distribution obtained by using the means computed
from random samples of a specific size taken from a
population.
• Sampling error is the difference between the
sample measure and the corresponding population
measure due to the fact that the sample is not a
perfect representation of the population.
Properties of the Distribution of
Sample Means
• The mean of the sample means will be the same as
the population mean.
• The standard deviation of the sample means will be
smaller than the standard deviation of the
population, and will be equal to the population
standard deviation divided by the square root of
the sample size.
The Central Limit Theorem
• As the sample size n increases, the shape of the
distribution of the sample means taken with
replacement from a population with mean and
standard deviation will approach a normal
distribution.
• The mean of the sample means equals the
population mean. X = .
• The standard deviation of the sample means is
called the standard error of the mean.
X = n.
The Central Limit Theorem
• The central limit theorem can be used to answer
questions about sample means in the same manner
that the normal distribution can be used to answer
questions about individual values.
• A new formula must be used for the z values:
X − X X −
z= =
X n
Example: Hours of Television
• A. C. Neilsen reported that children between the
ages of 2 and 5 watch an average of 25 hours of
television per week. Assume the variable is
normally distributed and the standard deviation is 3
hours. If 20 children between the ages of 2 and 5
are randomly selected, find the probability that the
mean of the number of hours they watch television
will be greater than 26.3 hours.
Example 6-13: Hours of Television
X − 26.3 − 25
z= = = 1.94
n 3 20
90 − 96 100 − 96
z= = −2.25 z= = 1.50
16 36 16 36
X − 224 − 218.4
z= = = 0.22
25
• The area between 0 and z = 0.22 is 0.08771 and below zero is 0.5 giving
us 0.5871.
• Hence, the probability of selecting an individual who consumes less
than 224 pounds of meat per year is 0.5871, or 58.71%.
Example: Meat Consumption
• The average number of pounds of meat that a
person consumes per year is 218.4 pounds.
Assume that the standard deviation is 25 pounds
and the distribution is approximately normal.
• b. If a sample of 40 individuals is selected, find the
probability the sample will be less than 224 pounds per
year.
Example 6-15: Meat Consumption
X − 224 − 218.4
z= = = 1.42
n 25 40
• The area between zero and z = 1.42 is 0.4222. The area to the left of
zero (mean) is 0.5.
• So the area to the left of z is 0.9222.
• Hence, the probability that the mean of a sample of 40 individuals is
less than 224 pounds per year is 0.9222, or 92.22%.
The Normal Approximation to
the Binomial Distribution
The Normal Approximation to the
Binomial Distribution
• A normal distribution is often used to solve
problems that involve the binomial distribution
since when n is large (say, 100), the calculations are
too difficult to do by hand using the binomial
distribution.
The Normal Approximation to the
Binomial Distribution
• The normal approximation to the binomial is
appropriate when np > 5 and nq > 5 .
• In addition, a correction for continuity may be used
in the normal approximation to the binomial.
• The continuity correction means that for any
specific value of X, say 8, the boundaries of X in the
binomial distribution (in this case, 7.5 to 8.5) must
be used.
The Normal Approximation to the
Binomial Distribution
• Binomial
• When finding: Normal
• P(X = a) Use: P(a – 0.5 < X < a + 0.5)
• P(X a) P(X > a – 0.5)
• P(X > a) P(X > a + 0.5)
• P(X a) P(X < a + 0.5)
• P(X < a) P(X < a – 0.5)
For all cases, = np, = npq , np 5, nq 5
The Normal Approximation to the
Binomial Distribution
• Procedure Table
• Step 1: Check to see whether the normal approximation
can be used.
• Step 2: Find the mean µ and the standard deviation .
• Step 3: Write the problem in probability notation, using
X.
• Step 4: Rewrite the problem by using the continuity
correction factor, and show the corresponding area
under the normal distribution.
• Step 5: Find the corresponding z values.
• Step 6: Find the solution.
Example: Reading While Driving
• A magazine reported that 6% of American drivers read
the newspaper while driving. If 300 drivers are selected
at random, find the probability that exactly 25 say they
read the newspaper while driving.
𝑎𝑛𝑑 0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
• The mean and the standard deviation are:
𝑎+b b−𝑎
μ𝑥 = and 𝜎𝑥 =
2 12
Shape of the Uniform distribution
• Because of its shape the uniform distribution is
sometimes called the rectangular distribution
0.25
0.2
f(x)
0.15
0.1
0.05
0
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
x
1 1.5
𝑝 𝑥 ≥ 2.5 = 2.5 ≤ 𝑥 ≤ 4 = 4 − 2.5 ∗ = = 0.375
4 4
The probability that a randomly selected patron will spend at least 2.5
minutes in the elevator is 38percent
Example 2 – Unform Distribution
• Suppose that an airline quotes a flight time of 2hours,
10 mins between two cities. Furthermore, suppose that
the historical flight records indicate that the actual
flight time between the two cities, x, is uniformly
distributed between 2 hours and 2 hours, 20 minutes.
Letting the time unit be one minute,
a. write the formula for the probability curve of x
b. Graph the probability curve of x.
c. Find 𝑝(125 ≤ 𝑥 ≤ 135)
d. Find the probability that a randomly selected flight
between the two cities will be at least five minutes late.
Solution to Example 2
• First convert all time periods into minutes
1
a. 𝑓 𝑥 = 20
𝑓𝑜𝑟 120 ≤ 𝑥 ≤ 140
b. Probability Curve
𝒑(𝟏𝟐𝟓 ≤ 𝒙 ≤ 𝟏𝟑𝟓)
𝒑(𝟏𝟑𝟓 ≤ 𝒙 ≤ 𝟏𝟒𝟎)
1 10
𝑝 125 ≤ 𝑥 ≤ 135 = 135 − 125 ∗ = = 0.5
20 20
1 5
𝑝 135 ≤ 𝑥 ≤ 140 = 140 − 135 ∗ = = 0.25
20 20
Exponential distribution
Exponential Distribution
• The exponential distribution is a continuous
distribution that models the time between
randomly occurring events.
• it is often used in applications such as modelling:
• the time between customer arrivals to a service system,
or
• the time to failure of machines, lightbulbs, and other
mechanical or electrical components.
Exponential Distribution
• A key property of the exponential distribution is
that it is memoryless;
• that is, the current time has no effect on future
outcomes.
• Example: the length of time until a machine failure
has the same distribution no matter how long the
machine has been running.
• Similar to the Poisson distribution, the exponential
distribution has one parameter 𝜆.
Exponential Distribution
• The exponential distribution is closely related to
the Poisson;
• if the number of events occurring during an interval of
time has a Poisson distribution, then the time between
events is exponentially distributed.
• If the number of arrivals at a bank is Poisson
distributed, say with mean 12 per hour, then the
time between arrivals is exponential, with mean
1/12 hour, or 5 minutes.
12
• Average number per minute = 0.2
60
60
• Time between arrivals = 5 minutes
12
Exponential Distribution
• If 𝜆 is a positive number, then the equation
describing the exponential distribution is:
𝑓 𝑥 = 𝜆𝑒 −𝜆𝑋 𝑓𝑜𝑟 𝑋 ≥ 0 and 0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
• The mean and standard deviation are: μ𝑥 =
1 1
and 𝜎𝑥 =
𝜆 𝜆
Graph of Exponential Distribution
• For a given value of 𝜆, the exponential distribution is
found by substituting 𝜆 and different values for x into
𝑓 𝑥 = 𝜆𝑒 −𝜆𝑋
𝑝 1 ≤ 𝑥 ≤ 2 = 𝑒 −0.4(1) − 𝑒 −0.4(2)
= 𝑒 −0.4 − 𝑒 −0.8
= 0.6703 − 0.4493
= 0.221
Example – Online Support Chat
• The Haines Services Company provides online chat
support for small business customers who have
purchased accounting software.
• It has been determined that the number of customers
who attempt to connect to the chat service per hour is
Poisson distributed with 𝜆 = 30 per hour.
• At the current staffing level, if customer requests are too
close together—45 seconds (0.75 minute) or less—the
chat connection will fail and the customer will get a
message saying to try again later.
• The managers at Haines are analyzing whether they
should add additional service representatives to
eliminate this problem.
• What is the probability that a customer will fail to
connect?
Solution: Online Support Chat
30
• Average number of calls per minute: 𝜆 = = 0.5
60
• Mean time between attempted connects is;
1 1
= = 2 minutes
𝜆 0.5
• The probability that a customer will fail to connect
means finding 𝑝(𝑥 ≤ 0.75 𝑚𝑖𝑛𝑢𝑡𝑒𝑠):
𝑝 𝑥 ≤ 0.75 = 𝑝 0 ≤ 𝑥 ≤ 0.75
= 𝑒 −0.5(0) − 𝑒 −0.5 0.75
= 1 − 0.6873 = 0.3127
Thus, the probability that a customer will fail to connect
is 31%. Thus a third of all trials will receive the try again
message at the current staffing level.
Thank you