MATH+270 Chapter+8
MATH+270 Chapter+8
Sampling Distributions
Sample Statistic x s2 s p̂
• If the information (data) collected is from the entire population, we can use descriptive statistics to draw
conclusions about the population being studied and the statistical process is over.
• However, it is often difficult or impossible to gain access to populations. The sample data are used to make
inferences about the population. For example, we might compute a sample mean x from the data and use
this information to draw conclusions regarding the population mean .
• In this chapter, we continue our discussion of probability distributions where statistics, such as x , will be
the random variable. Statistics are random variables because the value of a statistic varies from sample to
sample. For this reason, statistics have probability distributions associated with them. For example, there
is a probability distribution for the sample mean x , sample proportion p̂ , and so on. We use probability
distributions to make probability statements regarding the statistic.
• Our goal in this chapter is to describe the distribution of a statistic such as x and p̂ . Remember, when we
describe a distribution, we do so in terms of its shape, center (mean), and spread (standard deviation).
1
(1) Sampling Distribution of the Sample Mean, x
2
The Mean and Standard Deviation of the Sampling Distribution of x
Suppose that a simple random sample of size n is drawn from a population with mean and standard deviation
. The sampling distribution of x has mean x = and standard deviation x = . The standard deviation
n
of the sampling distribution of x , x , is called the standard error of the mean.
Summary
Suppose a random sample of size n is drown from a population with mean 𝜇 and standard deviation 𝜎. Then, we
can describe the distribution of the sample mean based on the following two situations:
(a) If the population distribution is Normal:
x will be normal for any sample size n with mean x = and standard deviation x = .
n
(b) If the population distribution is Nonnormal:
(i) When the sample size n is sufficiently large (n 30):
x will be approximately normal with mean x = and standard deviation x = .
n
(ii) When the sample size n is small (n 30):
Although it is still true that x = and x = , unfortunately there is no general result about the shape
n
of the distribution of x (usually x has similar distribution shape as the population).
Example1. A simple random sample of size n = 36 is obtained from a population with = 89 and = 18.
3
Example 2. Answer the following questions for the sampling distribution of the sample mean shown to the
right.
(a) What is the value of x ? Answer:
Answer:
(c) If the sample size is n = 16, what is likely true about the shape of the population?
Answer: Look at the graph to the right. The shape of x is normal even the sample size is small (n = 16 30).
Therefore, the population that we were sampling from must be normal.
(d) If the sample size is n = 16, what is the standard deviation of the population from which the sample was
drawn?
Answer:
Example 3. The upper leg of 20 to 29 year old males is normally distributed with a mean length of 43.7cm and
a standard deviation of 4.2cm.
= 43.7 and = 4.2
(a) What is the probability that a randomly selected 20- to 29-year old male has an upper leg length that is less
than 40 cm?
Answer:
(b) A random sample of 9 males who are 20 to 29 years old is obtained. What is the probability that the mean
upper leg length is less than 40 cm?
Answer:
(c) What is the probability that a random sample of 12 males who are 20–29 years old results in a mean upper
leg length that is less than 40 cm?
Answer:
(d) What effect does increasing the sample size have on the probability? Provide an explanation for this result.
4
Answer: Increasing the sample size n ______________ the probability that x 40 . This is because the
standard deviation of the sample mean x = _______________ as the sample size increases.
n
(e) A random sample of 15 males who are 20 to 29 years old results in a mean upper leg length greater than 46
cm. Do you find the result unusual? Why?
Answer: P( x 46) = 0.0169. Since 0.0169 is smaller than 0.05, finding a random sample of 15 males who are
20 to 29 years old results in a mean upper leg length greater than 46 cm is unusual.
Of course, gaining access to all the individuals in a population is rare, so we usually obtain estimates of
population parameters such as the population proportion p using the sample proportion p̂ .
Definition: Suppose that a random sample of size n is obtained from a population in which each individual
either does or does not have a certain characteristic. The sample proportion, denoted p̂ (read “p-hat”), is
x
given by pˆ = where x is the number of individuals in the sample with the specified characteristic. The sample
n
p̂
proportion, , is a statistic that estimates the population proportion, p.
Note that x can be thought of as the number of successes in n trials of a binomial experiment.
Example 1. The Harris Poll conducted a survey of 1200 adult Americans who vacation during the summer and
asked whether the individuals plan to work while on summer vacation. Of those surveyed, 552 indicated that
they plan to work while on vacation. Find the sample proportion of individuals surveyed who plan to work
while on summer vacation.
Note that a second survey of 1200 American adults would likely have a different estimate of the proportion
of Americans who plan to work while on summer vacation because different individuals would be in the
sample. Because the value of p̂ varies from sample to sample, it is a random variable and has a probability
distribution (shape, center, and spread).
Figure (a) shows the histogram of 2000 sample proportions p̂ from samples of size n = 10. Notice that the shape
of the distribution is skewed left. The mean of the 2000 sample proportions is 0.76 and the standard deviation is
0.136. Notice that the mean of the sample proportions equals the population proportion.
Figure (b) shows the histogram for 2000 sample proportions p̂ from samples of size n = 25. Notice that the
histogram is slightly skewed left (although not as skewed as the histogram with n = 10). The mean of the 2000
sample proportions for a sample of size n = 25 is 0.76 and the standard deviation is 0.086.
Figure (c) shows the histogram for 2000 sample proportions p̂ from samples of size n = 60. Notice that the
histogram is approximately normal. The mean of the 2000 sample proportions is 0.76 and the standard deviation
is 0.054.
5
Notice the following points regarding the sampling distribution of the sample proportion:
• Shape: As the size of the sample increases, the shape of the distribution of the sample proportion becomes
approximately normal.
• Center: The mean of the distribution of the sample proportion equals the population proportion, p.
• Spread: The standard deviation of the distribution of the sample proportion decreases as the sample size
increases.
Conditions/Requirements:
1. Simple Random Sampling (SRS): Any sample we take needs to be a simple random sample. Often, we’ll be
told in the problem that sampling was random.
2. The normality condition: np(1 – p) ≥10.
3. The independence condition: the sample size n is no more than 5% of the population size N.
Example 2. Based on a study conducted by the Gallup organization, 76% of Americans believe that the state of
moral values in the United States is getting worse. Suppose we obtain a simple random sample of n = 60
Americans and determine which believe that the state of the moral values in the United States is getting worse.
Describe the sampling distribution of the sample proportion for Americans with this belief.
Answer:
6
Example 3. A simple random sample of size n = 75 is obtained from a population whose size is N = 10,000 and
whose population proportion with a specified characteristic is p = 0.8.
(a) Describe the sampling distribution of p̂ .
Answer:
✓ Simple Random Sample (SRS). Shape: p̂ is approximately normal
✓ np(1 − p) = 75(0.8)(1 − 0.8) = 12 10
✓ n / N = 75 /10000 = 0.0075 0.05 Center: pˆ = p = 0.8 .
p(1 − p) 0.8(1 − 0.8)
Spread: pˆ = = = 0.046.
n 75
(b) What is the probability of obtaining x = 63 or more individuals with the characteristic?
Answer:
(c) What is the probability of obtaining x = 51 or fewer individuals with the characteristic?
Answer:
Answer:
(b) What is the probability that in a random sample of 500 adults more than 30% do not own a credit card?
Answer:
(c) What is the probability that in a random sample of 500 adults between 25% and 30% do not own a credit
card?
Answer:
(d) Would it be unusual for a random sample of 500 adults to result in 125 or fewer who do not own a credit
card? Why?
Answer: