STATPROB - Moodle - LC3 - Random Sapling and Central Limit Theorem
STATPROB - Moodle - LC3 - Random Sapling and Central Limit Theorem
Lecture/Examples
Definition of terms:
A population may refer to an entire group of people, objects, events, or measurements.
A sample refers to a set of observations drawn from a population
Parameters are descriptive measures computed from a population, while descriptive measures
computed from a sample are called statistics.
ab s &
A sampling distribution of sample means is a frequency distribution using the means computed
from all possible random samples of a specific size.
Finite population is one that consists of a finite or fixed number of elements, measurements, or
observations
y
ob tic
Example 1:
Considering a population consisting of 3,4,6,7, and 9. Suppose samples of size 3 are drawn from this
population. Compute for the mean, variance and standard deviation of the sampling distribution of the
a
sample means.
St
A – Samples of size 3
B – Means
C – Sample Means
D – Frequency
• If all possible samples of size n are drawn from a population of size N when mean is 𝜇 and variance
𝜎 # , then the sampling distribution of the sample means has the following properties:
ab s &
The mean of the sampling distribution of the sample means is equal to the population mean 𝜇
That is 𝜇$ = 𝜇.
The variance of the sampling distribution of the sample means is given by:
y
ob tic
() ,-*
𝜎 #$ = ∙ ,-. ; for Finite population
ilit
*
()
𝜎 #$ = ; for Infinite population
Pr tis
• The standard deviation of the sampling distribution of the sample means is just the SQUARE ROOT of
its variance.
a
• It measures the degree of accuracy of the sample mean as an estimate of the population mean.
St
Using the data in Example 1, compute for the Population Mean, Population Variance and Variance of the
Sampling Distribution.
∑ $
• Solution for Population Mean: 𝜇 = ,
3+4+6+7+9
𝜇 = = 5.8
5
∑ ($-9 ))
Solution for Population Variance: 𝜎 # = ,
Therefore, the mean of the sampling distribution is 5.8 and its variance is 0.76
Example 2:
Given:
𝜎=5 𝑛 = 16 𝜇 = 60
Solution:
The problem does not indicate the size of the population; it is assumed that the population is
ab s &
infinite. Therefore,
Mean: 𝜇$ = 60
( <
Standard Deviation: 𝜎$ = = = 1.25
y √* √.=
ob tic
ilit
CENTRAL LIMIT THEOREM
Pr tis
The Central Limit Theorem states that if random samples of size ‘ n ’ are drawn from a population, then
as ‘ n ’ becomes larger, the distribution of the sample means will be approximately normally distributed,
regardless of the shape of the population distribution.
a
$-9
The formula 𝑧 = E is used when computing for the probability that will take on a value within a given
St
√F
range in the sampling distribution of 𝑋.
where:
𝑋 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛
𝜇 = 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
𝜎 = 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
Example 3:
The average time it takes a group of college students to complete a certain examination is 46.2 minutes.
The standard deviation is 8 minutes. Assume that the variable is normally distributed. What is the
probability that a randomly selected college student will complete the examination in less than 43
minutes?
Given:
Solution:
43 − 46.2
𝑧= = −0.40
8
The problem is asking for P(X < 43). Therefore,
ab s &
Finally, the probability that a randomly selected college student will complete the examination in less
than 43 minutes is 0.3446 or 34.46%
y
ob tic
ilit
The average time it takes a group of college students to complete a certain examination is 46.2 minutes.
The standard deviation is 8 minutes. Assume that the variable is normally distributed. If 50 randomly
Pr tis
selected college students take the examination, what is the probability that the mean time it takes the
group to complete the test will be less than 43 minutes?
Given:
a
Solution:
$-9
The problem is dealing with data about the sample mean. Thus, the formula 𝑧 = E will be used. The
√F
problem is asking for P(< 43).
𝑋−𝜇 43 − 46.2
𝑧= 𝜎 = = −2.83
8
√𝑛 √50
• The problem is asking for P(X < 43). Therefore,
Example 4:
In the frequency of 30 times of a TV advertisement, the sales of a product has a mean of Php 100,000 a
week and with standard deviation of Php 75,000.
a. What is the probability that the sample mean belongs to the interval of Php 90,000 to
Php 110,000?
b. What is the interval for the mean sale cover of the middle 95% of the distribution of the
sample mean?
Given:
Both problems deal with sample mean. Thus, this formula will be used.
ab s &
$-9
𝑧= E
√F
The problem in (a) is asking for P(Php 90,000 < X < Php 110,000)
y
ob tic
𝑋. − 𝜇 𝑋# − 𝜇
𝑃(𝑃ℎ𝑝 90,000 < 𝑥 < 𝑃ℎ𝑝 110,000) = 𝑃( 𝜎 < 𝑧 < 𝜎 )
ilit
√𝑛 √𝑛
$\ -9 $) -9
So, 𝑃(𝑃ℎ𝑝 90,000 < 𝑥 < 𝑃ℎ𝑝 110,000) = 𝑃( <𝑧< )
Pr tis
E E
√F √F
= 0.2673 + 0.2673
= 0.5346 𝑜𝑟 53.46%
St
Therefore, the probability of 53.46% represents the sample mean that belongs to the interval of Php
90,000 to Php 110,000.
(b)
Since the interval covers the middle 95% of the distribution, 47.5% of the distribution is required from
both sides from the mean. The z-scores that represent 47.5% of the distribution is ±1.96.
Solving for 𝑋,
𝜎
𝑋 = 𝑧 _ ` + 𝜇
√𝑛
So, at z = - 1.96
𝑃ℎ𝑝 75,000
𝑋 = −1.96 _ ` + 100,00 = 𝑃ℎ𝑝 73,161.59
√30
• At z = 1.96
𝑃ℎ𝑝 75,000
𝑋 = 1.96 _ ` + 100,00 = 𝑃ℎ𝑝 126,838.41
√30
Therefore, 95% of the distribution sample mean is represented by the interval 𝑃ℎ𝑝 73,161.59 <
𝑋 < 𝑃ℎ𝑝 126,838.41. ab s &
TRY THIS
1. A population consists of five numbers: 85, 86, 87, 88, and 89. Consider samples of size 3, find
the sample mean, sample mean frequency, sample mean probability, and describe the sampling
distribution by computing its mean, variance and standard deviation.
y
ob tic
2. The average production of all machines in a fabrication is 15 finished materials in one hour with
a
a standard deviation of 3 materials per hour. What is the probability that a sample of 5
machines will produce more than 17 materials per hour? 6.81%
St
3. In a study of the life expectancy of 400 people in a certain geographic region, the mean age at
death is 70 years and the standard deviation is 5.1 years. If a sample of 50 people from this
region was selected, what is the probability that the mean life expectancy of the sample will be
less than 68 years? .28%
4. A bottling company uses a filling machine to fill plastic bottles with a popular cola. The bottles
are supposed to contain 300 milliliters (ml). In fact, the contents vary according to a normal
distribution with mean of 303 ml and standard deviation of 3 ml.
(a) Now take a random sample of 10 bottles. What are the mean and standard deviation of the
sample mean contents of these 10 bottles?
(b) What is the probability that an individual bottle contains less than 300 ml? 15.87%
(c) What is the probability that the sample mean contents of the 10 bottles is less than 300 ml?
0.01%
5. In a study done on the life expectancy of 500 people in a certain geographic region, the mean
age at death was 72 years and the standard deviation was 5.3 years.
(b) If a sample of 50 people from this region is selected, and the probability that the mean life
expectancy will be less than 70 years? 0.38%
(c) What are the mean and standard deviation of the sample mean length of life expectancy if you
get 20 random samples?
Baccay, E., Belecina, R., & Mateo, E. (2016). Statistics and Probability. Sampaloc, Manila, Philippines: Rex
Book Store, Inc.
Random
ab s &
Variable. Retrieved
https://owl.english.purdue.edu/owl/resource/560/10/.
October 25, 2016 from
y
ob tic
ilit
Pr tis
a
St