Week 7 STUDY GUIDE Statistics Probability 1
Week 7 STUDY GUIDE Statistics Probability 1
A.Y. 2021-2022
MODULE 06 | 4th QUARTER, WEEK 07
LESSON 06: Random Sampling and the Central Limit Theorem
What I Know?
In the previous lessons, you learned how to convert a normal random variable to a standard normal
random variable. You were also able to calculate probabilities, percentile and areas using the
standard normal tables and solve problems involving normal curve.
In the 3rd Quarter Performance Task, you were able to gather and collect data from an individual in a
sample of the study interest. Gathering and collecting data from every individual in a population is
tedious, costly. For this reason, it would be more practical to collect data from a subset of the
population. But how do you know if the sample truly represents the whole population? This can be
ensured by using methods of random sampling. This week’s study guide will help you understand in
Random Sampling and how the Central Limit Theorem plays an important role in sampling
distribution. Also, real-life situations are provided here to deepen your understanding on the said
topic.
Learning continues despite the circumstances that we’re facing. Continue to be more so you can do
more. You can do it! Good luck, stay safe, and enjoy learning.
1|Page
Statistics & Probability
A.Y. 2021-2022
What is It?
This module is divided into the two parts the following are:
I. Random Sampling
II. The Central Limit Theorem.
I. Random Sampling
In order to understand a random sampling let us first recall some important concepts.
1. Population is the number of observations in a population is called the population size (N).
It is not generally limited to groups of people; they may also refer to objects, living or non-living
things.
2. Sample is a subset or a smaller group of the population.
3. Parameter describes an entire population. Population Parameter refers to any numerical
value or measurement describing a characteristics of a population.
μ population mean
σ population standard deviation
4. Statistic describes only the sample. Sample Statistic refers to any numerical value or
measurement describing a characteristics of a sample.
x̄ sample mean
s sample standard deviation
Example:
1. Suppose you want to know the mean income of teachers in Makati City. The mean here is a
parameter of a population. But if you draw a random sample of 100 teachers and determine
their mean income, the mean you get here is a statistic of a sample. If you get a mean income
₱17 500 from the sample, you conclude that the population mean income 𝜇 is likely to be close
to ₱17 500 as well. This is an example of a statistical inference.
2|Page
Statistics & Probability
A.Y. 2021-2022
e. If 800 of the 1,200 test-takers said they have eaten breakfast before taking the test.
- Parameter
In sampling, the first task is to identify and define precisely the population of interest or the target
population.Then, we select a sample by an appropriate method/procedure that ensures the sample
represents the general traits or characteristics of the large group. In random samples, the criteria in
choosing a random sample is that all the members of the population have an equal and independent
chance of being selected into the sample.
Let us now define and discover the following terms and concepts:
1. Sampling is the method of gathering information about a population. However, we cannot
expect the sample to yield accurate information about the population. We should expect a
certain amount of error from using the sample. This error is called sampling error.
2. Sampling Frame is a list of members of the population.
3. The sample size is usually pre-determined by a formula or table. For now, our concern is how
to choose a random sample based on a pre-determined sample size.
4. Sampling with replacement means that once a person/object in the population is selected to
be in the sample, that person/object is placed back to be possibly selected again.
5. Sampling without replacement means that once a person/object, that person/object will not
be selected again.
6. Random Sampling/ Probability Sampling is a process whose members had an equal
of being selected from the population
7. Non-random Sampling/Non-probability Sampling is a sampling procedure where samples
are selected in a deliberate manner with little or no attention to randomization.
3|Page
Statistics & Probability
A.Y. 2021-2022
8. Simple Random Sampling is a process of selecting n sample size in the population until the
desired number of subjects or through lottery.
Example, in an organization of 200 employees, if the HR team decides on conducting
team building activities, it is highly likely that they would prefer picking chits out of a
bowl.
In this case, each of the 200 employees has an equal opportunity of being selected.
9. Systematic Sampling is a process of selecting kth element in the population until desired
number of subjects or respondents are attained.
Example, a researcher intends to collect a systematic sample of 500 people in a
population of 5000. He/ she numbers each element of the population from 1-5000 and
will choose every 10th individual to be a part of the sample (Total population/ Sample
size = 5000/ 500 = 10)
10. Stratified Sampling is a process of subdividing the population into subgroups or strata and
drawing members at random from each subgroup or stratum.
Example, the company has 800 female employees and 200 male employees. You want
to ensure that the sample reflects the gender balance of the company, so you sort the
population into two strata based on gender. Then you use random sampling on each
group, selecting 80 women and 20 men, which gives you a representative sample of
100 people.
11. Cluster Sampling is a process of selecting clusters from a population which is very large or
wide geographical area.
Example, the company has offices in 10 cities across the country all with roughly the
same number of employees in similar roles. You don’t have the capacity to travel to
every office to collect your data, so you use random sampling to select 3 offices —
these are your clusters.
4|Page
Statistics & Probability
A.Y. 2021-2022
12. Convenience Sampling is a simple procedure where sample are selecting a group of
individuals who (conveniently) are available for study.
Example, startups and NGOs usually conduct convenience sampling at a mall to
distribute leaflets of upcoming events or promotion of a cause — they do that by
standing at the mall entrance and giving out pamphlets randomly.
13. Purposive /Judgement Sampling is a technique when the researcher relies on his/her
personal/sound judgement in choosing to participate in the study or the sample selected is
based on the opinion of the expert.
Example, when researchers want to understand the thought process of people
interested in studying for their master’s degree. The selection criteria will be: “Are you
interested in doing your masters in …?” and those who respond with a “No” are
excluded from the sample.
14. Quota Sampling is applied when an investigator survey collects information from an assigned
number, or quota of individuals from one of several sample units fulfilling certain prescribed
criteria or belonging to one stratum.
Example, an interviewer might be told to go out and select 20 adult men, 20 adult
women, 10 teenage girls and 10 teenage boys so that they could interview them about
their television viewing.
15. Snowball Sampling is a technique in which one or more members of a population are located
and used to lead the researchers to the other members of the population.
Example, an interviewer might be told to go out and select 20 adult men, 20 adult
women, 10 teenage girls and 10 teenage boys so that they could interview them about
their television viewing.
5|Page
Statistics & Probability
A.Y. 2021-2022
16. Voluntary Sampling is a technique when sample are composed of respondents who are self-
select into the study/survey. Most of the times samples have strong interest in the topic or
study.
Example, a news show asking their viewers to participate in an on-line poll. The
samples are viewers who have chosen themselves and not the survey administrator.
Hope it is now clear to you the terms and concepts in illustrating the Random Sampling. Also, you
may click on the web page below for a simple discussion on the sampling techniques discussed as
well as other sampling techniques not discussed here is available:
https://towardsdatascience.com/sampling-techniques-a4e34111d808
If random samples of size n are drawn from a population, as n becomes larger, then the
sampling distribution of the mean approaches the normal distribution with
and standard deviation
6|Page
Statistics & Probability
A.Y. 2021-2022
or
Example: The average time it takes a group of college students to complete a certain
examination is 46.2 minutes. The standard deviation is 8 minutes. Assume that the
variable is normally distributed. If 50 randomly selected students take the examination,
what is the probability that the mean time it takes the group to complete the test will be
less than 43 minutes?
7|Page
Statistics & Probability
A.Y. 2021-2022
So, the probability that the mean time it takes the 50 randomly selected college
students will complete the test in less than 43min is 0.0023 or 0.23%.
2. Mean of the Sampling Distribution of the Sample Mean is the mean of the sampling
distribution of the sample mean is equal to the population mean.
Example: The mean of a set of data is μ=20. It follows that the mean of the sampling
distribution of the sample mean is also μ X =20.
3. Standard Deviation of the Sampling Distribution of the Sample Mean is if the population is
finite, the standard deviation of the sampling distribution of the sample mean is given by
σ X= ⋅
σ
√ N−n
√ n N−1
, where σ is the population standard deviation, n is the sample size, and N is
σ
the population size. If the population is infinite, the standard deviation is σ X = .
√n
Example 1: a population is considered finite if the sample drawn is at least 5% of the
population.
From a population of size N=500 , a sample of size n=30 is drawn. The population standard
deviation is σ =12.5. Since
n=30 is more than 5% of the population N=500 , the population is considered finite. So, we will
use the formula
σ
σ X= ⋅
√
N−n
√ n N−1
.
8|Page
Statistics & Probability
A.Y. 2021-2022
σ X ∧¿
σ
⋅
√ n N−1√
N−n
¿= ⋅
√
12.5 500−30
√30 500−1
¿ ≈ 2.21
Therefore, the standard deviation of the sampling distribution of the sample mean is σ X =2.21 .
Example 2: A population has a mean of 220 and a standard deviation of 30 . Using a sample
size of 150, what is the mean and the standard deviation of the sampling distribution of the
sample mean?
Solution:
1. List the given information.
The population mean is μ=220.
The population standard deviation is σ =30 .
The sample size is n=150.
2. Since the sample size is greater than 30, by the Central Limit Theorem, the sampling
distribution of the sample mean is normally distributed. It follows that the mean of the
sampling distribution of the sample mean is equal to the population mean.
μ X ∧¿ μ
¿=220
3. Compute for the standard deviation of the sampling distribution of the sample mean.
σ
σ X ∧¿
√n
30 Therefore, μ X =220 and σ X ≈ 2.45.
¿=
√150
¿ ≈ 2.45
9|Page
Statistics & Probability
A.Y. 2021-2022
What’s More?
Now, it’s your time to learn statistics by doing statistics! Try answering on your own the activities
below.
10 | P a g e
Note: Try answering the activity on your own and check your answers. The answer key in this activity
is provided in the last page of this material. Statistics & Probability
A.Y. 2021-2022
11 | P a g e
Statistics & Probability
A.Y. 2021-2022
Assessment
Let’s see how far you’ve learned! Answer the following activities to assess your
understanding in the topic.
12 | P a g e
Statistics & Probability
A.Y. 2021-2022
6. The school guidance counselor is interested in the mean IQ scores of all entering Senior High
School applicants in their school. From the recorded IQ scores of all applicants she obtained a
mean IQ of 105. Determine whether the situation describes a parameter or statistic.
a. Parameter
b. Statistic
7. There are 2,140 marathon runners who signed up, and 100% had a drink of at least 3 bottles
of water along the course. Determine whether the situation describes a parameter or statistic.
a. Parameter
b. Statistic
8. Suppose a population has mean 80 and standard deviation 10. Then, we get a sample of 90
cases. What is the probability that this sample will have a mean between 78 and 82?
a. 94.256%
b. 94.346%
c. 95.390%
d. 95.452%
9. In a study of life expectancy of 400 people in a certain geographic region, the mean age at
death was 70 years, and the standard deviation was 5.1 years. If a sample of 50 people from
this region is selected, what is the probability that the mean life expectancy will be less than 68
years?
a. 0.189%
b. 0.250%
c. 0.280%
d. 0.294%
10. The average public high school has 468 students with a standard deviation of 87. If a random
sample of 38 public schools is selected, what is the probability that the number of students
enrolled is between 445 and 485?
a. 68.345%
b. 79.672%
c. 80.451%
d. 83.338%
13 | P a g e
Statistics & Probability
A.Y. 2021-2022
References
- https://link.quipper.com/en/organizations/5f22837b28943218902a973c/curriculum#curriculum
14 | P a g e
Statistics & Probability
A.Y. 2021-2022
WHAT’S MORE: ANSWER KEY
I. Determine whether the following statements are true or false.
1. The group surveyed when the survey is not administered to all members of the population is called
a sample. TRUE
2. Sampling can be done when it would be impractical to survey the entire population. TRUE
3. The choice of sampling technique is dependent on the research questions and objectives and the
achievability of gaining access to the data. TRUE
4. Simple random sampling involves creating subgroups of the entire population, usually based on the
geographic location. FALSE
5. Cluster sampling uses strata in determining samples. FALSE
2. A teacher divides her classroom into juniors and seniors. In each group, she divides her students
into boys and girls. From each of these four new groups, she picks one student randomly. What type
of sampling method does she use?
a. cluster sampling
b. multistage sampling
c. stratified random sampling
d. systematic random sampling
3. Cluster sampling, stratified sampling, and systematic sampling are types of what kind of sampling
method?
a. direct sampling
b. indirect sampling
c. random sampling
d. non random sampling
4. This is a type of sampling where each element of the population has an equal chance of being
included in the sample.
a. irregular sampling
b. error free sampling
c. cluster sampling
d. simple random sampling
5. A population has mean µ = 40 and standard deviation σ = 15. What is the probability that a random
sample of size 36 will have a mean greater than 47?
15 | P a g e
Statistics & Probability
A.Y. 2021-2022
Solution:
STEP 1. Identify the given information.
µ = 40
σ = 15
X = 47
n = 10
STEP 3. Standardize the sample mean 47. (Solve for the z-score of 47)
X −μ
z=
σ
√n
47−40
z=
15
√10
z=2.8
STEP 4. Determine the area to the left of the z- score z = 2.8
16 | P a g e