100% found this document useful (1 vote)
56 views

Week 7 STUDY GUIDE Statistics Probability 1

This document provides an overview of random sampling and the central limit theorem. It discusses key concepts related to random sampling such as population, sample, parameter, statistic, and different sampling methods including simple random sampling, systematic sampling, stratified sampling, cluster sampling, and others. It explains that random sampling ensures samples are representative of the overall population. The document also indicates that the central limit theorem plays an important role in sampling distributions.

Uploaded by

Hannah Martinez
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
56 views

Week 7 STUDY GUIDE Statistics Probability 1

This document provides an overview of random sampling and the central limit theorem. It discusses key concepts related to random sampling such as population, sample, parameter, statistic, and different sampling methods including simple random sampling, systematic sampling, stratified sampling, cluster sampling, and others. It explains that random sampling ensures samples are representative of the overall population. The document also indicates that the central limit theorem plays an important role in sampling distributions.

Uploaded by

Hannah Martinez
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 16

Statistics & Probability

A.Y. 2021-2022
MODULE 06 | 4th QUARTER, WEEK 07
LESSON 06: Random Sampling and the Central Limit Theorem

What I Know?

In the previous lessons, you learned how to convert a normal random variable to a standard normal
random variable. You were also able to calculate probabilities, percentile and areas using the
standard normal tables and solve problems involving normal curve.

What I Need to Know?

In the 3rd Quarter Performance Task, you were able to gather and collect data from an individual in a
sample of the study interest. Gathering and collecting data from every individual in a population is
tedious, costly. For this reason, it would be more practical to collect data from a subset of the
population. But how do you know if the sample truly represents the whole population? This can be
ensured by using methods of random sampling. This week’s study guide will help you understand in
Random Sampling and how the Central Limit Theorem plays an important role in sampling
distribution. Also, real-life situations are provided here to deepen your understanding on the said
topic.

The module is about:


o Random Sampling and the Central Limit Theorem

After going through this module, you are expected to:


a. illustrate random sampling;
b. distinguishes between parameter and statistics; and
c. illustrate the Central Limit Theorem.

Learning continues despite the circumstances that we’re facing. Continue to be more so you can do
more. You can do it! Good luck, stay safe, and enjoy learning.

1|Page
Statistics & Probability
A.Y. 2021-2022

What is It?

This module is divided into the two parts the following are:
I. Random Sampling
II. The Central Limit Theorem.

I. Random Sampling
In order to understand a random sampling let us first recall some important concepts.
1. Population is the number of observations in a population is called the population size (N).
It is not generally limited to groups of people; they may also refer to objects, living or non-living
things.
2. Sample is a subset or a smaller group of the population.
3. Parameter describes an entire population. Population Parameter refers to any numerical
value or measurement describing a characteristics of a population.
μ  population mean
σ  population standard deviation

4. Statistic describes only the sample. Sample Statistic refers to any numerical value or
measurement describing a characteristics of a sample.
x̄  sample mean
s  sample standard deviation
Example:
1. Suppose you want to know the mean income of teachers in Makati City. The mean here is a
parameter of a population. But if you draw a random sample of 100 teachers and determine
their mean income, the mean you get here is a statistic of a sample. If you get a mean income
₱17 500 from the sample, you conclude that the population mean income 𝜇 is likely to be close
to ₱17 500 as well. This is an example of a statistical inference.

2. Determine whether the situation describes a parameter or statistic.


a. For the population of 1.5 billion facebook users worldwide, 78% of facebook users use
mobile phones to browse facebook. - Parameter
b. For the FB users in the Philippines, 70% use mobile phones or gadgets to browse
facebook. - Statistics
c. A bunch of lanzones is taken from a big box is 90% sweet. - Statistics
d. All the applicants taking the SHS entrance test were asked how they feel and 75% said
they are nervous. - Parameter

2|Page
Statistics & Probability
A.Y. 2021-2022
e. If 800 of the 1,200 test-takers said they have eaten breakfast before taking the test.
- Parameter

In sampling, the first task is to identify and define precisely the population of interest or the target
population.Then, we select a sample by an appropriate method/procedure that ensures the sample
represents the general traits or characteristics of the large group. In random samples, the criteria in
choosing a random sample is that all the members of the population have an equal and independent
chance of being selected into the sample.

Let us now define and discover the following terms and concepts:
1. Sampling is the method of gathering information about a population. However, we cannot
expect the sample to yield accurate information about the population. We should expect a
certain amount of error from using the sample. This error is called sampling error.
2. Sampling Frame is a list of members of the population.
3. The sample size is usually pre-determined by a formula or table. For now, our concern is how
to choose a random sample based on a pre-determined sample size.
4. Sampling with replacement means that once a person/object in the population is selected to
be in the sample, that person/object is placed back to be possibly selected again.
5. Sampling without replacement means that once a person/object, that person/object will not
be selected again.
6. Random Sampling/ Probability Sampling is a process whose members had an equal
of being selected from the population
7. Non-random Sampling/Non-probability Sampling is a sampling procedure where samples
are selected in a deliberate manner with little or no attention to randomization.

3|Page
Statistics & Probability
A.Y. 2021-2022
8. Simple Random Sampling is a process of selecting n sample size in the population until the
desired number of subjects or through lottery.
Example, in an organization of 200 employees, if the HR team decides on conducting
team building activities, it is highly likely that they would prefer picking chits out of a
bowl.
In this case, each of the 200 employees has an equal opportunity of being selected.

9. Systematic Sampling is a process of selecting kth element in the population until desired
number of subjects or respondents are attained.
Example, a researcher intends to collect a systematic sample of 500 people in a
population of 5000. He/ she numbers each element of the population from 1-5000 and
will choose every 10th individual to be a part of the sample (Total population/ Sample
size = 5000/ 500 = 10)

10. Stratified Sampling is a process of subdividing the population into subgroups or strata and
drawing members at random from each subgroup or stratum.
Example, the company has 800 female employees and 200 male employees. You want
to ensure that the sample reflects the gender balance of the company, so you sort the
population into two strata based on gender. Then you use random sampling on each
group, selecting 80 women and 20 men, which gives you a representative sample of
100 people.

11. Cluster Sampling is a process of selecting clusters from a population which is very large or
wide geographical area.
Example, the company has offices in 10 cities across the country all with roughly the
same number of employees in similar roles. You don’t have the capacity to travel to
every office to collect your data, so you use random sampling to select 3 offices —
these are your clusters.

4|Page
Statistics & Probability
A.Y. 2021-2022

12. Convenience Sampling is a simple procedure where sample are selecting a group of
individuals who (conveniently) are available for study.
Example, startups and NGOs usually conduct convenience sampling at a mall to
distribute leaflets of upcoming events or promotion of a cause — they do that by
standing at the mall entrance and giving out pamphlets randomly.

13. Purposive /Judgement Sampling is a technique when the researcher relies on his/her
personal/sound judgement in choosing to participate in the study or the sample selected is
based on the opinion of the expert.
Example, when researchers want to understand the thought process of people
interested in studying for their master’s degree. The selection criteria will be: “Are you
interested in doing your masters in …?” and those who respond with a “No” are
excluded from the sample.

14. Quota Sampling is applied when an investigator survey collects information from an assigned
number, or quota of individuals from one of several sample units fulfilling certain prescribed
criteria or belonging to one stratum.
Example, an interviewer might be told to go out and select 20 adult men, 20 adult
women, 10 teenage girls and 10 teenage boys so that they could interview them about
their television viewing.

15. Snowball Sampling is a technique in which one or more members of a population are located
and used to lead the researchers to the other members of the population.
Example, an interviewer might be told to go out and select 20 adult men, 20 adult
women, 10 teenage girls and 10 teenage boys so that they could interview them about
their television viewing.

5|Page
Statistics & Probability
A.Y. 2021-2022

16. Voluntary Sampling is a technique when sample are composed of respondents who are self-
select into the study/survey. Most of the times samples have strong interest in the topic or
study.
Example, a news show asking their viewers to participate in an on-line poll. The
samples are viewers who have chosen themselves and not the survey administrator.

Hope it is now clear to you the terms and concepts in illustrating the Random Sampling. Also, you
may click on the web page below for a simple discussion on the sampling techniques discussed as
well as other sampling techniques not discussed here is available:

https://towardsdatascience.com/sampling-techniques-a4e34111d808

II. The Central Limit Theorem.


Let us now define and discover the following terms and concepts:
1. Central Limit Theorem is the sampling distribution of the sample mean of any population
approaches a normal distribution as the sample size increases, regardless of the distribution of
the population. The Central Limit Theorem implies that as long as the sample size n is
sufficiently large, that is, n ≥ 30, the sampling distribution of the sample mean is approximately
normal.
Example, in a given population, a sample of size n=50 is drawn. Since the sample size
is greater than 30, it follows that the sampling distribution of the sample means is
normally distributed according to the Central Limit Theorem.

 If random samples of size n are drawn from a population, as n becomes larger, then the
sampling distribution of the mean approaches the normal distribution with
and standard deviation

6|Page
Statistics & Probability
A.Y. 2021-2022

What happens to the distribution of the sample mean as the n


(sample size) increases?

 CLT basically says that whether data from a population is


normally distributed or not, the distribution of the sample
means is approximately normally distributed as long as the sample size is large enough
(usually at least 30).
 Thus for sampling distribution of the mean, the value of standard normal variable z is given by:

or

Example: The average time it takes a group of college students to complete a certain
examination is 46.2 minutes. The standard deviation is 8 minutes. Assume that the
variable is normally distributed. If 50 randomly selected students take the examination,
what is the probability that the mean time it takes the group to complete the test will be
less than 43 minutes?

7|Page
Statistics & Probability
A.Y. 2021-2022

So, the probability that the mean time it takes the 50 randomly selected college
students will complete the test in less than 43min is 0.0023 or 0.23%.

2. Mean of the Sampling Distribution of the Sample Mean is the mean of the sampling
distribution of the sample mean is equal to the population mean.
Example: The mean of a set of data is μ=20. It follows that the mean of the sampling
distribution of the sample mean is also μ X =20.

3. Standard Deviation of the Sampling Distribution of the Sample Mean is if the population is
finite, the standard deviation of the sampling distribution of the sample mean is given by
σ X= ⋅
σ
√ N−n
√ n N−1
, where σ is the population standard deviation, n is the sample size, and N is
σ
the population size. If the population is infinite, the standard deviation is σ X = .
√n
Example 1: a population is considered finite if the sample drawn is at least 5% of the
population.

From a population of size N=500 , a sample of size n=30 is drawn. The population standard
deviation is σ =12.5. Since
n=30 is more than 5% of the population N=500 , the population is considered finite. So, we will
use the formula
σ
σ X= ⋅

N−n
√ n N−1
.

8|Page
Statistics & Probability
A.Y. 2021-2022

σ X ∧¿
σ

√ n N−1√
N−n

¿= ⋅

12.5 500−30
√30 500−1
¿ ≈ 2.21
Therefore, the standard deviation of the sampling distribution of the sample mean is σ X =2.21 .

Example 2: A population has a mean of 220 and a standard deviation of 30 . Using a sample
size of 150, what is the mean and the standard deviation of the sampling distribution of the
sample mean?
Solution:
1. List the given information.
The population mean is μ=220.
The population standard deviation is σ =30 .
The sample size is n=150.

2. Since the sample size is greater than 30, by the Central Limit Theorem, the sampling
distribution of the sample mean is normally distributed. It follows that the mean of the
sampling distribution of the sample mean is equal to the population mean.
μ X ∧¿ μ
¿=220
3. Compute for the standard deviation of the sampling distribution of the sample mean.
σ
σ X ∧¿
√n
30 Therefore, μ X =220 and σ X ≈ 2.45.
¿=
√150
¿ ≈ 2.45

9|Page
Statistics & Probability
A.Y. 2021-2022

What’s More?

Now, it’s your time to learn statistics by doing statistics! Try answering on your own the activities
below.

I. Determine whether the following statements are true or false.


1. The group surveyed when the survey is not administered to all members of the population is called
a sample.
2. Sampling can be done when it would be impractical to survey the entire population.
3. The choice of sampling technique is dependent on the research questions and objectives and the
achievability of gaining access to the data.
4. Simple random sampling involves creating subgroups of the entire population, usually based on the
geographic location.
5. Cluster sampling uses strata in determining samples.
II. Choose the letter of the best answer.
1. Which of the following employs simple random sampling?
a. picking a house randomly and then picking 5 more on the same street
b. drawing names randomly from a covered box
c. selecting every 10th person that you see
d. picking a random location on a map and then the nearest two houses are chosen
2. A teacher divides her classroom into juniors and seniors. In each group, she divides her students
into boys and girls. From each of these four new groups, she picks one student randomly. What type
of sampling method does she use?
a. cluster sampling
b. multistage sampling
c. stratified random sampling
d. systematic random sampling
3. Cluster sampling, stratified sampling, and systematic sampling are types of what kind of sampling
method?
a. direct sampling
b. indirect sampling
c. random sampling
d. non random sampling
4. This is a type of sampling where each element of the population has an equal chance of being
included in the sample.
a. irregular sampling
b. error free sampling
c. cluster sampling
d. simple random sampling
5. A population has mean µ = 40 and standard deviation σ = 15. What is the probability that a random
sample of size 36 will have a mean greater than 47?

10 | P a g e
Note: Try answering the activity on your own and check your answers. The answer key in this activity
is provided in the last page of this material. Statistics & Probability
A.Y. 2021-2022

11 | P a g e
Statistics & Probability
A.Y. 2021-2022

Assessment

Let’s see how far you’ve learned! Answer the following activities to assess your
understanding in the topic.

ACTIVITY 1. This activity will be answered individually in Quipper.


I. Identify the sampling technique illustrated in the following:
1. Every 8th person boarding a plane is searched thoroughly.
a. Simple Random Sampling
b. Stratified Random Sampling
c. Systematic Random Sampling
d. Cluster Random Sampling
2. A researcher for an airline interviews all of the passengers on ten randomly selected flights.
a. Simple Random Sampling
b. Stratified Random Sampling
c. Systematic Random Sampling
d. Cluster Random Sampling
3. A market researcher randomly selects 100 drivers under 35 years of age and 50 drivers over
35 years of age.
a. Simple Random Sampling
b. Stratified Random Sampling
c. Systematic Random Sampling
d. Cluster Random Sampling
4. The names of 50 contestants are written on 50 cards. The cards are placed in a bag then three
names are picked from the bag.
a. Simple Random Sampling
b. Stratified Random Sampling
c. Systematic Random Sampling
d. Cluster Random Sampling

II. Determine what is asked.


5. A researcher who wanted to determine the benefits of using a new study technique to
Statistics students obtained permission from a school district to select 50 senior high school
students as participants. The researcher selected 50 senior high school students at random.
The researcher selected 25 of them to participate in the new study program. The researcher
gave a training session on traditional study techniques to the other 25 students and asked
them to use these methods.
Who is the target population in this study?
a. Statistics students in the district
b. All students in the district
c. All Statistics students
d. The 25 students who learned the new study techniques

12 | P a g e
Statistics & Probability
A.Y. 2021-2022
6. The school guidance counselor is interested in the mean IQ scores of all entering Senior High
School applicants in their school. From the recorded IQ scores of all applicants she obtained a
mean IQ of 105. Determine whether the situation describes a parameter or statistic.
a. Parameter
b. Statistic
7. There are 2,140 marathon runners who signed up, and 100% had a drink of at least 3 bottles
of water along the course. Determine whether the situation describes a parameter or statistic.
a. Parameter
b. Statistic
8. Suppose a population has mean 80 and standard deviation 10. Then, we get a sample of 90
cases. What is the probability that this sample will have a mean between 78 and 82?
a. 94.256%
b. 94.346%
c. 95.390%
d. 95.452%
9. In a study of life expectancy of 400 people in a certain geographic region, the mean age at
death was 70 years, and the standard deviation was 5.1 years. If a sample of 50 people from
this region is selected, what is the probability that the mean life expectancy will be less than 68
years?
a. 0.189%
b. 0.250%
c. 0.280%
d. 0.294%
10. The average public high school has 468 students with a standard deviation of 87. If a random
sample of 38 public schools is selected, what is the probability that the number of students
enrolled is between 445 and 485?
a. 68.345%
b. 79.672%
c. 80.451%
d. 83.338%

13 | P a g e
Statistics & Probability
A.Y. 2021-2022

References

- Singh, Seema. “Sampling Techniques.” Towards Data Science. Retrieved 12 July 2019.

- https://link.quipper.com/en/organizations/5f22837b28943218902a973c/curriculum#curriculum

14 | P a g e
Statistics & Probability
A.Y. 2021-2022
WHAT’S MORE: ANSWER KEY
I. Determine whether the following statements are true or false.

1. The group surveyed when the survey is not administered to all members of the population is called
a sample. TRUE
2. Sampling can be done when it would be impractical to survey the entire population. TRUE
3. The choice of sampling technique is dependent on the research questions and objectives and the
achievability of gaining access to the data. TRUE
4. Simple random sampling involves creating subgroups of the entire population, usually based on the
geographic location. FALSE
5. Cluster sampling uses strata in determining samples. FALSE

II. Choose the letter of the best answer.


1. Which of the following employs simple random sampling?
a. picking a house randomly and then picking 5 more on the same street
b. drawing names randomly from a covered box
c. selecting every 10th person that you see
d. picking a random location on a map and then the nearest two houses are chosen

2. A teacher divides her classroom into juniors and seniors. In each group, she divides her students
into boys and girls. From each of these four new groups, she picks one student randomly. What type
of sampling method does she use?
a. cluster sampling
b. multistage sampling
c. stratified random sampling
d. systematic random sampling

3. Cluster sampling, stratified sampling, and systematic sampling are types of what kind of sampling
method?
a. direct sampling
b. indirect sampling
c. random sampling
d. non random sampling

4. This is a type of sampling where each element of the population has an equal chance of being
included in the sample.
a. irregular sampling
b. error free sampling
c. cluster sampling
d. simple random sampling

5. A population has mean µ = 40 and standard deviation σ = 15. What is the probability that a random
sample of size 36 will have a mean greater than 47?

15 | P a g e
Statistics & Probability
A.Y. 2021-2022
Solution:
STEP 1. Identify the given information.
µ = 40
σ = 15
X = 47
n = 10

STEP 2. Identify what is asked.


P( X > 47)

STEP 3. Standardize the sample mean 47. (Solve for the z-score of 47)
X −μ
z=
σ
√n
47−40
z=
15
√10
z=2.8
STEP 4. Determine the area to the left of the z- score z = 2.8

The area to the left of z = 2.8 is 0.99744


To determine the area more than z = 2.8, subtract 0.99744 from 1 since the total area of the normal
curve is 1.
P( X > 47) = 1 – 0.99744
= 0.00256
Therefore, the probability that a random sample size of 36 will have a mean greater than 47 is
0.00256 or 0.256%.

16 | P a g e

You might also like