RM - Cha 8
RM - Cha 8
1. Census survey:
2. Sample survey:
1. Census survey:
The researcher:
units
2. Sample survey:
The researcher:
• Can be:
I. Finite
II. Infinite
Finite Population:
• A population is called finite if it is possible to count or label
its individuals
Ex:
Ex:
Sample:
Sample size - n:
Sampling:
• Is a characteristic of a sample
Parameter:
• Is a characteristic of a population
Ex:
➢mean,
➢ mean,
➢Etc
• Population mean () is a parameter
statistics
Sampling error:
Non-sampling errors:
• So on.
Sampling distribution:
population.
SAMPLING DESIGN PROCESS
Sampling design Define Population . . . involved in
is the steps . . . sample planning
Census Vs Sample
Determine Sampling
Probability Sampling Procedure Non-probability
1. Simple Sampling
2. Systematic 1. Quota
3. Stratified 2. Snowball
4. Cluster 3. Convenience
5. Multi-stage 4. Judgmental
Determining appropriate
Sample Size
Estimate Cost of Planning
Reliability:
Detailed information:
• Expensiveness:
population
• Less accuracy:
• Misleading conclusion:
conditions.
• Vast data:
• Infinite population:
• Homogeneity:
Essentials of an ideal sample
• 4 basic characteristics.
1) Representativeness:
4) Homogeneity:
III. Sample design - C
C
• Sample design should answer the following
1. What type of
sample to use?
C
2. What is the
appropriate
sample unit?
Sampling unit can be
D
Iv. Sample size determination . . .
researcher
Methods to Determine Sample Size:
1. Personal judgments: Subjective decision
2. Nature of universe: Homogenous or heterogenous in nature
3. Budgetary approach:
Ex:
• If cost of surveying of one individual or unit is 30
birr and if the total available fund for survey is say
1800 birr.
• Determine the sample size?
Answer:
unit survey
• Information required:
Where
• n represent sample size
• Z represents standardization value indicating a confidence level
• E represents acceptable magnitude of error an error factor
• S represents sample SD or an estimate of the population SD
• p and q (q = 1-p) are proportion/percentage
Z-scores for the most common confidence
levels
n = (Z.S/E)2
= (1.96 *200/10)2
= 153,664/100
= 1536.64
= 1537
Note that:
Since n = (Z.S/E)2
decreases
proportion:
Non- Probability
probability
sampling sampling
Sampling
Technique
s
1. Non-Probability
Sampling
Non-Probability Sampling
• Samples are selected based on the subjective
judgment of the researcher, rather than random
selection
• Less complicated
• Less expensive
✓People in my class
of unity.
respondents.
2. Judgment/Purposive/Deliberate sampling
• Is a form of convenience sampling
• The population
elements are
selected based
on the
judgments of
the researcher.
• The researcher’s knowledge is instrumental in
of error.
• Select elements that are believed to be typical or
a sample to serve a
specific purpose.
✓disliked or
Quota Sampling
• The selection of individuals from each strata/group
First stage
Second stage:
cheaper
• Selection bias
Probability Sampling
Simple Systematic
Random Stratified Cluster
Sampling
Sampling Sampling Sampling
Multi-stage
Sampling
1.Simple Random Sampling - SRS
I. Lottery method
selection
of items
thus
depends
on chance
➢ It can be made with replacement or without replacement
• If the selected cards are not replaced before the next draw
units
• Simplicity
• The bias that could occur when the sample set is not
large enough
• Random start
= N/n
Ex 1:
Ex 2: • Assume N = 9 • Select a random
• Want n = 3 number from 1 to 3
• Sampling Interval = N/n = 9/3 = 3 • Assume it is 3
Class Activity:
technique.
homogeneous group.
• The population is divided into various classes
or sub-population.
• Individually more
total population
• The heterogeneous population of
size N units is sub-divided
into L homogeneous non overlapping sub
populations called Strata
• The ith stratum having Ni units (i =1, 2, 3,…,L) such
Males Females
Heterogeneous Population
Children
• Age • Income • Religion
•• Gender
Age •• Race
Income •• Religion
Etc
• Gender • Race • Etc
Example
❑ Suppose a researcher wishes to collect information
regarding income expenditure of the male
population of, say Jimma Town.
Service providers
Businessmen
Sellers
Others
❑From these d/t groups, the researcher will
select
elements using
random
sample
technique.
• Questions to be considered in stratified sampling
• Strata are:
➢ purposively formed and
➢ usually based on past experience of the
researcher and
➢ personal judgment of the researcher
Tips:
to manage
each stratum?
disproportionately
Merits:
each stratum.
4. Cluster sampling
• A method of probability sampling that is often used
to study:
➢large populations,
➢particularly those
that are widely
geographically
dispersed
• Researchers divide a population into smaller
multiple groups known as clusters
• A sample of
such
clusters is
then
selected
• Groups like
➢ schools,
➢ manufacturing unit,
Then take a sample
➢ households,
➢block of city,
➢Etc
• After randomly selecting the primary sample
unit (city, part of city), we survey or interview
all families or elements in that selected
primary sample unit.
studied
• The area sample is the commonly used type of
cluster sampling.
How to cluster a sample?
areas is selected
One - stage (1st stage) sampling:
• All of the elements within selected cluster are
included in the sample
• Has 4 steps
Step 1: Define your population
Step 2: Divide your sample into clusters
Step 3: Randomly select clusters to use as your sample
Step 4: Collect data from the sample
Two-stage (2nd stage) sampling:
• A subset of elements within selected clusters are
randomly selected for inclusion in the sample
Ex 1: Assume a researcher wants to evaluate consumer
spending on various modes of transportation in Bole of Addis
Ababa. Since Bole is a large sub-city with 14 woredas the
researcher needs to apply clustering approach to take a sample
out of it.
• The key stages he needs to apply for the application of
cluster sampling for this research would be:
Stage 1: Defining the total population:
✓ The target population is people living in Bole of Addis Ababa
Stage 2: Dividing population into 14 clusters:
✓ The area can be divided into 14 woredas or clusters
Stage 3: Choosing a sample of clusters out of the total:
✓ Ex: he may choose 3 out of the total 14 woredas
✓ HHs residing in 3 woredas will represent samples for the study
Stage 4: Choosing individual households to be included in the study
Ex 2:
• If you are interested in the average reading and
writing level of all the 5th graders in Addis Ababa
• Difficult to obtain a list of all 5th graders and collect
data from a random sample spread across Addis.
• So define the population:
➢ The 5th graders in Addis Ababa – select all schools
➢ Cluster the population – cluster the 5th graders by
school they attend
➢ Randomly select clusters to use as your sample -
sample within the school
➢ Collect data from the sample
Generally:
Limitation:
advantage it possesses.
Cluster Sampling vs Stratified Sampling
BASIS FOR STRATIFIED CLUSTER
COMPARIS SAMPLING SAMPLING
ON
Meaning • Stratified sampling • Cluster sampling
is one, in which the refers to a sampling
population is method wherein the
divided into members of the
homogeneous population are
segments, and then selected at random,
the sample is from naturally
randomly taken occurring groups
from the segments. called 'cluster'.
Sample • Randomly selected • All the individuals
individuals are taken are taken from
Cluster Sampling vs Stratified Sampling
BASIS FOR STRATIFIED CLUSTER
COMPARISON SAMPLING SAMPLING
representation. efficiency.
What’s their Similarities?
SIMILARITY
STRATIFIED SAMPLING CLUSTER SAMPLING
• Probability sampling • Probability sampling methods
methods
• Divide a population into • Divide a population into distinct
distinct groups – Strata groups -Cluster
• Tend to be quicker and more • Tend to be quicker and more cost-
cost-effective ways of effective ways of obtaining a
obtaining a sample from a sample from a population compared
population compared to a to a simple random sample.
simple random sample.
• Between groups • Within group
• Imposed by the researcher • Naturally occurring groups
• To increase precision and • To reduce cost and improve
representation. efficiency.
What is their Differences?
Difference
STRATIFIED SAMPLING CLUSTER SAMPLING
• A probability sampling procedure in • Cluster Sampling is a sampling
which the population is separated into technique in which the units of the
different homogeneous segments population are randomly selected from
called ‘strata’, and then the sample is already existing groups called ‘cluster.’
chosen from the each stratum
randomly, is called Stratified
Sampling.
• The individuals are randomly selected • The sample is formed when all the
from all the strata, to constitute the individuals are taken from randomly
sample. selected clusters.
• Population elements are selected • Population elements are selected in
individually from each stratum. aggregates.
• There is homogeneity within the • Homogeneity is found between groups.
group.
• Heterogeneity occurs between groups. • The members of the group are
heterogeneous in cluster sampling.
• Categories are imposed by the • The categories are already existing
Stratified Versus Cluster Sampling
5.Multi-stage sampling
random.
surveys.
Ex 1: If we wish to estimate say yield per hectare of a given
crop say coffee in Gedeo zone. We begin by random selection
of say 5 districts in the first instance.
250 Farms
• Of these 5 districts, 10
villages per district will be
chosen in the same manner. 5 farms/
Villages
• In final stage we will select
again randomly 5 farms
from every village.
• Thus, we shall examine per 10 Villages
and
Non-Sampling Error
1). Sampling Error
❖Is the difference between the result of a sample and
the result of census.
procedure/sampling method
be minimized
Error
❖Why?
law of probability
sampling theory.
• Sampling theory is design to attain one or more of
the following objectives:
✓Statistical estimation: Sampling theory helps in
estimating unknown population parameters from
knowledge of statistical measurement on sample
studies.
✓Testing of hypothesis: It enables us to decide
whether to accept or to reject the stated hypothesis.
• That is, observed differences are actually due to
chance or whether they are really significant
✓Statistical inference: Sampling theory helps in
making generalization about the population from the
studies based on samples drawn from it. It also helps
in determining the accuracy of such generalization
CHAPTER SUMMARY QUESTIONS
1. Systematic sampling is more random than a simple random
sampling method. A) True B) False.
2. Cluster sampling is one of the probability sampling
methods where individuals are randomly selected as the
primary sampling units finally. A) True B) False.
3. The first and last stages while conducting a census for the
research study is defining the target population and
execution of the plan respectively. A). True B). False
4. Sampling technique is more preferable to census when the
population is heterogenous. A). True B). False
5. The heterogeneous population of size N units is sub-
divided into H homogeneous non-overlapping sub
populations called Strata. A). True B). False
1. Why probability sampling are the most preferred?
A. It is very convenient in the situation when the sample to be
selected is very small and the researcher wants to get some
idea of the population characteristics
B. The sample units are selected based on the decision of the
researcher
C. Each unit of the population has some unknown probability
of entering the sample
D. The processes of sampling is automatic in which the
selection of one exclusively depends on the selection of the
other
E. All
F. None
2. Why is sampling so important to research?
A. Maximizing efficiency and being representative
B. It is usually impossible to study the entire population
C. Need for specialized knowledge
D. All E. Except C F. None
3. Basiliyos is worried that the d/c b/n the results of his
study and what is actually true for the population may be
too large. Which of the following would best define what is
Basiliyos worried about?
A. The randomness of his study
B. The sampling frame in his study
C. The population in his study
D. The error of his study E. All F. None
4. Which of the following is not true about non-probability
sampling
A. Samples are selected based on the subjective judgment
B. Random sample selection are not the intended use so that it
does not give equal chance
C. Sampling units are selected at the discretion of the
researcher
D. Used when the representativeness of the population is not
the prime issue
E. One person could have a 15% chance of being selected and
another person could have a 50% chance of being selected.
F. None
5. The list of elements from which the sample drawn is:
D). A and B
A). Element B). Sampling Unit C). Sampling frame D). All
E). A and B F). None
A). Simple B). Systematic C). Cluster D). Purposive E). All
8. Natanim wants to study the teaching-learning
methodology of graduating students under FBE
departments of Unity University in 2024. She thinks
using a stratified random sample would be the best
option for her study. Which of the following methods
would ensure Natanim takes a stratified random
sample
A. Draw a sample from each department of the faculty
B. Use lottery method
C. Employ table of random numbers
D. Order 4th year students by their average grade level and
choose every 50th participants
E. Choose economics department students from each section
of the year
F. None
9. The error due to the d/c b/n the sample estimation
and the actual value of the population is ________
A). Variance B). Systematic C). Sampling
D). Non-sampling
10. The most preferred type of sampling is ______
A). Convenience B). Judgmental C). Quota
D) Snowball E). All F). None
11.Secondary data should be verified in terms of _
A). Reliability B). Suitability C). Adequacy
D). Sufficiency E). All F) None
12. A hit or miss procedure of sample is:
A. SRSWOR
C. Cluster sampling
D. Systematic sampling
E. Multi-stage sampling
F. None
16. Hiwot is trying to put a sample together for her
current study. Her advisor suggests that Hiwot use
probability sampling to create a sample. She agrees
and decides to use a simple random sample. Which
of the following methods could Hiwot use to ensure
that she is creating a simple random sample?
A. Choosing every 150th name in a sampling frame
B. Choosing people from d/t segments of a society
C. Looking at the subject’s profile and choosing the
best participant
D. Lottery Method
E. All
F. None
17. Which of the following explains the d/c b/n
probability and non-probability sampling?
A. In a probability sample, the participants are chosen through
an interview selection process while in a non-probability
sample the participants are chosen using random number
generator.
B. In a probability sample, the participants are chosen using a
convenient sample interview selection process while in a
non-probability sample the participants are chosen using
lottery method.
C. In a non-probability sample, the participants are chosen via
in a non-random selection process while in a probability
sample the participants are chosen randomly.
D. In a probability sample, the participants are chosen by
using judgment while in a non-probability sample the
participants are chosen using a computer program. E. All
18. As Soreti conducts her research, she becomes
concerned that her sample might not really represent the
population under study in the research. Why is this a
major concern for Soreti?
A. The research is biased if the sample is not truly
representative of the study population
B. The research cannot be statistically significant if the
sample is not truly representative
C. The research cannot be replicated if the sample is not truly
representative.
D. The research cannot be generalized to the wider population
if the sample is not truly representative.
E. All
F. None
19.Which of the following is NOT a step in the sampling
process?
A. Identify the population and sampling frame
B. Estimate the cost of planning
C. Determine the sample size
D. Determine the sampling procedure
E. Identify the sampling tool
F. None
20. I am one of the 5 million people who bought a lottery
ticket that rewards ETB 30 million, with the outcomes
supposed to be revealed on the eve of the upcoming
Christmas. What is my probability of winning the lottery?
A. 50% B. 0.00002% C. 0.6% D. 0.0000002% E. Can’t be
determined F. None
END OF CHAPTER EIGHT
THANK YOU!