Research Chapter 5 Editedd
Research Chapter 5 Editedd
Plan procedure
for selecting sampling units
Conduct fieldwork
Relevant population
Operationally define
A list of elements from which the sample may
be drawn
Working population
Mailing lists - data base marketers
Sampling frame error
Group selected for the sample
◦ Primary Sampling Units (PSU)
◦ Secondary Sampling Units
◦ Tertiary Sampling Units
The difference between the sample results
and the result of a census conducted using
identical procedures
Statistical fluctuation due to chance variations
Nonsampling errors
Unrepresentative sample results
Not due to chance
Due to study design or imperfections in
execution
Probability sampling
Known, nonzero probability for every
element
Nonprobability sampling
Probability of selecting any particular
member is unknown
Convenience
Judgment
Quota
Snowball
Also called haphazard or accidental sampling
The sampling procedure of obtaining the
desire
It should not be confused with stratified
sampling.
A variety of procedures
Initial respondents are selected by probability
methods
Additional respondents are obtained from
characteristic
Do not confuse with quota sample
The purpose of cluster sampling is to sample
economically while retaining the
characteristics of a probability sample.
The primary sampling unit is no longer the
disadvantage.
Sample size requirements can be met
38
Confidence level: How certain you are (or need to be)
that the statistic obtained from your sample is an
accurate estimate of the population as a whole.
• The issue of how much sampling error can be tolerated only
applies to probability samples.
• Confidence level refers to the percentage that the sample
differs from the theoretical population within the confidence
interval.
• It can range from one percent to 99 percent.
• Realistically, confidence level should never fall below 90, 95
and 99 percent being the most common.
39
Confidence interval: The calculated range within
which the true population value lies and for which we
can express 95% confidence level(the standard, but
may vary).
• So if the significance interval is three percent, it would mean that the
data from the sample fall within the three percent range as compared
to what the results would be using the whole theoretical population.
40
For example: you might read in the paper that 48% favor
raising taxes and 52% oppose raising taxes .
Assuming Significance level is (+/-5%)
What this means is that if everyone in the population were
asked, the actual proportions would be somewhere between
43% to 53% (48 +/- 5) favoring raising taxes, and 47% to 57%
(52 +/- 5) _opposing.
Most evaluations accept a sampling error of 5%.
In the tax example, if we had a 5% margin of error, then the
true picture of opinions would be between 43% to 53% favoring
raising taxes and between 47% to 57% opposing raising taxes.
41
Option 1: For dichotomous/binomial variable or for population
characteristic expressed as a proportion (Approve or disapprove, for
or against, yes/no)
n= p% x q% x [z/e%]2
Where ,
◦ n is the minimum sample size required ,
◦ P% is the proportion belonging to specified category
◦ q% is the proportion not belonging to the specified category
◦ z is the z value corresponding to the level of confidence required
◦ e is the margin of error required
Level of confidence Z value
90% certain 1.65
95% certain 1.96
99% certain 2.57
42
Assume that 30 percent of the students say they will support the current
continuous assessment approach. We decide to estimate the true
proportion in the population with in 10 percentage points of this figure
(p= 30% + or -). Assume further that we want to be 95 percent confident
that the population parameter is within plus or minus of the sample
proportion.
We can be 95 percent confident that 30 percent of the respondents
would say they would support the approach with a margin of error of +
or- 10
Find the minimum sample size?
n= p% x q% x [z/e%]2
P% =30% Or Given: P= 0.3, q=0.7, z = 1.96, e=0.1
q% =70%
z =1.96
e = 10%
n= 81
43
Exercise:
To answer a research question you need to estimate the
proportion of total population of 4000 home care clients who
receive a visit from their home health care (extension workers)
at least once a week. You have been told that you need to be 95
percent certain that the estimate is accurate ( the level of
confidence in the estimate); this corresponds to a z score of
1.96. You have also been told that your estimate needs to be
accurate with in plus or minus 5 percent of the true percentage
(the margin of error that can be tolerated).
You still need to estimate the proportion of responses who
receive a visit from their home health care assistant at least once
a week. From your pilot survey you discovered that 12 out of
the 30 clients receive a visit at least once a week.
n= 369
45
Let us assume that a political party wants to conduct a poll to
estimate the percentage of voting for the party within ±5 per
cent points and that the party wishes to be 95 per cent confident
of the result. Also assume that the percentage of voting for the
party is believed to be 35 per cent. What sample is appropriate
in this case?
46
• Determine the size of the population.
• Decide on the margin of error. (As much as possible the margin of
error should not be higher than 5%. Probably 3% is an ideal one)
• Determine sample size
n= N__
1+NE²
n = sample size
N = the size of the population
e = the margin of error
48
o Though we use the above formulas, there are no fixed rules in
determining the size of a sample needed.
49
Refer to the standardized table (that can be
obtained from research methods’ books) to
find minimum/base sample size
50
End of Chapter Five