RM 7
RM 7
WHAT IS SAMPLING?
Sampling is the process of choosing a representative portion of a population.
A sample is a subset of individuals from which data are collected to make inferences
about the characteristics of the population.
This type of correspondence between the sample and the larger population is most
important when a researcher wants to know what proportion of the population has a
certain characteristic – like a particular opinion or a demographic feature.
2
WHY SAMPLE?
We know what the target population is, why not simply use the whole
population?
In some cases, where the target population is relatively small, and where
the need to generalize beyond the sample isn't a requirement, then
the whole population could indeed be used.
In reality there are a number of reasons for not using the whole
population:
9
FACTORS IN SELECTING WHICH
SAMPLING TECHNIQUES
Availability of and access to the individuals in the representative
group.
10
ADVANTAGES OF USING A SAMPLE
It is faster and cheaper.
It can yield more comprehensive information because a small sample can be
thoroughly investigated whereas the cost of such thoroughness would be prohibitive
for a large population.
It is more accurate because a small highly skilled group of workers is likely to make
fewer errors in the collection and handling of data than a large census would.
Because of the savings it permits in time and money, the sample survey makes
possible the study of much larger and much more varied populations than would be
possible for the same expenditure if one were making a complete enumeration
11
SAMPLE SIZE AND SAMPLING ERROR
In most cases, there is no prior knowledge about the variance of the population. In
such a case, the best thing that can be done is simply to take a certain
percentage of the whole population, say 2, 5, 10, 20, or 50 percent of N. Some
pointers maybe used as guides in selecting the percentage to be used:
13
SAMPLE SIZE AND SAMPLING ERROR
Large
ERROR
Small Large
SIZE OF SAMPLE
14
SAMPLE SIZE AND SAMPLING ERROR
Whenever a mean, a percentage, or other statistics is calculated
from a sample, a population value is being estimated. The question
is: “How much error is there likely to be in the statistics calculated
from samples of different sizes?”
The curve in the figure roughly expresses the relations between
sample size and error, error meaning deviation from population
values. The curve says that “the smaller the sample, the larger the
error and the larger the sample, the smaller the error”. This is
because statistics calculated from large samples are more
accurate, other things being equal, than those calculated from small
samples.
15
SAMPLE SIZE AND SAMPLING ERROR
All other things being equal, smaller samples (e.g., those with fewer than
1,000 respondents) have greater sampling error than larger
samples.
The sampling error is a number that describes the precision of an
estimate from any one of those samples. It is usually expressed as
a margin of error associated with a statistical level of confidence.
For example, a presidential preference poll may report that the
incumbent is favored by 51% of the voters, with a margin of error of
plus-or-minus 3 points at a confidence level of 95%. This means
that if the same survey were conducted with 100 different samples
of voters, 95 of them would be expected to show the incumbent
favored by between 48% and 54% of the voters (51% ± 3%).
16
SAMPLE SIZE AND SAMPLING ERROR
If you increase your n, you decrease standard error (SE) resulting
to higher statistics.
This results to the rejection of the null hypothesis (Ho). Including
Design Efficiency (Deff) factor in your sampling and survey design
will correct this. In most cases, without Deff, you overestimate
statistics that result to outright rejection of the Ho.
17
PROBABILITY SAMPLING TECHNIQUES
Simple Random Sampling
Systematic Sampling (with a random start)
Stratified Sampling
Cluster Sampling
Multi-stage Sampling
18
SIMPLE RANDOM SAMPLING
It is a sampling technique wherein every element in the sampled
population is given an equal non-zero chance of entering the sample.
It is sampling wherein all possible samples of fixed size n have the same
probability of being selected
A sample drawn at random is unbiased in the sense that no member of
the population has any more chance of being selected than any other
member. Random methods of selection do not allow our own biases
or any other systematic selection factors to operate.
May be either: (a) with replacement or (b) without replacement.
Sampling is done in a single stage with each element selected
independently.
19
ADVANTAGES OF SIMPLE RANDOM SAMPLING
The sample mean is an unbiased estimate of the population mean
(samples are used to estimate population parameters).
Estimation methods are simple and easy.
20
DISADVANTAGES OF SIMPLE
RANDOM SAMPLING
21
WHEN TO USE SIMPLE RANDOM SAMPLING
If the population is not widely spread geographically.
If the population is more or less homogenous with respect to the
characteristics under study (Use stratified sampling if individuals are
heterogenous).
22
SYSTEMATIC SAMPLING
It is a method of selecting a sample by taking every kth unit from an
ordered population, the first unit being selected at random.
25
ADVANTAGES OF SYSTEMATIC SAMPLING
Selection of the sample is administratively easier, quicker, and
cheaper than simple random sampling.
26
DISADVANTAGES OF SYSTEMATIC SAMPLING
A systematic sample may give poor precision when unsuspected
periodicity is present in the population (e.g. every 25th household in
the sample or population is rich).
27
STRATIFIED SAMPLING
There may often be factors which divide up the population into sub-
populations (groups/strata) and we may expect the measurement of
interest to vary among the different sub-populations.
28
WHEN TO USE STRATIFIED SAMPLING
When the population is heterogeneous, or dissimilar.
29
ADVANTAGES OF STRATIFIED SAMPLING
It is more efficient than simple random sampling.
It is administratively convenient.
30
DISADVANTAGES OF STRATIFIED SAMPLING
Stratification of the population may mean the nee for additional prior
information about the population and its subpopulations.
31
HOW TO DO STRATIFIED SAMPLING
Stratified sampling may be done thru:
(a) Proportional Allocation
(b) Quota/Non-proportional Allocation
In proportional allocation, we determine the size of each stratum and
the sample in each stratum is determined in terms of their percentage
to the total sample size.
Example: Suppose that in a company there are the following staff: (A)
male, full time=90, (B) male, part time=18, (C) female, full time=9, and
(D) female, part time=63. You want to have a sample size of n=80.
A= 90/180 x 80 = 40 C= 9/180 x 80 =4
B= 18/180 x 80 = 8 D= 63/180 x 80 =28
32
CLUSTER SAMPLING
Cluster sampling is a sampling technique where the entire population is
divided into groups, or clusters, and a random sample of these
clusters are selected. All observations in the selected clusters are
included in the sample.
Cluster sampling is typically used when the researcher cannot get a
complete list of the members of a population they wish to study but
can get a complete list of groups or 'clusters' of the population.
It is also used when a random sample would produce a list of subjects
so widely scattered that surveying them would prove to be far too
expensive.
This sampling technique may well be more practical and/or economical
than simple random sampling or stratified sampling. 33
CLUSTER SAMPLING
Example 1
Suppose we would like to investigate the use of pesticides by farmers in a
barangay. A cluster sample could be taken by identifying the different sitio or
purok as clusters. A sample of these sitio or purok (clusters) would then be
chosen at random, so all farmers in those sitio/purok will be included in the
sample. It can be seen here then that it is easier to visit several farmers in
the same purok/sitio than it is to travel to each farm in a random sample to
observe the use of pesticides.
Example 2
In conducting a poll of voter preferences for a nationwide election, we can
randomly select congressional districts (or some other applicable grouping of
voters), and then conduct the poll among the people in the chosen
congressional district. Many voter polls that utilize cluster sampling would
carefully choose their clusters so that they best represent the voter
preferences.
34
CLUSTER SAMPLING
Similar to to strata in stratified random sampling, clusters are mutually
exclusive subpopulations which together comprise the entire
population. Unlike strata, however, clusters are comprised of
heterogenous elements so that each cluster is typical of the
population.
36
DISADVANTAGES OF CLUSTER SAMPLING
It is not as effective as random or stratified random sampling in
ensuring representativeness. In actual field applications, for example,
adjacent households tend to have more similar characteristics than do
households distantly apart. This affects the representativeness of the
sample.
37
MULTI-STAGE SAMPLING
It In multistage sampling, we combine several techniques of sampling
into two or more phases of selection.
Except for the first stage units, a sampling frame is required only for
those units selected in order to sample the subunits.
39
DISADVANTAGES OF MULTI-STAGE SAMPLING
The major disadvantage of multistage sampling is its complexity in
theory, which maybe difficult to apply in the field. The estimation
procedures are difficult for non-statisticians to follow.
40
NON-PROBABILITY SAMPLING TECHNIQUES
Quota Sampling
Purposive Sampling
Accidental Sampling
Snowball Sampling
41
QUOTA SAMPLING
This is usually done in surveys where the sampling frame is not used at
all and the choice of the actual sample units to be interviewed is left to
the discretion of the interviewer.
The interviewer is given quota controls. A control maybe socioeconomic
variables such as age, income, sex, occupation, place of residence,
etc.
It is widely used in opinion polling and market research. Interviewers are
each given a quota of subjects of specified type to attempt to recruit
for example, an interviewer might be told to go out and select 20 adult
men and 20 adult women, 10 teenage girls and 10 teenage boys so
that they could interview them about their television viewing.
It suffers from a number of methodological flaws, the most basic of
which is that the sample is not a random sample and therefore the
sampling distributions of any statistics are unknown.
42
PURPOSIVE SAMPLING
This is characterized by the use of judgment by the researcher and a
deliberate effort to obtain representative samples by including
presumably typical areas or groups in the sample. Use this method
with extreme care.
43
ACCIDENTAL OR CONVENIENCE SAMPLING
This a form of non-probability sampling technique in which one takes
available samples at hand. This form of sampling should not be used
at all.
Because some members of the population have no chance of being
sampled, the extent to which a convenience sample – regardless of its
size – actually represents the entire population cannot be known.
44
SNOWBALL SAMPLING
Snowball sampling may simply be defined as a technique for finding research
subjects. One subject gives the researcher the name of another subject, who
in turn provides the name of a third, and so on (Vogt, 1999).
45
SNOWBALL SAMPLING
Although they violate the principles of sampling, the use of snowball
strategies provides a means of accessing vulnerable and more
impenetrable social groupings.
46
DEFICIENCIES OF SNOWBALL SAMPLING
Problems of representativeness and sampling principles.
47