Cluster Sampling
Cluster Sampling
Population
Step-2: Create Groups/Clusters
Now, we divide the population into clusters or groups, and the groups do
not overlap each other. Each cluster must represent the entire
population. There are also naturally occurring clusters like schools,
cities etc.
Create groups/clusters
Step-3: Randomly Select Clusters
Within each selected cluster, list all the elements within that cluster.
For example, if the selected cluster is of grade 8th students in one
school, we need to list all the students in that class. This step is done for
our ease and understanding.
Collect data from every individual in the list we made. The data collection
can be done in various ways like surveys, interviews, observations, or
any other method according to the type of population and our topic of
interest.
data collection from clusters
The final step after collecting the data is to perform analysis on data and
draw conclusions about the population. This can be done through various
data analysis techniques and we can take decisions according to the
output obtained
Advantages:
Low cost/high frequency of use/ Cheap: This method is
cheaper than other sampling methods, like simple random
sampling or stratified sampling. It’s because this method
reduces the need to survey each and every element in the
population and the efforts to sample each and every
individual is decreased.
Practical: This is practically possible when we cannot survey
each individual in a population because clusters/groups can be
more easily recognized and can be accessed.
Requires list of all clusters, but only of individuals within
chosen clusters.
Reduces cost.
Increased Efficiency: This method increases efficiency in data
collection, if the clusters are already naturally occurring groups (for
example, households, schools, geographic regions) that are easier
to sample together.
Disadvantages
Larger error for comparable size than other probability
methods
Multistage very expensive and validity depends on other
methods used
Risk of Bias: If the clusters are not good representation of the
entire population or is not evenly distributed, it may result in
the biased/wrong result