Chapter9-Sampling in Quantitative Research
Chapter9-Sampling in Quantitative Research
● Quantitative research can be characterized as exhibiting certain preoccupations, the most central of
which are: measurement; causality; generalization; and replication.
● Quantitative research has been subjected to many criticisms by qualitative researchers. These
criticisms tend to revolve around the view that a natural science model is inappropriate for studying
the social world.
ONLINE RESOURCES
www.oup.com/uk/brm5e/
Visit the Interactive Research Guide that accompanies this book to complete an exercise focused on
the nature of quantitative research.
CHAPTER
9
SAMPLING IN
QUANTITATIVE RESEARCH
Introduction 186 Time and cost 196
Non-response 196
Introduction to sampling 187
Heterogeneity of the population 197
Sampling error 189
Types of non-probability sampling 197
Types of probability sample 191 Convenience sampling 197
Simple random sample 191 Quota sampling 198
Systematic sample 191 Limits to generalization 201
Stratified random sampling 192
Error in survey research 202
Multi-stage cluster sampling 192
The qualities of a probability sample 193 Sampling issues for online surveys 202
Absolute and relative sample size 195 Questions for review 205
CHAPTER OUTLINE
This chapter and the three that follow it are very much concerned with principles and practices associated
with social survey research. Sampling principles are not exclusively a concern for survey research; for
example, they are relevant to the selection of documents for content analysis (see Chapter 13). However,
in this chapter the emphasis will be on sampling in connection with the selection of people who would be
asked questions by interview or questionnaire. The chapter explores:
• the related ideas of generalization (also known as external validity) and of a representative sample (the
latter allows the researcher to generalize findings from a sample to a population);
• the idea of a probability sample—that is, one in which a random selection process has been employed;
• the main types of probability sample: the simple random sample; the systematic sample; the stratified
random sample; and the multi-stage cluster sample;
• different types of non-probability sample, including quota sampling, which is widely used in market
research and opinion polls;
This chapter is concerned with some important aspects Decide on topic/area to be researched
of conducting a survey, but it presents only a partial pic-
ture, because there are many other steps. In this chapter
Review literature/theories relating to topic/area
we are concerned with the issues involved in selecting
respondents for survey research, although the principles
involved apply equally to other approaches to quantitative Formulate research questions
research, such as content analysis. Chapters 9, 10, and 11
deal with the data collection aspects of conducting a sur- Consider whether a social survey is appropriate
vey, while Chapters 15 and 16 deal with the analysis of (if not, consider an alternative research design)
data.
Figure 9.1 is intended to outline the main steps
Consider what kind of population will be appropriate
involved in planning and conducting survey research.
The planning will begin with an examination of gen-
eral research issues that are to be investigated. These Consider what kind of sample design will be employed
are gradually narrowed down so that they become
research questions, which may take the form of hypoth- Explore whether there is a sampling frame that can be
eses, though this is not necessarily the case. The move- employed
ment from research issues to research questions is
likely to be the result of reading the literature relating Decide on mode of administration
to the issues. (face-to-face; telephone; postal; email; online)
Once the research questions have been formulated,
the planning of the fieldwork can begin. In practice,
Develop questions (and devise answer
decisions relating to sampling and to the research alternatives for closed questions)
instrument will overlap, but they are presented in Fig-
ure 9.1 as part of a sequence. The survey researcher
Review questions and assess face validity
needs to decide what kind of population is suited to
the investigation of the topic and also needs to formu-
late a research instrument and decide how it should Pilot questions
be administered. The ‘research instrument’ may be a
structured interview schedule (discussed in Chapter 10) Revise questions
or a self-completion questionnaire (Chapter 11). There
are several different ways of administering such instru-
Finalize questionnaire/schedule
ments. Figure 9.2 outlines the main types that are likely
to be encountered.
Quantitative research almost invariably involves sam- Sample from population
pling. In this chapter, we will be mostly concerned with
sampling for social survey research involving data collec- Administer questionnaire/schedule to sample
tion by structured interview or questionnaire. In social
survey research, sampling constitutes a key step in the
research process, as illustrated in Figure 9.1. However, Transform completed questionnaires/schedules
into computer-readable data (coding)
other methods of quantitative research also involve
sampling considerations, as will be seen in Chapter 13,
where we will examine structured observation and content Analyse data
analysis. The principles of sampling involved are more or
less identical in connection with these other methods, Interpret findings
though frequently other considerations come to the fore
as well.
Consider implications of findings for research questions
Introduction to sampling 187
FIGURE 9.2
Main modes of administration of a survey
Survey
Structured Self-completion
interview questionnaire
Supervised Postal
Face-to-face Telephone Internet
5 6
Embedded Attached
7 8
Introduction to sampling
Many of the readers of this book will be university or them, since conducting survey research by interview is
college students. At some point in your stay at your uni- considerably more expensive and time-consuming, all
versity (we will use this term from now on to include things being equal, than by questionnaire (see Chapter
colleges) you may have wondered about the attitudes of 10). It is almost certain that you will need to sample
your fellow students to various matters, or about their students from the total population of students in your
behaviour in certain contexts, or something about their university to collect data from a manageable number of
backgrounds. If you were to decide to examine any or respondents.
all of these three areas, you might consider conducting But will any old sample suffice? Would it be sufficient
structured interviews or sending out questionnaires in to locate yourself in a central position on your campus
order to find out about their behaviour, attitudes, and (if your university has one) and then interview the stu-
backgrounds. You would, of course, have to consider dents who come past you and whom you are in a posi-
how best to design your interviews or questionnaires, tion to interview? Alternatively, would it be sufficient to
and the issues that are involved in the decisions that go around your student union asking people to be inter-
need to be made about designing these research instru- viewed? Or to send questionnaires to everyone on your
ments and administering them will be the focus of course?
Chapters 9–11. However, before getting to that point, The answer depends on whether or not you want to be
you are likely to be confronted with a problem. Let us able to generalize your findings to the entire student body
say that your university has around 10,000 students. in your university. If you do—and as discussed in Chap-
It is extremely unlikely that you will have the time and ter 7, one of the preoccupations of quantitative research-
resources to collect data from all these students. It is ers is generalization—it is unlikely that any of the three
unlikely that you would be able to send questionnaires sampling strategies proposed in the previous paragraph
to all 10,000 and then process the results, and even would provide you with a representative sample of all stu-
more unlikely that you would be able to interview all of dents in your university. To be able to generalize your
188 9 Sampling in quantitative research
findings from your sample to the population from which sample? There are various reasons, of which the follow-
it was selected, the sample must be representative. This ing stand out.
is a fundamentally important point, because the aim of
analyzing data from a sample is to make inferences about • The first two approaches depend heavily upon the
availability of students during the time or times that
the larger population from which the sample was drawn,
you search them out. Not all students are likely to be
without having to try to collect data from the whole pop-
equally available at that time, so the sample will not
ulation. See Key concept 9.1 for an explanation of terms
reflect those students who are not available.
concerning sampling.
Why might the strategies for sampling students previ- • The first two approaches also depend on the students
ously outlined be unlikely to produce a representative going to the locations. Not all students will necessarily
Sampling error 189
pass the point where you locate yourself or go to the Three sources of bias can be identified (see Key con-
student union, or they may vary significantly in the cept 9.1 for an explanation of terms):
frequency with which they do so. Their movements
are likely to reflect such things as where their halls of • If a non-probability or non-random sampling method is
used. If the method used to select the sample is not ran-
residence or accommodation are situated, or where
dom, there is a possibility that human judgement will
their departments are located, or their social habits.
affect the selection process, making some members of
Again, to rely on these locations would mean missing
the population more likely to be selected than others.
out on students who do not frequent them.
This source of bias can be eliminated through the use
• It is possible, not to say likely, that your decisions about of probability or random sampling, the procedure for
which people to approach will be influenced by your which is described later in this chapter.
judgements about how friendly or cooperative the
people concerned are likely to be or by how comforta- • If the sampling frame is inadequate. If the sampling
frame is not comprehensive or is inaccurate, or suffers
ble you feel about interviewing students of the same
from some other kind of similar deficiency, the sample
(or opposite) gender to yourself, as well as by many
that is derived cannot represent the population, even if
other factors.
a random or probability sampling method is employed.
• The problem with the third strategy is that students on
your course by definition take the same subject as each • If some sample members refuse to participate or cannot be
contacted—in other words, if there is non-response. The
other and therefore will not be representative of all
problem with non-response is that those who agree to
students in the university.
participate may differ in various ways from those who do
In other words, in the case of all three sampling not agree to participate. Some of the differences may be
approaches, your decisions about whom to sample are significant to the research question or questions. If the
influenced too much by personal judgements, by prospec- data are available, it may be possible to check how far,
tive respondents’ availability, or by your implicit criteria when there is non-response, the resulting sample differs
for inclusion. Such limitations mean that, in the language from the population. It is often possible to do this in
of survey sampling, your sample will be biased. A biased terms of characteristics such as gender or age, or, in the
sample is one that does not represent the population from case of something like a sample of university students,
which the sample was selected. As far as possible, bias whether the sample’s characteristics reflect the entire
should be removed from the selection of your sample. In population in terms of the students’ areas of study. How-
fact, it is incredibly difficult to remove bias altogether and ever, after non-response it is usually extremely difficult
to derive a truly representative sample. What needs to be to determine whether differences exist between the
done is to ensure that steps are taken to keep bias to an population and the sample in terms of ‘deeper’ factors,
absolute minimum. such as attitudes or patterns of behaviour.
Sampling error
In order to appreciate the significance of sampling we would expect our sample of 50 to be equally split
error for achieving a representative sample, consider in terms of this variable, as in Figure 9.4. If there is a
Figures 9.3–9.7. Imagine we have a population of small amount of sampling error, so that we have one
200 employees and we want a sample of 50. Imagine employee too many who is not appraised and one too
as well that one of the variables of concern to us is few who is, it will look like Figure 9.5. In Figure 9.6 we
whether or not employees receive regular performance see a rather more serious degree of over-representa-
appraisals from their immediate supervisor, and that tion of employees who do not receive appraisals. This
the population is equally divided between those who time there are three too many who are not appraised
do and those who do not. This split is represented by and three too few who are. In Figure 9.7 we have a
the vertical line that divides the population into two very serious over-representation of employees who do
halves (see Figure 9.3). If the sample is representative, not receive performance appraisals, because there are
190 9 Sampling in quantitative research
Have performance Do not have performance Have performance Do not have performance
appraisal appraisal appraisal appraisal
Have performance Do not have performance Have performance Do not have performance
appraisal appraisal appraisal appraisal
random selection process. It is important to ensure, how- than one stratifying criterion. Thus, it may be that you
ever, that there is no inherent ordering of the sampling would want to stratify by department, by gender, and by
frame, since this may bias the resulting sample. If there whether or not employees are above or below a certain
is some ordering to the list, the best solution is to rear- salary level or occupational grade. If it is feasible to iden-
range it. tify employees in terms of these stratifying criteria, it is
possible to use pairs of criteria or several criteria (such
as departmental membership plus gender plus occupa-
Stratified random sampling tional grade).
In our imaginary study of company employees, one of Stratified sampling is really feasible only when the
the features that we might want our sample to exhibit relevant information is available. In other words, when
is a proportional representation of the different depart- data are available that allow the ready identification of
ments in which employees work. It might be that the members of the population in terms of the stratifying cri-
kind of department an employee works in is viewed as terion (or criteria), it is sensible to employ this sampling
relevant to a wide range of attitudinal features that are method. But it is unlikely to be practical or economical
in turn relevant to the study of skill development and if the identification of population members for stratifica-
training. Generating a simple random sample or a sys- tion purposes entails a great deal of work because there
tematic sample might yield such a representation, so that is no available listing in terms of strata.
the proportion of employees from the sales and market-
ing department in the sample would be the same as that
Multi-stage cluster sampling
in the employee population and so on. Thus, if there are
1800 employees in the sales and marketing department, In the example we have been dealing with, employees
using our sampling fraction of 1 in 20, we would expect to be interviewed are located in a single company. Inter-
to have 90 employees in our sample from this department viewers will have to arrange their interviews with the
of the company. However, because of sampling error, it sampled employees, but, because they are all working
is unlikely that this will occur and more likely that there on the same premises (for the purposes of this hypotheti-
will be a difference, so that there may be, say, 85 or 93 cal example we are assuming that the company has only
from this department. one location), they will not be involved in a lot of travel.
Because it is very likely that the company will include However, imagine that we wanted a national sample of
in its records the department in which employees are employees. It is likely that interviewers would have to
based, or indeed may have separate lists of employees travel the length and breadth of the country to interview
for each department, it will be possible to ensure that the sampled individuals. This would add a great deal
employees are accurately represented in terms of their to the time and cost of doing the research. This kind of
departmental membership. In the language of sampling, problem occurs whenever the aim is to interview a sam-
this means stratifying the population by a criterion (in ple that is to be drawn from a widely dispersed popula-
this case, departmental membership) and selecting tion, such as a national population, or a large region, or
either a simple random sample or a systematic sample even a large city.
from each of the resulting strata. In the present example, One way in which it is possible to deal with this poten-
if there are five departments we would have five strata, tial problem is to employ cluster sampling. With cluster
with the numbers in each stratum being one-twentieth of sampling, the primary sampling unit (the first stage of
the total for each department. the sampling procedure) is not the units of the popula-
The advantage of stratified sampling in a case like tion to be sampled but groupings of those units. It is the
this is clear: it ensures that the resulting sample will be latter groupings or aggregations of population units that
distributed in the same way as the population in terms are known as clusters. Imagine that we want a nationally
of the stratifying criterion. If you use a simple random representative sample of 5000 employees who are work-
or systematic sampling approach, you may end up with ing for the 100 largest companies in the UK (this infor-
a distribution like that of the stratified sample, but it is mation is publicly available and generated through the
unlikely. Two points are relevant here. First, you can con- FTSE—Financial Times Stock Exchange—index; com-
duct stratified sampling sensibly only when it is relatively pany size is measured in terms of market capitalization).
easy to identify and allocate units to strata. If it is not Using simple random or systematic sampling would yield
possible or it would be very difficult to do so, stratified a widely dispersed sample, which would result in a great
sampling will not be feasible. Secondly, you can use more deal of travel for interviewers. One solution might be
The qualities of a probability sample 193
to sample companies and then employees from each of be sampled from each of the industries, and then approxi-
the sampled companies. A probability sampling method mately 400 employees from each of the companies would
would need to be employed at each stage. Thus, we be interviewed. Thus, there are three separate stages:
might randomly sample ten companies from the entire
population of the 100 largest companies in the UK, thus
• group 100 largest UK companies by market capitaliza-
tion;
yielding ten clusters, and we would then interview 500
randomly selected employees at each of the ten compa- • sample one company from each of the major industries;
nies. Research in focus 9.2 gives an example of a study • sample 400 employees from each of the companies.
that used cluster sampling. In a sense, cluster sampling is always a multi-stage
A potential problem, however, is that there is no guar- approach, because clusters are always sampled first and
antee that these ten companies reflect the diverse range then something else—either further clusters or popula-
of industrial activities that are engaged in by the popula- tion units—is sampled.
tion as a whole. One solution to this problem would be Many examples of multi-stage cluster sampling entail
to group the 100 largest UK companies by industry. This stratification. We might, for example, want further to strat-
could be done using the codes from the Standard Indus- ify the companies according to whether their headquarters
trial Classification system or SIC, which is used widely are located in the UK or abroad. To do this we would group
to classify companies for the purposes of collecting data. companies according to whether their headquarters were
The next step would be to randomly sample companies based in the UK or elsewhere and then select one or two
from each of the major groups. One company might then companies per industry from each of the two strata.
sample (X) to estimate the population mean (m). The the population mean will have known margins of error.
mean, or more properly the arithmetic mean, is the simple For more detail on this, see Tips and skills ‘Generalizing
average. This generalization from the sample mean to from a random sample to the population’.
FIGURE 9.8
The distribution of sample means
Number of samples
This consideration is important, because sampling theory tells us that 68 per cent of all sample means will lie
between plus or minus 1.00 standard error from the population mean and that 95 per cent of all sample means
will lie between plus or minus 1.96 standard errors from the population mean. It is this second calculation that is
crucial, because it is at least implicitly employed by survey researchers when they report their statistical findings.
They typically employ 1.96 standard errors as the crucial criterion in how confident they can be in their findings.
Essentially, the criterion implies that you can be 95 per cent certain that the population mean lies within plus or
minus 1.96 sampling errors from the sample mean.
If a sample has been selected according to probability sampling principles, we know that we can be 95 per cent
certain that the population mean will lie between the sample mean plus or minus 1.96 multiplied by the standard
error of the mean. This is known as the confidence interval. If the mean number of training days in the previous
Sample size 195
twelve months in our sample of 450 employees is 6.7 and the standard error of the mean is 1.3, we can be 95 per
cent certain that the population mean will lie between
and
If the standard error was smaller, the range of possible values of the population mean would be narrower; if the
standard error was larger, the range of possible values of the population mean would be wider.
If a stratified sample is selected, the standard error of the mean will be smaller; this is because the variation
between strata is essentially eliminated, because the population will be accurately represented in the sample in
terms of the stratification criterion or criteria employed. This consideration demonstrates the way in which
stratification injects an extra increment of precision into the probability sampling process, since a possible source
of sampling error is eliminated.
By contrast, a cluster sample without stratification exhibits a larger standard error of the mean than a comparable
simple random sample. This occurs because a possible source of variability between employees (that is,
membership of one department rather than another, which may affect levels of training undertaken) is
disregarded. If, for example, some departments have a culture of learning in which a large number of employees
were involved, and if these departments were not selected because of the procedure for selecting clusters, an
important source of variability would have been omitted. It also implies that the sample mean would be on the low
side, but that is another matter.
Sample size
One question about research methods that we are asked national probability sample of 1000 individuals in the UK
by students almost more than any other relates to the size is as likely to be representative as a national probability
of the sample: ‘How large should my sample be?’ or ‘Is my sample of 1000 individuals in the USA, even though the
sample large enough?’ The decision about sample size is latter has a much larger population. It also means that
not a straightforward one: it depends on a number of con- increasing the size of a sample increases the precision of a
siderations, and there is no one definitive answer. This is sample. This means that the 95 per cent confidence inter-
frequently a source of great disappointment to those who val referred to in Tips and skills ‘Generalizing from a ran-
pose such questions. Moreover, most of the time decisions dom sample to the population’ narrows. However, a large
about sample size are affected by considerations of time sample cannot guarantee precision, so that it is probably
and cost (see Tips and skills ‘Sample size and probability better to say that increasing the size of a sample increases
sampling’). Therefore, invariably decisions about sample the likely precision of a sample. This means that as sam-
size represent a compromise between the constraints of ple size increases, sampling error decreases. Therefore,
time and cost, the need for precision, and a variety of fur- an important component of any decision about sample
ther considerations that will now be addressed. size could be how much sampling error one is prepared
to tolerate. The less sampling error one is prepared to tol-
erate, the larger a sample will need to be. Fowler (1993)
Absolute and relative sample size however warns against a simple acceptance of this crite-
One of the most basic considerations, and one that is pos- rion. He argues that in practice researchers do not base
sibly the most surprising, is that, contrary to what you their decisions about sample size on a single estimate
might have expected, it is the absolute size of a sample of a variable. Most survey research is intended to gen-
that is important, not its relative size. This means that a erate a host of estimates—that is, of the variables that
196 9 Sampling in quantitative research
make up the research instrument that is administered. cost at such a juncture, since striving for smaller and
He also observes that it is not normal for survey research- smaller increments of precision becomes an increasingly
ers to be in a position to specify in advance ‘a desired level uneconomic proposition.
of precision’ (1993: 34). Moreover, since sampling error
will be only one component of any error entailed in an
Non-response
estimate, the notion of using a desired level of precision
as a factor in a decision about sample size is not realis- Considerations about sampling error do not end here.
tic. Instead, to the extent that this notion does enter into The problem of non-response should also be borne in
decisions about sample size, it often does so in a general mind. Most sample surveys include a certain amount
rather than a calculated way. of non-response. Thus, it is likely that only some of our
sample will agree to participate in the research. If it is
our aim to ensure as far as possible that 450 employ-
Time and cost ees are interviewed and if we think that there may be
Time and cost considerations become very relevant in a 20 per cent rate of non-response, it may be advis-
this context. In the previous paragraph it is clearly being able to sample 540–550 individuals, on the grounds
suggested that the larger the sample size, the greater that approximately 90 will be non-respondents. For
the precision (because the amount of sampling error example, of the 143 survey questionnaires sent to com-
will be less). By and large, up to a sample size of around panies in T. C. Powell’s (1995) study of total quality
1000, the gains in precision are noticeable as the sample management, only 40 were returned and of these only
size climbs from low figures of 50, 100, 150, and so on 36 were usable, making a response rate of 25 per cent.
upwards. After a certain point, however, often in the This raises the question of whether or not this sample
region of 1000, the sharp increases in precision become is big enough to represent companies in the geographi-
less pronounced, and, although it does not plateau, there cal area of the north-eastern USA that the study claims
is a slow down in the extent to which precision increases to represent (see Chapter 11 for a further discussion of
(and hence the extent to which the sample error of the acceptable response rates). The issue of non-response,
mean declines). Considerations of sampling size are and in particular of refusal to participate, is of par-
likely to be profoundly affected by matters of time and ticular significance, because it has been suggested by
Types of non-probability sampling 197
some researchers that response rates (see Key con- organizational research. Strategies that can improve
cept 9.3) to surveys are declining in many countries. responses to survey instruments such as structured
This implies that there is a growing tendency towards interviews and questionnaires will be examined in
people refusing to participate in survey research. As Chapters 10 and 11.
long ago as 1973, an article in the American maga-
zine Business Week carried an article ominously
Heterogeneity of the population
entitled ‘The Public Clams up on Survey Takers’. The
magazine asked survey companies about their experi- Yet another consideration is the homogeneity and het-
ences and found considerable concern about declin- erogeneity of the population from which the sample
ing response rates (Business Week 1973). Similarly, is to be taken. When a sample is very heterogeneous,
in Britain, a report from a working party on the Mar- such as a sample of a whole country or city, the popu-
ket Research Society’s Research and Development lation which it represents is likely to be highly varied.
Committee in 1975 pointed to concerns among mar- When it is relatively homogeneous, such as members of
ket research companies. However, analyses of this a company or of an occupation, the amount of variation
issue by by Baruch and Holtom (2008) and Anseel is less. The implication of this is that, the greater the
et al. (2010) suggested that there is little evidence of heterogeneity of a population, the larger a sample will
a generalized decline in response rates in surveys in need to be.
will be a good response rate. The findings may prove common and in some areas, such as consumer behav-
quite interesting, but the problem with such a sampling iour research, they have become the norm. Nonetheless,
strategy is that it is impossible to generalize the findings, there is evidence that we should be cautious in general-
because we do not know of what population this sample izing from convenience samples, particularly when they
is representative. They are simply a group of managers are samples of undergraduate students (Peterson and
who are available to the researcher. They are almost cer- Merunka 2014).
tainly not representative of managers as a whole—the It is common in some kinds of business research to
very fact they are taking this degree programme marks use university undergraduate students as a convenience
them out as different from managers in general. sample, particularly in experimental research. This has
This is not to suggest that convenience samples should been criticized by Hooghe et al. (2010), who argue that it
never be used. Let us say that our lecturer/researcher is can endanger the validity and generalizability of findings
developing a battery of questions that are designed to because undergraduate students are likely to think and
measure the ethical decision-making processes used by act differently from the general population. For instance,
managers. It is highly desirable to pilot such a research people from lower socio-economic groups are under-
instrument before using it in an investigation, and admin- represented in undergraduate student populations; stu-
istering it to a group who are not a part of the main study dents are likely to have more socio-economic resources
may be a legitimate way of carrying out some preliminary than is typical of the general population; and students
analysis of such issues as whether or not respondents may be inclined to exert more-than-typical cognitive
tend to answer a particular question in identical ways, or effort in order to give the ‘right’ answer. Moreover, it is
whether or not one question is often omitted when manag- highly problematic to make claims about managerial or
ers respond to it. In other words, for this kind of purpose, workplace phenomena based on such samples, as many
a convenience sample may be acceptable, though not undergraduates have little or no full-time workplace or
ideal. A second kind of context in which it may be at least managerial experience. This has led some management
fairly acceptable to use a convenience sample is when the journals to refuse to publish papers based on student
chance presents itself to gather data from a convenience samples, unless the sample is merely a convenient way
sample and it represents too good an opportunity to miss. to sample managers, as in the case of part-time MBAs
The data will not allow definitive findings to be generated, discussed above. The study described in Research in
because of the problem of generalization, but they could focus 9.4 used a convenience sample taken from the ‘real
provide a springboard for further research or allow links world’ rather than from among students.
to be forged with existing findings in an area.
It also perhaps ought to be recognized that conve-
Quota sampling
nience sampling probably plays a more prominent role
than is sometimes supposed. In the field of business and Quota sampling is used intensively in commercial
management, convenience samples are certainly very research, such as market research and political opinion
STUDENT EXPERIENCE
Using convenience sampling in a dissertation
Jordan conducted research on self-expression in the workplace. Her approach required a fairly large sample of
employees. The one-year duration of her honours program, and the need to fit her research around a demanding
coursework load, meant that she had to find an efficient way to generate a sample which would allow her to
conduct her analysis. She chose to survey graduate employees in a private-sector organization, who were
undergoing a training program run by an NGO she had previously worked for. All 515 graduate employees who
underwent the training program were given a paper survey to complete during their training program. She
explained: ‘[the NGO] provides training to this particular organisation and to all of the graduate employees … 515
of them. They … provide this training in two places in Melbourne and Sydney … the reason why I got 515 surveys
was because time was set aside during the workshop.’ Jordan’s use of convenience sampling provided her with an
effective means to collect data which allowed her to conduct her analysis in a situation in which she did not have
the time or resources to generate a random sample of employees.
Types of non-probability sampling 199
The study was conducted in Belgium. The authors needed a large sample for their study, but there was no
sampling frame available from which to draw a random sample of respondents, so the sample was drawn from
membership lists of HR professional associations, business publications, and the researchers’ own networks. This
generated a sample of 1463 respondents, of whom 424 agreed to participate. Participants were all Caucasian.
Clearly this strategy did not generate a random sample and the sample cannot be regarded as representative in a
statistical sense. Nonetheless, the sampling strategy represented a viable way to draw a large sample of actual HR
professionals which could not otherwise have been generated.
The researchers provided each respondent with job advertisements for two different jobs in two different kinds of
businesses. They were each also provided with four fictional CVs, each including a photo of a fictional applicant. The
CVs showed a variety of combinations of skin tone (light versus dark) and name (Flemish versus Arab/Maghreb).
Respondents were asked to rate each applicant using a three-item measure with Likert scale responses. For
example, ‘Given all the information you read about this applicant, how likely is it that you would invite this applicant
for a job interview?’ (1 = not likely at all; 7 = very likely). The findings showed the equally-qualified candidates with
a dark skin tone were systematically rated less suitable for jobs than those with light skin tone, with the effect
varying depending on the nature of the job. Name did not appear to matter. The findings suggested that there was
systematic ethnically-based discrimination in CV screening and that there were subtle effects which arose from the
kinds of jobs involved.
polling. The aim of quota sampling is to produce a sample Accordingly, an interviewer may know that among the
that reflects a population in terms of the relative propor- various subgroups of people, he or she must find and
tions of people in different categories, such as gender, interview five Asian, 25-to-34-year-old, lower-middle-
ethnicity, age groups, socio-economic groups, and region class females in the area in which the interviewer has
of residence, and in combinations of these categories. been asked to work. The interviewer usually asks people
However, unlike a stratified sample, the sampling of indi- who are available to him or her about their characteris-
viduals is not carried out randomly, since the final selec- tics in order to determine their suitability for a particu-
tion of people is left up to the interviewer. lar subgroup. Once a subgroup quota (or a combination
Once the categories and the number of people to be of subgroup quotas) has been achieved, the interviewer
interviewed within each category (known as quotas) will no longer be concerned to locate individuals for that
have been decided upon, it is then the job of interview- subgroup.
ers to select people who fit these categories. The quo- The choice of respondents is left to the interviewer,
tas will typically be interrelated. In a manner similar to subject to the requirement of all quotas being filled, usu-
stratified sampling, the population may be divided into ally within a certain time period. Those of you who have
strata in terms of, for example, gender, social class, age, ever been approached on the street by a person toting
and ethnicity. Census data might be used to identify the a clipboard and interview schedule and have been asked
number of people who should be in each subgroup. about your age, occupation, and so on, before being
The numbers to be interviewed in each subgroup will asked a series of questions about a product or whatever,
reflect the population. Each interviewer will probably have almost certainly encountered an interviewer with
seek out individuals who fit several subgroup quotas. a quota sample to fill. Sometimes, he or she will decide
200 9 Sampling in quantitative research
Error
Limits to generalization
One point that is often not fully appreciated is that even were involved in heavy industry, such as steel-making or
when a sample has been selected using probability sam- shipbuilding. The population from which the sample was
pling, any findings can be generalized only to the pop- selected consisted of all accountants and engineers who
ulation from which that sample was taken. This is an worked for these companies. Respondents were chosen
obvious point, but it is easy to think that findings from a randomly according to certain criteria for stratification,
study have some kind of broader applicability. If we take including age, job title, level in the company, and length
our imaginary study of training and skill development of service. It is interesting that there is no mention of
among employees of a company, any findings could be gender in the study, although we can fairly safely assume
generalized only to that company. In other words, you that, given that this was a study of accountants and engi-
should be very cautious about generalizing to employ- neers in the late 1950s, there is likely to be a male bias to
ees at other companies. There are many factors that may the study. The maximum number of individuals selected
imply that the level of training and skill development is for interview in each company was approximately 50. As
higher (or lower) in that company than among company the authors acknowledge, ‘the fact that this work was
employees as a whole. There may be a higher (or lower) done within a thirty-mile radius around Pittsburgh will
level of skill required in order to do the jobs that the com- inevitably raise questions about the degree to which the
pany requires its employees to do; there may be more (or findings are applicable in other areas of the country’
less) money in the company’s training budget; there may (1959: 31). The findings may also reflect the values of
be more (or less) of a culture of learning at this company; high individualism, self-interest, and high masculinity
or the company may recruit a higher (or lower) propor- that have been identified as characteristic of American
tion of employees who are already skilled. There may be culture (Hofstede 1984). This is part of the reason that
many other factors too. there have been so many attempts to replicate the study
Similarly, we should be cautious of overgeneralizing in on other occupational groups and in other localities,
terms of locality. A frequent criticism made of research including different cultures and nationalities.
on employee motivation relates to the limited extent to However, there could even be a further limit to gen-
which it can be assumed to be generalizable beyond the eralization that is implied by the Herzberg et al. sample.
confines of the national or regional culture on which The main study was conducted in the late 1950s. One
the study is based. For example, Herzberg et al. (1959) issue that is rarely discussed in this context, and that
conducted semi-structured interviews with 203 accoun- is almost impossible to assess, is whether or not there
tants and engineers in the Pittsburgh area in the USA. is a time limit on the findings that are generated. Quite
Most of the companies that constituted sites for the study aside from the fact that we need to appreciate that
202 9 Sampling in quantitative research
the findings cannot (or at least should not) be gener- twenty-first century. Quite aside from changes that
alized beyond the Pittsburgh area, is there a point at might have occurred naturally, the erosion and virtual
which we have to say, ‘Well, those findings applied to dismantling of the student grant system in the UK and
the Pittsburgh area then, but things have changed and the introduction of tuition fees have changed the ways
we can no longer assume that they apply to that or any students finance their education, including perhaps
other locality’? We are, after all, used to assuming that a greater reliance on part-time work (Lucas 1997), a
changed circumstances will prompt changed behaviour greater reliance on parents, and use of loans. But, even
and attitudes. To take a simple example: no one would when there is no definable or recognizable source of rel-
be prepared to assume that the findings of a study in evant change of this kind, there is nonetheless the pos-
1980 of UK university students’ budgeting and personal sibility (or even likelihood) that findings are temporally
finance habits would apply to students in the early specific.
a combination of both was more effective than text of the way through, resulting in unusable partially com-
messages alone. pleted questionnaires in most cases. Interestingly, they
2. As with postal questionnaire surveys, follow up non- also found that respondents were most likely to abandon
respondents at least once. their questionnaires part of the way through when in
the middle of completing a series of open questions. The
3. There is evidence that incentives can increase re-
implications of this finding reflect the advice you will
sponse rates in online surveys (Pedersen and Nielsen
encounter in Chapter 11: that it is probably best to ask
2016).
as few open questions in self-completion questionnaires
The case for the first of these two strategies in boost- as possible.
ing response rates is not entirely clear (Sheehan 2001), Further evidence suggests that having a progress indi-
but seems to be generally advisable. However, as previ- cator in an online survey can reduce the number of people
ously noted, with many online surveys it is impossible to who abandon the questionnaire part of the way through
calculate a response rate, since, when participants are completion (Couper et al. 2001). A progress indicator
recruited through invitations and postings on discussion is usually a diagrammatic representation of how far the
boards, etc., the size of the population of which they are respondent has progressed through the questionnaire at
a sample is almost impossible to determine. any particular point. Couper et al. also found that it took
Crawford et al. (2001) report the results of a survey less time for respondents to complete related items (for
of students at the University of Michigan that experi- example, a series of Likert items) when they appeared
mented with a number of possible influences on the on a screen together than when they appeared singly.
response rate. Students in the sample were initially Respondents also seemed less inclined to omit related
sent an email inviting them to visit the website, which questions when they appeared together on a screen
allowed access, via a password, to the questionnaire. rather than singly.
Some of those emailed were led to expect that the ques- However, it is important not be too sanguine about
tionnaire would take 8–10 minutes to complete (in some of these findings. One difficulty with them is that
fact, it would take considerably longer); others were the samples derive from populations whose members are
led to expect that it would take 20 minutes. As might not as different from one another as would almost cer-
be expected, those led to believe it would take longer tainly be found in samples deriving from general popu-
were less likely to accept the invitation, resulting in a lations. Another is that it must not be forgotten that, as
lower response rate for this group. However, Crawford previously noted, access to the internet is still not univer-
et al. also found that those respondents who were led to sal, and there is evidence that those with internet access
believe that the questionnaire would take only 8–10 min- differ from those without, both in terms of personal char-
utes were more likely to give up on the questionnaire part acteristics and attitudinally.
!
KEY POINTS
● Probability sampling is a mechanism for reducing bias in the selection of samples.
● Ensure you become familiar with key technical terms in the literature on sampling, including
representative sample; random sample; non-response; population; sampling error; etc.
● Randomly selected samples are important because they permit generalizations to the population and
because they have certain known qualities.
● Sampling error decreases as sample size increases.
● Quota samples can provide reasonable alternatives to random samples, but they suffer from some
deficiencies.
● Convenience samples may provide interesting data, but it is crucial to be aware of their limitations in
terms of generalizability.
● Sampling and sampling-related error are just two sources of error in social survey research.
● Online surveys pose specific challenges in terms of sampling.
Questions for review 205
Sampling error
● What is the significance of sampling error for achieving a representative sample?
Sample size
● What factors would you consider in deciding how large your sample should be when devising a
probability sample?
● What is non-response and why is it important to the question of whether or not you will end up with a
representative sample?
Limits to generalization
● ‘The problem of generalization to a population is not just to do with the matter of getting a
representative sample.’ Discuss.