IS221_Lecture2
IS221_Lecture2
• There are several basic approaches to research, but the mostly used
are: quantitative approach, qualitative approach and mixed approach
• In quantitative approach, there is the generation of data in
quantitative form which can be subjected to rigorous quantitative
analysis in a formal and rigorous manner.
• Qualitative approach to research is concerned with subjective
assessment of attitudes, opinions and behavior. Research in such a
situation is a function of researcher's insights and impressions.
QUANTITATIVE APPROACH
• Quantitative Designs
Survey research provides a quantitative or numeric description of
trends, attitudes, or opinions of a population by studying a sample of
that population. It includes cross-sectional and longitudinal studies
using questionnaires or structured interviews for data collection—
with the intent of generalizing from a sample to a population
Experimental research seeks to determine if a specific treatment
influences an outcome. The researcher assesses this by providing a
specific treatment to one group and withholding it from another and
then determining how both groups scored on an outcome.
RESEARCH DESIGNS
• Qualitative Designs
Narrative research is a design of inquiry from the humanities in
which the researcher studies the lives of individuals and asks one or
more individuals to provide stories about their lives
Phenomenological research is a design of inquiry coming from
philosophy and psychology in which the researcher describes the
lived experiences of individuals about a phenomenon as described by
participants. This description culminates in the essence of the
experiences for several individuals who have all experienced the
phenomenon
RESEARCH DESIGNS
• Qualitative Designs
Grounded theory is a design of inquiry from sociology in which the researcher
derives a general, abstract theory of a process, action, or interaction grounded in
the views of participants. This process involves using multiple stages of data
collection and the refinement and interrelationship of categories of information
Ethnography is a design of inquiry coming from anthropology and sociology in
which the researcher studies the shared patterns of behaviors, language, and
actions of an intact cultural group in a natural setting over a prolonged period of
time. Data collection often involves observations and interviews.
RESEARCH DESIGNS
• Qualitative Designs
• Case studies are a design of inquiry found in many fields,
especially evaluation, in which the researcher develops an in-
depth analysis of a case, often a program, event, activity,
process, or one or more individuals.
• Cases are bounded by time and activity, and researchers collect
detailed information using a variety of data collection procedures
over a sustained period of time
RESEARCH DESIGNS
• Mixed Methods
• Procedures for expanding mixed methods developed such as follows:
Ways to integrate the quantitative and qualitative data, such as one
database, could be used to check the accuracy (validity) of the other
database.
One database could help explain the other database, and one database
could explore different types of questions than the other database.
One database could lead to better instruments when instruments are not
well-suited for a sample or population.
One database could build on other databases, and one database could
alternate with another database back and forth during a longitudinal study.
RESEARCH DESIGNS
• Mixed Methods
Convergent parallel mixed methods is a form of mixed methods
design in which the researcher converges or merges quantitative
and qualitative data in order to provide a comprehensive analysis of
the research problem. In this design, the investigator typically
collects both forms of data at roughly the same time and then
integrates the information in the interpretation of the overall
results. Contradictions or incongruent findings are explained or
further probed in this design.
RESEARCH DESIGNS
• Mixed Methods
Explanatory sequential mixed methods is one in which the researcher
first conducts quantitative research, analyzes the results and then
builds on the results to explain them in more detail with qualitative
research. It is considered explanatory because the initial quantitative
data results are explained further with the qualitative data
RESEARCH DESIGNS
• Mixed Methods
Exploratory sequential mixed methods is the reverse sequence
from the explanatory sequential design. In the exploratory
sequential approach the researcher first begins with a qualitative
research phase and explores the views of participants. The data are
then analyzed, and the information used to build into a second,
quantitative phase.
RESEARCH DESIGNS
• Mixed Methods
Transformative mixed methods is a design that uses a
theoretical lens drawn from social justice or power as an
overarching perspective within a design that contains
both quantitative and qualitative data. The data in this
form of study could be converged or it could be ordered
sequentially with one building on the other
ALTERNATIVE RESEARCH DESIGNS
HOW TO APPROACH RESEARCH? (…)
• 3. Development of Hypothesis:
Hypothesis may be defined as a tentative statement showing a relationship
among variables under study
Hypotheses are typically used in experimental and quasi- experimental designs
and survey research
Data analysis tests this supposition
Interpretation of data will either disprove (reject) or support (uphold) the
hypothesis
Hypothesis translates the problem statement into a precise, unambiguous
prediction of expected outcomes
RESEARCH PROCESS
• 4. Preparing the research design: the conceptual structure within which research
would be conducted
The function of research design is to provide for the collection of relevant evidence with
minimal expenditure of effort, time and money depending on Research purposes, that is (i)
Exploration, (ii) Description, (iii) Diagnosis, and (iv) Experimentation
The preparation of the research design, appropriate for a particular research problem, involves
usually the consideration of the following:
(i) the means of obtaining the information;
(ii) the availability and skills of the researcher and his staff (if any);
(iii) explanation of the way in which selected means of obtaining information will be organized
and the reasoning leading to the selection;
(iv) the time available for research; and
(v) the cost factor relating to research, i.e., the finance available for the purpose.
RESEARCH PROCESS
• 8. Analysis of data:
• The analysis of data requires a number of closely related operations
such as establishment of categories, the application of these categories
to raw data through coding, tabulation and then drawing statistical
inferences
• In the process of analysis, relationships or differences supporting or
conflicting with original or new hypotheses should be subjected to tests
of significance to determine with what validity data can be said to
indicate any conclusion(s)
RESEARCH PROCESS
• 9. Hypothesis-testing:
• the researcher is in a position to test the hypotheses, if any, he had
formulated earlier.
• Hypothesis-testing will result in either accepting the hypothesis or in
rejecting it
• It uses tests such as Chi square test, t-test, F-test
RESEARCH PROCESS
2. Extraneous Variable
• Extraneous variables are independent variables that are not directly
linked with the study but may influence the dependent variable.
3. Control
• Control is essentially devised to minimize the effects of extraneous
variables. This is an important char acteristic of a good research design
RESEARCH DESIGN
4. Confounded Relationship
• When a dependent variable is affected by the influence of an extraneous
variable, then the relation among the dependent and independent variables is
confused or confounded by an extraneous variable.
5. Research Hypothesis
• If a hypothesized relationship or prediction or an assumption has to be tested
using scientific methods, it is called research hypothesis. A research
hypothesis links an independent variable to a dependent variable. It should
generally contain a dependent and an independent variable.
RESEARCH DESIGN
• 9. Experiment
• The process involving checking the validity of a hypothesis statement
of a research problem is called an experiment
• 10 Experimental Units
• Prespecified plots or blocks, where various treatments are used, are
called experimental units.
• Experimental units need to be defined very carefully
RESEARCH DESIGN
70
Objectives
71
Sampling
Why sample?
72
Why sample?
Cost in terms of money, time and manpower
Accessibility
Utility e.g. to do diagnostic laboratory test you
don’t draw the whole of patient’s blood.
A census is a sample consisting of the entire population.
Even though a census is not full proof, it gives detailed
information about every small area of the population.
It has the following disadvantages:
Expensive
Takes a long time
Cumbersome & therefore inaccurately done ( a careful sample
produces a more accurate data than a census.)
73
Sampling…..
Sampling is the process of selecting a representative sample
from populations.
It Selecting cases (elements)—or locating people (or other units of analysis)
—from a target population in order to study the population.
sampling
Sample
Inference
Population
74
Cont’d
The process of obtaining information from a subset (sample) of a larger
group (population)
The results for the sample are then used to make estimates of the larger
group
Faster and cheaper than asking the entire population
Two keys
1. Selecting the right people
Have to be selected scientifically so that they are representative of the population
2. Selecting the right number of the right people
To minimize sampling errors I.e. choosing the wrong people by chance
75
Population Vs. Sample
Population of Interest
Population Sample
Sample
Parameter
Statistic
76
Characteristics of Good Samples
o Representation
Sample surveys are almost never conducted for the
purposes of describing the particular sample under
study. Rather they are conducted for purposes of
understanding the larger population from which the
sample was initially selected
A great deal of work has been done over the years in
developing sampling methods that provide
representative samples for the general population.
E.g. international survey programs such as the DHS series, EPI
coverage surveys have perfected the art of household sampling.
77
Characteristics of Good Samples
79
Basic term cont’d….
80
Basic Terms cont’d…
81
Basic Terms cont’d…
82
Basic term cont’d….
84
Hierarchy of sampling Study subjects
The actual
participants in
the study
Sample
Subjects who are
selected
Sampling Frame
The list of potential subjects
from which the sample is
drawn
Source population
The Population from whom the study
subjects would be obtained
Target population 85
Sampling or Random
Errors
Non-sampling or
systematic
86
1. Sampling error
87
Sampling error cont’d…
88
Sampling error cont’d…
89
The cause of sampling error
91
Non-sampling Error……
o The basic types of non-sampling error
Non-response error
Response or data error
o A non-response error occurs when units selected as part of the
sampling procedure do not respond in whole or in part
If non-respondents are not different from those that did
respond, there is no non-response error
When non-respondents constitute a significant proportion of
the sample (about 15% or more
92
Non-sampling Error…….
o A response or data error is any systematic bias
that occurs during data collection, analysis or
interpretation
Respondent error (e.g., lying, forgetting, etc.)
Interviewer bias
Recording errors
Poorly designed questionnaires
93
Non-Sampling Error cont’d …
Random error can distort the results in any given direction but
tend to balance out on average
Thus, the total survey error
94
Advantage of sampling
95
Disadvantage of Sampling
If the population is very large and there are many sections and
subsections, the sampling procedure becomes very complicated
96
Characteristics Of A Good Sample Design
From what has been stated above, we can list down the characteristics
of a good sample design as:
Sample design must result in a truly representative sample.
Sample design must be such which results in a small sampling
error.
Sample design must be viable in the context of funds available for
the research study.
Sample design must be such so that systematic bias can be
controlled in a better way.
Sample should be such that the results of the sample study can be
applied, in general, for the universe with a reasonable level of
confidence.
97
Types of Sampling
98
Types of Sampling Methods
Sampling Method
Convenience
Multistage Random
Sampling
99
Probability Sampling Method …
In probability sampling
A sampling frame exists or can be compiled.
should have an equal or at least a known or nonzero chance
of being included in the sample.
Generalization is possible (from sample to population)
Simple Random Sampling,
Systematic Sampling,
Stratified Random Sampling,
Cluster Sampling
Multistage Sampling.
101
1. Simple Random Sampling(SRS)
102
Simple Random Sampling cont’d …
103
Simple Random Sampling cont’d …
Lottery method is appropriate if the total population is not too
large, otherwise if the population is too large then it will be very
difficult to use lottery method.
Thus, table of random number or computer generated random
number is the feasible method to be used.
Sampling schemes may be
o without replacement- no element can be selected more than once in the
same sample, N possible samples.
n
o with replacement- an element may appear multiple times in the one sample
possible samples.n
N
104
Example
ti on is
pop ul a
ou g h t h e
m a y not be
Th
g e n e o u s, there
homo f r ame .
avail a bl e
so
what?
105
2. Systematic Random Sampling
106
Steps in systematic sampling:
107
E.g. systematic sampling
• N = 1200, and n = 60
sampling fraction = 1200/60 = 20
• List persons from 1 to 1200
• Randomly select a number between 1 and 20
(e.g. 8)
• 1st person selected = the 8th on the list
• 2nd person = 8 + 20 = 28th list e.t.c.
108
Systematic sampling….
109
Though the frame available, the population may
not be homogeneous, so what?
110
3. Stratified Random Sampling
So, you divide your sample into male and female members and
randomly select the required sample size within each subgroup
(or "stratum")
112
Steps involve in stratified sampling method:
Define the population
Determine the desired sample size
Identify the variable and subgroups (strata) for which you want to
guarantee appropriate representation (either proportional or equal)
Then the total sample size will be the sum of all samples from each
subgroup. 113
There are two methods to get the study subject from each subgroup,
proportional allocation or
equal allocation.
We use proportional allocation technique when our subgroups vary dramatically in size
in our population
Let N be total population and N1, N2 . . . . Nk be the subtotal population for strata 1, 2,
…. K respectively. Moreover let n be the total sample size and n1, n2…..nk be th
subsample for strata 1, 2…..k respectively in which N = N1 + N2 +….. …+ NK
and n = n1 + n2 + …………..+ nk
Then the subsample “ni “which will be selected from subgroup Ni can be computed by
n Ni
ni where i 1, 2, 3........k
N
114
The higher the population in the subgroup, the higher the
sample size will be.
115
Advantage of stratified sampling over simple random sampling
DEMERIT
Sampling frame for the entire population has to be prepared
separately for each stratum.
116
4. Cluster Random Sampling
117
Steps in cluster sampling are:
119
5. Multistage Random Sampling
120
Non-Probability Sampling Method
121
Cont’d……….
Advantages
Cheaper and faster than probability
Reasonably representative if collected in a thorough manner
122
1. Judgment Sampling/ Purposive sampling
The researcher selects the sample based on judgment. A researcher exerts some effort in
selecting a sample that seems to be most appropriate for the study.
This is used primarily when there is a limited number of people that have
expertise in the area being researched
123
2. Convenience Sampling
124
Cont’d………..
125
3. Quota sampling
126
Cont’d
127
Cont’d
In quota sampling the selection of the sample is non-random.
128
4. Snowball sampling
129
Cont’d
While this technique can dramatically lower search costs,
it comes at the expense of introducing bias because the
technique itself reduces the likelihood that the sample will
represent a good cross section from the population.
130
Sample Size Determination
The answer will depend on the aims, nature and scope of the
study and on the expected result. All of which should be
carefully considered at the planning stage.
131
Sample……
o If sample (“n”) is
Large
Increase accuracy
Costy / complex
Take
Optimum
Small sample
o Decrease accuracy
o Less costy
How ?
132
Factors to determine sample size
Size of population
Resources – subjects, financial, manpower
Method of Sampling- random, stratified
Degree of difference to be detected
Variability (S.D.) – pilot study, historical
Degree of Accuracy (or errors)
- Type I error (alpha) p<0.05
- Type II error (beta) less than 0.2 (20%)
- Power of the test : more than 0.8 (80%)
Statistical Formulae
Dropout rate, non-compliance to Rx
133
o Sample size determination depending on outcome variables.
134
The third category covers continuous response variables
such as birth weight, age at first marriage, blood
pressure and cerium uric acid level, for which
numerical measurement are usually made.
135
Sample Size………...
136
Sample for Single population
To estimate sample size for single survey using simple or systematic
random sampling, need to know:
oEstimate of the prevalence of the outcome
o Precision desired
o Design effect
o Size of total population
oLevel of confidence (always use 95%)
137
Sample size for single population mean
138
Maximum acceptable difference (w): This is the maximum
amount of error that you are willing to accept.
Desired confidence level (Z/2 ) : is your level of certainty that
the sample mean does not differ from the true population mean
by more than the maximum acceptable difference. Commonly
we use a 95% confidence level.
Then the sample size determination formula for single
population mean is defined by:
z22 2
n
w2
139
Sample size for single population mean cont’d…
Where
α= The level of significance which can be obtain as 1-
confidence level.
σ=Standard deviation of the population
w= Maximum acceptable difference
z α/2 = The value under standard normal table for the
given value of confidence level
140
Sample Size for Single Population Proportion
141
Then the formula for the sample size of single population proportion is
defined as:
z22 * p (1 p )
n 2
w
142
Example 1
One of MPH student want to conduct a research on the prevalence of ANC utilization
of mothers in DABAT district. Given that the prevalence from the previous study found
to be 45.7% , what will be the sample size he should take to address his objective?
Solution:
Margin of error d= 5%
A confidence level of 95% will give the value of as Zα/2=1.96.
Then using the formula :
2 2
Z P (1 P ) Z
0.05 0.457 (1 0.457 )
n 2
2
2
W 0.05 2
1.96 0.457(0.543)
2
0.05 2
382
143
Some Considerations
The final sample size will be corrected for
Nonresponse, lost to follow up, lack of compliance and so on
Consider the total size of the population (N): if N <10000 then we need correction the formula
which is defined by
no
nf
n
1 o
N
Where nf = final sample size, no = sample size from the above formula and N total
population.
Take the design effect in to account if needed
144
Incorrect sample size will lead to
o Wrong conclusions
o Poor quality research (Errors)
o Type II error can be minimized by increasing the sample size
o Waste of resources
o Loss of money
o Ethical problems
o Delay in completion
145
Example 2
146
147