Slides_W6C1 Internal and external validity (1)
Slides_W6C1 Internal and external validity (1)
TOGETHER
W I T H YO U R
P RO J E C T
G RO U P S
T O DAY ! ! !
Ways to improve
I N T E R N A L A N D E X T E R N A L VA L I D I T Y
AG E N DA
• Internal validity and control variables
• Randomization and blinding
3
• External validity
• Group activity: analyze a real study
I N T E R N A L VA L I D I T Y
❖ Internal validity is the degree to which we can appropriately conclude that the
changes in X caused the changes in Y. Causality is the state of nature in which an
active change in one variable directly changes the probability distribution of
another variable (Seltman, 2018)
• Association (non-zero correlation) does not necessarily imply causation. 4
Randomization
Have more controlled 5
(including block-
variables
randomization)
Blinding: Single -
Control group
Double - Triple
I N T E R N A L VA L I D I T Y – B E T T E R
C O N T RO L O F VA R I A B L E S
❖ “Falling objects” experiment
• When done in air, more than one
thing is different between the
6
subjects: mass and air resistance
→ air resistant is a confounding
variable → invalid conclusion
that heavier objects fall faster.
• Done in vacuum: remove the
confounding → improve internal
validity.
I N T E R N A L VA L I D I T Y – B E T T E R
C O N T RO L O F VA R I A B L E S
In this report, the phrase “after
controlling for other factors” appears
8 times!
❖ The more controlled variables you
can identify, the better. 7
Food
A 10
Food
B
Population Sample Units
B L O C K R A N D O M I Z AT I O N
• Block randomiz ation is a sub-class of randomization, done when we know or have good
reasons to believe a certain factor has significant impact on the result: separate the
subjects into blocks based on the factor → then randomize within the blocks.
11
Consider
2 blocks
separately
• Too broad (claim that results would be true for a large population beyond the
sample) → risk of over-generalization.
• Depends on sample size and how representative it is of the population
SAMPLE SIZE
❖ Havinga large sample size is the first step in improving external validity. A larger
sample would more likely to represent the population.
• Example: population: FUV students.
• Randomly sample 3 students (i.e. sample size = 3) vs randomly sample 30 vs 300
(more when we learn statistics)
18
❖ So how large is “large enough”? There is no hard-and-fast rules:
• For some specific case such as the FDA drug trial there’re standards.
• Generally speaking, the smaller the population, the larger the sampling ratio needed.
• For populations under 1000, a minimum ratio of 30% is advisable to ensure
representativeness of the sample.
• For larger populations, such as a population of 10,000, a comparatively small
minimum ratio of 10 percent of individuals is required to ensure representativeness of
the sample.
E X T E R N A L VA L I D I T Y
• High internal validity is a requirement for every good experiment, but the requirement for
external validity is a bit more case-by-case, depending on the goal of the study/the standard
practice of that research area (paradigm strikes again!)
• Example:
Study on a new drug’s effect on patients → high Study on how to MAKE that drug through chemical
requirement for external validity due to synthesis → we synthetic chemists normally don’t
heterogenous population & high-stake even think about external validity (reliability is a 19
much more important requirement here)
consequence (human health)
C A S E S O F C AU T I O N S
❖3 cases of low generaliz ation worth talking about more (Seltman, 2018):
• Non-participation
• Drop-out
• Environment and treatment application variability
❖ Non-participation: People who agree to take part in a study might share something in 20
common, which may be a factor influencing the results → sample less representative of
broader population.
• Example 1: A study about effect of coffee, people who
agree to participate might be those who like coffee and
have been consuming it regularly for a long time.
• Example 2: Paid survey about consumer habit, those
who agree might have a certain view on money (that
influences them to spend the time to go through the
survey with that amount of payment)
C A S E S O F C A U T I O N S – D RO P - O U T
❖ Drop-out: people who start the study but did not complete it.
• This reduces the sample size → in general that’s bad for external validity
• The people who drop-out might share some common characters which
affect the randomization, this can lower both external and internal validity.
Example:
• A study on impacts of different genres of music on students’ learning effectiveness: groups 21
of 20 participants each, there’s one group assigned to listen to Heavy metal.
• After 1 week, 15 participants in this group dropped out.
• Sample size much smaller: ¼ of the original → external validity drops
• Those dropped out might be people who can’t stand this genre, leaving 5 Heavy metal
fans in the study group, thus any conclusion from this group is not representative of
broader student body → external validity drops
• Comparison of impacts of different genres on students are not conclusive, for example if
these 5 students out-performed other groups, it might be because they get the genre they
like → internal validity drops.
C A S E S O F C A U T I O N S – E N V I RO N M E N T
A N D T R E AT M E N T A P P L I C AT I O N
❖ Environment and treatment application variability: this is not about the subjects, but
the quality requirements of the environment: the lower these requirement, the more
the research result can be applied to environment other than the one in the study.
❖ Examples:
22
• A new chemical reaction that could be use with many chemicals, but requires very
high pressure;
• A clinical treatment that requires lab-like controlled which is difficult to obtain in a
normal doctor’s office;
• A drug that requires special storage;
• A study on effect of online learning that uses stable, high speed internet connection
…..
AG E N DA
• Internal validity and control
• Randomization and blinding
23
• External validity
• Group activity: analyze a real
study
L E T ’ S A N A LY Z E A
PA P E R