Matching Methods
Matching Methods
to make them
qsmile
cost saving
treatment
q effect
balanced
Mattea atop
Implementing it
given
Analysis q given
0 Before Class
SUTVA: Stable Unit Treatment Value Assumption. Outcome of one individual is not a ected by
treatment assignment of another individual.
Poorly treatments us
defined
Effects of causes us
Sutor
Yi Jill Yi o
ti Y 1 8 Troi'D
Improving water quality and its impact on hospitilization. SUTVA violation because
SUTVA isnt the worst thing. Just a better design. Explicitly incorporate that into your research design.
Intro to Matching:
Example: What is the e ect of a windmill on farmland values. Nick Pates, Mark.
Matching Estimator usually makes sense when there are a lot more potential control units/observations
than treatment units/observations.
You only want the controls that are similar to the treatment cases.
List all the covariates and their means. In RCTs you want them both to be similar to one another.
With pre outcome analysis, you can select a sample and construct a balance table. And just look if
both look similar.
RCTs assess balance in covariate means between treatment and control units/groups
Matching estimators require us to assess likeness of covariate distribution between treatment and
control.
xx
k
Same mean but di erent distribution.
k
Distance Metric: Euclidean Distance. Pick the variable, say age, pick the groups that are closest in age.
Mahalanobis Distance: Takes into account weights which down-weights potential control units when
distance is farther in one or more dimensions.
Control
treats
if
Protector
low probably I
being in treetet
Bad
potential cartons
mrs
Homework: bysort miss: sum names of variables that you want to summarize
For variables you observe, compare the characteristics of missing values vs non-missing values.
Just generate a summary stats table for which you have observations on all the participants. Compare
the characteristics.
*** Approach for assessing a paper/ writing a referee report: Prepare and structure
• Provide a summary
• Note to self:
• Presentation of results
• Writing quality
Dataset size?
ATE, ATOT.
Dont start a sub section with a table. Always start with text before a table.
If two samples are not the same, then we can assume that they are going to be di erent across
unobservables which we can not control for thereby leading OLS to be biased.
Laxmi's Paper:
Objective
Pre outcome analysis: Highlights that we are trying to approximate an experiment (i.e an RCT)
Unconfoundedness Assumption:
ftp.Ditpzzit Ei
Yi d
regress Di Lot 4 Zi t 2
ataman
D unexplained
m
Assumption:
variation
error term of the second equation is unrelated to the error term of the mDi
rst equation.
if this were not hold, then there is something in treatment group that is making them getting the
Lil Yi
o
treatment.
Dit ti
That something will go to the error term of the rst equation.
Overlap assumption
redid freebut
lil
f treat
fothoot
pulpit
Banc Pins Ei
Di Pot Binet
ftp.niitpiniitpsns
x1, x2, x3
Di
Discussion of LPM:
Di Rot Fini t É
prob Diii
Gedn y
Because LPM generates negative or greater than 1 probabilities, we often prefer to estimate the
propensity score using a logit or probit estimator (both of which arex non-linear estimator).
propensity score is just a predicted probability (can use both linear and non-linear probability models)
Matching: To recover causal e ects. Embedded within the procedure is a predictive objective.
time
Prediction excercise: We don't care about the right hand side of the equation. Or biasedness. All we
care about is do we get predictive values that are close to the true value.
to look at the treatment and control group for all the covariates:
pairwise comparison
I L
t E