Metrics - WT - 2023-24 - Unit13 - Problems+tricks With IV
Metrics - WT - 2023-24 - Unit13 - Problems+tricks With IV
ρ(z, u) σu
= β1 + ·
ρ(z, x) σx
So we should:
I Prefer IV over OLS if Corr (z, u)/Corr (z, x) < Corr (x, u)
I But in the other case, IV makes thing even worse!
I This is for instance the case when the Corr (z, x) is very small, i.e. when the
instruments are “poor”
The variance of the IV estimator is always larger than the variance of the OLS
estimator
By how much depends on the partial R 2 of the first stage regression and thus on
the predictive power of the instrument
IV more biased than OLS when:
Bias(β̂IV ) ρ(z, u)
= >1
Bias(β̂OLS ) ρ(z, x)ρ(x, u)
The key element to understand the “quality” of our IV procedure is the relative bias
2. Instrument Exogeneity
z }| {
ρ(z, u)
ρ(z, x) ρ(x, u)
| {z } | {z }
3. Instrument Power 1. Endogeneity
where
0
1 endogeneity: plim XNu = ρ(x, u) 6= 0
0
2 instrument exogeneity:plim ZNu = ρ(z, u) = 0
0
3 instrument power: plim XNZ = ρ(z, x) = large
Bias(β̂IV ) 1
∼
Bias(β̂OLS ) F
Bias(β̂IV )
H0 : > k%
Bias(β̂OLS )
1 Compute the F for the excluded instruments in the 1st stage regression
2 Check whether F > critical value (computed by Stock & Watson). Critical
values depend on the number of instruments and the number of endogenous
variables
Rule of thumb: no weak instruments if F > 11 (sometimes one sees F > 10)
Use and check intuition: a picture of Orvieto, the capital of the Etruscans! Do whatever you can to
persuade reader your instruments are good
Sometimes we face the issue of whether to add instruments. There is a trade off:
I An extra instrument may help identification because adds an extra source of
variation
I But if extra instrument weak, it may worsen the bias of IV!
The idea of the Hausman test is therefore to see if the estimates from OLS and IV
are similar. If they are different, we conclude that the variable(s) that we
instrumented for must be endogenous
Where is the problem with this test?
I We still have to assume that the instrument are exogenous, i.e. that
E [u|Z] = 0
I If this is not true (i.e. the instruments are invalid), IV is also not consistent
Hence, the Hausman test cannot differentiate between these two failures: the test
of the assumption about the exogeneity of X is conditional on the instruments
being truly exogenous
While it is a good idea to see if IV and OLS have different implications, it is easier
to use a regression “test” for endogeneity
If x2 is endogenous, then η2 (residuals from the reduced form equation) and u1
from the structural model will be correlated
I save the residuals from the first stage
I include the residuals in the structural equation (which of course has x2 in it)
and estimate it with OLS
I if the coefficient on the (predicted) residuals η̂2 is statistically different from
zero, reject the null of exogeneity
I we may want to use a heteroskedasticity-robust t-test
I if there are multiple endogenous variables, jointly test the residuals from each
first stage
Suppose you just have one endogenous regressor and two instruments Z1 and Z2
We can estimate the model using only Z1 , compute the residuals ^
u(Z1 ) and then
test whether Z2 0^
u/N = 0
We could do also the opposite to test wether Z1 is uncorrelated with the error term
In practice we test each instrument against the residuals that we obtain in the 2SLS
estimator using X̂ as instrument
If each instrument is valid then they should all give a consistent estimate of β
Hence all should converge to the same estimate
Problem: if the null that instruments are valid is rejected how should we interpret
it?
I It could be that instruments are invalid
I But it could also be that the model is incorrectly specified because instruments
belong to the second stage
The test statistic that we use (Sargan J statistic) is
(y − Xβ̂ IV )0 PZ (y − Xβ̂ IV )
J= ∼ χ2q−k
σû2
This is the minimized value of the objective function in the IV divided by the
variance of the residual
High values reject the null of validity