Problem Set 9
Problem Set 9
Problem Set #5
Date Due: Thursday, May 21
Y = β0 + β1 X1 + β2 X2 + β3 X3 + U .
E[XU ] = 0
and that X is not perfectly colinear. Suppose further that E[Y 4 ] < ∞ and
E[Xj4 ] < ∞ for 1 ≤ j ≤ 3. Using a large sample of i.i.d. observations from
(Y, X1 , X2 , X3 ), you estimate this equation using OLS. Let β̂ = (β̂0 , . . . , β̂3 ) de-
note the resulting estimate of β = (β0 , . . . , β3 ).
Y = X 0β + U ,
where E[U |X] = 0 (remember that this is a stronger assumption than our usual
assumption that E[XU ] = 0). Suppose Y is binary.
1
4. Download the dataset for this problem from the TA’s webpage. The dataset has
4 variables in the following order: college (an indicator for completing college), hs
(an indicator for completing high school, but not college), wage (average hourly
wage) and fem (an indicator for female). In order to load the data into Stata,
you may need to increase the memory. To do this, type
in Stata. This will increase the memory from the default of 1 mb to 10 mb.
2
To do this regression, you will have to create a new variable that is the sum
of hs and college.
i. How do the coefficients γ0 , . . . , γ3 relate to the coefficients β0 , . . . , β3
defined in part (a)?
ii. Test H0 : γ2 = 0 versus H1 : γ2 > 0 at the 5% significance level. How do
the results of your test compare to what you found in part (v) of part
(a) above? Explain briefly.
(c) Consider the following model of the determinants of income:
wage =
β0 + β1 fem + β2 college + β3 fem × college + β4 hs + β5 fem × hs + U .