Final2017 Solution PDF
Final2017 Solution PDF
Part A:
Answer ALL Questions on the Multiple Choice Answer Sheet.
Each Question is worth 3 marks (30 Marks Total):
yi = β1 + β2 xi2 + · · · + βK xiK + ei
yt = β1 + β2 xt2 + · · · + βK xtK + et ,
et = ρet−1 + vt with − 1 < ρ < 1
where vt ’s are independent random error terms with mean zero and variance σv2 .
Choose the wrong statement.
6. For each observation i = 1, · · · , n, the regressor xi,2 indicates “female,” and xi,3
indicates lipstick users. From the data, it is confirmed that xi,2 and xi,3 are similar, but
not the same. That is, in the sample, there are some women not using a lipstick, and
there are some men using a lipstick. Now, suppose that an econometrician regresses
yi on all the regressors including xi,2 and xi,3 and finds that the standard errors of
the OLS estimates are large. Choose the wrong statement.
7. In order to estimate the wage equation, an econometrician regresses the log of wage
on individual’s observed characteristics including years of schooling. In this problem,
however, the error term contains unobserved characteristics such as motivation, and it
is likely that the error is correlated with the years of schooling, i.e., a highly motivated
person tends to study more and also make more money. Choose the wrong statement.
Economic theory says the firm should increase advertising expenditure to the point
where an extra $1 of expenditure results in an extra $1 of sales (i.e., marginal cost =
marginal revenue). Choose the wrong statement.
E[yt ] = µ < ∞
V ar(yt ) = σ 2 < ∞
Cov(yt , yt−s ) = γs , (covariance depends on s, not t)
where vt are independent random error terms with zero means and variances σv2 .
Choose the wrong statement.
Part B:
Short answer questions – answer all questions in the Answer Booklet.
Marks are as indicated. Total value: 70 marks
1. Data on the weekly sales of a major brand of canned tuna by a supermarket chain in
a large mid-western US city during a mid-1990’s calendar year are contained in the
file tuna.dta. The file contains 52 observations on the following variables
(a) In order to see if this model has omitted variables, we ran the following command,
estat ovtest
and obtain the p value of 0.057. State precisely the null hypothesis associated
with this test (5 marks) and decide whether to reject the null hypothesis using
your own level of the test (5 marks).
(Answer)
H0 : the model has no omitted variables (5 marks)
Since p-value is larger than any conventionally used level of the test such as 1%
or 5%, we do not reject the null hypothesis. (5 marks)
If they say we accept the null hypothesis, take 1 mark off.
If they used 10% level of significance or even larger, they should reject the null
hypothesis. Mark accordingly.
(b) Discuss and interpret the estimates of β3 . Especially, show, step-by-step, that
the marginal effect is given as
∂sal1
= β3 sal2
∂apr2
(5 marks). Then, explain the effect of a one-unit increase in the price of Brand
2 on sales using the Stata outcome. (5 marks).
(Answer)
we can re-write the regression equation as
So,
∂sal1
= exp (β1 + β2 apr1 + β3 apr2 + β4 apr3 + β5 disp + β6 dispad + e) β3 = β3 sal1.
∂apr2
Therefore,
∂sal1
∂apr2
β3 =
sal1
Hence, a unit change in apr2 would lead to approximately, β3 × 100% change in
sal1. The estimates says that a one-unit increase in the price of Brand 2 will
lead to a 115% increase in sales.
(c) What is the estimated percentage increase in sales from a display and no adver-
tisement? (5 marks)
(Answer)
Hence, the estimated percentage increase in sales from a display and no adver-
tisement is
h i
exp(β̂5 ) − 1 ×100% = [exp(0.4237) − 1]×100% = [1.5276 − 1]×100% = 52.76%
where the errors are assumed to be independent N (0, σ 2 ) random variables. The
variable nwiffinc = faminc − wage × hours. The variable lfp is a binary variable
that takes the value 1 if a woman worked and 0 if she did not. The variable exper is
the years of experience.
(d) The following computer log and output are related to a hypothesis testing to
see whether log(wage) is endogenous using exper and exper2 as instrumental
variables. Precisely state the null hypothesis (5 marks), explain the test proce-
dure step-by-step and make a conclusion about the endogeneity using the Stata
outputs (5 marks).
(Answer)
The null hypothesis is that log(wage) is uncorrelated with the error terms, i.e.,
log(wage) is not endogenous (5 marks).
Testing the null hypothesis is equivalent to testing whether the coefficient of
vhat is zero. This is a standard t-test. The p-value is < 0.05 so we reject H0
and conclude that the explanatory variables are correlated with the error term
(i.e., we have an endogeneity problem). (5 marks)
3. The file mexican.dta contains data collected in 2001 from the transactions of 754
Mexican sex workers. There is information on four transactions per worker. The data
are a subset of that used by Gentler, Shah and Bertozzi, ‘Risky Business: The Market
for Unprotected Sex,’ Journal of Political Economics, (2005), 113, 518–550. There are
four transactions per worker. The labels id and trans are used to describe a particular
woman and a particular transaction. There are three categories of variables,
• Sex worker characteristics: age, attractive (indicator variable), school (if Year
12 or higher).
• Transaction characteristics: lnprice (log of the price), nocondom, bar (=1 ini-
tiated at a bar), street (=1 initiated in the street)
We estimated a model with lnprice as the dependent variable using client and trans-
action characteristics as independent variables and obtained the following Stata log.
(a) Test the null hypothesis that all individual has the same intercept at the level of
5 % and interpret your test result (5 marks).
(Answer)
The associated F -statistic and p-value appear at the bottom of the output. Since
the p-value is very small, we reject H0 . Hence, there are fixed effects (for some
individuals).
(b) Explain why sex workers characteristics should be omitted (5 marks).
(Answer)
Sex worker characteristics are omitted because they are time-invariant over the
time in which the 4 transactions took place. Their effect cannot be separated
from the individual effects given by the coefficients of the fixed-effects dummy
variables.
(c) nocondom can be argued to be an indicator of a risk premium (a higher price
might be paid for not wearing a condom). What is the estimate of the risk
premium? (5 marks)
(Answer)
The estimated risk premium for not using a condom is approximately 17% (log-
linear model). The exact estimate is 100(exp(0.170282) − 1)% = 18.6%
(d) Explain briefly about the main message from the Stata log below (5 marks).
(Answer)
Since the p-values are smaller than 0.05, we reject the null hypothesis that the RE
estimates and the FE estimates are equal for each of nocondom, rich, regular,
and alcohol.
End of Examination
Page 14 of 16
Semester One Final Deferred Examinations, 2017 ECON7310 Elements of Econometrics
Some Formulas:
Simple Regression A:
P P P
N i xi yi − i xi i yi
b1 = ȳ − b2 x̄ and b2 = P 2
N i xi − ( i xi )2
P
2
P 2
σ x σ2 −x̄σ 2
V ar(b1 ) = P i i 2 , V ar(b2 ) = P 2
, and Cov(b ,
1 2b ) = P 2
N i (xi − x̄) i (xi − x̄) i (xi − x̄)
ŷi = b1 + b2 xi
1 X 1 X 2
σ̂2 = (yi − ŷi )2 = êi
N −2 N −2
i i
σ̂ 2 i x2i
P q
\
V ar(b1 ) = and se(b1 ) = V\ ar(b1 )
N i (xi − x̄)2
P
σ̂ 2
q
V\ar(b2 ) = P 2
and se(b2 ) = V\
ar(b2 )
i (xi − x̄)
bk − c
bk ± tc se(bk ) and t = for k = 1, 2 and Pr(|N (0, 1)| > 1.96) = 0.05
se(bk )
Simple Regression B:
Multiple Regression A:
Page 15 of 16
Semester One Final Deferred Examinations, 2017 ECON7310 Elements of Econometrics
Multiple Regression B:
SSE 2K
AIC = log +
N N
SSE K log(K)
SIC = log +
N N
Autocorrelation:
Cov(et , et−k )
Corr(et , et−k ) =
V ar(et )
PT
\
Cov(et , et−k ) t=k+1 êt × êt−k
rk = = PT 2
V\ar(et ) t=k+1 êt−k
PT
(êt − êt−1 )2
d = t=2PT ≈ 2(1 − r1 )
2
t=2 êt
Endogeneity:
In the case of the simple regression model y = β1 + β2 x + e and one IV zi , the variance of
the IV estimator is
σ2
V ar(β̂2 ) = 2
.
(xi − x̄)2
P
rzx
Page 16 of 16