ch06ans
ch06ans
5TH EDITION
ANSWERS TO ODD-NUMBERED
EXERCISES IN CHAPTER 6
1
Chapter 6, Exercise Answers, Principles of Econometrics, 5e 2
EXERCISE 6.1
SSE 2132.65
(a) R2 1 1 0.5501
SST 4740.11595
( SST SSE ) ( K 1) (4740.11595 2132.65) 2
(b) F 28.73
SSE ( N K ) 2132.65 (50 3)
At 0.01 , the critical value is F(0.99, 2, 47) 5.09 . We reject H 0 . There is evidence from
the data to suggest that 2 0 and/or 3 0 .
The critical value for significance level 0 .0 1 is F(0.99,2,45) 5.11 . Since the calculated
F is greater than the critical F we reject H 0 and conclude that the model is misspecified.
SSE 401.179
(d) R2 1 1 0.9154
SST 4740.11595
F
SSER SSEU J (401.179 388.684) 2
0.707
(e)
SSEU ( N K ) 388.684 (50 6)
The critical value for significance level 0.05 is F(0.95,2,44) 3.209 . Since the
calculated F is less than the critical F we fail to reject H 0 . There is no evidence from
RESET to suggest the model is misspecified. Including zi2 appears to have remedied the
initial problem.
EXERCISE 6.3
(a) From the definition of wi , we have
b2 wi yi y wi yi
wi 1 2 xi 3 zi ei
1 wi 2 wi xi 3 wi zi wi ei
2 wi xi x 3 wi zi wi ei
2 3 wi zi wi ei
wi xi x xi x xi x 1.
2 2
with the last equality holding because
1
N
xi x zi z
2 3
1
xi x
2
N
x, z
cov
2 3
x
var
wi2 var ei | X
xi x
2
2 2
2
xi x N var(
x)
2 2
xi x 2
(d) From equation (5.13),
2
var b2 | X
1 r x x
2
xz i
2
var b2 | X
Because 1 r 2
1, var b2 | X
xz
1 r 2
xz
and hence var b2* | X var b2 | X . Although b2 suffers from omitted variable bias, it
does have a lower variance than b2 .
EXERCISE 6.5
The following assumptions are sufficient
E EXPER | EDUC , AGE E EXPER | AGE 1 2 AGE 3 AGE 2
E e | EDUC , AGE 0
EXERCISE 6.7
0 76.974 and
The point and interval predictions for SALES from Example 6.15 are SALES
(67.533, 86.415), respectively.
The point estimate for E ( SALES | PRICE 6, ADVERT 1.9) is
The standard error is se E ( SALES ) 0.84215 0.9177 , leading to the following interval estimate for
expected sales
76.974 1.9939 0.9177 75.144, 78.804
This interval is much narrower than the one for the prediction interval in Example 6.15. Because it
is concerned with estimating average sales from all cities rather than predicting sales for a specific
city, the error variance for an observation from a specific city is not included.
EXERCISE 6.9
Consider, for example, the model
y 1 2 x 3 z e
EXERCISE 6.11
There are several ways in which the restrictions can be substituted into the model, with each one
resulting in a different restricted model. We have chosen to substitute out 1 and 3 , leading to
the model
EXERCISE 6.13
(a) Let 2 the coefficient of GUNRATE in the firearm suicide equation. To answer the
question is there evidence that the gun buyback has reduced firearm suicides, we test
H 0 : 2 0 against the alternative H1 : 2 0 . The t-value for this test is
t 0.223 0.069 3.232 . The 1% critical value is t(0.01, 195) 2.346 . Because
3.232 2.346 , we reject H 0 : 2 0 at 10%, 5% and 1% significance levels. We
conclude that the gun buyback has been successful in reducing the number of firearm
suicides.
Let 2 the coefficient of GUNRATE in the non-firearm suicide equation. To answer the
question has there been substitution away from firearms to other means of suicide, we test
H 0 : 2 0 against the alternative H1 : 2 0 . The t-value for this test is
t 0.553 0.144 3.840 . The 1% critical value is t(0.99, 195) 2.346 . Because
3.840 2.346 , we reject H 0 : 2 0 at 10%, 5% and 1% significance levels. Although the
gun buyback has been successful in reducing the number of firearm suicides, it has led to
an increase in the number of non-firearm suicides. There has been substitution away from
firearms to other means of suicide.
Let 5 the coefficient of YEAR in the equation for the overall suicide rate. To answer the
question is there a separate trend in the suicide rate, we test H 0 : 5 0 against the
alternative H1 : 5 0 . The t-value for this test is t 0.056 0.393 0.142 . The 10%
critical values are t(0.95, 195) 1.653 . Because 1.653 0.142 1.653 , we fail to reject
H 0 : 5 0 at 10%, 5% and 1% significance levels. There is no evidence to suggest there
is a separate overall trend in the suicide rate.
Tests for the coefficients of YEAR in the other two equations suggest there has been a
decreasing trend in firearm suicides and an increasing trend in non-firearm suicides, not
related to the gun buyback, but there is no evidence of any overall trend in the suicide rate.
(b) Let 3 the coefficient of URATE in the equation for the overall suicide rate. To answer
the question is there evidence that greater unemployment increases the suicide rate, we test
H 0 : 3 0 against the alternative H1 : 3 0 . The t-value for this test is
t 1.147 1.206 0.951 . The 10% critical value is t(0.90, 195) 1.286 . Because
0.951 1.286 , we fail to reject H 0 : 3 0 at 10%, 5% and 1% significance levels. There
is no evidence to suggest that greater unemployment increases the suicide rate.
A similar conclusion is reached from the firearm equation. In the non-firearm equation,
testing H 0 : 3 0 against the alternative H1 : 3 0 , t 1.710 ; we reject H 0 : 3 0
at a 5% significance level, but not at a 1% significance level. There is some evidence that
greater unemployment increases the non-firearm suicide rate.
F
SSER SSEU J
50641 29745 2
68.494
SSEU ( N K ) 29745 (200 5)
F
SSER SSEU J
131097 129122 2
1.491
SSEU ( N K ) 129122 (200 5)
F
SSER SSEU J
175562 151890 2
15.195
SSEU ( N K ) 151890 (200 5)
The 1%, 5% and 10% critical F-values are, respectively, F(0.99, 2, 195) 4.716 ,
F(0.95, 2, 195) 3.042 , and F(0.90, 2, 195) 2.330 . All three significance levels lead to the same
conclusion. URATE and CITY contribute to the firearm equation and the overall equation
(15.195 and 68.494 are greater than 4.716), but not to the non-firearm equation (1.491 is
less than 2.330).
EXERCISE 6.15
(a) We test the null hypothesis H 0 : 3 0, 4 0 against the alternative that one of the
coefficients is not zero. The F-test statistic is
SSER SSEU J 1513.6 1358.7 2
F 2.74
SSEU N K 1358.7 52 4
The 10% critical value is 2.417. We reject the null hypothesis that both coefficients are zero
at the 10% level of significance and conclude that at least one of the competing brands prices
should be included in the equation.
(b) When PRB and PRC are omitted from the equation, the coefficient of PRA will change by
3.63944 . The intercept will change by 17.7507 .
(c) Let m E (CANS | PRA 0.91, PRB 0.91, PRC 0.90) , then mˆ 3.4392
We estimate that at the given prices the sales are between zero and 14,598 cans. This interval
is very much wider than the interval for the average number of cans sold.
(f) 2 is
The RESET statistic to test the significance of CANS
RESET
1358.7 1198.9 6.2645759
1198.9 47
For degrees of freedom 1, 47 the 10%, 5% and 1% critical values are 2.8154381,
4.0470999, 7.2068389, respectively. Thus, we reject the null hypothesis of no specification
error at 10% and 5%, but not at 1%.
EXERCISE 6.17
Values for the various model selection criteria for each of the equations are reported in the table
below. All 4 criteria suggest that all the new models with BATHS and BEDROOMS included are
an improvement over the best model in Example 6.16. From the set of new models, no.11, with the
extra variables BATHS and BEDROOMS SQFT , is favored by R 2 , AIC and SC, while the
RMSE for the predictions in the hold-out sample favors model no.10, with
BATHS and BEDROOMS .
EXERCISE 6.19
(a) The change in E (WFOOD | TOTEXP, NK ) from adding an extra child when
TOTEXP 90 is
1 2 ln(90) 3 NK 1 4 NK 1 ln(90)
(b) The change in E (WFOOD | TOTEXP, NK ) from an increase in total expenditure from
£80/week to £120/week when NK 2 is
1 2 ln(120) 23 4 2ln(120)
2 ln 1.5 2 4 ln 1.5
(d) For H 0(1) , the relationship between the F and 2 test values is F = 2 . The relationship
between the critical values is F(1 ,1,846) (12 ,1) . The relationship between the t and F tests
for H 0(1) is F t 2 for the test values and F(1 ,1,846) t(12 2,846) for the critical values.
For H 0(2) , the relationship between the F and 2 test values is F = 2 2 . The relationship
between the critical values is F(1 , 2,846) (12 , 2) 2 .
For H 0(3) , the relationship between the F and 2 test values is F = 2 3 . The relationship
between the critical values is F(1 ,3,846) (12 ,3) 3 .
Number of observations F 2 F 2 F 2
(f) With the exception of going from 100 to 400 observations for H 0(1) , increasing the number
of observations decreases the level of significance at which the null hypothesis is rejected.
This finding is in line with a general result that increasing the number of observations
increases the power of a test. Adding more hypotheses does not necessarily increase nor
decrease the level of significance at which a null hypothesis is rejected. With 400
observations and a 10% significance level we reject H 0(2) , but after adding an extra
condition for testing H 0(3) the null hypothesis is no longer rejected at this significance
level. There are no dramatic differences between the F-test outcomes and the 2 -test
outcomes.
EXERCISE 6.21
(a) (i) A 95% interval estimate for the saturation level is given by
ˆ ˆ t(0.975, 43) se ˆ ˆ 1.37767 0.05722 2.0167 2.4777 (19.08, 29.07)
ˆ ˆ
t(0.975, 43) se
1 exp ˆ
1 exp ˆ
0.81438
2.0167 0.00671869 (0.1505, 0.1776)
1 exp(1.37767)
(iv) The predicted shares up to 2050 are given in Table 6.21. They are plotted, along with
their 95% bounds, in Figure 6.21. The predictions are approaching the saturation level
very slowly. From 2040 to 2050, they increase only a small amount, from 0.7624 to
0.7842, when the saturation level is 0.8144. The 95% bounds become wider the further
into the future we are forecasting, reflecting the extra uncertainty from doing so.
Figure 6.21 Predicted share of electric arc furnace technology and 95% interval bounds
(b) The saturation level is given by and the point of inflection is given by t I .
Having 25 is equivalent to 25 0 . Thus, we set up the hypotheses
H 0 : 0.85, 25 0
H1 : at least one of the equalities in H 0 does not hold
Values of the F(2, 43) and (2)
2
test statistics for testing this hypothesis are F 4.460 (p-
value = 0.0174) and 2 8.92 (p-value = 0.0116). Since both p-values are less than 0.05,
both test statistics lead to the same conclusion: we reject H 0 .
Since 0.85 and tI 25 both lie within the respective 95% interval estimates for
and t I , it is perhaps surprising that the joint hypothesis H 0 : 0.85, tI 25 is rejected.
However, the separate interval estimates do not allow for the correlation between the
estimates for and t I . If these estimates are negatively correlated, then it is unlikely that
both and t I could take on these values which occur in the upper regions of their
respective interval estimates.
(c) When the restrictions 0.85 and 25 0 are imposed, the model becomes
0.85
yt et
1 exp(25 t )
The F-test value is
F
SSER SSEU J
0.020462 0.017272 2
3.97
SSEU (T K ) 0.017272 43
which has a corresponding p-value of 0.0262. The test outcome is the same. The null
hypothesis is rejected. However, the F-test value is different. The equivalence of the
approaches only holds for equations which are linear in the parameters.
EXERCISE 6.23
(a)
̂ 2.13711 0.43777
̂ 0.77897 0.10978
̂ 0.27247 0.74451
̂ 0.63834 0.09435
(b)
Interval estimates
Parameter Estimate Standard Error
Lower Upper
0.1180 0.05165 0.0153 0.2207
0.7790 0.1098 0.5607 0.9973
1.3745 1.4066 1.4226 4.1716
0.6383 0.0944 0.4507 0.8260
(b) Graphing Eˆ [ln( PRICE ) | SQFT 22, AGE ] against AGE, we obtain
5.1
5.0
4.9
log_PRICE
4.8
4.7
4.6
0 10 20 30 40 50 60 70 80 90
AGE
(c) Simplifying and combining into the one joint null hypothesis, we have
753 63754 0
H0 :
82 253 8754 0
H 1 : at least one equality in H 0 does not hold
The value of the F-statistic for testing H 0 is F 1.710 with p-value = 0.1814. Also, the
5% critical value is F(0.95, 2, 896) 3.006 . Thus, we fail to reject H 0 . There is no evidence to
suggest the joint hypothesis about expected log prices is incorrect.
(d) Simplifying and combining into one joint null hypothesis, we have.
1004 0
H0 : 3
1 222 503 25004 4.60517
H 1 : at least one equality in H 0 does not hold
The value of the F-statistic for testing H 0 is F 3.109 with p-value = 0.0451. Also, the
5% critical value is F(0.95, 2, 896) 3.006 . Thus, we reject H 0 . The data do not support the
joint hypothesis about expected log prices.
(f) Adding BATHS and SQFT BEDROOMS will have improved the predictive ability of
the model if 5 0 and/or 6 0 . Thus, we test H 0 : 5 0, 6 0 against
H1 :5 0 and/or 6 0 . The calculated F-value for this test is F 37.358 with p-value
= 0.0000. Also, the 5% critical value is F(0.95, 2, 894) 3.006 . Thus, we reject H 0 . Adding
BATHS and SQFT BEDROOMS improves the predictive ability of the model.
(g) Using the “natural predictor”, before adding one extra bedroom and bathroom the value of
the house is estimated to be
BEFORE exp b 23b 2b (3 23)b 165.525
PRICE 1 2 5 6
After adding one extra bedroom and bathroom the value of the house is estimated to be
AFTER exp b 25.6b 3b (4 25.6)b 205.463
PRICE 1 2 5 6
(h) Assuming there is no inflation in house prices, and the same equation is relevant after 20
years, in 20 years’ time the estimated values without and with the extra bedroom and
bathroom are
BEFORE exp b 23b 20b 202 b 2b (3 23)b 127, 466
PRICE 1 2 3 4 5 6
The estimated increase in value is $158,221 $127,466 = $30,755. The extra bedroom and
bathroom has less value in the older house.
EXERCISE 6.27
(a) The least-squares estimated equation using observations where EDUC 7 ( N 1178) is
ln(WAGE ) 1.1391 0.11673EDUC 0.008321EXPER
(se) (0.0869) (0.00547) (0.001059)
The hypotheses for the claim about the marginal effects of education and experience are
H 0 : 2 0.112, 3 0.008 H1 : 2 0.112 or 3 0.008
We reject H 0 if the computed value for the F-statistic, Fcalc , is greater than or equal to
F(0.95, 2, 1175) 3.003 . We find Fcalc 0.389 3.003 , and hence do not reject H 0 . There is
no evidence to dispute the claim made about the marginal effects of experience and
education.
(b) Augmenting the equation with the squares of the predictions, the RESET F-value for testing
the adequacy of the model is
F
SSER SSEU 1 261.8566 260.4315 6.424
SSEU 1174 260.4315 1174
Since 6.424 F(0.95, 1, 1174) 3.849 (p-value = 0.0114), we conclude that the squares of the
predictions have significant explanatory power and hence the model is inadequate.
Augmenting the equation with the squares and cubes of the predictions, the RESET F-value
for testing the adequacy of the model is
F
SSER SSEU 2
261.8566 259.8446 2
4.541
SSEU 1173 259.8446 1173
Since 4.541 F(0.95, 2, 1173) 3.003 (p-value = 0.0108), we conclude that the squares and
cubes of the predictions have significant explanatory power and hence the model is
inadequate.
The hypotheses and test results for each part of the question are given in the following table.
2 20 4 5 6 0.112
(i) 16.039 3.003 0.0000 Reject H 0
3 10 5 10 6 0.008
2 20 4 5 6 0.112
(ii) 0.580 3.003 0.5599 Fail to reject H 0
3 10 5 10 6 0.008
2 20 4 5 6 0.112
(iii) 13.638 3.003 0.0000 Reject H 0
3 10 5 10 6 0.008
F
SSER SSEU 1 252.7137 252.6801 0.156
SSEU 1171 252.6801 1171
Since 0.156 F(0.95, 1, 1171) 3.849 (p-value = 0.6930), we conclude that the squares of the
predictions do not have significant explanatory power; the test does not suggest the model
is inadequate.
With the squares and cubes of the predictions, the RESET F-value is
F
SSER SSEU 2
252.7137 251.8972 2
1.896
SSEU 1170 251.8972 1170
Since 1.896 F(0.95, 2, 1170) 3.003 (p-value = 0.1506), we conclude that the squares and
cubes of the predictions do not have significant explanatory power; the test does not suggest
the model is inadequate.
(e) The claim is not true for all levels of education and experience. The medians for EDUC and
EXPER are 14 and 23, respectively, values that approximately correspond to those tested in
part (ii). Thus, the claim is reasonable for those at the median levels of education and
experience, but it does not hold for values of EDUC and EXPER which are much greater or
less than the medians.
EXERCISE 6.29
(a) The estimated equation is
ln( SHARE ) 2.9569 0.015074 TAX
(se) (0.0405) (0.001038)
We find that an increase in the marginal tax rate for top earners by one percentage point is
associated with a decrease in the income share of the top earners by 1.5 percent. It is unlikely
we could interpret this estimate as causal. SHARE will depend not only on TAX, but also on
variables related to the general state of the economy. If these variables are correlated with
TAX, which is likely, 2 cannot be given a causal interpretation.
The correlations between TAX and YEAR and TAX and YEAR 2 are 0.630 and 0.463,
respectively. These values are moderately high and sufficient for omission of the trend from
the equation to result in overestimation of the effect of changes in TAX on ln(SHARE). A
regression of TAX on YEAR and YEAR 2 gives R 2 0.751 , a large enough value to explain
the change in magnitude for ̂ 2 .
(c) SHARE will be smallest when ln(SHARE) is smallest. To find that year, we solve the
following equation for YEAR
ln( SHARE ) 3
3 2 4YEAR 0 giving YEARMIN
YEAR 2 4
MIN 1.243246
Its standard error is given by se YEAR
MIN t
YEAR
(0.975, 76) se YEAR MIN 41.27, 46.23
After rounding to the closest year, the value YEAR 43.75 corresponds to the year 1964
while YEAR 41.27 and YEAR 46.23 correspond to the years 1961 and 1966,
respectively. Thus, we estimate that SHARE will be smallest in 1964, but, based on a 95%
interval estimate, the year when it is smallest could be as early as 1961 or as late as 1966.
The smallest actual value for SHARE is 7.74 in 1973, a year which does not fall within the
interval estimate.
with p-value Pr t(76) 1.299 0.1979 . Thus, we fail to reject H 0 at all conventional
significance levels. The data do not contradict the conjecture about the expected log income
share of top earners in the year 2000 for a marginal tax rate of 50%.
The lack of changes can be explained by the low correlations between GWTH and the other
right-hand side variables in the equation. These correlations are given in the following
matrix. They are all less than 0.1.
(g) To obtain an expected log income share of ln(12) in 2001, when GWTH 2001 3 , we require
7.514183 .
Using the delta rule to get its standard error, we obtain se TAX
We estimate that a marginal tax rate of 60.5% would be required. However, the interval
estimate that ranges from 45.6% to 75.5% shows there is a great deal of uncertainty about
this estimate.
EXERCISE 6.31
(a) Estimates for the model CSUMPTN 1 2 HOURS 3GOV 4 R 5 INC e are
given in the following output.
The coefficient of 3 , with p-value 0.2559 and the intercept with a p-value of 0.2257 are
not significantly different from zero at a 5% significance level.
(b) The positive estimates for 2 and 3 suggest that both hours worked and government
consumption are complements for private consumption. Testing H 0 : 2 0, 3 0 against
the alternative that at least one of 2 or 3 is zero yields a test value of F 2.662 .
Comparing this value with the critical value F(0.95, 2, 32) 3.295 , or examining its p-value
of 0.0852, we find we cannot reject H 0 : 2 0, 3 0 at a 5% significance level. Thus, it
is possible that both hours worked and government consumption are neither complements
nor substitutes for private consumption.
(c) Re-estimating the equation CSUMPTN 1 2 HOURS 3GOV 4 R 5 INC e
with GOV omitted yields the following results:
Omitting the variable GOV has led to a decrease in the magnitude of the coefficient estimate
for HOURS, making it no longer significantly different from zero at a 5% level of
significance, and an increase in the magnitude of the coefficient estimate for INC. The
coefficient estimate for R has changed very little.
(d) Estimates for the equation GOV 1 2 HOURS 3 R 4 INC v are
The p-values in this output suggest that GOV is highly correlated with HOURS and INC, but
less so with R. These results explain why omission of GOV from the equation in part (c) has
led to relatively large changes in the coefficient estimates for HOURS and INC, and only a
small change in the coefficient estimate for R. Also, the signs of the coefficients of HOURS
and INC in part (d) explain the direction of the changes. In part (c) the effect on CSUMPTN
of a change in HOURS includes the negative indirect effect of HOURS on GOV. Similarly,
the effect on CSUMPTN of a change in INC includes the positive indirect effect of INC
on GOV. The changes can be reconciled explicitly as follows:
E CSUMPTN | HOURS , R, INC 1 31 2 32 HOURS 4 33 R 5 34 INC
1 2 HOURS 3 R 4 INC
Estimates for 1, 2 , 3 , 4 were given in part (c). We can derive those estimates from
those given in parts (a) and (d):
ˆ 1 b1 b3a1 0.003980 0.164079 0.013171 0.006141
ˆ 2 b2 b3a2 0.424608 0.164079 0.710646 0.308006
ˆ 3 b4 b3a3 0.243435 0.164079 0.100382 0.259906
ˆ 4 b5 b3a4 0.608275 0.164079 0.845469 0.746999
(e) Re-estimating the models in parts (a) and (c) with the year 2007 omitted yields:
(f) The 2007 realized value for consumption growth was CSUMPTN 2007 0.003606 . The
model in part (c) produced the more accurate forecast for 2007.
Casual inspection suggests separate equations are not needed for the three different varieties.
Coefficient estimates for corresponding variables are all similar in magnitude, particularly
when considered relative to their standard errors.
(b) SSEU 5.924196 2.933984 4.006701 12.864881
(c) The total number of coefficients in the 3 equations is 12. If a single equation is estimated
for all 3 varieties, the number of parameters is 4. Thus, if we restrict corresponding
coefficients for all varieties to be equal, there are 12 4 8 restrictions.
(d) Using all the data to estimate one equation gives the following results:
(e) Let the coefficients of the cabernet, pinot and other varieties be denoted by
C1 , C 2 , C 3 , C 4 , P1, P 2 , P3 , P 4 , and O1 , O 2 , O3 , O 4 , respectively. The null and
alternative hypotheses are
H 0 : C1 P1 O1 , C 2 P 2 O 2 , C 3 P 3 O 3 , C 4 P 4 O 4
H1 : At least one of the inequalities in H 0 does not hold.
The value of the test statistic is
F
SSER SSEU J
12.93496 12.864881 8
0.688
SSEU ( N K ) 12.864881 1010
This value is less than the 5% critical value F(0.95, 8, 1010) 1.948 . Thus, the null hypothesis
is not rejected. There is no evidence to suggest that there should be different equations for
different varieties.