Holtz-Eakin, Newey and Rosen (1988)
Holtz-Eakin, Newey and Rosen (1988)
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.
http://www.jstor.org
This paper considers estimation and testing of vector autoregression coefficients in panel
data, and applies the techniques to analyze the dynamic relationships between wages and
hours worked in two samples of American males. The model allows for nonstationary
individual effects, and is estimated by applying instrumental variables to the quasi-dif-
ferenced autoregressive equations. Particular attention is paid to specifying lag lengths,
forming convenient test statistics, and testing for the presence of measurement error. The
empirical results suggest the absence of lagged hours in the wage forecasting equation. Our
results also show that lagged hours is important in the hours equation, which is consistent
with alternatives to the simple labor supply model that allow for costly hours adjustment or
preferences that are not time separable.
KEYwORDS:Vector autoregression, panel data, causality tests, labor supply.
1. INTRODUCTION
1This research was supported in part by NSF Grants SES-8419238 and SES-8410249. We are
grateful to Joseph Altonji, the Editors, and three referees for useful comments. Joseph Altonji and
David Card graciously provided us with the data used in the empirical analysis.
2Nevertheless, our techniques are appropriate for more " traditional" macroeconomic applications.
For example, Taylor (1980) examined and compared the time series properties of several key
macroeconomic variables for a number of European countries. Our methods could be used to execute
formal tests of similarity between them.
1371
2. THE MODEL
where the a 's and 8's are the coefficients of the linear projection of yt onto a
constant and past values of Yt and xt, and the lag length m is sufficiently large to
ensure that ut is a white noise error term. While it is not essential that the lag
lengths for y and x are equal, we follow typical practice by assuming that they
are identical.
Consistent estimation of the parameters of equation (2.1) requires many
observations of x and y values. In time series applications these observations
typically are obtained from a record of x and y over a long period of time. In
contrast, panel data usually have a relatively small number of time series
observations. Instead, there often are a great number of cross-sectional units,
with only a few years of data on each unit. To estimate the parameters of
equation (2.1) one must pool data from different units, a procedure which
imposes the constraint that the underlying structure is the same for each
cross-sectional unit.
The constraint that the time series relationship of x and y is the same for each
cross-sectional unit is likely to be violated in practice, so that it is desirable to be
able to relax this restriction. One way to relax the pooling constraint is to allow
for an "individual effect," which translates in practice into an individual specific
intercept in equation (2.1). Changes in the intercept of a stationary vector
autoregression correspond to changes in the means of the variables, so that
allowing for an individual effect allows for individual heterogeneity in the levels
of x and y. A second way to allow for individual heterogeneity is to allow the
variance of the innovation in equation (2.1) to vary with the cross-section unit.
Changes in the innovation variance of a vector autoregression correspond
to changes in the variance of the variables, so that allowing for cross-section
heteroskedasticity in the innovation variance allows for individual heterogeneity
in the variability of x and y. In what follows we allow for both an individual
effect and cross-section heteroskedasticity in the variance of the innovation.
It is likely that the level and variability of the variables are important sources
of individual heterogeneity, but it would also be nice to allow for individual
heterogeneity in the time series correlation pattern of x and y. In this context,
allowing for such heterogeneity is difficult because the variables on the right-hand
3 The asymptotic theory does require that various moments of the data exist. Existence of moments
in models with unit or explosive roots requires an assumption concerning the initial conditions of the
data such as the assumption that the first point in the life of the individual units is a constant.
4 Pakes and Griliches (1984) have considered a similar identification issue in the context of
entire past depends only on the past m observations. In the next section we will
discuss identification, estimation, and inference under this assumption on lag
length, as well as ways of testing restrictionson the parametersof equation (2.2).
3. STATISTICALINFERENCE
A. Identification
The specification of equation (2.2) as a projection equation implies that the
error term u , satisfies the orthogonality condition
Note that it would be valid to use nonlinear functions of lagged values of x and v as instruments
only if equation (2.2) could be interpreted as a conditional expectation rather than a linear projection.
We choose to work with a linear projection specification because specification of the form of the
conditional expectation of y1, given lagged values of x and v is difficult. It seems likely that this
conditional expectation would involve nonlinear functions of lagged values of x and Y.
6 If the first and second moments of the data are the same for different cross-section units, then the
minimum distance methods of MaCurdy (1981a) and Chamberlain(1983) could be used to estimate
the parameters from cross-section moments. In this case it will be easier to identify the parameters,
because the orthogonality conditions (3.1) do not involve all the cross-section moments.
7One common technique is to compute the differencebetween each variable and its time mean (by
cross-section unit) to eliminate the individual effect. See, e.g., Lundberg(1985). In the current context,
this procedure will yield inconsistent estimates, even when the parametersare stationary, because of
the presence of lagged endogenous variables. See Nickell (1981).
where
(3.3) at =aot rtfaOt,
cl,-=rt + a,t,t
=
dit 81t'
Note that in the special case of r,= 1 for each t, then this transformation is
simple differencing of equation (2.2). This has been suggested for use in estima-
tion of univariate autoregressive models in panel data by Anderson and Hsiao
(1982). More generally, this transformation is a quasi-differencing transformation
that has been suggested by Chamberlain (1983). We will proceed by first
discussing identification of the parameters of the transformed equation (3.2), and
then discussing identification of the original parameters of equation (2.2) from
(3.2).
The orthogonality conditions of equation (3.1) imply that the error term of the
transformed equation (3.2) satisfies the orthogonality condition
x See Fisher (1966). A sufficient condition for identification is that in the limit, the cross-product
matrix between the instruments and the right-hand side variables have rank equal to the number of
right-hand side variables.
effect. We will ignore identification of the constant terms (i.e., the a0o,'s)since
they are usually considered nuisance parameters.
To identify the original parameters there must be at least as many parameters
in the transformed equation as in the original equation. Note that it is possible to
estimate the parameters of (3.2) for a total of T - m - 2 time periods. Ignoring
the constant terms, there is a total of (T - m - 2)2(m + 1) parameters in the
transformed equation, which involve a total of (T - m - 2) + (T - m - 1)2m
original parameters. Thus, it will not be possible to identify the original parame-
ters unless T - m - 2 > 2m, i.e., the number of estimable time periods is at least
as large as twice the lag length. Also, because many of the parameters of the
transformed equation consist of nonlinear functions of the original parameters
with complicated interactions across time periods, for some values of the parame-
ters it may not be possible to recover the original parameters, even when
T? 3m + 2.
Importantly, there is no need to recover the original parameters to test certain
interesting hypotheses. For example, the hypothesis that x does not (Granger)
cause y conditional on the individual effect restricts 8, = 0 for each / and t. Since
this further implies that d,, = 0 for each / and t, this noncausality hypothesis can
be tested by testing for zero coefficients for the lagged x variables in the
transformed equation.
It is also useful to note that additional restrictions on the original parameters
can aid their identification. When the restriction rt= 1 for each t is imposed, the
parameters of the estimable transformed equations involve only (T- m - 1)2m
original parameters. There will be at least as many parameters in the estimable
time periods for the transformed equation as original parameters when T - m - 2
>?m. Thus, when rt = 1 and the number of estimable time periods is at least as
large as the lag length, recovery of the original parameters from the transformed
parameters is possible and, as is apparent from equation (3.3), straightforward.
Identification of the original parameters is easiest when the individual effect
coefficients and the lag coefficients are stationary. In this case the transformed
equation (3.2) can be written:
Here there are only 2m + 1 right-hand side variables, so that there are enough
instruments to identify the parameters if t > m + 2. In the stationary case it is
possible to obtain estimates of the lag parameters when T > m + 2.
A final case that is of interest occurs when measurement error is present.
Suppose that xit and yit are unobserved, and that instead we observe
1=1
where
(3.8) ) Qg ?e+e1(~1e~
=v~ +(t X
it=it e1e)-
I it-- I E a11eX
eit -et -1)
By the assumption that the measurement errors are uncorrelated across time, the
vector
will be uncorrelated with vit and thus qualify as the vector of instrumental
variables for equation (3.7).9 Here there are only 2(t - m - 2) + 1 instrumen-
tal variables, so that the requirement that there are at least as many instrumental
variables as right-hand side variables becomes t > 2m + 2. Thus, it will only be
possible to estimate the lag parameters in the stationary case with uncorrelated
measurement error when T > 2m + 2.
B. Estimation
The presentation requires some additional notation. Let
Yt=Ylt I YNtI and Xt = [xlt, , X
XNtI
be N x 1 vectors of observations on units for a given time period. Let
=- t-m-1, t-19 ' Xt-m-1]
be the N X (2m + 3) vector of right-hand side variables for equations (3.2), where
e is an N x 1 vector of ones. Let
Vt=[vIt,,.,, VNt]
To combine all the observations for each time period, we can "stack" equations
(3.5). Let
Y[ Ym +33.*.. 9 Y. ]I
((T- m - 2)N x 1)
B=lBm+39 B`T I 9
((T- m - 2)(2m + 3) x 1)
V = I Vm+ 3, * * , VT, I'
((T-m-2)Nx 1)
W=diagIWMI+39... WTI]
(zM+ 3Vm+ 3) =N
plim (Z'V)/N = plim 0.
N -oo N- oc ZTVT)/N
given by
2= E { Z'VV'Z}.
(3.14) B= Hy+ G
(ii) Efficiency
Several comments concerning the efficiency of B are in order. First, B is
efficient in the class of instrumental variable estimators which use linear combi-
nations of the instrumental variables. This follows directly from the results of
Hansen (1982). (See also White (1982).) However, just as 3SLS on an entire
system of equations may be more efficient than 3SLS on a subset of the
equations, it may be possible to improve the efficiency by jointly estimating both
the equation for yit given past values of y and x and the equation for xi, given
the history of x and y.
Second, recall that our procedure involves dropping the equations for the first
m + 2 time periods. When the parameters are nonstationary this procedure
involves no loss in efficiency. Although the equations that are dropped may be
correlated with the remaining equations, there are no cross equation restrictions,
and they are underidentified. When the parameters are stationary, dropping the
first m + 2 periods may involve some loss in efficiency. Because there are
cross-equation restrictions, efficiency can be improved by adding back t = m + 2
and t = m + 1 period equations, both of which have observable lags. Also, if
there is no heteroskedasticity (across time or individuals) in the innovation
variance for yit and xit, then all of the parameters for the joint Yit and xit
process can be estimated without the earliest cross-section moments, so that it
may be possible to further improve efficiency by using these moments. Cross-sec-
tion moment based estimation of moving average (but not autoregressive) time
series models in panel data has been considered by MaCurdy (1981a).
C. Hypothesis Testing
In this section we discuss the computation of statistics to test the hypotheses
that x does not cause y, that the parameters are stationary, that m is the correct
lag length, and other possible hypotheses. In each case, the test statistic revolves
around the sum of squared residuals, resulting in tests with chi-square distribu-
tion in large samples. Further, we consider two additional topics: tests when
parameters are not identified under the alternative hypothesis and sequences of
tests.
We consider only tests of linear restrictions on the estimated parameters, B.
Consider the null hypothesis:
(3.16) Ho: B = Hy + G
where the notation is as before. As we have shown, it is straightforwardto impose
this restriction during estimation. Let
(3.18) L=QR-Q.
(i) UnidentifiedParameters
When executing the tests, it is often the case that some parameters are not
identified under the alternative hypothesis. For example, under the null hypothe-
sis that x does not cause y, lagged x's can be used as instrumental variables for
lagged y's. This is because lagged x's will be correlated with lagged y's via the
individual effect. Use of these instruments permits us to identify the parameters
in (3.11). Under the alternative hypothesis, the greater number of parameters
means that not all of the parameters are identified. Nonetheless, a test of the null
To see this, let P be the matrix such that PP = - 1. Then premultiplying (3.11) by P results in
Ys= WsBs + Vs
are not identified (in the absence of the restrictions imposed by the null
hypothesis) for time period s, i.e. Zs has fewer elements than Bs (and, hence
fewer than Ws). That is, in the equation
(3.19) ZsYs= Zs:WsBs
+ Zsv
the number of rows in Zs Ys is fewer than the number of parametersin Bs.
The appropriate test statistic once again uses the difference between the
restricted and unrestricted sum of squared residuals, but care must be taken in
constructing the covariance matrix 2. Since the same covariance matnrxmust be
used when computing both the restricted and unrestricted sum of squared
residuals, the following procedure is appropnate.
First obtain the restricted sum of squares, incorporating the fact that BS is
identified under the null hypothesis by adding equation (3.19) to the list of
equations to be estimated. Let B* = [B', Bs']' be the coefficients for the equa-
tions for all time periods. The parameters B are identified under either hypothe-
sis, but those for time period s are not. Consider the null hypothesis
(3.20) Ho: B* =Hy+ G
where the elements of y are identified. Using similar notation. let
V* = [V Vs', . Y* =[y Ys]
W*= diag[ W, Ws], Z* = diag[Z, Zs].
Under the null hypothesis, we may add equation (3.19) to equation (3.11) as:
17The analogy is not exact because we consider more general hypotheses than simplv hypotheses
which impose equality across equations and because the Joint covariance matnx across (3. 11), above.
and (3.19), below, is not diagonal.
18 Importantly, the submatrix must be obtained from the estimated covanance matrix, S2, prior to
inverting the matrix and constructing the unrestrictedsum of squares.
HI: B=Hy+G,
and the second hypothesis, nested within H1,
112: y=-+G.
Let Q be the unrestricted sum of squares, QR1 the restricted sum of squares from
imposing H1, and QR2 the sum of squares from imposing both H1 and H2, i.e.,
the restriction
1) This result is a simple extension of similar results for the likelihood ratio test for maximum
likelihood.
20A similar procedure based upon Wald tests is discussed by Sargan (1980) in the closely related
context of testing for dynamic specification of time series models.
IV. AN EXAMPLE
A. The Issues
The conventional approach to analyzing the relationship between hours worked
and the wage rate is to specify and estimate a model in which hours worked in a
given period depend upon that period's wage rate. Implicitly, past hours and
wages are assumed to have no impact on current hours. Similarly, the possibility
that the past history of wages and hours affects the current wage is ruled out.
However, on theoretical grounds it is quite plausible to expect intertemporal
relationships between wages and hours worked. For example, maximization of
utility in some life cycle models leads to labor supply functions which depend on
wages in other periods. (See, e.g., MaCurdy (1983).) Moreover, if there are costs
to adjusting hours of work in response to changes in wages, one might expect that
past hours of work would help predict current hours of work. At the same time,
some human capital accumulation models suggest that present wage rates depend
on past hours of work. (As hours increase, so does expertise on the job, leading to
a higher subsequent wage.) Alternatively, one can imagine incentive schemes that
link a worker's current wage rate to his past hours of work. (Hamilton (1986)
argues that such schemes may help explain the behavior of medical interns,
associates in law firms, and assistant professors.)
To fix ideas, consider the equation pair
(4.1a) hi =, + W + ??h + E,
discussed, then the individual would take into account the effect of today's choice
of hours on tomorrow's wages, and the labor supply equation would take a
different form.
Substituting w1, from equation (4.1b) into equation (4.1a) gives
(4.3) h1i= (ah + paw) + /81w, 1 + **+
+ (th + pw) + (Eh +
which together with equation (4.1b) gives a VAR of the form considered in
Section 2. Note that lagged hours is excluded from equation (4.1a) and (4.3) and
that cross-equation restrictions are present. Evidence of the presence of lagged
hours in either equation might therefore be interpreted as evidence against this
specification. The presence of lagged hours in the wage equation might be
interpreted as presence of the kind of human capital or incentive effects previ-
ously mentioned, while the presence of lagged hours in the hours equations might
occur because of preferences that are not time separable or costs of adjusting
hours. Of course, the presence of lagged hours in the hours equation could also
be due to the omission of relevant variables or the violation of the assumption of
no serial correlation in e. Since serial correlation is often thought to be present
in such models, this perhaps reduces the substantive implications of the finding
that lagged hours appears in the hours equation.
The important point here is that this model and similar models imply the
presence of dynamic interrelationships between wages and hours, and these
interrelationships can be investigated using panel data on wages and hours. This
fact, of course, has been recognized by earlier investigators. Lundberg (1985)
used panel data to test whether hours Granger-cause wages. However, her
estimation procedure involved taking deviations from means to account for the
presence of individual effects. As we argued in Section II, such a procedure leads
to inconsistent estimates. Abowd and Card (1986) analyzed the time series
properties of the first differences in hours and earnings. Like MaCurdy (1981b)
and Lundberg, they assumed that the individual effect was stationary. Moreover,
although it would be possible to work backward from their estimates to learn
about the time series properties of the levels, this would be extremely cumber-
some, and they made no attempt to do so. In contrast, the procedures developed
in Section 3 allow us to obtain consistent estimates of the time series properties
of the levels of wages and hours without having to impose stationary individual
effects, and without having to employ difficult nonlinear methods.
B. The Data
We estimate equations for wages and hours using a sample of 898 males from
the Panel Study of Income Dynamics (PSID) covering the years 1968 to 1981.21
21
Our data include the Survey of Economic Opportunity (SEO) subsample, which oversamples low
income households. We performed all of the tests reported in the next section deleting the SEO
subsample. The results, which are available upon request, do not in general difler substantially from
those presented below. For the one exception, see footnote 22.
We study two variables for each individual. First is the log of the individual's
annual hours of work ("hours", denoted h,), and second is the log of his annual
average hourly earnings ("wages", denoted w,). (For a complete description of
the data, see Altonji and Paxson (1986).) As discussed below, we also check some
of our results using data from the National Longitudinal Survey of Men 45-59.
The wage variable in both data sets is constructed by dividing total earnings by
hours worked. As a result, to the extent that measurement errors are present, they
will be correlated across variables. While this presents a problem for full-infor-
mation methods, the single equation techniques used here are unaffected by this
correlation. Finally, to the extent that measurement error induces a serial
correlation in the composite error terms of the autoregression, this problem will
reveal itself as a correlation between instrumental variables and the transformed
errors.
TABLE I
WAGE EQUATION
Q L DF P
(i) m = 3 0.00 - 0 -
(ii) All Parameters
Stationary 26.22 26.22 34 0.828
(iii) m= 2 27.14 0.92 2 0.631
(iv) m= 1 29.31 2.17 2 0.338
(v) m= 0 43.40 14.09 2 0.001
(vi) Exclude Hours
m= 1 29.94 0.63 1 0.427
TABLE II
ESTIMATESa
UNCONSTRAINEDPARAMETER
(1) (2)
Wage Equation Hours Equation
methods of Section III.C to conduct the appropriate tests. The lag length results
are recorded in lines (iii) through (v) of Table I, which show the results for the
sequence m = 2, m = 1, and m = 0, respectively. The results provide no evidence
that the wage equation contains more than a single lag of hours and wages.
Finally, we conduct a test of the hypothesis that hours do not cause wages.
Line (vi) of Table I shows that one cannot reject the hypothesis that lagged hours
may be excluded from the wage equation. The estimate of this single autoregres-
sive parameter is shown in column (1) of Table III. Note that the exclusion of
lagged hours from the wage equation rejects the notion that past hours of work
affect the current wage. To the extent that workers face a market locus of hours
and wages, it contains at most contemporaneous tradeoffs between hours and
wages.
TABLE III
CONSTRAINED PARAMETER ESTIMATES'
(m=2) (n=1)
Aht 1 - 0.156 0.156
(0.0206) (0.0175)
,Aht r - 0.002
(0.0181) -
Aust, 0.135 -0.001 -
(0.0368) (0.05) _
-0.045 -
si2r - (0.0174) _
TABLE IV
HOURS EQUATION
Q I. DF p
(i) =3 0.00 - - -
(ii) Parameters
Stationary 26.69 26.69 34 0.810
(iii) ot = 2 27.33 0.64 2 0.726
(iv) "n = 1 34.09 6.76 2 0.034
(v) Exclude Wages,
nt = 2 37.47 10.14 2 0.006
(vi) Exclude Wages.
t71= 1 37.62 3.53 1 0.060
the case with the wage equation, we cannot reject the hypothesis that the
appropriate specification is a first-differencedequation with constant lag parame-
ters (see line (ii)). The unconstrained parameter estimates for this specification
are presented in column (2) of Table II. As in the wage equation, the strongest
effect both from the point of view of the absolute value of the coefficient and
statistical significance, is the own first lag. However, while in the wage equation
nothing else seems to "matter," in the hours equation, the second lag on wages is
statistically significant.22 Indeed, as in Table IV one cannot reject the hypothesis
that m = 2 (see line iii). Further restrictions depend, however, on the chosen level
of significance. As line (iv) indicates, one cannot reject the hypothesis that m = 1
at the 1%level. However, this conclusion is reversed using a 5%significance level.
Therefore, we test the hypothesis that lagged wages do not cause hours
conditional on both m = 1 and m = 2. Using m = 2 (and, thus, adopting a 5%
significance level) one can reject the hypothesis that lagged wages may be
excluded from the equation for hours (line (v)). The parameter estimates for this
AR(2) model are shown in column (2) of Table III. Using the 1% level of
22
When the SEO subsample was excluded, the data did not reject the hypothesis that lagged wages
could be excluded from the hours equation.
significance leads to different results. As shown in line (vi) of Table IV, one
cannot reject the hypothesis that lagged wages may be excluded from the AR(1)
specification of the hours equation at a 5% significance level. The resulting
parameter estimate is shown in column (3) of Table III.
Thus, using a 1% significance level, one is left with a parsimonious representa-
tion of the dynamic behavior of hours and wages. Both variables may be
represented as autonomous autoregressive processes with a single lag. The robust-
ness of this result is examined below.
(iv) MeasurementError
The estimation procedure used in the wage and hours equations makes no
special allowance for the possibility of measurement error. Altonji (1986) has
estimated that a large part of the yearly variation in PSID data is due to
measurement error. As noted in Section 3, the estimation procedure may be
modified to accommodate measurement error by simply using a different set of
instrumental variables.
We examine the effect of measurement error on our results by re-estimating the
final form of both the wage and hours equations. In order to isolate the effect of
measurement error, we focus on the correlation between the composite error term
and the instrumental variables. For the AR(1) specification, measurement error
will produce a correlation between instrumental variables dated t - 2 and the
composite error. In the absence of measurement error, no such correlation will be
present. (See equation (3.8).) We estimate the equation for wages using two
different sets of instrumental variables: (i) lagged wages dated both t - 2 and
t- 3 and (ii) lagged wages dated t- 3.23 We estimate the hours equation using
the corresponding sets of instrumental variables. One can formally test the null
hypothesis of no correlation between the instrumental variables dated t - 2 and
23
Note that this set of instruments is more restrictive than that used in our previous estimations.
This change is an attempt to increase the power of the test for measurement error.
TABLE V
MEASUREMENT ERROR CORRECTION a
Wage Hours
Equation Equation
aFigures in parentheses are standard errors. Estimates in part (i) of the table use variables dated t - 2 and t - 3 as
instrumental variables; the estimates in part (ii) use only variables dated t - 3. Part (iii) contains the test statistic for a test of
the null hypothesis of no correlation.
24 A Hausman test on the difference between the two coefficients also fails to reject the hypothesis,
although the p value of about 0.10 provides somewhat stronger evidence against the null hypothesis.
TABLE VI
NLS RESULTS
Hours Equation
Q L DF P
m = 1, All Parameters
Stationary 21.21 - 16 0.170
Exclude wages 21.36 0.15 1 0.699
Wage Equation
m = 1, All Parameters
Stationary 25.95 - 16 0.055
Exclude Hours 26.00 0.05 1 0.823
NLS sample consists of 1446 men who had positive earnings and hours in each of
the survey years 1966, 1967, 1969, 1971, 1973, and 1975. (See Abowd and Card
(1986) for details on the construction of the sample.) As in the PSID, we use two
data series: the log of annual hours of work and the log of average hourly
earnings. The latter is constructed as the difference between the log of annual
earnings and the log of annual hours.
Use of the NLS is complicated by the above-noted fact that the data are not
available for consecutive years. For the simple AR(1) model
(4.4) yit =fi + alyit,l + uit,
the problem of missing years can be circumvented by successive substitution to
yield a relationship between two-year differences:
(4.5) y=.-Yit-2
Yi Yt 2 = a2(y,2
Yt2Yt4 u ? aluit
? uia1t_ 1-it3 1-it2
Equation (4.5) is estimable using our methods. However, for an AR(2) (or longer
lag), successive substitution gives an infinite order lag specification that is not
estimable using our methods. Fortunately, the results from the PSID discussed
above indicate (using the 1% significance level) that equation (4.4) may be an
adequate representation of both wages and hours. For these reasons, we con-
centrate on the estimation of AR(1) models. We test the assumption of excluda-
bility and the overidentifying restrictions implied by the initial assumptions on
stationarity and lag length.
The test results are contained in Table VI. For both wages and hours, we
cannot reject the initial assumptions of stationarity and lag length; although the
test for wages is borderline at the 5% level.25 Under the identification restriction
that both own lag coefficients are nonzero and have the same sign, we can test for
causality by testing whether the wage is significant in the hours equation, and
whether hours is significant in the wage equation. The result of these tests is that
one cannot reject the hypotheses that hours do not cause wages and wages do not
25 Note that we only identify the square of the matnrx of autoregressive coefficients from the
biannual data. These restrictions allow us to solve for the original matrix in terms of its square.
TABLE VII
NLS PARAMETER ESTIMATESa
Ah, 2 0.0263
(0.0410)
'A w 2 0.0492
(0.0259)
TRANSFORMED NLS PARAMETER ESTIMATESa
Wage Equation Hours Equation
zAh, 0.162
- (0.127)
,A 0.222
(0.0584)
cause hours. The result in both cases is an equation of the form specified in (4.5).
Thus, neither the PSID nor NLS data lend support to the notion that the current
wage rate depends upon past hours of work.
The parameter estimates are shown in Table VII. Of course, because of the
presence of the al in equation (4.5), these parameters do not correspond directly
to those we obtained using PSID data. To make the estimates comparable, we
take square roots, imposing an identifying assumption that the underlying
coefficients are positive. These transformed estimates (and their standard errors)
are shown at the bottom of Table VII.
The correspondence between these point estimates and those in Table III is
striking. This is particularly compelling when one considers that the two data sets
cover different years and contain different types of individuals: the PSID has
only hourly employees, while the NLS has only relatively old workers. With
respect to the statistical significance of the estimates, in both data sets the lagged
wage is significant at conventional levels. There is some disagreement with
respect to lagged hours, however. For the PSID the t statistic for the coefficient
of lagged hours in the hours equation is 8.90; for the NLS it is 1.28. While this
evidence is mixed it does suggest that it is potentially dangerous to exclude
lagged hours from the autoregressive hours equation. Whether this fact has
consequences for the appropriate specification of the structural hours equation is
unclear; it depends on whether the presence of lagged hours is due to serial
correlation .2
26
26 Using PSID data, we tested for the presence of first order serial correlation in eh, of equation
(4.1a) using a Wald test of the common factor restriction that the coefficient of H, times the
coefficient of w, is equal to the negative of the coefficient of w, 2 (This test is valid under the
assumption that only w, l appears in the wage equation.) The test indicated that one can reject
the hypothesis of first order autocorrelation at a 3 percent significance level. Of course, testing for
other patterns of serial correlation might produce a diflerent result. Hence, while this test is suggestive
that the presence of lagged hours is indeed due to "structural" considerations, it is not conclusive.
TABLE VIII
NLS MEASUREMENT ERROR CORRECTIONa
aFigures in parentheses are standard errors. Estimates in part (i) of the table use variables dated - 2 and t --- 4 as
instrumental variables, the estimates in part (ii) use onlv variables dated t -- 4. Part (iii) contains the test statistic for a test of
the null hvpothesis of no correlation.
Finally, in the same fashion as for the PSID we check for the importance of
measurement error by re-estimating the equations using alternative sets of
instrumental variables. The results for the untransformed parameter estimates are
shown in Table VIII. Comparing these results to those in the top of Table VII,
we see that much like the results from the PSID, the most seriously affected
coefficient is that in the wage equation. Here, however, the formal test for the
wage equation supports the hypothesis of measurement error at close to the 1%
level.27As before, for the hours equation, the hypothesis of measurement error is
not supported.
5. CONCLUSION
27
A Hausman test also supports the measurement error hypothesis for the wage equation; the p
value is about 0.02.
REFERENCES
ABOWD,JOHNM., AND DAVID CARD (1986): "On the Covariance Structure of Earnings and Hours
Changes," mimeo, Princeton University, October, 1986.
ALTONJI, J. (1986): "Intertemporal Substitution in Labor Supply: Evidence from Micro Data,"
Journal of Political Economy, 94, S176-S215.
ALTONJ, J., AND C. PAXSON (1986): "Job Characteristics and Hours of Work," Research in Labor
Economics.
ANDERSON, T. W., AND C. HSIAO (1982): "Formulation and Estimation of Dynamic Models Using
Panel Data," Journal of Econometrics,18, 47-82.
ASHENFELTER,ORLEY, AND D. CARD(1982): "Time Series Representations of Economic Variables
and Alternative Models of the Labour Market," Review of Economic Studies, 49, 761-782.
BROWNING, M. J., A. S. DEATON, AND M. IRISH(1985): "A Profitable Approach to Labor Supply and
Commodity Demands Over the Life-Cycle," Econometrica, 53, 503-544.
CHAMBERLAIN,GARY (1983): "Panel Data," Chapter 22 in The Handbook of Econometrics VolumeII,
ed. by Z. Griliches and M. Intrilligator. Amsterdam: North-Holland Publishing Company.
FISHER, FRANKLIN (1966): The Identification Problem in Econometrics. Huntington, N.Y.: Krieger
Publishing Company.
GALLANT, RONALD, AND D. JORGENSON(1979): "Statistical Inference for a System of Nonlinear,
Implicit Equations in the Context of Instrumental Variables Estimation," Journal of Econometrics,
11, 275-302.
GRILICHES, ZVI, AND J. HAUSMAN (1984): "Errors in Variables in Panel Data," National Bureau of
Economic Research Technical Working Paper No. 37, May, 1984.
HAMILTON, BRUCE (1986): "Merit Pay Increases and the Supply of Labor," mimeo, Johns Hopkins
University, November, 1986.
HANSEN, LARS (1982): "Large Sample Properties of Generalized Method of Moments Estimators,"
Econometrica, 50, 1029-1054.
HECKMAN, J. J., AND T. E. MACuRDY (1980): "A Life Cycle Model of Female Labor Supply," Review
of Economic Studies, 47, 47-74.
HOLTz-EAKIN, D. (FORTHCOMING):"Testing for Individual Effects in Autoregressive Models,"
forthcoming in Journal of Econometrics.
HOLTz-EAKIN, D., W. NEWEY, AND H. ROSEN (FORTHCOMING):"The Revenues-Expenditures Nexus:
Evidence from Local Government Data," forthcoming in International Economic Review.
LUNDBERG, S. (1985): "Tied Wage-Hours Offers and the Endogeneity of Wages," Review of
Economics and Statistics, 67, 405-410.
MACuRDY,T. (1981a): "Time Series Models Applied to Panel Data," mimeo, Stanford University.
(1981b): "An Empirical Model of Labor Supply in a Life Cycle Setting," Journal of Political
Economy, 89, 1059-1086.
(1983): "A Simple Scheme for Estimating an Intertemporal Model of Labor Supply and
Consumption in the Presence of Taxes and Uncertainty," International Economic Review, 24,
265-289.
NEWEY,WHITNEY,AND K. WEST(1987): "Hypothesis Testing with Efficient Method of Moment
Estimation," International Economic Review, 28, 777-787.
NICKELL,STEPHEN(1981): "Biases in Dynamic Models with Fixed Effects," Econometrica, 49,
1417-1426.
PAKES,ARIEL,AND Zvi GRILICHES (1984): "Estimating Distributed Lags in Short Panels with an
Application to the Specification of Depreciation Patterns and Capital Stock Constructs," Review of
Economic Studies, 51, 243-262.
SARGAN,J. D. (1980): "Some Tests of Dynamic Specification for a Single Equation," Econometrica,
48, 879-898.
TAYLOR,JOHN (1980): "Output and Price Stability-An International Comparison," Journal of
Economic Dynamics and Control, 2, 109-132.
WHITE, HALBERT(1980): "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a
Direct Test for Heteroskedasticity," Econometrica,48, 817-838.
(1982): "Instrumental Variables Regression with Independent Observations," Econometrica,
50, 483-500.