Capitulo 4 Primera Edicion
Capitulo 4 Primera Edicion
t;hapter 4
If you solve (A3.5) for A. it is possible to verify
,
y; =O-A.:)L!/1-1Y1-j
Fi
Since A 1 < 1, the summation is such that (1 - A 1 )IA.:'1 - 1 = l. Hence, the optima!
1 1
l
a covariance stationary series:
l. Exhibits mean reversion in that it fluctuates around a constant long-run mean. Since the estimate of a 1 is directly related to the value of p 1 , the estimated value
2. Has a finite variance that is time-invariant. of a 1 is biased to be below its true value of unity. The estimated model will mimic
that of a stationary AR( 1) process with a near unit root. Hence, the usual t-test can- "
3. Has a theoretical correlogram that diminishes as lag length increases.
not be used to test the hypothesis a 1 = 1.
On the other hand, a nonstationary series necessarily has pennanent components. Figure 4.1 shows the sarnple correlograrn for a simulated random walk process.
The mean andJor variance of a nonstationary series are time-dependent. To aid in One hundred norrnally distributed random deviates were obtained so as to mimic
the identification of a nonstationary series, we know that: the {E,) sequence. Assuming y 0 =O, we can calculate the next 100 values in the (y,}
1. There is no long-run mean to which the series retums. sequence as y,= y,_, + E1• This particular correlogram is characteristic of most sam-
2. The variance is time-dependen! and goes to infinity as time approaches infinity. ple correlograms constructed from nonstationary data. The estimated value of p 1 is
close to unity and the sample autocorrelations die out slowly. If we did not know
3. Theoretical autocorrelations do not decay but, in finite samples, the sample cor-
the way in which the data were generated, inspection of Figure 4.1 might lead us to
relogram dies out slowly.
falsely conclude that the data were generated from a stationary process. With this
. · Although the properties of a sample correlogram are useful tools for detecting particular data, esti mates of an AR( 1) model with and without an intercept yield
the possible presence of unit roots, the method is necessarily irnprecise. What may (standard errors are in parentheses):
appear as a unit root to one observer may appear as a stationary process to another.
The problem is difficult because a near unit root process will have the same shaped y,::: 0.9546yt-1 +E,, R 2 = 0.86 (4.3)
ACF as a unit root process. For example, the correlogram of a stationary AR(l) (0.030)
process such that p(l) = 0.99 will exhibit the type of gradual decay indicative of a y,::: 0.164 + 0.9247y,_I +E,, R 2 = 0.864 (4.4)
nonstationary process. To illustrate sorne of the issues involved, suppose that we (0.037)
know a series is generated from the following first-order process: 1
Examining (4.3), a careful researcher would not be willing to dismiss the possi-
y,::: ª1Y1-1 +E, (4.1) bility of a unit root since the estimated value of a 1 is only 1.5133 standard devia-
tions from unity. We might correctly recognize that under the null hypothesis of a
where (E,} is generated frorn a white-noise process. unit root, the estimate of a 1 will be biased below unity. If we knew the true distrib-
First, suppose that we wish to test the null hypothesis that a 1 = O. Under the ution of a 1 under the null of a unit root, we could perfonn such a significance test.
maintained null hypothesis of a 1 =O, we can estimate (4.1) using OLS. The fact Of course, if we did not know the true data-generating process, we might estimate
that E, is a white-noise process and \a 1 1 < 1 guarantees that the [y,} sequence is the model with an intercept. In (4.4), the estímate of a, is more than two standard
stationary and the estimate of a 1 is efficient. Calculating the standard error of the deviations from unity: (1 - 0.9247)/0.037 = 2.035. However, it would be wrong to
estimate of a 1 , the researcher can use a r-test to detennine whether a 1 is signifi- use this infonnation to reject the null of a unit root. After ali, the point of this sec-
cantly different from zero. tion has been to indicate that such t-tests are inappropriate under the null of a unit ·
The situation is quite different if we want to test the hypothesis a 1 = l. Now, un- roo t.
der the null hypothesis, the {y,} sequence is generated by the nonstationary process: Fortunately, Dickey and Fuller (1979, 1981) devised a procedure to fonnally test
, for the presence of a unit root. Their methodology is similar to that used in con-
structing the data reported in Figure 4.1. Suppose that we generated thousands of
y,::: LE¡ (4.2) such random walk sequences and for each we calculated the estimated value of a 1 •
i=l Although most ali of the estimates would be close to unity, sorne would be further
Figure 4.1 The application of these Dickey-Fuller critica/ values to tests for unit roots is
/
/-
A simulated random walk process. straightforward. Suppose we did not know the true-data generating process and
/
10
were trying to ascertain whether the data used in Figure 4.1 contained a unit root.
Using these Dickey-Fuller statistics, we would not reject the null of a unit root in
8
(4.4). The estimated value of a 1 is only 2.035 standard deviations from unity. In
6 fact, if the true value of a 1 does equal unity, we should find the estimated value to
be within 2.58 standard deviations from unity 90% of the time.
4 Be aware that stationarity necessitates -1 < a 1 < l. Thus, if the estimated value
of a 1 is close to -1, you should also be concerned about nonstationarity. If we de-
2 fine y :::: a 1 - 1, the equivalent restriction is -2 < y < O. In conducting a Dickey-
3
Fuller test, it is possible to check that the estimate1 value of y is greater than -2.
o
Monte Cario Simulation
-2
The procedure Dickey and Fuller (1979, 1981) used to obtain their critica! values is
-4 typical of that found in the modern time series literature. Hypothesis tests concem-
o
ing the coefficients of non-stationary variables cannot be conducted using tradi-
tional t-tests or F-tests. The distributions of the appropriate test statistics are non-
standard and cannot be analytically evaluated. However, given the trivial cost of
Correlogram of the process. computer time, the non-standard distributions can easily be derived using a Monte
.. Cario simulation .
The first step in the procedure is to computer generate a set of random numbers
0.8 (sometimes called pseudo-random numbers) from a given distribution. Of course,
the numbers cannot be entirely random since ali computer algorithms rely on a de-
terministic number generating mechanism. However, the numbers are drawn so as
0.6
to mimic a random process having sorne specified distribution. Usually, the num-
bers are designed to be nonnally distributed and serially uncorrelated. The idea is to
0.4 use these numbers to represent one replication of the entire {e,} sequence.
Ali major statistical packages have a built-in random number generator. An inter-
0.2
esting experiment is to use your software package to draw a set of 100 random
numbers and check for serial correlation. In almost ali circumstances, they will be
highly correlated. In your own work, if you need to use serially uncorrelated num-
o 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 bers, you can model the computer generated numbers using the Box Jenkins
methodology. The residuals should approximate white noise.
(b)
The second step is to specify the parameters and initial conditions of the {y,} se-
quence. Using these parameters, initial conditions, and random numbers, the {y,}
from unity than others. In performing this experiment, Dickey and Fuller found that can be constructed. Note that the simulated ARCH processes in Figure 3.9 and ran-
in the presence of an intercept: dom-walk process in Figure 4.1 were constructed in precisely this fashion.
Similarly, Dickey and Fuller (1979, 1981) obtained 100 values for {E,}. set a 1 == 1,
Ninety percent of the estimated val ues of a 1 are lcss than 2.5 8 standard errors from
y 0 ==O, and calculated l 00 values for {y,} accordíng to (4.1 ). At thís point, the para-
unity.
meters of interest (such as the estímate of a 1 or the in-sample variance of y,) can be
Ninety-five percent of the estimated values of a 1 are less than 2.89 standard errors obtained.
from unity. The beauty of the method is that all important attributes of the constructed {y,}
Ninety-nine percent of the estimated values of a 1 are less than 3.51 standard errors sequence are known to the researcher. For this reason, a Monte Cario simulation is
from unity. 2 often referred to as an "experiment." The only problem is that the set of random
• .. _..,,.Ó )'-" • ' - ,...,._, ...,,.... '- "•• • •v ·
In their Monte Cario analysis, Granger and Newbold generated many such sam-
numbers drawn is just one possible outcome. Obviously, the estimates in (4.3) and
(4.4) are dependent on the values of the simulated { E1 } sequence. Different out- ples and for each sample. estimated a regression in the form of (4.5). Since the (y 1}
comes for { E1 } will yield different values of the simulated (y 1 } sequence. and {.zrl sequences ar md pendent of each other, Equation (4.5) is necessarily
This is why the Monte Cario studies perform many replications of the process meamngles ; a y relat10nsh1p between the two variables is spurious. Surprisingly,
outlined above. The third step is to replicate steps 1 and 2 thousands of times. The at the 5% s1g01ficance leve.!, they were able to reject the null hypothesis a 1 = O in
goal is to ensure that the statistical properties of the constructed (y, l sequence are approx1mately 750:º of the t1 e. Moreover, the regressions usually had very high R 2
in accord with the true distribution. Thus, for each replication, the parameters of in- va!ues and the est1mated res1duals exhibited a high degree of autocorrelation.
terest are tabulated and critica! values (or confidence intervals) obtained. As such, To xplain the ?ranger a d Newbold findings, note that the regression equation
the properties of your data set can be compared to the properties of the simulated (4.5! is ne essanly meanmgless if the residual series {e,} is nonstationary.
data so that hypothesis tests can be performed. This is the justification for using the Obv1ously, 1f the {e uence has a stochastic trend, any error in period t never
Dickey-Fuller critica! values to test the hypothesis a 1 = 1. dec ys, so th.at tne deviation from the mOcieCíSpei:1nanerit. It is hard to imagine at-
tachmg any 1mportance to an economic model having perrnanent errors. The sim-
One limitation of a Monte Cario experiment is that it is specific to the assump-
plest way to examine the properties of the {e,) sequence is to abstract from the in-
tions used to generate the simulated data. If you change the sample size, include (or
tercept term a 0 and rewrite (4.5) as
delete) an additional parameter in the data generating process, or use altemative ini-
tial conditions an entirely new simulation needs to be performed. Nevertheless, you
shou!d be able to envision many applications of Monte Cario simulations. As dis-
cussed in Hendry, Neale, and Ericsson (1990), they are particularly use ful for
studying the small sample properties of time-series data. As you will see shortly, lf z, and Y1 are id by (4.6) and (4.7), we can impose the initial eortdi:tions
Monte Cario simulations are the workhorse of unit root tests. Yo = X 0 =O, so that:
CASE2
o o
The {y,} and {z,} sequences are integrated of different orders. Regression equations
y, z, using such variables are meanini?iless. For exampl , replace (4.7) by the stationary
-2 -2 process z, = pz,_ 1 + Ezr• where 1 p \ < 1. Now (4.8) IS replaced by e,= LEy; - L:p'Ezr_,.
Although the expression Lp;E,,_; is convergent, the ( Ey,} sequence still contains a
trend component. 5
-4 -4
o 50 100 o 50 100
CASE 3
Since the ( Eyr} and { Ezr} sequences are independent, the regression of y, on z, is spu- The nonstationary {y,} and {z,} sequences are integrated of the same order and the
rious. Given the realizations of the random disturbances, it appears as if the two se- residual sequence contains a stochastic trend. This is the case in which the regres-
quences are related. In the scatter plot of y, against z,, you can see that y, tends to sion is spurious. The results from such spurious regressio!ls Wt:. meaningless in that
rise as z, decreases. The regression equation of y, on z, will capture this tendency. errors_a.:r!!J> !!ll -In thls case, Ttls often-rec;i'luñendedthat-th·é-reg;:; sio
The correlation coefficient between y, and z, is --0.372 and a linear regression yields equation be estimated in first differences. Consider the first difference of (4.5):
y, = --0.46z, - 0.31. However, the residuals from the regression equation are nonsta-
tionary.
Scatter plot of y, and z, Regression residuals Since y,, z,, and e, each contain unit roots, the first difference of each is stationary.
2 2 Hence, the usual asymptotic results apply. Of course, if one of the trends is deter-
++ + +
+++ ++/ ministic and the other is stochastic, first-differencing each is not appropriate.
o + +
+ i: .t + + o CASE4
.t
...
+ +
+.t+ :e
-2 1.+ ... -2 The nonstationary {y,} and { z,} sequences are integrated of the same order and the
+ residual sequence i station ry. In this circumstance, {y,} and (z,} are cointe-
grated. A trivial example of a coiritegrated system occurs if Ezr and Ey, are perfectly
-4 -4
-3 -2 -1 o 2 o 50 100 correlated. If Ezr = Ey,, then (4.8) can be set equal to zero (which is stationary) by
+
Scatter plot
setting a, = l. To considera more interesting example, suppose that both z, and y,
-Regression line are the random walk plus noise processes:
y,=µ,+ Eyt Cases 2 or 3 apply. If the variables are cointegrated, the results of Chapter 6 apply.
Z, = µ, + E, 1 The rcmainder of this chapter considers the formal test procedures for the presence
of unit roots and/or deterministic time trends.
where Ey, and E" are white-noise processes and µ, is the random walk process µ, =
µ,_ 1 + Er Note that both {z,) and {y,) are unit root processes, but y, - z, = Ey, - E,, is
stationary.
Ali of Chapter 6 is devoted to the issue of cointegrated variables. For now, it is 2. DICKEY-FULLER TESTS
sufficient to note that pretesting the variables in a regression for nonstationarity is
extremely importan t. Estimating a regression in the form of (4.5) is meaningless if The last section outlíned a simple procedure to detennine whether a 1 = l in the
model y,= a 1 y,_ 1 +E,. Begin by subtracting y,_ 1 from each side of the equation in
order to write the equivalent form: !::.y,= yy,_ 1 +E" where y= a 1 - l. Of course, test-
WORKSHEET 4.2 Spurious Regressions: Example 2 ing the hypothesis a 1 = 1 is equivalent to testing the hypothesis y= O. Dickey and
Fuller ( 1979) actually consider three different regression equations that can be u sed
Consider the two random walk plus drift processes: to test for the presence of a unit root:
y,= 0.2 + y,_ 1 + Ey 1 z,= -0.1 + z,_, +E" !::.yr == YY1-1 +e., (4.9)
30 o t.yr ==ªo+ 'YYr-1 + e., (4.10)
20
t.yr == ªº + YY1-1 + a2t +e., (4.l l)
-5 --
The difference between the three regressions concems the presence of the deter-
10 z, ministic elements a 0 and a 2 t. The first is a pure random walk model, the second
-10 adds an intercept or drift term, and the third includes both a drift and linear time
o tren d.
The parameter of interest in ali the regression equations is y, if y= O, the {y1 } se-
-10 -150 quence contains a unit root. The test involves estimating one (or more) of the equa-
o 50 100 50 100
tions above using OLS in order to obtain the estimated value of y and associated
standard error. Comparing the resulting t-statistic with the appropriate value re-
ported in the Dickey-Fuller tables allows the researcher to determine whether to ac-
Again, the {e.Y,) and {E."} sequences are independent, so that the regression of y, on
cept or reject the null hypothesis y= O.
z, is spurious. The scatter plot of y, against z, strongly suggests that the two series
are related. It is the deterministic time trend that causes the sustained increase in y, Recall that in (4.3), the estímate of y,= a 1 y,_ 1 +e., was such that a 1 = 0.9546 with
and sustained decrease in z,. The residuals from the regression equation y, = -2z, + a standard error of 0.030. Clearly, the OLS regression in the form !::.y,= yy,_ 1 + e.,
will yield an estímate of y equal to --0.0454 with the same standard error of 0.030.
e, are nonstationary.
Hence, the associated r-statistic for the hypothesis y= O is -1.5133 (-0.0454/0.03 =
-1.5133).
Scatter plot of y, and z, Regression residuals
4 The methodology is precisely the same, regardless of which of the three forms of
the equations is estimated. However, be aware that the critica! values of the r-statis-
tics do depend on whether an intercept and/or time trend is included in the regres-
20 2
sion equation. In their Monte Cario study, Dickey and Fuller (1979) found that the
y, 10 y,+ 2·z, o critica! values for y= O depend on the form of the regression and sample size. The
statistics labeled -r, -rµ, and -r, are the appropriate statistics to use for Equations
(4.9), (4.10), and (4.11), respectively.
o -2
Now, look at Table A at the end of this book. With 100 observations, there are
-10L---'................. -'- J three different critica! values for the t-statistic y= O. For a regression without the
-15 -10 -5 o 50 100 intercept and trend terms (a 0 = a2 = 0), use the section labeled -r. With 100 observa-
z, tions, the critica! values for the r-statistic are -1.61, -1.95 and -2.60 at the 10, 5,
/
,,and 1 % significance levels, respectively. Thus, in the hypothetical example with y= r = number of restrictions
-0.0454 and a standard error of 0.03 (so that t = -1.5133), it is not possible to reject T = number of usable observations
the null of a unit root at conventional significance levels. Note that the appropriate k = number of parameters estimated in the unrestricted model
critica] values depend on sample size. As in most hypothesis tests, for any given
leve] of significance, the critica! values of the t-statistic decrease as sample size in- Hence, T - k = degrees of freedom in the unrestricted model.
creases. Comparing the calculated value of <ji¡ to the appropriate value reported in Dickey
Including an intercept term but nota trend term (only a 2 =O) necessitates the use and Fuller ( 1981) allows you to determine the significance leve! at which the re-
of the critica! values in the section labeled 'tµ- Estimating (4.4) in the form ti.y, = striction is binding. The null hypothesis is that the data are generated by the re-
ªº + YY1-1 +E, necessarily yields a value of y equal to (0.9247 - 1) = -0.0753 with a stricted model and the altemative hypothesis is that the data are generated by the
standard error of 0.037. The appropriate calculation for the 'tµ statistic yields unrestricted model. If the restriction is not binding, RSS(restricted) should be close
-0.0753/0.037 = -2.035. If we read from the appropriate row of Table A, with the to RSS(unrestricted) and <ji¡ should be small; hence, large values of <jl¡ suggest a
same 100 observations, the critica] values are -2.58, -2.89, and -3.51 at the 1O, 5, binding restriction and rejection of the null hypothesis. Thus, if the calculated value
and 1 % significance levels, respectively. Again, the null of a unit root cannot be re- of <ji, is smaller than that reported by Dickey and Fuller, you can accept the re-
jected at conventional significance levels. Finally, with both intercept and trend, stricted model (i.e., you do not reject the null hypothesis that the restriction is not
use the critica] values in the section labeled 't,; now the critica! values are -3.45 binding). If the calculated value of <)l; is larger than reported by Dickey and Fuller,
and -4.04 at the 5 and 1 % significance levels, respectively. The equation was not you can reject the null hypothesis and conclude that the restriction is binding. The
estimated using a time trend; inspection of Figure 4.1 indicates there is little reason critica] values of the three <)l; statistics are reported in Table C at the end of this text.
to include a deterministic trend in the estimating equation. Finally, it is possible to test hypotheses conceming the significance of the drift
As discussed in the next section, these critica! values are unchanged if (4.9), term a0 and time trend a 2 . Under the null hypothesis y= O, the test for the presence
(4.10), and (4.11) are replaced by the autoregressive processes: 6 of the time trend in (4.14) is gíven by the 't¡h statistic. Thus, this statistic is the test
a 2 =O given that y= O. To test the hypothesis a 0 =O, use the 'ª' statistic if you esti-
p
where RSS =residual sum of squares, and standard errors are in parentheses.
To test the null hypothesis that the data are generated by (4.17) against the alter-
native that (4.15) is the "true" model, use the qi 2 statistíc. Dickey and Fuller test the Next, add and subtract (ap-- 1 + aP)y,_r+ 2 to obtain
null hypothesis a0 = a 2 =y= O as follows. Note that the residual sums of squares of
the restricted and unrestricted models are 0.065966 and 0.056448 and the null hy-
pothesis entails three restrictions. With l JO usable observations and four estimated
parameters, the unrestricted model contains 106 degrees of freedom. Sin ce Continuing in this fashion, we get
0.056448/106 = 0.000533, the (j> 2 statistic is given by
p
qi2 = (0.065966 - 0.056448)/ 3(0.000533) = 5.95 6.yr =ªo+ 'Y Yr-1 + L ¡LlYr-i+I + E1
i=2
With l JO observations, the critica! value of (j> 2 calculated by Dickey and Fuller is
5.59 at the 2.5% significance leve!. Hence, it is possible to reject the null hypothe-
where 'Y=-(!- f., a;)
r=I
sis of a random walk against the altemative that the data contain an intercept and/or p
a unit root and/or a deterministic time trend (i.e., rejecting a0 = a 2 = y= O means
that one or more of these parameters does not equal zero).
¡= Lª1
j=i
(4.19)
Dickey and Fuller also test the null hypothesis a2 =y= O given the altemative of
(4.15). Now ifwe view (4.16) as the restricted model and (4.15) as the unrestricted In (4.19), che coefficient of interest is y; if y= O, the equation is entirely in first
model, the <j> 3 statistic is calculated as differences and so has a unit root. We can test for the p esence of a unit root using
the same Dickey-Fuller statistics discussed above. Again, the appropriate statistic
<j>3 = (0.063211 - 0.056448)/ 2(0.000533) = 6.34 to use depends on the deterministic components included in the regression equa-
tion. Without an intercept or trend, use the 1statistic; with only the intercept, use
With 11O observations, the critica! value of <j> 3 is 6.49 at the 5% significance the 1:µ statistic; and with both an intercept and trend, use the e, statistic. It is worth-
leve! and 5.47 at the 10% significance level. 7 At the 10% leve!, they reject the null while pointing out that the results here are perfectly consistent with our study of
hypothesis. However, at the 5% leve!, the calculated value of <\> 3 is smaller than the difference equations in Chapter 1. If the coefficients of a difference equation sum to
critica! value; they do not reject the null hypothesis that the data contain a unit root 1, ar least one characteristic root is unity. Here, if Úl; = 1, y= O and the system has
and/or deterministic time trend. a unit root.
To compare with the 1, test (i.e., the hypothesis that only y= 0) note that Note that the Dickey-Fuller tests assume that the errors are independent and
ha ve a constant variance. This raises four important problems related to the fact that
1, = --0.119/0.033 =-3.61. we do not know the true data-generating process. First, the true data-generating
process may contain both autoregressive and moving average components. We
which rejects the null of a unit root at the 5% leve!. need to know how to conduct the test if the order of the moving average terms (if
unknown. Second, we cannot properly estímate y and its standard error un- increased number of lags necessitates the estimation of additional parameters and a
/ less ali the autoregressive terms are included in the estimating equation. Clearly, loss of degrees of freedom. The degrees of freedom decrease since the number of
the simple regression 6y, = a 0 + yy,_ 1 + E, is inadequate to this task if (4.18) is the parameters estimated has increased and because the number of usable observations
'
true data-generating process. However, the true order of the autoregressive process has decreased. (We lose one observation for each additional lag included in the au- j
is usually unknown to the researcher, so that the problem is to select the appropriate toregression.) On the other hand, too few lags will not appropriately capture the ac-
1
lag length. The third problem stems from the fact that the Dickey-Fuller test con- tual error process, so that y and its standard error will not be well estimated.
siders only a single unit root. However, a pth-order autoregression has p character- How <loes the researcher select the appropriate lag length in such circumstances'l
istic roots; if there are m :S: p unit roots, the series needs to be differenced m times to One approach is to start with a relatively long lag length and pare down the model
achieve stationarity. The fourth problem is that it may not be known whether an in- by the usual t-test and/or F-tests. For example, one could estímate Equation (4.20)
tercept and/or time trend belongs in (4.18). We consider the first three problems be- using a lag length of n*. If the t-statistic on lag n* is insignificant at sorne specified
low. Section 7 is con cerned with the issue of the appropriate deterministic regres- critica! value, reestimate the regression using a lag length of n* - 1. Repeat the
sors. process until the lag is significantly different from zero. In the pure autoregressive
Since an invertible MA model can be transformed into an autoregressive model, case, such a procedure will yield the true lag length with an asymptotic probability
the procedure can be generalized to allow for moving average components. Let the of unity, provided that the initial choice of lag length includes the true length. With
{y,} sequence be generated from the mixed autoregressive/moving average process: seasonal data, the process is a bit different. For example, using quarterly data, one
could start with 3 years of lags (n = 12). If the t-statistic on lag 12 is insignificant at
A(L)y, = C(L)E, sorne specified critica! value and an F-test indicates that lags 9 to 12 are also in-
significant, move to lags 1 to 8. Repeat the process for lag 8 and lags S to 8 until a
where A(L) and C(L) = polynomials of orders p and q, respectively reasonable lag length has been determined.
Once a tentative lag length has been determined, diagnostic checking should be
If the roots of C(L) are outside the unit circle, we can write the {y,} sequence as
conducted. As always, plotting the residuals is a most important diagnostic too!.
the autoregressive process: There should not appear to be any strong evidence of structural change or serial
correlation. Moreover, the correlogram of the residuals should appear to be white
A(L)y, /C(L) = E, noise. The Ljung-Box Q-statistic should not revea! any significant autocorrelations
among the residuals. It is inadvisable to use the altemative procedure of beginning
or, defining D(l) = A(l)IC(L), we get with the most parsimonious model and keep adding lags until a significant lag is
found. In Monte Cario studies, this procedure is biased toward selecting a value of
D(L)y, =E,
n that is less than the true value.
Even though D(l) will generally be an infinite-order polynomial, in principie we
Multiple Roots
can use the same technique as used to obtain (4.19) to form the infinite-order au-
toregressi ve model: Dickey and Pantula (1987) suggest a simple extension of the basic procedure if
more than one unit root is suspected. In essence, the methodology entails nothing
more than performing Dickey-Fuller tests on successive differences of (y,}. When
t,.y, = 'YY1-l + IJl;6Y,-i+l +E, (4.20) exactly one root is suspected, the Dickey-Fuller procedure is to estímate an equa-
i=2 tion such as 6y, = a0 + yy,_ 1 + E,. Instead, if two roots are suspected, estímate the
equation:
As it stands, (4.20) is an infinite-order autoregression that cannot be estimated Now, the second problem conceming the appropriate lag length arises. Including
using a finite data set. Fortunately, Said and Dickey (1984) have shown that an un- too many lags reduces the power of the test to reject the null of a unitroot since the
known ARIMA(p, 1, q) process can be well approximated by an ARIMA(n, 1, O)
autoregression of order no more than T 113 • Thus, we can solve the first problem by
using a finite-order autoregression to approximate (4.20). The test for y= O can be
conducted using the aforementioned Dickey-Fuller 1, "tµ, or 1, test statistics.
(4.21)
Use the appropriate statistic (i.e., 1, 'µ• or 1:, depending on the deterministic
ele- ments actually included in the regression) to determine whether 1 is
significantly different from zero. If you cannot reject the null hypothesis that
1 =O, conclude
that the {y,} sequence is /(2). If 1 does differ from zero, go on to determine
whether there is a single unit root by estimating
--------------------------------------------·- •., .1 ............. ''-"'-.!
- - -- -
...................
(4 22) Tourist visits to Spain have a unit root and seasonal unit root. Since seasonality
is a key feature of many economic series, a sizable literature has developed to test
Since there are not two unit roots, you should find that P1 and/or P2 differ from for seasonal unit roots. Befare proceeding, note that the first difference of a sea-
zero. Under the null hypothesis of a single unit root, P1 <O and P2 =O; un er the al- sonal unit root process will not be stationary. To keep matters simple, suppose that
temative hypothesis, (y }1 is stationary, so that P 1 and p2 are both negat1ve. Thus, the quarterly observations of (y1 ) are generated by
estimate (4.22) and use the Dickey-Fuller critica! values to test the null hypothes1s
p 1 =O. If you reject this null hypothesis, conclude that {y 1).is stationary. Yr = Yr-4 + Er
As a rule of thumb, economic series do not need to be d1fferenced more than two
times. However, in the odd case in which at most r unit roots are suspected, the Here, the fourth difference of {y,} is stationary; using the notation of Chapter 2,
procedure is to first estímate we can write 64,}', = E1. Given the initial condition y 0 = y_ 1 = ... = O, the solution for
Ar (:t Ar-]
y, is:
Ll Yr =ªo+ 1-'1Ll Yr-1 +E,
If !J..ry, is stationary, you should find that -2 < 1 <O. If the Dickey:-Fuller critica] y,= E1 + E 1-4 + E 1_ 8 + ...
values for p 1 are such that it is not possible to reject the nul of a ntt root, you ac-
cept the hypothesis that (y,} contains r unit roots. If you reJect th1s null of exactly so that
r unit roots, the next step is to test for r - 1 roots by estimating
1/4 1/4
Y1 - Y1-1 = L
i=O
E4; - L
i=O
E4i-l
If both p 1 and p2 differ from zero, reject the null hypothesis of r - 1 u.nit root .
You can use the Dickey-Fuller statistics to test the null of exactly r - 1 unlt roots if Hence, Ay, equals the difference between two stochastic trends. Since the vari-
the t statistics for p 1 and p2 are both statistically different from zero. If you can re- ance of !J..y1 increases without Iimit as t increases, the (Liy1} sequence is not station-
ject this null, the next step is to form ary. However, the seasonal difference of a unit root process may be stationary. For
¡.,,ry, = ao +
(:t Ar-!
+ (:t F-2
+ (:t Ar-3 +E example, if {y1} is generated by y 1 = y 1_ 1 + E 1, the fourth difference (i.e., Ll4,}' 1 = E 1 +
Yi-1 Yr-1 Yr-1 r E,_ 1 + E,_ 2 + E,_ ) is stationary. However, the variance of the fourth difference is
J-l¡Ll l-'2Ll l-'3Ll 3
larger than the variance of the first difference. The point is that the Dickey-Fuller
As long as it is possible to reject the null hypothesis that the variou& values of the procedure must be modified in order to test for seasonal unit roots and distinguish
p; are nonzero, continue toward the equation: between seasonal versus nonseasonal roots.
There are severa! alternative ways to treat seasonality in a nonstationary se-
quence. The most direct method occurs when the seasonal pattern is purely deter-
ministic. For example, Jet D 1 , D2 , and D 3 represent quarterly seasonal dummy vari-
Continue in this fashion until it is not possible to reject the null of a unit root or ables such that the value of D; is unity in season iand zero otherwise. Estímate the
the y 1 series is shown to be stationary. Notice that this procedure i.s quite diffe:ent regression equation:
from the sequential testing for successively greater numbers of untt ro.ots. It m1ght
seem tempting to test for a single unit root and, if the null cannot be. reJected, go on
(4.23)
to test for the presence of a second unit root. In repeated samples, th1s method tends
to select too few roots.
where y, is the regression residual, so that y 1 can be viewed as the deseasonalized
Seasonal Unit Roots value of y 1•
Next, use these regression residuals to estímate the regression:
You will recall that the best-fitting model for the monthly Spanish tourism data
used in Chapter 2 had the form: p
If a 1 = !, one homogeneous solution to (4.25) is y,= y,_ 1 • As such, the {y,} se- Since A(L) evaluated at ª1 = ª2 = a3 = a4 = 1 is (1 - L4), it is possible to approxi-
quence tends to repeat itself each and every period. This is the case of a nonsea- mate (4.25) by
sonal unit root.
[(l-L4)-L(I +L+L2+L3)(a1
2
- l)+(I-L+L2-L3)L(a2- l)
Define y, such that y, =(a, - 1) and note that (1 + iL)i = ¡- L and (1 _ iL)i = ¡+
If a 2 =
1, one homogeneous solution to (4.25) is y, + y,_ 1 = O. In this instance, the L; hence,
equence tends to replicate itself at 6-month intervals, so that there is a semiannual
unit root. For example, if y,= 1, it follows that Yr+i = -1, Yr+2 = + 1, Yr+J = -1, Yr+4 =
1, etc.
Y1=0 Y2=0 Ys=r6=0
Intercept -2.88 -1.95 3.08
(1 - L4 )y, = y (1 +L.+ L
1
2 + L3 )y,_ 1 -y2 (1 - L + L2 -L3)y,_1 Intercept plus seasonal dummies -2.95 -2.94 6.57
. + (1 - L2)[()'3 - 1'4)i - ('{3 + )'4)L]y 1_1 + E 1 (4.26) Intercept plus seasonal dummies -3.53 -2.94 6.60
plus time trend
To purge the imaginary numbers from (4.26), define Ys and '{6 such. that 2'.J =
-y - iy and 2y =..;.y + iy . Hence, (y - y4 )i = y5 and y3 + '{4 = )'6. Subst1tutmg mto
6 5 4 6 5 3
(4.26) yields 4. EXAMPLES OF THE AUGMENTED DICKEY-FULLER
TEST
The last chapter reviewed the evidence reported by Nelson and Plosser (1982) sug-
gesting that macroeconomic variables are difference stationary rather than trend
Thus, to implement the procedure, use the following steps:
stationary. We are now in a position to consider their formal tests of the hypothesis.
For each series under study, Nelson and Plosser estimated the regression:
STEP 1: Form the following variables:
The chosen lag lengths are reported in the column labeled p in Table 4.2. The es-
STEP 2: Estimate the regression:
timated values a 0, a 2 , and y are reported in columns 3, 4, and 5, respectively.
You might want to modify the form of the equation by including an in- Table 4.2 Nelson and Plosser's Tests for Unit Roots
tercept, deterministic seasonal dummies, and a linear time trend. As in the
4
augmented form of the Dickey-Fuller test, lagged val u es of ( 1 - L )Y1-;
p
ªº ª2 r r+1
may also be included. Perform the appropriate diagnos ic checks t? ens re Real GNP 2 0.819 0.006 -0.175 0.825
that the residuals from the regression equation approx1mate a whtte-1101se (3.03) (3.03) (-2.99)
process. Nominal 2 1.06 0.006 -0.101 0.899
GNP >o•·..
(2.37) (2.34) (-2.32)
STEP 3: Form the t-statistic for the null hypothesis y 1 = O; the appropriate critica] Industrial 6 0.103 0.007 -0.165 0.835
"· · values are reported in Hylleberg et al. ( 1990). If you do not reject the hy- production (4.32) (2.44) (-2.53)
pothesis y, = O, conclude that a 1 = 1, so there is a nonseasonal unit root. Unemployment 4 0.513 --0.000 -0.294* 0.706
Next, form the t-test for the hypothesis y2 =O. If you do not reJeCt the null rate (2.81) (-0.23) (-3.55)
hypothesis, conclude that a 2 = 1 and there is a unit root with a semiannual
frequency. Finally, perform the F-test for the hypothesis Ys = y6 =O. If the Notes: l. p is the chosen lag length. Coefficients divided by their standard errors are in parentheses.
Thus. entries in parentheses represent the /-test for the null hypothesis that a coefficient is
calculated value is less than the critica! value reported 1n Hylleberg et al.
equal to zero. l)nder the null of nonstationary, it is necessary to use the Dickey-Fuller criti-
(1990), conclude that y5 and/or y is zero, so that there is a unit root with ca! values. At the 0.05 significance leve!, the critica! value for the t-statistic is -3.45.
6
an annual frequency. Be aware that the three null hypotheses are not alter- 2. An asterisk (*) denotes significance at the 0.05 leve!. For real and nominal GNP and indus-
natives; a series may have nonseasonal, semiannual, and annual umt roots. trial production, it is not possible to reject the null y= O at the 0.05 leve!. Hence, the unem-
ployment rate appears to be stationary.
At the 5% significance leve!, Hylleberg et al. ( 1990) report that the critica! val- 3. The expression y+ 1 is the estimate of the partía! autocorrelation between y, and y,_ 1 •
ues using 100 observations are:
/ Jesllng ;ar J renas una v111• /\ULJ•j
Recall that the traditional view of business cycles maintains that the GNP and In applied work, p, and P1 usually refer to national price indices in t relative to a
production Jevels are trend stationary rather than difference stationary. An adherent base year, so that e, refers toan index of the domestic currency price of foreign ex-
of this view must assert that y is different from zero; if y= O, the series has a unit change relative to a base year. For example, if the U.S. inflation rate is 10% while
root and is difference stationary. Given the sample sizes uscd by Nelson and the foreign inflation rate is 15%, the dallar price of foreign exchange should fall by
Plosser (1982), at the 0.05 leve!, the critica! value of the t-statistic for the null hy- approximately 5%. The presence of the term d, allows for short-run deviations from
pothesis y= O is -3.45. Thus, only if the estimated value of y is. more than 3.45 PPP.
standard deviations from zero, is it possible to reJeCt the hypothes1s that Y= O. As Because of its simplicity and intuitive appeal, PPP has been used extensively in
can be seen from inspection of Table 4.2. the estimated values of y for real GNP, theoretical models of exchange rate determination. However, as in the well-known
nominal GNP, and industrial production are not statistically different from zero. Dombusch (1976) "overshooting" model, real economic shocks, such as productiv-
Only the unemployment rate has an estimated value of y that is significantly differ- ity or demand shocks, can cause permanent deviations from PPP. For our purposes.
the theory of PPP serves as an excellent vehicle to illustrate many time-series test-
ent from zero at the 0.05 leve!.
ing procedures. One test of long-run PPP is to determine whether d, is stationary.
After ali, if the deviations from PPP are nonstationary (i.e., if the deviations are
Unit Roots and Purchasing-Power Parity
permanent in nature), we can reject the theory. Note that PPP does allow for persis-
Purchasing-power parity (PPP) is a simple relationship linking national price levels tent deviations; the autocorrelations of the {d,} sequence need not be zero. One
and exchange rates. In its simplest form, PPP asserts that the rate of currency depre- popular testing procedure is to define the "real" exchange rate in period t as
ciation is approximately equal to the difference betwecn the domestic and foreign
intlation rates. If p and p* denote the logarithms of the U.S. and foreign price levels
r, = e,+ P7 - p,
ande the logarithm of the dollar price of foreign exchange, PPP implies
e,= p, - P7 + d, Long-run PPP is said to hold if the { r1} sequence is stationary. For example, in
Enders (1988), I constructed real exchange rates for three major U .S. trading part-
ners: Germany, Canada, and Japan. The data were divided into two periods:
where d, represents the deviation from PPP in period t.
January 1960 to April 1971 (representing the fixed exchange rate period) and
January 1973 to November 1986 (representing the flexible exchange rate period).
Each nation's Wholesale Price Index (WPI) was multiplied by an index of the U.S.
dallar price of the foreign currency and then divided by the U.S. WPI. The log of
Figure 4.2 Real exchange rates. the constructed series is the {r,) sequence. Updated values of the real exchange rate
1.6 data used in the study are in the file REAL.PRN contained on the data disk. As an
exercise, you should use this data to verify the results reported below.
A critica! first step in any econometric analysis is to visually inspect the data.
1.4 The plots of the three real exchange rate series during the flexible exchange rate pe-
riod are shown in Figure 4.2. Each series seems to meander in a fashion characteris-
tic of a random walk process. Notice that there is Iittle visual evidence of explosive
1.2
behavior or a deterministic time trend. Consider Figure 4.3 that shows the autocor-
relation function of the Canadian real rate in levels, part (a), and first differences,
part (b). This autocorrelation pattem is typical of ali the series in the analysis. The
autocorrelation function shows little tendency to decay, whereas the autocorrela-
tions of the first differences display the classic pattem of a stationary series. In
0.8 graph (b), al! autocorrelations (with the possible exception of p 11 that equals 0.18)
are not statistically different from zero at the usual significance levels.
To formally test for the presence of a unit root in the real exchange rates, aug-
06
· 1973 1975 1977 1979 mented Dickey-Fuller tests of the form given by (4.19) were conducted. The re-
-Canada -Germany -Japan gression !:,.r, = a0 + yr,_ 1 + 2!:,.r,_ 1 + 3!:,.r,_2 + ... was estimated based on the follow-
ing considerations:
(Jan. 1973 = 1.00)
i esttng ¡or J renas ana u1111 J<.uul>
Figure 4.3 ACF of Canada's real exchange rate. tistic for the null y= O is never more than l.59. The economic interpretation is
Levels. First dJfferences. that real productivity and/or demand shocks have had a pennanent influence on
1 1 1 1 1 1 1 1 1 1 1
real exchange rates.
0.4 2. As measured by the sample standard deviation (SD), real exchange rates were
0.8 - far more volatile in the 1973 to 1986 period than the 1960 to 1971 period.
0.2 Moreover, as measured by the standard error of the estimate (SEE), real ex-
0.6 change rate volatility is associated with unpredictability. The SEE during the
o flexible exchange rate period is severa! hundred times that of the fixed rate pe-
0.4 '--- riod. It seems reasonable to conclude that the change in the exchange rate
(0.2) regime (i.e., the end of Bretton-Woods) affected the volatility of the real ex-
0.2 - change rate.
(0.4) 3. Care must be taken to keep the appropriate null hypothesis in mind. Under the
o 1 2 3 4 5 6 7 8 9 10 11 12
null of a unit root, classical test procedures are inappropriate and we resort to the
1 2 3 4 5 6 7 8 9101112 statistics tabulated by Dickey and Fuller. However, classical test procedures
(a) (b) _(which assume stationary variables) are appropriate under the null that the real
rates are stationary. Thus, the following possibility arises. Suppose that the t-sta-
tistic in the Canadian case happened to be -2.16 instead of -1.42. Using the
1. The theory of PPP does not allow for a deterministic time trend or multiple unit Dickey-Fuller critica! values, you would not reject the null of a unit root; hence,
roots. Any such findings would refute the theory as posited. Although ali the se- you could conclude that PPP fails. However, under the null of stationarity
ries decline throughout the early l 980s and ali rise during the rnid to late l 980s, (where we can use classical procedures), y is more than two standard deviations
there is no a priori reason to expect a structural change. Pretesting the data using from zero and you would conclude PPP holds since the usual t-test becornes ap-
the Dickey-Pantula (1987) strategy showed no evidence of multiple unit roots. plicable.
Moreover, there was no reason to entertain the notion of trend stationarity; the This apparent dilemma commonly occurs when analyzing series with roots
expression a 2 t was not included in the estimating equation. close to unity in absolute value. Unit root tests do not have much power in dis-
criminating between characteristic roots close to unity and actual unit roots. The
2. In both time periods, F-tests and the SBC indicated that p3 through P 24 could be
dilemma is only apparent since the two null hypotheses are quite different. It is
set equal to zero. For Germany and Japan during the flexible rate period, 2 was
statistically different from zero; in the other four instances, p2 could be set equal perfectly consistent to maintain a null that PPP holds and not be able to reject a
to zero. In spite of these findings, with monthly data it is always important to null that PPP fails! Notice that this dilemma does not actually arise for any of
entertain the possibility of a lag length no shorter than 12 months. As such, tests the series reported in Table 4.3; for each, it is not possible to reject a null of
y= O at conventional significance levels.
were conducted using the short lags selected by the F-tests and SBC and using a
lag length of 12 months. 4. Looking at sorne of the diagnostic statistics, we see that ali the F-statistics indi-
cate that it is appropriate to exclude lags 2 (or 3) through 12 from the regression
For the Canadian case during the 1973 to 1986 period, the t-statistic for the null
equation. To reinforce the use of short lags, notice that the first-order correlation
hypothesis that y= O is -1.42 using no lags and -1.51 using ali 12 lags. Gi ven the
coefficient of the residuals (p) is low and the Durbin-Watson statistic close to 2.
critica! value of the 'tµ statistic, it is not possible to reject the null of a unit root in It is interesting that ali the point estimates of the characteristic roots indicate that
the Canadian/U.S. real exchange rate series. Hence, PPP fails for these two nations. real exchange rates are convergent. To obtain the characteristic roots, rewrite
In the 1960 to 1971 period, the calculated value of the t-statistic is -1.59; again, it
the estimated equations in the autoregressíve fonn r, = a0 + a 1 rr-1 or r, = a 0 +
is possible to conclude that PPP faíls. a 1 r,_1 + a 2 r,_ 2 . For the four AR( 1) models, the point estimates of the slope coef-
Table 4.3 reports the results of ali six estimations using the short lag lengths sug- ficients are ali less than unity. In the post-Bretton-Woods period (1973-1986),
gested by the F-tests and SBC. Notice the following properties of the estimated the point esti mates of the characteristic roots of Japan' s second-order process
models: are 0.931 and 0.319; for Germany, the roots are 0.964 and 0.256. However, this
l. For ali síx models, it is not possible to reject the null hypothesís that PPP fails. is precisely what we would expect if PPP fails; under the null of a unit root, we
As can be seen from the last column of Table 4.3, the absolute value of the t-sta- know that y is biased downward.
°V': o
'T 5. PHILLIPS-PERRON TESTS
1
-
11
The distribution theory supporting the Dickey-Fuller tests assumes that the errors
are statistícally independent and have a constant variance. In using this methodol-
ogy' care-mu-st be taken to ensure that the error terrnsare uncorrelated and have
constant variance. Phillips and Perron ( 1988) developed a generalization of the
Dickey-Fuller procedure that allows for fairly mild assumptions conceming the
'° r--
00 -
distribution of the errors.
0 "'
N
To briefly explain the procedure, consider the following regression equations:
(4.27)
and
(4.28)
°o'
<n 00
::::: ó s 00
O"¡ r<l
00
ci ....:
00
ci 0i
- N
O N
o
ci ....: °' where T = number of observations and the disturbance term µ, is such that
1 1
Eµ,= O, but there is no requirement that the disturbance term is serially un-
correlated or homogeneous. Instead of the Dickey-Fuller assumptions of
independence and homogeneity, the Phillips-Perron test allows the dístur-
bances to be weakly dependent and heterogeneously distributed.
e
00
O': Phillips and Perron characterize the distributions and derive test statistics that
o
can be used to test hypotheses about the coefficients di' and ii; under the null hy-
pothesis that the data are generated by
e
cr-, N
oo Z(tat): Used to test the hypothesis af = 1
1
ºoºo Z(tií 1 ): Used to test the hypothesis d 1 = 1
Z(tá 2 ): Used to test the hypothesis á2 =O
Z(<j> 3): Used to test the hypotheses d 1 = 1 and d2 =O
The critica! values for the Phillips-Perron statistics are precisely those given for
the Dickey-Fuller tests. For example, the critica! values for Z(taf) and Z(tá 1) are
e:
"o'. those given in the Dickey-Fuller tables under the headings 'tµ and 't,, respectively.
...., The critica! values of Z(<h) are given by the Dickey-Fuller <jl 3 statistic.
"'
Do not be deceived by the apparent simplicity of Equations (4.27) and (4.28). In or
reality, it is far more general than the type of data-generating process allowable by
the Dickey-Fuller procedure. For example, suppose that the { µr) sequence is gener-
ated by the autoregressive process µr = (C(L)/B(L)]E,, where B(L) and C(L) are
Sr+90 - f r = Pt
polynomials in the lag operator. Given this fonn of the error process, we can write
Equation (4.27) in the forrn used in the Dickey-Fuller tests; that is, where Pr == per unit profit from speculation
E,p, =O
Thus, the efficient market hypothesis requires that for any time period t, the 90-
day forward rate (i.e., fr) be an unbiased estimator of the spot rate 90 days from t.
or
Suppose that a researcher collected weekly data of spot and forward exchange
rates. The data set would consist of the forward rates f r• Ír+?• f r+ 14, ••• and spot
(1 - af L)B(L)y, =a+ C( L)E,
rates s/! sr+?• sr+ 14 , • ••. By using these exchange rates, it is possible to construct the
sequence sr+ 9o - Ír = pf! s,+7 + 9o - Ír+? == Pr+?• sr+ 14+ 9o - Ír+t 4 == Pr+t4• .. .. Normalize
where a B(L) =a
the time period to 1 week, so that y 1 == pf! y 2 == Pr+?• y 3 == Pr+ 14, • •• and consider the
Thus, the Phillips-Perron procedure can be applied to mixed processes in the regression equation (where - is dropped for simplicity):
same way as the Dickey-Fuller tests.
Table 4.4 Returns to Forward Speculation In practice, the choice of the most appropriate test can be difficult since you
./
.. ncver know the true data-generating process. A safe choice is to use both types of
/ ªº ª1 ªz unit roots tests. If they reinforce each other, you can ha ve confidence in the results.
Switzerland -D.117E-2 0.941 -0.11 !E-4 Sometimes, economic theory will be helpful in that it suggests the most appropriate
(0.106E-2) (0.159E-l) (0.834E-5) test. In the Corbae and Ouliaris example, excess returns should be positively corre-
Z(ta 0 ) = -1.28 Z(ta 1 ) = -4.06 Z(ta 2 ) = -1.07 latcd; hence, the Phillips-Perron test is a reasonable choice.
Canada -D.65 lE-3 0.907 O.l 16E-5
(0.409E-3) (0.191E-1) (0.298E-5)
Z(ta 0 ) = -1.73 Z(ta 1 ) = -5.45 Z(ta 2 ) = -1.42 6. STRUCTURALCHANGE
U.K. -D.779E-3 0.937 -D.132E-4
(0.903E-3) (0.163E-l) (0.720E-5) In performing unit root tests, special care must be taken if it is suspected that struc-
Z(ta 0) = -D.995 Z(la 1 ) = -4.69 Z(ta 2 ) = -1.50 tural change has occurred. When there are structural breaks, the various Dickey-
Notes: l. Standard errors are in parentheses. Fuller and Phillips-Perron test statistics are biased toward the nonrejection of a unit
2. Z(ta ) and Z(ta ) are the Phillips-Perron adjusted t-statistics for the hypothescs that a = O root. To explain, consider the situation in which there is a one-time change in the
0 2 0
and a 2 =O. respectively. Z(ta 1 ) is the Phillips-Perron adjusted r-statistic for the hypothesis mean of an otherwise stationary sequence. In the top graph (a) of Figure 4.4, the
that a,= l. {y,) sequen ce was constructed so as to be stationary around a mean of zero for t =
O, ... , SO and then to fluctuare around a mean of 6 for t = 51, ... , 1OO. The se-
and Singleton (1982), Corbae and Ouliaris are able to reject the null of a unit root quence was formed by drawing 100 normally and independently distributed values
in ali series examined. Thus, shocks to the retum from forward exchange market for the {E,) sequence. By setting Yo = O, the next 100 values in the sequence were
speculation do not ha ve permanent effects. generated using the formula:
A second necessary condition for the efficient market hypothesis to hold is that
the intercept term a0 equal zero. A nonzero intercept term suggests a predictable (4.29)
gap between the forward rate and spot rate in the future. If a0 =fa O, on average,
there are unexploited profit opportunities. lt may be that agents are risk-averse or where DL is a dummy variable such that DL =O for t = 1, ... , SO and DL = 3 for t =
profit-maximizing speculators are not fully utilizing ali available inforrnation in de- S 1, ... , 1OO. The subscript L is designed to indicare that the leve! of the dummy
terrnining their forward exchange positions. In absolute value, all the Z(td 0) statis- changes. At times, it will be convenient to refer to the value of the dummy variable
tics are less than the critica! value, so that Corbae and Ouliaris cannot reject the in period tas DL(t); in the example at hand, DL(SO) =O and DL(51) = 3.
null a 0 =O. In the same way, they are notable to reject the null hypothesis of no de- In practice, the structural change may not be as apparent as the break shown in
terrninistic time trend (i.e., that a 2 = 0). The calculated Z(tii2 ) statistics indicate that the figure. However, the large simulated break is useful for illustrating the problem
the estimated coefficients of the time trend are never more than l .SO standard errors of using a Dickey-Fuller test in such circumstances. The straight line shown in the
from zero. figure highlights the fact that the series appears to have a detenninistic trend. In
At this point, you might wonder whether it would be possible to perform the fact, the straight line is the best-fitting OLS equation:
same sort of analysis using an augmented Dickey-Fuller (ADF) test. After ali, Said
and Dickey (1984) showed that the ADF test can be used when the error process is
a moving average. The desirable feature of the Phillips-Perron test is that it allows
for a weaker set of assumptions conceming the error process. Also, Monte Cario
In the figure, you can see that the fitted value of a0 is negative and the fitted
studies find that the Phillips-Perron test has greater power to reject a false null hy-
value of a 2 is positive. The proper way to estímate (4.29) is to fit a simple AR(l)
pothesis of a unít root. However, there is a cost entailed with the use of weaker as- model and allow the intercept to change by including the dummy variable DL.
sumptions. Monte Cario studies have also shown that in the presence of negative However, suppose that we unsuspectingly fit the regression equation:
moving average terrns, the Phillips-Perron test tends to reject the null of a unit root
whether or not the actual data-generating process contains a negative unit root. It is
Yr = ªº + ªtYr-1 +e, (4.30)
preferable to use the ADF test when the true model contains negative moving aver-
age terms and the Phillips-Perron test when the true model contains positive mov- As you can infer from Figure 4.4, the estimated value of at is necessarily biased
ing average terms. toward unity. The reason for this upward bias is that the estimated value of a 1 cap-
1
Figure 4.4 Two models of structural change.
Yr = Yo + ªot +LE¡
i=I
Thus, the misspecified equation (4.30) will tend to rnimic the trend line shown in
Figure 4.4 by biasing a 1 toward unity. This bias in a 1 means that the Dickey-Fuller
6 test is biased toward accepting the null hypothesis of a unit root, even though the
series is stationary within each of the subperiods.
Of course, a unit root process can exhibit a structural break also. The lower graph
4 (b) of Figure 4.4 simulates a random walk process with a structural change occur-
ring at t = 51. This second simulation used the same 100 realizations for the {E,)
sequence and set Yo= 2. The 100 realizations of the {y,} sequence were constructed
2 as
y,=y,_ 1 +E,+Dp
one-time change in the intercept of a trend stationary process. Formally, let the null STEP 1: Detrend the data by estimating the altemative hypothesis and calling the
and altemative hypotheses be residuals y,.
Henc , each val u e of y, is the residual from the regression y, = a 0 + a 2 t +
H 1 :y,=a 0 +yH +µ 1 Dp+t 1 (4.31)
µ2DL +y,. '
A 1 : y,= a0 + a 2 t + µ2DL +E, (432)
//'' The t-statistic for the null a 1 = 1 can then be compared to the appropriate critica! Perron's Test for Structural Changa
value calculated by Perron. In addition, the methodology is quite general in that it Perron ( 1989) used his analysis of structural change to challenge the findings of
can also allow for a one-time change in the drift or one-time change in both the Nelson and Plosser (1982). With the very same variables used, his results indicate
mean and drift For example, it is possible to test the null hypothesis of a permanent that most macroeconomic variables are not characterized by unit root processes.
change magnitude of the drift term versus the altemative of a change in the slope of Instead, the variables appear to be TS processes coupled with structural breaks.
the trend. Here, the null hypothesis is According to Perron (l989), the stock market crash of 1929 and dramatic oil price
increase of 1973 were exogenous shocks having permanent effects on the mean of
most macroeconomic variables. The crash induced a one-time fall in the mean.
Otherwise, macroeconomic variables appear to be trend stationary.
where DL = 1 if t > 't and zero otherwise. With this specification, the {y,} sequence Ali variables in Perron's study (except real wages, stock prices, and the station-
is generated by t:iy, = a0 +E, up to period r and t:iy, = (a0 + µ2 ) +E, thereafter. If µ1 > ary unemployment rate) appeared to have a trend with a constant slope and exhib-
O, the slope coefficient of the deterministic trend increases for t > 't. Similarly, a ited a major change in the level around 1929. In arder to entertain various hypothe-
slowdown in trend growth occurs if µ2 < O. ses concerning the effects of the stock market crash, consider the regression
The altemative hypothesis posits a trend stationary series with a change in the equation:
slope of the trend for t > 't:
k
y,= ªo +µ.,DL +µ.2Dp +a2t+a1Y1-1 + Lf3;f1Y1-i +E1
i=l
where DT = t - 't for t > 't and zero otherwise. For example, suppose that the break
occurs in period S 1 so that 't = 50. Thus, D,,.( 1) through D,,.(50) are all zero, so that
for the first 50 periods, {y,} evolves as y,= a 0 + a 2 t + E,. Beginning with period S 1, where Dp( 1930) = 1 and zero otherwise
D(S l)T = 1, D(S2)T = 2, ... , so that for t > 't, {y,} evolves as y 1 = a0 + (a2 + µ3)t + DL = l for ali t beginning in 1930 and zero otherwise
E,. Hence, DT changes the slope of the deterministic trend line. The slope of the Under the presumption of a one-time change in the mean of a unit root process,
trend is a2 for t:::; 't and a 2 + µ3 for t > 't. a 1 = 1, a 2 =O, and µ2 =O. Under the altemative hypothesis of a permanent one-time
To be even more general, it is possible to combine the two null hypotheses H 1 break in the trend stationary model, a 1 < 1 and µ1 =O. Perron' s {1989) results using
and H2 • A change in both the leve! and drift of a unit root process can be repre- real GNP, nominal GNP, and industrial production are reported in Table 4.5. Given
sented by the !ength of each series, the 1929 crash means that A. is 1/3 for both real and nomi-
nal GNP and egua! to 2/3 for industrial production. Lag lengths (i.e., the values of
k) were determined using t-tests on the coefficients ;· The value k was selected if
the t-statistic on k was greater than 1.60 in abso!ute value and the t-statistic on .
where Dp and DL = the pulse and leve! dummies defined above for i> k was less than 1.60.
The appropriate altemative for this case is First, consider the results for real GNP. When we examine the last column of the
table, it is clear that there is little support for the unit root hypothesis; the estimated
value of a 1 = 0.282 is significantly different from unity at the 1% level. Instead,
real GNP appears to have a deterministic trend (a2 is estimated to be over five stan-
dard deviations from zero). Al so note that the point estimate µ1 = -0.189 is signifi-
Again, the procedure entails estimating the regression A 2 or A 3 . Next using the
can ti y different from zero at conventional levels. Thus, the stock market crash is es-
residuals yl' estímate the regression: timated to have induced a permanent one-time decline in the intercept of real GNP.
These findings receive additional support since the estimated coefficients and
.Yr = ª1Y1-1 +e, their t-statistics are quite similar across the three equations. Ali values of a 1 are
about five standard deviations from unity, whereas the coefficients of the determin-
If the errors from this second regression equation do not appear to be white-
istic trends (a 2) are ali over five standard deviations from zero. Since ali estimated
noise, estímate the equation in the form of an augmented Dickey-Fuller test. The
t-statistic for the null hypothesis a 1 = 1 can be compared to the critica! values calcu- values of µ1 are significant at the 1% leve! and negative, the data seem to support
lated by Perron ( 1989). For A.= 0.5, Perron reports the critica! value of the t-statistic the contention that real macroeconomic variables are TS, except for a structural
at the 5% significance leve! to be -3.96 for H 2 and -4.24 for H3. break resulting from the stock market crash.
.?"
/º"""6JV' •••"---··- •
>' Table 4.5 Retesting Nelson and Plosser's Data for Structural Change Dickey-Fuller tests yield
T A. k
ªº µ.¡ J1i ª2 ª1 y,= -Ü.0233y,_ 1 + E,, t-statistic for y= O: -0.98495
Real GNP 62 0.33 8 3.44 -0.189 -0.018 0.027 0.282 y,= 0.0661 - Ü.0566yl-I + E1, t-statistic for y= O: -1.70630
(5.07) (-4.28) (-0.30) (5.05) (-5.03) Yt = -Ü.0488 -Ü.1522y 1_¡ + 0.004t + E1, t-statistic for y= O: -2.73397
Nominal 62 0.33 8 5.69 -3.60 0.100 0.036 0.471
GNP (5.44) (-4.77) ( 1.09) (5.44) (-5.42) Diagnostic tests indicate that longer lags are not needed. Regardless of the p res-
ence of the constant or the trend, the {y,} sequence appears to be difference stat ion-
Industrial 111 0.66 8 0.120 -0.298 -0.095 0.032 0.322
production (4.37) (-4.56) (-.095) (5.42) (-5.47)
ary. Of course, the problem is that the structural break biases the data toward sug-
gesting a unit root.
Notes: 1.T = number of observations Now, with the Perron procedure, the first step is to estimate the model y f =a o +
A
A = proposition of observations occurring befare the structural change a2t + µ2DL + y,. The residuals from this equation are the detrended {Y,} sequence.
k = lag length
2. The appropriate 1-statistics are in parentheses. For a 0, µ,, µ,.anda,. the null is that the coef- The second step is to test for a unit root in the residuals by estimating yA 1 = a 1 yA t-1 +
ficient is equal to zero. For a 1 , the null hypothesis is a 1 = l. Note that al! estimated values of E1• The resulting regression is:
a 1 are significantly different from unity at the 1% leve!.
y,= 0.4843 y,_ 1 + E 1
In the third step, ali the diagnostic statistics indicate that {E,) approximates a
white-noise process. Finally, the t-statistic for a, = 1 is 5.396. Hence, we can reject
Tests with Simulated Data the null of a unit root and conclude that the simulated data are stationary around a
To further illustrate the procedure, 100 random numbers were drawn to represent breakpoint at t = 51.
the {E,} sequence. By setting y 0 == O, the next 100 values in the {y,} sequence were Sorne care must be used in using Perron's procedure since it assumes that the
drawn as date of the structural break is known. In your own work, if the date of the break is
uncertain, you should consult Perron and Vogelsang (1992). In fact, entire issue of
the July 1992 Journal of Business and Economic Statistics is devoted to break-
points and unit roots.
Lag: 1 2 3 4 5 6 Power
0.94 0.88 0.84 O.SI 0.77 0.72 Formally, the power of a test is equal to the probability of rejecting a false null hy-
pothesis (i.e., 1 minus the probability of a type II error). Monte Carlo simulations
and the ACF of the first differences is: have shown that the power of the various Dickey-Fuller and Phillips-Perron tests
is very low; unit root tests do not have the power to distinguish between a unit root
Lag: 1 2 3 4 5 6 and near unit root process. Thus, these tests will too often indicate that a series con-
-0.002 -0.201 -0.112 0.079 -0.010 -0.061 tains a unit root. Moreover, they have little power to distinguish between trend sta-
·onary and drifting processes. In finite samples, any trend stationary process can be Figure4.6
arbitrarily well approximated by a unit root process, and a unit root process can be
arbitrarily well approximated by a trend stationary process. These results should not Stationary and unit root processes.
be too surprising after examining Figure 4.6. The top graph (a) of the figure shows 3¡-- .,- -r-- .- -.-
a stationary process and unit root process. So as not to bias the results in any partic-
ular direction, the simulation uses the same 100 values of {E,} that were used in 2
Figure 4.4. Using these 100 realizations of {E,}, we constructed two sequences as:
W, = 1 + 0.02! + E1 20 40 60 80 100
X1 = 0.02 + X 1_ 1 + E,/3 - Unit root process -Stationaryprocess
where x 0 = 1 (a)
Here, the trend and drift terms domínate the time paths of the two sequences.
Trend stationary and unit root processes.
Again, ·¡t is very difficult to distinguish between the sequences. This is especially
4r- --,-----,-----r-----r----
true since dividing each realization of E, by 3 acts to smooth out the {x,} sequence.
Just as it is difficult for the naked eye to perceive the differences in the sequences,
it is also difficult for the Dickey-Fuller and Phillips-Perron tests to select the cor-
rect specification. 3
It is easy to show that a trend stationary process can be made to mimic a unit root
process arbitrarily well. As discussed in Chapter 3, it is possible to write the ran-
dom walk plus noise model in the forrn:
2
y,=µ,+ T),
µ,=µ,_¡+E,
where T), and E, are both independent white-noise processes with variances of 0
and 02 , respectively. Suppose that we can observe the (y,} sequence, but
cannot directly observe the separate shocks affecting y,. If the variance of
E, is not zero, {y,} is the unit root process:
1
- Random walk plus drift - Trend stationary process
Yr = µo +LE¡ + T11 (4.33)
i l (b)
On the other hand, if cr 2 = O, then ali values of {E,} are constant, so that: E, = the test. Reduced power means that the researcher may conclude that the process
E,_ 1 = ... = E:0 . To maintain the same notation as in previous chapters, define this ini- contains a unit root when, in fact, none is present. The second problem is that the
tial value of E0 as a 0 • lt follows that µ, = µ0 + a 0t, so {y,} is the trend stationary appropriate statistic (i.e., the 1, 1µ, and 1,) for testing y= O depends on which re-
process: gressors are included in the model. As you can see by examining the three Dickey-
Fuller tables, for a given significance leve!, the confidence intervals around y= O
(4.34) dramatically expand if a drift and time trend are included in the model. This is quite
different from the case in which {y,} is stationary. The distribution of the t-statistic
Thus, the difference between the difference stationary process of (4.33) and trend does not depend on the pre:;ence of the other regressors when stationary variables
process of (4.34) concems the variance of E1• Having observed the composite ef- are used.
fects of the two shocks-but not the individual components r¡, and E 1-we see that The point is that it is important to use a regression equation that mimics the ac-
there is no simple way to determine whether cr 2 is exactly equal to zero. This is par- tual data-generating process. If we inappropriately omit the intercept or time trend,
ticularly true if the data-generating process is such that cr is large relative to cr2 . In the power of the test cango to zero. 10 For example, if as in (4.35), the data-generat-
a finite sample, arbitrarily increasing cr will make it virtually impossible to distin- ing process includes a trend, omitting the term a 2t imparts an upward bias in the es-
guish between a TS and DS series.
It also follows that a trend stationary process can arbitrarily well approximate a
timated value of y. On the other hand, extra regressors increase the absolute value
of the critica! values so that you may fail to reject the null of a unit root. .,
unit root process. If the stochastic portian of the trend stationary process has suffi- To illustrate the problem, suppose that the time series {y,) is assumed to be gen-
cient variance, it will not be possible to distinguish between the unit root and trend erated by the random walk plus drift process:
!
stationary hypotheses. For example, the random walk plus drift model (a difference
astationary process) can be2 arbitrarily well represented by the model y, = a0 +
1y,_ 1 +E, by increasing cr and allowing a
-
the other models. 9
\
. Estímating (4.13) or (4.14), we observe that the 'tµ, 't,, <jl 1 , <jl 2 , and <jl 3 statistics if the trend is significant, retest for the presence of a unit root (i.e., "( == 0)
have the asymptotic distributions tabulated by Dickey and Fuller ( 1979, 1981). using the standardized normal distribution. After all, if a trend is inappro-
The critica\ values of the various statistics depend on sample size. However, the priately included in the estimating equation, the limiting distribution of a2
sample variance of {y,} will be dominated by the presence of a trend or drift. is the standardized normal. If the nUll of a unit root is rejected, proceed no
We saw an example of this phenomenon in Figure 3.12 of Chapter 3. The time further; conclude that the {y,} sequence <loes not contain a unit root.
path of the random walk plus drift model in graph (b) is swamped by the pres- Otherwise, conclude that the {y,} sequence contains a unit root.
ence of the drift term. The fact that the stochastic trend is precisely the same as
in graph (a) has little effect on the overall appearance of the series. Although the Estimate (4.35) without the trend [i.e., estímate a model in the form of
proof is beyond the scope of this text, the 'tµ and 't, statistics converge to the (4.13)]. Test for the presence of a unit root using the 'tµ statistic. If the null
standardized normal. Specifically, is rejected, conclude that the model does not contain a unit root. If the null
hypothesis of a unit root is not rejected, test for the significance of the
constant (e.g., use the 'taµ statistic to test the significance of a0 given "( ==
0). Additional confirmation of this result can be obtained by testing the
t=I hypothesis a0 = y= O using the <P 1 statistic. If the drift is not significant, es-
',; ¡'
if a 0 7' O and a 2 = O timate an equation in the form of (4.12) and proceed to Step 4. If the drift
is significant, test for the presence of a unit root using the standardized
Only when both a0 and a 2 equal zero in the regression equation and data-gen-
erating process do the nonstandard Dickey-Fuller distributions dominate. If the data-
generating process is known to contain a trend or drift, the null hypothesis y= O Figure 4.7 A procedure to test for unit roots.
can be tested using the standardized normal distribution. Estimate L'.y, = a 0 + yy, -1 + ª2 t +I.{3¡L'.y, -i + Er
The direct implication of these four findings is that the researcher may fail to re- No STOP: Conclude
Is y= O? '
ject the null hypothesis of a unit root because of a misspecification conceming the no unit root.
deterministic part of the regression. Too few or too many regressors may cause a
¡Yes: Test for the presence No
failure of the test to reject the null of a unit root. Although we can never be sure of the trend.
that we are including the appropriate deterministic regressors in our econometric
model, there are sorne useful guidelines. Doldado, Jenkinson, and Sosvilla-Rivero Is a 2 =O No Is y= O Yes Conclude {y,) has
given using normal
(1990) suggest the following procedure to test for a unit root when the form of the a unit root.
y= O? distribution?
data-generating process is unknown. The following is a straightforward modifica-
tion of their method: ¡Yes
STEP 1: As shown in Figure 4.7, start with the least restrictive of the plausible Estimate No STOP: Conclude
L'.y, = ªº + yy, - 1 + I.f3J1y, - 1 +e,
i
models (which will generally include a trend and drift) and use the 't, sta- Is y= O?
no unit root.
tistic to test the null hypothesis y= O. Unit root tests ha ve low power to re-
Yes: Test for the presence No
ject the null hypothesis; hence, if the null hypothesis of a unit root is re-
of the drift.
jected, there is no need to proceed. Conclude that the (y,} sequence does
not contain a unit root. Is a 0 =O No lsy =O using Yes Conclude {y,) has
given ' normal
y= O? a unit root.
distribution?
STEP 2: If the null hypothesis is not rejected, it is necessary to determine whether
too many deterministic regressors were included in Step l above. 11 Test
for the significance of the trend term under the null of a unit root (e.g., use
i Yes
the 'tp, statistic to test the significance of a 2). You should try to gain addi- No Conclude
Estimate no unit root.
tional confirmation for this result by testing the hypothesis a2 = "( = O using L'.y,= YY1-1 +I./3¡L'.y, -1+e, Yes
the <j> 3 statistic. If the trend is not significant, proceed to Step 3. Otherwise, Is y= O? Conclude {y,} has
a unit root.
normal. If the null hypothesis of a unit root is rejected, conclude that the where LGDP, == log(GDP,), so that MGDP, is the growth rate of real GDP, and
{y,} sequence does not contain a unit root. Otherwise, conclude that the standard errors are in parentheses.
{y,) sequence contains a unit root. The model is well estimated in that the residuals appear to be white-noise and ali
coefficients are of high quality. For our purposes, the interesting point is that the
STEP 4: Estimate (4.35) without the trend or drift, that is, esti mate a model in the Lllog(GDP,) series appears to be a stationary process. Integrating suggests that
form of (4.12). Use 1to test for the presence of a unit root. If the null hy- log(GDP,) has a stochastic and deterministic trend. The deterministic quarterly
pothesis of a unit root is rejected, conclude that the {y,} sequence does not growth rate of 0.007018--close to a 3% annual rate-appears to be quite reason-
contain a unit root. Otherwíse, conclude that the {y,} sequence contains a able. Now consider the three augmented Dickey-Fuller equations with t-statistics in
unit root. parentheses:
Although the methodology outlined in Figure 4.7 can be very useful, it does have (1.52736)
its problems. Each step in the procedure involves a test that is conditioned on ali RSS = 0.0093334206
i the previous tests being correct; the significance leve! of each of the cascading tests
! is impossible to ascertain. MGDP, = 0.000279LGDP,_ 1 + 0.2633 lMGDP,_ 1 + 0.15443.:V.GDP,_ 2 (4.39)
l The procedure and its inherent dangers are nicely illustrated by trying to deter- (3.82135) (2.93959) (1.72964)
li mine if real gross domestic product (GDP) has a unit root. The data are contained in
the file entitled US.WKl on the data disk; it is a good idea to replicate the results
RSS = 0.0096582756
1
reported below. If we use quarterly data over the 1960:1to1991:4 period, the cor- From (4.37), the t-statistic for the null hypothesis y= O is -2.54309. Critica\ val-
relogram of the logarithm of real GDP exhibits slow decay. However, the first 12
j autocorrelations and partial autocorrelations of the logarithmic first difference are
ues with 125 usable observations are not reported in the Dickey-Fuller table. 12
However, with 100 observations, the critica] value of 1, at the 5% significance leve\
is -3.45; hence, it is not possible to reject the null hypothesis of a unit root given
the presence of the drift term and time trend.
\ ACF of the logarithmic first difference of real GDP:
Lag 1: 0.3093189 0.2316683 0.0572363 0.0556556 --0.0604932 0.0336679 The power of the test may have been reduced due to the presence of an unneces-
7: --0.0476200 --0.1453376 --0.0461222 0.0600729 0.0101171 -0.1695323 sary time trend and/or drift term. In Step 2, you test for the presence of the time
trend given the presence of a unit root. In (4.37), the t-statistic for the null hypothe-
PACF of the logarithmic first difference of real GDP: sis that a 2 =O is 2.27941. Do not Jet this large value fool you into thinking that a 2 is
Lag 1: 0.3093189 0.1503780 -0.0567524 0.0220048 --0.0876589 0.0696282 significantly different from zero. Remember, in the presence of a unit root, you
cannot use the critica] values of a Hable; instead, the appropriate critica] values are
7: --0.0507211 --0.1605942 0.0669240 0.1200468 -0.0353431 -0.2423071
given by the Dickey-Fuller 113, statistic. As you can see in Table 4.1, the critica!
value of 113, at the 5% significance leve] is 2.79; hence, it is reasonable to conclude
Despite the somewhat large partial correlation at lag 12, the Box-Jenkins procc-
that a 2 == O. The <j> 3 statistic to test the joint hypothesis a 2 == y== O reconfirms this re-
dure yields the ARIMA(O, 1, 2) model:
sult. If we view (4.37) as the unrestricted model and (4.39) as the restricted model,
there are two restrictions and 120 degrees of freedom in the unrestricted model; the
MGDP, == 0.007018 + (1+0.262169L + 0.197547L2)E,
<j> 3 statistic is
(0.001144) (0.088250) (0.082663)
/ ' = [(0.0096582756 -0.0089460783)/2] /(0.0089%0783/120) between real business cycles and the more traditional formulations, the nature of
- =4.7766 the trend may have important theoretical implications.
The usual t-statistics and F-statistics are not applicable to determine whether or
Since the critica! value of <!> 3 is 6.49, it is possible to conclude that the restriction nota sequence has a unit root. Dickey and Fuller (1979, 1981) provide the appro-
a2 = y = O is not binding. Thus, proceed to Step 3 where you estímate the model priate test statistics to determine whether a series contains a unit root, unit root plus
without the trend. In (4.38), the t-statistic for the null hypothesis y= O is -1.96196. drift, and/or unit root plus drift plus a time trend. The tests can also be modified to
Since the critica! value of the 'tµ statistic is -2.89 at the 5% significance leve!, the account for seasonal unit roots. If the residuals of a unit root process are heteroge-
null hypothesis of a unit root is not rejected at conventional significance levels. neous or weakly dependent, the altemative Phillips-Perron test can be used.
Again, the power of this test will have been reduced if the drift term does not be- Structural breaks will bias the Dickey-Fuller and Phillips-Perron tests toward
long in the model. To test for the presence of the drift, use the 'tm statistic. The cal- the nonrejection of a unit root. Perron (1989) shows how it is possible to incorpo-
culated t-statistic is 2.05219, whereas the critica! value at the 5% significance leve! rate a known structural change into the tests for unit roots. Caution needs to be ex-
is 2.54. The <!> 1 statistic also suggests that the drift term is zero. Comparing (4.38) ercised since it is always possible to argue that structural change has occurred; each
and (4.39), we obtain year has something different about it than the previous year. In an interesting exten-
sion, Perron and Vogelsang (1992) show how to test for a unit root when the pre-
qi 1 = (0.0096582756 - 0.0093334206)/(0.0093334206/121) cise date of the structural break is unknown.
= 4.21147365 Ali the aforementioned tests have very low power to distinguish between a unit
root and near unit root process. A trend stationary process, can be arbitrarily well
approximated by a unit. root process, and a unit root process can be arbitrarily well
Proceeding to Step 4 yields (4.39). The point is that the procedure has worked it-
approximated by a trend stationary process. Moreover, the testing procedure is con-
self into an uncomfortable comer. The problem is that the positive coefficient for y
founded by the presence of the deterministic regressors (i.e., the intercept and de-
(i.e., the estimated value of y= 0.000279 is almost four standard deviations from
terministic trend). Too many or too few regressors reduce the power of the tests.
zero) suggests an explosive process. In Step 3, it was probably unwise to conclude
An alternative is to take a Bayesian approach and avoid specific hypothesis test-
that the drift term is equal to zero. As you should verify in Exercise 4 at the end of
ing altogether. West and Harrison (1989) provide an accessible introduction to
this chapter, the simple Box-Jenkins ARIMA(O, 1, 2) model with an intercept of
Bayesian analysis in the context of regression analysis. Zellner (1988) discusses
0.007018 performs better than any of the altematives.
sorne of the philosophical underpinnings of the approach and Leamer (1986) pro-
vides a straightforward application to estimating the determinants of inflation. Sims
( 1988) is the standard reference for the Bayesian approach to unit roots.
You should find that the data have the following properties:
D. If your software package can perform Phillips-Perron tests, reestimate part C
using the Phillips-Perron rather than Dickey-Fuller procedure. You should
Observa- Standard find that the t-statistics for y= O are
Series tions Mean Error Minimum Maximum
RCAN 203 0.93041911330 0.05685010789 0.83472000000 1.03930000000 Series No lags 12 lags Trend + 12 lags
RGER 203 1.07711822660 0.15732887872 0.64541000000 1.34009000000 RCAN -1.82209 -1.60022 -1.10882
RJAP 203 1.16689172414 0.13981473422 0.91620000000 1.50787000000 RJAP -1.82886 -2.03795 -2.19736
RUK 203 1.09026873892 0.14524762980 0.70991900000 1.38482000000 RGER -1.65117 -1.85319 -1.88371
A. For each sequence, find the ACF and PACF of (i) the leve! of the real ex- RUK -1.56654 -1.81530 -2.01424
change rate; (ii) the first difference of the real exchange rate; and (iii) the de-
trended real exchange rate. For example, for Canada you should find E. Why do you suppose the results from parts C and D are so similar?
F. Determine whether an intercept term belongs in the regression equations.
ACF:
Determine whether the time trend should be included in the equations.
1: 0.95109959 0.91691527 0.89743824 0.86897993 0.84708012 0.81911904 Determine whether the intercept and time trend belong in the equations.
7: 0.79706303 0.77888188 0.75410092 0.72946966 0.70020306 0.65782904
G. Use the Japanese data to show that you can reject the null hypothesis of two
unit roots.
ACF of the first difference:
1: --0.1562001 --0.1531103 0.0443029 --O.O 152957 0.1053500 -0.0740475 2. The second column in the file labeled BREAK.PRN contains the simulated data
7: --0.0475489 0.0597755 --0.0255490 0.0142241 0.1810469 --0.1151413 u sed in Section 6. You should find:
B. Explain why it is not possible to determine whether the seqence is stationary Observa- Standard
or nonstationary by the simple examination of the ACF and PACF. Series tions Mean Error Mínimum Maximum
C. Including a constant, use Dickey-Fuller and augmented Dickey-Fuller tests Yl 100 0.98802 0.99373 -0.78719 2.654697
(with 12 monthly lags) to test whether the series are unit root processes. You
should find that the t-statistics for y= O are A. Plot the data to see if you can recognize the effects of the structural break.
B. Verify the results reported in Section 6.
Series No lags 12 lags Trend + 12 lags
3. The third column in the file labeled BREAK.PRN contains another simulated
RCAN -1.81305 -1.50810 -0.85650
RJAP -1.81978 -2.30579 -2.61854 data set with a structural break at t = 51. You should find
RGER -1.64297 -2.10719 -2.09955
Observa- Standard
RUK -1.55877 -2.51668 -2.57493 Series tions Mean Error Mínimum Maximum
i. The last entry in the table means that y is more than 2.57 standard deviations Y2 100 2.21080 1.7816 -1.3413 5.1217
from zero. A student's t-table indicates that at the 95% significance leve!, the
critica! value is about 1.96 standard deviations. Why is it incorrect to conclude A. Plot the data. Compare your graph to those of Figures 4.4 and 4.5.
that the null hypothesis of a unit root can be rejected since the calculated 1-sta-
B. Obtain the ACF and PACF of the {Y2,} sequence and first difference of the
tistic is more than 1.96 standard deviations from zero? sequence. Do the data appear to be difference stationary?
11. For each entry reported in the table, what are the appropriate statistics to use (1:,
C. If as in (4.11), a Dickey-Fuller test is performed including a constant and
1:µ, or 1:,) in order to test the null hypothesis of a unit root? trend, you should obtain
··r1· .. ··-··•· · ......, .. · ..•....--· -····· · ·-·
Standard ENDNOTES
Coefficient Estimate Error t-Statistic Significance
s, serial
Constant 0.072445666 0.071447971 1.01396 l. Issues concerning the possibility of higher-order equations, longer lag length
0.31314869 correlation in the residuals, structural change, and the presence of detenninistic compo-
TREND -0.000101438 0.002120465 -0.04784 0.96194514 nents will be considered in due course.
Y2,_ 1 -0.022398360 0.034013944 -0.65851 0.51178974 2. The critica! values are reported in Table A at the end of this text.
3. Suppose that the estimated value of y is -1.9 (so that the estimate of a 1 is -0.9) with a
i. In what ways is this regression equation inadequate? standard error of 0.04. Since the estimated value of y is 2.5 standard errors from -2 [(2
- 1.9)/0.04 = 2.5], the Dickey-Fuller statistics indicate that we cannot reject the null
ii. What diagnostic checks would you want to perform? hy- pothesis y= -2 at the 95% significance leve!. Unless stated otherwise, the
discussion in the text assumes that a 1 is positive. Also note that if there is no prior
D. Estimate the. equation Y2, = a 0 + a 2 t + µ 2 DL and save tite residúals. You
information con- ceming the sign of a 1 , a two-tailed test can be conducted.
should obtain
4. Here we use the notation e,. rather than e,. to highlight that the residuals from such a re-
gression will generally not be white-noise.
Standard 5. For the same reason, it is also inappropriate to use one variable that is trend stationary
Coefficient Estima te Error t-Statistic Significancc and another that is difference stationary. In such instances, "time" can be included as a
Constant 0.4185991020 0.1752103414 2.38912 0.01882282 so-called explanatory variable or the variable in question can be detrended.
6. Tests using lagged changes in the {Ci.y,} sequence are called augmented Dickey-Fu!ler
DUMMY 2.8092054550 0.3097034669 9.07063 0.00000000
tests.
TREND 0.0076752509 0.0053644896 1.43075 0.15571516 7. In their simulations, Dickey and Fuller (1981) found that 90% of the calculated <j> 3 statís-
tícs were 5.47 or Jess and 95% were 6.49 or less when the actual data were generated ac-
E. Perform a pickey-Fuller test on the saved residuals. You should find y, =
cordíng to the null hypothesis.
0.965247ly,_ 1 +e,, where the standard error of a 1 =0.0372. Also perform the 8. Perron's Monte Cario study allows for a drift and deterministic trend. Nonetheless, the
appropriate diagnostíc tests on this regression to ensure that the residuals ap- value of a 1 is biased toward unity in the presence of the deterministic trend.
proximate white noise. You should conclude that the series is a unit root 9. Moreover, Evans and Savin (1981) find that for an AR(l) model, the li.niting distribu-
process with a one-time pulse at t = 51. tion of the autoregressive pararneter has a normal asymptotic distribution (for p < 1).
However, when the parameter is near l, the unit root distribution is a better finite sample
F. Retum to part D but now eliminate the insignificant time trend. How is your
approximation than the asymptotic correct distribution.
answer to part E affected?
10. Campbell and Perron (1991) report that omitting a variable that is growing at least as
fast as any other of the appropriately included regressors causes the power of the tests to
4. The sixth column in the file labeled US.WKl contains the real GDP data used in approach zero.
Section 7. The quarterly series runs from 1960:1to1991:4 and each entry is ex- 11. Using the most general model in Step l is meant to address the problem of omitting irn-
pressed in 1985 dollars. You should find that the properties of the series are such portant deterrnínistic regressors.
that 12. The sample period 1960:1to1991:4 contains 128 total observations. Three observations
are lost by creating the two lagged changes.
Series Name Observations Mean
GDP85 l28 3.220373E+ 12
APPENDIX: Phillips-Perron Test Statistics
A. Plot the logarithm of real GDP. Do the data suggest any particular form of
the trend? Suppose that we observe observations 1, 2, ... , T of the {y,} sequence and esti-
B. Use the Box-Jenkins methodology to verify that an ARIMA(O, 1, 2) model mate the regression equation:
performs better than an ARIMA(2, 1, 0) model.
C. Calculate the various Dickey-Fuller statistics reported in Section 7. Are
there any indications that might be inappropriate to accept the hypothesis In this appendix, we modify our notation slightly for those wishing to read the
a0 = O? work of Phillips and Perron. Fortunately, the changes are minor; simply replace ií 0
D. Repeat the procedure using the Phillips-Perron tests. with µ, ií 1 with a, and ií 2 with · Thus, suppose we have estimated the regression: