Economatrics Postmte 1
Economatrics Postmte 1
HSC - 205
1 / 46
Assumptions of OLS
Homoskedasticity: The variance of all the error terms is same.
Var (ui |X ′ s) = constant.
No Autocorrelation: Error terms are mutually uncorrelated.
corr (ut , ut−i ) = 0. this proplem mainly arises for time series data
No Multicollinearity or perfect/high collinearity.
occurs when two or more explanatory variables in a regression model are highly
correlated
All error terms are independent of all x variables (strict exogeneity).
We treat the explanatory variables as fixed. (deterministic)
cov (ut , X ′ s) = 0 or
Var (ϵi ) = σ for all i. If the error terms do not have constant variance,
they are said to be heteroscedastic.
Heteroscedasticity is likely if there exists
Measurement Error. For Example: in a primary survey some
respondents might provide more accurate responses than others.
3 / 46
Homoscedasticity
Example
Now families with low incomes will spend relatively less on vacations
and further vacations across such families will be small.
4 / 46
Homoscedasticity
OLS Estimation in the presence of Heteroscedasticity
Yi = β1 + β2 Xi + ui
σi2
Its variance is Var (βˆ2 ) = Σxi2
Σxi2 σi2
With heteroscedasticity, var (βˆ2 ) = (Σxi2 )2
5 / 46
Homoscedasticity
Consequences of its non-existence
OLS estimates are no longer BLUE. This means that among all the
unbiased estimators, OLS does not provide the estimate with the
smallest variance. So, our β̂2 is still unbiased and consistent but it is
inefficient.
Formal Tests:
Park Test: Park formalizes the graphical method by suggesting that
σi2 is some function of the explanatory variable Xi .
σi2 = σ 2 Xiβ e vi
var(Y)=var(u)=ESS=e^2
Yi = β1 + β2 X2 i + β3 X3 i + ui
GQ
Goldfeld-Quandt Test: This method is applicable if it is assumed
that the heterosedastic variance (σi2 ) is positively related to one of
the explanatory variables in the model.
Y1 = β1 + β2 Xi + ui
σi2 = σ 2 Xi2
where σ 2 is constant.
So, σi2 would be larger with the larger Xi .
12 / 46
Homoscedasticity
Detecting Heteroscedasticity cont...
RSS2 /df
λ=
RSS1 /df
where df = n−c2 −k
λ follows F-distribution. If λ is greater than Fcritical then the null
hypothesis of no heterogeneity will be rejected.
Remember the ability of GQ test depends on how you choose the c.
13 / 46
Homoscedasticity
Remedies
Remedies depends upon whether σi2 is known and when σi2 is not
known.
When σi2 is known: Generalised Least Square (GLS) or Weighted
Least Square (for simplicity using WLS and GLS interchangeably here).
Yi = β1 X0i + β2 Xi + ui
14 / 46
Homoscedasticity
Remedies cont...
15 / 46
Homoscedasticity
Remedies Example
Y = β1 + β2 X2 + β3 X3 + U (1)
Say, EU 2 = σ 2 Z 2 . =var(Y)
18 / 46
Homoscedasticity
Remedies cont...
1 2 2
= σ Z
Z2
So, our weight i.e. w = 1/z.
19 / 46
Homoscedasticity
Remedies cont...
lnYi = β1 + β2 lnXi + ui
20 / 46
Multicollinearity
What is the nature of Multicollinearity?
22 / 46
Multicollinearity
Example
imperfect corr
perfect corr
X1 X2 X3
10 50 52
15 75 75
18 90 97
24 120 129
30 150 152
23 / 46
Multicollinearity
What is the nature of Multicollinearity?
Note that till now multicollinearity is defined only in the form of linear
relationship among X variables.
But it does not rule out the nonlinear relationships among them. For
example: Consider the following regression model:
Yi = β0 + β1 Xi + β2 Xi2 + β3 Xi3 + ui
But there is a very high correlation between Xi , Xi2 and Xi3 which will
make it difficult to estimate the parameters with great precision.
24 / 46
Multicollinearity
Sources
25 / 46
Multicollinearity
Estimation in the presence of Perfect Multicollinearity
0
=
0
27 / 46
Multicollinearity
Estimation in the presence of Imperfect Multicollinearity
P
where vi is the stochastic error term and x2i vi = 0.
σ2
var (βˆ3 ) = P
x3i2 (1 − r23
2 )
The speed with which variances and covariances increase can be seen
with the variance-inflation factor (VIF) which is defined as:
1
VIF = 2
1 − r23
31 / 46
Multicollinearity
VIF
σ2
var (βˆ2 ) = P 2 VIF
x2i
σ2
var (βˆ3 ) = P 2 VIF
x3i
32 / 46
Multicollinearity
Example One
33 / 46
Multicollinearity
Example Two
35 / 46
Multicollinearity
Remedies
= β1 + β2 Xi + vi
where Xi = X2i + 0.10X3i . One you obtain βˆ2 can estimate βˆ3 from
the postulated relationship.
36 / 46
Multicollinearity
Remedies
37 / 46
Autocorrelation
What is the nature of autocorrelation?
But if the researcher does not use the quadratic term in her model
then it will be an incorrect linear cost curve.
Yt = β1 + β2 Xt + ut (6)
where Yt−1 , Xt−1 and ut−1 are the lagged values of Y, X and u
respectively.
Subtract 6 and 7 we will get first difference operator as
Nonstationarity
In simple terms, a stationary series is a flat looking series i.e. it moves
around a constant.
43 / 46
Autocorrelation
Visualization of Positive and negative Autocorrelation
44 / 46
Autocorrelation
OLS Estimation in the presence of Autocorrelation
Yt = β1 + β2 Xt + ut
Lets assume that the disturbance, or error, terms are generated by the
following mechanism:
ut = ρut−1 + ϵt (8)
σϵ2
var (ut ) = E (ut2 ) =
1 − ρ2
σϵ2
cov (ut , ut−s ) = E (ut ut − s) = ρs
1 − ρ2
46 / 46