0% found this document useful (0 votes)

6 views25 pages

Violation of the Classical Assumptions

Chapter 4 discusses multicollinearity in regression analysis, highlighting its types, consequences, and detection methods, including the Variance Inflation Factor (VIF). It emphasizes that while OLS estimators remain BLUE, high multicollinearity can lead to large standard errors and misleading statistical significance. Remedial measures include redefining the model and using principal components analysis to address multicollinearity issues.

Uploaded by

keyanna gillette

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views25 pages

Violation of the Classical Assumptions

Uploaded by

keyanna gillette

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 25

CHAPTER 4

REGRESSION DIAGNOSTIC I:
MULTICOLLINEARITY

Damodar Gujarati
Econometrics by Example
MULTICOLLINEARITY
 One of the assumptions of the classical linear
regression (CLRM) is that there is no exact linear
relationship among the regressors.
 If there are one or more such relationships among
the regressors, we call it multicollinearity, or
collinearity for short.
 Perfect collinearity: A perfect linear relationship
between the two variables exists.
 Imperfect collinearity: The regressors are highly (but
not perfectly) collinear.
Damodar Gujarati
Econometrics by Example
CONSEQUENCES
 If collinearity is not perfect, but high, several
consequences ensue:
 The OLS estimators are still BLUE, but one or more regression
coefficients have large standard errors relative to the values of
the coefficients, thereby making the t ratios small.
 Even though some regression coefficients are statistically
insignificant, the R2 value may be very high.
 Therefore, one may conclude (misleadingly) that the true values
of these coefficients are not different from zero.
 Also, the regression coefficients may be very sensitive to small
changes in the data, especially if the sample is relatively small.

Damodar Gujarati
Econometrics by Example
VARIANCE INFLATION FACTOR
 For the following regression model:
Yi B1  B2 X 2i  B3 X 3i  ui
It can be shown that:
 2
 2
var(b2 )  2 2
 2
VIF
 x (1  r )
2i 23 x 2i

and
 2
 2
var(b3 )  2 2
 2
VIF
 x (1  r )
3i 23 x 3i

where σ2 is the variance of the error term ui, and r23 is the
coefficient of correlation between X2 and X3.
Damodar Gujarati
Econometrics by Example
VARIANCE INFLATION FACTOR (CONT.)

1
VIF  2
1  r23
is the variance-inflating factor.

 VIF is a measure of the degree to which the variance of

the OLS estimator is inflated because of collinearity.

Damodar Gujarati
Econometrics by Example
DETECTION OF MULTICOLLINEARITY
 1. High R2 but few significant t ratios
 2. High pair-wise correlations among explanatory
variables or regressors
 3. High partial correlation coefficients
 4. Significant F test for auxiliary regressions
(regressions of each regressor on the remaining
regressors)
 5. High Variance Inflation Factor (VIF) and low
Tolerance Factor (TOL, the inverse of VIF)

Damodar Gujarati
Econometrics by Example
REMEDIAL MEASURES
 What should we do if we detect multicollinearity?
 Nothing, for we often have no control over the data.
 Redefine the model by excluding variables may attenuate
the problem, provided we do not omit relevant variables.
 Principal components analysis: Construct artificial
variables from the regressors such that they are orthogonal
to one another.
These principal components become the regressors in the
model.
Yet the interpretation of the coefficients on the principal
components is not as straightforward.

Damodar Gujarati
Econometrics by Example
Damodar Gujarati

Econometrics by Example
CHAPTER 5

REGRESSION DIAGNOSTIC II:

HETEROSCEDASTICITY

Damodar Gujarati
Econometrics by Example, second edition
HETEROSCEDASTICITY
 One of the assumptions of the classical linear
regression (CLRM) is that the variance of ui, the
error term, is constant, or homoscedastic.
 Reasons are many, including:
 The presence of outliers in the data
 Incorrect functional form of the regression model
 Incorrect transformation of data
 Mixing observations with different measures of scale
(such as mixing high-income households with low-
income households).
Damodar Gujarati
Econometrics by Example, second edition
CONSEQUENCES
 If heteroscedasticity exists, several consequences
ensue:
 The OLS estimators are still unbiased and consistent, yet the
estimators are less efficient, making statistical inference less
reliable (i.e., the estimated t values may not be reliable).
 Thus, estimators are not best linear unbiased estimators
(BLUE); they are simply linear unbiased estimators (LUE).
 In the presence of heteroscedasticity, the BLUE estimators
are provided by the method of weighted least squares
(WLS).

Damodar Gujarati
Econometrics by Example, second edition
DETECTION OF HETEROSCEDASTICITY
 Graph histogram of squared residuals
 Graph squared residuals against predicted Y
 Breusch-Pagan (BP) Test
 White’s Test of Heteroscedasticity
 Other tests such as Park, Glejser, Spearman’s rank
correlation, and Goldfeld-Quandt tests of
heteroscedasticity

Damodar Gujarati
Econometrics by Example, second edition
BREUSCH-PAGAN (BP) TEST
 Estimate the OLS regression, and obtain the squared OLS residuals
from this regression.
 Regress the square residuals on the k regressors included in the model.
 You can choose other regressors also that might have some bearing
on the error variance.
 The null hypothesis here is that the error variance is homoscedastic –
that is, all the slope coefficients are simultaneously equal to zero.
 Use the F statistic from this regression with (k-1) and (n-k) in the
numerator and denominator df, respectively, to test this hypothesis.
 If the computed F statistic is statistically significant, we can reject
the hypothesis of homoscedasticity. If it is not, we may not reject
the null hypothesis.

Damodar Gujarati
Econometrics by Example, second edition
WHITE’S TEST OF HETEROSCEDASTICITY
 Regress the squared residuals on the regressors, the
squared terms of these regressors, and the pair-wise cross-
product term of each regressor.
 Obtain the R2 value from this regression and multiply it by
the number of observations.
 Under the null hypothesis that there is homoscedasticity,
this product follows the Chi-square distribution with df
equal to the number of coefficients estimated.
 The White test is more general and more flexible than the
BP test.

Damodar Gujarati
Econometrics by Example, second edition
REMEDIAL MEASURES
 What should we do if we detect heteroscedasticity?
 Use method of Weighted Least Squares (WLS)
 Divide each observation by the (heteroscedastic) σi and estimate the
transformed model by OLS (yet true variance is rarely known).
 If the true error variance is proportional to the square of one of the
regressors, we can divide both sides of the equation by that variable
and run the transformed regression.
 Take natural log of dependent variable.
 Use White’s heteroscedasticity-consistent standard errors or
robust standard errors.
 Valid in large samples

Damodar Gujarati
Econometrics by Example, second edition
CHAPTER 6

REGRESSION DIAGNOSTIC III:

AUTOCORRELATION

Damodar Gujarati
Econometrics by Example
AUTOCORRELATION
 One of the assumptions of the classical linear
regression (CLRM) is that the covariance between
ui, the error term for observation i, and uj, the error
term for observation j, is zero.
 Reasons for autocorrelation include:
 The possible strong correlation between the shock in
time t with the shock in time t+1
 More common in time series data

Damodar Gujarati
Econometrics by Example
CONSEQUENCES
 If autocorrelation exists, several consequences ensue:
 The OLS estimators are still unbiased and consistent.
 They are still normally distributed in large samples.
 They are no longer efficient, meaning that they are no longer
BLUE.
 In most cases standard errors are underestimated.
 Thus, the hypothesis-testing procedure becomes suspect, since
the estimated standard errors may not be reliable, even
asymptotically (i.e., in large samples).

Damodar Gujarati
Econometrics by Example
DETECTION OF AUTOCORRELATION
 Graphical method
 Plot the values of the residuals, et, chronologically
 If discernible pattern exists, autocorrelation likely a problem
 Durbin-Watson test
 Breusch-Godfrey (BG) test

Damodar Gujarati
Econometrics by Example
DURBIN-WATSON (d) TEST
 The Durbin-Watson d statistic is defined as:

t n

 (e  et t 1 ) 2

d  t 2 t n

e
t 1
2
t

Damodar Gujarati
Econometrics by Example
DURBIN-WATSON (d) TEST ASSUMPTIONS
 Assumptions are:
 1. The regression model includes an intercept term.
 2. The regressors are fixed in repeated sampling.
 3. The error term follows the first-order autoregressive (AR1)
scheme:
ut  ut  1  vt
where ρ (rho) is the coefficient of autocorrelation, a value between -1
and 1.
 4. The error term is normally distributed.
 5. The regressors do not include the lagged value(s) of the
dependent variable, Yt.
Damodar Gujarati
Econometrics by Example
DURBIN-WATSON (d) TEST (CONT.)
 Two critical values of the d statistic, dL and dU, called the lower and upper
limits, are established
 The decision rules are as follows:
 1. If d < dL, there probably is evidence of positive autocorrelation.
 2. If d > dU, there probably is no evidence of positive autocorrelation.
 3. If dL < d < dU, no definite conclusion about positive autocorrelation.
 4. If dU < d < 4 - dU, probably there is no evidence of positive or negative
autocorrelation.
 5. If 4 - dU < d < 4 - dL, no definite conclusion about negative autocorrelation.
 6. If 4 - dL < d < 4, there probably is evidence of negative autocorrelation.
 d value always lies between 0 and 4
 The closer it is to zero, the greater is the evidence of positive autocorrelation,
and the closer it is to 4, the greater is the evidence of negative
autocorrelation. If d is about 2, there is no evidence of positive or negative
(first) order autocorrelation.
Damodar Gujarati
Econometrics by Example
BREUSCH-GODFREY (BG) TEST
 This test allows for:
 (1) Lagged values of the dependent variables to be included as
regressors
 (2) Higher-order autoregressive schemes, such as AR(2), AR(3), etc.
 (3) Moving average terms of the error term, such as ut-1, ut-2, etc.
 The error term in the main equation follows the following AR(p)
autoregressive structure:
ut 1ut  1   2ut  2  ...   p ut  p  vt

 The null hypothesis of no serial correlation is:

1  2 ...  p 0

Damodar Gujarati
Econometrics by Example
BREUSCH-GODFREY (BG) TEST (CONT.)
 The BG test involves the following steps:
 Regress et, the residuals from our main regression, on the regressors in
the model and the p autoregressive terms given in the equation on the
previous slide, and obtain R2 from this auxiliary regression.
 If the sample size is large, BG have shown that: (n – p)R2 ~ X2p
 That is, in large samples, (n – p) times R2 follows the chi-square distribution with
p degrees of freedom.
 Rejection of the null hypothesis implies evidence of autocorrelation.
 As an alternative, we can use the F value obtained from the auxiliary
regression.
 This F value has (p , n-k-p) degrees of freedom in the numerator and
denominator, respectively, where k represents the number of parameters in the
auxiliary regression (including the intercept term).

Damodar Gujarati
Econometrics by Example
REMEDIAL MEASURES
 First-Difference Transformation
 If autocorrelation is of AR(1) type, we have: ut   ut  1 vt
 Assume ρ=1 and run first-difference model (taking first difference
of dependent variable and all regressors)
 Generalized Transformation
 Estimate value of ρ through regression of residual on lagged
residual and use value to run transformed regression
 Newey-West Method
 Generates HAC (heteroscedasticity and autocorrelation
consistent) standard errors
 Model Evaluation

Damodar Gujarati
Econometrics by Example

Gujarati Basic Econometrics Solutions
87% (39)
Gujarati Basic Econometrics Solutions
189 pages
Basic Econometrics Solutions Manual
83% (6)
Basic Econometrics Solutions Manual
189 pages
Project 10 (Statistics
No ratings yet
Project 10 (Statistics
14 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Introduction to Linear Regression Analysis
From Everand
Introduction to Linear Regression Analysis
Douglas C. Montgomery
2.5/5 (4)
Blood Pressure Levels For Boys by Age and Height Percentile
No ratings yet
Blood Pressure Levels For Boys by Age and Height Percentile
4 pages
EBE Ch5
No ratings yet
EBE Ch5
7 pages
EBE Ch6
No ratings yet
EBE Ch6
11 pages
Regression Diagnostic Ii: Heteroscedasticity: Damodar Gujarati
No ratings yet
Regression Diagnostic Ii: Heteroscedasticity: Damodar Gujarati
7 pages
EBE_Ch5
No ratings yet
EBE_Ch5
7 pages
EBE Ch4
No ratings yet
EBE Ch4
7 pages
Functional Forms of Regression Models: Damodar Gujarati
No ratings yet
Functional Forms of Regression Models: Damodar Gujarati
11 pages
Functional Forms of Regression
No ratings yet
Functional Forms of Regression
11 pages
Regression Diagnostic I: Multicollinearity: Damodar Gujarati
No ratings yet
Regression Diagnostic I: Multicollinearity: Damodar Gujarati
7 pages
EBE_Ch4
No ratings yet
EBE_Ch4
7 pages
Chapter 4 Mutlicollinearity
No ratings yet
Chapter 4 Mutlicollinearity
7 pages
Functional Forms of Regression Models: Damodar Gujarati
No ratings yet
Functional Forms of Regression Models: Damodar Gujarati
10 pages
EBE_Ch1
No ratings yet
EBE_Ch1
17 pages
EBE Ch2
No ratings yet
EBE Ch2
10 pages
The Linear Regression Model: An Overview: Damodar Gujarati
No ratings yet
The Linear Regression Model: An Overview: Damodar Gujarati
17 pages
EBE_Ch7
No ratings yet
EBE_Ch7
21 pages
EBE Ch16
No ratings yet
EBE Ch16
10 pages
Intro to Econometrics Latter Half Chanon-1016098-17101310898743
No ratings yet
Intro to Econometrics Latter Half Chanon-1016098-17101310898743
15 pages
EBE Ch13
No ratings yet
EBE Ch13
6 pages
Econometrics
No ratings yet
Econometrics
46 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
Chap4 Econometrics I Jonse
No ratings yet
Chap4 Econometrics I Jonse
51 pages
Ch5 - Slides 2022 - 11 - 29 - L1
No ratings yet
Ch5 - Slides 2022 - 11 - 29 - L1
35 pages
Course Contents
No ratings yet
Course Contents
2 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
Chapter 5 Violations of CLRM Assumptions
100% (2)
Chapter 5 Violations of CLRM Assumptions
25 pages
chapter-4
No ratings yet
chapter-4
38 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
66 pages
Chris Brooks_Chapter 5_slides
No ratings yet
Chris Brooks_Chapter 5_slides
71 pages
BA H DSEC IiApplied Econometrics 5th Sem
No ratings yet
BA H DSEC IiApplied Econometrics 5th Sem
7 pages
MFIN 305_Lecture3
No ratings yet
MFIN 305_Lecture3
66 pages
OLS Assumptions and diagnostics
No ratings yet
OLS Assumptions and diagnostics
18 pages
Basic Econometrics Unit 2
No ratings yet
Basic Econometrics Unit 2
34 pages
Chapter 4
No ratings yet
Chapter 4
55 pages
Econometrics moduleII
100% (2)
Econometrics moduleII
114 pages
OLS Assumptions
No ratings yet
OLS Assumptions
40 pages
(Ebook) Econometrics by Example by Damodar Gujarati ISBN 9781137375018, 1137375019 download
No ratings yet
(Ebook) Econometrics by Example by Damodar Gujarati ISBN 9781137375018, 1137375019 download
43 pages
Chapter 4 (2)
No ratings yet
Chapter 4 (2)
62 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
CH 4 2023 Eonometrics For Acct and Finance
No ratings yet
CH 4 2023 Eonometrics For Acct and Finance
21 pages
Chapter 4
No ratings yet
Chapter 4
63 pages
Gujarati Student Solutions
100% (1)
Gujarati Student Solutions
189 pages
Ontents: Foreword Preface To The Fourth Edition
No ratings yet
Ontents: Foreword Preface To The Fourth Edition
12 pages
Ch5_slides (2) (1)
No ratings yet
Ch5_slides (2) (1)
32 pages
Topic 1 Wble
No ratings yet
Topic 1 Wble
58 pages
Chapter 4
No ratings yet
Chapter 4
2 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Log-Linear Modeling: Concepts, Interpretation, and Application
From Everand
Log-Linear Modeling: Concepts, Interpretation, and Application
Alexander von Eye
No ratings yet
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
From Everand
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
Fouad Sabry
No ratings yet
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value
From Everand
Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value
Wouter Verbeke
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
High-Dimensional Covariance Estimation: With High-Dimensional Data
From Everand
High-Dimensional Covariance Estimation: With High-Dimensional Data
Mohsen Pourahmadi
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Topology and Geometry for Physicists
From Everand
Topology and Geometry for Physicists
Charles Nash
3.5/5 (1)
redsultados conflic
No ratings yet
redsultados conflic
19 pages
Stats Tutorial Sheet 2_2
No ratings yet
Stats Tutorial Sheet 2_2
6 pages
احصاء
No ratings yet
احصاء
3 pages
GE-MATH-4-FINALS-MODULE
No ratings yet
GE-MATH-4-FINALS-MODULE
15 pages
Attribute Oriented Analysis
No ratings yet
Attribute Oriented Analysis
27 pages
Assignment 4 - Kelompok 2 (1) J
No ratings yet
Assignment 4 - Kelompok 2 (1) J
5 pages
Chapter 1 Summary Univaraiate Data (002) (2)
No ratings yet
Chapter 1 Summary Univaraiate Data (002) (2)
44 pages
Gender and Preference
No ratings yet
Gender and Preference
3 pages
Chapter - 4 Moments, Skewness & kurtosis
No ratings yet
Chapter - 4 Moments, Skewness & kurtosis
41 pages
Basic Stat
No ratings yet
Basic Stat
5 pages
Financial Econometrics - #4
No ratings yet
Financial Econometrics - #4
92 pages
Worksheet Booklet 2020-2021 Name: - Grade: Subject: Economics
No ratings yet
Worksheet Booklet 2020-2021 Name: - Grade: Subject: Economics
48 pages
Central Tendency Practice Sheet
No ratings yet
Central Tendency Practice Sheet
12 pages
6.4 Activity 2
No ratings yet
6.4 Activity 2
2 pages
Activity 8
No ratings yet
Activity 8
5 pages
Problems-Chapter 2
No ratings yet
Problems-Chapter 2
5 pages
Prepared by Mr. A.P Nanada, Associate Prof. (OM) : Decision Science Sub. Code: MBA 105 (For MBA Students)
No ratings yet
Prepared by Mr. A.P Nanada, Associate Prof. (OM) : Decision Science Sub. Code: MBA 105 (For MBA Students)
247 pages
2. Descriptive Statistics 2024 (1)
No ratings yet
2. Descriptive Statistics 2024 (1)
31 pages
Harmonic Mean
No ratings yet
Harmonic Mean
2 pages
STB1003_Unit-3 bsc
No ratings yet
STB1003_Unit-3 bsc
12 pages
Lampiran R Studio
No ratings yet
Lampiran R Studio
25 pages
FDS IMPORTANT QUESTIONS EduEngg
100% (1)
FDS IMPORTANT QUESTIONS EduEngg
7 pages
dsc-unit-3-cse
No ratings yet
dsc-unit-3-cse
9 pages
Cpk Template
No ratings yet
Cpk Template
2 pages
Answer For Q4 Case Study 1 Excel
No ratings yet
Answer For Q4 Case Study 1 Excel
5 pages
Elementary Statistics 6th Edition Larson Test Bank - Download Now And Never Miss A Chapter
100% (3)
Elementary Statistics 6th Edition Larson Test Bank - Download Now And Never Miss A Chapter
56 pages
FINC3017 Week3 Tut
No ratings yet
FINC3017 Week3 Tut
3 pages

Violation of the Classical Assumptions

Uploaded by

Violation of the Classical Assumptions

Uploaded by

CHAPTER 4

 VIF is a measure of the degree to which the variance of

REGRESSION DIAGNOSTIC II:

REGRESSION DIAGNOSTIC III:

 The null hypothesis of no serial correlation is:

You might also like