0% found this document useful (0 votes)

2 views

Lecture 2_regression_multiple_regressors

The document discusses the application of Ordinary Least Squares (OLS) regression with multiple regressors to address omitted variable bias (OVB) and improve causal inference in empirical economics. It highlights the importance of including relevant variables to obtain unbiased estimates and explains the conditions under which OLS estimates can be interpreted as causal. Additionally, it covers the implications of multicollinearity, model selection, and the significance of adjusted R² in evaluating model fit.

Uploaded by

cringelord1980

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture 2_regression_multiple_regressors

Uploaded by

cringelord1980

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Empirical Economics

Jacopo Bonan – [email protected]

Part 1.2 - Linear Regression with Multiple Regressors
University of Brescia

1/30
Preview

• OLS with one regressor can give biased β, if E(u|X ) = 0 does not hold
• OLS with multiple regressors can solve the omitted variable bias and get causal
effects
• Ceteris paribus condition
• OLS with multiple regressors can improve predictions

2/30
Example - California schools

Characteristics that are important drivers of the final score are also likely to be correlated
with STR.
For example, because of the large immigration in California: % of students who are still
learning English is important for tests results and may also be related class size.
Example - The role of non-native English speakers
• Students who are still learning English might perform worse on standardized tests
than native speakers. Thus, districts with a higher % of non-native speakers might
have, on average, lower scores.
• Districts with many migrants could have larger classes (why?)
• Then, OLS could erroneously produce a large estimate of β1 . It mixes the impact of
class size with that of migration; it compares small classes with few non-native
speakers (high performers) vs large classes with many non-native speakers (low
performers)
• The effect of STR is biased!
• What if we know the % of non English speakers in each district (elpcti = English
learners percent)? 3/30
Example - California schools

Correlation of elpct with str and score

corr (str , elpct) = 0.19 corr (testscr , elpct) = −0.64

4/30
Omitted Variable Bias

OVB in OLS with one regressor

The Omitted Variable Bias is a systematic bias of the OLS for the causal effect of X on
Y , due to the fact that X is correlated with a variable that has been omitted from the
regression model (is in the error term).
Two conditions need to be verified for an OVB:

1. a regressor X must be correlated with the omitted variable;

2. the omitted variable is a determinant of Y .

When both conditions are realized, the assumption A1 is violated and the OLS
estimator is biased. Neither changing the sample, nor increasing the number of
observations would solve the problem.

5/30
What is the size of the bias?
Formula for the OVB
Let us suppose that all the assumptions A2-A3 are verified and let us define
ρXu = corr (Xi , ui ).
Then the OLS estimator:
p σu
β̂1 → β1 + ρXu (1)
σX
| {z }
bias

Size and sign of the bias:

1. the size of the bias depends upon:

• the correlation between the omitted variable (in u) and the regressor X ;
• the std dev. of the error (σu ): how relevant the OV is in explaining Y ?
2. the sign of the bias uniquely depends upon the fact that the correlation between X
and u is positive or negative.
• ρXu > 0 ⇒ bias > 0 ⇒ we are overestimating the true β1
• ρXu < 0 ⇒ bias < 0 ⇒ we are underestimating the true β1

6/30
Can I cancel/reduce the bias?

To cancel the bias I should include all the omitted variables in my model.
A first method to reduce it, is to split the sample into groups such that within each
group the omitted variable is kept constant (e.g. districts for which the % of non English
speaking students is similar) but also such that the other depended variable of interest
has a sufficient variability (e.g. the students-to-teachers ratio).
Using this grouping strategy (here quartile split) we can compute the difference in
average score between large and small classes within groups of schools with similar elpct
and use a simple t-test to test whether the difference is significant.
Sample splitting

7/30
Linear model with multiple regressors

The grouping approach partially solves OVB, but has limitations

• does not provide a precise causal effect of class size, holding constant the fraction
of English learners
• appears complicated if one includes more than one omitted variable
• becomes unpractical as the number of comparison cells increases and the samples
within each cell decrease

One solution is to extend the single variable OLS model to multiple regression model.
This allows to estimate the causal effect on Yi of changing X1i , while holding constant
the other regressors (X2i , X3i ,etc), which are confounding factors and causing OVB in the
univariate OLS.
For prediction, the multiple regression model can improve accuracy.

8/30
The Multiple Regression Model

The Linear Multiple Regression Model with k regressors is:

Yi = β0 + β1 X1,i + β2 X2,i + · · · + βk Xk,i + ui (2)

• Same components as in the univariate model

• βk = ∆X∆Y
k
holding constant X1 , ..., Xk−1 → partial effect on Y of Xk , holding
constant all other factors; expected difference in Yi associated with a unit difference
in Xk , ceteris paribus
• β0 is the expected value of Y when all the X s are equal to 0

9/30
The Multiple Regression Model

The Linear Multiple Regression Model with k regressors is:

Yi = β0 + β1 X1,i + β2 X2,i + · · · + βk Xk,i + ui (3)

and can be written in compact notation as:

Y = X β + u (4)
n×1 n×(k+1)(k+1)×1 n×1

where:
       
Y1 1 X1,1 ... Xk,1 β0 u1

 Y2 


 1 X1,2 ... Xk,2 


 β1 


 u2 

Y =
 .. , X = 
  .. .. .. , β = 
  ..  u=
  .. 

 .   . . ··· .   .   . 
Yn 1 X1,n ... Xk,n βk un

Alternatively you can write it as:

Yi = Xi′ β + ui with Xi′ = [1, X1,i , . . . , Xk,i ]

10/30
The OLS estimator

The minimization problem now is that of choosing a vector β that contains k + 1

parameters (k regressors + the constant). But it is the usual problem:

• the objective function is the sum of the squared deviations;

• the choice variables are the parameter values.

X
argmin [Yi − b0 − b1 X1,i − · · · − bk Xk,i ]2 (5)
b
i

and the FOCs lead us to a (linear) system in k + 1 equation with k + 1 unknown:

X ′ (Y − Xb) = 0 (6)

OLS general formulation

Solving for b we obtain the OLS estimator:

β̂ = (X ′ X )−1 X ′ Y (7)

Note: to compute (X ′ X )−1 we need this product of matrices to be invertible.

11/30
This condition is satisfied if X has full rank, i.e. there is no multicollinearity.
California Schools

Example - California schools

score =686.0 − 1.10 × str − 0.65 × elpct

(8.7) (0.43) (0.03)

• After including elpct, the parameter on str changes (more or less reduced by half).
• Why such a drastic change in the estimate?
• In the univariate model, β1 was underestimated (negative OVB)
• Now, OVB is attenuated. Completely removed?

12/30
Assumptions of the Multiple Regression Model

Conditions for ALL OLS estimates to be interpreted as causal become:

A1: the conditional distribution of the errors, given the regressor has zero mean –
E(u|X ) = E(ui |X1,i , X2,i , . . . , Xk,i ) = 0.
A2: observations are i.i.d. – (X , Y ) = (X1,i , X2,i , . . . , Xk,i , Yi ) ∼ i.i.d.
A3: large outliers are unlikely – 0 < E[Xj,i ]4 , E[Yi ]4 < ∞ ∀ j = 1, . . . , k.
A4: no perfect multicollinearity between regressors – rank(X ) = k + 1

13/30
Perfect multicollinarity

The regressors are said to exhibit perfect multicollinearity (or to be perfectly

multicollinear) if one of the regressors is a linear function of some other regressors.

Perfect multicollinearity
Formally we have perfect multicollinearity if a regressor j can be expressed as:
k
X
Xj,i = αh Xh,i ∀ i = 1, . . . , n
h=1

The assumption A4 above requires that this is not the case.

Note: every modern software automatically checks for this and drops one of the
redundant regressors.

14/30
Example - California schools
The dummy variable trap
Suppose we partition the school districts into three categories: rural, suburban, urban
and we create three dummy variables (i.e. Xrural , Xsuburban , Xurban ) with value 1 if the
district i is of that specific category, and value 0 if not.
Imagine we want to estimate:

score = β0 + β1 rural + β2 suburban + β3 urban

However, because every district belongs only to one of the three categories we will have
that:
rurali + suburbani + urbani = 1 ∀i
but the vector 1 is a regressor already included in the model (associated with the
constant). Thus, to estimate this model, we’ll need to drop either one of the three
dummy variables (which becomes the reference category) or the constant. for example:

score = β0 + β1 rural + β2 suburban

However this changes the interpretation of the coefficients β0 , β1 , β2 . 15/30

Imperfect Multicollinearity

When two (or more) of the regressors are highly correlated, then imperfect
multicollinearity arises.
Imperfect multicollinearity, does not pose any problems for the theory of the OLS
estimators. However, if the regressors are imperfectly multicollinear, then the coefficient
on at least one individual regressor will be imprecisely estimated – in particular, it will
have a large sampling variance.

Example - California schools

Consider the regression of score on str and pctel. Suppose we were to add also the
percentage of the district’s residents who are first-generation immigrants (pctimm).
These people often speak English only as a second language, so the variables pctel and
pctimm will be highly correlated and it will be difficult to precisely estimate the
individual effects. In particular, there will be little info on test scores in schools with
low pctel and high pctimm, and vice-versa. This will lead to larger variance (less
precision) of the estimator

16/30
Control variables and causality

In the multiple regression we are not interested in the causal effects of all the variables.
Some of them might be there only to avoid OVB in the causal interpretation of the
variables of interest. Thus we have:

• variables of interest (X ): for which we aim at estimating the causal effect;

• control variables (W ): only there to reduce the omitted variable bias; no interest in
causal effects

This allows to relax assumption A1, which becomes:

Conditional Mean Independence
A1-bis: the error u has a conditional mean that doesn’t depend on the X , given W
formally: E(u|W , X ) = E(u|W ) or in extended form
E(ui |X1,i , X2,i , . . . , Xk,i , W1,i , . . . , Wr ,i ) = E (ui |W1,i , . . . , Wr ,i ).

The conditional mean of u given W , does not change even after considering the
knowledge about X . Thus, when controlling for W , X becomes uncorrelated from u (as
if they were randomly assigned).
If A1-bis holds, the coefficients for the variables of interest (X ) have a causal 17/30
interpretation, while those for the controls (W ) can be biased.
Goodness of Fit in the Multiple Regression

Similarly to the single regressor case, we can measure the quality of the model by means
of the SER and the R 2 .
The standard error of the regression writes:
s P
n 2
r
i=1 ûi SSR
q
SER = sû = sû2 = = (8)
n−k −1 n−k −1

The denominator adjusts for the degrees of freedom lost due to the estimate of the k + 1
parameters. In large samples, such adjustment is negligible.
The R 2 is like in the univariate case:
ESS SSR
R2 = =1− (9)
TSS TSS

where ESS = i (Ŷi − Ȳi )2 and TSS = i (Yi − Ȳi )2

P P

However, the R 2 increases (by construction) every time that we add a new variable to
our model, which contributes to decrease SSR.

18/30
Adjusted R 2

To correct for this issue, it is better to use the adjusted R 2 (often indicated as R̄ 2 ) that
writes:
n − 1 SSR
R̄ 2 = 1 − (10)
n − k − 1 TSS

When adding a new regressor (k increases) the formula for the R̄ 2 entails a trade-off:

• it reduces the ratio SSR

TSS

• it increases the ratio n−1

n−k−1

so the decision (to add or not the regressor) depends on which effect dominates.
Notes: R̄ 2 is always less than R 2 and can take negative values

19/30
Goodness of Fit and Model Selection

A Note of Caution
When choosing the most appropriate model (among a set of models) the R 2 or the R̄ 2
should not be the unique criterion.
A high value of the R 2 only means that your regression model explains the variability in
Y.
It does not imply that:

• you have an unbiased estimator for the causal effect (and that you have deleted all
the possible OVB);
• the variables in the model are statistically significant.

20/30
The sampling distribution of β̂
Properties of the OLS estimator
As for the single regressor model, under A1-A4 the OLS is unbiased and consistent.

Formally, if A1-A4 hold true we will have:

E(β̂) = β (11)
′ −1 ′ ′ −1
Var (β̂) = σβ̂2 = (X X ) (X Σu X )(X X ) (12)
p
β̂ → β (13)

where:
1
Σu = E(uu ′ ) (14)
n−k
1
which (being unobservable) can be estimated as Σ̂u = n−k û û ′ . In most applications, we
exclude A4, i.e., homoschedasticity, and compute heteroschedasticity robust SE (the
software does it!)
In large samples, thanks to the CLT, the OLS is distributed as a multivariate standard
Normal and
d
β̂k → N (βk , σβ̂2 k ) ∀j = 1, . . . , K + 1 (15)
21/30
Hypothesis testing in the multiple regression model

We can rewrite:
β̂k − E[β̂k ]
∼ N (0, 1) ∀ k = 1, . . . , K + 1
SE (βˆk )

Which implies that:

• hypothesis testing on a single element βk of the vector β can be carried out using
the usual t-test;
• 95% confidence interval for a single element βk of the vector β can be computed by
β̂k ± 1.96SE (β̂k );

Note: because in Var (β̂k ) there is also the covariance between the different estimates,
the t-tests on single elements of the vector β are not independent. Therefore
including/omitting a regressor will change the final outcome of every single t-test.

22/30
Hypothesis testing in the multiple regression model

Example - California schools

All modern softwares report all the useful info and we’ll get something similar to:

score =686.0 − 1.10 × str − 0.65 × elpct

(8.7) (0.43) (0.03)

Single regressor vs. multiple regressors:

23/30
Hypothesis testing in the multiple regression model
Example - California schools
what if we add expenditure per pupil as a further control?

The coef of STR becomes −0.29(0.48) → it becomes non-significant, and flips the
policy implication with respect to the beginning. However, STR and PPexpenditure
are correlated (imperfect multicollinearity)- hence one may test that both β1 = 0 and
24/30
β2 = 0
Testing joint hypothesis

In a multiple regression model, we can also test for joint hypotheses.

H0 : β1 = β1,0 , β2 = β2,0 , . . . , βq = βq,0
H1 : at least one of the q restrictions in H0 is not true.
Why using only single t-tests is never a good idea
Let’s set q = 2 and let’s use the single t-tests at the 5% on each single restriction. If
the t-tests are independent then we won’t reject H0 if and only if:

|t1 | ≤ 1.96 and |t2 | ≤ 1.96

Therefore:
PrH0 (|t1 | ≤ 1.96 and |t2 | ≤ 1.96) = 0.952 = 90.25%
and the test size (rejecting H0 when it is true) is of the 9.75% (not 5%).
Conclusion: you make many more type-I errors than you would expect.

The problem worsens:

• if q increases;
• if regressors are correlated
25/30
The F-statistic on two restrictions

Definition for q = 2
If q = 2, we can define the F-statistic as:

1 t12 + t22 − 2ρ̂t1 ,t2 t1 t2

F = 2
(16)
2 1 − ρ̂t1 ,t2

where ρ̂2t1 ,t2 is the correlation between t1 and t2 . Therefore, the F-stat takes into
account the correlation between different t-stats.
If the single t-stats are uncorrelated (ρ̂2t1 ,t2 = 0), the F-stat would simply be an average
of two squared t-statistics:
1 2
F = (t1 + t22 )
2

The F-stat is distributed as a F2,∞ . If its value is “sufficiently large”, we reject H0 .

26/30
F-statistic

• Reject H0 if F > Fα , where Fα is the critical values for a given significance level α

27/30
Testing multiple coefficients

Sometimes a single restriction might involve more parameters. For example, economic
theory might suggest a specific restriction about two parameters having the same value
(e.g. β1 = β2 or β1 − β2 = 0)
In this case therefore a single restriction (q = 1) involves more estimated parameters.
To test for this restriction we can transform the regression model in a form such that the
t-test refers to a single parameter.
Example - equality restriction
Let’s suppose our model is

Yi = β0 + β1 X1 + β2 X2 + ui (17)

and we want to test H0 : β1 = β2 vs. H1 : β1 ̸= β2 .

By adding and subtracting β2 X1 to our model we get:

Yi = β0 + (β1 − β2 )X1 + β2 (X1 + X2 ) + ui

= β0 + γ1 X1 + β2 V1 + ui

and it would be sufficient to use the t-test H0 : γ1 = 0. 28/30

F-test in R

In R:

# heteroskedasticity-robust F-test
linearHypothesis(model, c("STR=0", "expenditureK=0"), white.adjust = "hc1")

29/30
Model specification

How to decide what variables to include in a regression

1. Identify the variable of interest (X )

2. Think about OVB: what variables are we omitting that could bias β attached to the
variable of interest
3. Include those omitted variables (or proxies) as control variables, after checking their
correlation with Y and X . This wil be the base specification
4. Specify a range of plausible alternative models, which include additional candidate
control variables: alternative specifications
5. Estimate your base model and plausible alternative specifications (“sensitivity
checks”)
• do candidate variables affect the coefficient of interest (β)?
• are candidate variables significant?
• don’t just try to maximize R 2 : the objective is an unbiased estimator of the
coeff. of interest (causal effect!), not the best fit

30/30

Mock Econometrics
No ratings yet
Mock Econometrics
3 pages
EE1_3_multiple linear regression
No ratings yet
EE1_3_multiple linear regression
30 pages
lecture_8
No ratings yet
lecture_8
29 pages
Limited Dependent Variables
No ratings yet
Limited Dependent Variables
17 pages
The Multiple Linear Regression Model: Version: 30-10-2023, 16:07
No ratings yet
The Multiple Linear Regression Model: Version: 30-10-2023, 16:07
17 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
17 pages
Module 3 - Data Analysis_S RM
No ratings yet
Module 3 - Data Analysis_S RM
63 pages
EC212: Introduction To Econometrics Simple Regression Model (Wooldridge, Ch. 2)
No ratings yet
EC212: Introduction To Econometrics Simple Regression Model (Wooldridge, Ch. 2)
107 pages
ECON326 Midterm
No ratings yet
ECON326 Midterm
5 pages
Chapter7 Econometrics Multicollinearity
No ratings yet
Chapter7 Econometrics Multicollinearity
24 pages
MultivariableRegression 1
No ratings yet
MultivariableRegression 1
30 pages
Chapter3 Econometrics MultipleLinearRegressionModel
No ratings yet
Chapter3 Econometrics MultipleLinearRegressionModel
41 pages
Theme 2 Ordinary Least Squares Regression
No ratings yet
Theme 2 Ordinary Least Squares Regression
10 pages
Heteroskedasticity in The Linear Model: I 0 I I 0 I
No ratings yet
Heteroskedasticity in The Linear Model: I 0 I I 0 I
10 pages
Final 2015 PDF
No ratings yet
Final 2015 PDF
13 pages
UC Berkeley Econ 140 Section 10
No ratings yet
UC Berkeley Econ 140 Section 10
8 pages
Econ20222 MJAbackgr
No ratings yet
Econ20222 MJAbackgr
164 pages
Lecture3-Enriching the Linear Models Slides-Annotated
No ratings yet
Lecture3-Enriching the Linear Models Slides-Annotated
42 pages
Ssss PDF
No ratings yet
Ssss PDF
50 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Answers To Odd-Numbered Exercises For Fox, Applied Regression Analysis
No ratings yet
Answers To Odd-Numbered Exercises For Fox, Applied Regression Analysis
151 pages
Multiple Regression Model
No ratings yet
Multiple Regression Model
17 pages
BivariateReg WT2425
No ratings yet
BivariateReg WT2425
109 pages
ETW2510 Lecture 8 Heteroskedasticity
No ratings yet
ETW2510 Lecture 8 Heteroskedasticity
29 pages
ChatGPT
No ratings yet
ChatGPT
6 pages
Topic10 Written
No ratings yet
Topic10 Written
27 pages
Chapter 3 Multiple regression
No ratings yet
Chapter 3 Multiple regression
49 pages
MLRM
No ratings yet
MLRM
22 pages
Chapter 3 Multiple Regression
No ratings yet
Chapter 3 Multiple Regression
49 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
Bivariate Regression - Part I: Indep Var / Dep Var Continuous Discrete
No ratings yet
Bivariate Regression - Part I: Indep Var / Dep Var Continuous Discrete
4 pages
LN3_Least Squares Estimation-Finite-Sample Properties_ver2_slides
No ratings yet
LN3_Least Squares Estimation-Finite-Sample Properties_ver2_slides
35 pages
Econometrics Lecture4 MultipleRegression
No ratings yet
Econometrics Lecture4 MultipleRegression
40 pages
Mult Regression
No ratings yet
Mult Regression
28 pages
Logit Probit and Tobit Models For Catego PDF
No ratings yet
Logit Probit and Tobit Models For Catego PDF
19 pages
Dougherty5e Studyguide Review
No ratings yet
Dougherty5e Studyguide Review
17 pages
Machine Learning and Pattern Recognition Background Selftest
No ratings yet
Machine Learning and Pattern Recognition Background Selftest
2 pages
Unit-3 Data Analysis
No ratings yet
Unit-3 Data Analysis
36 pages
Lecture 2-2_Simple Linear Regression (One Regressor)
No ratings yet
Lecture 2-2_Simple Linear Regression (One Regressor)
22 pages
econ4
No ratings yet
econ4
92 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Ec 2
No ratings yet
Ec 2
12 pages
Lecture 1a
No ratings yet
Lecture 1a
17 pages
Lecture I - Docx - 12
No ratings yet
Lecture I - Docx - 12
10 pages
Introductory Econometrics: Multiple Regression: Inference
No ratings yet
Introductory Econometrics: Multiple Regression: Inference
38 pages
Lecture 19: Interactions
No ratings yet
Lecture 19: Interactions
4 pages
Section 11 PDF
No ratings yet
Section 11 PDF
7 pages
Unit 5
No ratings yet
Unit 5
10 pages
Multicollinearity
No ratings yet
Multicollinearity
35 pages
Seminar Econometrie
No ratings yet
Seminar Econometrie
15 pages
Statistics 3 Notes
No ratings yet
Statistics 3 Notes
90 pages
Lecture 2
No ratings yet
Lecture 2
14 pages
The Poisson Regression Model
No ratings yet
The Poisson Regression Model
6 pages
5 Bivariate Data. Double The Data, Double The Fun: 5.1 Covariance and Correlation
No ratings yet
5 Bivariate Data. Double The Data, Double The Fun: 5.1 Covariance and Correlation
10 pages
Regression Analysis: Ordinary Least Squares
No ratings yet
Regression Analysis: Ordinary Least Squares
12 pages
lec # 3
No ratings yet
lec # 3
47 pages
Lec3 2019 PDF
No ratings yet
Lec3 2019 PDF
43 pages
Frisch Waugh Lovell
No ratings yet
Frisch Waugh Lovell
15 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Arts9 - q3 - Mod3 - Techiques, Style of Neoclassic and Romantic Arts and Its Influence - CO
100% (1)
Arts9 - q3 - Mod3 - Techiques, Style of Neoclassic and Romantic Arts and Its Influence - CO
20 pages
Geometry and Trigonometry questionss
No ratings yet
Geometry and Trigonometry questionss
70 pages
The 70 Weeks of Daniel Chapter 9: Edward Huey
No ratings yet
The 70 Weeks of Daniel Chapter 9: Edward Huey
7 pages
Look & Clook Scheduling
No ratings yet
Look & Clook Scheduling
4 pages
Eksamitöö (PE Inglise Keel 2015)
No ratings yet
Eksamitöö (PE Inglise Keel 2015)
6 pages
PLD5600A Brochure
No ratings yet
PLD5600A Brochure
7 pages
Hecht - Chapter 9
No ratings yet
Hecht - Chapter 9
33 pages
Thesis Title For Forestry
100% (2)
Thesis Title For Forestry
4 pages
TLE10 Q2 Mod1 Configuring-Computer-System-and-Network v3
No ratings yet
TLE10 Q2 Mod1 Configuring-Computer-System-and-Network v3
36 pages
34779
No ratings yet
34779
67 pages
GRADE 8 PRETECHNICAL NOTES v2
No ratings yet
GRADE 8 PRETECHNICAL NOTES v2
25 pages
AU Optronics B173RW01 Ver 5
No ratings yet
AU Optronics B173RW01 Ver 5
31 pages
Three Boys in The Wild North Land by Young, Egerton Ryerson, 1840-1909
100% (1)
Three Boys in The Wild North Land by Young, Egerton Ryerson, 1840-1909
147 pages
Interpolation EEE
No ratings yet
Interpolation EEE
23 pages
Work From Home: The Policies and Requirements
0% (1)
Work From Home: The Policies and Requirements
1 page
aits-1718-ft-vii-paper-2___solutions
No ratings yet
aits-1718-ft-vii-paper-2___solutions
40 pages
28DaysProgram PDF
100% (2)
28DaysProgram PDF
4 pages
Ejercicios Pasado Simple
No ratings yet
Ejercicios Pasado Simple
2 pages
Form 2 Schemes Term 3
No ratings yet
Form 2 Schemes Term 3
12 pages
Intel i5-3470 vs Xeon E3-1270 v6 vs i7-3770 [cpubenchmark.net] by PassMark Software
No ratings yet
Intel i5-3470 vs Xeon E3-1270 v6 vs i7-3770 [cpubenchmark.net] by PassMark Software
1 page
Sean Ealy Science Fair
No ratings yet
Sean Ealy Science Fair
6 pages
HGY3CC-3R布料机说明书 - 宝泉版 (Shenyang BQ)
No ratings yet
HGY3CC-3R布料机说明书 - 宝泉版 (Shenyang BQ)
52 pages
Poetry Packet
No ratings yet
Poetry Packet
9 pages
Assignment 1_Basic concepts of chemistry.docx-2
No ratings yet
Assignment 1_Basic concepts of chemistry.docx-2
4 pages
soal bahasa inggris
No ratings yet
soal bahasa inggris
6 pages
Intellectual Disability: Definition, Classification, Causes, and Characteristics
No ratings yet
Intellectual Disability: Definition, Classification, Causes, and Characteristics
58 pages
Efficiency Map Model for PMSM to Use in Ev Simulation
No ratings yet
Efficiency Map Model for PMSM to Use in Ev Simulation
8 pages
Final Glass Furniture
No ratings yet
Final Glass Furniture
26 pages
Green Building: Sustainable Architecture and Planning
No ratings yet
Green Building: Sustainable Architecture and Planning
16 pages
Air On Coil Temperature
No ratings yet
Air On Coil Temperature
8 pages

Lecture 2_regression_multiple_regressors

Uploaded by

Lecture 2_regression_multiple_regressors

Uploaded by

Empirical Economics

Jacopo Bonan – [email protected]

Correlation of elpct with str and score

corr (str , elpct) = 0.19 corr (testscr , elpct) = −0.64

OVB in OLS with one regressor

1. a regressor X must be correlated with the omitted variable;

Size and sign of the bias:

1. the size of the bias depends upon:

The grouping approach partially solves OVB, but has limitations

The Linear Multiple Regression Model with k regressors is:

Yi = β0 + β1 X1,i + β2 X2,i + · · · + βk Xk,i + ui (2)

• Same components as in the univariate model

The Linear Multiple Regression Model with k regressors is:

Yi = β0 + β1 X1,i + β2 X2,i + · · · + βk Xk,i + ui (3)

and can be written in compact notation as:

Alternatively you can write it as:

Yi = Xi′ β + ui with Xi′ = [1, X1,i , . . . , Xk,i ]

The minimization problem now is that of choosing a vector β that contains k + 1

• the objective function is the sum of the squared deviations;

and the FOCs lead us to a (linear) system in k + 1 equation with k + 1 unknown:

OLS general formulation

Note: to compute (X ′ X )−1 we need this product of matrices to be invertible.

Example - California schools

score =686.0 − 1.10 × str − 0.65 × elpct

Conditions for ALL OLS estimates to be interpreted as causal become:

The regressors are said to exhibit perfect multicollinearity (or to be perfectly

The assumption A4 above requires that this is not the case.

score = β0 + β1 rural + β2 suburban + β3 urban

score = β0 + β1 rural + β2 suburban

However this changes the interpretation of the coefficients β0 , β1 , β2 . 15/30

Example - California schools

• variables of interest (X ): for which we aim at estimating the causal effect;

This allows to relax assumption A1, which becomes:

where ESS = i (Ŷi − Ȳi )2 and TSS = i (Yi − Ȳi )2

• it reduces the ratio SSR

• it increases the ratio n−1

Formally, if A1-A4 hold true we will have:

Which implies that:

Example - California schools

score =686.0 − 1.10 × str − 0.65 × elpct

Single regressor vs. multiple regressors:

In a multiple regression model, we can also test for joint hypotheses.

|t1 | ≤ 1.96 and |t2 | ≤ 1.96

The problem worsens:

1 t12 + t22 − 2ρ̂t1 ,t2 t1 t2

The F-stat is distributed as a F2,∞ . If its value is “sufficiently large”, we reject H0 .

and we want to test H0 : β1 = β2 vs. H1 : β1 ̸= β2 .

Yi = β0 + (β1 − β2 )X1 + β2 (X1 + X2 ) + ui

and it would be sufficient to use the t-test H0 : γ1 = 0. 28/30

How to decide what variables to include in a regression

1. Identify the variable of interest (X )

You might also like