0% found this document useful (0 votes)

2 views33 pages

Econometrics___Lecture_2___Simple_Regression

The document outlines the Simple Regression Model, which examines the relationship between two variables through a linear equation. It discusses the interpretation, assumptions, and properties of Ordinary Least Squares (OLS) estimators, emphasizing the importance of understanding causal relationships and the limitations of linearity in practice. Additionally, it covers the concepts of goodness-of-fit, incorporating nonlinearities, and estimating error variance in regression analysis.

Uploaded by

Philips Otibo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views33 pages

Econometrics___Lecture_2___Simple_Regression

Uploaded by

Philips Otibo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

L ECTURE 2 - T HE S IMPLE R EGRESSION M ODEL

Course: [107408] ECONOMETRICS

School of Economics and Management,

University of Siena
2024/2025
T HE SIMPLE REGRESSION MODEL
D EFINITION OF THE SIMPLE REGRESSION MODEL

▶ The simple regression model can be used to study the relationship between two variables.

▶ A simple equation to “Explain the variable y in terms of variable x”

1 / 32
T HE SIMPLE REGRESSION MODEL
I NTERPRETATION OF THE SIMPLE LINEAR REGRESSION MODEL

▶ Explains how y varies with changes in x

▶ The simple linear regression model is rarely applicable in practice but its discussion is
useful for pedagogical reasons.

2 / 32
T HE SIMPLE REGRESSION MODEL
E XAMPLES

Example: Soybean yield and fertilizer

Example: A simple wage equation

Linearity: one-unit change in x has the same effect on y, regardless of the initial value of x. This
is often unrealistic. For example, in the wage-education example, we might want to allow for
increasing returns: the next year of education has a larger effect on wages than did the previous
year.
3 / 32
T HE SIMPLE REGRESSION MODEL
C AUSAL INTERPRETATION

▶ When is there a causal interpretation?

Conditional mean independence assumption: the average value of u does not depend on the
value of x.
In the example of educations, this means that given any specific value of education (say 12
years of schooling), the factors in the error term (like intelligence or experience) are, on average,
zero.

Example: wage equation

4 / 32
T HE SIMPLE REGRESSION MODEL
P OPULATION REGRESSION FUNCTION (PFR)

▶ The conditional mean independence assumption implies that

▶ This means that the average value of the dependent variable can be expressed as a linear
function of the explanatory variable.
▶ It is important to understand that the equation tells us how the average value of y changes
with x; it does not say that y equals β0 + β1 x for all units in the population.

5 / 32
T HE SIMPLE REGRESSION MODEL
G RAPHICAL FORM

This graph represents the linear relationship between the dependent variable y and the
independent variable x for the entire population. It is important to note that we do not observe
the entire population. Instead, we typically work with a sample. 6 / 32
T HE SIMPLE REGRESSION MODEL
D ERIVING THE ORDINARY LEAST SQUARES (OLS) ESTIMATES

▶ In order to estimate the regression model one needs data

▶ Population parameters represent the true underlying relationship in the entire population.
These are denoted as: β0 , β1
▶ Fitted (Sample) parameters are estimates based on the sample data. These parameters
aim to approximate the population parameters and are denoted with a hat as: β̂0 , β̂1
▶ A random sample of n observations

A random sample is a subset of individuals or observations chosen from a larger population,

7 / 32
where each member of the population has an equal chance of being selected.
T HE SIMPLE REGRESSION MODEL
D ERIVING THE ORDINARY LEAST SQUARES (OLS) ESTIMATES

So, how can we derive the estimates using our data?

▶ Defining regression residuals

The residual for observation i is the difference between the actual yi and its fitted value:

▶ Minimize the sum of the squared regression residuals

Finding β̂0 and β̂1 to make the sum of squared residuals as small as possible. Gives the same
result as the method of moments.

▶ OLS estimators

8 / 32
T HE SIMPLE REGRESSION MODEL
D ERIVING THE ORDINARY LEAST SQUARES (OLS) ESTIMATES

Interpreting β1

▶ The OLS estimator for β1 can also be represented as

1. The more x and y move together (high covariance), the higher the value of β1 : This is
because covariance measures the strength of the linear relationship between x and y . If
they are highly positively correlated, β1 will be larger, reflecting that changes in x strongly
influence changes in y .
2. If the variance of x is large, the effect of β1 will be smaller: This is because, with a large
spread in x, each individual unit change in x represents a smaller portion of the total
variation in x. So, the estimated effect of x on y (captured by β1 ) becomes smaller.

R stats: Let’s see how to calculate these coefficients in R. Open on Moodle R scripts / Lecture 2
- Simple OLS
9 / 32
T HE SIMPLE REGRESSION MODEL
D ERIVING THE ORDINARY LEAST SQUARES (OLS) ESTIMATES

▶ OLS fits as good as possible a regression line through the data points

10 / 32
T HE SIMPLE REGRESSION MODEL
E XAMPLE OF A SIMPLE REGRESSION

CEO salary and return on equity

Fitted regression

▶ Causal interpretation?
11 / 32
T HE SIMPLE REGRESSION MODEL
P ROPERTIES OF OLS ON ANY SAMPLE OF DATA

Properties of OLS on any sample of data

▶ Fitted values and residuals

▶ Algebraic properties of OLS regression

12 / 32
T HE SIMPLE REGRESSION MODEL
P ROPERTIES OF OLS ON ANY SAMPLE OF DATA

▶ This table presents fitted values and residuals for 15 CEOs.

▶ For example, the 12th CEO’s predicted salary is $526,023 higher than their actual salary.
▶ By contrast the 5th CEO’s predicted salary is $149,493 lower than their actual salary.

R stats: Let’s see if we can confirm the OLS properties in R. Open on Moodle R scripts /
Lecture 2 - Simple OLS
13 / 32
T HE SIMPLE REGRESSION MODEL
G OODNESS OF FIT

▶ How well does an explanatory variable explain the dependent variable?

Measures of variation:

14 / 32
T HE SIMPLE REGRESSION MODEL
G OODNESS OF FIT

Decomposition of total variation

Goodness-of-fit measure (R-squared)

15 / 32
T HE SIMPLE REGRESSION MODEL
G OODNESS OF FIT

CEO Salary and return on equity

So, it means that only one independent variable (x) does not explain much of the variation in
salaries

Voting outcomes and campaign expenditures

Caution: A high R-squared does not necessarily mean that the regression has a causal
interpretation!
16 / 32
T HE SIMPLE REGRESSION MODEL
I NCORPORATING NONLINEARITIES : S EMI - LOGARITHMIC FORM

Incorporating nonlinearities: Semi-logarithmic form

We might want to allow for increasing returns: the next year of education has a larger effect on
wages than did the previous year.

Regression of log wages on years of education

This changes the interpretation of the regression coefficient:

17 / 32
T HE SIMPLE REGRESSION MODEL
I NCORPORATING NONLINEARITIES : S EMI - LOGARITHMIC FORM

Fitted regression

18 / 32
T HE SIMPLE REGRESSION MODEL
I NCORPORATING NONLINEARITIES : S EMI - LOGARITHMIC FORM

CEO salary and firm sales

This changes the interpretation of the regression coefficient:

19 / 32
T HE SIMPLE REGRESSION MODEL
I NCORPORATING NONLINEARITIES : S EMI - LOGARITHMIC FORM

CEO salary and firm sales: fitted regression

▶ The log-log form postulates a constant elasticity model, whereas the semi-log form
assumes a semi-elasticity

▶ Linear regression must be linear in the parameters (coefficients), but the variables can
undergo nonlinear transformations such as x 2 , log (x ), or ex .

20 / 32
T HE SIMPLE REGRESSION MODEL
E XPECTED VALUES AND VARIANCES OF THE OLS ESTIMATORS

▶ The estimated regression coefficients are random variables because they are calculated
from a random sample

▶ The question is what the estimators will estimate on average and how large will their
variability be in repeated samples

First we have to make some assumptions

21 / 32
T HE SIMPLE REGRESSION MODEL
S TANDARD ASSUMPTIONS FOR THE LINEAR REGRESSION MODEL

Standard assumptions for the linear regression model

Assumption SLR.1 (Linear in parameters)

Assumption SLR.2 (Random sampling)

22 / 32
T HE SIMPLE REGRESSION MODEL
A SSUMPTIONS FOR THE LINEAR REGRESSION MODEL ( CONT.)

Assumption SLR.3 (Sample variation in the explanatory variable))

Assumption SLR.4 (Zero conditional mean))

23 / 32
T HE SIMPLE REGRESSION MODEL
A SSUMPTIONS FOR THE LINEAR REGRESSION MODEL ( CONT.)

Theorem 2.1 (Unbiasedness of OLS)

Interpretation of unbiasedness:
▶ The estimated coefficients may be smaller or larger, depending on the sample resulting
from a random draw. However, on average, they will be equal to the values that
characterize the true relationship between y and x in the population.
▶ “On average” means if sampling was repeated, i.e. if drawing the random sample and doing
the estimation was repeated many times.
▶ In a given sample, estimates may differ considerably from true values. But if we repeat the
estimation for many different samplings, we will get, on average, the true value.

24 / 32
T HE SIMPLE REGRESSION MODEL
A SSUMPTIONS FOR THE LINEAR REGRESSION MODEL ( CONT.)

Variances of the OLS estimators

▶ Depending on the sample, the estimates will be nearer or farther away from the true
population values.
▶ How far can we expect our estimates to be away from the true population values on
average (= sampling variability)?
▶ Sampling variability is measured by the estimator‘s variances

Assumption SLR.5 (Homoskedasticity)

Assumption SLR.5: the homoskedasticity assumption plays no role in showing that β0 and β1
are unbiased. We add Assumption SLR.5 because it simplifies the variance calculations and
because it implies that OLS has certain efficiency properties, which we will see next class.
25 / 32
T HE SIMPLE REGRESSION MODEL
A SSUMPTIONS FOR THE LINEAR REGRESSION MODEL ( CONT.)

Graphical illustration of homoskedasticity

26 / 32
T HE SIMPLE REGRESSION MODEL
A SSUMPTIONS FOR THE LINEAR REGRESSION MODEL ( CONT.)

An example for heteroskedasticity: Wage and education

Homoskedasticity does not hold: more educated people likely have a wider variety of job
opportunities, which could lead to more wage variability at higher levels of education. People
with low levels of education have fewer opportunities and often must work at the minimum wage;
this reduces wage variability at low education levels.
27 / 32
T HE SIMPLE REGRESSION MODEL
A SSUMPTIONS FOR THE LINEAR REGRESSION MODEL ( CONT.)

Theorem 2.2 (Variances of the OLS estimators)

▶ Under assumptions SLR.1 – SLR.5:

Conclusion:
▶ The sampling variability of the estimated regression coefficients will be the higher, the
larger the variability of the unobserved factors, and the lower, the higher the variation in the
explanatory variable.

28 / 32
T HE SIMPLE REGRESSION MODEL
E STIMATING THE ERROR VARIANCE

The formulas of the previous slides allow us to isolate the factors that contribute to Var (β̂1 ) and
Var (β̂0 ). But these formulas are unknown, because σ 2 is unknown (it’s a population parameter).
Nevertheless, we can use the data to estimate σ 2 , which then allows us to estimate Var (β̂1 ) and
Var (β̂0 ).

29 / 32
T HE SIMPLE REGRESSION MODEL

Theorem 2.3 (Unbiasedness of the error variance)

▶ Calculation of standard errors for regression coefficients

▶ The estimated standard deviations of the regression coefficients are called “standard
errors.” They measure how precisely the regression coefficients are estimated.
30 / 32
T HE SIMPLE REGRESSION MODEL
R EGRESSION ON A BINARY EXPLANATORY VARIABLE

▶ Suppose that x is either equal to 0 or 1

▶ This regression allows the mean value of y to differ depending on the state of x

▶ Note that the statistical properties of OLS are no different when x is binary

31 / 32
R EFERENCES

Heiss F. (2020). Using R for Introductory Econometrics, 2nd edition. Available here: link.
Chapter 2: The Simple Regression Model.

Wooldridge J.M. (2018). Introductory Econometrics: A Modern Approach, Seventh Edition.

Cengage. Chapter 2: The Simple Regression Model.

32 / 32

Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Functional Data Analysis PDF
No ratings yet
Functional Data Analysis PDF
8 pages
STAT 445 Regression Analysis
No ratings yet
STAT 445 Regression Analysis
49 pages
Lecture 2 Simple Regression Model
100% (1)
Lecture 2 Simple Regression Model
47 pages
Simple Linear Regression Model (1)
No ratings yet
Simple Linear Regression Model (1)
51 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
ch-02-wooldridge-5e-ppt20250307
No ratings yet
ch-02-wooldridge-5e-ppt20250307
51 pages
Chapter 2
No ratings yet
Chapter 2
12 pages
Chapter 1: The Nature of Econometrics and Economic Data
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data
19 pages
STAT 445-Lecture 1_2021
No ratings yet
STAT 445-Lecture 1_2021
42 pages
ECC321 chapter2
No ratings yet
ECC321 chapter2
5 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
42 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
Chapter 2 Econometric
No ratings yet
Chapter 2 Econometric
28 pages
02 Simple Regression
No ratings yet
02 Simple Regression
29 pages
As of Sep 16, 2020: Seppo Pynn Onen Econometrics I
No ratings yet
As of Sep 16, 2020: Seppo Pynn Onen Econometrics I
52 pages
Lecture 6. Linear Regression
No ratings yet
Lecture 6. Linear Regression
12 pages
Ch3 Slides Ed4 2024
No ratings yet
Ch3 Slides Ed4 2024
72 pages
CH 02 PPT Simple Linear Regression
No ratings yet
CH 02 PPT Simple Linear Regression
43 pages
Lecture 2-3
No ratings yet
Lecture 2-3
8 pages
Week 2 - The Simple Linear Regression Model PDF
No ratings yet
Week 2 - The Simple Linear Regression Model PDF
47 pages
Lecture 2. Simple Linear Regression
No ratings yet
Lecture 2. Simple Linear Regression
49 pages
1-Chap II Econometrics ABC DR Mitiku
No ratings yet
1-Chap II Econometrics ABC DR Mitiku
80 pages
Ch3_slides_Ed4_2024_20(1)
No ratings yet
Ch3_slides_Ed4_2024_20(1)
72 pages
AG909 Quantitative Methods For Finance
No ratings yet
AG909 Quantitative Methods For Finance
7 pages
Slides (Handout) - Caio - Chapter 2 (Wooldridge)
No ratings yet
Slides (Handout) - Caio - Chapter 2 (Wooldridge)
86 pages
Ie Slide02
No ratings yet
Ie Slide02
30 pages
1170_10045_411513
No ratings yet
1170_10045_411513
55 pages
Unit 5 and 6 - Inferential Statistics and Regression Analysis
No ratings yet
Unit 5 and 6 - Inferential Statistics and Regression Analysis
68 pages
Chapter 1 Article
No ratings yet
Chapter 1 Article
9 pages
Simple Linear Regression Model I
No ratings yet
Simple Linear Regression Model I
83 pages
Ordinary Least Squares Linear Regression Review: Week 4
No ratings yet
Ordinary Least Squares Linear Regression Review: Week 4
10 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
CH 02
No ratings yet
CH 02
88 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Multiple regression
No ratings yet
Multiple regression
14 pages
EECM3724 Unit 9 ch14 Slides 2023
No ratings yet
EECM3724 Unit 9 ch14 Slides 2023
57 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
16 pages
Lecture 2_MRA and Inference
No ratings yet
Lecture 2_MRA and Inference
57 pages
Econ 399 Chapter2a
No ratings yet
Econ 399 Chapter2a
40 pages
ECO 401 Econometrics: SI 2021 Week 2, 14 September
100% (1)
ECO 401 Econometrics: SI 2021 Week 2, 14 September
47 pages
Hayashi 1 13
No ratings yet
Hayashi 1 13
13 pages
Introduction To Econometrics (ET2013) : Teresa Randazzo
No ratings yet
Introduction To Econometrics (ET2013) : Teresa Randazzo
30 pages
Lecture 4
No ratings yet
Lecture 4
17 pages
Lecture 2
No ratings yet
Lecture 2
47 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
2 Basic Regression
No ratings yet
2 Basic Regression
69 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
CH - 02 - Simple Linear Regression - TQT
No ratings yet
CH - 02 - Simple Linear Regression - TQT
61 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
24 pages
Chapter 1: The Nature of Econometrics and Economic Data
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data
16 pages
Topic 3 SRM 1
No ratings yet
Topic 3 SRM 1
61 pages
Chapter 02
No ratings yet
Chapter 02
14 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
SLR Note
No ratings yet
SLR Note
5 pages
CH 02 Wooldridge 6e PPT Updated
No ratings yet
CH 02 Wooldridge 6e PPT Updated
35 pages
Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
CH 02 Simple Regression TQT
No ratings yet
CH 02 Simple Regression TQT
61 pages
Lecture 2 - LRM
No ratings yet
Lecture 2 - LRM
43 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
FullProf User's Guide
No ratings yet
FullProf User's Guide
76 pages
Bias Varience Trade Off
100% (2)
Bias Varience Trade Off
35 pages
CBNST
0% (1)
CBNST
2 pages
NEP Introductory Econometrics December 2024
No ratings yet
NEP Introductory Econometrics December 2024
7 pages
Python Recipes for Engineers and Scientists Scripts that devour your integrals equations differential equations and interpolations 1st Edition Javier Riverola Gurruchaga - Quickly download the ebook to explore the full content
100% (2)
Python Recipes for Engineers and Scientists Scripts that devour your integrals equations differential equations and interpolations 1st Edition Javier Riverola Gurruchaga - Quickly download the ebook to explore the full content
47 pages
Regularization
No ratings yet
Regularization
5 pages
88-Article Text-138-2-10-20230103
No ratings yet
88-Article Text-138-2-10-20230103
20 pages
Regression Metrics
No ratings yet
Regression Metrics
26 pages
Chapter 2
No ratings yet
Chapter 2
73 pages
Problems and challenges in spatial analysis
No ratings yet
Problems and challenges in spatial analysis
5 pages
Add Raster Feature
No ratings yet
Add Raster Feature
10 pages
Unit7_Autocorrelation
No ratings yet
Unit7_Autocorrelation
11 pages
Unit II - Numerical Methods IV Sem
No ratings yet
Unit II - Numerical Methods IV Sem
37 pages
vertopal.com_Lab_Linear_Regression
No ratings yet
vertopal.com_Lab_Linear_Regression
21 pages
Devoir MR Alfred en Anglais
No ratings yet
Devoir MR Alfred en Anglais
26 pages
Program -7
No ratings yet
Program -7
4 pages
An 8-Bit QVGA AMOLED Driver IC With A Polynomial Interpolation DAC
No ratings yet
An 8-Bit QVGA AMOLED Driver IC With A Polynomial Interpolation DAC
4 pages
4.4 Lagrange Polynomials
No ratings yet
4.4 Lagrange Polynomials
16 pages
Regression Analysis MCQ's
No ratings yet
Regression Analysis MCQ's
3 pages
Practical C++20 Financial Programming: Problem Solving for Quantitative Finance, Financial Engineering, Business, and Economics 2nd Edition Carlos Oliveira download
100% (3)
Practical C++20 Financial Programming: Problem Solving for Quantitative Finance, Financial Engineering, Business, and Economics 2nd Edition Carlos Oliveira download
54 pages
Numerical Analysis Anzar Lec 29 Least Squares Regression 16122022 065607pm
No ratings yet
Numerical Analysis Anzar Lec 29 Least Squares Regression 16122022 065607pm
10 pages
Course Outline Numerical Methods
No ratings yet
Course Outline Numerical Methods
2 pages
Tool & Mold Making
No ratings yet
Tool & Mold Making
18 pages
Approximation in Numerical Computation
No ratings yet
Approximation in Numerical Computation
10 pages
Quadratic Regression Calculator - Formula
No ratings yet
Quadratic Regression Calculator - Formula
7 pages
BCA 6th Sem
No ratings yet
BCA 6th Sem
29 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
Lectute 2 - Panel Data Regression
No ratings yet
Lectute 2 - Panel Data Regression
30 pages
Numerical 1
No ratings yet
Numerical 1
23 pages

Econometrics___Lecture_2___Simple_Regression

Uploaded by

Econometrics___Lecture_2___Simple_Regression

Uploaded by

L ECTURE 2 - T HE S IMPLE R EGRESSION M ODEL

Course: [107408] ECONOMETRICS

School of Economics and Management,

▶ A simple equation to “Explain the variable y in terms of variable x”

▶ Explains how y varies with changes in x

Example: Soybean yield and fertilizer

Example: A simple wage equation

▶ When is there a causal interpretation?

Example: wage equation

▶ The conditional mean independence assumption implies that

▶ In order to estimate the regression model one needs data

A random sample is a subset of individuals or observations chosen from a larger population,

So, how can we derive the estimates using our data?

▶ Defining regression residuals

▶ Minimize the sum of the squared regression residuals

▶ The OLS estimator for β1 can also be represented as

CEO salary and return on equity

Properties of OLS on any sample of data

▶ Fitted values and residuals

▶ Algebraic properties of OLS regression

▶ This table presents fitted values and residuals for 15 CEOs.

▶ How well does an explanatory variable explain the dependent variable?

Decomposition of total variation

Goodness-of-fit measure (R-squared)

CEO Salary and return on equity

Voting outcomes and campaign expenditures

Incorporating nonlinearities: Semi-logarithmic form

Regression of log wages on years of education

This changes the interpretation of the regression coefficient:

CEO salary and firm sales

This changes the interpretation of the regression coefficient:

CEO salary and firm sales: fitted regression

First we have to make some assumptions

Standard assumptions for the linear regression model

Assumption SLR.1 (Linear in parameters)

Assumption SLR.2 (Random sampling)

Assumption SLR.3 (Sample variation in the explanatory variable))

Assumption SLR.4 (Zero conditional mean))

Theorem 2.1 (Unbiasedness of OLS)

Variances of the OLS estimators

Assumption SLR.5 (Homoskedasticity)

Graphical illustration of homoskedasticity

An example for heteroskedasticity: Wage and education

Theorem 2.2 (Variances of the OLS estimators)

Theorem 2.3 (Unbiasedness of the error variance)

▶ Calculation of standard errors for regression coefficients

▶ Suppose that x is either equal to 0 or 1

Wooldridge J.M. (2018). Introductory Econometrics: A Modern Approach, Seventh Edition.

You might also like