0% found this document useful (0 votes)

53 views

Simple Linear Regression.: 29.1 Method of Least Squares

The document summarizes the method of least squares linear regression. It defines the loss function as the sum of squared residuals that is minimized to find the regression line. This line is called the least squares line. It further describes simple linear regression where the response variable Y is modeled as a linear function of X, with random error epsilon. It derives the maximum likelihood estimates of the regression coefficients beta_0 and beta_1 as well as the variance sigma^2. Finally, it computes the distributions of the estimates beta_0 and beta_1, showing they are normally distributed.

Uploaded by

Carlos Socré

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Simple Linear Regression.: 29.1 Method of Least Squares

Uploaded by

Carlos Socré

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Lecture 29

Simple linear regression.

29.1 Method of least squares.

Suppose that we are given a sequence of observations

(X1 , Y1 ), . . . , (Xn , Yn )

where each observation is a pair of numbers X, Yi . Suppose that we want to

predict variable Y as a function of X because we believe that there is some underlying
relationship between Y and X and, for example, Y can be approximated by a function
of X, i.e. Y f (X). We will consider the simplest case when f (x) is a linear function
of x:
f (x) = 0 + 1 x.

x x

x x
x

Figure 29.1: The least-squares line.

Of course, we want to find the line that fits our data best and one can define the
measure of the quality of the fit in many different ways. The most common approach

116
LECTURE 29. SIMPLE LINEAR REGRESSION. 117

is to measure how Yi is approximated by 0 + 1 Xi in terms of the squared difference

(Yi (0 +1 Xi ))2 which means that we measure the quality of approximation globally
by the loss function
n
X
L= ( Yi (0 + 1 Xi ))2 minimize over 0 , 1
|{z} | {z }
i=1 actual estimate

and we want to minimize it over all choices of parameters 0 , 1 . The line that mini-
mizes this loss is called the least-squares line. To find the critical points we write:
n
L X
= 2(Yi (0 + 1 Xi )) = 0
0 i=1
n
L X
= 2(Yi (0 + 1 Xi ))Xi = 0
1 i=1

If we introduce the notations

= 1 1X 1X 2 1X
X
X Xi , Y = Yi , X 2 = Xi , XY = Xi Y i
n n n n
then the critical point conditions can be rewritten as
= Y and 0 X
0 + 1 X + 1 X 2 = XY

and solving it for 0 and 1 we get

Y
XY X
1 = and 0 = Y 1 X.

2
X2 X
If each Xi is a vector Xi = (Xi1 , . . . , Xik ) of dimension k then we can try to
approximate Yi s as a linear function of the coordinates of Xi :

Yi f (Xi ) = 0 + 1 Xi1 + . . . + k Xik .

In this case one can also minimize the square loss:

X
L= (Yi (0 + 1 Xi1 + . . . + k Xik ))2 minimize over 0 , 1 , . . . , k

by taking the derivatives and solving the system of linear equations to find the pa-
rameters 0 , . . . , k .
LECTURE 29. SIMPLE LINEAR REGRESSION. 118

29.2 Simple linear regression.

First of all, when the response variable Y in a random couple (X, Y ) is predicted as
a function of X then one can model this situation by

Y = f (X) +

where the random variable is independent of X (it is often called random noise)
and on average it is equal to zero: = 0. For a fixed X, the response variable Y in
this model on average will be equal to f (X) since

(Y |X) = (f (X) + |X) = f (X) + (|X) = f (X) + = f (X).

and f (x) = (Y |X = x) is called the regression function.

Next, we will consider a simple linear regression model in which the regression
function is linear, i.e. f (x) = 0 + 1 x, and the response variable Y is modeled as

Y = f (X) + = 0 + 1 X + ,

where the random noise is assumed to have normal distribution N (0, 2 ).

Suppose that we are given a sequence (X1 , Y1 ), . . . , (Xn , Yn ) that is described by
the above model:
Y i = 0 + 1 Xi + i
and 1 , . . . , n are i.i.d. N (0, 2 ). We have three unknown parameters - 0 , 1 and 2
- and we want to estimate them using the given sample. Let us think of the points
X1 , . . . , Xn as fixed and non random and deal with the randomness that comes from
the noise variables i . For a fixed Xi , the distribution of Yi is equal to N (f (Xi ), 2 )
with p.d.f.
1 (yf (Xi ))2
f (y) = e 22
2
and the likelihood function of the sequence Y1 , . . . , Yn is:
1 n 1 n
12 1 Pn
Pn
(Yi f (Xi ))2 2
f (Y1 , . . . , Yn ) = e 2 i=1 = e 22 i=1 (Yi 0 1 Xi ) .
2 2

Let us find the maximum likelihood estimates of 0 , 1 and 2 that maximize this
likelihood function. First of all, it is obvious that for any 2 we need to minimize
n
X
(Yi 0 1 Xi )2
i=1
LECTURE 29. SIMPLE LINEAR REGRESSION. 119

over 0 , 1 which is the same as finding the least-squares line and, therefore, the MLE
for 0 and 1 are given by

and 1 = XY X Y .
0 = Y 1 X
X2 X2
Finally, to find the MLE of 2 we maximize the likelihood over 2 and get:
n
1X
2
= (Yi 0 1 Xi )2 .
n i=1

Let us now compute the joint distribution of 0 and 1 . Since Xi s are fixed, these
estimates are written as linear combinations of Yi s which have normal distributions
and, as a result, 0 and 1 will have normal distributions. All we need to do is find
their means, variances and covariance. First, if we write 1 as
P
XY X Y 1 (Xi X)Y i

1 = =
X2 X2 n X2 X 2
then its expectation can be computed:
P P
(X i Yi
X) 0 + 1 Xi )
(Xi X)(
(1 ) = =
n(X 2 X 2) n(X 2 X 2)
P P 2
(Xi X) Xi (Xi X) nX 2 nX
= 0 +1 = 1 = 1 .
n(X 2 X 2) n(X 2 X 2) 2)
n(X 2 X
| {z }
=0

Therefore, 1 is unbiased estimator of 1 . The variance of 1 can be computed:

P(X X)Y i X (X X)Y i
i i
Var(1 ) = Var = Var
n(X 2 X 2) n(X 2 X 2)
X Xi X 2 1
= 2 = 2 ) 2
n(X 2 X
2
n(X X )
2 2 2
n (X X 2 )2
2
= .
n(X 2 X 2)

Therefore, 1 N 1 , n(X 2X 2 ) . A similar straightforward computations give:
2

2
N 0 , 1 +
0 = Y 1 X
X
2
2)
n n(X 2 X
and
2
X
Cov(0 , 1 ) = .
2)
n(X 2 X

Get Discovering Statistics Using IBM SPSS Statistics 6th Edition Andy Field Free All Chapters
75% (4)
Get Discovering Statistics Using IBM SPSS Statistics 6th Edition Andy Field Free All Chapters
64 pages
MAF3821 2024 Part1
100% (1)
MAF3821 2024 Part1
35 pages
HW 03 Sol
No ratings yet
HW 03 Sol
9 pages
Guidance For Industry Stability Testing of Drug Substances and Drug Products PDF
No ratings yet
Guidance For Industry Stability Testing of Drug Substances and Drug Products PDF
114 pages
Econometrics Formulas
80% (5)
Econometrics Formulas
2 pages
Crowdfunding For Green Projects in Europe 2017
No ratings yet
Crowdfunding For Green Projects in Europe 2017
34 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
FCDS - RA ch1 Sp21
No ratings yet
FCDS - RA ch1 Sp21
14 pages
Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Lecture-4
No ratings yet
Lecture-4
11 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Linera Regression II PDF
No ratings yet
Linera Regression II PDF
14 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Math644 Chapter 1 Part1
No ratings yet
Math644 Chapter 1 Part1
5 pages
Reg Analysis
No ratings yet
Reg Analysis
63 pages
Chapter 9 Simple Linear Regression and Correlation (1) (1)
No ratings yet
Chapter 9 Simple Linear Regression and Correlation (1) (1)
56 pages
8. Linear Regression
No ratings yet
8. Linear Regression
29 pages
CH 11 Slides
No ratings yet
CH 11 Slides
41 pages
SimpleLinearRegression 150107
No ratings yet
SimpleLinearRegression 150107
25 pages
regression2
No ratings yet
regression2
28 pages
EE311_Lecture_Ch9_Regression
No ratings yet
EE311_Lecture_Ch9_Regression
15 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Topic 3: Simple Linear Regression
No ratings yet
Topic 3: Simple Linear Regression
19 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
sta255 Week 13-pre
No ratings yet
sta255 Week 13-pre
16 pages
Tutorial1_estimates(1)
No ratings yet
Tutorial1_estimates(1)
9 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
RegEstimationLS_ML_StatColumbia
No ratings yet
RegEstimationLS_ML_StatColumbia
44 pages
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
No ratings yet
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
20 pages
MOOC Course-Regression Analysis and Forecasting - January 2017 Assignment 1
No ratings yet
MOOC Course-Regression Analysis and Forecasting - January 2017 Assignment 1
5 pages
Sparse Regression
No ratings yet
Sparse Regression
37 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
Notes On Applied Linear Regression
No ratings yet
Notes On Applied Linear Regression
47 pages
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
No ratings yet
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
23 pages
Unit4 Multivariate Analysis
No ratings yet
Unit4 Multivariate Analysis
20 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
IE266-S25-week10-updated (1)
No ratings yet
IE266-S25-week10-updated (1)
27 pages
CVEN2002 Week11
No ratings yet
CVEN2002 Week11
49 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
BS Classes V2
No ratings yet
BS Classes V2
70 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
LECTURE2
No ratings yet
LECTURE2
13 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
Lecture3 221109 035214
No ratings yet
Lecture3 221109 035214
87 pages
Econometrics_2
No ratings yet
Econometrics_2
8 pages
Lectureslides Chap6-Annot PDF
No ratings yet
Lectureslides Chap6-Annot PDF
30 pages
Chapter 02
No ratings yet
Chapter 02
14 pages
STAT 3008 Applied Regression Analysis Tutorial 1 - Term 2, 2019 20
No ratings yet
STAT 3008 Applied Regression Analysis Tutorial 1 - Term 2, 2019 20
2 pages
3 - Linear Regression-Least Square Error Fit
No ratings yet
3 - Linear Regression-Least Square Error Fit
35 pages
Chapter_8_Linear_regression (1)
No ratings yet
Chapter_8_Linear_regression (1)
22 pages
05 Regression Least Squares
No ratings yet
05 Regression Least Squares
5 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
BM1 CheatsheetExam2
No ratings yet
BM1 CheatsheetExam2
3 pages
Lec2 ASE
No ratings yet
Lec2 ASE
86 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Multiple Regression Analysis: I 0 1 I1 K Ik I
100% (1)
Multiple Regression Analysis: I 0 1 I1 K Ik I
30 pages
Week 2
No ratings yet
Week 2
33 pages
Lecture 2-2_Simple Linear Regression (One Regressor)
No ratings yet
Lecture 2-2_Simple Linear Regression (One Regressor)
22 pages
Simple_linear_regression-Presentation -Review-analysis -covariance
No ratings yet
Simple_linear_regression-Presentation -Review-analysis -covariance
10 pages
Introduction to Bessel Functions
From Everand
Introduction to Bessel Functions
Frank Bowman
2.5/5 (1)
STAT 252 2025 Winter Common Syllabus 1
No ratings yet
STAT 252 2025 Winter Common Syllabus 1
7 pages
PROCESS Version 4 Documentation Addendum
No ratings yet
PROCESS Version 4 Documentation Addendum
6 pages
Role of Strategic Planning
No ratings yet
Role of Strategic Planning
12 pages
NIJ Journal Issue No. 271
No ratings yet
NIJ Journal Issue No. 271
44 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
Calculation of First Sales Forecasts
No ratings yet
Calculation of First Sales Forecasts
29 pages
Simple Linear Regression: Yandell - Econ 216 Chap 13-1
No ratings yet
Simple Linear Regression: Yandell - Econ 216 Chap 13-1
70 pages
Big Data For Marketing Resource Reallocation
No ratings yet
Big Data For Marketing Resource Reallocation
31 pages
Jecr-Using Disparate Market Data To Measure and Predict Future Performance of Commercial
No ratings yet
Jecr-Using Disparate Market Data To Measure and Predict Future Performance of Commercial
18 pages
Solutions To Ch12 Blanchard
No ratings yet
Solutions To Ch12 Blanchard
11 pages
Regression
No ratings yet
Regression
3 pages
5172Get Probability and Statistics with Reliability Queueing and Computer Science Applications 2nd Edition Kishor Shridharbhai Trivedi free all chapters
100% (21)
5172Get Probability and Statistics with Reliability Queueing and Computer Science Applications 2nd Edition Kishor Shridharbhai Trivedi free all chapters
60 pages
Lecture 10
No ratings yet
Lecture 10
14 pages
Embankment Dam Breach Parameters and Their Uncertainties: David C. Froehlich, PH.D., P.E., M.ASCE
No ratings yet
Embankment Dam Breach Parameters and Their Uncertainties: David C. Froehlich, PH.D., P.E., M.ASCE
14 pages
12 - Perbedaan Regresi Dengan Time Series Dan Klasifikasi
No ratings yet
12 - Perbedaan Regresi Dengan Time Series Dan Klasifikasi
18 pages
Machine Learning in Geomechanics by Khatibi
No ratings yet
Machine Learning in Geomechanics by Khatibi
15 pages
A Coach's Guide to Velocity-Based Training_ Definitions and Diagnostics
No ratings yet
A Coach's Guide to Velocity-Based Training_ Definitions and Diagnostics
15 pages
Tieu Luan
No ratings yet
Tieu Luan
37 pages
HJKKJK
No ratings yet
HJKKJK
79 pages
Impact of Leadership Style On Employee Motivation: A Study On The Employee Serving in Banking Organization in Bangladesh
No ratings yet
Impact of Leadership Style On Employee Motivation: A Study On The Employee Serving in Banking Organization in Bangladesh
7 pages
CW 2-3 Regression & Reexpresing 11 03 2024
No ratings yet
CW 2-3 Regression & Reexpresing 11 03 2024
36 pages
May D. Segletes D. and Gordon A. P. 2013 The Application of The Norton Bailey Law For Creep Prediction Through Power Law Regression
No ratings yet
May D. Segletes D. and Gordon A. P. 2013 The Application of The Norton Bailey Law For Creep Prediction Through Power Law Regression
8 pages
Impact of Job Satisfaction On Employee Performance 2
No ratings yet
Impact of Job Satisfaction On Employee Performance 2
13 pages
BDS 2019-20
No ratings yet
BDS 2019-20
5 pages
Sequential Forward Selection (SFS)
No ratings yet
Sequential Forward Selection (SFS)
5 pages
Regression Testing: Stuart Anderson
No ratings yet
Regression Testing: Stuart Anderson
17 pages

Simple Linear Regression.: 29.1 Method of Least Squares

Uploaded by

Simple Linear Regression.: 29.1 Method of Least Squares

Uploaded by

Lecture 29

Simple linear regression.

29.1 Method of least squares.

where each observation is a pair of numbers X, Yi . Suppose that we want to

Figure 29.1: The least-squares line.

is to measure how Yi is approximated by 0 + 1 Xi in terms of the squared difference

If we introduce the notations

and solving it for 0 and 1 we get

Yi f (Xi ) = 0 + 1 Xi1 + . . . + k Xik .

In this case one can also minimize the square loss:

29.2 Simple linear regression.

(Y |X) = (f (X) + |X) = f (X) + (|X) = f (X) + = f (X).

and f (x) = (Y |X = x) is called the regression function.

where the random noise is assumed to have normal distribution N (0, 2 ).

Therefore, 1 is unbiased estimator of 1 . The variance of 1 can be computed:

You might also like