0% found this document useful (0 votes)

4 views13 pages

Math170S_Lecture6

The document discusses regression analysis, focusing on predicting a future random variable Y based on a known variable x, with an emphasis on estimating the conditional mean E[Y |x]. It outlines the assumptions of regression models, particularly linear regression, and explains how to calculate maximum likelihood estimators for the parameters involved. Additionally, it provides a practical example using student scores to illustrate the calculations and the concept of residuals in assessing model fit.

Uploaded by

kirudadarinn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views13 pages

Math170S_Lecture6

Uploaded by

kirudadarinn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Math 170S

Lecture 6

Hubeyb Gurdogan

August 8, 2024

1 / 13
References

Zimmerman
Chapters 6.5

2 / 13
Regression

▶ In general we are interested in predicting a future(latent)

random variable Y corresponding to a realized(known)
variable x.
▶ Example: Let x represent the average age in a country, and
Y be the annual fatality rate of Influenza.
▶ For now we concentrate on estimating the conditional mean
E[Y |x](a standard predictor for Y given x).
▶ To estimate we observe random variables Y1 , Y2 , ..., Yn for
x1 , x2 , .., xn independently, obtaining a sample of n pairs of
known numbers (x1 , y1 ), (x2 , y2 ), ..., (xn , yn )
▶ These pairs then used to estimate the conditional mean
E[Y |x].

3 / 13
Regression Assumptions
The relationship between Y and x is assumed to be:

Y = µ(x) + ϵ

where we have:

▶ µ(x) is a deterministic function of x.

▶ ϵ is a Normal random variable with mean 0, variance σ 2 i.e.
ϵ ∼ N(0, σ 2 ).

Remark: The function µ(x) can have different forms that names
the method;.

▶ Linear regression model: µ(x) = α + βx

▶ Polynomial regression model: µ(x) = α + βx + γx 2
▶ A Non-linear regression model: µ(x) = αx β

4 / 13
Linear Regression Problem

We now consider the linear regression problem where

µ(x) = α + βx

. For a random sample Y1 , Y2 , ..., Yn we have,

Yi = α + βxi + εi , εi ∼ N(0, σ 2 )

Note, this implies that Y1 , ..., Yn have the following distribution;

Yi ∼ N(α + βxi ; σ 2 )

(Warning: α matches with the α1 in the course book.)

5 / 13
Maximum Likelihood Estimators of α, β, σ 2
We can use maximum likelihood estimation to obtain estimates of
α, β, σ 2 . The likelihood function is:
n
(yi − (α + βxi ))2

2
Y 1
L(α, β, σ ) = exp −
(2πσ 2 )1/2 2σ 2
i=1
Pn 2
1 i=1 (yi − (α + βxi ))
= exp −
(2πσ 2 )1/2 2σ 2
The maximum likelihood estimators α̂, β̂, σ̂ 2 of α, β, σ 2 are the
values that maximize L(·).
Pn 2
2 1 i=1 (yi − (α + βxi ))
(α̂, β̂, σ̂ ) = arg max exp −
2 1/2
α,β,σ 2 (2πσ ) 2σ 2

6 / 13
Maximum Likelihood Estimators of α, β, σ 2
Maximum likelihood estimates of α, β, σ 2 are given by:
Pn Pn
(y −ȳ )(xi −x̄) yi (xi −x̄)
▶ β̂ = Pn i
i=1
2 = Pi=1
n 2
i=1 (xi −x̄) i=1 (xi −x̄)

▶ α̂ = ȳ − β̂ x̄

1 Pn
▶ σ̂ 2 = n i=1 [yi − α̂ − β̂xi ]2

Calculating the above α̂, β̂, σ̂ 2 is not hard, but can be

time-consuming.

Therefore, we provide you the following formulas for α̂, β̂, σ̂ 2 .

7 / 13
Formulas

AB
E− n B A
β̂ = 2 ; α̂ = − β̂ ;
C − An n n
2
2 D B E AB
σ̂ = − − β̂ + β̂ 2 ,
n n n n

where
n
X n
X
A := xi , B := yi
i=1 i=1
Xn n
X
C := xi2 , D : yi2
i=1 i=1
Xn
E := xi yi ,
i=1

8 / 13
Calculations

Let x1 , . . . , xn be the midterm score of 10 students in a fictional

statistics class:

70 74 72 68 58 54 82 64 80 61.

Let y1 , . . . , yn be the final score of the 10 students:

77 94 88 80 71 76 88 80 90 69.

9 / 13
Calculations
The key values A, B, C , D, E are given by
Xn
A= xi = 70 + 74 + 72 + 68 + 58
i=1
+ 54 + 82 + 64 + 80 + 61 = 683;
Xn
B= yi = 77 + 94 + 88 + 80 + 71
i=1
+ 76 + 88 + 80 + 90 + 69 = 813;
Xn
C= xi2 = 702 + 742 + 722 + 682 + 582
i=1
+ 542 + 822 + 642 + 802 + 612 = 47, 405;
Xn
D= yi2 = 772 + 942 + 882 + 802 + 712
i=1
+ 762 + 882 + 802 + 902 + 692 = 66, 731;
10 / 13
Calculations

n
X
E= xi yi
i=1
=(70)(77) + (74)(94) + (72)(88) + (68)(80) + (58)(71)+
(54)(76) + (82)(88) + (64)(80) + (80)(90) + (61)(69)
= 56, 089;

The MLEs are then given by

AB
E− n B A
β̂ = 2 = 0.742 α̂ = − β̂ = 30.6214
C − An n n

2
2 D B E AB
σ̂ = − − β̂ + β̂ 2 = 21.77638
n n n n

11 / 13
Residuals

Once we derived the maximum likelihood estimates of α, β, σ 2 , we

can derive the predicted outcome variable ŷi for each i = 1, ..., n

ŷi = α̂ + β̂xi

Residuals: the differences between the observed values yi and the

predicted values ŷi are called residuals and provide information
regarding how well the model fit the data.

In general, we have:

▶ Large residuals ⇒ the model is a bad fit

▶ Small residuals ⇒ the model is a good fit
In practice the average of the squared residuals, σ̂ 2 is used as a
measure of success.

12 / 13
Scatter Plot, Regression Line

13 / 13

lecture1_ml_MLE
No ratings yet
lecture1_ml_MLE
103 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
Guideline For Curriculum Vitae (CV) Preparation: Nato Women'S Professional Network
No ratings yet
Guideline For Curriculum Vitae (CV) Preparation: Nato Women'S Professional Network
13 pages
Linear Regression
100% (2)
Linear Regression
228 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
2022/2023 ASVAB For Dummies Angie Papple Johnston pdf download
100% (1)
2022/2023 ASVAB For Dummies Angie Papple Johnston pdf download
48 pages
Stats101A - Chapter 2
No ratings yet
Stats101A - Chapter 2
59 pages
RegEstimationLS_ML_StatColumbia
No ratings yet
RegEstimationLS_ML_StatColumbia
44 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Mult Regression
No ratings yet
Mult Regression
28 pages
Econometric estimation BETA
No ratings yet
Econometric estimation BETA
36 pages
AllNotes-4 (2)
No ratings yet
AllNotes-4 (2)
56 pages
Reg Analysis
No ratings yet
Reg Analysis
63 pages
Lecture 2 Multivariate Linear Regression Models
No ratings yet
Lecture 2 Multivariate Linear Regression Models
15 pages
WST 311 Notes part 2 2024
No ratings yet
WST 311 Notes part 2 2024
21 pages
Multiple Regression
No ratings yet
Multiple Regression
22 pages
BS Classes V2
No ratings yet
BS Classes V2
70 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Lecture 1: Optimal Prediction (With Refreshers) : 36-401, Fall 2017 Sunday 3 September, 2017
No ratings yet
Lecture 1: Optimal Prediction (With Refreshers) : 36-401, Fall 2017 Sunday 3 September, 2017
13 pages
Quant_Chapter_05_ols
No ratings yet
Quant_Chapter_05_ols
15 pages
Book Summary - Getting Things Done
No ratings yet
Book Summary - Getting Things Done
13 pages
26-MEA CSL Stimulation Laboratory Procedures
No ratings yet
26-MEA CSL Stimulation Laboratory Procedures
164 pages
SRM Formula Sheet-2
100% (1)
SRM Formula Sheet-2
11 pages
Reading 5 A
No ratings yet
Reading 5 A
10 pages
Regress
No ratings yet
Regress
11 pages
Linera Regression II PDF
No ratings yet
Linera Regression II PDF
14 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
LECTURE2
No ratings yet
LECTURE2
13 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
Chapter 1 - Linear Regression With 1 Predictor: Statistical Model
No ratings yet
Chapter 1 - Linear Regression With 1 Predictor: Statistical Model
35 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Deming Regression: Methcomp Package May 2007
100% (1)
Deming Regression: Methcomp Package May 2007
10 pages
Chap7
No ratings yet
Chap7
7 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
No ratings yet
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
23 pages
Standard Errors For Regression Equations
No ratings yet
Standard Errors For Regression Equations
4 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
LM Week1 1 2019
No ratings yet
LM Week1 1 2019
28 pages
FCDS - RA ch1 Sp21
No ratings yet
FCDS - RA ch1 Sp21
14 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
STAT 3008 Applied Regression Analysis Tutorial 2 - Term 2, 2019 20
No ratings yet
STAT 3008 Applied Regression Analysis Tutorial 2 - Term 2, 2019 20
2 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Seattle SISG 18 IntroQG Lecture08
No ratings yet
Seattle SISG 18 IntroQG Lecture08
21 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Chapter2 (Simple Linear Regression)
No ratings yet
Chapter2 (Simple Linear Regression)
11 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
Practice Leaflet
No ratings yet
Practice Leaflet
6 pages
Notes of CH 2 Physical Features of India - Class 9th Geography Study Rankers PDF
100% (2)
Notes of CH 2 Physical Features of India - Class 9th Geography Study Rankers PDF
10 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
CH 2
No ratings yet
CH 2
31 pages
dis1
No ratings yet
dis1
5 pages
Phylum Porifera: "Sponges"
No ratings yet
Phylum Porifera: "Sponges"
11 pages
Regression 101
No ratings yet
Regression 101
18 pages
Project Research Proposal Sikholiwe
No ratings yet
Project Research Proposal Sikholiwe
26 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
Least Squares Estimation PDF
No ratings yet
Least Squares Estimation PDF
5 pages
Topic 3 - Concept of Inclusion in Sports, Its Need
No ratings yet
Topic 3 - Concept of Inclusion in Sports, Its Need
6 pages
Math644 Chapter 1 Part1
No ratings yet
Math644 Chapter 1 Part1
5 pages
Stability Analysis of Small Dams
100% (1)
Stability Analysis of Small Dams
6 pages
Statistical Analysis: A Manual On Dissertation Statistics in SPSS
No ratings yet
Statistical Analysis: A Manual On Dissertation Statistics in SPSS
198 pages
Lecture5 Module2 Anova 1
No ratings yet
Lecture5 Module2 Anova 1
9 pages
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
No ratings yet
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
9 pages
SAT 20
No ratings yet
SAT 20
10 pages
Research Format - Guidelines Undergraduate
No ratings yet
Research Format - Guidelines Undergraduate
28 pages
Geothermal Energy
No ratings yet
Geothermal Energy
8 pages
BUS 361 Assignment 5-1
No ratings yet
BUS 361 Assignment 5-1
5 pages
Reading Quiz 6 CPSC 320 101 102 103 2022W1 Intermediate Algorithm Design and Analysis PDF
No ratings yet
Reading Quiz 6 CPSC 320 101 102 103 2022W1 Intermediate Algorithm Design and Analysis PDF
6 pages
Izho 2012 Experiment (Problems & Solutions)
No ratings yet
Izho 2012 Experiment (Problems & Solutions)
14 pages
Exam 1cv40 January 2019
No ratings yet
Exam 1cv40 January 2019
11 pages
LCA - Group 2
No ratings yet
LCA - Group 2
9 pages
Speak Like, Uh, A Pro
No ratings yet
Speak Like, Uh, A Pro
5 pages
Real-time Operating Systems
No ratings yet
Real-time Operating Systems
5 pages
Cornell University HD3290 Final Study Guide
No ratings yet
Cornell University HD3290 Final Study Guide
3 pages
McKinsey & Company
No ratings yet
McKinsey & Company
1 page
Law 1 Stat q3 Week 1 2
No ratings yet
Law 1 Stat q3 Week 1 2
12 pages
Sampling Station: Source: Sampling Date: Testing Date:: Determination of Ten Per Cent Fines Value (TFV)
No ratings yet
Sampling Station: Source: Sampling Date: Testing Date:: Determination of Ten Per Cent Fines Value (TFV)
2 pages
A Formal Definition of Big Data Based On Its Essential Features
No ratings yet
A Formal Definition of Big Data Based On Its Essential Features
12 pages
ME2-O2-Ф20 Electrochemical Oxygen Sensor: Manual
No ratings yet
ME2-O2-Ф20 Electrochemical Oxygen Sensor: Manual
4 pages
111, NDT Brochure
No ratings yet
111, NDT Brochure
4 pages
Mine Poster
No ratings yet
Mine Poster
1 page
Tata Houseing Myst - Kasauli E - Brochoure
No ratings yet
Tata Houseing Myst - Kasauli E - Brochoure
10 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)

Math170S_Lecture6

Uploaded by

Math170S_Lecture6

Uploaded by

Math 170S

▶ In general we are interested in predicting a future(latent)

▶ µ(x) is a deterministic function of x.

▶ Linear regression model: µ(x) = α + βx

We now consider the linear regression problem where

. For a random sample Y1 , Y2 , ..., Yn we have,

Note, this implies that Y1 , ..., Yn have the following distribution;

(Warning: α matches with the α1 in the course book.)

Calculating the above α̂, β̂, σ̂ 2 is not hard, but can be

Therefore, we provide you the following formulas for α̂, β̂, σ̂ 2 .

Let x1 , . . . , xn be the midterm score of 10 students in a fictional

Let y1 , . . . , yn be the final score of the 10 students:

The MLEs are then given by

Once we derived the maximum likelihood estimates of α, β, σ 2 , we

Residuals: the differences between the observed values yi and the

▶ Large residuals ⇒ the model is a bad fit

You might also like