0% found this document useful (0 votes)

40 views

1 Simple Linear Regression

Uploaded by

pakchungyiu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views

1 Simple Linear Regression

Uploaded by

pakchungyiu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.

” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

1 SIMPLE LINEAR REGRESSION

1.1 SIMPLE LINEAR REGRESSION MODEL

 The simple linear regression model is a model with a single regressor 𝑥 that
has a straight-line relationship with a response 𝑦:

𝑦 =𝛽 +𝛽 𝑥+𝜀
where

o the intercept 𝛽 and the slope 𝛽 are unknown constants and 𝜀 is a

random error component;
o the errors are assumed to have mean zero and unknown variance 𝜎 ;
o the errors are uncorrelated. - independent Cov(di @j)
.

E(g) 0 Var(2) 0
,
= 0 =
,
=

 The regressor 𝑥 is controlled by the data analyst and measured with negligible
errors, while the response 𝑦 is a random variable. At each possible value for 𝑥,
the mean of the distribution of 𝑦 is y po B1X +, Ellio = +
, Var(s) o =

El Bo P /know
Cor(Ei , Ej) =

expectation :· E(Ex)
+ ,
𝐸(𝑦|𝑥) = 𝛽 + 𝛽 𝑥 =>t take It = 0

-
.IE
↓ and the variance is
,

given X EYF, E constant

E((x)
E(po(x) E(pix(x)
+ +

*
E(x(x) = x
𝑉𝑎𝑟(𝑦|𝑥) = 𝜎 . Var(pot Bx +
&(x) = Var((x) = 02
=
pot Pix + 0

 The parameters 𝛽 and 𝛽 are called regression coefficients. The slope 𝛽 is

the change in the mean of the distribution of 𝑦 produced by a unit change in 𝑥.
If the range of data on 𝑥 includes 𝑥 = 0, then the intercept 𝛽 is the mean of the
distribution of the response 𝑦 when 𝑥 = 0. If the range of 𝑥 does not include zero,
then 𝛽 has no practical interpretation.

Example 1.1.1 A study investigated whether the average number of tweets (or
messages) per hour prior to the movie’s release on Twitter.com could be used to forecast
the opening weekend box office revenues of movies. The two variables of a sample of 23
movies were measured.

Tweet Revenue
-
Movie
Rate (millions)
1 1365.8 142
2 1212.8 77
3 581.5 61
4 310.1 32
5 455 31
6 290 30
⋮ ⋮ ⋮
23 2.75 0.3

Suggest a linear regression model for the investigation. What are the practical meaning
of the regression coefficients.

1
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

e
n

𝑦 =𝛽 +𝛽 𝑥 +𝜀
&

• 𝑦 = Box Office Revenue (the dependent variable)

• 𝑥 = Tweet Rate (the predictor variable)

revenue.
• 𝛽 = 𝐸(𝑦|𝑥 = 0) (the intercept, the average avenue
W of a movie when the tweet
rate is zero)
When tweet rate 0r =

( )
• 𝛽 = (the slope, the average revenue increase of a movie when the tweet
rate increases by 1 unit)

Example 1.1.2 Show that 𝐸(𝑦|𝑥) = 𝛽 + 𝛽 𝑥 and 𝑉𝑎𝑟(𝑦|𝑥) = 𝜎 .

𝐸(𝑦|𝑥) = 𝐸(𝛽 + 𝛽 𝑥 + 𝜀|𝑥)

= 𝐸(𝛽 + 𝛽 𝑥 + 𝜀|𝑥)
= 𝐸(𝛽 + 𝛽 𝑥|𝑥) + 𝐸(𝜀|𝑥)
=𝛽 +𝛽 𝑥

𝑉𝑎𝑟(𝑦|𝑥) = 𝑉𝑎𝑟(𝛽 + 𝛽 𝑥 + 𝜀|𝑥)

= 𝑉𝑎𝑟(𝜀|𝑥)
= 𝑉𝑎𝑟(𝜀)
=𝜎

1.2 LEAST SQUARES ESTIMATION OF THE PARAMETERS

gus giXvsi
 Suppose that we have 𝑛 pairs of data, say (𝑦 , 𝑥 ), (𝑦 , 𝑥 ), … , (𝑦 , 𝑥 ).

 The method of least squares is used to estimate 𝛽 and 𝛽 by minimizing the

sum of the squares of the differences between the observations 𝑦 and the
straight line. The sum of squares is
Sum of squares E 𝑆(𝛽 , 𝛽 ) = (𝑦 − 𝛽 − 𝛽 𝑥 ) .
&g
Ey-po-pix
means -

-
 The least-squares estimators of 𝛽 and 𝛽 are
estimate
∑ 𝑥 𝑦 − 𝑛𝑥̅ 𝑦
*
SAS GRF lyi-po-Bil a
y-intercept 𝛽 = 𝑦 − 𝛽 𝑥̅
-
and 𝛽 =
∑ 𝑥 − 𝑛𝑥̅
.
E =

↑
slope
Top
& .

y-intercept .

 The fitted simple linear regression model is then

𝑦 = 𝛽 + 𝛽 𝑥.
#En It modify 1 what
 The difference between the observed value 𝑦 and the corresponding fitted value
𝑦 is a residual:
I error term
i
𝑒 = 𝑦 −𝑦 = 𝑦 − 𝛽 +𝛽 𝑥 .

du
Elyi)
fitted
(gi)

obarred a
=
yi -

Y
x x
x

x)
=
yi /Bo BiXi)
-
+
2
ve
& true ·
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

 Residuals play an important role in investigating model adequacy and in

detecting departures from the underlying assumptions.

Example 1.2.1 Derive the least-squares estimators of 𝛽 and 𝛽 .

The least-squares criterion is lyi-Bo-pixi

** Sum of squares 𝑆(𝛽 , 𝛽 ) = (𝑦 − 𝛽 − 𝛽 𝑥 ) . = 21y :
-

Bo-pixil
( 1)
-

& -B
The least-squares estimators of 𝛽 and 𝛽 , say 𝛽 and 𝛽 , must satisfy

𝜕𝑆
= −2 𝑦 −𝛽 −𝛽 𝑥 =0 y =
Bo + B X
,

𝜕𝛽 ,

𝜕𝑆
= −2 𝑦 −𝛽 −𝛽 𝑥 𝑥 = 0
𝜕𝛽 ,

Simplifying these equations yields

𝑛𝛽 + 𝛽 𝑥 = 𝑦

𝛽 𝑥 +𝛽 𝑥 = 𝑦𝑥

The above equations are called the least-squares normal equations. The solution to the
normal equation is

𝛽 = 𝑦 − 𝛽 𝑥̅

∑ 𝑦 𝑥 − 𝑛𝑦𝑥̅ ∑ (𝑦 − 𝑦)(𝑥 − 𝑥̅ )
𝛽 = =
∑ 𝑥 − 𝑛𝑥̅ ∑ (𝑥 − 𝑥̅ )

Example 1.2.2 A study investigated whether the average number of tweets (or
messages) per hour prior to the movie’s release on Twitter.com could be used to forecast
the opening weekend box office revenues of movies. The two variables of a sample of 23 n = 23
movies were measured. Consider the simple linear regression model:

𝑦 =𝛽 +𝛽 𝑥 +𝜀

where 𝑦 = weekend box office revenues and 𝑥 = the average number of tweets per hour.
Below are some quantities based on the sample:

3
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons. -

𝑦 𝑥 = 396603.2 𝑥 = 4933199 𝑦 = 35626.09

i(yi
𝑥 = 6980.65 𝑦 = 576.3 B= (xi -
1)
"

-
n
Exyi-nEy
Boz j - B F =

xi2 n
Derive the least-squares estimators of 𝛽 and 𝛽 .
-

No . of million
576.3 6980.65 ↓
∑ 𝑦 𝑥 − 𝑛𝑦𝑥̅ 396603.2 − 23 23 23
𝛽 = = = 0.07876722
∑ 𝑥 − 𝑛𝑥̅ 6980.65
4933199 − 23 23
576.3 6980.65
𝛽 = 𝑦 − 𝛽 𝑥̅ = − (0.07876722) = 1.150157

I
23 23
-
I
h = 23

↑
y Bo pix
=
+

Residuals:
Min 1Q Median 3Q Max
-36.751 -2.302 2.468 5.083 33.270

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 1.150154 3.676108 0.313 0.757
x 0.078767 0.007938 9.923 2.22e-09 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 13.32 on 21 degrees of freedom

Multiple R-squared: 0.8242, Adjusted R-squared: 0.8159
F-statistic: 98.47 on 1 and 21 DF, p-value: 2.217e-09

1.3 PROPERTIES OF THE ESTIMATORS AND THE FITTED MODEL

 𝛽 and 𝛽 are linear combinations of the observations 𝑦 . For example,

not be 0 ?
>
- Exiyizyi-Exig 555 +
,
why
∑ (𝑦 − 𝑦)(𝑥 − 𝑥̅ ) ∑ (𝑥 − 𝑥̅ )𝑦 𝑥 − 𝑥̅
𝛽 = = = 𝑦 = 𝑐𝑦
∑ (𝑥 − 𝑥̅ ) ∑ 𝑥 − 𝑥̅ ∑ 𝑥 − 𝑥̅

*
̅
where 𝑐 = .
∑ ̅

* Y : *s -

t 4
tyr
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

 𝛽 and 𝛽 are unbiased estimators of the model parameters 𝛽 and 𝛽 . For

example,

𝐸 𝛽 =𝐸 𝑐𝑦 ESER linear combination

= 𝑐 𝐸(𝑦 )
& It expectation
= 𝑐 𝐸(𝛽 + 𝛽 𝑥 + 𝜀 )

= 𝑐 𝐸(𝛽 + 𝛽 𝑥 ) E(i) = 0

= 𝑐𝛽 + 𝑐𝛽 𝑥

𝑥 − 𝑥̅ (xi x)-
= 0
=𝛽 𝑐 +𝛽 𝑥
∑ 𝑥 − 𝑥̅

𝑥 − 𝑥̅ 𝑥 − 𝑥̅
=𝛽 𝑐 +𝛽 𝑥 +𝛽 𝑥̅
∑ 𝑥 − 𝑥̅ ∑ 𝑥 − 𝑥̅
-O
𝑥 − 𝑥̅ (𝑥 − 𝑥̅ )
=𝛽 +𝛽
∑ 𝑥 − 𝑥̅ ∑ 𝑥 − 𝑥̅
=𝛽

 The variance of 𝛽 is found as

𝑉𝑎𝑟 𝛽 = 𝑉𝑎𝑟 𝑐𝑦

= 𝑐 𝑉𝑎𝑟(𝑦 ) + 𝑐 𝑐 𝐶𝑜𝑣 𝑦 , 𝑦

= 𝑐 𝑉𝑎𝑟(𝛽 + 𝛽 𝑥 + 𝜀 )

= 𝑐 𝑉𝑎𝑟(𝜀 )
↑
𝑥 − 𝑥̅
= 𝜎
∑ 𝑥 − 𝑥̅

(𝑥 − 𝑥̅ )
= 𝜎
∑ 𝑥 − 𝑥̅
𝜎
=
∑ 𝑥 − 𝑥̅

5
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

 By the Gauss-Markov theorem, the least-squares estimators are the best

linear unbiased estimators, where “best” implies minimum variance.
-
:
letsquare Edit
 Below are several other useful properties of the least-squares fit: *Bos
minimized+ E (y-pix +E
=

y =
Bo +
B ,x

I
o ∑ (𝑦 − 𝑦 ) = ∑ 𝑒 =0
o ∑ 𝑦 =∑ 𝑦
o 𝑦 = 𝛽 + 𝛽 𝑥̅
o ∑ 𝑥𝑒 =0
o ∑ 𝑦𝑒 =0

Example 1.3.1 Show that 𝛽 is a linear combination of the observations 𝑦 .

𝛽 = 𝑦 − 𝛽 𝑥̅

=
1
𝑦 − 𝑐𝑦 𝑥̅
: y :

𝑛 oly If out of y
1
= − 𝑐 𝑥̅ 𝑦
𝑛
= Be j -

= 𝑑𝑦
: (cyi) -

where

=Ytebin
1 (𝑥 − 𝑥̅ )𝑥̅
𝑑 = − .
𝑛 ∑ 𝑥 − 𝑥̅

Example 1.3.2 Prove the following properties of the least-squares fit:

clt-Isilyi}
a) ∑ (𝑦 − 𝑦 ) = ∑ 𝑒 =0
b) 𝑦 = 𝛽 + 𝛽 𝑥̅
:
Chiyi
c) ∑ 𝑥𝑒 =0 where hi =
-xC ,

d) ∑ 𝑦𝑒 =0

(a)

1 𝜕𝑆
(𝑦 − 𝑦 ) = 𝑦 −𝛽 −𝛽 𝑥 = =0
−2 𝜕𝛽 ,

(b)

𝛽 = 𝑦 − 𝛽 𝑥̅ ⇒ 𝛽 + 𝛽 𝑥̅ = 𝑦

6
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

𝑦𝑒 = 𝛽 +𝛽 𝑥 𝑒 =𝛽 𝑒 +𝛽 𝑥𝑒 =0

1.4 ESTIMATION OF 𝝈𝟐 ei = 2/ gi
-

y /":

 An unbiased estimator of 𝜎 is o= le
M
∑ 𝑒 𝑆𝑆
𝜎 = = = 𝑀𝑆
𝑛−2 𝑛−2

 𝑆𝑆 is called the residual sum of squares; 𝑀𝑆 is called the residual

mean square.

 The residual sum of squares can be expressed as 𝑆𝑆 = 𝑆𝑆 − 𝛽 𝑆 where

𝑆𝑆 = ∑ (𝑦 − 𝑦) :

𝑆𝑆 = (𝑦 − 𝑦 )

= 𝑦 −𝛽 −𝛽 𝑥

= 𝑦 − 𝑦 + 𝛽 𝑥̅ − 𝛽 𝑥

= (𝑦 − 𝑦) + 2 (𝑦 − 𝑦) 𝛽 𝑥̅ − 𝛽 𝑥 + 𝛽 𝑥̅ − 𝛽 𝑥

= (𝑦 − 𝑦) + 2𝛽 (𝑦 − 𝑦)(𝑥̅ − 𝑥 ) + 𝛽 (𝑥̅ − 𝑥 )

𝑆
= (𝑦 − 𝑦) − 2𝛽 𝑆 +𝛽 𝑆
𝑆
=𝑆 −𝛽 𝑆

Example 1.4.1 Estimate 𝜎 of the error term of the model proposed in example 1.1.2.
Below are some quantities based on the sample:

𝑦 𝑥 = 396603.2 𝑥 = 4933199 𝑦 = 35626.09

𝑥 = 6980.65 𝑦 = 576.3

Recall that 𝛽 = 0.07876722.

↑
Elpi) =
B ,
7

Bi is a n
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

𝜎 =
𝑆𝑆 - sum of squares
of residuals . =

Elyi
𝑛−2
𝑆 −𝛽 𝑆 IST SiT Total Sum of squares
= = Elyi-gl =

X, Xc , In
=
...

/
-

23 − 2 N(0 1) ~
,

∑ 𝑦 − 𝑛𝑦 − 𝛽 (∑ 𝑦 𝑥 − 𝑛𝑥̅ 𝑦)
=
23 − 2
576.3 6980.65 576.3
35626.09 − (23) 23 − (0.07876722) 396603.2 − (23) 23 23 sum of many
= incluy r V
. . .

23 − 2 -
= 177.3297 + fi fu fo
lother factors)
+ + ...

fluctuation (noice)
&

random
>

Bittour
GPA +G k,
B
-
= +
e .
g
.
e
g
.
....

1.5 HYPOTHESIS TESTING this.

fi f

....
mainly study

 To test hypotheses and construct confidence intervals about the model

parameters, we make the additional assumption that the model errors 𝜀 are
normally distributed.

 The error terms, 𝜀 ’s, are normally and independent distributed with mean 0
and variance 𝜎 , abbreviated NID(0, 𝜎 ).

 The appropriate hypotheses for testing that the intercept equals a constant,
↑

say 𝑐 are

① 𝐻 :𝛽 = 𝑐 , 𝐻 :𝛽 ≠ 𝑐

p: N/Bo of
= N
The test statistic is aiyi ,

t-distp 𝛽 −𝑐 11 96 are considered as

p E(p)
∗
the
.

𝑡 = ~ extreme value. C2-tailed

se 𝛽 -

2 5 %)
↑ proved
in Ch 2
N(c)
.
.
.

~
where se 𝛽 = 𝑀𝑆 (1/𝑛 + 𝑥̅ /𝑆 ).
# Var(p) * #n

The null hypothesis is rejected if |𝑡 ∗ | > 𝑡 / , .

 The appropriate hypotheses for testing that the slope equals a constant, say 𝑐
are

𝐻 :𝛽 = 𝑐 , 𝐻 :𝛽 ≠ 𝑐

The test statistic is

𝛽 −𝑐
② 𝑡∗ =
se 𝛽

where se 𝛽 = 𝑀𝑆 /𝑆𝑆 .

⑤ The null hypothesis is rejected if |𝑡 | > 𝑡 ∗

/ , .
*
t > to , n-2 or * -

to , me

dif
&= significance .
8
level
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

 The test about the slope is related to the test of the significance of
regression—failing to reject 𝐻 : 𝛽 = 0 is equivalent to saying that there is no
linear relationship between 𝒚 and 𝒙.

 We may also use an analysis-of-variance approach to the test of the

significance of regression. means
ANOrTrance
sum of square .

 This approach is based on the partitioning of total variability in the response

variable 𝑦:

↓ Ett.
(𝑦 − 𝑦) = (𝑦 − 𝑦 + 𝑦 − 𝑦)

SSRES
5 = (𝑦 − 𝑦 ) + 2 (𝑦 − 𝑦 )(𝑦 − 𝑦) + (𝑦 − 𝑦)
Total sum of
squares -li = ej
= (𝑦 − 𝑦 ) + 2 𝑦 (𝑦 − 𝑦 ) − 2 𝑦(𝑦 − 𝑦 ) + (𝑦 − 𝑦)

= (𝑦 − 𝑦 ) + 2 𝑦 𝑒 − 2𝑦 (𝑦 − 𝑦 ) + (𝑦 − 𝑦)

Sum of squares
&S Res ·
of residuals .

= (𝑦 − 𝑦 ) + (𝑦 − 𝑦)
-
since ∑ 𝑦 𝑒 = 0 and ∑ 𝑦 =∑ 𝑦.
How much
 We usually write the identity as variation absorbed.
V

𝑆𝑆 = 𝑆𝑆 + 𝑆𝑆

where 𝑆𝑆 = ∑ (𝑦 − 𝑦) , 𝑆𝑆 =∑ (𝑦 − 𝑦 ) , and 𝑆𝑆 = ∑ (𝑦 − 𝑦) .
flic
① B1

Ml
0
Ho : =

 The test statistic is

𝑆𝑆 /𝑑𝑓 𝑀𝑆
② 𝐹∗ = =
𝑆𝑆 /𝑑𝑓 𝑀𝑆
F-dist
where 𝑑𝑓 = 1 and 𝑑𝑓 = 𝑛 − 2. #5%; dfr
dfres.
,

(1 n 2)
,
-

The null hypothesis is rejected if 𝐹 > 𝐹 ∗

, , .

 Under the null hypothesis, 𝐹 ∗ should be close to 1 because 𝐸(𝑀𝑆 ) = 𝜎 and

𝐸(𝑀𝑆 ) = 𝜎 + 𝛽 𝑆 .

 The test procedure is summarized in the following table:

Source of Sum of Degrees Mean 𝐹∗

Variation Squares of Square (55/df)
Freedom
Regression 𝑆𝑆 1 𝑀𝑆 𝑀𝑆 /𝑀𝑆
Residual 𝑆𝑆 𝑛−2 𝑀𝑆

urrs Total 𝑆𝑆 𝑛−1

9
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

Example 1.5.1 Test for the significance of regression of the model proposed in example
1.1.2. Use 5% level of significance.

① 𝐻 : 𝛽 = 0, 𝐻 : 𝛽 ≠ 0 E slope to
& * relation
slopeNo relation
,

Approach 1: The test statistic is

𝛽 −0
② 𝑡∗ =
se 𝛽
~
twi =
tri
0.07876722
= 2
-
177.3297/ ∑ 𝑥 − 𝑛𝑥̅ [Ki-5)
"

- 0.07876722
=
6980.65
177.3297/ 4933199 − (23) 23
= 9.92333

The null hypothesis is rejected if |𝑡 ∗ | > 𝑡 . / , = 2.080. Therefore, the null

hypothesis is rejected, and we conclude that there is a linear relationship between the
revenue and the Tweet rate.

Approach 2:
(yi -j)" ay i = -
hij
576.3
𝑆 = 35626.09 − (23) = 21186.02
23
lyi-yil ayi-cyiyi
:
xigi-nj]
𝑆𝑆 = (0.07876722) 396603.2 − (23)
6980.65 576.3
= 17462.09
23 23
Source of Sum of Degrees Mean 𝐹∗
Variation Squares of Square (ss/df)
Freedom
no of regression
Regression SSR 17462.09 &
1 17462.09 98.47228
Residual SSRES 3723.93 123 − 2(i(n-no ofBs) 177.33
.

Total SST 21186.02 23 − 1

𝑆𝑆 /𝑑𝑓 𝑀𝑆
𝐹∗ = = = 98.47228
𝑆𝑆 /𝑑𝑓 𝑀𝑆

The null hypothesis is rejected if 𝐹 ∗ > 𝑭𝟎.𝟎𝟓/𝟐,𝟏,𝟐𝟑 𝟐 = 𝟏. 𝟒𝟎𝑭𝟎.𝟎𝟓,𝟏,𝟐𝟑 𝟐 ≈ 𝑭𝟎.𝟎𝟓,𝟏,𝟐𝟎 = 𝟒. 𝟑𝟓.

Therefore, the null hypothesis is rejected, and we conclude that there is a linear
relationship between the revenue and the Tweet rate.

1.6 INTERVAL ESTIMATION

 A 100(1 − 𝛼) percent confidence interval on the slope 𝛽 is given by

𝛽 −𝑡 / , se 𝛽 , 𝛽 + 𝑡 / , se 𝛽

startwith >E to 10
& confidence interval
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

and a 100(1 − 𝛼) percent confidence interval on the intercept 𝛽 is given by

~
𝛽 −𝑡 / , se 𝛽 , 𝛽 + 𝑡 / , se 𝛽

 The formulas of the confidence intervals are based on the fact that

𝛽 −𝛽 𝛽 −𝛽
~𝑡 and ~𝑡
se 𝛽 se 𝛽

 The sampling distribution of (𝑛 − 2)𝑀𝑆 /𝜎 is chi-square with 𝑛 − 2 degrees of

freedom, denoted 𝜒 . Thus, a 100(1 − 𝛼) percent confidence interval on 𝜎 is
↑
(𝑛 − 2)𝑀𝑆 (𝑛 − 2)𝑀𝑆
, Xa
𝜒 / , 𝜒 / ,

 An unbiased point estimator of 𝜇 | = 𝛽 + 𝛽 𝑥 , the mean response for a

particular value of the regressor variable, 𝑥 , is

E(y) =
Bo + Xo 𝜇̂ | =𝛽 +𝛽 𝑥
& E mean
 A 100(1 − 𝛼) percent confidence interval on 𝜇
O is
-

1 (𝑥 − 𝑥̅ ) 1 (𝑥 − 𝑥̅ )
𝜇̂ | −𝑡 / , 𝑀𝑆 + , 𝜇̂ | +𝑡 / , 𝑀𝑆 +
𝑛 𝑆 𝑛 𝑆

 The result is based on the fact that 𝜇̂ | is a linear combination of 𝑦 ’s and thus,
normally distributed random variable; and the variance of 𝜇̂ | is

𝑉𝑎𝑟 𝜇̂ | = 𝑉𝑎𝑟 𝛽 + 𝛽 𝑥
= 𝑉𝑎𝑟 𝑦 − 𝛽 𝑥̅ + 𝛽 𝑥
= 𝑉𝑎𝑟 𝑦 + 𝛽 (𝑥 − 𝑥̅ ) o
= 𝑉𝑎𝑟(𝑦) + 𝑉𝑎𝑟 𝛽 (𝑥 − 𝑥̅ ) + 2𝐶𝑜𝑣 𝑦, 𝛽 (𝑥 − 𝑥̅ )
= 𝑉𝑎𝑟(𝑦) + 𝑉𝑎𝑟 𝛽 (𝑥 − 𝑥̅ )
𝛽 +𝛽 𝑥 +𝜀
= 𝑉𝑎𝑟 + (𝑥 − 𝑥̅ ) 𝑉𝑎𝑟 𝛽
𝑛
𝜀 (𝑥 − 𝑥̅ ) 𝜎
= 𝑉𝑎𝑟 +
𝑛 𝑆
𝜎 (𝑥 − 𝑥̅ ) 𝜎
= +
CAFe only
𝑛 𝑆

 je-
To predict the future observation 𝑦 , we consider the 100(1 − 𝛼) percent
prediction interval on a future observation at 𝑥 :

Conf ,
into

[E- Zor( ,
x+ zon)
11
Pre into
[x -
200 , Xitzen of
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

1 (𝑥 − 𝑥̅ )
𝑦 −𝑡 / , 𝑀𝑆 1+ + ,𝑦
𝑛 𝑆
X

1 (𝑥 − 𝑥̅ )
+𝑡 / , 𝑀𝑆 1+ +
𝑛 𝑆

where 𝑦 = 𝛽 + 𝛽 𝑥 .

 Note that the quantity of interest is a random variable, 𝑦 = 𝛽 + 𝛽 𝑥 + 𝜀 ,

instead of a constant, 𝛽 + 𝛽 𝑥 .

 The derivation of the interval beings with

𝑦 −𝑦 𝑦 −𝑦
𝑦 −𝑦 𝑉𝑎𝑟(𝑦 − 𝑦 ) 𝑉𝑎𝑟(𝑦 − 𝑦 )
= = ~𝑡
𝑉𝑎𝑟(𝑦 − 𝑦 ) 𝑉𝑎𝑟(𝑦 − 𝑦 ) 𝑀𝑆 /𝜎
𝑉𝑎𝑟(𝑦 − 𝑦 )

where
𝑉𝑎𝑟(𝑦 − 𝑦 ) = 𝑉𝑎𝑟(𝑦 ) + 𝑉𝑎𝑟(−𝑦 ) + 2𝐶𝑜𝑣(𝑦 , −𝑦 )
= 𝑉𝑎𝑟(𝑦 ) + 𝑉𝑎𝑟(−𝑦 )
𝜎 (𝑥 − 𝑥̅ ) 𝜎
=𝜎 + +
𝑛 𝑆
1 (𝑥 − 𝑥̅ )
=𝜎 1+ +
𝑛 𝑆
and
1 (𝑥 − 𝑥̅ )
𝑉𝑎𝑟(𝑦 − 𝑦 ) = 𝑀𝑆 1+ + .
𝑛 𝑆

 Hence, we have

𝑦 −𝑦
𝑃 −𝑡 / , ≤ ≤𝑡 / , =1−𝛼
𝑉𝑎𝑟(𝑦 − 𝑦 )
𝑃 𝑦 −𝑡 / , 𝑉𝑎𝑟(𝑦 − 𝑦 ) ≤ 𝑦 ≤ 𝑡 / , 𝑉𝑎𝑟(𝑦 − 𝑦 ) =1−𝛼

 The prediction interval can be easily generalized to the prediction interval on

the mean of 𝑚 future observations on the response at 𝑥 = 𝑥 . The 100(1 − 𝛼)%
prediction interval on 𝑦 is

1 1 (𝑥 − 𝑥̅ )
𝑦 −𝑡 / , 𝑀𝑆 + + ,𝑦
𝑚 𝑛 𝑆

1 1 (𝑥 − 𝑥̅ )
+𝑡 / , 𝑀𝑆 + +
𝑚 𝑛 𝑆

12
Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.” In
Introduction to Linear Regression Analysis. John Wiley & Sons.

Example 1.6.1 Construct 95% confidence intervals on 𝜇 | and a 95% prediction interval
on a future value of a movie with Tweet rate 100.

Recall that 𝛽 = 0.07876722, 𝛽 = 1.150157, 𝑀𝑆 = 177.33, and 𝑆 = 2814526.

𝜇̂ | = 𝑦 = 1.150157 + 0.07876722(100) = 9.026879

1 (𝑥 − 𝑥̅ ) 1 (100 − 303.5065)
𝑀𝑆 + = 177.33 + = 10.31936
𝑛 𝑆 23 2814526
1 (𝑥 − 𝑥̅ ) 1 (100 − 303.5065)
𝑀𝑆 1+ + = 177.33 1 + + = 187.6494
𝑛 𝑆 23 2814526
𝑡 . / , = 2.080

1 (𝑥 − 𝑥̅ ) 1 (𝑥 − 𝑥̅ )
𝜇̂ | −𝑡 / , 𝑀𝑆 + , 𝜇̂ | +𝑡 / , 𝑀𝑆 +
𝑛 𝑆 𝑛 𝑆
= [2.345137, 15.708621]

1 (𝑥 − 𝑥̅ ) 1 (𝑥 − 𝑥̅ )
𝑦 −𝑡 / , 𝑀𝑆 1+ + ,𝑦 + 𝑡 / , 𝑀𝑆 1+ +
𝑛 𝑆 𝑛 𝑆
= [−19.46604, 37.51980]

1.7 COEFFICIENT OF DETERMINATION

 The coefficient of determination

𝑆𝑆 𝑆𝑆
𝑅 = =1−
𝑆𝑆 𝑆𝑆

measures the proportion of variation in 𝑦 explained by the regressor 𝑥.

Example 1.7.1 Calculate the proportion of variation in the movie revenue explained by
the Tweet rate.

𝑆𝑆 17462.09
𝑅 = = = 0.8242 = 82.42%
𝑆𝑆 21186.02

Practice Test 2 Summer 2021
0% (1)
Practice Test 2 Summer 2021
7 pages
The Effects of Doing Part Time Jobs On College Student Academic Performance and Social Life in A Chinese Society
100% (1)
The Effects of Doing Part Time Jobs On College Student Academic Performance and Social Life in A Chinese Society
17 pages
Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
2 Multiple Linear Regression I (1)
No ratings yet
2 Multiple Linear Regression I (1)
9 pages
Chapter 14
No ratings yet
Chapter 14
18 pages
Lecture1 STAT4355
No ratings yet
Lecture1 STAT4355
59 pages
Stat 331 Applied Linear Models - Assignment 1: I 0 1 I I I I N I 1 2 I
No ratings yet
Stat 331 Applied Linear Models - Assignment 1: I 0 1 I I I I N I 1 2 I
2 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
StatLearning2r PDF
No ratings yet
StatLearning2r PDF
267 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
Lecture3 221109 035214
No ratings yet
Lecture3 221109 035214
87 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Lecture 2
No ratings yet
Lecture 2
30 pages
BS Classes V2
No ratings yet
BS Classes V2
70 pages
8. Linear Regression
No ratings yet
8. Linear Regression
29 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Lec2 ASE
No ratings yet
Lec2 ASE
86 pages
FCDS - RA ch1 Sp21
No ratings yet
FCDS - RA ch1 Sp21
14 pages
DAUNIT-3
No ratings yet
DAUNIT-3
32 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Module III (Part II)(Regression and Time Series)
No ratings yet
Module III (Part II)(Regression and Time Series)
118 pages
Chapter_8_Linear_regression (1)
No ratings yet
Chapter_8_Linear_regression (1)
22 pages
Simple Linear Regression Analysis - Final
No ratings yet
Simple Linear Regression Analysis - Final
46 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
EE311_Lecture_Ch9_Regression
No ratings yet
EE311_Lecture_Ch9_Regression
15 pages
Linear Regression
100% (1)
Linear Regression
14 pages
Chapter 9 Simple Linear Regression and Correlation (1) (1)
No ratings yet
Chapter 9 Simple Linear Regression and Correlation (1) (1)
56 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
Week 1 Linear - Regression-Final
No ratings yet
Week 1 Linear - Regression-Final
43 pages
2.linear Regression
No ratings yet
2.linear Regression
49 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
ch12_0
No ratings yet
ch12_0
43 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Regression Slides
No ratings yet
Regression Slides
27 pages
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
No ratings yet
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
20 pages
5 - Part II - Regression Analysis w-notes(1)
No ratings yet
5 - Part II - Regression Analysis w-notes(1)
10 pages
Reg Analysis
No ratings yet
Reg Analysis
63 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
IE266-S25-week10-updated (1)
No ratings yet
IE266-S25-week10-updated (1)
27 pages
LinearRegression
No ratings yet
LinearRegression
24 pages
Chapter 2 - 1907876925
No ratings yet
Chapter 2 - 1907876925
33 pages
Chapter2 1
No ratings yet
Chapter2 1
55 pages
STAT 445-Lecture 1_2021
No ratings yet
STAT 445-Lecture 1_2021
42 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
7 pages
Module 3 EDA
No ratings yet
Module 3 EDA
14 pages
Regression Notes- Part-1
No ratings yet
Regression Notes- Part-1
17 pages
Multiple Regression
No ratings yet
Multiple Regression
22 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
FE5209 3 AY 2024
No ratings yet
FE5209 3 AY 2024
59 pages
Statics Thinking-Regression
No ratings yet
Statics Thinking-Regression
51 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
lecture 3
No ratings yet
lecture 3
33 pages
Simple Regression Model: Erbil Technology Institute
No ratings yet
Simple Regression Model: Erbil Technology Institute
9 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Capsule Calculus
From Everand
Capsule Calculus
Ira Ritow
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Plagiarism Checker X - Report: Originality Assessment
No ratings yet
Plagiarism Checker X - Report: Originality Assessment
30 pages
The Perfect Biology Lab Report
No ratings yet
The Perfect Biology Lab Report
17 pages
Impact of Online Advertising in The Netizens' Buying Behavior in The Digital Era: An Empirical Study
No ratings yet
Impact of Online Advertising in The Netizens' Buying Behavior in The Digital Era: An Empirical Study
15 pages
ML Final
100% (1)
ML Final
28 pages
Body Image Self Esteem
No ratings yet
Body Image Self Esteem
10 pages
Six Sigma: Define-Measure-Analyze-Improve-Control (DMAIC)
No ratings yet
Six Sigma: Define-Measure-Analyze-Improve-Control (DMAIC)
21 pages
1 Regression
No ratings yet
1 Regression
65 pages
GE 104 Module 4
No ratings yet
GE 104 Module 4
24 pages
InstrumentalVars Kolesar Gsas - Harvard 0084L 10796
No ratings yet
InstrumentalVars Kolesar Gsas - Harvard 0084L 10796
162 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
21 pages
Dioxin Formation: The Burn Barrel Study
No ratings yet
Dioxin Formation: The Burn Barrel Study
7 pages
module-PR2-1 Edited
No ratings yet
module-PR2-1 Edited
71 pages
Subject CS1 - Actuarial Statistics For 2022 Examinations: Institute of Actuaries of India
No ratings yet
Subject CS1 - Actuarial Statistics For 2022 Examinations: Institute of Actuaries of India
6 pages
BSMT 3 Research PRELIM MODULE 1 23 24
No ratings yet
BSMT 3 Research PRELIM MODULE 1 23 24
13 pages
Lecture 4
No ratings yet
Lecture 4
6 pages
Project - Report - Forest Fire Prediction - Group 119
No ratings yet
Project - Report - Forest Fire Prediction - Group 119
26 pages
(Ebooks PDF) Download The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou Full Chapters
100% (3)
(Ebooks PDF) Download The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou Full Chapters
84 pages
Inovasi Dan Kelangsungan Hidup Perusahaan
No ratings yet
Inovasi Dan Kelangsungan Hidup Perusahaan
39 pages
How To Calculate and Save Costs of Injection Molding - LinkedIn
No ratings yet
How To Calculate and Save Costs of Injection Molding - LinkedIn
11 pages
DBDA EANDC QB Practical Machine Learning PDF
No ratings yet
DBDA EANDC QB Practical Machine Learning PDF
4 pages
Vulnerabilitymapping
No ratings yet
Vulnerabilitymapping
15 pages
longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)
No ratings yet
longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)longomatch-manual-english_(1)
11 pages
Applied Ergonomics: R.S. Bridger, K. Brasher, A. Dew, S. Kilminster
No ratings yet
Applied Ergonomics: R.S. Bridger, K. Brasher, A. Dew, S. Kilminster
9 pages
1 s2.0 S0169204622002328 Main
No ratings yet
1 s2.0 S0169204622002328 Main
18 pages
Chapter 05
No ratings yet
Chapter 05
23 pages
Srinivasan 2014
No ratings yet
Srinivasan 2014
19 pages
Difference Between Correlation and Regression (With Comparison Chart) - Key Differences
No ratings yet
Difference Between Correlation and Regression (With Comparison Chart) - Key Differences
16 pages
IIMT2641 Group 6 Final Report
No ratings yet
IIMT2641 Group 6 Final Report
24 pages

1 Simple Linear Regression

Uploaded by

1 Simple Linear Regression

Uploaded by

Reference: Montgomery, Douglas C. “Chapter 2: Simple Linear Regression.

1 SIMPLE LINEAR REGRESSION

o the intercept 𝛽 and the slope 𝛽 are unknown constants and 𝜀 is a

given X EYF, E constant

 The parameters 𝛽 and 𝛽 are called regression coefficients. The slope 𝛽 is

• 𝑦 = Box Office Revenue (the dependent variable)

• 𝑥 = Tweet Rate (the predictor variable)

Example 1.1.2 Show that 𝐸(𝑦|𝑥) = 𝛽 + 𝛽 𝑥 and 𝑉𝑎𝑟(𝑦|𝑥) = 𝜎 .

𝐸(𝑦|𝑥) = 𝐸(𝛽 + 𝛽 𝑥 + 𝜀|𝑥)

𝑉𝑎𝑟(𝑦|𝑥) = 𝑉𝑎𝑟(𝛽 + 𝛽 𝑥 + 𝜀|𝑥)

1.2 LEAST SQUARES ESTIMATION OF THE PARAMETERS

 The method of least squares is used to estimate 𝛽 and 𝛽 by minimizing the

 The fitted simple linear regression model is then

 Residuals play an important role in investigating model adequacy and in

Example 1.2.1 Derive the least-squares estimators of 𝛽 and 𝛽 .

The least-squares criterion is lyi-Bo-pixi

Simplifying these equations yields

𝑦 𝑥 = 396603.2 𝑥 = 4933199 𝑦 = 35626.09

Residual standard error: 13.32 on 21 degrees of freedom

1.3 PROPERTIES OF THE ESTIMATORS AND THE FITTED MODEL

 𝛽 and 𝛽 are linear combinations of the observations 𝑦 . For example,

 𝛽 and 𝛽 are unbiased estimators of the model parameters 𝛽 and 𝛽 . For

𝐸 𝛽 =𝐸 𝑐𝑦 ESER linear combination

 The variance of 𝛽 is found as

 By the Gauss-Markov theorem, the least-squares estimators are the best

Example 1.3.1 Show that 𝛽 is a linear combination of the observations 𝑦 .

Example 1.3.2 Prove the following properties of the least-squares fit:

 𝑆𝑆 is called the residual sum of squares; 𝑀𝑆 is called the residual

 The residual sum of squares can be expressed as 𝑆𝑆 = 𝑆𝑆 − 𝛽 𝑆 where

𝑦 𝑥 = 396603.2 𝑥 = 4933199 𝑦 = 35626.09

Recall that 𝛽 = 0.07876722.

1.5 HYPOTHESIS TESTING this.

 To test hypotheses and construct confidence intervals about the model

t-distp 𝛽 −𝑐 11 96 are considered as

𝑡 = ~ extreme value. C2-tailed

The null hypothesis is rejected if |𝑡 ∗ | > 𝑡 / , .

The test statistic is

⑤ The null hypothesis is rejected if |𝑡 | > 𝑡 ∗

 We may also use an analysis-of-variance approach to the test of the

 This approach is based on the partitioning of total variability in the response

 The test statistic is

The null hypothesis is rejected if 𝐹 > 𝐹 ∗

 Under the null hypothesis, 𝐹 ∗ should be close to 1 because 𝐸(𝑀𝑆 ) = 𝜎 and

 The test procedure is summarized in the following table:

Source of Sum of Degrees Mean 𝐹∗

urrs Total 𝑆𝑆 𝑛−1

Approach 1: The test statistic is

The null hypothesis is rejected if |𝑡 ∗ | > 𝑡 . / , = 2.080. Therefore, the null

Total SST 21186.02 23 − 1

The null hypothesis is rejected if 𝐹 ∗ > 𝑭𝟎.𝟎𝟓/𝟐,𝟏,𝟐𝟑 𝟐 = 𝟏. 𝟒𝟎𝑭𝟎.𝟎𝟓,𝟏,𝟐𝟑 𝟐 ≈ 𝑭𝟎.𝟎𝟓,𝟏,𝟐𝟎 = 𝟒. 𝟑𝟓.

1.6 INTERVAL ESTIMATION

 A 100(1 − 𝛼) percent confidence interval on the slope 𝛽 is given by

and a 100(1 − 𝛼) percent confidence interval on the intercept 𝛽 is given by

 The sampling distribution of (𝑛 − 2)𝑀𝑆 /𝜎 is chi-square with 𝑛 − 2 degrees of

 An unbiased point estimator of 𝜇 | = 𝛽 + 𝛽 𝑥 , the mean response for a

 Note that the quantity of interest is a random variable, 𝑦 = 𝛽 + 𝛽 𝑥 + 𝜀 ,

 The derivation of the interval beings with

 The prediction interval can be easily generalized to the prediction interval on

Recall that 𝛽 = 0.07876722, 𝛽 = 1.150157, 𝑀𝑆 = 177.33, and 𝑆 = 2814526.

𝜇̂ | = 𝑦 = 1.150157 + 0.07876722(100) = 9.026879

1.7 COEFFICIENT OF DETERMINATION

 The coefficient of determination

measures the proportion of variation in 𝑦 explained by the regressor 𝑥.

You might also like