0% found this document useful (0 votes)

61 views

10-4 Variation and Prediction Intervals

The document discusses key concepts in correlation and regression analysis including: 1) Explained variation which is accounted for by the relationship between x and y, and unexplained variation which is due to chance or other variables. 2) The coefficient of determination (r2) which indicates the proportion of total variation that is explained by the regression model. 3) The standard error of estimate which measures the accuracy of predicted y-values based on the regression model. 4) Prediction intervals which provide a range of values that future observations of y are likely to fall within based on a given value of x.

Uploaded by

Javed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views

10-4 Variation and Prediction Intervals

Uploaded by

Javed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

10-4 Variation and Prediction Intervals

Explained and unexplained variation

In this section, we study two measures used in correlation and regression studies.
(The coefficient of determination and the standard error of estimate.) We also
learn how to construct a prediction interval for y using a regression line and a
given value of x. To study these concepts, we need to understand and calculate
the total variation, explained deviation, and the unexplained deviation for each
ordered pair in a data set.

Assume that we have a collection of paired data containing the sample point

(x , y), that 𝑦 is the predicted value of y, and that the mean of the sample y-values
is 𝑦.

The total variation about a regression line is the sum of the squares of the
differences between the y-value of each ordered pair and the mean of y.

total variation = (𝒚 − 𝒚)𝟐

The explained variation is the sum of the squared of the differences between
each predicted y-value and the mean of y.

explained variation = (𝒚 − 𝒚)𝟐

The unexplained variation is the sum of the squared of the differences between
the y-value of each ordered pair and each corresponding predicted y-value.

unexplained variation = (𝒚 − 𝒚)𝟐

The sum of the explained and unexplained variations is equal to the total
variation.

Total variation = Explained variation + Unexplained variation

As its name implies, the explained variation can be explained by the relationship
between x and y. The unexplained variation cannot be explained by the
relationship between x and y and is due to chance or other variables.
Consider the advertising and sales data used throughout this section with a
regression line of 𝑦 = 50.729 x + 104.061.

Using the data point (2.0, 220) we can find the total, explained, and unexplained
variation:

The Coefficient of determination

The coefficient of determination r2 is the ratio of the explained variation to the

total variation.
𝑒𝑥𝑝𝑙𝑎𝑖𝑒𝑛𝑑 𝑣𝑎𝑟𝑖𝑎𝑡𝑖𝑜𝑛
𝑟2 =
𝑡𝑜𝑡𝑎𝑙 𝑣𝑎𝑟𝑖𝑎𝑡𝑖𝑜𝑛
We can compute 𝑟 2 by using the definition or by squaring the linear correlation
coefficient r.

Ex 1)

The correlation coefficient for the following advertising expenses and company
sales data is 0.913. Find the coefficient of determination. What does this tell you
about the explained variation of the data about the regression line? About the
unexplained variation? (r= 0.913 suggests a strong positive linear correlation)

𝒓𝟐 = 0.834

About 83.4% of the variation in the company sales can be explained by the
variation in the advertising expenditures. About 16.6% of the variation is
unexplained and is due to chance or other variables.
Advertising expenses Company sales xy x2 y2

(1000s of $), x (1000s of $), y

2.4 225 540 5.76 50,625

1.6 184 294.4 2.56 33,856

2.0 220 440 4 48,400

2.6 240 624 6.76 57,600

1.4 180 252 1.96 32,400

1.6 184 294.4 2.56 33,856

2.0 186 372 4 34,596

2.2 215 473 4.84 46,225

Sums

15.8 1634 3289.8 32.44 337,558

𝑦 =(1634/8) =204.25 𝑥 =(15.8/8)=1.975 ,

The Standard Error of Estimate

The Standard Error of Estimate se is the standard deviation of the observed

y-values about the predicted 𝑦-value for a given x-value. It is given by

(𝑦 − 𝑦 )2
𝑠𝑒 =
𝑛−2

Or as the following equivalent formula:

𝑦 2 − 𝑏0 𝑦 − 𝑏1 𝑥𝑦
𝑠𝑒 =
𝑛−2

Ex 2)

The regression equation of the advertising expenses and company sales data in
example 1) is

𝑦 = 50.729 x + 104.061

Find the standard error of estimate.

x y 𝑦 (𝑦 − 𝑦)2

2.4 225 225.81 0.6561

1.6 184 185.23 1.5129

2.0 220 205.52 209.6704

2.6 240 235.96 16.3216

1.4 180 175.08 24.2064

1.6 184 185.23 1.5129

2.0 186 205.52 381.0304

2.2 215 215.66 0.4356

Sum 635.3463
The standard error of estimate of the company sales for a specific advertising
expense is about $10,290.

In chapter 7, we saw that point estimates will not give us any information about
how accurate they might be. Thus, we developed confidence interval estimates to
overcome this advantage. In this section we follow the same approach to
construct a prediction interval.

A prediction interval is an interval estimate of a predicted value of y.

Given a linear regression equation 𝑦 = 𝑏0 + 𝑏1 𝑥 and x0, a specific value of x, a

prediction interval for y is

𝑦−𝐸 <𝑦 <𝑦+𝐸

Where

1 𝑛 𝑥0 − 𝑥 2
𝐸= 𝑡𝛼 𝑠𝑒 1+ +
2 𝑛 𝑛 𝑥2 − 𝑥 2

With n-2 degrees of freedom.

Ex3)

Using the results of previous example, construct a 95% prediction interval for the
company sales when the advertising expenses are $2100. What can you
conclude?

Module 1 Statistic PDF
No ratings yet
Module 1 Statistic PDF
7 pages
Coefficient of Determination
No ratings yet
Coefficient of Determination
7 pages
05 - Statind2 - Regresi Linier Sederhana Dan Korelasi
No ratings yet
05 - Statind2 - Regresi Linier Sederhana Dan Korelasi
15 pages
2023 Statistics Fin 11
No ratings yet
2023 Statistics Fin 11
19 pages
Module Three: Determining Cause and Making Reliable Forecasts
No ratings yet
Module Three: Determining Cause and Making Reliable Forecasts
44 pages
Chapter 13
No ratings yet
Chapter 13
15 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Antwerpen2014sessie5 (Regression)
No ratings yet
Antwerpen2014sessie5 (Regression)
42 pages
Presentation REGRESSION (9)
No ratings yet
Presentation REGRESSION (9)
26 pages
Regression
100% (1)
Regression
43 pages
Regression (Manual)
No ratings yet
Regression (Manual)
7 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Week-4 BA Linear Regression
No ratings yet
Week-4 BA Linear Regression
16 pages
Chap 13 - Correlation and Linear Regression
No ratings yet
Chap 13 - Correlation and Linear Regression
55 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Regression Equation
No ratings yet
Regression Equation
56 pages
Chapter 5 - STATISTICAL TESTS OF THE LEAST SQUARES ESTIMATES
No ratings yet
Chapter 5 - STATISTICAL TESTS OF THE LEAST SQUARES ESTIMATES
10 pages
Linear Regression and Correlation: Mcgraw-Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw-Hill/Irwin
29 pages
Session 5 Marked B PDF
No ratings yet
Session 5 Marked B PDF
36 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
Chapter 13 PowerPoint
No ratings yet
Chapter 13 PowerPoint
36 pages
Linear Regression and Correlation: Mcgraw Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw Hill/Irwin
37 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
Linear Regression Full Version
No ratings yet
Linear Regression Full Version
34 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
49 pages
Course 6 Econometrics Regression
No ratings yet
Course 6 Econometrics Regression
6 pages
Statistical Inference AP TV
No ratings yet
Statistical Inference AP TV
20 pages
10 CH 10 Linear Regression and Correlation
No ratings yet
10 CH 10 Linear Regression and Correlation
92 pages
Regrion
No ratings yet
Regrion
19 pages
Linier Regression
No ratings yet
Linier Regression
19 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
8 pages
linearregression-Rupak_(1)
No ratings yet
linearregression-Rupak_(1)
32 pages
Business Stat 10 12 .PDF
No ratings yet
Business Stat 10 12 .PDF
144 pages
Lecture 8 Correlation and Linear Regression
No ratings yet
Lecture 8 Correlation and Linear Regression
66 pages
Regression Analysis
No ratings yet
Regression Analysis
5 pages
STB1003_Unit-3 bsc
No ratings yet
STB1003_Unit-3 bsc
12 pages
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
No ratings yet
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
31 pages
Business Statistics: A First Course: Simple Linear Regression
No ratings yet
Business Statistics: A First Course: Simple Linear Regression
65 pages
Unit 5
No ratings yet
Unit 5
34 pages
Ch. 8 Measures of Association
No ratings yet
Ch. 8 Measures of Association
8 pages
Lab Report 4 (2017MC75) PS
No ratings yet
Lab Report 4 (2017MC75) PS
3 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
67 pages
Chapter13 MAS202
No ratings yet
Chapter13 MAS202
32 pages
The Simple Linear Regression Model and Correlation
100% (1)
The Simple Linear Regression Model and Correlation
64 pages
6034 - Classical Linear Regression Model
No ratings yet
6034 - Classical Linear Regression Model
30 pages
Regression Equation For SI
No ratings yet
Regression Equation For SI
12 pages
ANOVA Table and Prediction Intervals
No ratings yet
ANOVA Table and Prediction Intervals
7 pages
12 W12NSE6220 - Fall 2023 - Zeng
No ratings yet
12 W12NSE6220 - Fall 2023 - Zeng
44 pages
Chapter 4 Demand Estimation
No ratings yet
Chapter 4 Demand Estimation
9 pages
AP Stats 3.2
No ratings yet
AP Stats 3.2
57 pages
09 Inference For Regression Part1
No ratings yet
09 Inference For Regression Part1
12 pages
Econometrics for Finace Lecture II-Session Three
No ratings yet
Econometrics for Finace Lecture II-Session Three
32 pages
Regression
No ratings yet
Regression
46 pages
CH 14
No ratings yet
CH 14
12 pages
4-Biol 605-Regression Models (1)
No ratings yet
4-Biol 605-Regression Models (1)
25 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Regression Analysis
No ratings yet
Regression Analysis
9 pages
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
MRM301T
No ratings yet
MRM301T
2 pages
Growth and Survival Rates of Narra Trees (Pterocarpus Indicus) in Seed Ball Substrate
No ratings yet
Growth and Survival Rates of Narra Trees (Pterocarpus Indicus) in Seed Ball Substrate
13 pages
Pre Experimental and True Experimental Research Design
No ratings yet
Pre Experimental and True Experimental Research Design
20 pages
Part A
No ratings yet
Part A
4 pages
Factor Analysis True/False Questions
100% (1)
Factor Analysis True/False Questions
3 pages
PSY 201 L6 Factorial Designs
No ratings yet
PSY 201 L6 Factorial Designs
43 pages
Session 1 Handin 2023
No ratings yet
Session 1 Handin 2023
2 pages
Table of Contents
No ratings yet
Table of Contents
5 pages
Time Series Forecasting Business Report
No ratings yet
Time Series Forecasting Business Report
42 pages
Chapter5 - Hypothesis Testing and Statistical Inference
No ratings yet
Chapter5 - Hypothesis Testing and Statistical Inference
50 pages
Testing of Hypothesis For Large Sample
No ratings yet
Testing of Hypothesis For Large Sample
11 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
87-Article Text-285-1-10-20220108
No ratings yet
87-Article Text-285-1-10-20220108
20 pages
Quili Methods
29% (7)
Quili Methods
82 pages
Chapter Two
50% (2)
Chapter Two
13 pages
Methods in Behavioral Research 12th Edition (eBook PDF)instant download
100% (2)
Methods in Behavioral Research 12th Edition (eBook PDF)instant download
54 pages
Correlation and Regression
No ratings yet
Correlation and Regression
22 pages
Quiz 3 (Solution) Done
No ratings yet
Quiz 3 (Solution) Done
5 pages
AP Statistics HW - Unit 1 MC
No ratings yet
AP Statistics HW - Unit 1 MC
3 pages
Loyola College (Autonomous), Chennai - 600 034: B.SC - Degree Examination - Statistics ST 5509-Regression Analysis
No ratings yet
Loyola College (Autonomous), Chennai - 600 034: B.SC - Degree Examination - Statistics ST 5509-Regression Analysis
2 pages
Template Critical Appraisal
No ratings yet
Template Critical Appraisal
9 pages
3-Applying multiple linear Regression
No ratings yet
3-Applying multiple linear Regression
5 pages
Chapter 9, Hypothesis Testing - Edit
No ratings yet
Chapter 9, Hypothesis Testing - Edit
54 pages
Elementary Statistics 11 th Edition Robert Johnson - Download the ebook now to start reading without waiting
100% (3)
Elementary Statistics 11 th Edition Robert Johnson - Download the ebook now to start reading without waiting
56 pages
7 Training Evaluation Report
No ratings yet
7 Training Evaluation Report
6 pages
End-to-End Machine Learning Project (Bootcamp)
No ratings yet
End-to-End Machine Learning Project (Bootcamp)
415 pages
Chapter-4-Revised
No ratings yet
Chapter-4-Revised
5 pages
CHP 3 Notes, Gujarati
No ratings yet
CHP 3 Notes, Gujarati
4 pages
Chi-Square Assignment
No ratings yet
Chi-Square Assignment
4 pages

10-4 Variation and Prediction Intervals

Uploaded by

10-4 Variation and Prediction Intervals

Uploaded by

10-4 Variation and Prediction Intervals

Explained and unexplained variation

total variation = (𝒚 − 𝒚)𝟐

explained variation = (𝒚 − 𝒚)𝟐

unexplained variation = (𝒚 − 𝒚)𝟐

Total variation = Explained variation + Unexplained variation

The Coefficient of determination

The coefficient of determination r2 is the ratio of the explained variation to the

(1000s of $), x (1000s of $), y

2.4 225 540 5.76 50,625

1.6 184 294.4 2.56 33,856

2.0 220 440 4 48,400

2.6 240 624 6.76 57,600

1.4 180 252 1.96 32,400

1.6 184 294.4 2.56 33,856

2.0 186 372 4 34,596

2.2 215 473 4.84 46,225

15.8 1634 3289.8 32.44 337,558

𝑦 =(1634/8) =204.25 𝑥 =(15.8/8)=1.975 ,

The Standard Error of Estimate

The Standard Error of Estimate se is the standard deviation of the observed

y-values about the predicted 𝑦-value for a given x-value. It is given by

Or as the following equivalent formula:

Find the standard error of estimate.

2.4 225 225.81 0.6561

1.6 184 185.23 1.5129

2.0 220 205.52 209.6704

2.6 240 235.96 16.3216

1.4 180 175.08 24.2064

1.6 184 185.23 1.5129

2.0 186 205.52 381.0304

2.2 215 215.66 0.4356

A prediction interval is an interval estimate of a predicted value of y.

Given a linear regression equation 𝑦 = 𝑏0 + 𝑏1 𝑥 and x0, a specific value of x, a

𝑦−𝐸 <𝑦 <𝑦+𝐸

With n-2 degrees of freedom.

You might also like