0% found this document useful (0 votes)

7 views40 pages

QBM101 Chapter10

Uploaded by

bonachinh111

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views40 pages

QBM101 Chapter10

Uploaded by

bonachinh111

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

QBM 101 Business Statistics

Department of Business Studies

Faculty of Business, Economics & Accounting
HELP University
SUBJECT OUTLINE:
 Module 1: Introduction; organizing
and graphing data; numerical
descriptive measures

 Module 2: Probability, discrete random

variables; continuous random variables
and the normal distribution

 Module 3: Sampling distributions;

estimation; hypothesis testing

 Module 4: Simple linear regression

CHAPTER 10:
SIMPLE LINEAR REGRESSION
 10.1 Simple linear regression
 10.2 Standard deviation of errors
and coefficient of determination
 10.3 Inferences about B
 10.4 Linear correlation
 10.5 Regression analysis: A complete
example
 10.6 Interpretation of Excel output
A regression model is a mathematical equation
that describes the relationship between two or
more variables. A simple regression model
includes only two variables: one independent and
one dependent. The dependent variable is the one
being explained, and the independent variable is
the one used to explain the variation in the
dependent variable.

A (simple) regression model that gives a straight-

line relationship between two variables is called a
linear regression model.
Regression: describing the nature of relationship
between variables – positive, negative, linear, or
nonlinear.
Correlation: determining whether a relationship
between variables exists

Questions: Are the two variables related? If so,

what is the strength? What kind of relationship?
What prediction can be made?
Examples: Height and weight of human, number
of cigarettes smoked vs weights of infants;
time spent on studying and exam marks.
 Dependent variable (DV) (y, the one being
explained) vs. independent variable (IV) (x, used
to explain the variation).
 Simple (only 1 IV) vs. multiple (> 1 IV)
regression
 Linear (straight-line relationship) vs. nonlinear
regression
SIMPLE LINEAR REGRESSION ANALYSIS

In the regression model y = A + Bx + ε, A

is called the y-intercept or constant term, B
is the slope, and ε is the random error term.
The dependent and independent variables
are y and x, respectively.

In the model ŷ = a + bx, a and b, which are

calculated using sample data, are called the
estimates of A and B, respectively.
SCATTER PLOT/DIAGRAM
ERROR SUM OF SQUARE (SSE)
The error sum of squares, denoted SSE, is

SSE   e 2   ( y  yˆ )2

The values of a and b that give the minimum

SSE are called the least square estimates of A
and B, and the regression line obtained with
these estimates is called the least squares line.
Least square/best-fit line:
yˆ  a  bx
  x
2

SS xx   x 2

n
  y
2

SS yy   y 2

n

SS xy   xy 
  x   y 
n
SS xy
b
SS xx
a  y  bx
Least square/best-fit line:

x
 x  386  55.1429, y   y  108  15.4286
n 7 n 7

SS xy   xy 
  x   y   6403   386 108  447.5714
n 7
 x
2
(386) 2
SS xx   x 
2
 23058   1772.8571
n 7
SS xy 447.5714
b   0.2525
SS xx 1772.8571
a  y  bx  15.4286  (0.2525)(55.1429)  1.5050
yˆ  a  bx  1.5050  0.2525x
Least square/best-fit line (estimation and its
reliability):
SS xy 447.5714
b   0.2525
SS xx 1772.8571
a  y  bx  15.4286  (0.2525)(55.1429)  1.5050
yˆ  a  bx  1.5050  0.2525 x
Estimate the amount of food expenditures when the income is $6100.
yˆ  a  bx  1.5050  0.2525(61)  $16.9075 hundred  $1690.75
Error, e  y  yˆ  16  16.9075  $0.9075 hundred  $90.75
Estimate the amount of food expenditures when the income is $6000.
yˆ  a  bx  1.5050  0.2525(60)  $16.655 hundred  $1665.50
The estimation is reliable because 60  (33,83)
Estimate the amount of food expenditures when the income is $2000.
yˆ  a  bx  1.5050  0.2525(20)  $6.555 hundred  $655.50
The estimation is not reliable because 20  (33,83) *Extrapolation
ERROR OF PREDICTION
Least square/best-fit line (interpretation of
regression coefficients):

yˆ  a  bx  1.5050  0.2525 x
y  intercept, a  1.5050
A family with RM 0 income will
spend RM1.5050 hundred
=RM150.50 on food.
Slope coefficient, b  0.2525
For every one unit (RM100) of increment
in income, the expenditure on food will
increase by RM0.2525 hundred = RM25.25.
Degrees of Freedom for a Simple Linear
Regression Model

The degrees of freedom for a simple linear

regression model are

df = n – 2
Standard deviation of errors:

  is estimated by se
SSE
se  , where SSE   ( y  yˆ ) 2

n2
df  n  2
SS yy  bSS xy
se 
n2
Standard deviation of errors:

SS xy 447.5714
b   0.2525
SS xx 1772.8571

SS xy   xy 
  x   y   6403   386 108  447.5714
n 7
 y
2
(108) 2
SS yy   y 2
  1792   125.1743
n 7
SS yy  bSS xy 125.1743  (0.2525)(447.5714)
se    1.5939
n2 72
Coefficient of determination (COD)

bSS xy
r 
2
,0  r 1
2

SS yy
b  0.2525, SS xy  447.5714, SS yy  125.7143
bSS xy 0.2525(447.5714)
r 
2
  0.899  89.9%
SS yy 125.7143
Interpretation: 89.9% of the total variation in food expenditures
of household can be explained by the variation in incomes, and
the remaining 10.1% is due to randomness and other variables.
Coefficient of correlation (COC)

SS xy
r , 1  r  1
SS xx SS yy
SS xx  1772.8571, SS xy  447.5714, SS yy  125.7143
SS xy 447.5714
r   0.9481
SS xx SS yy 1772.8571125.7143
Interpretation: Positive or negative sign/correlated.
Very weak, average/moderate, strong, very strong
r  0.9481: very strong and positively correlated
Other example:
r  0.1111: very weak and negatively correlated
bB se
Test statistic: tcalc  , df  n  2, sb 
sb SS xx
H0 : B  0
H1 : B  0 (two-tailed test)
B  0 (positive), B  0 (negative) (one-tailed test)
  is unknown, use the t distribution.
HT about the slope coefficient, B
Test at the 1% significance level whether the
slope of the regression line is positive.
H 0 : B  0, H1 : B  0 (one-tailed test)
  0.01
df  n  2  7  2  5
b  B 0.2525  0
tcalc    6.662
sb 0.0379
tcritical  t ,n  2  t0.01,5  3.365
tcritical  3.365  tcalc  6.662
Reject H 0 . There is sufficient evidence to conclude
that the slope is positive, or, income determines
food expenditure positively.
A random sample of eight drivers selected from a small city
insured with a company and having similar minimum
required auto insurance policies was selected. The following
table lists their driving experiences (in years) and monthly
auto insurance premiums (in dollars).
Regression Analysis: A Complete Example

(a) IV and DV. Do you expect a positive or negative relationship?

(b) Compute SS xx , SS yy , and SS xy .
(c) Find the least square regression line.
(d) Interpret the regression coefficients in (c).
(e) Calculate the COC and COD. Interpret their meanings.
(f) Predict the monthly premium for a driver with 10 years of experience.
Comment on the reliability of the estimation.
(g) Compute the standard deviation of errors.
(h) Test at a 5% significance level whether B is negative.
Regression Analysis: A Complete Example
(a) IV: Driving experience, DV: Monthly auto insurance premium
A negative linear relationship.
Regression Analysis: A Complete Example

(b) x 
 x 90
  11.25, y 
 y 474
  59.25
n 8 n 8

SS xy   xy 
  x   y 
 4739 
(90)(474)
 593.5
n 8
  x
2
(90) 2
SS xx   x 2
  1396   383.5
n 8
 y
2
(474) 2
SS yy   y 2   29, 642   1557.5
n 8

SS xy 593.5
(c) b    1.5476
SS xx 383.5
a  y  bx  59.25  (1.5476)(11.25)  76.6605
yˆ  a  bx  76.6605  1.5476 x
Regression Analysis: A Complete Example
(d) yˆ  a  bx  76.6605  1.5476 x
y  intercept, a  76.6605
A driver with 0 years of driving experience will need to pay
a monthly premium of $76.66.
Slope coefficient, b  1.5476
For every one extra year of driving experience, the monthyly
premium will decrease by $1.55.
SS xy 593.5
(e) COC, r    0.7679
SS xx SS yy (383.5)(1557.5)
A moderately strong and negatively correlation.
bSS xy (1.5476)(593.5)
r 
2
  0.5897
SS yy 1557.5
Alternative: COD,r 2   0.7679   0.5897
2

58.97% of the variation in monthly premium can be explained by

driving experience, whereas the remaining 41.03% is due to
randomness and other unaccounted factors.
Regression Analysis: A Complete Example

(f) yˆ (10)  76.6605  1.5476(10)  $61.18

The estimstion is reliable because 10  (2,25).

SS yy  bSS xy 1557.5  (1.5476)(593.5)

(g) se    10.3199
n2 82
Regression Analysis: A Complete Example
(h) H 0 : B  0, H1 : B  0
  0.05, df  n  2  8  2  6
b  B 1.5476  0 1.5476  0
tcalc     2.937
sb 10.3199 0.5270
383.5
tcritical  t ,df  t0.05,6  1.943
tcalc  2.937  tcritical  1.943
Reject H 0 . There is sufficient evidence to conclude that the slope is negative.
The hypothesis test on B can be
performed using the p-value approach,
using the output obtained from
statistical software.
EXCEL OUTPUT

Source: http://www.excel-easy.com/examples/regression.html
EXCEL
EXCEL
EXCEL
SUMMARY
 Identify IV (x) and DV (y)
 Calculate SS of xx, yy, and xy

 Determine the best fit line

 Calculate and interpret regression coefficients

 Calculate and interpret COC and COD

 Estimate and comment on its reliability

 Hypothesis test on B (critical value approach

using manual calculation, or p-value
approach from the Excel output)
 Finding missing values from the given Excel
output

Topic 6 Simple Linear Regression
No ratings yet
Topic 6 Simple Linear Regression
57 pages
Bio IA - Kwok
No ratings yet
Bio IA - Kwok
12 pages
Manele in Romania 2015
No ratings yet
Manele in Romania 2015
2 pages
QBM 101 Lecture 10
No ratings yet
QBM 101 Lecture 10
45 pages
Lecture 8 Correlation and Linear Regression
No ratings yet
Lecture 8 Correlation and Linear Regression
66 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Lecture 6 Correlation and Regression
No ratings yet
Lecture 6 Correlation and Regression
10 pages
Corelation and Regression
No ratings yet
Corelation and Regression
137 pages
Correlation and Regression
No ratings yet
Correlation and Regression
10 pages
Linear Regression Lecture
No ratings yet
Linear Regression Lecture
18 pages
Regression Analysis
No ratings yet
Regression Analysis
21 pages
Regression Analysis
No ratings yet
Regression Analysis
22 pages
Chapter 14 Multiple Regression and Correlation Analysis
No ratings yet
Chapter 14 Multiple Regression and Correlation Analysis
25 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
BES - Lecture 10 - Simple Linear Regression
No ratings yet
BES - Lecture 10 - Simple Linear Regression
15 pages
Chap 014
No ratings yet
Chap 014
20 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
50 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
67 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
Chapter No 11 (Simple Linear Regression)
No ratings yet
Chapter No 11 (Simple Linear Regression)
3 pages
CH 2
No ratings yet
CH 2
31 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Simple Regression and Correlation
No ratings yet
Simple Regression and Correlation
30 pages
Chapter 13
No ratings yet
Chapter 13
129 pages
Multivariate Ana
No ratings yet
Multivariate Ana
20 pages
Chapter 13
No ratings yet
Chapter 13
108 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
64 pages
Lecture-3---Linear-Regression-imran-20022025-092939am
No ratings yet
Lecture-3---Linear-Regression-imran-20022025-092939am
46 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
03 - Simple Linear Regression
No ratings yet
03 - Simple Linear Regression
13 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Chap 10 Regression Analysis
No ratings yet
Chap 10 Regression Analysis
68 pages
Ch13 ZKH3 Final ZkH3
No ratings yet
Ch13 ZKH3 Final ZkH3
96 pages
CH 08
No ratings yet
CH 08
13 pages
5 Chapter Fi
No ratings yet
5 Chapter Fi
29 pages
Regression and Correlation
No ratings yet
Regression and Correlation
13 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
Regression Models Notes
No ratings yet
Regression Models Notes
13 pages
Chapter 17
No ratings yet
Chapter 17
31 pages
Topic - chapter 12 - Regression models
No ratings yet
Topic - chapter 12 - Regression models
1 page
Simple Regression
100% (1)
Simple Regression
50 pages
01 - Simple Linear Regression
No ratings yet
01 - Simple Linear Regression
24 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Linear correlation and linear regression
No ratings yet
Linear correlation and linear regression
37 pages
13 Predictive Analysis - Tests of Association- Regression
No ratings yet
13 Predictive Analysis - Tests of Association- Regression
70 pages
Regression
No ratings yet
Regression
3 pages
8-Simple Regression Analysis
No ratings yet
8-Simple Regression Analysis
9 pages
Brief Lecture Notes On Simple Linear Regression Regression Analysis
No ratings yet
Brief Lecture Notes On Simple Linear Regression Regression Analysis
8 pages
Regression Equation For SI
No ratings yet
Regression Equation For SI
12 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
Regression Student
No ratings yet
Regression Student
20 pages
Simple Lin Regress Inference
No ratings yet
Simple Lin Regress Inference
51 pages
Regression and Factor
No ratings yet
Regression and Factor
95 pages
Regression Models for Data Science in R
No ratings yet
Regression Models for Data Science in R
137 pages
Chapter Fourteen: Multiple Regression and Correlation Analysis
No ratings yet
Chapter Fourteen: Multiple Regression and Correlation Analysis
27 pages
QMM Epgdm 5
No ratings yet
QMM Epgdm 5
58 pages
06 Least Squar Regression
No ratings yet
06 Least Squar Regression
25 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
Coefficient of Determination
No ratings yet
Coefficient of Determination
7 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Chapter 6 Supplement: Transportation and Assignment Solution Procedures
90% (10)
Chapter 6 Supplement: Transportation and Assignment Solution Procedures
49 pages
HOW TO BUILD AN ARGUMENT
No ratings yet
HOW TO BUILD AN ARGUMENT
2 pages
Syllabus Financial Modeling 22.08.2022 FINAL
0% (1)
Syllabus Financial Modeling 22.08.2022 FINAL
7 pages
Writing Literature Reviews Jose Galvan
100% (1)
Writing Literature Reviews Jose Galvan
8 pages
Review 2007
No ratings yet
Review 2007
96 pages
Implementation: E-Rupi
No ratings yet
Implementation: E-Rupi
19 pages
Delm 116
100% (2)
Delm 116
9 pages
How Is The Satisficing Decision Maker Best
No ratings yet
How Is The Satisficing Decision Maker Best
7 pages
4 5854898563208709653
No ratings yet
4 5854898563208709653
38 pages
RMG Assignment
No ratings yet
RMG Assignment
27 pages
Testbank for POWER Learning and Your Life Essentials of Student Success 5th Edition Feldman Instant Download
No ratings yet
Testbank for POWER Learning and Your Life Essentials of Student Success 5th Edition Feldman Instant Download
18 pages
Online Reading
No ratings yet
Online Reading
69 pages
Nabling Rocesses: Governance Practice Inputs Outputs EDM03.01 Evaluate Risk Management. From Description Description To
0% (1)
Nabling Rocesses: Governance Practice Inputs Outputs EDM03.01 Evaluate Risk Management. From Description Description To
2 pages
Aurora National Science High School
No ratings yet
Aurora National Science High School
4 pages
Pimpari-Chinchwad Parisaratil Balgunhegari: Ek Samajik Va Arthik Samasya
No ratings yet
Pimpari-Chinchwad Parisaratil Balgunhegari: Ek Samajik Va Arthik Samasya
10 pages
An Eye For An Eye in The Electronic Age - Gauging
No ratings yet
An Eye For An Eye in The Electronic Age - Gauging
19 pages
Defining The Relationship Between Fine Motor Visual-Spatial Integration and Reading and Spelling
No ratings yet
Defining The Relationship Between Fine Motor Visual-Spatial Integration and Reading and Spelling
22 pages
Oxford Textbook - Chapters 7 and 8 - Worked Solutions-4
No ratings yet
Oxford Textbook - Chapters 7 and 8 - Worked Solutions-4
32 pages
L7 SCNM Assignment Guide 2024
No ratings yet
L7 SCNM Assignment Guide 2024
9 pages
SBL SD20 Examiner's Report
No ratings yet
SBL SD20 Examiner's Report
16 pages
Chapter 1-2-3 Final
No ratings yet
Chapter 1-2-3 Final
20 pages
Interval Estimation
No ratings yet
Interval Estimation
33 pages
Applied Social Research A Tool For The Human Services 10th Edition Timothy P Hilton Peter R Fawson Thomas J Sullivan Cornell R Dejong Download PDF
100% (3)
Applied Social Research A Tool For The Human Services 10th Edition Timothy P Hilton Peter R Fawson Thomas J Sullivan Cornell R Dejong Download PDF
23 pages
Lab 5 InstruLecture
No ratings yet
Lab 5 InstruLecture
13 pages
MSU Study: Student Appearance and Academic Performance
No ratings yet
MSU Study: Student Appearance and Academic Performance
38 pages
A.S.P.E.N. Enteral Nutrition Practice Recommendations
No ratings yet
A.S.P.E.N. Enteral Nutrition Practice Recommendations
8 pages
Mtech Transportation Punjabi Uni Syllabus 2018 20
No ratings yet
Mtech Transportation Punjabi Uni Syllabus 2018 20
31 pages
Guidelines For SIP
No ratings yet
Guidelines For SIP
24 pages

QBM101 Chapter10

Uploaded by

QBM101 Chapter10

Uploaded by

QBM 101 Business Statistics

Department of Business Studies

 Module 2: Probability, discrete random

 Module 3: Sampling distributions;

 Module 4: Simple linear regression

A (simple) regression model that gives a straight-

Questions: Are the two variables related? If so,

In the regression model y = A + Bx + ε, A

In the model ŷ = a + bx, a and b, which are

The values of a and b that give the minimum

The degrees of freedom for a simple linear

(a) IV and DV. Do you expect a positive or negative relationship?

58.97% of the variation in monthly premium can be explained by

(f) yˆ (10)  76.6605  1.5476(10)  $61.18

SS yy  bSS xy 1557.5  (1.5476)(593.5)

 Determine the best fit line

 Calculate and interpret regression coefficients

 Calculate and interpret COC and COD

 Estimate and comment on its reliability

 Hypothesis test on B (critical value approach

You might also like