0% found this document useful (0 votes)

3 views

STAT2507.Chapter3Part2.W22

Chapter 3 – Part 2 discusses numerical measures for quantitative bivariate data, focusing on dependent and independent variables, covariance, correlation coefficients, and least-squares regression lines. It explains how to predict one variable based on another, compute sample covariance and correlation coefficients, and derive the least-squares regression line for linear relationships. The chapter includes practical exercises to apply these concepts using real data on blood alcohol content and beer consumption.

Uploaded by

bilalpasha1528910a

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

STAT2507.Chapter3Part2.W22

Uploaded by

bilalpasha1528910a

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Chapter 3 – Part 2

Numerical Measures for

Quantitative Bivariate Data

© 2022 Wayne Horn (excluding images)

Outline
▪ In Chapter 3 – Part 2, we will discuss the following topics:
▪ The Dependent Variable and Independent Variable
▪ Covariance
▪ The Correlation Coefficient
▪ The Least-Squares Regression Line

2
The Dependent Variable and Independent Variable
▪ We are usually interested in investigating the relationship between
two quantitative variables because we wish to predict the value of
one variable based on the value of the other variable.
▪ Examples:
▪ A real estate agent wants to predict the selling price of a house
based on the number of bedrooms.
▪ A personal trainer wants to predict the number of calories
somebody burns based on the total time spent exercising.
▪ A student wants to predict their final exam grade for a course
based on their midterm exam grade.

3
The Dependent Variable and Independent Variable
▪ The variable whose values we want to predict is referred to as the
dependent variable and is denoted by Y.
▪ The variable on which we base our predictions is referred to as the
independent variable and is denoted by X.
▪ As we saw earlier, a scatterplot of the observed (x, y) values will
show us the nature and strength of the relationship between X and Y.
▪ In this chapter, we will focus our attention on variables that have an
approximately linear relationship.

4
Population Covariance
▪ The population covariance of two variables X and Y, denoted by  XY ,
is a measure of the linear dependence between X and Y.
▪ Covariance can assume any real value.
▪ If the covariance is positive, then as one variable changes the other
variable tends to move in the same direction.
▪ If the covariance is negative, then as one variable changes the other
variable tends to move in the opposite direction.

5
Sample Covariance
▪ In order to estimate the population covariance, we take a random
sample of n pairs of values

(x1, y1), (x2, y2), …, (xn, yn)

and compute the sample covariance sxy as follows:

 n  n 
n x i y i 
 i =1   i =1 
n

 ( x i − x )( y i − y )  x y
i i −
n
s xy = i =1
= i =1
n −1 n −1
6
Exercise 1
In the undergraduate statistics project Beers BAC Beers BAC
at Ohio State University, the relationship 5 0.100 3 0.020
between blood alcohol content (BAC)
2 0.030 5 0.050
and the number of beers consumed
appear to have an approximately 9 0.190 4 0.070
linear relationship. 8 0.120 6 0.100
Compute and interpret the sample 3 0.040 5 0.085
covariance between BAC and the 7 0.095 7 0.090
number of beers consumed. 3 0.070 1 0.010
5 0.060 4 0.050

7
Exercise 1

8
0.20

0.15
xi − x
(x i , y i )

yi −y
BAC

0.10

y = 0.07375

0.05

x = 4.8125
0.00
0 1 2 3 4 5 6 7 8 9
Beers

9
Population Correlation Coefficient
▪ The population correlation coefficient of two random variables X and
Y, denoted by  , is computed as
 xy
= .
 x y
▪ The correlation coefficient is a unitless measure of the direction and
strength of the linear dependence between X and Y.

10
Population Correlation Coefficient
▪ The population correlation coefficient always equals a value between
–1 and 1, inclusive.

–1 0 +1

perfect negative no linear perfect positive

linear relationship relationship linear relationship

11
Sample Correlation Coefficient
▪ In order to estimate the population correlation coefficient, we take a
random sample of n pairs of values
(x1, y1), (x2, y2), …, (xn, yn)
and compute the sample correlation coefficient r as follows:

s xy
r= .
s xs y
▪ The sample correlation coefficient always equals a value between –1
and 1, inclusive.

12
Examples of the Sample Correlation Coefficient

16
30.0

14
27.5
12

25.0
10

8 22.5
y

y
6 20.0

4
17.5

2
15.0
0

0 1 2 3 4 5 0 1 2 3 4
x x

r=1 r = –1
perfect positive perfect negative
linear relationship linear relationship
13
Examples of the Sample Correlation Coefficient

35
90

30 80

70
25
y

y
60
20

15
40

10 30
0 1 2 3 4 0 1 2 3 4
x x

r = –0.874 r = 0.408
strong negative moderate positive
linear relationship linear relationship
14
Examples of the Sample Correlation Coefficient

60
30
50

40
25
30

20 20

y
y

0 15

-10
10
-20

-30
0 1 2 3 4 5 10 15 20 25 30 35
x x

r = 0.165 r = –0.009
weak positive no discernible
linear relationship linear relationship
15
Examples of the Sample Correlation Coefficient

y
54

7 8 9 10 11 12 13
x

r=0
perfect quadratic relationship

16
Exercise 2
In the undergraduate statistics project Beers BAC Beers BAC
at Ohio State University, the relationship 5 0.100 3 0.020
between blood alcohol content (BAC)
2 0.030 5 0.050
and the number of beers consumed
appear to have an approximately 9 0.190 4 0.070
linear relationship. 8 0.120 6 0.100
Compute and interpret the sample 3 0.040 5 0.085
correlation coefficient between BAC 7 0.095 7 0.090
and the number of beers consumed. 3 0.070 1 0.010
5 0.060 4 0.050

17
Exercise 1

18
The Least-Squares Regression Line
▪ If X and Y appear to have an approximately linear relationship, then
we can approximate the relationship using the equation of a line
Y = a + bX .
▪ The value a is the y-intercept. It is the value of Y when X= 0.
▪ The value b is the slope. It is the amount by which Y will change if we
increase X by 1 unit.
▪ The best-fitting line relating Y to X is called the least-squares
regression line.

19
The Least-Squares Regression Line
▪ The least-squares regression line is found by minimizing the sum of
the squared vertical differences between the data points and the line.
y
. .
. . Y = a + bX

. . .
. . .
x
20
The Least-Squares Regression Line
▪ The least-square regression line is Y = a + bX , where

sy
b=r and a = y − bx .
sx

▪ Note: Since sy and sx are both positive, b and r have the same sign.

21
Exercise 3
In the undergraduate statistics project Beers BAC Beers BAC
at Ohio State University, the relationship 5 0.100 3 0.020
between blood alcohol content (BAC)
2 0.030 5 0.050
and the number of beers consumed
appear to have an approximately 9 0.190 4 0.070
linear relationship. 8 0.120 6 0.100
(a) Find the least-squares regression line 3 0.040 5 0.085
for predicting BAC based on the number 7 0.095 7 0.090
of beers consumed. 3 0.070 1 0.010
(b) Predict the BAC for a student who 5 0.060 4 0.050
consumed 5 beers.
22
Exercise 1

23
Exercise 3
Fitted Line Plot
BAC = - 0.01270 + 0.01796 Beers
0.20

0.15
BAC

0.10

0.05

0.00
0 1 2 3 4 5 6 7 8 9
Beers

Chapter 6 - Correlation and Regression
No ratings yet
Chapter 6 - Correlation and Regression
9 pages
MetNum1 2023 1 Week 13
No ratings yet
MetNum1 2023 1 Week 13
70 pages
Regression and Correlation
No ratings yet
Regression and Correlation
54 pages
Part 2 Exploring Relationships Among Variables
No ratings yet
Part 2 Exploring Relationships Among Variables
8 pages
Statistics Overview Part II
No ratings yet
Statistics Overview Part II
29 pages
Linear Regression and Correlation
No ratings yet
Linear Regression and Correlation
35 pages
A2.2-2.5andS4.1 (1) (1)
No ratings yet
A2.2-2.5andS4.1 (1) (1)
21 pages
Stats10_Chapter+4 2
No ratings yet
Stats10_Chapter+4 2
54 pages
Topic V
No ratings yet
Topic V
30 pages
Chapter 5 - Regression
No ratings yet
Chapter 5 - Regression
7 pages
Regression
No ratings yet
Regression
50 pages
Describing Bivariate Numerical Data - Honors 281
No ratings yet
Describing Bivariate Numerical Data - Honors 281
34 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
Lecture 8 Correlation and Linear Regression
No ratings yet
Lecture 8 Correlation and Linear Regression
66 pages
Chap3-Bivariate Multivariate Data Distribution Upload
No ratings yet
Chap3-Bivariate Multivariate Data Distribution Upload
60 pages
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
No ratings yet
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
31 pages
Practical Biostatistics BMB-308: Torial Port and Presentation
No ratings yet
Practical Biostatistics BMB-308: Torial Port and Presentation
28 pages
ES12005 Lecture 2.5 2024-25 (1)
No ratings yet
ES12005 Lecture 2.5 2024-25 (1)
75 pages
Correlation
No ratings yet
Correlation
72 pages
Chapter 8
No ratings yet
Chapter 8
8 pages
05 Class RegressionCorrelation
No ratings yet
05 Class RegressionCorrelation
57 pages
Regression2024 MBA
No ratings yet
Regression2024 MBA
25 pages
Simple Linear Regression
100% (1)
Simple Linear Regression
50 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Unit 4 Regression analysis
No ratings yet
Unit 4 Regression analysis
28 pages
Chapter-23 Bivariate Statistical Analysis: Measurement of Association
No ratings yet
Chapter-23 Bivariate Statistical Analysis: Measurement of Association
30 pages
Course: Statistiek Voor Premasters
No ratings yet
Course: Statistiek Voor Premasters
51 pages
Asynchronus Learning Module - Sesi 8
No ratings yet
Asynchronus Learning Module - Sesi 8
9 pages
chapter8
No ratings yet
chapter8
8 pages
Chapter Five Regression
No ratings yet
Chapter Five Regression
12 pages
Linear Regression and Correlation
No ratings yet
Linear Regression and Correlation
26 pages
Chapter 5 - Eng
No ratings yet
Chapter 5 - Eng
20 pages
13simple linear regression
No ratings yet
13simple linear regression
127 pages
LP-III Lab Manual
No ratings yet
LP-III Lab Manual
49 pages
Correlacion y Regresion Lineal
No ratings yet
Correlacion y Regresion Lineal
49 pages
Regression and correlation notes
No ratings yet
Regression and correlation notes
28 pages
Handout 5 Correlation and Regression (Recovered)
No ratings yet
Handout 5 Correlation and Regression (Recovered)
6 pages
4_5870483869949498067
No ratings yet
4_5870483869949498067
3 pages
Correlation and Regression
No ratings yet
Correlation and Regression
31 pages
Looking at Data: Relationships: Least-Squares Regression
No ratings yet
Looking at Data: Relationships: Least-Squares Regression
23 pages
ML Assignment No. 1: 1.1 Title
No ratings yet
ML Assignment No. 1: 1.1 Title
8 pages
Lectures 14 15
No ratings yet
Lectures 14 15
66 pages
Week 12+13
No ratings yet
Week 12+13
47 pages
Unit 2 - Scatterplots Correlation and Regression Summer 2021
No ratings yet
Unit 2 - Scatterplots Correlation and Regression Summer 2021
43 pages
regression.2
No ratings yet
regression.2
6 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
46 pages
Dupont - Simple Linear Regression (STATISTICAL MODELING FOR BIOMEDICAL RESEARCHERS)
No ratings yet
Dupont - Simple Linear Regression (STATISTICAL MODELING FOR BIOMEDICAL RESEARCHERS)
52 pages
Scatter Plot/Diagram Simple Linear Regression Model
No ratings yet
Scatter Plot/Diagram Simple Linear Regression Model
43 pages
Investigating Variables
No ratings yet
Investigating Variables
15 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
25 pages
MAP 716 Lecture 4 Simple Linear Regression
No ratings yet
MAP 716 Lecture 4 Simple Linear Regression
23 pages
Lecture 8 and 9 Regression Correlation and Index
No ratings yet
Lecture 8 and 9 Regression Correlation and Index
32 pages
Correlation and Regression
No ratings yet
Correlation and Regression
16 pages
Linear correlation and linear regression
No ratings yet
Linear correlation and linear regression
37 pages
Actividad - Evaluable2.1 - Chicaiza - Iza - 5582 (Ingles Version)
No ratings yet
Actividad - Evaluable2.1 - Chicaiza - Iza - 5582 (Ingles Version)
7 pages
Notes Scatter Plots
No ratings yet
Notes Scatter Plots
39 pages
Midterm Answer
No ratings yet
Midterm Answer
5 pages
5 Bivariate Data. Double The Data, Double The Fun: 5.1 Covariance and Correlation
No ratings yet
5 Bivariate Data. Double The Data, Double The Fun: 5.1 Covariance and Correlation
10 pages
5-Correlation, Regression and Rank Correlation-08-03-2024
No ratings yet
5-Correlation, Regression and Rank Correlation-08-03-2024
29 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Coordinate Systems and Transformations: Topics
No ratings yet
Coordinate Systems and Transformations: Topics
19 pages
Bfin 332 Topic 4
No ratings yet
Bfin 332 Topic 4
26 pages
Session 11 - Circular Failure
No ratings yet
Session 11 - Circular Failure
36 pages
15 Hardest SAT Math Questions to Improve Your Score
No ratings yet
15 Hardest SAT Math Questions to Improve Your Score
1 page
Outlines of JSCE Recommendations For Design and Co
No ratings yet
Outlines of JSCE Recommendations For Design and Co
9 pages
SWD-CSA-A23.3-04 Shear Wall Design Manual
No ratings yet
SWD-CSA-A23.3-04 Shear Wall Design Manual
82 pages
Seminar Topics 1
No ratings yet
Seminar Topics 1
7 pages
Finding CB-H Parameters in Terms of CE-h Parameters. (Assumptions and H)
No ratings yet
Finding CB-H Parameters in Terms of CE-h Parameters. (Assumptions and H)
5 pages
Mértan Elméleti Összefoglaló
No ratings yet
Mértan Elméleti Összefoglaló
127 pages
AAI 2 Part 12 PSA 530-Audit Sampling
No ratings yet
AAI 2 Part 12 PSA 530-Audit Sampling
171 pages
9th (ATSO) PDF
50% (4)
9th (ATSO) PDF
4 pages
Register Free: Syllabus Revision 20% Guaranteed Score Doubt Solving Nasa
No ratings yet
Register Free: Syllabus Revision 20% Guaranteed Score Doubt Solving Nasa
18 pages
IV Practicum P
No ratings yet
IV Practicum P
2 pages
PROTHERM 100 PROTHERM 200 Operating Manual PDF
No ratings yet
PROTHERM 100 PROTHERM 200 Operating Manual PDF
75 pages
Pet 6 - 16 4 24
No ratings yet
Pet 6 - 16 4 24
9 pages
Pythagorean Theorem Instagram
0% (2)
Pythagorean Theorem Instagram
2 pages
Corrigendum Result - 4th SEM
No ratings yet
Corrigendum Result - 4th SEM
60 pages
Complexity and Solution Architecture
100% (1)
Complexity and Solution Architecture
21 pages
Factorials A Module 1 2
100% (1)
Factorials A Module 1 2
2 pages
(1986) Dommel - Et - Al
No ratings yet
(1986) Dommel - Et - Al
7 pages
ANSWERS PSRM 2023 Semester Test 3 Information and Additional Exercises_024345
No ratings yet
ANSWERS PSRM 2023 Semester Test 3 Information and Additional Exercises_024345
7 pages
ETAG 001: Guideline For European Technical Approval OF
No ratings yet
ETAG 001: Guideline For European Technical Approval OF
19 pages
Geospatial Data: Instructors
No ratings yet
Geospatial Data: Instructors
33 pages
BS en 13586-2020
100% (1)
BS en 13586-2020
33 pages
Learning Goals:: X X X X X X
No ratings yet
Learning Goals:: X X X X X X
3 pages
Building Descrip/ve Models: Simula/on: Center For Transportation & Logistics
No ratings yet
Building Descrip/ve Models: Simula/on: Center For Transportation & Logistics
25 pages
1 s2.0 S1018363918306767 Main PDF
No ratings yet
1 s2.0 S1018363918306767 Main PDF
13 pages
Label Propagation On Graphs: Leonid E. Zhukov
No ratings yet
Label Propagation On Graphs: Leonid E. Zhukov
26 pages
Counting Permutations Combinations Edit1
No ratings yet
Counting Permutations Combinations Edit1
30 pages
Vertical Axis Tidal Current Genrator Paper Salter and Taylor
No ratings yet
Vertical Axis Tidal Current Genrator Paper Salter and Taylor
19 pages

STAT2507.Chapter3Part2.W22

Uploaded by

STAT2507.Chapter3Part2.W22

Uploaded by

Chapter 3 – Part 2

Numerical Measures for

© 2022 Wayne Horn (excluding images)

(x1, y1), (x2, y2), …, (xn, yn)

and compute the sample covariance sxy as follows:

perfect negative no linear perfect positive

You might also like