0% found this document useful (0 votes)

2 views

Topic 7.1_Correlation and Simple Linear Regression

The document discusses correlation analysis, simple linear regression, and model building in psychology, emphasizing the relationship between variables. It explains the coefficient of correlation (r), its interpretation, and how to use least squares to determine a regression equation. An example involving sales calls and copier sales illustrates the concepts and calculations involved in establishing a predictive model.

Uploaded by

tc458gxq6p

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Topic 7.1_Correlation and Simple Linear Regression

Uploaded by

tc458gxq6p

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

PSY2032 Statistical Methods in Psychology I

Topic 7

Correlation,
Simple Linear Regression, and
Model Building

[7.1] What is Correlation Analysis?

[7.2] The Coefficient of Correlation (r)

[7.3] Simple Linear Regression

7.3.1 Least Squares Principle
7.3.2 Drawing the Line of Regression
7.1 What is Correlation Analysis?
 Example: Is there a relationship between the number of hours that students
studies for an exam and the score earned?

 Correlation Analysis is the study of the relationship between variables. It is a

group of techniques to measure the association between two variables.

 The basic idea of correlation analysis is to report the association between two
variables. The usual first step is to plot the data in a scatter diagram.

Example
Copier Sales of America sells copier to businesses of all sizes throughout the United
States and Canada. Ms Marcy Bancer was recently promoted to the position of
national sale manager. At the upcoming sales meeting, the sales representatives
from all over the country will be in attendance. She would like to impress upon
them the importance of making that extra sales call each day. She decides to gather
some information on the relationship between the number of sales calls and the
number of copier sold.
She selected a random sample of 10 sales representatives and determined the
number of sales calls they made last month and the number of copiers they sold.
The sale information is reported in the following table.
What observations can you make about the relationship between the number of sales
calls and the number of copiers sold? Develop a scatter diagram to display the
information.
Sales Number of Sales Number of Copiers
Representative Calls (X) Sold (Y)
Tom Keller 20 30
Jeff Hall 40 60
Brian Virost 20 40
Greg Fish 30 60
Susan Welch 10 30
Carlos Ramirez 10 40
Rich Niles 20 40
Mike Kiel 20 50
Mark Reynolds 20 30
Soni Jones 30 70
 Based on the information in table, Ms. Bancer suspects there is relationship
between the number of sales calls made in a month and the number of copiers
sold.
 Soni Jones sold the most copiers last month, and she was one of three
representatives making 30 or more sales call.
 Susan Welch and Carlos Ramirez made only 10 calls last month. Ms. Welch
had the lowest number of copiers sold among the sampled representatives.
 The implication is that the number of copies sold is related to the number of sales
calls made. As the number of sales calls increases, it appears the number of
copiers sold also increases.
 Common Practice to draw the scatter diagram
 Independent variable (number of sales calls) – Horizontal or X-axis
 Dependent variable (copiers sold) - Vertical or Y-axis
Independent variable – the variable that provides the basis for estimation.
It is the predictor variable = the number of sales calls
Dependent variable – the variable that is being predicted or estimated
= the number of copiers sold
 The scatter diagram shows graphically that the sales representatives who make
more calls tend to sell more copiers.
 Note that while there appears to be a positive relationship between the two
variables, all the points do not fall on a line.
 In the following section you will measure the strength and direction of this
relationship between two variables by determining the coefficient of correlation.
7.2 The Coefficient of Correlation (r)
 Coefficient of Correlation – describes the strength of the relationship between
two sets of interval-scaled or ratio-scaled variables.
ρ Population coefficient of correlation
r Sample coefficient of correlation
  1.00  r  1.00
 A correlation coefficient of -1 or +1 indicates perfect correlation.
 If there is absolutely no linear relationship between the two sets of variables,
Person’s r is zero.
 A coefficient of correlation r close to 0 (say 0.08) shows that the linear
relationship is quite weak. The same conclusion is drawn if r = -0.08.
 Coefficients of -0.91 and +0.91 have equal strength; both indicate very strong
correlation between the two variables. Thus the strength of the correlation does
not depend on the direction (either + or -).

= CORREL (x, y)
 Coefficient of Correlation (r) – A measure of the strength of the linear
relationship between two variables.
 The sample coefficient of correlation is identified by the lower-case letter (r).
 It shows the direction and strength of the linear (straight line) relationship
between two variables.
Sales Representative Calls (X) Sales (Y) XX YY X  X Y  Y 
Tom Keller 20 30 -2 -15 30
Jeff Hall 40 60 18 15 270
Brian Virost 20 40 -2 -5 10
Greg Fish 30 60 8 15 120
Susan Welch 10 30 -12 -15 180
Carlos Ramirez 10 40 -12 -5 60
Rich Niles 20 40 -2 -5 10
Mike Kiel 20 50 -2 5 -10
Mark Reynolds 20 30 -2 -15 30
Soni Jones 30 70 8 25 200
X  22 Y  45  X  X (Y  Y) = 900

r
 (X  X)(Y  Y)

900
 0.759
(n  1)SXSY (10  1)(9.189)(14.337)
 Positive, it confirms our reasoning based on the scatter diagram, fairly close to 1, so the association
is strong.
Positive Correlation
1 2
( x - x, y – y )

( x, y )

3 4

In the quadrant [2], both (x – X ) (+) and (Y – Y ) (+) will be positive (++=+),
Clare Morris: Quantitative Approaches in Business Studies, 6/e © Clare Morris 2003

while in the quadrant [3], both (x – X ) (-) and (Y – Y ) (-) then (--=+) will be
positive.
The products (x – X ) (Y – Y ) will therefore nearly all be positive, as will the sum
∑(x – X ) (Y - Y ) over all the points.
Negative Correlation No Linear Relationship – Zero Correlation

1 2 1 2

3 4
3 4

Clare Morris: Quantitative Approaches in Business Studies, 6/e © Clare Morris 2003 For no correlation, the points are pretty
In the quadrant [1], where (x – X ) is negative Clare Morris: Quantitative Approaches in Business Studies, 6/e © Clare Morris 2003
uniformly scattered throughout all four
and (Y – Y ) is positive. (-+=-) while in the
quadrants, so the product (x – X )(Y – Y )
quadrant [4], where (x – X ) is positive and will be fairly evenly balanced between
(Y – Y ) is negative. positive and negative.

Thus, when we sum them, the positives and

(+-=-), so the products (x – X ) (Y – Y ) and negatives will tend to balance out, so that the
total will be close to zero.
the sum ∑ (x – X ) (Y - Y ) over all the
Sample Correlation Coefficient (r)
Covariance

r
 (X  X)(Y  Y)

 XY  nXY
(n  1)SX S Y ( n  1) S x S y

Correlation and Causation

 If there is a strong relationship between two variables, we are tempted to assume
that an increase or decrease in one variable causes a change in the other variable.
 However, strong correlation but no causality - Spurious correlations.
 What we can conclude when we find two variables with a strong correlation is
that there is a relationship or association between the two variables, not that a
change in one causes a change in the other.

Pearson Correlation – SPSS

https://www.youtube.com/watch?v=VOI5IlHfZVE
Exercise
Employee Y X1 X2
Annual Salary Years of experience Years of Postsecondary
($ 000) Education
1 54.9 5.5 4.0
2 60.5 9.0 6.0
3 58.9 6.0 5.0
4 59.0 8.0 5.5
5 57.5 6.5 5.0

Which factor (X1 or X2) has a higher correlation with Annual Salary (Y)?

Y X1 X2
Y 1
X1 0.813164 1
X2 0.962216 0.924995 1
7.3 Simple Linear Regression
 In this section we wish to develop an equation to express the linear relationship
between two variables.
 The technique used to develop the equation and provide the estimates is called
regression analysis.
 Regression Analysis – An equation that expresses the linear relationship between
two variables.

7.3.1 Least Squares Principle

 Disadvantage: its
position is based in part
on the judgment of the
person drawing the line.
 All the lines except line
A seem to be reasonable.

 The scatter diagram is reproduced with a line drawn with a ruler through the dots
to illustrate that a straight line would probably fit the data.
 Judgment is eliminated by determining the regression line using mathematical
method called the least squares principle, this method gives us the “best-fitting”
line.
 Least Squares Principle – Determining a regression equation by minimizing
the sum of the squares of the vertical distances between the actual Y values and
the predicted value of Yˆ .

The Least Squares Line

 This plot (X=3; Y=8) deviates by 2 from the line,
found by 10-8
 The deviation squared is 4
 The squared deviation for the plot X=4, Y=18 is 16
 The squared deviation for the plot X=5, Y=16 is 4
 The sum of the squared deviations is 24, found by
4+16+4
 General form of linear regression equation

ŷ  a  b x
Where
 ŷ read Y hat, is the predicted value of the Y variable for a selected X value.

 a is the Y-intercept. It is the estimated value of Y when X = 0.

Another way to put it is: “a” is the estimated value of Y where the regression
line crosses the Y-axis when X is zero.

 b is the slope of the line, or the average change in ŷ for each change of one
unit (either increase or decrease) in the independent variable x.

 x is any value of the independent variable that is selected.

The formulas for a and b are:

 Slope of the regression line:

Sy
br
Sx
r is the correlation coefficient
 Sy is the standard deviation of Y (the dependent variable)
 Sx is the standard deviation of X (the independent variable)

 Y intercept:
a  Y  bX

 Y is the mean of Y (the dependent variable)

 X is the mean of X (the independent variable)
Recall the example involving Copier Sales of America. The sales manager gathered
information on the number of sales calls made and the number of copiers sold for a
random sample of 10 sales representatives. As a part of her presentation at the upcoming
sales meeting, Ms. Bancer, the sales manager, would like to offer specific information
about the relationship between the number of sales calls and the number of copiers sold.

Use the least square method to determine a linear equation to express the relationship
between the two variables. What is the expected number of copiers sold by a
representative who made 20 calls?

Sales Representative Calls (X) Sales (Y) XX YY X  X Y  Y 

Tom Keller 20 30 -2 -15 30
Jeff Hall 40 60 18 15 270
Brian Virost 20 40 -2 -5 10
Greg Fish 30 60 8 15 120
Susan Welch 10 30 -12 -15 180
Carlos Ramirez 10 40 -12 -5 60
Rich Niles 20 40 -2 -5 10
Mike Kiel 20 50 -2 5 -10
Mark Reynolds 20 30 -2 -15 30
Soni Jones 30 70 8 25 200
X  22 Y  45  X  X (Y  Y) = 900
Solution

The calculations necessary to determine the regression equation are:

r
 (X  X)(Y  Y)

900
 0.759
(n  1)SXSY (10  1)(9.189)(14.337)

 Sy   14.337 
 
b  r   0.759   1.1842
 Sx   9.189 

a  Y  bX  45 - (1.1843)22  18.9476

 Thus, the regression equation is ŷ  18.9476  1.1842 x , and it can be shown on the
scatter diagram.
7.3.2 Drawing the Line of Regression

ŷ  18.9476  1.1842 x

ŷ  42.6316
 The a value of 18.9476 is the point where the equation crosses the Y-axis. A
literal translation is that if no sales calls are made, that is, X = 0, 18.9476 copiers
will be sold.

 The b value of 1.1842 means that for each additional sales call made the sales
representative can expect to increase the number of copier sold by about 1.2.

 The regression equation is ŷ  18.9476  1.1842 x . If a salesperson makes 20 calls,

he or she can expect to sell ŷ  18.9476  1.1842(20)  42.6316 copiers.

How to Use SPSS: Simple Linear Regression

https://www.youtube.com/watch?v=xp4Sffz5bbA

Marrington - Recording Classical Guitar
No ratings yet
Marrington - Recording Classical Guitar
445 pages
Allison MT (B) 600 Series Transmissions Service Manual PDF
100% (4)
Allison MT (B) 600 Series Transmissions Service Manual PDF
56 pages
Linear Regression and Corelation (1236)
No ratings yet
Linear Regression and Corelation (1236)
50 pages
5 Correlation and Cofficient 2023
No ratings yet
5 Correlation and Cofficient 2023
51 pages
Statics
No ratings yet
Statics
61 pages
Pearson's Sample Coefficient Correlation: Lesson 1
No ratings yet
Pearson's Sample Coefficient Correlation: Lesson 1
7 pages
Correlation-Regression 2019
No ratings yet
Correlation-Regression 2019
76 pages
QM CH 9 Corr Coeff
100% (2)
QM CH 9 Corr Coeff
18 pages
Lesson 9. Correlation Coefficient
No ratings yet
Lesson 9. Correlation Coefficient
18 pages
Chapter 3 Stat
No ratings yet
Chapter 3 Stat
66 pages
Syl-3. Correlation Analysis
No ratings yet
Syl-3. Correlation Analysis
16 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
17 pages
Correlation and Regression
100% (6)
Correlation and Regression
36 pages
Correlation Lecture
No ratings yet
Correlation Lecture
20 pages
Statisticsprobability11 q4 Week8 v4
No ratings yet
Statisticsprobability11 q4 Week8 v4
9 pages
Covariance and Correlation: Parthiban Rajendran
No ratings yet
Covariance and Correlation: Parthiban Rajendran
17 pages
114635812-Correlation and Regression
100% (1)
114635812-Correlation and Regression
5 pages
Correlaton Stats
No ratings yet
Correlaton Stats
8 pages
4. Correlation and Regression Analysis
No ratings yet
4. Correlation and Regression Analysis
17 pages
Welcome To The: Scatter Plots
100% (1)
Welcome To The: Scatter Plots
42 pages
TOPIC 9
No ratings yet
TOPIC 9
9 pages
Correlation Analysis
No ratings yet
Correlation Analysis
26 pages
Correlation Qmt-Students - 13 May 2022
No ratings yet
Correlation Qmt-Students - 13 May 2022
14 pages
Final Exam - Balines Rhea Rose
No ratings yet
Final Exam - Balines Rhea Rose
10 pages
Chapter Eight 8 Simple Linear Regression and Correlation: N XY X Y N X X
No ratings yet
Chapter Eight 8 Simple Linear Regression and Correlation: N XY X Y N X X
5 pages
Chapter 5 Sta404
No ratings yet
Chapter 5 Sta404
10 pages
Pearson Product Moment Correlation.docx
No ratings yet
Pearson Product Moment Correlation.docx
6 pages
Correlation
No ratings yet
Correlation
46 pages
L3 - Correlation & Rank Correlation
No ratings yet
L3 - Correlation & Rank Correlation
11 pages
Stat Chapter 6
No ratings yet
Stat Chapter 6
23 pages
CH 9
No ratings yet
CH 9
12 pages
7. CORRELATION
No ratings yet
7. CORRELATION
6 pages
Correlation and Regression
No ratings yet
Correlation and Regression
11 pages
Correlation - Regression Complete
No ratings yet
Correlation - Regression Complete
130 pages
Correlation
No ratings yet
Correlation
12 pages
How Can We Explore The Association Between Two Quantitative Variables?
No ratings yet
How Can We Explore The Association Between Two Quantitative Variables?
7 pages
Chapter 1
No ratings yet
Chapter 1
22 pages
Lecture 7 - Correlation Regression
No ratings yet
Lecture 7 - Correlation Regression
47 pages
WEEK 7 Modular
No ratings yet
WEEK 7 Modular
10 pages
Measures of Relationship - Day 2
No ratings yet
Measures of Relationship - Day 2
44 pages
Chapter 13 Correlation and Linear Regression
No ratings yet
Chapter 13 Correlation and Linear Regression
19 pages
Module Five: Correlation Objectives
No ratings yet
Module Five: Correlation Objectives
11 pages
Statistics and Probability: Quarter 4 - Module 7 Pearson's Sample Correlation Coefficient
No ratings yet
Statistics and Probability: Quarter 4 - Module 7 Pearson's Sample Correlation Coefficient
16 pages
10 - Data Analysis - Correlation Analysis
100% (1)
10 - Data Analysis - Correlation Analysis
17 pages
Correlation Analysis With Pearson R
No ratings yet
Correlation Analysis With Pearson R
41 pages
Regression: Dr. Agustinus Suryantoro, M.S
No ratings yet
Regression: Dr. Agustinus Suryantoro, M.S
31 pages
Regression Analysis
No ratings yet
Regression Analysis
41 pages
DADM-Correlation and Regression
No ratings yet
DADM-Correlation and Regression
138 pages
Correlation and Regression
No ratings yet
Correlation and Regression
167 pages
Topic V
No ratings yet
Topic V
30 pages
Stat and Probability Finals
No ratings yet
Stat and Probability Finals
7 pages
Correlation and Regression
No ratings yet
Correlation and Regression
54 pages
Chapter 2 Regression and Forecasting
No ratings yet
Chapter 2 Regression and Forecasting
88 pages
CORRELATION and REGRESSION
100% (1)
CORRELATION and REGRESSION
19 pages
Pearson Product Moment Correlation Coefficient
No ratings yet
Pearson Product Moment Correlation Coefficient
2 pages
Regression
No ratings yet
Regression
18 pages
BONGGA Statistics-and-Probability 4Q SLM8
No ratings yet
BONGGA Statistics-and-Probability 4Q SLM8
10 pages
Correlation Analysis and Its Types
No ratings yet
Correlation Analysis and Its Types
50 pages
BC06-1 Bmas Topic-5 Linear Correlation and Regresssion
No ratings yet
BC06-1 Bmas Topic-5 Linear Correlation and Regresssion
13 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
r23 p & s Unit 2 Material
No ratings yet
r23 p & s Unit 2 Material
14 pages
Major Revision Facts in Mathematics
From Everand
Major Revision Facts in Mathematics
B. N. Kumar
No ratings yet
CALTEY - S4 - Reggio
No ratings yet
CALTEY - S4 - Reggio
22 pages
CALTEY - S3 - HighScope
No ratings yet
CALTEY - S3 - HighScope
18 pages
Lecture_5_research methods
No ratings yet
Lecture_5_research methods
38 pages
Ch3 Exercise 1 QA
No ratings yet
Ch3 Exercise 1 QA
4 pages
PSY3025 L2
No ratings yet
PSY3025 L2
76 pages
Lecture_1_neurons
No ratings yet
Lecture_1_neurons
47 pages
PSY3025 L3
No ratings yet
PSY3025 L3
90 pages
PSY3021 L3-4
No ratings yet
PSY3021 L3-4
84 pages
Chemical Engineering and Processing: Process Intensification
No ratings yet
Chemical Engineering and Processing: Process Intensification
5 pages
Mars Mercury Ashtakavarga
No ratings yet
Mars Mercury Ashtakavarga
1 page
Assesment Biology and Arabic - Infographic Poster of 3d Model
No ratings yet
Assesment Biology and Arabic - Infographic Poster of 3d Model
1 page
50 Phrasal Verbs For Work and Business
No ratings yet
50 Phrasal Verbs For Work and Business
4 pages
IEEE Guide For Aging Mechanisms and Diagnostic Procedures in Evaluating Electrical Insulation Systems
No ratings yet
IEEE Guide For Aging Mechanisms and Diagnostic Procedures in Evaluating Electrical Insulation Systems
12 pages
Business Studies Class 12 Study Material Chapter 6
No ratings yet
Business Studies Class 12 Study Material Chapter 6
15 pages
TMP Has Been Experiencing Increasing Demand From Its Institutional Clients
No ratings yet
TMP Has Been Experiencing Increasing Demand From Its Institutional Clients
1 page
Adroher Et Al 1988 Chemosphere
No ratings yet
Adroher Et Al 1988 Chemosphere
7 pages
Testo 606 in
No ratings yet
Testo 606 in
2 pages
POST TEST in ART APPRECIATION
No ratings yet
POST TEST in ART APPRECIATION
19 pages
Attachment View RD
No ratings yet
Attachment View RD
2 pages
Raz cqlz26 Mysteryofkingtut
No ratings yet
Raz cqlz26 Mysteryofkingtut
3 pages
02-NCSCM Volume 2 - Techical Specifications PDF
No ratings yet
02-NCSCM Volume 2 - Techical Specifications PDF
498 pages
Coursera BioinfoMethods-I Lecture01 r2018
No ratings yet
Coursera BioinfoMethods-I Lecture01 r2018
16 pages
Assessment Practices in Philippine Higher STEAM Education
No ratings yet
Assessment Practices in Philippine Higher STEAM Education
19 pages
HOPE 4 Module 3.1
No ratings yet
HOPE 4 Module 3.1
29 pages
OSCE Procedures
No ratings yet
OSCE Procedures
16 pages
s6 Aceiteka 2023maths P2
No ratings yet
s6 Aceiteka 2023maths P2
4 pages
16 Acetoacetic Malonic Esters ENG
No ratings yet
16 Acetoacetic Malonic Esters ENG
10 pages
How To Tell Someone The Good News - Robert J. Wieland PDF
100% (1)
How To Tell Someone The Good News - Robert J. Wieland PDF
22 pages
24M - Vol I.
100% (1)
24M - Vol I.
764 pages
Tesol Mid-Term Assignment: Students Will Be Able
No ratings yet
Tesol Mid-Term Assignment: Students Will Be Able
7 pages
BMD_S4CLD2408_BPD_EN_DE
No ratings yet
BMD_S4CLD2408_BPD_EN_DE
59 pages
Uml 91
No ratings yet
Uml 91
35 pages
Guidance Services
No ratings yet
Guidance Services
64 pages
Case Study 6-Send me a Pic
No ratings yet
Case Study 6-Send me a Pic
5 pages
Revision For Final Test: The News That I Got Is True. The News That Made Me Sad Is True
No ratings yet
Revision For Final Test: The News That I Got Is True. The News That Made Me Sad Is True
7 pages
Azimuth Determination by Gyroscope
No ratings yet
Azimuth Determination by Gyroscope
8 pages

Topic 7.1_Correlation and Simple Linear Regression

Uploaded by

Topic 7.1_Correlation and Simple Linear Regression

Uploaded by

PSY2032 Statistical Methods in Psychology I

[7.1] What is Correlation Analysis?

[7.2] The Coefficient of Correlation (r)

[7.3] Simple Linear Regression

 Correlation Analysis is the study of the relationship between variables. It is a

Thus, when we sum them, the positives and

Correlation and Causation

Pearson Correlation – SPSS

7.3.1 Least Squares Principle

The Least Squares Line

 a is the Y-intercept. It is the estimated value of Y when X = 0.

 x is any value of the independent variable that is selected.

 Slope of the regression line:

 Y is the mean of Y (the dependent variable)

Sales Representative Calls (X) Sales (Y) XX YY X  X Y  Y 

The calculations necessary to determine the regression equation are:

 The regression equation is ŷ  18.9476  1.1842 x . If a salesperson makes 20 calls,

How to Use SPSS: Simple Linear Regression

You might also like