Core_11_Statistics-and-Probability_q4_CLAS8-Correlation-JOSEPH-AURELLO
Core_11_Statistics-and-Probability_q4_CLAS8-Correlation-JOSEPH-AURELLO
NAME:_____________________________________
STRAND/BLOCK:__________________________
STATISTICS
AND PROBABILITY
Quarter IV – Week 8
Pearson’s Sample Correlation Coefficient
and Correlation Analysis
Republic Act 8293, Section 176 states that: No copyright shall subsist in any work
of the Government of the Philippines. However, prior approval of the government agency or
office wherein the work is created shall be necessary for the exploitation of such work for a
profit. Such agency or office may, among other things, impose as a condition the payment
of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this CLAS are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from
their respective copyright owners. The publisher and authors do not represent nor claim
ownership over them.
Management Team:
Servillano A. Arzaga, CESO V, SDS
Loida P. Adornado, PhD, ASDS
Cyril C. Serador, PhD, CID Chief
Ronald S. Brillantes, EPS-LRMS Manager
Marie Vic C. Velasco, PhD, EPS-Mathematics
Eva Joyce C. Presto, PDO II
Rhea Ann A. Navilla, Librarian II
______1. Which of the following measures the strength and direction of a linear relationship
between two quantitative variables?
A. hypothesis
B. Pearson’s sample correlation coefficient
C. regression
D. scatter Plot
𝑛(Σxy)−(Σ𝑥)(Σy)
B. 𝑟 =
√[𝑛(Σ𝑥)2 −(Σ𝑥)2 ][𝑛 (Σ𝑦2 )−(Σy)2 ]
𝑛(Σxy)−(Σ𝑥)(Σy)²
C. 𝑟 =
√[𝑛(Σ𝑥)²−(Σ𝑥)2 ][𝑛 (Σ𝑦2 )]
(Σxy)−(Σ𝑥)(Σy)²
D. 𝑟 =
√[𝑛(Σ𝑥)²−(Σ𝑥)2 ][𝑛 (Σ𝑦2 )]
_______4. A survey shows that as the weather gets hotter, air conditioning consumption
increases. Which of the following values of r represents the result?
A. 0.31
B. 0.95
C. 0
D. -0.88
x y
6 28
2 38
0 41
6 34
3 31
5 22
______5. In the given bivariate data, n or the total numbers of ordered pairs is:
A. 5 B. 6
C. 12 D. 22
1
______6. In solving the Pearson correlation coefficient, what is the next step after
constructing a table, computing for the product paired scores xy, getting the
square of x scores?
A. Construct a table
B. Get the sum of all entries in all columns in the table
C. Simplify and compute for the value of r
D. Substitute all the sum and n in the formula
_______9. “As the weather becomes hotter, the number of milk teas sold increases.” Which
of the following is the appropriate value of r for the statement?
A. -0.89
B. 0
C. 0.78
D. 0.45
2
Lesson 1
Pearson’s Sample
Correlation Coefficient
MELC: The learner calculates the Pearson’s sample correlation coefficient.
(M11/12SP-IVh-2)
𝑛(Σxy) − (Σ𝑥)(Σy)
𝑟=
√[𝑛(Σ𝑥2 ) − (Σ𝑥)2 ][𝑛 (Σ𝑦2 ) − (Σy)2 ]
Where:
n = number of data pairs Σxy = the sum of products of paired scores
Σx = sum of x scores Σx² = sum of squared x scores
Σy = sum of y scores Σy² = sum of squared y scores
3
Example 1. A researcher wishes to see whether there is a relationship between number of
absences and the final grade of the seven students in the statistics class. Compute the
value of the correlation coefficient for the data obtained in the study.
Student Number of absences Final grade (%)
A 6 82
B 2 86
C 15 88
D 9 75
E 12 80
F 5 90
G 8 78
Solution
Step 1. Construct a table.
Student Number Final grade xy x² y²
of absences x y (%)
A 6 82
B 2 86
C 15 73
D 9 75
E 12 80
F 5 90
G 8 78
Step 2. Find the values of xy (multiply the entries in x column with the entries in y column),
x² (square the values of x), and y² (square the values of y); place these values in the
corresponding columns of the table.
A 6 82 492 36 6,724
B 2 86 172 4 7,396
C 15 73 1,095 225 5,329
D 9 75 675 81 5,625
E 12 80 960 144 6,400
F 5 90 450 25 8,100
G 8 78 624 64 6,084
Step 3. Get the sum of all the entries in x, y, xy, x² and y² column.
A 6 82 492 36 6,724
B 2 86 172 4 7,396
C 15 73 1,095 225 5,329
D 9 75 675 81 5,625
E 12 80 960 144 6,400
F 5 90 450 25 8,100
G 8 78 624 64 6,084
4
Step 4. Substitute the values in the formula and solve for r. Here, n=7 because there are
seven (7) pairs of values.
Given:
n=7
Σx = 57
Σy = 564
Σxy = 4,468
Σx² = 579
Σy² = 45,658
𝑛(Σxy)−(Σ𝑥)(Σy)
𝑟=
√[𝑛(Σ𝑥 2 )−(Σ𝑥)2 ][𝑛 (Σ𝑦 2 )−(Σy)2 ]
7(4,468)−(57)(564)
𝑟=
√[7(579)−(57)2 ][7 (45,658)−(564)2 ]
31,276−32,148
𝑟=
√[4053−3249][319,606−318096]
31,276−32,148
𝑟=
√[804][1510]
31,276−32,148
𝑟=
√1,214,040
−872
𝑟=
1101.8348
𝒓 = - 0.79
Pearson r Assumptions:
• Correlation requires that both variables be quantitative.
• Correlation describes linear relationships. Correlation does not describe curve
relationships between variables, no matter how strong the relationship is.
5
Let’s Practice
Directions: Fill the blank parts of the table with the correct values. After completing
the table, get the sum of each column. Then, substitute the values obtained in the
given formula. Lastly, perform the indicated operations to calculate the value of r.
A researcher wants to find out if a relationship exists between the height and
weight of Senior High School Students of Marcelino A. Javarez National High School.
Data were as follows:
Student 1 2 3 4 5 6
Height in cm (x) 160 162 148 155 170 165
Weight (y) 62 60 50 63 75 65
x y Xy x² y²
160 62 9,920 25,600 3,844
162 60 26,244
148 50 7,400 2,500
155 63 24,025
170 75 12,750 5,625
165 65 27,225
𝛴𝑥 =960 Σy = Σxy = 60,280 Σ𝑥² = Σ𝑦² =23,763
𝑛(Σxy)−(Σ𝑥)(Σy)
𝑟=
√[𝑛(Σ𝑥 2 )−(Σ𝑥)2 ][𝑛 (Σ𝑦 2 )−(Σy)2 ]
6(60,280)−(960)( )
𝑟=
√[6( )−(960)2 ] [6 (23,763)−(375)2 ]
361,680−360,000
𝑟=
√[923,388−921,600][319,606− ]
361,680−360,000
𝑟=
√[1,788][1,953]
361,680−360,000
𝑟=
√3,491,964
1,680
𝑟=
𝒓 = 0.90
6
Directions: Using the completed table for bivariate data below, substitute each to the formula
of Pearson’s sample correlation coefficient. Then, solve for the value of r. Choose your answer
from the box. Write your answer on the space provided for the value of r.
n 𝛴x 𝛴y 𝛴xy 𝛴x 𝛴y r
1. 5 58 149 1,753 702 660,565
2. 8 47 151 911 305 125, 605
3. 7 108 77 1,126 1,794 197,418
4. 9 240 55 1,469 6,520 281, 813
Let’s Do More
Directions: Arrange from 1-4 the following steps in calculating the Pearson’s sample
correlation coefficient, r. Write your answers on the space provided.
______1. Get the sum of all the entries in x, y, xy, x² and y² column.
______2. Find the values of xy (multiply the entries in x column with the entries in y
column), x² (square the values of x), and y² (square the values of y); place
these values in the corresponding columns of the table.
7
Directions: Read and understand the statements below. Write T if the statement is
correct and F if it is not. Write your answer before the number.
_________5. Karl Pearson is the mathematician who developed the correlation coefficient
that bears his name.
Let’s Sum It Up
Activity 1
Directions: Complete the statement by filling in the blanks with the words in the boxes
below.
correlation
0 coefficient -1 to +1 negative +1
The 1.)____________________ computed from the sample data measures the strength and
direction of the relationship between two quantitative variables. The symbol for the
correlation coefficient is r. The range of the correlation coefficient is from
2.) ____________________. If there is a strong positive linear relationship between the
variables, the value of r will be close to 3.) ____________________. If there is a strong
4.) ____________________linear relationship between the variables, the value of r will be close
to -1. When there is no linear relationship between the variables or only a weak
relationship, the value of r will be close to 5.) ____________________.
8
Lesson 2
Solving Problems Involving
Correlation Analysis
no
perfect perfect
correlation
negative positive
correlation correlation
9
The sign of the coefficient of correlation indicates the direction of the relationship
between the two variables. A positive correlation coefficient means that variables obtaining
high scores in the first variable tend to obtain high scores in the second variable. A negative
correlation coefficient means that for every increase in one variable, there is a
corresponding decrease on the second variable or for every decrease in one variable, there
is a corresponding increase on the second variable.
10
The images are
data graphed in a catter
plot. This illustrates
what the relationships
might look like at
different degrees of
strength or different
values of r.
For a correlation
coefficient of zero, the
points have no
direction, the shape is
almost round, and a line
does not fit to the points
on the graph. As you
can observe, as the
correlation coefficient increases, the observations group closer together in a linear shape.
The line is difficult to detect when the relationship is weak (e.g., r = -0.3), but becomes
clearer as relationships become stronger (e.g., r = -0.99).
Let’s Practice
Directions: Given the following details, make an interpretation for each data. The first
one is done for you.
1. In a survey, the correlation coefficient r between engine size and fuel consumption
was found to be -0.9.
Interpretation: There is a strong negative correlation between the engine size and
fuel consumption.
2. In a survey, the correlation between the number of hours per week studetns spent
studying and their performance in an exam was found to be 0.72.
Interpretation: ________________________________________________________________
______________________________________________________________________________.
3. In a survey, the correlation between the height of Senior High School students
and their shoe size was found to be 0.48.
Interpretation: ________________________________________________________________
______________________________________________________________________________.
11
Directions: Using the correlation scale, identify the strength of each r value.
Let’s Do More
a. Perfect positive
1. Correlation
b. Perfect Negative
Correlation
c. No Correlation
a. Strong positive
Correlation
2.
b. Moderate
Negative Correlation
c. No Correlation
a. Weak positive
Correlation
b. Weak Negative
3.
Correlation
c. Perfect negative
Correlation
12
a. Weak positive
Correlation
4.
b. Weak Negative
Correlation
c. Perfect negative
Correlation
a. Strong positive
Correlation
5.
b. Moderate
Negative Correlation
c. No Correlation
0.9 -1 -0.5 1 0
Directions: With the given data, calculate the Pearson’s sample correlation
coefficient and interpret the result using the correlation scale. Show your solution.
The table shows the average number of hours spent weekly by students watching
korean drama and the average mark they achieved in their summative exams.
Hours spent
watching Korean 6 2 0 6 3 5
Drama weekly
Average scores in
28 38 41 34 31 22
their exam
Solution:
13
Let’s Sum It Up
Activity 1
5. 6.
4.
3. 7.
8. The correlation coefficient between the exam score in Probability and Statistics and exam
score in Earth and Life Science was found to be 0.76. How will you interpret the result?
____________________________________________________________________________________
_____________________________________________________________________________________
14
Let’s Assess
Directions: Read each question carefully. Write the letter of the correct answer on the
space provided before the number.
______3. Which of the following measures the strength and direction of a linear relationship
between two quantitative variables?
A. hypothesis
B. Pearson’s sample correlation coefficient
C. regression
D. scatter Plot
𝑛(Σxy)−(Σ𝑥)(Σy)
B. 𝑟 =
√[(Σ𝑥)2 −(Σ𝑥)2 ][𝑛 (Σ𝑦2 )−(Σy)2 ]
𝑛(Σxy)−(Σ𝑥)(Σy)²
C. 𝑟 =
√[𝑛(Σ𝑥)²−(Σ𝑥)2 ][ (Σ𝑦2 )]
(Σxy)−(Σ𝑥)(Σy)²
D. 𝑟 =
√[𝑛(Σ𝑥)²−(Σ𝑥)2 ][𝑛 (Σ𝑦2 )]
_______5. A survey shows that as the weather gets hotter, air conditioning consumption
increases. Which of the following values of r represents the result?
A. 0.31
B. 0.95
C. 0
D. -0.88
x y
6 28
2 38
0 41
6 34
3 31
5 22
15
______6. In the given bivariate data, n or the total numbers of ordered pairs is:
A. 5 B. 6 C. 12 D. 22
______7. In solving the Pearson correlation coefficient, what is the next step after
constructing a table, computing for the product paired scores xy, getting the
square of x scores?
A. Construct a table
B. Get the sum of all entries in all columns in the table
C. Simplify and compute for the value of r
D. Substitute all the sum and n in the formula
_______10. “As the weather becomes colder, the number of coffees sold increases.” Which
of the following is the appropriate value of r for the statement?
A.-0.89 B. 0 C. 0.78 D. 0.25
16
Answer Key
LESSON 1
Let’s Try
1. B 1. 0.95
2. A 2. 0.44
3. A 3. 0.56
4. B Let’s Practice 4. 0.27
5. B
Let’s Do More
6. B
7. A
8. C
9. C 1. 3
10. D 2. 2
3. 4
4. 1
1. T
2. F
3. T
4. F
5. T
Let’s Sum It Up
1. Correlation Coefficient
2. -1 to +1
3. +1
4. Negative
5. 0
LESSON 2
Let’s Practice Let’s Do More
1. A,1
2. There is a strong positive correlation
2. C, 0
between the number of hours per week
3. B, 0.5
students spent studying and their
4. C , -1
performance in an exam.
5. A, 0.9
3. There is a weak positive correlation
between the height of Senior High School There is a strong
students and their shoe size negative
correlation
average number
Let’s Sum It Up on the hours
spent weekly by
1.pearson correlation coefficient students
2. High watching korean
3. perfect negative correlation drama and the
4. moderate negative correlation average mark
5. negligible/weak correlation they achieved in
6. strong positive correlation their summative
7. perfect positive correlation exams.
8. strong positive correlation
Let’s Assess
1.A 2.D 3. B 4.A 5.B 6.B 7. B 8. A 9. A 10. A
17
References
1. PDF
Allan G. Bluman. Elementary Statistics: A Step by Step Approach, Eight Edition, Connect Statistics
Hosted by ALEKS Corporation)
2. Websites
Pierce, Rod. "Scatter (XY) Plots". Math Is Fun. Retrieved 24 May 2021,
http://www.mathsisfun.com/data/scatter-xy-plots.html
Diana Mindrila, Ph.D. and Phoebe Balentyne, M.Ed. “Based on Chapter 4 of The Basic Practice
Statistics (6th ed)”. Accessed May 24, 2021 from
www.westga.edu/academics/research/vrc/assets/docs/scatterplots_and_correlation_notes.pdf
Deborah J. Rumsey, PhD. How to Interpret a Correlation Coefficient r. Accessed May 24, 2021,
www.dummies.com/education/math/statistics/how-to-interpret-a-correlation-coefficient-r/
18
FEEDBACK SLIP
4. Was there any part of this CLAS that you found difficult?
If yes, please specify what it was and why.
None
NAME OF SCHOOL:
Teacher’s Name and Signature:
Parent’s / Guardian’s Name and
Signature:
Date Received:
Date Returned:
19