Pearson's Sample Coefficient Correlation: Lesson 1
Pearson's Sample Coefficient Correlation: Lesson 1
I. PRELIMINARIES
Competenc Calculates the Pearson’s sample coefficient correlation.
y Solves problems involving correlation analysis.
Objective After going through this module, you are expected to:
calculates the Pearson’s sample coefficient correlation , and
solves problems involving correlation analysis.
Topic/ Pearson’s sample coefficient correlation (r)
Subject Strength of the correlation
Matter Problems involving correlation analysis
Textbook
Materials
Copyrights
Total Points
Date
Pearson’s r
strength of
proportional
correlation
relation
Correlational
Analysis
n ( ∑ xy )−(∑ x)(∑ y )
Formula: r= where x & y are the variables and n is the number
2 2
√ [ n ∑ ( x )−( ∑ x ) ][ n ∑ ( y )− (∑ y ) ]
2 2
of data pairs
Values of Pearson’s r
– possible values of r ranges: -1 ≤ r ≤ 1
Strength of Correlation
There are no hard rules in determining the strength of the linear relationship based on the
correlation coefficient, but you may want to use the following guide in order to interpret the
correlation:
Example: Determining the Pearson’s r between the given two sets of data below.
x 0 3 5 1 4 6
y 15 20 30 15 25 35
Solution:
We begin by completing the table below so it would be easier to solve for Pearson’s r and with n = 5.
The first few rows of columns 3 to 5 in the table showed the sample calculations.
x y xy x2 y2
0 15 0 x 15 = 0 0x0=0 15 x 15 = 225
3 20 3 x 2 = 60 3x3=9 20 x 20 = 400
5 30 5 x 30 = 150 5 x 5 = 25 30 x 30 = 900
1 15 15 1 225
4 25 100 16 625
6 35 210 36 1225
∑x = 0 + 3 + 5 ∑y = 15 +20 +30 +
+1+4+6= 15 + 25 + 25 + 35 ∑xy = 535 ∑(x2) = 87 ∑(y2) = 3600
19 = 140
The values obtained in the last row of the table can then be used to calculate for Pearson’s r as shown
below:
Pearson’s r :
Correlation analysis is a method of statistical evaluation used to study the strength of a relationship
between two, numerically measured, continuous variables (e.g. height and weight). This particular type of
analysis is useful when a researcher wants to establish if there are possible connections between
variables.
The strength of the correlation is measured using the correlation coefficient. There are several types
of correlation coefficient but for this module you will consider just one of them, the Pearson r.
Strength of Correlation
There are no hard rules in determining the strength of the linear relationship based on the
correlation coefficient, but you may want to use the following guide in order to interpret the
correlation:
Example 1: Do a correlation analysis of the variables x and y of the example presented in Lesson 1.
Solution:
Since Pearson’s r is already solved, the analysis would be like this:
With an r = 0.97, then there is a positive, strong correlation between variable x and y; that is,
the as the values of x increases, the value of y also increases.
Example 2: Do correlation and regression analysis by determining the Pearson’s r of the relationship that
exist between the time spent in playing ML, Mobile Legends®, and the grades of senior high school (SHS).
The data is presented below:
Respondents A B C D E F G H I J K L M N O P Q R S T
Time Spent in ML 1 4 2 5 1 4 2 1 2 4 4 2 1 3 1 3 2 3 2 2
Grades 89 91 88 87 88 81 79 91 78 90 79 79 87 76 88 85 91 79 79 75
Solution:
First assign which variable will be x and which will y; it is not relevant which is which but for uniformity
purposes, assign x to the variable first presented and y the other. It is also not necessary to arrange the
data set in ascending or descending order; what is important is that the data pairing will not be
interchanged.
Then, as in the table in the example in Lesson 1, do the same as shown below:
Pearson’s r :
n ( ∑ xy )−( ∑ x ) ( ∑ y ) 20 ( 4100 )−( 49 )( 1680 ) 82000−82320
r= = =
2 2
2 2
√ [ n ∑ ( x )−( ∑ x ) ][ n ∑ ( y )− (∑ y ) ] √ [ 20 ( 149 )−( 49 ) ][ 20 ( 141,710 )−( 1680 ) ]
2 2
√ [ 2980−2401 ] [ 2,834,200−2,822,40
¿−0.12
Since r = -0.12, then there is a negative, weak correlation between time spent in ML and the grades of
the senior high school students; it can then be said that somehow, in some cases (weak correlation),
the more amount of time spent in playing ML the lesser will be the grades.
IV. ACTIVITIES
Activity 1 (Competency 1)
Date: May 31, 2021
Direction: Determine the Pearson’s r of the following data set. Score: ___
x y xy x2 y2
10 30
13 40
15 60
11 30
14 50
16 70
∑x = ∑y = ∑xy = ∑(x2) = ∑(y2) =
(Scoring Rubric: 1 point for each entry in the table, for the value of Pearson r and the analysis.)
Pearson’s r:
Activity 2 (Competency 2)
Date: June 01, 2021
Title: Analyze me!
Direction: Perform correlational analysis by determining the Person’s r of the relationship that
exist between the number of hours of sunshine and the amount of ice cream sold at a shop Score: ___
Hours of Sunshine 2 3 5 7 9
Ice Cream Sold 4 5 7 10 15
Construct & complete the table, then solve Person’s r write a correlational analysis on the space
below:
(Scoring Rubric: 1 point for each entry in the table, for the value of Pearson r and the analysis.)
Activity 3 (Competency 2)
Date: June 02, 2020
Title: A Statistical Inquiry!
Direction: Perform a statistical inquiry applying correlational analysis in the situation below. Score: ___
An inquiry is to be conducted to determine whether a correlation exist between a student’s attitude in class
to his or her academic performance. A sample data was collected; shown below.
Respondent
A B C D E F
s
Attitude 5 8 7 6 4 9
Academic
78 90 83 87 76 92
Performance
I. Encircle the letter of the best answer and write it on your answer sheet.
5. A Person’s r is calculated between two sets of data x and y. What would be the correlational analysis if r =
0.84?
a. There is a strong and negative correlation between x and y; that is the values of x increases as the
values of y decreases.
Weight Height (6.) (7.) (8.)
(x) (y) xy x2 y2
47 157
45 158
51 155
51 152
53 153
43 147
45 145
48 155
(9.) ∑x = (10.) ∑y = (11.) ∑xy = (12.) ∑(x2) = (13.) ∑(y2) =
b. There is a strong negative correlation between x and y; that is the values of x increases as the
values of y increases.
c. There is a strong and positive correlation between x and y; that is the values of x increases as the
values of y decreases.
d. There is a strong and positive correlation between x and y; that is the values of x increases as the
values of y decreases.
II. Perform a correlational analysis of the situation below. Show your solution by completing the table, writing
the calculated Person’s r and analysis in the space below.
A survey is conducted to determine if a correlation exists between the weight, in kilogram, and height, in
centimeters, of the students of a certain high school. Eight sample students were selected for the
survey and the resulting data is tabulated below.
(14.) Pearson’s r:
PARENTS’/GUARDIANS’ FEEDBACK
Activity 2:
x2 y2 n ( ∑ xy )−( ∑ x ) ( ∑ y )
x y xy r=
2 2
2 4 8 4 16
√ [ n ∑ ( x )−( ∑ x ) ][ n ∑ ( y )− (∑ y ) ]
2 2
5 ( 263 )− ( 26 ) ( 41 )
3 5 15 9 25 ¿ 2 2
√[ 5 ( 168 ) −( 26 ) ][ 5 ( 415 )−( 41 ) ]
5 7 35 25 49
1315−1066
¿
7 10 70 49 100 √[ 840−676 ] [ 2075−1681 ]
9 15 135 81 225 249 249 249
¿ = = =0.98
√[ 164 ][ 394 ] √64616 254.197
There is a strong, positive correlation between the variables (r
∑x = 26 ∑y = 41 ∑xy = 263 ∑(x2) = 168 ∑(y2) = 415 = 0.98).
VIII. REFERENCES
Albert, J. R. (2008). Basic Statistics for the Tertiary Level. Rex Bookstore.
Commision on Higher Education (2016). Teaching Guide for Senior High School: Statistics and
Probability. Commission on Higher Education.
De Veau, R. D., Velleman, P. F., & Bock, D. E. (2006). Intro Stats. Pearson Ed. Inc.
Febre, Francisco Jr. (1987). Introduction to Statistics. Phoenix Publishing House