Math_I
Math_I
Table of Contents
Introduction: 3
Information/Mesurement: 4
Mathematical Processes: 9
Interpretation of Results: 16
Validity: 17
Areas of Improvement: 18
Appendix: 18
bibliography: 20
3
Criterion A: Introduction
I’m going to be doing my research on the correlation of hours of sleep and grades
since grades are a very important part of every student's life and it reflects on the
student's academic performance. The purpose of this research is to find out whether or
not a students grades and the of sleep they get are related. In order to achieve the
answer, I will be engaged in primary research. For Primary research, I will be distributing
questionaries amongst the students of DP12 and DP11 with a specific set of questions
(Table 1) through the use of Google questionnaires. Afterward I will be analyzing these
results using multiple mathematical methods such as the calculation of the least squares
continuity (a form of the chi squared test). I decided to pick this as my topic of research
because i have always strived to get my grades better, and I’ve always been told that
sleeping is very important to have good grades, which i had some doubts in that
statement. So this research should answer my doubts and let me know if the grades and
4
sleeping hours are actually related. Personally, i believe that the answer will be no, and
as some of my other friends from all around the world, the total number adding up to 54
people. students were asked to fill in their total score 100% being the perfect grade, and
80 7.5
85 8.5
90 8
70 7.5
5
70 6.5
90 8
80 5.5
60 6.5
80 8
67 4
80 8
80 8
66 8
95 6
80 8
25 4
99 7
98 7
92 5
80 8
99 8
92 5
100 4
85 6
87 7
80 6
75 8
85 8
6
65 2.5
82 7
80 4
90 8.5
90 6.5
97 8
95 5
60 7
65 8
89 10
96 8
70 6.5
94 7
75 9
75 6.5
100 3
31 7.5
68 7
90 9
7
The graphs below have been imported from my personal google questionnaire. I created
distributed the link among two different communities on the internet, i happened to be an
administrator in one of the communities so the response was very big. A lot of people
were able to use the form, not only my international friends from all around the world
(USA, Canada, France, Spain, Pakistan, Philippines, China, Sweden etc.) but my
Using Excel i created a scatter diagram of Hours of sleep correlating with the
students grades. i calculated the r2 using Excel as well, the regression line has also been
· The chi squared test is used to measure whether two classification (or factors) from
Null Hypothesis is “The grades of the students are independent of the amount of sleep
they get.”
Alternative Hypothesis is “The grades of the students is not independent of the amount of
Observed Values
G<40 1 1 0 2
40 ≤ G < 80 2 9 1 12
G ≥ 80 3 28 2 33
11
Total 6 38 3 47
Expected Values
|(F o − F e )| 2 −0.5
C hi 2 = ∑ Fe
(|(1 − 0.255| −0.5) 2 |(1 − 1.53| −0.5) 2 |(0 − 0.16| −0.5) 2 (|2 − 1.53| −0.5) 2
C hi 2 = ∑ 0.255
+ 1.53
+ 0.16
+ 1.53
+
(|9 − 9.70| −0.5) 2 (|1 −0.78| −0.5) 2 (|3 − 4.21| −0.5 2 (|28 − 26.7| −0.5) 2 (|2 − 2.13| −0.5) 2
9.70
+ 0.78
+ 4.21
+ 26.7
+ 2.13
C hi 2 = 1.271686215
Df = (Rows − 1)(Columns − 1)
Df = (3 − 1)(3 − 1)
Df = 4
12
C hi 2 value is less than the critical value, 1.271686215<9.488, the null hypothesis is
accepted and the grades of the students are independent of the amount of sleep they get.
· Least squares regression identifies the relationship between the independent variable
“x” “Grades of the Students” and the dependent variable “y” “ The ammount of sleep
they get” .
S xy
y−y = (S x ) 2
(x − x)
Sx = √ ∑(x−x) 2
n S xy is the covariance =
∑ xy
n - x y.
x is the mean value or average of the grades received by the students
y is the mean value or average of the hours of sleep that students got.
x y x−x y−y (x − x) 2 (y − y ) 2
(x − x) * (y − y )
80 7.5 -0.468085106 0.670212766 0.219103667 0.449185152 -0.313716614
85 8.5 4.531914894 1.670212766 20.5382526 2.789610684 7.56926211
90 8 9.531914894 1.170212766 90.85740154 1.369397918 11.15436849
70 7.5 -10.46808511 0.670212766 109.5808058 0.449185152 -7.015844273
70 6.5 -10.46808511 -0.329787234 109.5808058 0.10875962 3.452240833
90 8 9.531914894 1.170212766 90.85740154 1.369397918 11.15436849
13
3782
x= 47 = 80.468
321
y = 47 = 6.83
Sx = √ ∑(x−x) 2
n Sx = 11583.70213
47 = √246.46 = 15.7
∑ xy
n - x y.
25958
47 - 80.47*6.83 =2.69
15
√11583.70213
47
S xy
y−y = (S x ) 2
(x − x)
2.69
y − 6.83 = (15.7 ) 2
(x − 80.468)
y=0.011x + 5.9424
The regression line represents the relation between the two factors the grades of the
students and The grades of the students. This will help us to predict the future values.
We have a steady line since the slope is a very small number: 0.011, this means that the
line is not increasing or decreasing and the line is steady. This means that there is no clear
● Pearson’s correlation coefficient indicates the strength of the relationship between the
independent variable “x” “Grades of the students” and the dependent variable “y” “ The
hours of sleep.”
√ √
∑(x−x) 2 ∑(y−y) 2
S xy
r= SxSy Where Sx = n
Sy = n
∑ xy
S xy is the covariance n - xy
Sx = √ ∑(x−x) 2
n Sx = 11583.70213
47 = √246.46 = 15.7
√
∑(y−y) 2
125.6382979
Sy = n Sx = 47 = √2.673 = 1.635
25958
S xy= 47 - 80.47*6.83 =2.69
2.69
r= 15.7*1.635 = 0.28
The correlation between “Grades of the students” and “The hours of sleep" is a weak
positive correlation, since the “r” is 0.28, which proves that the correlation is weak.
17
Interpretation
To investigate whether the grades of the students and the hours of sleep that they
got are related I performed three tests, which are Calculation of the least squares
continuity (a form of the chi squared test). All three tests provided results that suggested
that the two variables are independent. This proves that the grades of a student and the
directly corresponds with a weak correlation. Also the yates’ correction for continuity
test showed that the critical value at 5% significance with 4 degrees of freedom was
9.488 which is way higher than our critical value (1.271686215) this lead me to accept
the null hypothesis and conclude that the two variables are independent of each other.
Calculation of the least squares regression also showed us that we had a steady line since
the slope was a very small number: 0.011, This means that there is no clear correlation
One way we may see this relationship is that every student is an individual. They
may not let the sleep they get affect their grades. Student A may go home from school,
study everything in order to get great grades and go to sleep at a reasonable time and get
great grades while Student B may do exactly the same as Student A only after finishing
everything play on their computer, watch television or engage in some leisure, either
18
way the student will not get as much of sleep as Student A did but this does not affect
what he has studied and the grades the student will get.
Validity
I first performed the linear regression to check the relationship between the two
variables. I created a scatter plot which clearly showed a weak positive linear correlation
between the two variables. A very low r 2 value showed that the correlation between the
two values was very weak. To further support the claim I used a form of the chi-squared
test called Yates’ correction for continuity which also suggested that the two variables
are independent of each other. All together tests provide strong support that the hours of
sleep that a student gets and their grades are not related.
were done several times to ensure accuracy. For the Pearson’s Correlation Coefficient i
used two different ways to calculate the r value, Graphing Calculator and Math by hand,
both resulted in the same figures. For the regression line and the r 2 i used three different
ways to calculate it. Graphing Calculator, Math by hand and The Microsoft Excel graph.
In addition, I performed three different operations to show that the hours of sleep and the
Areas of Improvement
To be completely sure that the two variables are independent this study can be
improved. The research was only limited to 47 students, which is not a lot but all i could
gather. If there was more data of 100+ students the answer would be more accurate. I
also only had data from High School students, having students from Universities could
Appendix:
Hours Of
Grades Sleep
80 7.5
85 8.5
90 8
70 7.5
70 6.5
90 8
80 5.5
60 6.5
80 8
67 4
20
80 8
80 8
66 8
95 6
80 8
25 4
99 7
98 7
92 5
80 8
99 8
92 5
100 4
85 6
87 7
80 6
75 8
85 8
65 2.5
82 7
80 4
90 8.5
90 6.5
97 8
95 5
60 7
65 8
89 10
96 8
21
70 6.5
94 7
75 9
75 6.5
100 3
31 7.5
68 7
90 9
Bibliography
https://en.wikipedia.org/wiki/Yates%27s_correction_for_continuity
Google Questionaries
https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/correlation-
coefficient-formula/
https://faculty.elgin.edu/dkernler/statistics/ch04/4-2.html