Bivariate Statistics: by The End of This Sub-Topic, Learners Should Be Able To
Bivariate Statistics: by The End of This Sub-Topic, Learners Should Be Able To
Introduction
Bivariate statistics is used when analysing a data set with two
variables which are usually denoted as X and Y.
Bivariate statistics is also used for two sets of variables that are
dependent on each other such as volume of traffic flow into a
town’s central business district and time.
Graphs such as the scatter plot and box plot are used to plot
bivariate data.
ρ=1-6∑di2n(n2-1)
Where n = number of values in the sample
di = difference in paired ranks
The values are ranked in a descending order with the highest value
given a rank of 1.
Example 2.3.1
Table 2.3.2
∑di2= 16+4+0+4+16
= 40
Now we put this into the formula.
ρ=1-6∑di2n(n2-1)
= =1-6(40)5(52-1)
=1-240120
= 1-2
= -1
Therefore, there is a weak relationship between the values.
The correlation coefficient r shows how far away the data points are
to the line of best fit.
It therefore, attempts to draw a line of best fit through the two data
variables’ scatter points.
Example
A study into the number of reported outbreaks of typhoid
in Mbare for the past 7 years shows the following data.
Example
A study carried out in Harare to determine if there is any
relationship between typhoid outbreaks and the rain season of
every year. The following table shows the findings.
Use this information to find out if there is typhoid outbreaks are on
the increase every rain season.
Solution