Biostatistics Unit 10. Measures of Relationship
Biostatistics Unit 10. Measures of Relationship
P. K. MUTETI
[email protected]
Definition
P. K. MUTETI 2
Cont
As a child grows from an infant into a toddler into a young child, both the child's
height and weight tend to change.
Those changes are not always tightly locked to one another, but they do tend to
occur together.
So if we took a sample of children from a few weeks old to 3 years old and
measured the height and weight of each child, we would likely see a positive
relationship between the two.
With the relationship between height and weight in children, it is likely that the
general growth of children, which increases both height and weight, accounts for
the observed correlation.
P. K. MUTETI 3
Cont
P. K. MUTETI 4
Graphic representation of correlation coefficient of height and
weight in children
P. K. MUTETI 5
Types of correlation coefficients.
P. K. MUTETI 7
Curved correlation
P. K. MUTETI 8
Types of measures of relationship
P. K. MUTETI 9
Cont
As you move from zero to either end of this scale, the strength of the relationship
increases.
You can think of the strength of a linear relationship as how tightly the data points
in a scatter plot cluster around a straight line.
In a perfect relationship, either negative or positive, the points all fall on a single
straight line.
P. K. MUTETI 10
Cont
The symbol for the Pearson correlation is a lowercase r, which is often subscripted
with the two variables.
For example, rxy stand for the correlation between the variables X and Y.
P. K. MUTETI 11
Karl pearsons coefficient of correlation
P. K. MUTETI 12
Where
r = Correlation coefficient
xi = values of the x-variable in a sample
x̄ the mean of the values of the x-variable
y = values of the y-variable in a sample
ȳ = the mean of the values of the y-variable
P. K. MUTETI 13
Cont
P. K. MUTETI 14
Cont
P. K. MUTETI 15
Spearman rank-order correlation formula
P. K. MUTETI 16
Cont
3. Phi correlation coefficient. The Phi coefficient is an index of the degree of relationship between
two variables that are measured on a nominal scale. Because variables measured on a nominal scale
are simply classified by type, rather than measured in the more general sense, there is no such thing
as a linear relationship. Nevertheless, it is possible to see if there is a relationship.
For example, suppose you want to study the relationship between religious background and
occupations. You have a classification systems for religion that includes Catholic, Protestant,
Muslim, Other, and Agnostic/Atheist. You have also developed a classification for occupations that
include Unskilled Laborer, Skilled Laborer, Clerical, Middle Manager, Small Business Owner, and
Professional/Upper Management. You want to see if the distribution of religious preferences differ
by occupation, which is just another way of saying that there is a relationship between these two
variables.
The Phi Coefficient is not used nearly as often as the Pearson and Spearman correlations. Therefore,
we will not be devoting space here to the computational procedures. However, interested students
can consult advances statistics textbooks for the details. you can compute Phi easily as one of the
options in the crosstabs procedure in SPSS for Windows. Click on the button below to see how.
P. K. MUTETI 17
Methods of studying correlation
P. K. MUTETI 18
Scatter diagrams method
This is the simplest method in studying correlation in special type of dot chart in
two variables.
It is also known as dotogram or scatter diagram.
The given data are plotted on a graph paper in the form of dots
P. K. MUTETI 19
Cont
When the two sets of observations increase or decrease together (positive) the line
slopes upwards from left to right
When one set decreases as the other increases the line slopes downwards from left
to right.
As the line must be straight, it will probably pass through few, if any, of the dots
P. K. MUTETI 20
Cont
P. K. MUTETI 21
Perfectly positive
P. K. MUTETI 22
No correlation.
P. K. MUTETI 23
Perfect negative correlation.
P. K. MUTETI 24
Curvilinear correlation
P. K. MUTETI 25
Significance of the study of correlation
P. K. MUTETI 26
Covariance
Covariance is the measure of the joint variability of two random variables (X, Y).
For Example – Income and expense of households. The households having higher
Income (say X) will have relatively higher expenses (say Y) and vice-versa.
This kind of relationship between two variables is called joint variability and is
measured through covariance and correlation
P. K. MUTETI 27
Positive covariance:
If the variable(X) takes a higher value, the value of the corresponding variable(Y) is
also higher and vice-versa.
Eg. Income and Expense of Household. As X takes a higher value, the
corresponding values of Y is on the higher side E.x. Income and Expense of
Household. As X takes a higher value, the corresponding values of Y is on the higher
side
P. K. MUTETI 28
Positive covariance
P. K. MUTETI 29
Negative covariance
If the variable(X) takes a higher value, the value of the corresponding variable(Y) is
low and vice-versa.
Example: Price and demand. as the price of a commodity increases, its demand
decreases.
P. K. MUTETI 30
Negative covariance
P. K. MUTETI 31
Zero covariance or no covariance:
Note: The zero covariance means the covariance will be zero or near zero
P. K. MUTETI 32
Zero covariance or no covariance
P. K. MUTETI 33
Covariance formula
P. K. MUTETI 34
Regression
P. K. MUTETI 35
Simple linear regression:
P. K. MUTETI 36
The end
Thank you
P. K. MUTETI 37