Module - 2 Correlation Analysis: Contents: 2.2 Types of Correlation
Module - 2 Correlation Analysis: Contents: 2.2 Types of Correlation
2.1 Introduction
Correlation is a statistical technique that can show whether and how strongly pairs of variables are related. For example, height and weight are related; taller people tend to be heavier than shorter people. The relationship isn't perfect. People of the same height vary in weight, and you can easily think of two people you know where the shorter one is heavier than the taller one. Nonetheless, the average weight of people 5'5'' is less than the average weight of people 5'6'', and their average weight is less than that of people 5'7'', etc. Correlation can tell you just how much of the variation in peoples' weights is related to their heights. Although this correlation is fairly obvious your data may contain unsuspected correlations. You may also suspect there are correlations, but don't know which are the strongest. An intelligent correlation analysis can lead to a greater understanding of your data.
2.2.2
The distinction between simple, partial and multiple Correlation is based upon the number of variables studied Simple Correlation Correlation between only two variables, e.g. Correlation between age and height, correlation between yield of rice and amount of rainfall in a given area are examples of Simple Correlation Multiple Correlation When correlation between three or more variables are studied simultaneously, then it is called multiple Correlation Partial Correlation In this we recognize more than two variables but consider only two variables to be influencing each other, the effect of other influencing variable being kept constant. The correlation between the two variables keeping the other variables constant is called partial correlation 1 2 3 4 5 X1-Yield of rice X2-Amount of Rainfall X3-Amount of fertilizers X4-Type of soil X5-Advanced technologies used.
Correlation analysis of X1, X2, X3, X4 and X5 is an example of Multiple Correlation whereas if we only study the relation between X1 and X2 keeping other variables constant it would be an example of Partial Correlation between yield of rice and amount of rainfall.
2.2.3
The nature of the graph gives us the idea of the linear type of correlation between two variables. If the graph is in
a straight line, the correlation is called a "linear correlation" and if the graph is not in a straight line, the correlation is non-linear or curvi-linear
degree of correlation is denoted by r and its direction is given by the signs positive and negative. Example :
positive correlation
negative correlation
no correlation
crore) 16 29 43 20 3 4 6 Y =121
Mean
)2
Mean
)2
Mean
(Y
-Y
Mean
Sum of squared deviations of sales = 1310.79 Sum of cross products (SP) = 261.24
Calculation of the Pearson r r= 261.24 (53.71) (1310.79) = 261.24 70402.53 .
r = (261.24) / (265.33) = +0.985 Interpretation The magnitude of the correlation between advertisement expenditure and sales = 0.985. The direction of the relationship is positive. As the advertisement expenditure increases so does the sales of the commodity.
Convenience Store 1 2
from
Rank
Rank
between
d2
10 9
1.80 1.20
2 3.5
8 5.5
64 30.25
3 4 5 6 7 8 9 10
8 7 6 5 4 3 2 1
1 6 6 3.5 9 10 6 8
7 1 0 1.5 -5 -7 -4 -7
49 1 0 2.25 25 49 16 49 d 285.5 =
Data Table: Spearman's Rank Correlation Calculate the coefficient (R) using the formula below. The answer will always be between 1.0 (a perfect positive correlation) and -1.0 (a perfect negative correlation). When written in mathematical notation the Spearman Rank formula looks like this :
Now to put all these values into the formula. Find the value of all the d2 values by adding up all the values in the Difference 2 column. In our example this is 285.5. Multiplying this by 6 gives 1713. Now for the bottom line of the equation. The value n is the number of sites at which you took measurements. This, in our example is 10. Substituting these values into n - n we get 1000 - 10 We now have the formula: R = 1 - (1713/990) which gives a value for R: 1 - 1.73 = -0.73.
What does this R value of -0.73 mean The R value of -0.73 suggests a fairly strong negative relationship.