4. Correlation & Regression
4. Correlation & Regression
Correlation
About
2 variables are said to be correlated if a Change in one causes a corresponding change in the other variable.
It is a Numerical or Quantitative measure of relationship or association b/w any 2 variables.
Types of correlation
1. Based on the direction of change of variables i) Positive, ii) Negative
2. Based upon the constancy of the ratio of change b/w the variables i) Linear ii) Non-linear
3. Based upon the no. of variables studied i) Simple ii) Multiple iii) Partial
Measures of correlation
1. Scatter (Dot) Diagram method.
2. Karl Pearson’s coefficient of correlation.
3. Spearman’s Rank correlation coefficient
Properties
i) Correlation coefficient possess the property of symmetry. The correlation coefficient is
symmetrical w.r.t x and y. i.e., r (x, y) = r (y, x).
ii) ‘r’ is free from the unit of measurement. i.e., ‘r’ is a pure no. which is suitable for
comparison.
iii) Correlation coefficient is independent of change of origin and scale.
iv) If 2 variables are independent, then r = 0. BUT converse is not always true.
Merits
1. It gives a mathematical value, in which it summaries the degree and direction of correlation.
Demerits
1. Always assume linear relationship.
2. Calculation of ‘r’ is difficult.
3. ‘r’ is affected by extreme obs.
4. Time consuming method.
Demerits
i) This method can’t be used for finding correlation in the case of bivariate frequency
distribution.
ii) This method is very difficult to apply when the no. of items is more than 30.
If we are given data in the form of ranks BUT the highest rank in the series exceeds the no. of pairs of obs.
in such situations, ranks are treated as values and the fresh ranks are determined.
Regression
About
The literal or dictionary meaning of regression is “stepping back” or “moving backward” or “returning to
avg. value”.
Regression term vas 1st time used by Sir Francis Galton in 1877.
It is a functional or mathematical relationship b/w 2 variables.
Regression analysis
It means the estimation or prediction of the unknown value of one variable (DEPENDENT variable)
from the known values of one or more other variables (INDEPENDENT variables).
- The variable whose value is to be predicted is called the DEPENDENT/ EXPLAINED/
PREDICTED/ REGRESSED variable.
- The variables whose value are used to predict the value of dependent variable are called
INDEPENDENT/ EXPLANATORY/ PREDICTOR/ REGRESSOR variable.
The regression analysis confined to the study of only 2 variables, a dependent and an independent
variable is called Simple Regression Analysis.
Linear regression
When the relationship b/w the dependent and independent variable is linear, the technique for prediction is
called Simple Linear Regression.