0% found this document useful (0 votes)
8 views

4. Correlation Analysis

Correlation analysis examines the degree of inter-relatedness among variables, indicating whether they tend to increase or decrease together. It includes various types such as positive, negative, and zero correlation, as well as simple, partial, and multiple correlation. The correlation coefficient quantifies the strength of the relationship, ranging from -1 (perfect negative) to 1 (perfect positive), and tools like Spearman's rank correlation measure associations between ranked variables.

Uploaded by

triviatruism
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

4. Correlation Analysis

Correlation analysis examines the degree of inter-relatedness among variables, indicating whether they tend to increase or decrease together. It includes various types such as positive, negative, and zero correlation, as well as simple, partial, and multiple correlation. The correlation coefficient quantifies the strength of the relationship, ranging from -1 (perfect negative) to 1 (perfect positive), and tools like Spearman's rank correlation measure associations between ranked variables.

Uploaded by

triviatruism
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

CORRELATION

ANALYSIS
Mr. Sunanda Das, Assistant Professor, Dept. of CSE, KUET
CORRELATION
• Correlation is the degree of inter-relatedness/associations among the
two or more variables.
• Correlation analysis is a process to find out the degree of relationship
between two or more variables by applying various statistical tools and
techniques.

Examples:
• Relationship between price and demand of a commodity
• Relationship between height and weight
Scope of Correlation Analysis
• The existence of correlation between two (or more) variables only
implies that these variables:

• Either tend to increase or decreased together


• An increase (or decrease) in one is accompanied by the corresponding
decrease (or increase) in the other.

# Correlation analysis does not answer the questions like why there is
cause and effect between two variables.
Types of Correlation

Types of Correlation
On the basis of degree On the basis of number of On the basis of linearity
of correlation variables
• Simple correlation
• Positive correlation • Linear correlation
• Partial correlation
• Negative correlation • Non – linear correlation
• Multiple correlation
• Zero Correlation
Positive Correlation
• When two variables move in the same direction then the correlation between these
two variables is said to be Positive Correlation.
• When the value of one variable increases, the value of other value also increases at
the same rate.

Negative Correlation
• In this type of correlation, the two variables move in the opposite direction.
• When the value of one variable increases, the value of the other variable decreases.
Zero Correlation
• When the two variables are independent and the change in one
variable has no effect in other variable
Simple correlation
• Correlation is said to be simple when only two variables are analyzed.

Partial correlation
• When three or more variables are considered for analysis but only
two influencing variables are studied and rest influencing variables
are kept constant.

• Correlation analysis is done with demand, supply and income. Where


income is kept constant.
Multiple correlation
In case of multiple correlation three or more variables are studied
simultaneously.

Rainfall, production of rice and price of rice are studied simultaneously will be
known are multiple correlation.

Linear correlation
If the change in amount of one variable tends to make changes in amount of
other variable bearing constant changing ratio it is said to be linear
correlation.
Non - Linear correlation
If the change in amount of one variable tends to make changes in
amount of other variable but not bearing constant changing ratio it is
said to be non – linear correlation.
Correlation Coefficient
• The correlation coefficient that indicates the strength of the
relationship between two variables. Ex. Pearson's correlation
coefficient.
• i.e In order to test the linear association between two variables x and
y we can use the Pearson correlation coefficient rxy
• The correlation coefficient takes values between -1 to 1
• 1: perfect/strong and positive linear correlation
• -1: perfect/strong and negative linear correlation
• 0: no linear correlation

1 0.8 0.4 0 -0.4 -0.8 -1


Correlation can have a value:

r = 1 is a perfect positive correlation


r = 0 is no correlation (the values don't seem linked at all)
r = -1 is a perfect negative correlation
GLUCOSE
SUBJECT AGE (X)
LEVEL (Y)
1 43 99
2 21 65
3 25 79
4 42 75
5 57 87
6 59 81
Rank correlation
• A rank correlation measures an ordinal association

i. The relationship between rankings of different ordinal variables or


ii. Different rankings of the same variable, where a "ranking" is the
assignment of the ordering labels "first", "second", "third", etc. to different
observations of a particular variable.

• A rank correlation coefficient measures the degree of similarity


between two rankings, and can be used to assess the significance of
the relation between them.
Spearman's rank correlation
coefficient
• Spearman's correlation coefficient, (ρ, also signified by rs) measures the
strength and direction of association between two ranked variables

• d= the difference between the ranks of corresponding variables


• n= number of observations

• #An ordinal variable is a categorical variable for which the possible values are ordered. For example, suppose
you have a variable, economic status, with three categories (low, medium and high).
• In case u individuals receive the same rank, we describe it as a tied
rank of length u. In case of a tied rank, the above given formula is
changed to

• In this formula, tj represents the jth tie length and the summation
extends over the lengths of all the ties for both the series.
References
1. Probability & Statistics for Engineers & Scientists by Ronald E.
Walpole, Raymond H. Myers, Sharon L. Myers, Keying Ye
2. Probability and statistical inference by Robert V. Hogg, Elliot Tanis,
Dale Zimmerman

• https://blog.flexmr.net/correlation-analysis-definition-exploration
• https://www.mathsisfun.com/data/correlation.html

You might also like