4 Association of Attributes
4 Association of Attributes
What is Correlation?
Correlation is a measure of the strength of the relationship between two variables. The
correlation coefficient quantifies the degree of change of one variable based on the change of
the other variable. In statistics, correlation is connected to the concept of dependence, which is
the statistical relationship between two variables
The Pearson’s correlation coefficient or just the correlation coefficient r is a value between -1
and 1 (-1≤r≤+1). It is the most commonly used correlation coefficient and valid only for a linear
relationship between the variables. If r=0, no relationship exist, and if r≥0, the relation is
directly proportional; the value of one variable increases with the increase in the other. If r≤0,
the relationship is inversely proportional; one variable decreases as the other increases.
Because of the linearity condition, correlation coefficient r can also be used to establish the
presence of a linear relationship between the variables.
What is Association?
The statistical term association is defined as a relationship between two random variables which
makes them statistically dependent. It refers to rather a general relationship without specifics of
the relationship being mentioned, and it is not necessary to be a causal relationship.
Many statistical methods are used to establish the association between two variables. Pearson’s
correlation coefficient, odds ratio, distance correlation, Goodman’s and Kruskal’s Lambda and
Spearman’s rho (ρ) are a few examples.