Professional Documents
Culture Documents
Correlation Analysis: Correlation Coefficient), As Defined Below. The Figure Beside (Courtesy: Wikipedia) Shows
Correlation Analysis: Correlation Coefficient), As Defined Below. The Figure Beside (Courtesy: Wikipedia) Shows
A correlation is said to be positive if with increase of one variable, the other variable also tend to
increase. Similarly, a correlation is said to be negative if with increase of one variable, the other
variable tend to decrease. Examples of them are as given above. A relationship is said to have nil
correlation, if with increase of one variable, the other variable tend to neither increase nor
decrease definitively. The degree or extent of correlation between two variables is quantitatively
measured using some parameters called correlation coefficients. The most common correlation
coefficient is the Pearson correlation coefficient r (also called the Pearson product-moment
correlation coefficient), as defined below. The figure beside (Courtesy: Wikipedia) shows
various cases of positive, nil and
negative correlations, with their values
of the coefficient r mentioned at the top row. Note that r can vary from 1 to –1, indicating
positive (1 ≥ r > 0), practically nil (r ≈ 0) or negative (–1 ≤ r < 0) correlation.
_ _
Here x and y are the mean values of the variables x (i.e., of xi) and y (i.e., of yi) respectively, n
is the number of observations, while sx and sy are the standard deviations of x and y respectively.
In practice, however, the following equivalent relation is used to easily calculate the value of r:
1
As an illustrative example, let us consider the following data of heights and weights ffor 8 men:
Height (cm) 165 167 170 172 173 176 180 183
Weight (kg) 62 60 64 65 65 68 73 72
here, let us calculate the sums of {xi}, {yi}, {xi2}, {xi yi} and {yi2}
To calculate the coefficient r here
using the following table (we note that here the number of points n = 8):
So, r = {8 x 91839 – 1386 x 529529}/√{(8 x 240392 – 1386 x 1386) x (8 x 35127 – 529 x 529)}
= 1518/√{2140 x 1175}
= 0.957
Thus it is a case of strong positive correlation, with the correlation coefficient r ≈ 1.
[To have an idea of how the two variables here are varying with each other
other, corresponding to this
strong positive correlation,, we may view the X-Y scatter plot for this data as found below:]