Professional Documents
Culture Documents
Correlation Analysis and Its Types
Correlation Analysis and Its Types
Correlation Analysis and Its Types
Analysis
Bike sales grow with income
Correct helmet use can lead to a 42% reduction in the risk of fatal
injuries and a 69% reduction in the risk of head injuries according
to the WHO
• Income growth
• Population growth
• Literacy growth VS. Two-Wheeler Growth
• Economy growth
• Rural Urban
Migration Growth
Learning Outcomes
•Positive •Simple
•Linear
correlation correlation correlation
•Partial correlation •Non – linear
•Negative
correlation correlation
•Multiple
correlation
Correlation : On the basis of
degree
Positive Correlation
if one variable is increasing and with its
impact on average other variable is
also increasing that will be positive
correlation.
Example of Positive Correlation
• The More time you spend marketing your
business, the more new customers you will
have.
• As the temperature goes up, ice cream
sales also go up.
Correlation : On the basis of
degree
Negative correlation
if one variable is increasing and with its
impact on average other variable is
decreasing that will be negative
correlation.
Example of Negative Correlation
• The weather gets colder, air conditioning
costs decrease.
• Increase in price of goods and services
decrease in sales
Scatter Plot for Positive and
Negative correlation
Correlation : On the basis of
number of variables
Simple correlation
Correlation is said to be simple when
only two variables are analyzed.
For example :
Correlation is said to be simple when it
is done between demand and supply
or we can say income and expenditure
etc.
Correlation : On the basis of
number of variables
Partial correlation :
When three or more variables are
considered for analysis but only two
influencing variables are studied and
rest influencing variables are kept
constant.
For example :
Correlation analysis is done with demand,
supply and income. Where income is
kept constant.
Correlation : On the basis of
number of variables
Multiple correlation :
In case of multiple correlation three or
more variables are studied
simultaneously.
For example :
Rainfall, production of rice and price of
rice are studied simultaneously will be
known are multiple correlation.
Correlation : On the basis of
linearity
Linear correlation :
If the change in amount of one variable
tends to make changes in amount of
other variable bearing constant
changing ratio it is said to be linear
correlation.
For example : The ratio of change
between the variables is the same
X: 10 20 30 40 50
Y: 20 40 60 80 100
Correlation : On the basis of
linearity
Non - Linear correlation :
If the change in amount of one variable
tends to make changes in amount of
other variable but not bearing constant
changing ratio it is said to be non - linear
correlation.
For example :
If the amount of fertilizers is doubled the
yield of wheat would not be necessarily be
doubled.
Importance of correlation
analysis :
Measures the degree of relation i.e.
whether it is positive or negative.
Estimating values of variables i.e. if
variables are highly correlated then we
can find value of variable with the
help of gives value of variable.
Helps in understanding economic
behavior.
Karl Pearson Coefficient of
Correlation
A Parametric Test
COEFFICIENT OF CORRELATION
The ratio indicating the degree of relationship
between two related variables.
For a perfect
. POSITIVE CORRELATION the
coefficient of correlation is +1
For a perfect NEGATIVE CORRELATION the
coefficient of correlation is -1.
Positive coefficient of correlation varies from 0
to +1.
Negative coefficient of correlation varies from 0
to -1.
Interpretation of correlation coefficient
Five examples of correlation coefficient
KARL PEARSON METHOD
Karl Pearson method was invented by KARL
PEARSON and on his name, the method is called
as KARL Pearson method.
It is represented as ‘r’ .
The standard formula used in the computation of
Pearson’s product moment correlation coefficient is as
follows :
N ( XY ) X
r
Y
[N X ( X ) ][N Y
2 2 2
(Y ) ]
2
N (X Y ) X Y
r
[NX 2 (X)2 ][NY 2(Y)2 ]
Where,
N - the total no: of scores or cases
Ʃ -the summation of the items indicated
ƩX - the sum of all X scores
ƩX²-
each X score should be squared and then those
squares summed{the sum of the X squared scores}
(ƩX)²
- X scores should be summed and the total
squared(the squares of the sum of all the X scores)
ƩY – the sum of all Y scores
ƩY² - each Y score should be squared and then those
squares summed
(ƩY)² - Y score should be summed and the total
squared
There are five subjects and the students have to
appear in two tests of each subject. We want to find
whether there is any correlation between the marks
scored in test1 and test2 of each subject or not.
B 3 15
C 2 11
D 8 10
E 6 18
SUBJECT SCORES IN SCORES IN XY X² Y²
TEST 1 (X) TEST 2 (Y)
A 5 12
B 3 15
C 2 11
D 8 10
E 6 18
A 5 12 60 25 144
B 3 15 45 9 225
C 2 11 22 4 121
D 8 10 80 64 100
E 6 18 108 36 324
± denotes direction,
‘+’ means positive correlation
‘-’ means negative correlation.
Interpretation of ‘r’
B 25 70
C 20 40
D 30 50
E 35 30
Non-Parametric
Statistics
Spearman’s Rank Correlation
When data is of ordinal level (ranked data), Pearson correlation
coefficient r cannot be applied.