Professional Documents
Culture Documents
Correlationanalysis
Correlationanalysis
Correlationanalysis
ANALYSIS
It is a statistical measure
which shows relationship
between two or more
variable moving in the same
or in opposite direction
08/01/12 anilmishra5555@rediffmail.com 1
Types of correlation
correlation
Simple ,
positive Linear
multiple
& negative & non-
&
linear
partial
08/01/12 anilmishra5555@rediffmail.com 2
Methods of correlation
• Scatter diagram
• Karl Pearson’s
• Rank correlation
08/01/12 anilmishra5555@rediffmail.com 3
Scatter diagram
• Perfectly
+ve
08/01/12 anilmishra5555@rediffmail.com 4
Perfectly -ve
08/01/12 anilmishra5555@rediffmail.com 5
Zero degree
08/01/12 anilmishra5555@rediffmail.com 6
Karl Pearson correlation coefficient
x.y
r
x . y 2 2
08/01/12 anilmishra5555@rediffmail.com 7
Where
x X X
and
y Y Y
08/01/12 anilmishra5555@rediffmail.com 8
problem
6 9 0 1 0 1 0
2 11 -4 3 16 9 -12
10 5 4 -3 16 9 -12
8 8 2 0 4 0 0
4 7 -2 -1 4 1 -2
30 40 0 0 40 20 -26
Sol.cont.
X X
30
6
N
5
N 5
Y 40
Yr x.y 8 26 26
0.92
x . y
2 2
40.20 800
spearman’s Rank correlation
R 6 D 2
1 N (N 2
where 1)
D Rx R y
Rx
rank.of .X Ry
08/01/12 rank.of .y
anilmishra5555@rediffmail.com 12
problem
R 63 D 2
1 N N
6.30
R
1 10 3
10
2
R 111
9
R 11
0.82
Chi square test χ2
• It measure the differences between what is observed
and what is expected according to a null hypothesis.
• where
• O = observed data in each category
• E = observed data in each category based on the
experimenter’s hypothesis
Goodness of Fit
• Department store, A, has four competitors: B,C,D, and
E. Store A hires a consultant to determine if the
percentage of shoppers who prefer each of the five
stores is the same. A survey of 1100 randomly
selected shoppers is conducted, and the results about
which one of the stores shoppers prefer are below. Is
there enough evidence using a significance level α =
0.05 to conclude that the proportions are really the
same?
Store A B C D E
• Denote the total no. of signs (ignoring zeros) by ‘n’ and the no. of
less frequent signs by ‘S’.
• Obtain the critical value (k) of less frequent signs ‘S’ preferably at
5% level of significance by using the formula :
K=(n-1) /2- 0.98√n
• Compare the value of ‘S’ with the critical value (k). If the value of ‘S’
is greater than value of(k) then the null hypothesis is accepted
otherwise rejected
Run Test for randomness
• This test find out that whether the
observations in a sample occur in a certain
order or they occur in random.
• Ho- The sequence of observation is random
• H1- The sequence of observation is not
random
• In this firstly, all the observations are arranged
in the order they are collectd.
• Then Median is calculated.
• Observations greater than median are given +
sign and less than median, given – sign.
ANOVA
• Analysis of Variance (ANOVA) is a statistical method
used to test differences between two or more means.
• Key Assumptions-
1. Independence of case: There should not be any
pattern in the selection of the sample.
2. Normality: Distribution of each group should be
normal.
3. Homogeneity: Homogeneity means variance
between the groups should be the same.
• One way analysis: When we are comparing more than
three groups based on one factor variable, then it said
to be one way analysis of variance (ANOVA). For
example, if we want to compare whether or not the
mean output of three workers is the same based on the
working hours of the three workers.