Professional Documents
Culture Documents
Module III 1
Module III 1
Module III 1
Correlation:-
To examine whether the two RV’s are inter-related, we collect of values
of and corresponding to repetitions of the random variable. Let them
be . Then we plot the points with co-
ordinates on a graph paper. The simple
figure consisting of the plotted points is called a scatter diagram. From the
scatter diagram, we can form a fairly good, though vague, idea of the
relationship between and . If the points are dense or closely packed, we
may conclude that and are correlated. On the other hand if the points
are widely scattered throughout the graph paper, we may conclude that
and are either not correlated or poorly correlated.
Further if the points in the scatter diagram appear to lie near a straight line,
we assume that the RV’s have linear correlation. If they cluster round a well
defined curve other than a straight line, the RV’s are assumed to be non-
linear
Karl Pearson’s Product Moment Correlation Coefficient
(Correlation Coefficient between and )
∑ ∑ ∑
∑ ∑ ∑ ∑
( , ) ( )
( )
Note:-
1) When and are independent . Hence
and thus
X 1 3 5 7 8 10
Y 8 12 15 17 18 20
Sol:-
1 8 1 64 8
3 12 9 144 36
5 15 25 225 75
7 17 49 289 119
8 18 64 324 144
10 20 100 400 200
=248 =1446
Here
X 65 67 66 71 67 70 68 69
Y 67 68 68 70 64 67 72 70
Note:-
Note:-
For the data of problem, compute the coefficients of linear partial
correlation and multiple correlation .
Sol:-
X1(Weight) X2(Height) X3(Age) u=X1-77 v=X2-55 w=X3-10 uv vw uw u^2 v^2 w^2
71 59 10 -6 4 0 -24 0 0 36 16 0
55 51 8 -22 -4 -2 88 8 44 484 16 4
58 50 7 -19 -5 -3 95 15 57 361 25 9
77 55 10 0 0 0 0 0 0 0 0 0
56 52 10 -21 -3 0 63 0 0 441 9 0
76 61 12 -1 6 2 -6 12 -2 1 36 4
68 57 9 -9 2 -1 -18 -2 9 81 4 1
𝑛 ∑ 𝑣𝑤 − ∑ 𝑣 ∑ 𝑤
𝑟 =
(𝑛 ∑ 𝑣 − ∑ 𝑣 )(𝑛 ∑ 𝑤 − ∑ 𝑤 )
.
=0.8418
.
Regression Equations
Regression Equation on
∑ ∑ ∑
Where ∑ ∑
(OR)
(OR)
( )
is called the regression coefficient on
( )
Regression Equation on
∑ ∑ ∑
Where ∑ ∑
(OR)
(OR)
22 20 -7 -7 49 49 49
26 20 -3 -7 9 49 21
29 21 0 -6 0 36 0
30 29 1 2 1 4 2
31 27 2 0 4 0 0
31 24 2 -3 4 9 -6
34 27 5 0 25 0 0
35 31 6 4 36 16 24
Regression equation on
When we have
Regression equation on
When we have
In a partially destroyed laboratory record of an analysis of correlation
data, the following results only are legible: Variable of . The
regression equations are and What
were (a) the mean values of and (b) the standard deviation of
? And (c) the correlation coefficient between and
Sol:-
A study of prices of rice at Chennai and Mumbai gave the following
data:
Chennai Mumbai
Mean 19.5 17.75
S.D 1.75 2.5
Competitors
Judges 1 2 3 4 5 6 7 8 9 10
A 6 5 3 10 2 4 9 7 8 1
B 5 8 4 7 10 2 1 6 9 3
C 4 9 8 1 2 3 10 5 7 6
Discuss which pair of judges have the nearest approach to common taste of beauty.