Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Question # 06: Following are the data on study time (x – in hours) and score for math students

(y – in percentage): [8]

x 10 15 12 20 8 16 14 22
y 92 81 84 74 85 80 84 80

a) Draw a scatter plot.


b) Use calculator to calculate different sums in the formulae, and obtain the
linear correlation coefficient.

c) Construct a regression model to estimate the score of math student (y) for a known amount of time he
spend on study (x). Plot the obtained regression equation of the same graph obtained in part a.

d) Discuss the graphical interpretation of the value of ‘r’ and verify that it is consistent with the graph you
obtained in part c.

Answer # 06:
a) Scatter Plot:
100
90
80
70
y - in percentage

60
50
40
30
20
10
0
0 5 10 15 20 25
x-in hours

b) The linear correlation coefficient:

x y xy x2 y2
10 92 920 100 8464
15 81 1215 225 6561
12 84 1008 144 7056
20 74 1480 400 5476
8 85 680 64 7225
16 80 1280 256 6400
14 84 1176 196 7056
22 80 1760 484 6400

Σx = 117
Σy = 660
Σxy = 9519
Σx2 = 1869
Σy2 = 54638

r = n Σxy - (Σx)(Σy)
√{nΣx2 - (Σx)2}{nΣy2 - (Σy)2}
r = 8 × 9519 - (117)(660)
√{8 × 1869- (117)2}{8 ×54638 - (660)2}
r = -1068
√{1263)}{1504}
r = -1068
√1899552
r = -1068
1378.242
r = -0.77
c) A regression model to estimate the score of math student (y) for a known amount of time
he spend on study (x):
y̅ = a + bx̅

b = n Σxy - (Σx)(Σx)
nΣx2 - (Σx)2
b = 8 × 9519 - (117)(660)
8 × 1869 -(117*117)
b = -1068
1263
b = -0.8456

y̅ = a + bx̅
y̅ - bx̅ = a
a = y̅ - bx̅
x̅ = ΣX
n
x̅ = 117
8
x̅ = 14.63

y̅ = ΣY
n
y̅ = 660
8
y̅ = 82.50

a = 82.5 - (-0.8456)(14.63)
a = 94.8711

ŷ = a + bx
ŷ = 94.8711 + (-0.8456)x
ŷ = 94.8711 - 0.8456x

x ŷ = a + bx
10 86.4151
15 82.1871
12 84.7239
20 77.9591
8 88.1063
16 81.3415
14 83.0327
22 76.2679

100
90
80
ŷ - pridicted percentage

70
60
50
40
30
20
10
0
0 5 10 15 20 25
x-in hours

d) The graphical interpretation of the value of ‘r’. Consistency of correlation coefficient:


r = -0.77 indicates that there is a strong negative linear relationship between study time (x – in hours) and score
of math students (y – in percentage).

As we have negative correlation coefficient, So, in scatter plot we can clearly see the value of estimated score of
math students (y-in percentages) decreases consistently as the time (x-in hour) increases. And all the points lies
on the straight line which proves the consistency of correlation coefficient 'r'.

You might also like