CSF21103 Tutorial 8

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

CSF21103: PROBABILITY AND STATISTICAL DATA ANALYSIS

Faculty of Informatics and Computing


Tutorial 8: Correlation and Regression

1. A __________ relationship exists when both variables increase or decrease at the same
time.

2. __________ is a statistical method used to determine whether a relationship between


variables exists.

3. In a __________ relationship, as one variable increases, the other variable decreases,


and vice versa.

4. The two variables in a scatter plot are called the __________ and __________ variables.

5. Determine the type of relationship shown in the figure below.

6. The range of the correlation coefficient is from __________ to __________.

7. A correlation coefficient of 0.961 would mean that the values of x __________ as the
values of y __________.

1
Chapter 8 - Correlation and Regression

8. A study was conducted to determine if there was a linear relationship between a person's
age and his/her peak heart rate.
a. Draw the scatter plot for the variables.
b. Give a brief explanation of the type of relationship.

Age Peak Heart Rate


16 220
26 194
32 193
37 178
42 172
53 160
48 174
21 214

9. Compute the value of the correlation coefficient.

x 60 43 56 46 57
y 138 119 134 128 132

2
Chapter 8 - Correlation and Regression

10. A study was conducted to determine if there was a relationship between the prices a
non-member of a book club paid for various publications and the prices that a member
paid for the same publications. The data gathered is shown below. Compute the value of
the correlation coefficient.
Non-member Member
Price Price
58 32
42 22
46 20
32 16
25 19
75 58
35 34
63 48

11. When r is not significantly different from 0, the best predictor of y is the ____________
of the data values of y.

12. A regression line was calculated as y  9.7  3.2 x . The slope of this line is
____________.

13. The equation of a regression line is y  4.6  3.2 x . What is the intercept of this line?

14. If the equation for the regression line is y' = 7x – 9, then a value of x = 2 will result in a
predicted value for y of ____________.

3
Chapter 8 - Correlation and Regression

15. A psychologist wants to determine if there is a linear relationship between the number
of hours a person goes without sleep and the number of mistakes he/she makes on a
simple test. The following data is recorded.
a. Draw a scatter plot.
b. Determine the regression line equation and plot the regression line
on the scatter plot.

Hours without Number of


Sleep, x Mistakes, y
32 6
38 8
48 13
24 5
46 7
35 6
30 5
34 8
42 12

4
Chapter 8 - Correlation and Regression

16. A random sample of eight drivers insured with a company and having similar auto
insurance policies were selected. The following table lists their driving experiences (in
years) and monthly auto insurance premiums.

Driving Experience (years) Monthly Auto Insurance


Premium ($)
5 64
2 87
12 50
9 71
15 44
6 56
25 42
16 60

a. Does the insurance premium depend on the driving experience or does the
driving experience depend on the insurance premium? Do you expect a
positive or a negative relationship between these two variables?

b. Compute SSxx, SSyy, and SSxy.

c. Find the least squares regression line by choosing appropriate dependent and
independent variables based on your answer in part a.

5
Chapter 8 - Correlation and Regression

d. Interpret the meaning of the values of a and b calculated in part c.

e. Plot the scatter diagram and the regression line.

f. Calculate r and explain what its mean.

g. Predict the monthly auto insurance premium for a driver with 10 years of
driving experience.
.

You might also like