Professional Documents
Culture Documents
mcd1110 Sample Test 2b 2012 02
mcd1110 Sample Test 2b 2012 02
mcd1110 Sample Test 2b 2012 02
Practice Test 2 B
Question & Answer Booklet
READING TIME: 10 minutes WRITING TIME: 2 hours
Instructions to Candidates
Please enter your name and student ID number below.
Name: ___________________________________________________
Student ID Number: ________________________________________
Percentage of Final Assessment: 20%
C: Time series 21
Total Marks 75
Candidates are reminded that they should have no materials on their desks unless their use has been
specifically permitted by the following instructions.
Approved scientific and/or graphic calculators are permitted.
Mathematical instruments and templates are permitted.
A Formula Sheet is provided in the test booklet.
Circle your responses to Multiple-Choice questions in the test booklet.
Show all working/calculations.
DO NOT USE PENCIL.
You must hand in this entire question paper at the end of the test.
Do not open this booklet until instructed to do so.
COPYRIGHT WARNING:
All materials produced for teaching this course of study, including all lectures delivered all audio and visual aids to presentation of lectures
(including overheads, PowerPoint slides and any on-line materials) and any supplementary materials, are protected by copyright.
You are permitted to use these materials only for your personal study and research. Use of the materials for any other purposes, including sale of
your personal lecture notes, without express permission of the copyright owner, may infringe copyright. The copyright owner may take action against
you for infringement.
MCD1110 - DATA ANALYSIS FORMULA SHEET
count
=
percent ×100% =R largest value − smallest value
total count
n +1
IQR = Q3 − Q1
2
Upper fence = Q3 + 1.5 IQR Lower fence = Q1 − 1.5 IQR
Sample Mean
s=
∑ (x − x) 2
or s=
∑ (x − x ) 2
f
or s=
∑ (m − x ) 2
f
n −1 n −1 n −1
where n = ∑ f
R
or s≈
4
Probability
Addition rule P ( A ∪ B ) = P ( A) + P ( B ) − P ( A ∩ B )
P ( A ∪ B ) = P ( A) + P ( B )
P( A ∩ B )
Conditional probability P(A B ) =
P (B )
r=
∑ (x − =
x )( y − y )
, where s
∑ ( x − x ) and s
=
2
∑ ( y − y) 2
( n − 1) sx s y x
n −1
y
n −1
Coefficient of Determination r2
y= a + bx , where b=
rs y
and =
a = y − bx , where x
∑
=
x
,y
∑y
sx n n
Time Series
y1 + y2 + y3
3 − moving mean ( smoothed y2 ) =
3
median ( y1 , y2 , y3 )
3 − moving median ( smoothed y2 ) =
actual figure
deseasonalised figure =
seasonal index
To explore the relationship between MP3/iPOD user (yes or no) and gender (male or female), it
would be best to display the data collected in:
A. a scatter plot
B. an appropriately percentaged table
C. back to back stems plots
D. parallel box plots
Question 2
For a large sample of students at a local Primary school it was found that the correlation between
score on a test on current global affairs and height showed a coefficient of correlation, r = 0.76.
From this information it is reasonable to conclude that
Question 3
A. −0.48
B. 0.51
C. 0.45
D. −0.24
Question 4
The relationship between average monthly temperature and the number of air conditioners sold is
found to have a correlation coefficient of r = 0.72. We can therefore conclude that:
The information in the following parallel box plots relates to questions 5 and 6.
Brand X
Brand Y
The parallel box plots show the variation in salt content (mg/100g) in two brands (Brand X and
Brand Y) of wheat crackers.
Question 5
The variables Salt Content and Brand are :
Question 6
The presence of a relationship between Salt Content and Brand is best shown by considering the:
A. median
B. IQR and range
C. shape
D. all of the above
Answer ALL questions in the space provided. Show ALL working and calculations.
Question 1
Scientists have investigated the frequency of the chirps of ground crickets with the current ground
temperature. The data collected is shown in the table below.
T (0F) 89 72 93 84 81 75 70 82 69 83 69 83 81 84 76
C (rate/sec) 20 16 20 18 17 16 15 17 15 16 15 17 16 17 14
a After considering which are the independent and dependent variables, draw a fully labelled
scatter plot for this data on the axes below.
3 marks
∑ (t −=
t )(c − c ) 151.4 ∑=
(t − t )
2
748.2 ∑=
(c − c ) 2
41.6
Respondents to an on-line survey were asked ‘Will you consider converting your car to LPG (an
alternative fuel)?’ The response of 405 people, and the age group they are in, is given in the table
below. It is expected that consideration of an LPG conversion for a car and age group are related.
Age group
Response
≤ 40 year old >40 years old
Yes 72 125
No 108 100
Total 180 225
a Explain why the row headings in the table are response rather than age group.
1 mark
______________________________________________________________________________
______________________________________________________________________________
d Does the data support the contention that there is a relationship between age group and
attitude to an LPG conversion.
2 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
Question 1
Question 2
Question 3
A. 0.18
B. 0.65
C. -0.18
D. -0.65
The scatter plot for hearing test result versus loud music exposure showed a point at (75, 56). The
residual value for this point is
A. −1.5
B. 19
C. −19
D. 1.5
Question 5
After analysing a scatter plot and applying a number of transformations the following analysis was
obtained.
Transformation Residuals r2
log y Vs x curved 58%
1/y Vs x curved 61%
y Vs log x random 83%
y Vs 1/x random 86%
A. log y Vs x
B. 1/y Vs x
C. y Vs log x
D. y Vs 1/x
Answer ALL questions in the space provided. Show ALL working and calculations.
Question 1
A class of secondary students was introduced to their study of bivariate data through an exercise
that required the students to measure, record and analyse their height and arm span. The data they
collected are shown below.
Arm Arm
Span(cm) Height(cm) Span(cm) Height(cm)
156 162 177 173
157 160 177 176
159 162 178 178
160 155 184 180
161 160 188 188
161 162 188 187
162 170 188 182
165 166 188 181
170 170 188 192
170 167 194 193
173 185 196 184
173 176 200 186
= =
r 0.91 =
sx 13.6 s y 11.2
= =
x 175.5 y 174.8
195
Height(cm)
190
185
180
175
170
165
160
155
150
150 160 170 180 190 200 210
Arm Span(cm)
d Give the equation to the least squares regression line and draw it on the scatter plot given.
Show the calculations that you used to mark this line in the correct position.
4 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
Question 2
The atmospheric concentration of carbon dioxide, CO 2 , (in parts per million) has been measured
about every 20 years over the last two centuries. Year 1 is about 1760 and year 12 is about 1990.
Year 1 2 3 4 5 6 7 8 9 10 11 12
CO 2 (ppm) 277 280 284 283 288 290 297 302 308 317 339 356
400
350
300
CO2 ppm
250
200
150
100
50
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13
Year
a A linear regression analysis of the data found the least squares equation to be:
concentration of CO 2 = 261 + 6.3 × year, where 1760 was year 1. To check the assumption
of linearity a residual analysis is performed. Use the regression equation to help complete the
last 4 entries (to the nearest whole number) in the table of residuals.
2 marks
Year 1 2 3 4 5 6 7 8 9 10 11 12
CO 2 (ppm) 277 280 284 283 288 290 297 302 308 317 339 356
Predicted
267 274 280 286 293 299 305 311 318 324
CO 2 (ppm)
residual 10 6 4 −3 −5 −9 −8 −9 −10 −7
b Interpret the following residual plot made from the table in part a.
Residual plot
15
10
5
Residual
Year
0
0 1 2 3 4 5 6 7 8 9 10 11 12
-5
-10
-15
2 marks
360
340
320
300
280
260
0 20 40 60 80 100 120 140 160
e Explain what steps you would take to decide if the transformation gave a better model of the
data than the original scatter plot.
2 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
A. A time series plot will always show either an increasing or decreasing trend
B. A time series plot must have time as the dependent variable
C. A time series plot will be based on bivariate data
D. The tendency for regular fluctuations of time series values on a quarterly basis is
called cyclic variation
Question 2
10
B. cyclical only
8
C. random with trend
6
D. cyclical with trend
4
0
1 2 3 4 5 6 7 8 9 10 11 12
Quarter
Question 3
Lift ticket sales at a ski resort are recorded at the end of each month over the 4 month winter period.
A seasonal index of 0.65 for the first month tells us that
A. ticket sales in the first month are 65% below monthly average
B. ticket sales in the other 3 months must all be above 1
C. snow fell for 0.65 of the month
D. ticket sales in the first month are 35% below monthly average
Month (x) 1 2 3 4 5 6 7 8 9 10
Sales (y) 5 4 6 8 5 7 13 10 12 13
For the time series given in the table, the 2 median smoothed y value centered at x = 6 is
A. 5
B. 6
C. 7
D. 8
Quarter 1 2 3 4
Sales 240 160 190 410
Seasonal Index 0.96 0.76 1.64
Question 5
A. 0.16
B. 0.64
C. 0.32
D. 0.336
Question 6
A. 230
B. 250
C. 220
D. 260
Answer ALL questions in the space provided. Show ALL working and calculations.
Question 1
The annual flows for the Mitta Mitta River for a certain 12 year period are shown in the following
table and are graphed on the time series chart.
Year 1 2 3 4 5 6 7 8 9 10 11 12
Flow 509 710 1634 1107 401 685 1548 1578 1012 1151 1190 1690
1800
1600
1400
1200
1000
Flow
800
600
400
200
0
1 2 3 4 5 6 7 8 9 10 11 12
Year
a Use the graphical approach to smooth the time series on the above plot using 3-median
smoothing.
4 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
b Comment on the effect of the smoothing and the apparent trend of the time series.
2 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
c (i) Use the 3-mean smoothing method to complete (to the nearest whole number) the
empty row in the table above.
4 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
2000
1800
1600
1400
1200
Flow
1000
800
600
400
200
0
1 2 3 4 5 6 7 8 9 10 11 12
Year
d Comment on the effect of the smoothing and the apparent trend of the smoothed data on the
chart in part c.
2 marks
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
End of Test