Professional Documents
Culture Documents
Further Statistical Analysis
Further Statistical Analysis
Further Statistical Analysis
90
IMPORTANT FEATURES AND TIPS FROM 2UG EXAM HISTORY
(years)
It includes content found in Standard 2's MS-S4 Bivariate Data Analysis, but notably
removes any analysis of Pearson's Correlation Co-Efficient.
60
This content area has contributed an average of 3.2% per paper over the past decade of
Gen2 exams (note past allocations are no guarantee of future contributions but can 50
nonetheless cast light on a topic's likely importance).
A. B. (C) The hours training for a marathon and the time taken to complete the marathon
(D) The number of dogs per household and the number of televisions per household
Which of the following best describes the correlation between and ? Which statement about the change in heights when comparing Y9 to Y12 is correct?
(A) Positive A. The mean increased and the standard deviation decreased.
(B) Negative B. The mean decreased and the standard deviation decreased.
(C) Positively skewed C. The mean increased and the standard deviation increased.
(D) Negatively skewed D. The mean decreased and the standard deviation increased.
7. FS Health, 2UG 2015 HSC 19 MC 9. FS Health, 2UG 2016 HSC 29e
The table shows the life expectancy (expected remaining years of life) for females at Part i: RAP Data - Bottom 17%: School result (87%) was 4% above state average (83%)
selected ages in the given periods of time.
The graph shows the life expectancy of people born between 1900 and 2000.
In 1975, a 45‑year‑old female used the information in the table to calculate the age to which
she was expected to live. Twenty years later she recalculated the age to which she was
expected to live.
What is the difference between the two ages she calculated?
(A) years
(B) years
(C) years
(D) years
(i) According to the graph, what is the life expectancy of a person born in 1932? (1 mark)
8. Data, STD2 S4 SM-Bank 2 (ii) With reference to the value of the gradient, explain the meaning of the gradient in this
context. (2 marks)
Pedro is planning a statistical investigation.
List the steps that Pedro must follow to execute the statistical investigation correctly. (2 marks)
10. Data, 2UG 2012 HSC 29a 11. FS Health, 2UG 2013 HSC 28b
Part i: RAP Data - Bottom 23%: School result (68%) was 7% above state average (61%) Part v: RAP Data - Bottom 19%: School result (67%) was 5% above state average (62%)
Part ii: RAP Data - Bottom 8%: School result (47%) was equal to state average (47%)
Ahmed collected data on the age ( ) and height ( ) of males aged 11 to 16 years.
Part iii: RAP Data - Bottom 9%: School result (70%) was 1% above state average (69%)
He created a scatterplot of the data and constructed a line of best fit to model the relationship
Tourists visit a park where steam erupts from a particular geyser. between the age and height of males.
The brochure for the park has a graph of the data collected for this geyser over a period of
time.
The graph shows the duration of an eruption and the time until the next eruption, timed from
the end of one eruption to the beginning of the next.
(i) Tony sees an eruption that lasts 4 minutes. Based on the data in the graph, what is the (i) Determine the gradient of the line of best fit shown on the graph. (1 mark)
minimum time that he can expect to wait for the next eruption? (1 mark) (ii) Explain the meaning of the gradient in the context of the data. (1 mark)
(ii) Julia saw two consecutive eruptions, one hour apart. Based on the data in the graph, (iii) Determine the equation of the line of best fit shown on the graph. (2 marks)
what was the longest possible duration of the first eruption that she saw? (1 mark)
(iv) Use the line of best fit to predict the height of a typical 17-year-old male. (1 mark)
(iii) What does the graph suggest about the association between the duration of an (v) Why would this model not be useful for predicting the height of a typical 45-year-old
eruption and the time to the next eruption? (1 mark) male? (1 mark)
12. FS Health, 2UG 2015 HSC 28e 13. Data, STD1 2017 HSC 29d
The shoe size and height of ten students were recorded. All the students in a class of 30 did a test.
The marks, out of 10, are shown in the dot plot.
8
7
Number of students
6
i. Complete the scatter plot AND draw a line of fit by eye. (2 marks)
5
4
3
2
1
0 1 2 3 4 5 6 7 8 9 10
Mark
(ii) The mean test mark is 5.4. The standard deviation of the test marks is 4.22.
Using the dot plot, calculate the percentage of the marks which lie within one standard
deviation of the mean. (2 marks)
ii. Use the line of fit to estimate the height difference between a student who wears a
size 7.5 shoe and one who wears a size 9 shoe. (1 mark)
iii. A student calculated the correlation coefficient to be 1 for this set of data. Explain why this
cannot be correct. (1 mark)
14. Data, 2UG 2006 HSC 27b 15. FS Health, 2UG 2009 HSC 28b
Each member of a group of males had his height and foot length measured and recorded. Part i: RAP Data - Bottom 13%: School result (51%) was 3% above state average (48%)
The results were graphed and a line of fit drawn.
The height and mass of a child are measured and recorded over its first two years.
(i) Why does the value of the -intercept have no meaning in this situation? (1 mark)
(ii) George is 10 cm taller than his brother Harry. Use the line of fit to estimate the
difference in their foot lengths. (1 mark)
(iii) Sam calculated a correlation coefficient of −1.2 for the data. Give TWO reasons why
Sam must be incorrect. (2 marks)
(i) Describe the correlation between the height and mass of this child, as shown in the
graph. (1 mark)
(ii) A line of best fit has been drawn on the graph.
Find the equation of this line. (2 marks)
Copyright © 2004-18 The State of New South Wales (Board of Studies, Teaching and Educational Standards NSW)
Worked Solutions 5. FS Health, 2UG 2012 HSC 11 MC
80
70
(years)
60
50
(ii)
(ii)
(iv)
(iii)
(v)
12. FS Health, 2UG 2015 HSC 28e 13. Data, STD1 2017 HSC 29d
(ii)
(ii)
ii.
(iii)
iii.
(ii)