Professional Documents
Culture Documents
Revision Sheet 5 - Stats Exemplar
Revision Sheet 5 - Stats Exemplar
Revision Sheet 5 - Stats Exemplar
1a. [1 mark]
The Malthouse Charity Run is a 5 kilometre race. The time taken for each runner to
complete the race was recorded. The data was found to be normally distributed with a
mean time of 28 minutes and a standard deviation of 5 minutes.
A runner who completed the race is chosen at random.
Write down the probability that the runner completed the race in more than 28 minutes.
Markscheme
0.5 ( 12 ,50 %) (A1) (C1)
[1 mark]
1b. [2 marks]
Calculate the probability that the runner completed the race in less than 26 minutes.
Markscheme
P ( X ≤26 ) (M1)
Note: Award (M1) for a correct mathematical statement.
OR
Award (M1) for a diagram that shows the value 26 labelled to the left of the mean and the
correct shaded region.
3.45 ( 0.344578 … ,34.5 % ) (A1) (C2)
[2 marks]
1c. [3 marks]
It is known that 20 % of the runners took more than 28 minutes and less than k minutes
to complete the race.
Find the value of k .
Markscheme
0.7 OR 0.3 (seen) (A1)
Note: Award (A1) for 0.7 or 0.3 seen.
P ( time< 7 )=0.7 OR P ( time> k )=0.3 (M1)
The Pearson’s product–moment correlation coefficient for this data is 0.948 , correct to
three significant figures.
Use your graphic display calculator to find the equation of the regression line y on x .
Markscheme
y=0.384 x+ 629
y= ( 0.384221… ) x + ( 629.421 … ) (A1)(A1) (C2)
Note: Award (A1) for 0.384 x , (A1) for 629 . If the answer is not given as an equation, award
a maximum of (A1)(A0).
[2 marks]
2b. [2 marks]
The distance from Hong Kong to Tokyo is 2900 km.
Use your regression equation to estimate the cost of a flight from Hong Kong to Tokyo with
Galois Airways.
Markscheme
y=0.384221… ×2900+ 629.421… (M1)
Note: Award (M1) for substitution into their regression equation.
1740 (1744 , 1743.66 …) (HKD) (A1)(ft) (C2)
Note: Follow through from part (a).
[2 marks]
2c. [2 marks]
Explain why it is valid to use the regression equation to estimate the airfare between Hong
Kong and Tokyo.
Markscheme
the correlation is (very) strong (R1)
2900 (km) is within the given data range (interpolation) (R1) (C2)
Note: Two correct reasons are required for the awarding of (C2).
[2 marks]
3a. [2 marks]
Sungwon plays a game where she rolls a fair 6 -sided die and spins a fair spinner with 4
equal sectors. During each turn in the game, the die is rolled once and the spinner is
spun once. The score for each turn is the sum of the two results. For example, 1 on the die
and 2 on the spinner would receive a score of 3.
The following diagram represents the sample space.
Find the probability that Sungwon’s score on her first turn is greater than 4 .
Markscheme
(
18 3
24 4 )
, 0.75 ,75 % (A1)(A1) (C2)
Note: Award (A1) for correct numerator, (A1) for correct denominator.
[2 marks]
3b. [2 marks]
Sungwon takes a second turn.
Find the probability that Sungwon scores greater than 4 on both of her first two turns.
Markscheme
18 18
× (M1)
24 24
Note: Award (M1) for the square of their probability in part (a).
Note: Follow through from part (a), provided their answer is less than or equal to 1.
[2 marks]
3c. [2 marks]
Sungwon will play the game for 11 turns.
Find the expected number of times the score on a turn is greater than 4 .
Markscheme
18
11 × (M1)
24
Markscheme
$\left( {\text{H}_{0}\text{\,:}} \right)$ choice of meal is independent of age (or
equivalent) (A1)
Note: Accept "not associated" or "not dependent" instead of independent. In lieu of "age",
accept an equivalent alternative such as "being a child or adult".
[1 mark]
4b. [1 mark]
Write down the number of degrees of freedom.
Markscheme
2 (A1)
[1 mark]
4c. [2 marks]
Show that the expected number of children who chose shrimp is 31, correct to
two significant figures.
Markscheme
69 67 69× 67
× × 150 OR (M1)
150 150 150
Note: Award (M1) for correct substitution into expected frequency formula.
30.82 ( 30.8 ) (A1)
31 (AG)
Note: Both an unrounded answer that rounds to the given answer and rounded answer
must be seen for the (A1) to be awarded.
[2 marks]
4d. [2 marks]
Write down
the χ 2 statistic.
Markscheme
¿ (G2)
[2 marks]
4e. [1 mark]
the p-value.
Markscheme
( p-value ¿) 0.265 ( 0.264803 … ) (G1)
Note: Award (G0)(G2) if the χ 2 statistic is missing or incorrect and the p-value is correct.
[1 mark]
4f. [2 marks]
State the conclusion for this test. Give a reason for your answer.
Markscheme
0.265>0.10 OR 2.66< 4.605 (R1)(ft)
the null hypothesis is not rejected (A1)(ft)
OR
the choice of meal is independent of age (or equivalent) (A1)(ft)
Note: Award (R1)(ft)) for a correct comparison of either their χ 2 statistic to the χ 2 critical
value or their p-value to the significance level.
Condone “accept” in place of “not reject”.
Follow through from parts (a) and (d).
Do not award (A1)(ft)(R0).
[2 marks]
4g. [2 marks]
A customer is selected at random.
Calculate the probability that the customer is an adult.
Markscheme
(
81 27
150 50 )
, 0.54 , 54 % (A1)(A1)(G2)
Markscheme
(
116 58
150 75 )
, 0.773 ,0.773333 … , 77.3 % (A1)(A1)(G2)
Markscheme
34
( 0.493, 0.492753 … , 49.3 % ) (A1)(A1)(G2)
69
Note: Award (A1) for numerator, (A1) for denominator.
[2 marks]
5a. [2 marks]
Consider the following graphs of normal distributions.
In the following table, write down the letter of the corresponding graph next to the given
mean and standard deviation.
Markscheme
(A1)
(A1) (C2)
Note: Award (A1) for each correct entry.
[2 marks]
5b. [2 marks]
At an airport, the weights of suitcases (in kg) were measured. The weights are normally
distributed with a mean of 20 kg and standard deviation of 3.5 kg.
Find the probability that a suitcase weighs less than 15 kg.
Markscheme
(M1)
Note: Award (M1) for sketch with 15 labelled and left tail shaded OR for a correct
probability statement, P(X < 15).
0.0766 (0.0765637…, 7.66%) (A1) (C2)
[2 marks]
5c. [2 marks]
Any suitcase that weighs more than k kg is identified as excess baggage.
19.6 % of the suitcases at this airport are identified as excess baggage.
Find the value of k .
Markscheme
(M1)
Note: Award (M1) for a sketch showing correctly shaded region to the right of the mean
with 19.6% labelled (accept shading of the complement with 80.4% labelled) OR for a
correct probability statement, P(X > k ) = 0.196 or P(X ≤ k ) = 0.804.
23.0 (kg) (22.9959… (kg)) (A1) (C2)
[2 marks]
6a. [2 marks]
The diagram shows a circular horizontal board divided into six equal sectors. The sectors
are labelled white (W), yellow (Y) and blue (B).
A pointer is pinned to the centre of the board. The pointer is to be spun and when it stops
the colour of the sector on which the pointer stops is recorded. The pointer is equally likely
to stop on any of the six sectors.
Eva will spin the pointer twice. The following tree diagram shows all the possible
outcomes.
Markscheme
1 1
× OR
3 3 3()
1 2
(M1)
Note: Award (M1) for multiplying correct probabilities.
1
(0.111, 0.111111…, 11.1%) (A1) (C2)
9
[2 marks]
6b. [3 marks]
Find the probability that at least one of the spins is yellow.
Markscheme
( 12 × 13 )+( 16 × 13 )+ 13 (M1)(M1)
Note: Award (M1) for ( × ) and ( × ) or equivalent, and (M1) for and adding only
1 1 1 1 1
2 3 6 3 3
the three correct probabilities.
OR
1−
3()
2 2
(M1)(M1)
2
Note: Award (M1) for seen and (M1) for subtracting
3
2 2
3 ()
from 1. This may be shown in a
tree diagram with “yellow” and “not yellow” branches.
5
(0.556, 0.555555…, 55.6%) (A1)(ft) (C3)
9
Note: Follow through marks may be awarded if their answer to part (a) is used in a correct
calculation.
[3 marks]
6c. [1 mark]
Write down the probability that the second spin is yellow, given that the first spin is blue.
Markscheme
1
(0.333, 0.333333…, 33.3%) (A1) (C1)
3
[1 mark]
7a. [2 marks]
A florist sells bouquets of roses. The florist recorded, in Table 1, the number of roses in
each bouquet sold to customers.
Table 1
The roses can be arranged into bouquets of size small, medium or large. The data from
Table 1 has been organized into a cumulative frequency table, Table 2.
Table 2
Markscheme
(A1)
(A1)(ft) (C2)
Note: Award (A1) for 10; (A1)(ft) for the last column all correct. Follow through from their
10 for their 50 in the last column.
[2 marks]
7b. [2 marks]
Write down the probability that a bouquet of roses sold is not small.
Markscheme
35
50 ( 7
)
0.7 , ,70 % (A1)(ft)(A1)(ft) (C2)
10
Note: Award (A1)(ft) for their numerator being 25 + their 10, and (A1)(ft) for their
denominator being their 50. Follow through from part (a).
[2 marks]
7c. [2 marks]
A customer buys a large bouquet.
Find the probability that there are 12 roses in this bouquet.
Markscheme
4
10 ( 2
)
0.4 , , 40 % (A1)(A1)(ft) (C2)
5
Note: Award (A1) for a numerator of 4 and (A1)(ft) for their 10 as denominator. Follow
through from part (a).
[2 marks]
8a. The random variable X is the number of emails Katherine receives during a day. X
satisfies a Poisson distribution with a mean of 30. Find the probability that on a particular
day she will receive less than 20 emails.
0.0219
8b. Find the probability that on a particular day she will receive more than 55 emails.
−5
1.43 ×10
8c. Find the probability that she will receive 200 emails or less in a 7 day week.
Y Po ( 210 )
P ( Y ≤ 200 )=0.258
The number of emails, R, that Jane receives during the day satisfies Poisson distribution
with mean 20.
8d. Find the probability that on a particular day the total number of emails received by
both Jane and Katherine together is 50 or less.
W =X+R
E(W)= E(X)+E(R) = 50
W Po (50 )
P(W<=50) = 0.538
9. It is known that the IQs of “Grandmasters” chess players are normally distributed with a
population mean of μ.
A sample of the IQs of ten grandmasters were tested at the tournament. The sample mean x
was 115 and the sample standard deviation sn was 4.
a. Find the 90% confidence interval for μ, giving your answer to one decimal place.
[112.68,117.32]
112.68 ≤ μ ≤ 117.32
b. State with a reason which distribution should have been used.
T – distribution because the population standard deviation is not given.
10. The time t, that competitors take to complete a 10000 m race is normally distributed
with mean μ=50minutes and standard deviation σ =5 minutes.
a. Sketch a diagram to represent this information with the numbers 50 and 5 indicated on
it.
b. Find the probability that a random competitor takes between 45 and 55 minutes.
0.683
c. Find the probability that a random competitor takes less than 40 minutes to complete the
race.
0.0228
d. The fastest 75% competitors receive a medal. Find the time that the race has to be
completed under in order for a competitor to receive a medal.
53.4 minutes
e. Given a competitor received a medal, find the probability that they completed the race in
less than 40 minutes.
P ( X< 40 ) ∩ P ( X <53.4 ) P ( X < 40 ) 0.0228
= = =0.0304
P ( X <53.4 ) P ( X< 53.4 ) 0.75
f. If 10000 competitors ran the race, estimate(to the nearest minute) how many
competitors completed the race in less than 33 minutes.
3 competitors
10000 x P(X<33)
11. Eleven students took a Math exam. Let h represent the number of hours that each
student spent revising, and let s represent the score(out of 100) they obtained.
The data for each student is shown in the following table:
a. Identify any outliers in the value of h . Justify your answer.
K
b. Calculate the Pearsons product moment correlation coefficient r , for this data.
r =0.816
c. Write down the equation of the line of best fit of s on h.
y=3.79 x +53.6
d. Hence estimate the score, to the nearest integer, of a twelfth student who spent 5.5
hours studying.
74
e. Rank the students from 1-11 according to the number of hours they spent revising.
Rank 1 should represent the most hours spent revising. In a similar way rank the
students according to the test score they obtained. Rank 1 should represent the
highest test score. Complete the table given below for each student’s ranking
13. Anna cycles to her new school. She records the times taken for the first ten days with
the following results (in minutes).
12.4 13.7 12.5 13.4 13.8 12.3 14.0 12.8 12.6 13.5
Assume that these times are a random sample from the N ( μ , σ 2 ) distribution.
σ 2=0.416
(b) Calculate a 95 % confidence interval for μ.
[12.7,13.5]
(c) Before Anna calculated the confidence interval she thought that the value of μ would
be 12.5. In order to check this, she sets up the null hypothesis H 0 : μ=12.5 .
(i) Use the above data to calculate the value of an appropriate test statistic. Find the
corresponding p-value using a two-tailed test.
(ii) Interpret your p-value at the 1 % level of significance, justifying your conclusion.
14. [13 marks]
The mean weight of a certain breed of bird is believed to be 2.5 kg. In order to test this
belief, it is planned to determine the weights x 1 , x 2 , x 3 ,… , x 16 (in kg) of sixteen of these
birds and then to calculate the sample mean x . You may assume that these weights are a
random sample from a normal distribution with standard deviation 0.1 kg.
(a) State suitable hypotheses for a two-tailed test.
(b) Find the critical region for x having a significance level of 5 %.
(c) Given that the mean weight of birds of this breed is actually 2.6 kg, find the
probability of making a Type II error.