Practice Problem 3 (Data Description) Sol
1. The following data give the number of text messages sent on 10 randomly selected days in
2013 by a USP student:
32.37 41
42 44 44 47 47 47 48
‘a. Compute the mean, median, mode and the standard deviation
Solution:
Mean=42.9, median=44, mode=47, SD=!
1
b. Calculate the values of three quartiles, 40th percentile and the percentile rank of 44.
Solution:
‘tet quartile
median
3rd quartile
4
44,00
47.00
40" percentile: =(40x10)/100=4, this implies the average of the 4" and 5"* terms,
(a2+44)/2=43
Percentile rank of 44=(4+0.5)/10*100=45
2. The following frequency distribution table gives the number of hours spent last week on cell
phones (making phone calls and texting) by all 100 paticnts in Suva Private Hospital.
a, Calculate the mean, modal group and variance
Hours per week | Frequency
OF 4
48 Is
8-12 2s
12-16 18
16-20 16
20-24 9
of the number of hours spent last week on
cell phone by the patients.
fifours per
week ffPemerey Tin [em [mee fie
04 7 z 8 4 56
ey 18 6 108 | 36 | oes
ez By 70 [250 | 100 [2500
12-16 is 74252 [196 [3528
16-20 16 18__[28s__—[324 [Sasa
20-24 9 2 [iss [see [4386
1124 16272ua Dati _ 124
1.24, modal group is 8-12 group.
N 100
. mip Sat) 6272-1124
N 100
b. Calculate the 70™ percentile of the distribution above.
3. An analysis of monthly wages paid to the workers of firm A and B belonging to the same
industry gives the following results:
‘Number of Workers Average monthly wage Variance of distribution of wages
Firm A 100 S196 S81
Firm B 200 $185 3144.
2. Which firm, A or B has a larger wage bill?
b, In which firm, A or B is there greater variability among individual wages?
Solution:
a) Firm B has a larger wage bill since it has to bill 200xS185-$37000, while Firm B has to
bill 100x$196=$19600 B.
b) In which firm, A or B is there greater variability among individual wages? Firm B has
greater variability.4. The following stem and leaf plot give the waiting times (in minutes) for 29 patients to see the
doctor in the CWM Hospital.
Stem | Leaf
o| 2355
1| 345588999
2| 0015789
3| 012259
4| 126
a. Find IQR and the percentile rank for the score 29.
29x25
29x75
):0= =21.8= 22nd term=31, Q,:0= =
2, Too fem: eee Tg
1-15=16
3 8thterm>15
number ofscores less than 29-+0.5
2
Percentile rank for 20 = 1940.5
100
%100=67%
b. Are there any outliers in the data? Use an appropriate numerical test involving the IQR to
identify outliers, if any, and clearly state your conclusion.
from above, we have
IQR = Q,—0, =31-15= 16then IORx1.5=1.5(19= 24
= the interval in whieh the outlicr lies outside is[15~ 2432+ 24]
= [9,56]
= hence no outlier since all the data lies within the interval.
c. Ifa data value is identified as an outlier, what should be done about it?
Ans:
‘The outlier should be investigated to see if there is an error or some other problem in the
data; then a decision whether to include or exclude it should be made based on the
particular situation. If it was a correct value then the data value should remain in the data
set. If there is a problem with this data value, then it should be corrected or removed from
the data. For example: If the data was recorded incorrectly (perhaps a 9 was miscoded and
the correct value was 6) then the data should be corrected. If it was an error but the correct
value is not known it should be removed from the data set.5. The number of teacher strikes over a 13-year period in Fiji is shown below.
20 18 7 13 7 14 5 9 9 9 10017) 15
a. Find the Five-Number summary
b. Construct a box-plot for the above data.
. Using the box-plot, what can you say about the distribution of the data?
Solution:
a. LV-5, QI-9, Q2=10, Q3-15, HV-20
b.
Teacher Strike
0 5 10 15 20 25
number of teacher
¢. The distribution is positively skewed as the median falls to the left of the box.