Professional Documents
Culture Documents
PDF Solutions Manual To Accompany Introductory Statistics 9Th Edition 0321691229 Online Ebook Full Chapter
PDF Solutions Manual To Accompany Introductory Statistics 9Th Edition 0321691229 Online Ebook Full Chapter
http://testbankbell.com/product/solutions-manual-to-accompany-
introductory-quantum-optics-1st-edition-9780521527354/
http://testbankbell.com/product/solutions-manual-to-accompany-
introductory-chemistry-essentials-4th-edition-9780321765802/
http://testbankbell.com/product/solutions-manual-to-accompany-
statistics-and-data-analysis-9780137444267/
http://testbankbell.com/product/solutions-manual-to-accompany-
introductory-mathematical-analysis-for-business-13rd-
edition-0321643720/
Solutions Manual to accompany Fundamentals of
Statistics 3rd edition 0321641876
http://testbankbell.com/product/solutions-manual-to-accompany-
fundamentals-of-statistics-3rd-edition-0321641876/
http://testbankbell.com/product/solutions-manual-to-accompany-
introduction-to-java-programming-9th-edition/
http://testbankbell.com/product/solutions-manual-to-accompany-
elementary-statistics-using-excel-4th-edition-0321564960/
http://testbankbell.com/product/solution-manual-for-introductory-
statistics-9th-by-mann/
http://testbankbell.com/product/solutions-manual-to-accompany-
statistics-for-business-decision-making-and-analysis-0321123913/
Solutions Manual to accompany Introductory
Statistics 9th edition 0321691229
282
CHAPTER 6 ANSWERS
Exercises 6.1
6.1 A density curve is a smooth curve that identifies the shape of a
distribution for a variable.
6.2 A density curve is always on or above the horizontal axis. Also, the area
under the density curve, and above the horizontal axis, is equal to one.
6.3 The percentage of all possible observations of a variable that lie within a
specified range is approximately equal to the corresponding area under its
density curve.
6.4 The percentage of all possible observations of the variable that lie between
7 and 12 equals the area under its density curer between 7 and 12, expressed
as a percentage.
6.5 The percentage of all possible observations of the variable that lie to the
right of 4 equals the area under its density curve to the right of 4,
expressed as a percentage.
6.6 (a) Since the area to the left of 10 is 0.654, 65.4% of all possible
observations of the variable are less than 10.
(b) Since the total area under the density curve is one and the area to the
left of 10 is 0.654, the area to the right of 10 must be 1 ± 0.654 =
0.346. Therefore, 34.6% of all possible observations of the variable
are at least 10.
6.7 (a) Since the area to the right of 15 is 0.324, 32.4% of all possible
observations of the variable exceed 15.
(b) Since the total area under the density curve is one and the area to the
right of 15 is 0.324, the area to the left of 15 must be 1 ± 0.324 =
0.676. Therefore, 67.6% of all possible observations of the variable
are at most 15.
6.8 Since the area under the curve that lies between 30 and 40 is 0.832 and the
total area under the density curve is one, the area not between 30 and 40 is
1 ± 0.832 = 0.168. The area not between 30 and 40 would be the area to the
left of 30 or the area to the right of 40. Therefore, 16.8% of all possible
observations of the variable are either less than 30 or greater than 40.
6.9 Since the area under the curve that lies between 15 and 20 is 0.414 and the
total area under the density curve is one, the area not between 15 and 20 is
1 ± 0.414 = 0.586. The area not between 15 and 20 would be the area to the
left of 15 or the area to the right of 20. Therefore, 58.6% of all possible
observations of the variable are either less than 15 or greater than 20.
6.10 (a) Since 33.6% of all possible observations of the variable exceed 8, the
area under the density curve that lies to the right of 8 is 0.336.
(b) Since the total area under the density curve is one and the area to the
right of 8 is 0.336, the area to the left of 8 must be 1 ± 0.336 =
0.664.
6.11 (a) Since 28.4% of all possible observations of the variable are less than
11, the area under the density curve that lies to the left of 11 is
0.284.
(b) Since the total area under the density curve is one and the area to the
left of 11 is 0.284, the area to the right of 11 must be 1 ± 0.284 =
0.716.
6.12 No, because the total area to the right and left of 4 should be 1, not 1.01.
6.13 No, because the total area to the right and left of 65 should be 1, not 0.9.
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
6.24
0.9
0.8
0.7
0.6
b
0.5
0.4
0.3
0.2
0.1
a c
0.0
-8 -6 -4 -2 0 2 4 6
x
6.31 (a)
(b) z = (x ± 18.14)/1.76
(c) z has a standard normal distribution (μ = 0 and ) = 1).
0.4
0.3
0.2
0.1
0.0
-3 -2 -1 0 1 2 3
z
(d) The percentage of adult males G. mollicoma that have carapace lengths
between 16 mm and 17 mm is equal to the area under the standard normal
curve between -1.22 and -0.65.
(e) The percentage of adult males G. mollicoma that have carapace lengths
exceeding 19 mm is equal to the area under the standard normal curve
that lies to the right of 0.49.
6.32 (a)
(b) z = (x - 206)/44.7
(c) z has a standard normal distribution (μ = 0 and ) = 1).
0.4
0.3
0.2
0.1
0.0
-3 -2 -1 0 1 2 3
z
(d) The percentage of U.S. females 20 years old or older that have a serum
total cholesterol level between 150 mg/dL and 250 mg/dL is equal to the
area under the standard normal curve between -1.25 and 0.98.
(e) The percentage of U.S. females 20 years old or older that have a serum
total cholesterol level below 220 mg/dL is equal to the area under the
standard normal curve that lies to the left of 0.31.
6.33 (a)
34 43 52 61 70 79 88
x
(b) z = (x - 61)/9
(c) z has a standard normal distribution (μ = 0 and ) = 1).
0.4
0.3
0.2
0.1
0.0
-3 -2 -1 0 1 2 3
z
(e) The percentage of finishers with times less than 75 minutes is equal to
the area under the standard normal curve that lies to the left of 1.56.
6.34 (a)
(b) z = (x ± 52.0)/17.2
(c) z has a standard normal distribution (μ = 0 and ) = 1).
0.4
0.3
0.2
0.1
0.0
-3 -2 -1 0 1 2 3
z
(d) The percentage of adult green sea urchins with weights between 50 g and
60 g is equal to the area under the standard normal curve between -0.12
and 0.47.
(e) The percentage of adult green sea urchins with weights above 40 g is
equal to the area under the standard normal curve that lies to the
right of -0.70.
6.35 (a)
0.30
0.25
Relative Frequency
0.20
0.15
0.10
0.05
0.00
12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5
Age
(b) Yes. The distribution appears to be slightly right skewed, but could
be approximated quite well by a normal distribution.
6.36 (a) Dividing each frequency by the total frequency of 51, we obtain the
relative frequencies that are shown in the following histogram.
0.35
0.30
Relative Frequency
0.25
0.20
0.15
0.10
0.05
0.00
10.5 11.5 12.5 13.5 14.5 15.5 16.5 17.5 18.5 19.5 20.5 21.5
Rate
(b) The data are somewhat right skewed, although the one birth rate between
21 and 22 accounts for much of that impression. The rest of the
distribution is closer to bell-shaped.
6.37 (a)
Cloudiness is Breslau
2000
1500
Frequency
1000
500
0
0 1 2 3 4 5 6 7 8 9 10
Degree
(b) The data is definitely not normal. The distribution instead looks
bimodal with the most frequent values of 0 and 10 occurring at the far
left and far right of the distribution. The rest of the distribution
looks evenly distributed or uniform.
6.38 (a)
Wrong Number
30
25
20
Percent
15
10
0
0 1 2 3 4 5 6 7 8
Wrong
(b) The data is not normal. The distribution of number of wrong numbers is
right skewed.
6.39 (a) Using Minitab, we obtain the data from the WeissStats CD and choose
Graph Histogram, select Simple from the first row, and click OK.
Then enter VERBAL and MATH in the Graphs variables text box and click
OK. The resulting graphs follow.
Histogram of VERBAL Histogram of MATH
80 80
70 70
60 60
50 50
Frequency
Frequency
40 40
30 30
20 20
10 10
0 0
240 320 400 480 560 640 720 320 400 480 560 640 720
VERBAL MATH
(a) Yes. The SAT verbal scores appearing in the left hand histogram have
roughly a bell shaped distribution.
(b) Yes. The SAT verbal scores appearing in the right hand histogram have
roughly a bell shaped distribution.
6.40 Using Minitab, we obtain the data from the WeissStats CD and choose Graph
Histogram, select Simple from the first row, and click OK. Then enter
FERTILITY in the Graphs variables text box and click OK. The resulting
graph follows.
50
40
30
Frequency
20
10
0
1 2 3 4 5 6 7
FERTILITY RATE
6.42 (a)
P(x)
0 2 4 6 8 10 12
x
1 2
/ 22 2
(b) y e ( x 5 )
2S 2
(c) Using Excel we enter x in cell A1 and P(x) in B1. Then below the x,
enter the numbers ±1, -0.75, -0.50, ..., 10.50, 10.75, 11.00. In cell
B2, enter the expression =(1/SQRT(2*PI()))*EXP(-((A2-5)^2)/4). Then
copy this expression to the cells below B2 for all of the values in
Column A. Use your mouse to highlight all of the data including the
titles and click on the Chart icon and select (XY) Scatter. Choose the
second option in the second row and click on Next twice. Click on the
Titles tab to enter any titles you wish to create and click on Next
again. Click on Finish.
The result is
0.45
0.40
0.35
0.30
0.25
P(x)
0.20
0.15
0.10
0.05
0.00
-1 1 3 5 7 9 11
x
(d) The curves are the same in parts (a) and (c). There may be minor
differences in the axis labeling, but the curves are the same.
column. Click on the Data view button and check only the Connect line
box. Click OK twice. The result is
0.025
0.020
0.015
P(X)
0.010
0.005
0.000
(b) Following the procedure in Example 6.2, we choose Calc Random Data
Normal..., type 1000 in the Generate rows of data text box, click in
the Store in column(s) text box and type DAYS, click in the Mean text
box and type 266, click in the Standard deviation text box and type 16,
and click OK.
(c) We would expect the sample mean and sample deviation to be
approximately equal to the mean and standard deviation of the
population, 266 and 16 respectively. This is because we expect the
sample to reflect approximately the characteristics of the population.
140
120
100
Frequency
80
60
40
20
6.44 (a)
(b) Following the procedure in Example 6.2, we choose Calc Random Data
Normal..., type 1500 in the Generate rows of data text box, click in
the Store in column(s) text box and type DAYS, click in the Mean text
box and type 53, click in the Standard deviation text box and type 3.4,
and click OK.
(c) We would expect the sample mean and standard deviation to be near 53
and 3.4 respectively since these the corresponding values for the
entire population.
(d) We choose Calc
Column Statistics..., click on the Mean button, click
in the Input variable text box and select DAYS, and click OK. Then
repeat the process, selecting the Standard deviation button. The
results are shown in the Session Window. The results we obtained were
Mean of DAYS = 52.9267 and Standard deviation of DAYS = 3.19167. Your
results will vary from ours.
(e) We would expect the histogram for the sample to be bell-shaped since it
should be similar to the normal distribution of the population.
(f) Using Minitab, we obtain the histogram below for the sample. Your
histogram will likely be similar to ours, but not identical.
200
150
Frequency
100
50
0
45 48 51 54 57 60 63
DAYS
Exercises 6.2
6.45 Finding areas under the standard normal curve is important because for any
normally distributed variable, we can obtain the percentage of all possible
observations that lie within any specified range by first converting x
values to z-scores and then finding the corresponding area under the
standard normal curve.
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
6.46 The standard normal curve is associated with the normal distribution with μ
= 0 and ) = 1
6.47 The total area under the curve is 1, and the standard normal curve is
symmetric about 0. Therefore, the area to the left of 0 is 0.5, and the
area to the right of 0 is 0.5.
6.48 The area under the standard normal curve to the right of 2.08 is the same as
the area to the left of -2.08, by symmetry. The area to the left of -2.08
is 0.0188. Therefore, the area to the right of 2.08 is 0.0188.
6.49 The area under the standard normal curve to the right of 0.43 is 1 - the
area to the left of 0.43. The area to the left of 0.43 is 0.6664.
Therefore, the area to the right of 0.43 is 1 - 0.6664 = 0.3336.
6.50 The area under the standard normal curve to the right of 1.96 is the same as
the area to the left of -1.96, by symmetry. The area to the right of 1.96
can be found by 1.0 - 0.975 = 0.025. Therefore, the area to the left of -
1.96 is 0.025.
6.51 The area to the left of z = 3.00 is 0.9987 and the area to the left of -3.00
is 0.0013. Therefore the area between -3.00 and 3.00 is 0.9987 - 0.0013 =
0.9974 (99.74%).
6.52 The standard normal curve is the associated normal curve for a standardized
normally distributed variable. The standardized variable is labeled with
WKHOHWWHU]KHQFHWKHQDPH³]-FXUYH´
6.53 (a) Locate the row (tenths digit) and column (hundredths digit) of the
specified z-score. The corresponding table entry is the area under the
standard normal curve that lies to the left of the z-score.
(b) The area that lies under the standard normal curve to the right of a
specified z-score is 1 - (area to the left of the z-score).
(c) The area that lies under the standard normal score between two
specified z-scores, say a and b, where a < b, is found by subtracting
the area to the left of a from the area to the left of b.
6.54 The area under the standard normal curve that lies to the left of a z-score
is always strictly between 0 and 1.
6.55 (a) (b)
0.4 0.4
0.3 0.3
P(z)
P(z)
0.0594
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
(c) (d)
0.4 0.4
0.3 0.3
P(z)
0.5000
P(z)
0.2 0.2
0.0000
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4 0.4
0.3 0.3
P(z)
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
(c)
0.4
0.3
P(z)
0.2
1.0000
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
z
0.4 0.4
0.3 0.3
P(z)
P(z)
0.2 0.2
0.8577 0.2743
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
(c) (d)
0.4 0.4
0.3 0.3
P(z)
P(z)
0.2 0.2
0.5000 0.0000
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4 0.4
0.3 0.3
0.7123
P(z)
P(z)
0.2 0.2
0.1 0.1
0.0217
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
(c)
0.4
0.3
P(z)
0.2
1.0000
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
z
0.4 0.4
0.3 0.3
0.9105
P(z)
P(z)
0.2 0.2
0.0440
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4 0.4
0.3 0.3
P(z)
P(z)
0.2 0.2
0.2121 0.1357
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4 0.4
0.3 0.3
P(z)
P(z)
0.2 0.2
0.7981
0.0166
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4 0.4
0.3 0.3
0.8413
P(z)
P(z)
0.2 0.2
0.0661
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4 0.4
0.3 0.3
P(z)
P(z)
0.2
0.0475 0.2
0.7357 0.0618
0.0170
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
6.62 (a)
0.4
0.3
P(z)
0.2
0.1587 0.0228
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
z
0.4
0.3 0.8413
p(z)
0.2
0.0060
0.1
0.0
-3 -2 -1 0 1 2 3
X
0.4 0.4
0.3 0.3
0.6826
P(z)
P(z)
0.2 0.2
0.9544
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
0.4
0.3
P(z)
0.2 0.9974
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
z
(c)
Area = 0.9987 ± 0.0013 = 0.9974
6.66 (a) and (b)
Percentage of
Region Area total area
- to -3 0.0013 0.13
-3 to -2 0.0228 - 0.0013 = 0.0215 2.15
-2 to -1 0.1587 - 0.0228 = 0.1359 13.59
-1 to 0 0.5000 - 0.1587 = 0.3413 34.13
0 to 1 0.8413 - 0.5000 = 0.3413 34.13
1 to 2 0.9772 - 0.8413 = 0.1359 13.59
2 to 3 0.9987 - 0.9722 = 0.0215 2.15
3 to 1.0000 - 0.9987 = 0.0013 0.13
1.0000 100.00
6.67 6.68
0.4 0.4
0.3 0.3
P(z)
P(z)
0.2 0.2
0.1
0.0250 z = -1.96 0.1
0.0100 z = -2.33
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
z = -1.96 z = -2.33
6.69 6.70
0.4 0.4
P(z)
0.2 0.2
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
z = 0.67 z = 0.84
6.71 6.72
0.4 0.4
0.3 0.3
0.9500 0.7000
P(z)
P(z)
0.2 0.2
0.1
0.0500 z = -1.645 0.1
0.3000
z = -0.52
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
z = -1.645 z = -0.52
Note: The area 0.0500 was exactly between
the area of two z-scores on the table.
Therefore, the two z-scores of -1.64 and
-1.65 were averaged.
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
6.73 6.74
0.4 0.4
0.3 0.3
0.0067 0.33 0.9850
P(z)
P(z)
0.2 0.2
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
z= 0.44 z = 2.17
6.75 (a) (b)
0.4 0.4
0.3 0.3
0.9700 0.9850
P(z)
P(z)
0.2 0.2
0.0050
0.1 z = 1.88 0.1 z = 2.575
0.0300
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
z = 1.88 z = 2.575
0.4
0.3
0.3
P(z)
0.2
0.9400 0.0600
P(z)
0.2
0.8000 0.2000 0.1 z = 1.5554
0.1 z = 0.84
0.0
-4 -3 -2 -1 0 1 2 3 4
0.0
-4 -3 -2 -1 0 1 2 3 4 z
z = 0.84 z = 1.555
6.77 6.78
0.4 0.4
0.3 0.3
0.9000 0.9900
P(z)
P(z)
0.2 0.2
z = 1.645 z = 2.575
0.1 0.1
0.0500 z = -1.645 0.0500 0.0050 z = -2.575 0.0050
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
Z = +1.645 Z = +2.575
6.79
z0.10 z0.05 z0.025 z0.01 z0.005
6.80 The area under any curve representing the distribution of a variable is
equivalent to probability. Since the total probability for all of the
values of any variable is 1, the total area under the curve representing the
probability distribution of any variable is also 1.
6.81 (a) z (b) -z (c) + z/2
(d)
For part (a): For part (b):
0.4 0.4
0.3
0.3
P(z)
P(z)
0.2
0.2
0.1 z 0.1
-z
0.0
0.0 -4 -3 -2 -1 0 1 2 3 4
-4 -3 -2 -1 0 1 2 3 4 z
z
For part (c):
0.4
0.3
1 ±
P(z)
0.2
0.1
/2 -z/2 z/2 /2
0.0
-4 -3 -2 -1 0 1 2 3 4
z
Exercises 6.3
6.82 Sketch the normal curve associated with the variable, shade the region of
interest and mark the delimiting x-values, compute the z-scores for the
delimiting x values, and use Table II to obtain the area under the standard
normal curve delimited by those z-scores.
6.83 The x values delimiting the interval within two standard deviations either
side of the mean are μ - 2) and μ + 2). When these are standardized using
z = (x - μ)/), the former becomes z = [(μ - 2)) - μ]/) = -2 and the latter
becomes z = [(μ + 2)) - μ]/) = +2.
6.84 The empirical rule says that for any variable having approximately a bell-
shaped distribution, regardless of whether the variable is normally
distributed, approximately 68.26% of the observations will lie within one
standard deviation either side of the mean, approximately 95.44% of the
observations will lie within two standard deviations either side of the
mean, and approximately 99.74% of the observations will lie within three
standard deviations either side of the mean.
6.85 (a) For the variable values 1 and 7, the z-values are
1 6 76
z 2.50 and z 0.50
2 2
The area to the left of z = -2.50 is 0.0062 and the area to the left of
z = 0.50 is 0.6915. Therefore the area between z = -2.50 and z = 0.50
is 0.6915 - 0.0062 = 0.6853. Thus the percentage of all possible
values of the variable that lie between 1 and 7 is 68.53%.
(b) For a variable value of 5, the z value is
56
z 0.50
2
The area to the left of z = -0.50 is 0.3085. The area to right of z =
-0.50 is 1 ± 0.3085 = 0.6915. Thus the percentage of values of the
variable exceeding 5 is 69.15%.
(c) For a variable value of 4, the z value is
46
z 1.00
2
The area to the left of z = -1.00 is 0.1587. Thus the percentage of
values of the variable that are less than 4 is 15.87%.
6.86 (a) For the variable values 73 and 80, the z-values are
73 68 80 68
z 0.50 and z 1.20
10 10
The area to the left of z = 0.50 is 0.6915 and the area to the left of
z = 1.20 is 0.8849. Therefore the area between z = 0.50 and z = 1.20
is 0.8849 - 0.6915 = 0.1934. Thus the percentage of all possible
values of the variable that lie between 73 and 80 is 19.34%.
(b) For a variable value of 75, the z value is
75 68
z 0.70
10
The area to the left of z = 0.70 is 0.7580. The area to right of z =
0.70 is 1 ± 0.7580 = 0.2420. Thus the percentage of values of the
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
6.89 (a) Using Table II, we find that an area of 0.2500 lies to left of
z = -0.67, an area of .5000 lies to the left of z = 0.00 and an area of
0.7500 lies to the left of 0.67. We convert each of these z-values to
x-values using x = μ + z). Thus 25% of the values of the variable lie
to the left of the first quartile, x = 6 + (-0.67)(2) = 4.66. Half
(50%) of the values of the variable lie to the left of the second
quartile (median), x = 6 + (0.00)(2) = 6. Finally, 75% of the values
of the variable lie to the left of the third quartile, x = 6 +
(0.67)(2) = 7.34.
(b) Using Table II, we find that an area of 0.8500 lies to the left of
z = 1.04. We convert this z-value to an x-value using x = μ + z).
Thus 85% of the values of the variable lie to the left of the 85 th
percentile, x = 6 + (1.04)(2) = 8.08.
(c) If 65% of the possible values exceed the value, then 35% of the
possible values are less than the value. Using Table II, we find that
an area of 0.3500 lies to the left of z = -0.39. We convert this z-
value to an x-value using x = μ + z). Thus 65% of the values of the
variable exceed x = 6 + (-0.39)(2) = 5.22.
(d) The two values that divide the area into a middle area of 0.95 and two
outside areas of 0.025 have 2.5% of the curve and 97.5% of the curve to
their left. Using Table II, we find that an area of 0.0250 lies to the
left of z = -1.96 and an area of 0.9750 lies to the left of z = 1.96.
We convert these z-values to x-values using x = μ + z). Thus 95% of
the variable lie between x = 6 + (-1.96)(2) = 2.08 and
x = 6 + (1.96)(2) = 9.92.
6.90 (a) Using Table II, we find that an area of 0.2500 lies to left of
z = -0.67, an area of .5000 lies to the left of z = 0.00 and an area of
0.7500 lies to the left of 0.67. We convert each of these z-values to
x-values using x = μ + z). Thus 25% of the values of the variable lie
to the left of the first quartile, x = 68 + (-0.67)(10) = 61.30. Half
(50%) of the values of the variable lie to the left of the second
quartile (median), x = 68 + (0.00)(10) = 68. Finally, 75% of the
values of the variable lie to the left of the third quartile, x = 68 +
(0.67)(10) = 74.70.
(b) Using Table II, we find that an area of 0.9900 lies to the left of
z = 2.33. We convert this z-value to an x-value using x = μ + z).
Thus 99% of the values of the variable lie to the left of the 99 th
percentile, x = 68 + (2.33)(10) = 91.30.
(c) If 85% of the possible values exceed the value, then 15% of the
possible values are less than the value. Using Table II, we find that
an area of 0.1500 lies to the left of z = -1.04. We convert this z-
value to an x-value using x = μ + z). Thus 85% of the values of the
variable exceed x = 68 + (-1.04)(10) = 57.60.
(d) The two values that divide the area into a middle area of 0.90 and two
outside areas of 0.05 have 5% of the curve and 95% of the curve to
their left. Using Table II, we find that an area of 0.0500 lies to the
left of z = -1.645 and an area of 0.9500 lies to the left of z = 1.645.
We convert these z-values to x-values using x = μ + z). Thus 90% of
the variable lie between x = 68 + (-1.645)(10) = 51.55 and
x = 68 + (1.645)(10) = 84.45.
6.91 (a) Using Table II, we find that an area of 0.2500 lies to left of
z = -0.67, an area of .5000 lies to the left of z = 0.00 and an area of
0.7500 lies to the left of 0.67. We convert each of these z-values to
x-values using x = μ + z). Thus 25% of the values of the variable lie
to the left of the first quartile, x = 10 + (-0.67)(3) = 7.99. Half
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
(50%) of the values of the variable lie to the left of the second
quartile (median), x = 10 + (0.00)(3) = 10. Finally, 75% of the values
of the variable lie to the left of the third quartile, x = 10 +
(0.67)(3) = 12.01.
(b) The seventh decile is the same as the 70th percentile. Using Table II,
we find that an area of 0.7000 lies to the left of z = 0.52. We
convert this z-value to an x-value using x = μ + z). Thus 70% of the
values of the variable lie to the left of the seventh decile, x = 10 +
(0.52)(3) = 11.56.
(c) If 35% of the possible values exceed the value, then 65% of the
possible values are less than the value. Using Table II, we find that
an area of 0.6500 lies to the left of z = 0.39. We convert this z-
value to an x-value using x = μ + z). Thus 35% of the values of the
variable exceed x = 10 + (0.39)(3) = 11.17.
(d) The two values that divide the area into a middle area of 0.99 and two
outside areas of 0.005 have 0.5% of the curve and 99.5% of the curve to
their left. Using Table II, we find that an area of 0.0050 lies to the
left of z = -2.575 and an area of 0.9950 lies to the left of z = 2.575.
We convert these z-values to x-values using x = μ + z). Thus 99% of
the variable lie between x = 10 + (-2.575)(3) = 2.275 and
x = 10 + (2.575)(3) = 17.725.
6.92 (a) Using Table II, we find that an area of 0.2500 lies to left of
z = -0.67, an area of .5000 lies to the left of z = 0.00 and an area of
0.7500 lies to the left of 0.67. We convert each of these z-values to
x-values using x = μ + z). Thus 25% of the values of the variable lie
to the left of the first quartile, x = 0 + (-0.67)(4) = -2.68. Half
(50%) of the values of the variable lie to the left of the second
quartile (median), x = 0 + (0.00)(4) = 0. Finally, 75% of the values
of the variable lie to the left of the third quartile, x = 0 +
(0.67)(4) = 2.68.
(b) The second decile is the same as the 20th percentile. Using Table II,
we find that an area of 0.2000 lies to the left of z = -0.84. We
convert this z-value to an x-value using x = μ + z). Thus 20% of the
values of the variable lie to the left of the second decile,
x = 0 + (-0.84)(4) = -3.36.
(c) If 15% of the possible values exceed the value, then 85% of the
possible values are less than the value. Using Table II, we find that
an area of 0.8500 lies to the left of z = 1.04. We convert this z-
value to an x-value using x = μ + z). Thus 15% of the values of the
variable exceed x = 0 + (1.04)(4) = 4.16.
(d) The two values that divide the area into a middle area of 0.80 and two
outside areas of 0.10 have 10% of the curve and 90% of the curve to
their left. Using Table II, we find that an area of 0.1000 lies to the
left of z = -1.28 and an area of 0.9000 lies to the left of z = 1.28.
We convert these z-values to x-values using x = μ + z). Thus 80% of
the variable lie between x = 0 + (-1.28)(4) = -5.12 and
x = 0 + (1.28)(4) = 5.12.
6.93 (a) For carapace lengths 16 mm and 17 mm, the z-values are
16 18.14 17 18.14
z 1.22 and z 0.65
1.76 1.76
The area to the left of z = -1.22 is 0.1112 and the area to the left of
z = -0.65 is 0.2578. Therefore the area between z = -1.22 and z = -
0.65 is 0.2578 - 0.1112 = 0.1466. Thus the percentage of carapace
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
The area to the left of z = -1.50 is 0.0668 and the area to the left of
z = 0.96 is 0.8315. Therefore the area between z = -1.50 and z = 0.96
is 0.8315 - 0.0668 = 0.7647. Thus the percentage of tee shots that
went between 260 and 280 yards is 76.47%.
(b) For x = 300, the z-value is
300 272.2
z 3.42 . The area to the left of z = 3.42 is 0.9997.
8.12
Thus the area to the right of z = 3.42 is 1 - 0.9997 = 0.0003, implying
that only 0.03% of tee shots went more than 300 yards.
6.98 (a) For tumors that are 1 and 2 cm long, the z values are
(d)
0.4
0.4 0.4
0.3
0.3 0.3
P(z)
P(z)
P(z)
0.2
0.2 0.2
0.9974
0.6826 0.9544 0.1
0.1 0.1
0.0
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4 z
z z 1.73
1.18 1.62 1.07
1.29 1.51
6.102 (a) 68.26% of all such children watch between 18.27 and 30.73 hours of TV
per week.
(b) 95.44% of all such children watch between 12.04 and 36.96 hours of TV
per week.
(c) 99.74% of all such children watch between 5.81 and 43.19 hours of TV
per week.
(d)
0.4 0.4 0.4
f(z)
f(z)
0.2 68.26% 0.2 95.44% 0.2 99.74%
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z z
18.27 30.73 x 12.04 36.96 x 5.81 43.19 x
6.103 (a) From Table 6.1, exactly 11.70% of the heights of female students are
between 62 and 63. For heights of 62 and 63 inches, the z values are
62 64.4 63 64.4
z 1.00 and z 0.58
2.4 2.4
The area to the left of z = -1.00 is 0.1587 and the area to the left of
z = -0.58 is 0.2810. Thus, the area between z = -1.00 and z = -0.58 is
0.2810 - 0.1587 = 0.1223 (12.23%). The two percentages are quite
close.
(b) From Table 6.1, exactly = 0.1575 + 0.1100 + 0.0735 + 0.0374 + 0.0199 =
0.3983 (39.83%) of the heights of female students are between 65 and 70
inches. For heights of 65 and 70 inches, the z values are
65 64.4 70 64.4
z 0.25 and z 2.33
2.4 2.4
The area to the left of z = 0.25 is 0.5987 and the area to the left of
z = 2.33 is 0.9901. Thus, the area between z = 0.25 and a = 2.33 is
0.9901 - 0.5987 = 0.3914 (39.14%). The two percentages are quite
close.
6.104 (a) A sketch of a normal curve with mean 9.58 inches and standard deviation
0.51 inches is as follows
0.8
0.7
0.6
0.5
Density
0.4
0.3
0.2
0.1
0.0
8.05 8.56 9.07 9.58 10.09 10.60 11.11
Foot Length (in)
5 6.1 7 6.1
P(5 X 7) P( Z ) P(0.85 Z 0.69)
1.3 1.3
0.7549 0.1977 0.5572
We interpret this to mean that about 55.72% of female polychaete worms
have a length that is between 5 and 7 mm.
5 4.66
6.106 (a) P( Y ! 5) P( Z ! ) P( Z ! 0.45) 1 0.6736 0.3264 . We interpret
0.75
this to mean that about 32.64% of the nests of the booted eagle of
western Europe are more than 5 km to the nearest marshland.
3 4.66 6 4.66
(b) P(3 d Y d 6) P( dZd ) P(2.21 d Z d 1.79)
0.75 0.75
0.9633 0.0136 0.9497
We interpret this to mean that about 95% of the nests of the booted
eagle of western Europe are between 3 and 6 km from the nearest
marshland.
6.107 (a) 95% of the population values lie within 1.96 standard deviations to
either side of the mean.
(b) 89.9% of the population values lie within 1.64 standard deviations to
either side of the mean.
6.108 (a) 99% of all possible observations lie within 2.575 standard deviations
to either side of the mean.
(b) 80% of all possible observations lie within 1.28 standard deviations to
either side of the mean.
6.109 If the times between patients were approximately normally distributed, then
approximately 15.87% of the inter-arrival times would be negative since
P(Time < 0) = P(z<(0-8.7)/8.7) = P(z < -1) = 0.1587. Thus about one out of
every six inter-arrival times would be negative. Since this is not
possible, the distribution of times must not be normal.
6.110 P(μ - z/2 ) < x < μ + z/2) = P(-z/2 ) < x - μ < z/2 ))
= P(-z/2 < (x - μ)/) < z/2 )
= P(-z/2 < z < z/2 ) = (1 - /2) - /2
= 1 -
6.111 (a) Q1 = μ - 0.67)
Q2 = μ
Q3 = μ + 0.67)
Pk = μ + z(1-k/100))
Exercises 6.4
6.112 A normal probability plot is particularly useful when sample sizes are
small; these are situations in which histograms, stem-and-leaf diagrams, and
dotplots are less useful in determining the shape of the distribution.
6.113 Decisions about whether a variable is normally distributed often are
important in subsequent analyses such as percentile calculations or in
determining the type of statistical inference procedure to be used.
6.114 A normal probability plot is a plot of the observed values of a variable
versus the normal scores ± the observations expected for a variable having a
standard normal distribution. If the variable is normally distributed, then
the normal probability plot should yield roughly a straight line. If the
plot is roughly linear, then it is accepted as reasonable that the variable
is approximately normally distributed. If it is not roughly linear, then
the variable is probably not normally distributed.
6.115 If the normal probability plot is roughly linear except for a small number
of points that lie well outside the overall pattern of the plot, it is
possible that some or all of those points are outliers. If there is
sufficient reason to remove the potential outliers from the sample and doing
so results in a plot that is linear without outliers, the analysis can often
be carried out with the remaining observations.
6.116 If two or more observations in a sample are equal, you can think of them as
slightly different from one another and use Table III to obtain their normal
scores. For example, if there are two observations of 6.23, think of them
as 6.225 and 6.235.
6.117 The variable is approximately normally distributed because the normal
probability plot is approximately linear.
6.118 The variable is approximately normally distributed because the normal
probability plot is approximately linear.
6.119 The variable is not normally distributed because the normal probability plot
is not linear and is instead curved.
6.120 The variable is not normally distributed because the normal probability plot
is not linear and is instead curved.
6.121 The variable is not normally distributed because the normal probability plot
is not linear and is instead curved.
6.122 The variable is not normally distributed because the normal probability plot
is not linear and is instead curved.
6.123 (a)
2.5
2.0
1.5
NORMAL SCORE
1.0
0.5
0.0
-0.5 20 40 60 80 100 120
-1.0
-1.5
-2.0
-2.5
SCORE
(b) Based on the probability plot, there appear to be two outliers in the
sample: 34 and 39.
(c) Based on the probability plot, the sample does not appear to come from
a normally distributed population.
6.124 (a)
2
NORMAL SCORE
0
0 50 100 150
-1
-2
SUBSCRIPTION RATES
(b) Based on the probability plot, there do not appear to be any outliers
in the sample.
(c) Based on the probability plot, the sample appears to be from an
approximately normally distributed population.
6.125 (a)
Times Normal Score
2.00 X y
1.50 93.37 -1.71
NORMAL SCORE
6.126 (a)
Expenditure Normal Score
X y
238 -1.64
246 -1.11
256 -0.79
302 -0.53
311 -0.31
320 -0.10
321 0.10
321 0.31
327 0.53
335 0.79
343 1.11
423 1.64
(b) Based on the probability plot, there appears to be one outlier in the
sample: 423.
(c) Based on the non-linear probability plot, the sample does not appear to
be from an approximately normally distributed population.
6.127 (a) :HHQWHUWKHGDWDLQ([FHOLQDFROXPQZKLFKZHQDPHG³[´DWWKHWRS
Highlight the name and the data with your mouse and then choose DDXL
Charts and Plots. Select Normal Probability Plot from the drop down
Function type box and enter x in both the Quantitative variable and
Label variable boxes. Click OK. The resulting plot is
(b) From the plot, it appears that there may be two outliers, 7.6 and 6.7.
(c) From the plot, the data do not appear to be normally distributed.
However, if the two outliers were excluded from the data, the remaining
data may be normally distributed.
6.130 (a) We enter the data iQ([FHOLQDFROXPQZKLFKZHQDPHG³%XULDOV´DWWKH
top. Highlight the name and the data with your mouse and then choose
DDXL Charts and Plots. Select Normal Probability Plot from the drop
down Function type box and enter Burials in both the Quantitative
variable and Label variable boxes. Click OK. The resulting plot is
(b) From the plot on the left, there is one potential outlier, 2484.
(c) We removed the outlier and replotted the data above at the right. Even
with the outlier removed, the plot is decidedly non-linear. We
conclude that the data are probably not normally distributed.
6.131 (a) UsiQJ0LQLWDEDQGDVVXPLQJWKDWWKHGDWDDUHLQDFROXPQQDPHGµ7(03¶
choose Graph
Histogram..., select the Simple plot and click OK.
Specify TEMP in the Graph Variables text box and click OK. The
resulting plot is
Histogram of TEMP
20
15
Frequency
10
0
97.0 97.5 98.0 98.5 99.0 99.5
TEMP
$VLGHIURPWKHXQXVXDOµVDZWRRWK¶SDWWHUQRIWKHOHIWVLGHRIWKH
graph, an assumption of normality for this data appears to be
reasonable.
(b) Choose Graph
Probability Plot..., select the Single plot and click
OK. Specify TEMP in the Graph Variables text box. Click on the
Distribution button, click the Data Display tab, select the Symbols
only option button from the Data Display list and click OK. Now click
the Scale button, click on the Y-Scale Type tab, select the Score
option button from the Y-Scale Type list, and click OK twice. The
resulting plot is
1
Score
-1
-2
-3
97.0 97.5 98.0 98.5 99.0 99.5
TEMP
(b) There are no apparent outliers and the graph is roughly linear, so it
seems reasonable to assume normality for these data.
(c) We arrived at the same conclusion from both graphs.
6.132 (a) Using Minitab and assuming that the data are in FROXPQVQDPHGµ9(*¶DQG
µ201,¶FKRRVHGraph Histogram..., select the Simple plot and click
OK. Specify VEG and OMNI in the Graph Variables text box and click OK.
The resulting plots are
12
12
10
10
Frequency
Frequency
8 8
6 6
4 4
2 2
0 0
0 20 40 60 80 10 20 30 40 50 60 70 80
VEG OMNI
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
0 10 20 30 40 50 60 70 80 90 0 10 20 30 40 50 60 70 80 90
VEG OMNI
Both plots are roughly linear with no apparent outliers. Thus the
assumption of normality for these data seems reasonable.
(c) Both probability plots are more linear than we expected after looking
at the histograms. This illustrates the difficulty of assessing
QRUPDOLW\ZLWKKLVWRJUDPVDQGIDLUO\VPDOOVDPSOHV7KHVRIWZDUH¶V
choice of classes for the histograms may also have affected our
impression of the data.
Score
0
-1
-2
Yes. There do not appear to be any outliers in these data. Since the plot
follows a fairly straight line, normality of the chips data seems to be a
reasonable assumption.
6.134 (a) Minitab has the capability of producing a histogram and a normal
probability plot from grouped data. Enter the midpoints from 9.5 to
13.4 in a column named LENGTH and enter the corresponding frequencies
in a column named FREQ. To create the histogram using these two
columns, select Graph
Histogram, select the Simple version and click
OK, then enter LENGTH in the Graph variables text box and click on the
Data options button and then on the Frequency tab. Click on the
Frequency variable(s) text box and enter FREQ. Click OK twice. The
graph is shown below and looks very much like a normal distribution.
700
600
500
Sum of FREQ
400
300
200
100
ML Estimates
Mean: 11.5457
99 StDev: 0.552487
95
90
80
70
Percent
60
50
40
30
20
10
5
Data
6.135 (a) Using Minitab, we will generate four columns of fifty observations each
by choosing Calc Random data
Normal..., entering 50 in the
Generate rows of data text box, clicking in the Store in column(s) text
box and typing C1-C4, clicking in the Mean text box and entering 266,
clicking in the Standard deviation text box and entering 16, and
clicking OK. Now click in the worksheet column title row and name the
four columns GEST1, GEST2, GEST3, and GEST4.
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
220 230 240 250 260 270 280 290 300 230 240 250 260 270 280 290 300 310
GEST1 GEST2
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
230 240 250 260 270 280 290 220 230 240 250 260 270 280 290 300
GEST3 GEST4
(c) Yes. Since the data were generated from a normal distribution, we
would expect the four plots to be roughly linear, as they are. Your
simulation will likely result in data and graphs that differ from ours.
6.136 (a) We will generate four columns of 75 observations each by choosing Calc
Random data
Exponential..., entering 75 in the Generate rows of
data text box, clicking in the Store in column(s) text box and entering
C1-C4, clicking in the Mean text box and typing 8.7, and clicking OK.
Now click in the worksheet column title row and name the four columns
TIME1, TIME2, TIME3, and TIME4.
Variables text box. Click OK. The resulting four graphs are shown
below.
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
-3 -3
0 10 20 30 40 0 10 20 30 40
TIME1 TIME2
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
-3 -3
0 10 20 30 40 0 10 20 30 40 50 60
TIME3 TIME4
(c) Yes. Since the data were generated from a non-normal reverse J-shaped
distribution, we should expect the plots to be non-linear.
Exercises 6.5
6.137 It is not practical to use the binomial probability formula when the number
of trials, n, is large.
6.138 A binomial distribution with p not equal to 0.5 is skewed. When n is small
and p is near 0 or 1, the skewness is enough to preclude using a normal
approximation. However, as n increases and/or as p nears 0.5, the skewness
subsides and the binomial distribution becomes sufficiently bell-shaped to
permit a normal approximation. In general, the binomial distribution is
sufficiently bell-shaped when np and n(1 ± p) are both 5 or greater. On the
other hand, a binomial distribution with p equal to 0.5 is symmetric,
regardless of the number of trials.
6.139 (a) (i) P(x = 4 or 5) = 0.2051 + 0.2461 = 0.4512
(ii) P(3 < x < 7) = 0.1172 + 0.2051 + 0.2461 + 0.2051 + 0.1172 = 0.8907
(b) (i) P(x = 4 or 5):
Step 1: n = 10; p = 0.5
Step 2: np = 5; n(1 - p) = 5. Since both np and n(1 - p) are at
least 5, the normal approximation can be used.
Step 3:
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
Px np 10(0.)
5 5
Vx np(1 p) 10(0.)(
5 0.)
5 1.58
6.143 (a) The mean is P np 200(0.65) 130 and V = np(1-p) 200(0.65)(0.35) 6.75.
Note that np = 130 and n(1 ± p) = 70, so it is reasonable to use the
normal approximation for binomial probabilities. If X represents the
number of preschool children living in poverty that have been exposed
to cigarette smoke at home, we want P(X > 125). With the continuity
correction, this becomes P(X > 124.5). For X = 124.5, we have z =
(124.5 ± 130)/6.75 = -0.81. The area to the left of z = -0.81 is
0.2090. Therefore P(X > 124.5) = 1 ± 0.2090 = 0.7910 (approximately).
Note that np = 83 and n(1 ± p) = 117. since both are greater than 5,it
is reasonable to use the normal approximation for binomial
probabilities. If X represents the number of naturalized citizens who
were originally from Asia, we want P(X < 75). With the continuity
correction, this becomes P(X < 74.5). For X = 74.5, we have z = (74.5
± 83)/6.97 = -1.22. The area to the left of z = -1.22 is 0.1112.
Therefore P(X < 74.5) = 0.1112.
(b) P(80 < X < 90) with continuity corrections becomes P(79.5 < X < 90.5)
For X = 79.5, we have z = (79.5 ± 83)/6.97 = -0.50, and for X = 90.5,
we have z = (90.5 ± 83)/6.97 = 1.08. The area to the left of ±0.50 is
0.3085, and the area to the left of 1.08 is 0.8599. Therefore the
probability that X is between 80 and 90, inclusive, is approximately
0.8599 - 0.3085 = 0.5514.
(c) P(X < 70 or X > 90) = P(X < 70) + P(X > 90). With continuity
corrections, this becomes P(X < 69.5) + P(X > 90.5). For X = 69.5, we
have z = (69.5 ± 83)/6.97 = -1.94, and for X = 90.5, we have z = (90.5
± 83)/6.97 = 1.08. The area to the left of ±1.94 is 0.0262. The area
to the left of 1.08 is 0.8599. Thus, The area to the right of 1.08 is
1.000 - 0.8599 = 0.1401. Finally, P(X < 70 or X > 90) = 0.0262 +
0.1401 = 0.1663 (approximately).
6.145 For parts (a), (b),and (c), steps 1-3 are as follows:
Step 1: n = 100; p = 0.316
Step 2: np = 31.6; n(1 - p) = 68.4. Since both np and n(1 - p) are at
least 5, the normal approximation can be used.
Step 3:
Px np 100(0.316) 31.6
Vx np(1 p) 100(0.316)(0.684) 4.65
(a) P(X = 32)
Step 4: For x = 31.5 and x = 32.5, the z-scores are
31.5 31.6 32.5 31.6
z 0.02 and z 0.19.
4.65 4.65
The area to the left of z = -0.02 is 0.4920 and the area to the left of
z = 0.19 is 0.5753. Therefore, the approximate probability that X = 32
is 0.5753 ± 0.4920 = 0.0833.
(b) P(30 < X < 35)
Step 4: For X = 29.5 and X = 35.5, the z-scores are
29.5 31.6 35.5 31.6
z 0.45 and z 0.84.
4.65 4.65
The area to the left of z = -0.45 is 0.3264 and the area to the left of
z = 0.84 is 0.7995. Therefore, the approximate probability that X is
between 30 and 35, inclusive, is 0.7995 ± 0.3264 = 0.4731.
(c) P(X > 25)
Step 4: For X = 24.5, the z-score is
24.5 31.6
z 1.53.
4.65
The area to the left of z = -1.53 is 0.0630. Therefore, the
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
(d) In Step 2, we found that np = 9.75. This meets the minimum requirement
of at least 5, but it is close to 5. The larger the values of np and
n(1-p), the better the normal approximation to the binomial
distribution. Since 9.75 is a smaller number, the normal approximation
is acceptable, but not the most accurate approximation.
6.149 For parts (a), (b), and (c), steps 1-3 are as follows:
Step 1: n = 250; p = 0.40
Step 2: np = 100; n(1 - p) = 150. Since both n(1 -p) and np are at
least 5, the normal approximation can be used.
Step 3:
Px np 250(0.40) 100
Vx np(1 p) 250(0.40)(0.60) 7.746
(a) Forty percent of 250 is 100. P(X = 100):
Step 4: For x = 99.5 and x = 100.5, the z scores are
99.5 100 100.5 100
z 0.06 and z 0.06.
7.746 7.746
The area to the left of z = -0.06 is 0.4761 and the area to
the left of z = 0.06 is 0.5239. Therefore, the approximate
probability that X is 100 is 0.5239 ± 0.4761 = 0.0478.
(b) Step 4: For x = 100.5, the z-score is
100.5 100
z 0.06.
7.746
The area to the left of z = 0.06 is 0.5239. Therefore, the
approximate probability that X exceeds 100 is 1.0000 ± 0.5239
= 0.4761.
(c) Step 4: For x =89.5, the z-score is
89.5 100
z 1.36.
7.746
The area to the left of z = -1.36 is 0.0869. Therefore, the
approximate probability that X is fewer than 90 is 0.0869.
6.150 For parts (a), (b), and (c), steps 1-3 are as follows:
Step 1: n = 200; p = 0.66
Step 2: np = 132; n(1 - p) = 68. Since both n(1 -p) and np are at
least 5, the normal approximation can be used.
Step 3:
Px np 200(0.66) 132
Vx np(1 p) 200(0.66)(0.34) 6.699
(a) 66% of 200 is 132. P(X = 132):
Step 4: For x = 131.5 and x = 132.5, the z-scores are
131.5 132 132.5 132
z 0.07 and z 0.07.
6.699 6.699
The area to the left of z = -0.07 is 0.4721 and the area to
the left of z = 0.07 is 0.5279. Therefore, the approximate
probability that X is 78 is 0.5279 - 0.4721 = 0.0558.
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
Step 3:
Px np 5000(0.47368) 2368.4
Vx np(1 p) 300(0.47368)(0.52632) 35.306
Step 4: For x = 2500.5, the z-score is
2500.5 2368.4
z 3.74.
35.306
The area to the left of z = 3.74 is 0.9999. Therefore, the
approximate probability that X is at least 2501 is 1.0000 ± 0.9999
= 0.0001.
6.152 Battery lifetime (x) is normally distributed with μ = 30 hours and ) = 5
hours. Also, n = 500. Initially, we need to find the "success" probability
associated with any one battery lasting longer than 25 hours. This success
probability is calculated as follows: The z-score for x = 25 is
25 30
z 1.00.
5
The area to the left of z = 1.00 is 0.1587, so the probability that a single
battery will last more than 25 hours is 1.0000 ± 0.1587 = 0.8413. This is
our success probability p.
We now calculate the probability that at least 80% of the batteries last
longer than 25 hours. We do this by using the normal approximation to the
binomial. Since 80% of 500 is 400, we can find P(x > 400) as follows:
Step 1: n = 500; p = 0.8413
Step 2: np = 420.65; n(1-p) = 79.35. Since both np and n(1-p) are at
least 5, the normal approximation can be used.
Step 3:
Px np 500(0.8413) 420.65
Vx np(1 p) 500(0.8413)(0.1587) 8.17
Step 4: For x = 399.5, the z-score is
399.5 420.65
z 2.59.
8.17
The area to the left of z = -2.59 is 0.0048, so the probability that at
least 80% will last more than 25 hours is 1.0000 ± 0.0048 = 0.9952.
6.153 (a) The expected number of males with Fragile X Syndrome is np =
10000(1/1500) = 6.67.
(b) To use the normal approximation to the binomial distribution, we first
need the standard deviation, which is
1 1499
Vx np(1 p) 10000( )( ) 2.581
1500 1500
To find the approximate probability that more than 7 males have Fragile
X syndrome, we want P(X > 7) or P(X > 8) or, approximating, we use P(X
> 7.5). For x = 7.5, the z-score is
7.5 6.67
z 0.32.
2.581
Copyright © 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
The area to the left of z = 0.32 is 0.6255. Thus, the probability that
more than 7 males have Fragile X Syndrome is 1.0000 - 0.6255 = 3745.
To find the approximate probability that at most 10 males have Fragile
X syndrome, we want P(X < 10) or, approximating, we use P(X < 10.5).
For x = 10.5, the z=score is
10.5 6.67
z 1.48.
2.581
The area to the left of z = 1.48 is 0.9306. Thus, the probability that
at most 10 males have Fragile X Syndrome is 0.9306.
(c) Although both np and n(1 - p) are at least 5, a binomial distribution
with p = 1/1500 is highly skewed to the right whereas the normal
distribution being used to approximate it is a symmetric distribution.
The Poisson distribution used in Exercise 5.93 reflects the skewness of
the binomial distribution and thus we would expect it to provide the
better approximation.
8. The percentages for a normally distributed variable and areas under the
corresponding normal curve (expressed as a percentage) are identical.
9. The distribution of the standardized version of a normally distributed
variable is the standard normal distribution, that is, a normal distribution
with a mean of 0 and standard deviation of 1.
10. (a) True. The mean completely determines the center of the distribution.
(b) True. The standard deviation completely determines the shape of the
distribution.
11. (a) The (second) curve with ) = 6.2 has the largest spread.
(b) The first and second curves are centered at μ = 1.5.
(c) The first and third curves have the same shape because ) = 3 for both.
(d) The third curve is centered farthest to the left because it has the
smallest value of μ = -2.7.
(e) The fourth curve is the standard normal curve because μ = 0 and ) = 1.
12. Key Fact 6.4.
13. (a) The table entry corresponding to the specified z-score is the area to
the left of that z-score.
(b) The area to the right of a specified z-score is found by subtracting
the table entry from 1.
(c) The area between two specified z-scores is found by subtracting the
table entry for the smaller z-score from the table entry for the larger
z-score.
14. (a) Find the table entry that is closest to the specified area. The z-
score determined by locating the corresponding marginal values is the
z-score that has the specified area to its left. If the specified area
is exactly between two entries, then the two z-scores should be
averaged.
(b) Subtract the specified area from 1. Find the entry in the table that
is closest to the result of the subtraction, averaging if the area is
exactly between two table entries. The z-score determined by locating
the corresponding marginal values is the z-score that has the specified
area to its right.
15. The value z is the z-score that has area to its right under the standard
normal curve.
16. The 68.26-95.44-99.74 rule states that for a normally distributed variable:
68.26% of all possible observations lie within one standard deviation to
either side of the mean; 95.44% of all possible observations lie within two
standard deviations to either side of the mean; 99.74% of all possible
observations lie within three standard deviation to either side of the mean.
17. The normal scores for a sample of observations are the observations we would
expect to get for a sample of the same size for a variable having the
standard normal distribution.
18. If we observe the values of a normally distributed variable for a sample,
then a normal probability plot should be roughly linear.
19.
0.8
0.7
0.6
c
0.5
0.4
C2
0.3
0.2
a b
0.1
0.0
-10 0 10
x
20. (a)
0.4
0.3
f(x)
0.2
0.1
0.0
15 16 17 18 19 20 21 22 23
x
(b) z = (x - 18.8)/1.1
(c) 0.4
0.3
f(z)
0.2
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
z
(d) P(17 < x < 20) = 0.8115
(e) The percentage of men who have forearm lengths less than 16 inches
equals the area under the standard normal curve that lies to the left
of -2.55.
21. (a) The area to the right of 1.05 is 1.0000 - 0.8531 = 0.1469.
(b) The area to the left of -1.05 is 0.1469 (by symmetry).
(c) The area between -1.05 and 1.05 is 0.8531 - 0.1469 = 0.7062.
0.4 0.4
0.3 0.3
f(z)
f(z)
0.2 0.2 0.2709
0.1 0.1
0.0013
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
(c) Area = 0.9970 - 0.8665 = 0.1305 (d) Area = 1.000 - 0.0197 = 0.9803
0.4 0.4
0.3 0.3
0.9803
f(z)
f(z)
0.2 0.2
0.1305
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
(e) Area = 0.0668 - 0.0000 = 0.0668 (f) Area = 0.8413 + (1 - 0.9987) = 0.8426
0.4 0.4
0.8413 + 0.0013
= 0.8426
0.3 0.3
f(z)
f(z)
0.2 0.2
0.0668
0.1 0.1
0.0 0.0
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
z z
24. (a)
40
30
Percent
20
10
0
0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Weight
(b) No. The histogram indicates that the data are left skewed.
25. (a) The z-score for x = 1.4 is (1.4 ± 1.3)/0.4 = 0.25. The area to the
left of z = 0.25 is 0.5987, so the percentage of patients receiving
total knee arthroplasty who have a knee hyaluronic acid concentration
below 1.4 mg/ml is 0.5987.
(b) The z-score for x = 1.0 is (1.0 ± 1.3)/0.4 = -0.75 and for x = 2.0,
z = (2.0 ± 1.3)/0.4 = 1.75. The area to the left of z = ±0.75 is
0.2266 and the area to the left of z = 1.75 is 0.9599. Therefore, the
probability that the percentage of patients receiving total knee
arthroplasty who have a knee hyaluronic acid concentration between 1
and 2 mg/ml is 0.9599 ± 0.2266 = 0.7333.
(c) The z-score for x = 2.1 is (2.1 ± 1.3)/0.4 = 2.00. The area to the
left of z = 2.00 is 0.9772, so the percentage of patients receiving
total knee arthroplasty who have a knee hyaluronic acid concentration
above 2.1 mg/ml is 1.0000 ± 0.9772 = 0.0228.
26. (a) An area of 0.2500 lies to the left of z = -0.67; an area of 0.5000 lies
to the left of z = 0.00; and an area of 0.7500 lies to the left of z =
0.67. To convert these z-scores to GRE scores (x), we use x = μ + z).
Thus Q1 = 462 + (-0.67)(119) = 382.27; the median (Q2) = 462 +
(0.00)(119) = 462; and Q3 = 462 + (0.67)(119) = 541.73. 25% of the GRE
scores will be less than 382.27; 50% of the scores will be less than
462; and 75% of the scores will be less than 541.73.
(b) An area of 0.9900 lies to the left of z = 2.33. To convert this z-
scores to a GRE score (x), we use x = μ + z). Thus the 99th percentile
is 462 + (2.33)(119) = 739.27. 99% of the GRE scores will be less than
739.27.
27. (a) 68.26% of students who took the verbal portion of the GRE scored
between 343 and 581.
(b) 95.44% of students who took the verbal portion of the GRE scored
between 224 and 700.
(c) 99.74% of students who took the verbal portion of the GRE scored
between 105 and 819.
28. (a) Order the prices from smallest to largest. Obtain the normal scores
for 12 observations from Table III. The result is the table below.
The normal probability plot is to the left of the table.
Normal
Price score
1.68 -1.64
1.69 -1.11
1.75 -0.79
1.76 -0.53
1.78 -0.31
1.79 -0.1
1.81 0.1
1.85 0.31
1.86 0.53
1.89 0.79
1.89 1.11
2.01 1.64
(b) There do not appear to be any outliers.
(c) The plot is roughly linear, so it seems reasonable to assume that the
data come from a normal distribution.
(d) Using Minitab, enter the data in a column named PRICE. Select Graph
Probability Plot, select Single and click OK. Enter PRICE in the Graph
variables, click on the Distributions button and the Data Display tab.
Check the Symbols only option and click OK. Then click on the Scale
button and the Y-Scale Type tab and select the Score option. Click OK
twice. The resulting graph is
1
Score
-1
-2
29. (a) Using Minitab, enter the data in a column named EMPLOYEES. Select
Graph Probability Plot, select Single and click OK. Enter EMPLOYEES
in the Graph variables, click on the Distributions button and the Data
Display tab. Check the Symbols only option and click OK. Then click
on the Scale button and the Y-Scale Type tab and select the Score
option. Click OK twice. The resulting graph is
Score
0
-1
-2
(b) There are five observations that are considerably greater than the
other 20. If the distribution were otherwise normally distributed,
these might be possible outliers.
(c) The graph is not close to being roughly linear. The data are very
right skewed and hence not normally distributed, so the five possible
outliers may be just part of the pattern of right skewness.
30. For parts (a), (b), and (c), steps 1-3 are as follows:
Step 1: n = 1500; p = 0.80
Step 2: np = 1200; n(1 - p) = 300. Since both np and n(1 - p) are at least
5, the normal approximation can be used.
Step 3:
Px np 1500(0.8) 1200
Vx np(1 p) 1500(0.8)(0.2) 15.49
(a) P(x = 1225):
Step 4: For x = 1224.5 and x = 1225.5, the z-scores are
1224.5 1200 1225.5 1200
z 1.58 and z 1.65
15.49 15.49
The area to the left of z = 1.58 is 0.9429 and the area to the left of
z = 1.65 is 0.9505. The probability that the vaccine will be effective
in exactly 1225 cases is approximately = 0.9505 - 0.9429 = 0.0076.
(b) P(x > 1175):
Step 4: For x = 1174.5, the z-score is
1174.5 1200
z 1.65.
15.49
The area to the left of z = -1.65 is 0.0495. The probability that the
vaccine will be effective in at least 1175 cases is approximately
1.0000 ± 0.0495 = 0.9505.
(c) P(1150 < x < 1250):
Step 4: For x = 1149.5 and x = 1250.5, the z-scores are
1149.5 1200 1250.5 1200
z 3.26 and z 3.26.
15.49 15.49
The area to the left of z = -3.26 is 0.0006 and the area to the left of
z = 3.26 is 0.9994. The probability that the vaccine will be effective
in 11509 to 1250 cases, inclusive, is approximately = 0.9994 - 0.0006 =
0.9988.
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
-3 -3
10 20 30 40 50 60 70 80 90 100 1.5 2.0 2.5 3.0 3.5 4.0
HSP GPA
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
-3 -3
20 24 28 32 36 0 20 40 60 80 100 120 140 160
AGE CREDITS
2 2
1 1
Score
Score
0 0
-1 -1
-2 -2
-3 -3
10 15 20 25 30 35 15 20 25 30 35
ENGLISH MATH
1
Score
-1
-2
-3
15.0 17.5 20.0 22.5 25.0 27.5 30.0 32.5
COMP
(b) Cumulative GPA, total earned credits, ACT English score, ACT Math score, and
ACT composite score appear to be approximately normally distributed. Total
HDUQHGFUHGLWVKDVDFRXSOHRIµMRJV¶LQWKHOLQHRQHDWFUHGLWVDQGRQH
around 136 credits. The latter is probably due to education students in the
sample whose credit requirements are greater than for most other students.
(c) Age appears to be far from normally distributed. It is skewed right with
several possible outliers. The plot for high school percentile is roughly
linear except for the right end where it turns upward due to the data being
restricted to values of 100 or less.
(d) Using Minitab with the data entered from the WeissStats CD for the Focus
Database, choose Graph
Histogram..., select the With fit version and
click OK, select HSP GPA AGE CREDITS ENGLISH MATH COMP in the Graph
variables text box and click OK. Our results follow.
300
250
250
200
Frequency
Frequency
200
150
150
100
100
50 50
0 0
0 16 32 48 64 80 96 112 1.0 1.5 2.0 2.5 3.0 3.5 4.0
HSP GPA
Histogram of AGE Histogram of CREDITS
Normal Normal
1800 800
1600 700
1400
600
1200
500
Frequency
Frequency
1000
400
800
300
600
200
400
200 100
0 0
16 20 24 28 32 36 40 0 30 60 90 120 150 180
AGE CREDITS
700 700
600 600
500 500
Frequency
Frequency
400 400
300 300
200 200
100 100
0 0
8 12 16 20 24 28 32 36 15 18 21 24 27 30 33 36
ENGLISH MATH
Histogram of COMP
Normal
900
800
700
600
Frequency
500
400
300
200
100
0
15 18 21 24 27 30 33
COMP
(e) The three ACT scores (ENGLISH, MATH, and COMP) are the closest to being
normally distributed. HSP and GPA have distributions that are truncated at
100 and 4.0 respectively. The result is that the left sides of the
GLVWULEXWLRQVVHHPWRIROORZWKHQRUPDOFXUYHEXWWKHGDWDµSLOHXS¶RQWKH
right side. AGE has a distribution that is right skewed, and CREDITS has a
tri-modal distribution with peaks around 30, 60, and 90 credits where you
expect them for students who have finished their Freshman, Sophomore, and
Junior years. Thus, the sample is representative of the population for most
of the variables. Credits is the variable with the greatest difference
between the sample and the population.
0.15
relative frequency
0.10
0.05
0.00
33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
Chest size
(b) A normal curve with mean 39.85 and standard deviation 2.07 is as follows.
Normal Curve
Normal, Mean=39.85, StDev=2.07
0.20
0.15
0.10
0.05
0.00
35.0 37.5 40.0 42.5 45.0 47.5
341
0.15
relative frequency
0.10
0.05
0.00
33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
Chest size
(b) A normal curve with mean 39.85 and standard deviation 2.07 is as follows.
Normal Curve
Normal, Mean=39.85, StDev=2.07
0.20
0.15
0.10
0.05
0.00
35.0 37.5 40.0 42.5 45.0 47.5
BY EDITH RICKERT
THE wide attention that lately has been attracted to the influence of the
fraternities in men’s colleges and universities, from the social and
democratic points of view, has suggested the treatment of the somewhat
similar societies in colleges for women. At our request, Miss Rickert has
made a comprehensive and impartial investigation, the results of which
are embodied in the present paper and in one to follow, entitled, “What
Can We Do About It?” Parents, college authorities, fraternity members,
and other students will be interested in the facts and conclusions here
presented.—THE EDITOR.
So far I have said little about the girl who is left out. To many of
the fraternity women she seems the most deplorable feature of the
system. Except in the college where the societies contain all but a
very few of the students, her case does not seem to me serious. To
be sure, she misses a great deal of fun and social training; but if she
is worth saving, she saves herself in the end; and if she gives up in
despair and goes home because she does not “make a frat,” she is
not of the stuff that the college needs. Probably some sensitive girls
are embittered for a long time by the slight, but the finer ones
conquer, even though the sting lingers throughout their course. A
large number in the great mass of the students are unaffected by the
fraternity problems. They do not expect or wish to get in, and they
have virtually very little to do with the fraternity girls except as they
work in the same student organizations. They have their own sets,
their own social life, averaging more good times, perhaps, than they
are credited with by their pitying superior sisters. What the university
fails to do to counteract the effects of poverty, ill breeding, bad
preparation, and inexperience is another thing altogether. Individual
tragedies settle themselves, and those who win come out stronger,
finer women for their victory over adverse conditions than any
fraternity girl for whom the way has been made smooth.
But the fraternity idea must be judged not so much by those
whom it shuts out from special privilege as by the results that it
produces in those whom it fosters.
The Pan Hellenic Society believes itself specially chosen and
trained for service. And what has it done? Aside from a vague and
general interest in alumnæ activities, this service is reduced to
scholarships, some isolated attempts in education and philanthropy,
a certain “dynamic force” upon the character of its members,
scarcely apparent to outsiders, and continued perfection of
organization, thus far for no more evident purpose than the reform of
its own body.