Professional Documents
Culture Documents
Full Statistics For Business and Economics 12Th Edition Mcclave Solutions Manual Online PDF All Chapter
Full Statistics For Business and Economics 12Th Edition Mcclave Solutions Manual Online PDF All Chapter
Full Statistics For Business and Economics 12Th Edition Mcclave Solutions Manual Online PDF All Chapter
https://testbankdeal.com/product/statistics-for-business-and-
economics-12th-edition-mcclave-test-bank/
https://testbankdeal.com/product/statistics-for-business-and-
economics-13th-edition-mcclave-solutions-manual/
https://testbankdeal.com/product/statistics-for-business-and-
economics-13th-edition-mcclave-test-bank/
https://testbankdeal.com/product/statistics-for-business-and-
economics-12th-edition-anderson-solutions-manual/
Statistics for Business and Economics Revised 12th
Edition Anderson Solutions Manual
https://testbankdeal.com/product/statistics-for-business-and-
economics-revised-12th-edition-anderson-solutions-manual/
https://testbankdeal.com/product/statistics-12th-edition-mcclave-
test-bank/
https://testbankdeal.com/product/first-course-in-statistics-12th-
edition-mcclave-solutions-manual/
https://testbankdeal.com/product/statistics-for-business-and-
economics-12th-edition-anderson-test-bank/
https://testbankdeal.com/product/statistics-for-business-and-
economics-revised-12th-edition-anderson-test-bank/
Chapter 10
Categorical Data Analysis
10.1 a. The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 3 1 2 .
From Table IV, Appendix D, .05
2
5.99147 . The rejection region is 2 5.99147 .
b. The rejection region requires .10 in the upper tail of the 2 distribution with df k 1 5 1 4 .
From Table IV, Appendix D, .10
2
7.77944 . The rejection region is 2 7.77944 .
c. The rejection region requires .01 in the upper tail of the 2 distribution with df k 1 4 1 3 .
From Table IV, Appendix D, .01
2
11.3449 . The rejection region is 2 11.3449 .
The characteristics of the binomial are the same as those for the multinomial with k 2 .
10.3 The sample size n will be large enough so that, for every cell, the expected cell count, Ei, will be equal to 5
or more.
ni Ei
2
(78 80) 2 (60 80) 2 (182 160) 2
The test statistic is 2 8.075
Ei 80 80 160
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 3 1 2 . From
Table IV, Appendix B, .05
2
5.99147 . The rejection region is 2 5.99147 .
Since the observed value of the test statistic falls in the rejection region ( 2 8.075 5.99147) , H0 is
rejected. There is sufficient evidence to indicate that at least one of the probabilities differs from its
hypothesized value at .05 .
564
Copyright © 2014 Pearson Education, Inc.
Categorical Data Analysis 565
If the probabilities are the same, p1,0 p2,0 p3,0 p4,0 .25
H 0 : p1 p2 p3 p4 .25
H a : At lease one of the probabilities differs from .25
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 4 1 3 .
From Table IV, Appendix D, .05
2
7.81473 . The rejection region is 2 7.81473 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 3.293 7.81473) , H0 is not rejected. There is insufficient evidence to indicate the multinomial
probabilities differ at .05 .
b. The Type I error is concluding the multinomial probabilities differ when, in fact, they do not.
The Type II error is concluding the multinomial probabilities are equal, when, in fact, they are not.
c. For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 .
pˆ 3 59 / 205 .288
10.6 a. The data are categorical because they are measured using categories, not meaningful numbers. The
possible categories are legs only, wheels only, both legs and wheels, and neither legs nor wheels.
b. Let p1 proportion of social robots with legs only, p2 proportion of social robots with wheels only,
p3 proportion of social robots with both legs and wheels, and p4 proportion of social robots with
neither legs nor wheels. To determine if the design engineer’s claim is incorrect, we test:
ni Ei
2
(63 53) 2 (20 31.8) 2 (8 10.6) 2 (15 10.6) 2
d. The test statistic is 2 8.730
Ei 53 31.8 10.6 10.6
e. The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 4 1 3 .
From Table IV, Appendix D, .05
2
7.81473 . The rejection region is 2 7.81473 .
Since the observed value of the test statistic falls in the rejection region ( 2 8.730 7.81473) , H0 is
rejected. There is sufficient evidence to indicate that at least one of the probabilities differs from its
hypothesized value at. .05 .
10.7 a. Let p1 proportion using total visitors, p2 proportion using paying visitors, p3 proportion using
big shows, p4 proportion using funds raised, and p5 proportion using members.
To determine if one performance measure is used more often than any of the others, we test:
H 0 : p1 p2 p3 p4 p5 .20
H a : At least one of the probabilities differs from the hypothesized value
From the printout, the test statistic is 2 1.66667 and the p-value is p 0.797 .
Since the p-value is not less than ( p .797 .10) , H0 is not rejected. There is insufficient evidence
to indicate that one performance measure is used more often than any of the others at .10 .
b. For confidence coefficient .90, .10 and / 2 .10 / 2 .05 . From Table II, Appendix D,
z.05 1.645 .
pˆ 1 8 / 30 .267
We are 90% confident that the proportion of museums world-wide that use total visitors as their
performance measure is between .134 and .400.
10.8 a. The categorical variable is the rating of the student exposure to social and environmental issues. It
has 5 levels: 1-star, 2-stars, 3-stars, 4-stars, and 5-stars.
b. If there were no difference in the category proportions, then each proportion should be
pi 1 / 5 .20 . There were a total of n = 30 business schools sampled. The expected number would
be: E1 E2 E3 E4 E5 npi ,0 30(.20) 6
c. To determine if there are differences in the star rating category proportions of all MBA programs, we
test:
H 0 : p1 p2 p3 p4 p5 .20
H a : At lease one of the probabilities differs from the hypothesized value
ni Ei 2 6 9 6 14 6 5 6 0 6
2 2 2 2 2 2
e. The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 5 1 4 .
From Table IV, Appendix D, .05
2
9.48773 . The rejection region is 2 9.48773 .
f. Since the observed value of the test statistic falls in the rejection region ( 2 21 9.48773) , H0 is
rejected. There is sufficient evidence to indicate differences in the star rating category proportions of
all MBA programs at .05 .
x3 14
g. Some preliminary calculations are: pˆ 3 .467
n 30
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
pˆ 3 qˆ3 .467(.533)
pˆ 3 z.025 .467 1.96 .467 .179 (.288, .646)
n 30
We are 95% confident that the proportion of all MBA programs that are ranked in the 3-star category
is between .288 and .646.
10.9 a. Since there are 10 income groups, we would expect 10% or 1, 072(.10) 107.2 givers in each of the
income categories.
b. The null hypothesis for testing whether the true proportions of charitable givers in each income group
are the same is:
H 0 : p1 p2 p10 .10
d. The rejection region requires .10 in the upper tail of the 2 distribution with df k 1 10 1 9 .
From Table IV, Appendix D, .10
2
14.6837 . The rejection region is 2 14.6837 .
e. Since the observed value of the test statistic falls in the rejection region ( 2 93.15 14.6837) , H0 is
rejected. There is sufficient evidence to indicate that the true proportions of charitable givers in each
income group are not all the same at .10 .
10.10 a. The qualitative variable is firm position on off-shoring. There are four levels: “currently off-shoring,”
“not currently off-shoring, but plan to do so,” “off-shored in the past, but no more,” and “off-shoring is
not applicable.”
b. Let p1 proportion of firms currently off-shoring, p2 proportion of firms not currently off-shoring,
but plan to do so, p3 proportion of firms off-shored in the past, but no more, and p4 proportion of
firms where off-shoring is not applicable.
Copyright © 2014 Pearson Education, Inc.
568 Chapter 10
To determine if the proportions of U.S. firms in the four off-shoring position categories is significantly
different, we test:
H 0 : p1 p2 p3 p4 .25
H a : At least one of the probabilities differs from the hypothesized value
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 4 1 3 .
From Table IV, Appendix D, .05
2
7.81473 . The rejection region is 2 7.81473 .
Since the observed value of the test statistic falls in the rejection region ( 2 468.96 7.81473) , H0
is rejected. There is sufficient evidence to indicate that at least one of the proportions of U.S. firms in
the four off-shoring position categories is significantly different at .05 .
c. For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 .
We are 95% confident that the proportion of U.S. firms who are currently off-shoring is between .177
and .243.
10.11 Let p1 proportion users using both hands/both thumbs, p2 proportion of users using right hand/right
thumb, p3 proportion of users using left hand/left thumb, p4 proportion of users using both hands/right
index finger, p5 proportion of users using left hand/right index finger and p6 proportion of users using
other. Some preliminary calculations: E1 E2 E3 E4 E5 E6 npi,0 859 1/ 6 143.167 .
To determine if the proportions of mobile device users in the six texting style categories differ, we test:
H 0 : p1 p2 p3 p4 p5 p6 1 / 6
H a : At least one of the probabilities differs from the hypothesized value
2 i
Ei 143.167 143.167 143.167 143.167
(18 143.167) 2 (25 143.167) 2
756.436
143.167 143.167
The rejection region requires .10 in the upper tail of the 2 distribution with df k 1 6 1 5 . From
Table IV, Appendix D, .10
2
9.23635 . The rejection region is 2 9.23635 .
Since the observed value of the test statistic falls in the rejection region ( 2 756.436 9.23635) , H0 is
rejected. There is sufficient evidence to indicate that the proportions of mobile device users in the six
texting style categories differ at .10 .
10.12 Let p1 proportion of anchor tenants, p2 proportion of major space users, p3 proportion of large
standard tenants, p4 proportion of small standard tenants, and p5 proportion of small tenants. Some
preliminary calculations:
ni Ei
2
(14 18.21) 2 (61 91.05) 2 (216 182.1) 2 (711 728.4) 2
2
Ei 18.21 91.05 182.1 728.4
(819 801.24) 2
18.011
801.24
The rejection region requires .01 in the upper tail of the 2 distribution with df k 1 5 1 4 . From
Table IV, Appendix D, .01
2
13.2767 . The rejection region is 2 13.2767 .
Since the observed value of the test statistic falls in the rejection region ( 2 18.011 13.2767) , H0 is
rejected. There is sufficient evidence to indicate that the proportions of tenants in the five categories differ
from the developer’s belief at .01 .
10.13 a. The data come from a multinomial experiment because there are several possible categorical
responses to the question.
Test Contribution
Category Observed Proportion Expected to Chi-Sq
1 869 0.50 1059.50 34.252
2 339 0.10 211.90 76.236
3 338 0.10 211.90 75.041
4 127 0.10 211.90 34.016
5 85 0.10 211.90 75.996
6 128 0.05 105.95 4.589
7 233 0.05 105.95 152.352
N DF Chi-Sq P-Value
2119 6 452.483 0.000
The test statistic is 2 452.843 and the p-value is p 0.000 . Since the p-value is less that .01 ,
H0 is rejected. There is sufficient evidence to indicate that at least one of the proportions differs from
its hypothesized value at .01 .
To determine if the percentages of all adults falling into the four response categories changed after the
Enron scandal, we test:
2
Ei 910.35 708.05 303.45 101.15
149.096
The rejection region requires .01 in the upper tail of the 2 distribution with df k 1 4 1 3 . From
Table IV, Appendix D, .01
2
11.3449 . The rejection region is 2 11.3449 .
Since the observed value of the test statistic falls in the rejection region ( 2 149.096 11.3449) , H0 is
rejected. There is sufficient evidence to indicate the percentages of all adults falling into the four response
categories changed after the Enron scandal at .01 .
10.15 Let p1 proportion of mail only users, p2 proportion of Internet only users, and p3 proportion of both
mail and Internet. Some preliminary calculations:
H 0 : p1 p2 p3 1 / 3
H a : At least one of the probabilities differs from the hypothesized value
ni Ei
2
(262 146.667) 2 (43 146.667) 2 (135 146.667) 2
2 164.895
Ei 146.667 146.667 146.667
The rejection region requires .01 in the upper tail of the 2 distribution with df k 1 3 1 2 . From
Table IV, Appendix D, .01
2
9.21034 . The rejection region is 2 9.21034 .
Since the observed value of the test statistic falls in the rejection region ( 2 164.895 9.21034) , H0 is
rejected. There is sufficient evidence to indicate that the proportions mail only, Internet only, and both
mail and Internet users differ at .01 .
To determine if the data from the independent survey contradict the percentages reported by the CPS Cell
Phone Supplement, we test:
2
6.132
Ei 480.93 348.91 84.87 29.29
The rejections region requires .10 in the upper tail of the 2 distribution with df k 1 4 1 3 .
From Table IV, Appendix D, .10
2
6.25139. The rejection region is 2 6.25139 .
Since the test statistic does not fall in the rejection region ( 2 6.132 6.25139) , H0 is not rejected.
There is insufficient evidence to indicate the data from the independent survey contradict the percentages
reported by the CPS Cell Phone Supplement at .10 .
10.17 To determine if the number of overweight trucks per week is distributed over the 7 days of the week in
direct proportion to the volume of truck traffic, we test:
2
Ei 79.074 81.972 77.418 74.520
51 64.170 18 17.802 31 19.044
2 2 2
12.374
64.170 17.802 19.044
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 7 1 6 . From
Table IV, Appendix D, .05 12.5916 . The rejection region is 2 12.5916 .
2
Since the observed value of the test statistic does not fall in the rejection region ( 2 12.374 12.5916) , H0
is not rejected. There is insufficient evidence to indicate the number of overweight trucks per week is
distributed over the 7 days of the week is not in direct proportion to the volume of truck traffic at .05 .
To determine if the House of Representatives is not statistically representative of the religious affiliations
of their constituents, we test:
Since no value of was given, we will use .05 . The rejections region requires .05 in the upper tail
of the 2 distribution with df k 1 4 1 3 . From Table IV, Appendix D, .05
2
7.81473 . The
rejection region is 2 7.81473 .
Since the test statistic falls in the rejection region ( 2 174.169 7.81473) , Ho is rejected. There is sufficient
evidence to indicate the House of Representatives is not statistically representative of the religious
affiliations of their constituents at .05 .
[nij Eˆij ]2
The test statistic is
2
b. .
Eˆij
The rejection region requires .01 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (2 1)(3 1) 2 . From Table IV, Appendix D, .01
2
9.21034 . The rejection
region is 2 9.21034 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 8.71 9.21034) , H0 is not rejected. There is insufficient evidence to indicate the row and
column classifications are dependent at .01 .
10.21 a. To convert the frequencies to percentages, divide the numbers in each column by the column total
and multiply by 100. Also, divide the row totals by the overall total and multiply by 100. The
column totals are 25, 64, and 78, while the row totals are 96 and 71. The overall sample size is 165.
The table of percentages are:
Column
1 2 3
Row 1 9 34 53 96
100 36% 100 53.1% 100 67.9% 100 57.5%
25 64 78 167
2 16 30 25 71
100 64% 100 46.9% 100 32.1% 100 42.5%
25 64 78 167
70
60
57.5
50
40
Percent
30
20
10
0
1 2 3
Column
c. If the rows and columns are independent, the row percentages in each column would be close to the
row total percentages. This pattern is not evident in the plot, implying the rows and columns are not
independent. In Exercise 10.20, we did not have enough evidence to say the rows and columns were
not independent. If the sample sizes were bigger, we would have been able to reject H0.
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(3 1) 4 . From Table IV, Appendix D, .05
2
9.48773 . The rejection region is
2 9.48773 .
Since the observed value of the test statistic falls in the rejection region ( 2 12.36 9.48773) , H0 is
rejected. There is sufficient evidence to indicate the row and column classification are dependent at
.05 .
10.23 a-b. To convert the frequencies to percentages, divide the numbers in each column by the column total
and multiply by 100. Also, divide the row totals by the overall total and multiply by 100.
B
B1 B2 B3 Totals
A1 40 72 42 154
100 29.9% 100 44.2% 100 29.6% 100 35.1%
134 163 142 439
A2 63 53 70 186
Row 100 47.0% 100 32.5% 100 49.3% 100 42.4%
134 163 142 439
A3 31 38 30 99
100 23.1% 100 23.3% 100 21.1% 100 22.6%
134 163 142 439
50
40
35.1
30
Percent
20
10
0
1 2 3
B
The graph supports the conclusion that the rows and columns are not independent. If they were, then
the height of all the bars would be essentially the same.
50
42.4
40
30
Percent
20
10
0
1 2 3
B
The graph supports the conclusion that the rows and columns are not independent. If they were, then
the height of all the bars would be essentially the same.
25
22.6
20
15
Percent
10
0
1 2 3
B
The graph does not support the conclusion that the rows and columns are not independent. All the
bars would be essentially the same.
10.24 a. The two qualitative variables are model of Accord and injury (yes or no).
b. The contingency table is:
c. To determine if the injury rate for collision claims depends on Accord model, we test:
R C 50,132(5,501) RC 50,132(46,136)
Eˆ11 1 1 5,340.67 Eˆ12 1 2 44, 791.33
n 51, 637 n 51, 637
2
nij Eˆ ij
5,364 5,340.67 44, 768 44, 791.33
2 2
2
3.91
160.33 1,344.67
This agrees with the test statistic found on the XLSTAT printout.
f. The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (2 1)(2 1) 1 . From Table IV, Appendix D, .05
2
3.84146 . The rejection
region is 2 3.84146 . This is the same critical value found on the XLSTAT printout.
g. Since the observed value of the test statistic falls in the rejection region ( 2 3.91 3.84146) , H0 is
rejected. There is sufficient evidence to indicate the injury rate for collision claims depends on
Accord model at .05 .
Since the p-value is less than ( p .0479 .05) , H0 is rejected. There is sufficient evidence to
indicate the injury rate for collision claims depends on Accord model at .05 .
x1 5,364 x2 137
pˆ1 .107 pˆ 2 .091
n1 50,132 n2 1,505
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
Since the interval contains only positive numbers, the injury rate for hybrid Accords is less than the
injury rate for conventional Accords.
10.25 a. Yes, it appears that the male and female tourists differ in their responses to purchasing photographs,
postcards, and paintings. The values in the ‘Always’ and ‘Rarely or Never’ categories are quite
different. The percentages are insufficient to draw a conclusion because the sample sizes must be
taken into account.
b. The counts are found by changing the percentages to proportions and multiplying the proportions by
the sample sizes in each gender. The counts are:
c. To determine whether male and female tourists differ in their responses to purchasing photographs,
postcards, or paintings, we test:
e. Since the p-value is less than ( p .000 .01) , H0 is rejected. There is sufficient evidence to
indicate male and female tourists differ in their responses to purchasing photographs, postcards, or
paintings at .01 .
10.26 a. The sample proportion of negative tone news stories that are deceptive is 111 / 170 .653 .
b. The sample proportion of neutral tone news stories that are deceptive is 61 / 110 .555 .
c. The sample proportion of positive tone news stories that are deceptive is 11 / 31 .355 .
d. Yes, it appears that the proportion of news stories that are deceptive depends on the story tone. The
proportion that is deceptive for negative tone stories is .653, while the proportion that is deceptive for
positive tone stories is only .355. These proportions look much different.
Since the p-value is less than ( p .005 .05) , Ho is rejected. There is sufficient evidence to
indicate authenticity of a news story depends on tone at .05 .
10.27 a. To compare the two proportions, we could use either a test of hypothesis or a confidence interval. I
will use a 95% confidence interval.
xM 1 29 xF 1 89
pˆ M 1 .282 pˆ F 1 .511
nM 103 nF 174
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
We are 95% confident that the difference in the proportions of male and female professionals who
believe their salaries are too low is between .343 and .115. Since 0 is not in this interval, there is
evidence that the two proportions are different.
xM 2 58 xF 2 64
pˆ M 2 .563 pˆ F 2 .368
nM 103 nF 174
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
pˆ M 2 qˆ M 2 pˆ F 2 qˆ F 2 .563(.437) .368(.632)
pˆ M 2 pˆ F 2 z.025 .563 .368 1.96
nM nF 103 174
.195 .120 (.075, .315)
We are 95% confident that the difference in the proportions of male and female professionals who
believe their salaries are equitable/fair is between .075 and .315. Since 0 is not in this interval, there
is evidence that the two proportions are different.
xM 3 16 xF 3 21
pˆ M 3 .155 pˆ F 3 .121
nM 103 nF 174
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
pˆ M 3 qˆ M 3 pˆ F 3 qˆ F 3 .155(.845) .121(.879)
pˆ M 3 pˆ F 3 z.025 .155 .121 1.96
nM nF 103 174
.034 .085 (.051, .119)
We are 95% confident that the difference in the proportions of male and female professionals who
believe they are well paid is between .051 and .119. Since 0 is in this interval, there is no evidence
that the two proportions are different.
d. Yes. Since there were differences between the proportions of males and females on 2 of the 3 levels,
there is evidence that the opinions of males and females are different.
R C 118(103) RC 118(174)
Eˆ11 1 1 43.877 Eˆ12 1 2 74.123
n 277 n 277
R C 122(103) RC 122(174)
Eˆ 21 2 1 45.365 Eˆ 22 2 2 76.635
n 277 n 277
R C 37(103) RC 37(174)
Eˆ 31 3 1 13.758 Eˆ 33 3 3 23.242
n 277 n 277
To determine if the opinion on the fairness of a travel professional’s salary differ for males and
females, we test:
2
Eˆ ij
43.877 74.123 45.365
14.214
76.635 13.758 23.242
The rejection region requires .10 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(2 1) 2 . From Table IV, Appendix D, .10
2
4.60517 . The rejection
region is 2 4.60517 .
Since the observed value of the test statistic falls in the rejection region ( 2 14.214 4.60517) , H0
is rejected. There is sufficient evidence to indicate that the opinions on the fairness of a travel
professional’s salary differ for males and females at .10 .
f. For confidence coefficient .90, .10 and / 2 .10 / 2 .05 . From Table II, Appendix D,
z.05 1.645 . The 90% confidence interval is:
We are 90% confident that the difference in the proportions of male and female professionals who
believe their salaries are too low is between -.325 and -.133. Since 0 is not in this interval, there is
evidence that the two proportions are different.
10.28 a. Let p3 proportion of the 3-photos per page group who selected the target mugshot, p6 proportion
of the 6-photos per page group who selected the target mugshot, and p12 proportion of the 12-photos
per page group who selected the target mugshot.
19 19 15
pˆ 3 .594 , pˆ 6 .594 , pˆ12 .469
32 32 32
Target Target
Mugshot Mugshot not Total
selected selected
3-photos per page 19 13 32
3-photos per page 19 13 32
3-photos per page 15 17 32
Total 53 43 96
To determine if there are differences in the proportions who selected the target mugshot among the
three photo groups, we test:
2
nij Eˆ ij
19 17.667 13 14.333 19 17.667 13 14.333
2 2 2 2
2
1.348
17.667 14.333
The rejection region requires .10 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(2 1) 2 . From Table IV, Appendix D, .10
2
4.60517 . The rejection
region is 2 4.60517 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 1.348 4.60517) , H0 is not rejected. There is insufficient evidence to indicate that there are
differences in the proportions who selected the target mugshot among the three photo groups at
.10 .
Using frequencies in Fr
1 2 3 4 All
1 126 75 35 93 329
2 72 36 10 27 145
3 30 9 4 6 49
4 372 180 51 174 777
All 600 300 100 300 1300
From the printout, the test statistic is 2 21.242 and the p-value is p .012 . Since the p-value is less than
( p .012 .05) , H0 is rejected. There is sufficient evidence to indicate a firm’s position on off-shoring
depends on the firm’s nationality at .05 .
The rejection region requires .10 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(2 1) 2 . From Table IV, Appendix D, .10
2
4.60517 . The rejection region is
2 4.60517 .
Since the observed value of the test statistic falls in the rejection region ( 2 72.234 4.60517) , H0 is
rejected. There is sufficient evidence to indicate that the option choice depends on emotion state at .10 .
To determine if the proportions of mobile device users in the six texting style categories depend on whether
a male or female are texting, we test:
2
nij Eˆ ij
161 154.435 235 241.565 14 15.250 4.209
2 2 2
2
Eˆ ij
154.435 241.565 15.250
The rejection region requires .10 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (6 1)(2 1) 5 . From Table IV, Appendix D, .10
2
9.23635 . The rejection region is
2 9.23635 .
Since the observed value of the test statistic does not fall in the rejection region ( 2 4.209 9.23635) , H0
is not rejected. There is insufficient evidence to indicate the proportions of mobile device users in the six
texting style categories depend on whether a male or female are texting at .10 .
R C 234(40) RC 234(397)
Eˆ11 1 1 21.419 Eˆ12 1 2 212.581
n 437 n 437
R C 203(40) RC 203(397)
Eˆ 21 2 1 18.581 Eˆ 22 2 2 184.419
n 437 n 437
To determine if the response rate of air traffic controllers to mid-air collision alarms differs for true and
false alerts, we test:
2
nij Eˆ ij
3 21.419 231 212.581 37 18.581 166 184.419 37.533
2 2 2 2
2
Eˆ ij
21.419 212.581 18.581 184.419
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (2 1)(2 1) 1 . From Table IV, Appendix D, .05
2
3.84146 . The rejection region is
2 3.84146 .
Since the observed value of the test statistic falls in the rejection region ( 2 37.533 3.84146) , H0 is
rejected. There is sufficient evidence to indicate the response rate of air traffic controllers to mid-air
collision alarms differs for true and false alerts at .05 .
2
nij Eˆ ij
15 10.667 17 21.333 14 10.667 18 21.333
2 2 2 2
2
Eˆ ij
10.667 21.333 10.667 21.333
3 10.667 29 21.333
2 2
12.469
10.667 21.333
The rejection region requires .01 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(2 1) 2 . From Table IV, Appendix D, .01
2
9.21034 . The rejection region is
2 9.21034 .
Since the observed value of the test statistic falls in the rejection region ( 2 12.469 9.21034) , H0 is
rejected. There is sufficient evidence to indicate that the proportion of subjects who selected menus
consistent with the theory depends on goal condition at .01 .
10.34 Using MINITAB, the results of the table comparing type of coupon user and gender are:
The test statistic is 2 6.797 and the p-value is p .033 . Since the p-value is not less than
( p .033 .01) , H0 is not rejected. There is insufficient evidence to indicate type of coupon user
depends on gender at .01 .
Using MINITAB, the results of the table comparing type of coupon user and coupon usage satisfaction
level are:
To determine if type of coupon user depends on coupon usage satisfaction level, we test:
H 0 : Type of coupon user and coupon usage satisfaction level are independent
H a : Type of coupon user and coupon usage satisfaction level are dependent
The test statistic is 2 30.418 and the p-value is p .000 . Since the p-value is less than
( p .000 .01) , H0 is rejected. There is sufficient evidence to indicate type of coupon user depends on
coupon usage satisfaction level at .01 .
Cue 5 6 13 24
20.83 25.00 54.17 100.00
35.71 35.29 76.47 50.00
Pattern 9 11 4 24
37.50 45.83 16.67 100.00
64.29 64.71 23.53 50.00
All 14 17 17 48
29.17 35.42 35.42 100.00
100.00 100.00 100.00 100.00
From the printout, the test statistic is 2 7.378 and the p-value is p .025 .
Since the p-value is less than ( p .025 .05) , H0 is rejected. There is sufficient evidence to indicate the
choice of heuristic strategy depends on type of instruction at .05 .
Since the p-value is not less than ( p .025 .01) , H0 is not rejected. There is insufficient evidence to
indicate the choice of heuristic strategy depends on type of instruction at .01 .
10.36 a. Using MINITAB, the results for the First Trial are:
Switch
Boxes No Switch Total
1 10 17 27
6.50 20.50
1.885 0.598
2 3 24 27
6.50 20.50
1.885 0.598
3 5 22 27
6.50 20.50
0.346 0.110
4 8 19 27
6.50 20.50
0.346 0.110
Total 26 82 108
Chi-Sq = 5.876, DF = 3, P-Value = 0.118
To determine if the likelihood of switching boxes depends on condition for the first trial, we test:
H0: Likelihood of switching boxes and condition are independent
Ha: Likelihood of switching boxes and condition are dependent
From the printout above, the test statistic is 2 5.876 and the p-value is p 0.118 . Since the p-value
is not small, Ho is not rejected. There is insufficient evidence to indicate that the likelihood of
switching boxes depends on condition for the first trial for any value of .118 .
2 12 15 27
18.75 8.25
2.430 5.523
3 21 6 27
18.75 8.25
0.270 0.614
4 19 8 27
18.75 8.25
0.003 0.008
Total 75 33 108
Chi-Sq = 12.000, DF = 3, P-Value = 0.007
To determine if the likelihood of switching boxes depends on condition for the last trial, we test:
From the printout above, the test statistic is 2 12.00 and the p-value is p 0.007 . Since the p-value
is small, H0 is rejected. There is sufficient evidence to indicate that the likelihood of switching boxes
depends on condition for the last trial for any value of .007 .
Switch
Boxes No Switch Total
1 10 17 27
16.50 10.50
2.561 4.024
2 23 4 27
16.50 10.50
2.561 4.024
Total 33 21 54
Chi-Sq = 13.169, DF = 1, P-Value = 0.000
To determine if the likelihood of switching boxes depends on trial number for the Empty condition, we
test:
From the printout above, the test statistic is 2 13.169 and the p-value is p 0.000 . Since the p-
value is so small, H0 is rejected. There is sufficient evidence to indicate that the likelihood of
switching boxes depends on trial number for the Empty condition for any value of .000 .
Switch
Boxes No Switch Total
1 3 24 27
7.50 19.50
2.700 1.038
2 12 15 27
7.50 19.50
2.700 1.038
Total 15 39 54
Chi-Sq = 7.477, DF = 1, P-Value = 0.006
To determine if the likelihood of switching boxes depends on trial number for the Vanish condition,
we test:
From the printout above, the test statistic is 2 7.477 and the p-value is p 0.006 . Since the p-value
is so small, Ho is rejected. There is sufficient evidence to indicate that the likelihood of switching
boxes depends on trial number for the Vanish condition for any value of .006 .
2 21 6 27
13.00 14.00
4.923 4.571
Total 26 28 54
Chi-Sq = 18.989, DF = 1, P-Value = 0.000
To determine if the likelihood of switching boxes depends on trial number for the Steroids condition,
we test:
From the printout above, the test statistic is 2 18.989 and the p-value is p .000 . Since the p-value
is so small, H0 is rejected. There is sufficient evidence to indicate that the likelihood of switching
boxes depends on trial number for the Steroids condition for any value of .000 .
Switch
Boxes No Switch Total
1 8 19 27
13.50 13.50
2.241 2.241
2 19 8 27
13.50 13.50
2.241 2.241
Total 27 27 54
Chi-Sq = 8.963, DF = 1, P-Value = 0.003
To determine if the likelihood of switching boxes depends on trial number for the Steroids2 condition,
we test:
H0: Likelihood of switching boxes and trial number are independent
Ha: Likelihood of switching boxes and trial number are dependent
From the printout above, the test statistic is 2 8.963 and the p-value is p .003 . Since the p-value is
so small, H0 is rejected. There is sufficient evidence to indicate that the likelihood of switching boxes
depends on trial number for the Steroids2 condition for any value of .003 .
c. Of all the tests performed, only one was not significant. There was no evidence that the likelihood of
switching boxes depended on condition for the first trial. All other tests indicated that the variables
were dependent. Thus, both condition and trial number influence a subject to switch.
10.37 a. To determine if the vaccine is effective in treating the MN strain of HIV, we test:
H0: Vaccine status and MN strain are independent
Ha: Vaccine status and MN strain are dependent
From the printout the test statistic is 2 4.411 and the p-value is p 0.036 . Since the p-value is
less than ( p .036 .05) , H0 is rejected. There is sufficient evidence to indicate that the vaccine is
effective in treating the MN strain of HIV at .05 .
b. We must assume that we have a random sample from the population of interest. We cannot really
check this assumption. The second assumption is that all expected cell counts will be 5 or more. In
this case, since there are only 7 observations in the second row, there is no way that the expected cell
counts in that row will both be 5 or more (the sum of the expected cell counts in the row must sum to
the observed row total).
d. If vaccine status and MN are independent, then the proportion of those in each group that are positive
should be very similar. In these two additional tables, the proportion of positive results for the
unvaccinated group is increasing and the proportion of positive results for the vaccinated group is
decreasing.
Table 1:
Table 2:
e. The p-value is 04378 .00571 .00027 .04976 . Since the p-value is less than ( p .04976 .05) ,
H0 is rejected. There is sufficient evidence to indicate that the vaccine is effective in treating the MN
strain of HIV at .05 .
2
nij Eˆ ij
(20 10) (30 44) 54.14
2 2
The test statistic is
2
Eˆ ij 10 44
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(3 1) 4 . From Table IV, Appendix D, .05
2
9.48773 . The rejection
region is 2 9.48773 .
Since the observed value of the test statistic falls in the rejection region ( 2 54.15 9.48773) , H0
is rejected. There is sufficient evidence to indicate a dependence between rows and columns at
.05 .
c. Yes, the assumptions differ. If the row and column totals are not fixed, then we assume that we take
a random sample form a multinomial distribution. If the row totals are fixed, then we assume that we
are taking k random samples from k multinomial populations.
Column
1 2 3 Totals
20 20 10 50
1 100% 40% 100% 22.2% 100% 9.1% 100% 20%
50 90 110 250
Row 10 20 70 100
2 100% 20% 100% 22.2% 100% 63.6% 100% 40%
50 90 110 250
20 50 30 100
3 100% 40% 100% 55.6% 100% 27.3% 100% 40%
50 90 110 250
40
30
Percent
20 20
10
0
1 2 3
Column
The graph supports the decision in part a. In part a, we rejected the null hypothesis and concluded
that the rows and columns were dependent. If they were independent, then we would expect the three
bars to be the same height. In this graph, they are not the same height.
10.39 a. If all the categories are equally likely, then p1,0 p2,0 p3,0 p4,0 p5,0 .2 .
E1 E2 E3 E4 E5 npi ,0 150(.20) 30
H 0 : p1 p2 p3 p4 p5 .2
H a : At lease one of the probabilities differs from .2
The rejection region requires .10 in the upper tail of the 2 distribution with df k 1 5 1 4 .
From Table IV, Appendix D, .10
2
7.77944 . The rejection region is 2 7.77944 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 2.133 7.77944) , H0 is not rejected. There is insufficient evidence to indicate the categories
are not equally likely at .10 .
35
b. pˆ 2 .233
150
For confidence coefficient .90, .10 and / 2 .10 / 2 .05 . From Table II, Appendix D,
z.05 1.645 . The confidence interval is:
pˆ 2 qˆ2 .233(.767)
pˆ 2 z.05 .233 1.645 .233 .057 (.176, .290)
n2 150
10.40 a. The qualitative variable of interest is the location of professional sports stadiums and ballparks.
There are 3 levels or categories of this variable – downtown, central city, and suburban.
b. Let p1 proportion of major sports facilities located in downtown areas, p2 proportion of major
sports facilities located in central city areas, and p3 proportion of major sports facilities located in
suburban areas in 1997.
To determine if the proportions of major sports facilities in downtown, central city, and
suburban areas in 1997 are the different than in 1985, we test:
e. The degrees of freedom for the test statistic is df k 1 3 1 2 . The p-value is p P ( 2 6.174) .
Using MINITAB,
Chi-Square with 2 DF
x P( X <= x )
6.174 0.954361
Since the p-value is smaller than ( p .0456 .05) , H0 is rejected. There is sufficient evidence to
indicate the proportions of major sports facilities in downtown, central city, and suburban areas in
1997 are the different than in 1985.
10.41 a. The qualitative variable in this exercise is what “Made in the USA” means. There are 4 levels or
categories for this variable: 100% of labor and materials are produced in the US, 75-99% of labor
and materials are produced in the US, 50-74% of labor and materials are produced in the US, and less
than 50% of labor and materials are produced in the US.
n 64 20 18 4 106 .
e. The rejection region requires .10 in the upper tail of the 2 distribution with df k 1 4 1 3 .
From Table IV, Appendix D, .10
2
6.25139 . The rejection region is 2 6.25139 .
f. Since the observed value of the test statistic does not fall in the rejection region
( 2 4.68 6.25139) , H0 is not rejected. There is insufficient evidence to indicate the consumer
advocate group’s claim is incorrect at .10 .
n1 64
g. pˆ1 .604
n 106
For confidence coefficient .90, .10 and / 2 .10 / 2 .05 . From Table II, Appendix D,
z.05 1.645 . The 90% confidence interval is:
We are 90% confident that the proportion of all consumers who believe “Made in the USA” means
“100%” of labor and material are produced in the US” is between .526 and .682.
[nij Eˆij ]2 [691 575.7]2 [381 496.3]2 [794 909.3]2 [899 783.7]2
2 81.46
Eˆij
575.7 496.3 909.3 783.7
d. To determine if tax-motivation and itemize-deduction are related for charitable givers, we test:
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (2 1)(2 1) 1 . From Table IV, Appendix D, .05
2
3.84146 . The rejection
region is 2 3.84146 .
Since the observed value of the test statistic falls in the rejection region ( 2 81.46 3.84146) , H0 is
rejected. There is sufficient evidence to indicate that tax-motivation and itemize-deduction are
related for charitable givers at .05 .
e. To compute the bar graph, we first convert frequencies to percentages by dividing the numbers in
each column by the column total and multiplying by 100%. Also, divide the row totals by the overall
total and multiply by 100%.
50
40
38.8 %
30
Percent
20
10
0
Yes No
Itemize
The proportions range from .028 to .050. Since .050 is about twice as big as .028, there may be
evidence to conclude some of the proportions are different.
To determine if the proportions of censored measurements differ for the six tractor lines, we test:
2
Eˆ ij
5964.39 257.61 4497.74
187 198.49
2
48.0978
198.49
The rejection region requires .01 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (6 1)(2 1) 5 . From Table IV, Appendix D, .01
2
15.0863 . The rejection
region is 2 15.0863 .
Since the observed value of the test statistic falls in the rejection region ( 2 48.0978 15.0863) ,
H0 is rejected. There is sufficient evidence to indicate that the proportions of censored measurements
differ for the six tractor lines at .01 .
c. Even though there are differences in the proportions of censured data among the 6 tractor lines, these
proportions range from .028 to .050. In practice, there is very little difference between .028 and .050.
n1 146
10.44 a. Let p1 = proportion of abstainers with congestive heart failure. Then pˆ1 .163 .
n 896
n2 106
b. Let p2 = proportion of moderate drinkers with congestive heart failure. Then pˆ 2 .152 .
n 696
n3 29
c. Let p3 = proportion of heavy drinkers with congestive heart failure. Then pˆ 3 .090 .
n 321
d. The three sample proportions found in parts a, b, and c appear to be different. It appears that the
proportion of AMI patients with congestive heart failure depends on alcohol consumption.
e. To determine if the proportion of AMI patients with congestive heart failure depends on alcohol
consumption, we test:
H0: The proportion of AMI patients with congestive heart failure is independent of alcohol
consumption
f. H a: The proportion of AMI patients with congestive heart failure depends on alcohol consumption
Since the p-value is less than ( p .006 .05) , H0 is rejected. There is sufficient evidence to
indicate that the proportion of AMI patients with congestive heart failure depends on alcohol
consumption at .05 .
E1 np1,0 400 .30 120 E2 np2,0 400 .20 80 E3 np3,0 400 .20 80
ni Ei
2
(100 120) 2 (75 80) 2 (85 80) 2 (50 40) 2
2
Ei 120 80 80 40
(40 40) 2 (50 40) 2
8.958
40 40
c. To determine if the true percentages of the colors produced differ from the manufacturer’s stated
percentages, we test:
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 6 1 5 .
From Table IV, Appendix D, .05
2
11.0705 . The rejection region is 2 11.0705 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 8.958 11.0705) , H0 is not rejected. There is insufficient evidence to indicate the true
percentages of the colors produced differ from the manufacturer’s stated percentages at .05 .
2 7.391
Ei 500 220 110 170
The rejection region requires .05 in the upper tail of the 2 distribution with
df k 1 4 1 3 . From Table IV, Appendix D, .05
2
7.81473 . The rejection region is
2 7.81473 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 7.391 7.81473) , H0 is not rejected. There is insufficient evidence to indicate the percentages
disagree with the percentages reported by Nielson/NetRatings at .05 .
x1 487
pˆ1 .487
n 1000
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
pˆ1qˆ1 .487(.513)
pˆ1 z.025 .487 1.96 .487 .031 (.456, .518)
n 1000
We are 95% confident that the percentage of all Internet searches that use the Google Search Engine
is between 45.6% and 51.8%.
R C 53(35) RC 53(35)
Eˆ11 1 1 26.5 Eˆ12 1 2 26.5
n 70 n 70
R C 17(35) RC 17(35)
Eˆ 21 2 1 8.5 Eˆ 22 2 2 8.5
n 70 n 70
To determine if the severity of the ethical issue influenced whether the issue was identified or not by
the auditors, we test:
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (2 1)(2 1) 1 . From Table IV, Appendix D, .05
2
3.84146 . The rejection
region is 2 3.84146 .
Since the observed value of the test statistic does not fall in the rejection region
( 2 .078 3.84146) , H0 is not rejected. There is insufficient evidence to indicate that the severity
of the ethical issue influenced whether the issue was identified or not by the auditors at .05 .
b. No. If there were 0 in the bottom cell of the column, then the expected count for that cell will be less
than 5. One of the assumptions necessary for the test statistic to have a 2 distribution will not hold.
Since the row and column totals are the same, the expected cell counts are the same as above.
2
nij Eˆ ij 2 2 2 2
2 (32 26.5) (21 26.5) (3 8.5) (14 8.5) 9.401
Eˆ ij 26.5 26.5 8.5 8.5
RC 42(144) RC 56(144)
Eˆ 32 3 2 23.08 Eˆ 42 4 2 30.78
n 262 n 262
To determine whether a pig farmer’s education level has an impact on the size of the pig farm, we test:
H0: Pig farmer’s education level and size of pig farm are independent
Ha: Pig farmer’s education level and size of pig farm are dependent
[nij Eˆ ij ]2 (42 42.79) 2 (53 52.21) 2 (27 31.08) 2 (42 37.92) 2 (22 18.92) 2
2
Eˆ ij 42.79 52.21 31.08 37.92 18.92
(20 23.08) 2 (27 25.22) 2 (29 30.78) 2
2.14
23.08 25.22 30.78
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (4 1)(2 1) 3 . From Table IV, Appendix D, .05
2
7.81473 . The rejection region is
2 7.81473 .
Since the observed value of the test statistic does not fall in the rejection region ( 2 2.14 7.81473) , H0
is not rejected. There is insufficient evidence to indicate that a pig farmer’s education level has an impact
on the size of the pig farm at .05 .
To compute the bar graph, we first convert frequencies to percentages by dividing the numbers in each row
by the row total and multiplying by 100%. Also, divide the column totals by the overall total and multiply
by 100%.
50
45
40
Percent
30
20
10
0
<1,000 1,000-2,000 2,000-5,000 >5,000
Farm Size
Since the bars are all similar in height, it supports the conclusion to the test above.
To determine if there are differences in the percentages of incidents in the four cause categories, we test:
H 0 : p1 = p2 p3 p4 .25
H a : At lease one of the probabilities differs from its hypothesized value
2 i
Ei 20.75 20.75 20.75 20.75
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 4 1 3 . From
Table IV, Appendix D, .05
2
7.81473 . The rejection region is 2 7.81473 .
Since the observed value of the test statistic falls in the rejection region ( 2 8.036 7.81473) , H0 is
rejected. There is sufficient evidence to indicate there are differences in the percentages of incidents in the
four cause categories at .05 .
10.50 a. The two qualitative variables are years (1990, 1991, . . . , 2000) and acquisition status (yes or no).
c. From the printout, the test statistic is 2 297.048 and the p-value is p 0.000 . Since the p-value is
less than ( p 0.000 .05) , H0 is rejected. There is sufficient evidence to indicate that year and
acquisition status are dependent at .05 .
Committee
Acceptable Rejected Totals
Acceptable 101 23 124
Inspector
Rejected 10 19 29
Totals 111 42 153
b. Yes. To plot the percentages, first convert frequencies to percentages by dividing the numbers in
each column by the column total and multiplying by 100. Also, divide the row totals by the overall
total and multiply by 100.
90
80 81.1
70
Percent 60
50
40
30
20
10
0
Acceptable Rejected
Committee
Since the heights of the bars are not similar, it appears there is a relationship.
R C 124(111) R C 124(42)
Eˆ11 1 1 89.691 Eˆ 21 1 2 34.039
n1 153 n1 153
R C 29(111) RC 29(42)
Eˆ 21 2 1 21.039 Eˆ 22 2 2 7.961
n1 153 n1 153
To determine if the inspector's classifications and the committee's classifications are related, we test:
2
nij Eˆ ij
(101 89.961) (23 34.039) (10 21.039) (19 7.961) 26.034
2 2 2 2
2
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (2 1)(2 1) 1 . From Table IV, Appendix D, .05
2
3.84146 . The rejection
region is 2 3.84146 .
Since the observed value of the test statistic falls in the rejection region ( 2 26.034 3.84146) , H0
is rejected. There is sufficient evidence to indicate the inspector's and committee's classifications are
related at .05 . This indicates that the inspector and committee tend to make the same decisions.
2
ni E i (32 22.1) 2 (26 25.5) 2 (15 9.35) 2 (6 11.9) 2 (6 16.15) 2
2
17.16
Ei 2 22.1 25.5 9.35 11.9 16.15
The rejection region requires .05 in the upper tail of the 2 distribution with df k 1 5 1 4 .
From Table IV, Appendix D, .05
2
9.48773 . The rejection region is 2 9.48773 .
Since the observed value of the test statistic falls in the rejection region ( 2 17.16 9.48773) ,
reject H0. There is sufficient evidence to indicate the probabilities differ from their hypothesized
values at .05 .
n1 32
b. pˆ1 .376
n 85
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
c. The interval tells us that between 27.3% and 47.9% of the Avonex MS patients are exacerbation-free
during a two-year period. Since this interval is completely above the percentage of placebo patients
(26%), it seems that the Avonex patients are more likely to have no exacerbations than placebo
patients.
Flight Response
Altitude Low High Totals
< 300 85 105 190
300-600 77 121 198
600 17 59 76
Totals 179 285 464
R C 190(179) RC 190(285)
Eˆ11 1 1 73.297 Eˆ12 1 2 116.703
n 464 n 464
R C 198(179) RC 198(285)
Eˆ 21 2 1 76.384 Eˆ 22 2 2 121.616
n 464 n 464
RC 76(179) RC 76(285)
Eˆ 31 3 1 29.319 Eˆ 32 3 2 46.681
n 464 n 464
To determine if flight response of the geese depends on the altitude of the helicopter, we test:
2
Eˆ ij 73.297 116.703 76.384
11.477
121.616 29.319 46.681
The rejection region requires .01 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(2 1) 2 . From Table IV, Appendix D, .01
2
9.21034 . The rejection
region is 2 9.21034 .
Since the observed value of the test statistic falls in the rejection region ( 2 11.477 9.21034) , H0
is rejected. There is sufficient evidence to indicate that the flight response of the geese depends on
the altitude of the helicopter at .01 .
Flight Response
Lateral
Distance Low High Totals
< 1000 37 243 280
1000-2000 68 37 105
2000-3000 44 4 48
3000 30 1 31
Totals 179 285 464
RC 280(179) RC 280(285)
Eˆ11 1 1 108.017 Eˆ12 1 2 171.983
n 464 n 464
R C 105(179) RC 105(285)
Eˆ 21 2 1 40.506 Eˆ 22 2 2 64.494
n 464 n 464
RC 48(179) RC 48(285)
Eˆ 31 3 1 18.517 Eˆ 32 3 2 29.483
n 464 n 464
R C 31(179) RC 31(285)
Eˆ 41 4 1 11.959 Eˆ 42 4 2 19.041
n 464 n 464
To determine if flight response of the geese depends on the lateral distance of the helicopter, we test:
H0: Flight response and Lateral distance of the helicopter are independent
Ha: Flight response and Lateral distance of the helicopter are dependent
2
Eˆ
ij
108.017 171.983 40.506 64.494
207.814
18.517 29.494 11.959 19.041
The rejection region requires .01 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (4 1)(2 1) 3 . From Table IV, Appendix D, .01
2
11.3449 . The rejection
region is 2 11.3449 .
Since the observed value of the test statistic falls in the rejection region ( 2 207.814 11.3449) , H0
is rejected. There is sufficient evidence to indicate that the flight response of the geese depends on
the lateral distance of the helicopter at .01 .
e. Using SAS, the contingency table for altitude by response with the column percents is:
ALTGRP RESPONSE
Frequency|
Percent |
Row Pct |
Col Pct |LOW |HIGH | Total
---------+--------+--------+
<300 | 85 | 105 | 190
| 18.32 | 22.63 | 40.95
| 44.74 | 55.26 |
| 47.49 | 36.84 |
---------+--------+--------+
300-600 | 77 | 121 | 198
| 16.59 | 26.08 | 42.67
| 38.89 | 61.11 |
| 43.02 | 42.46 |
---------+--------+--------+
600+ | 17 | 59 | 76
| 3.66 | 12.72 | 16.38
| 22.37 | 77.63 |
| 9.50 | 20.70 |
---------+--------+--------+
Total 179 285 464
38.58 61.42 100.00
From the row percents, it appears that the lower the plane, the lower the response. For altitude
<300m, 55.26% of the geese had a high response. For altitude 300-600m, 61.11% of the geese had a
high response. For altitude 600+m, 77.63% of the geese had a high response. Thus, instead of
setting a minimum altitude for the planes, we need to set a maximum altitude. For this data, the
lowest response is at an altitude of < 300 meters.
Using SAS, the contingency table for lateral distance by response with the column percents is:
The FREQ Procedure
LATGRP RESPONSE
Frequency |
Percent |
Row Pct |
Col Pct |LOW |HIGH | Total
----------+--------+--------+
<1000 | 37 | 243 | 280
| 7.97 | 52.37 | 60.34
| 13.21 | 86.79 |
| 20.67 | 85.26 |
----------+--------+--------+
1000-2000 | 68 | 37 | 105
| 14.66 | 7.97 | 22.63
| 64.76 | 35.24 |
| 37.99 | 12.98 |
----------+--------+--------+
2000-3000 | 44 | 4 | 48
| 9.48 | 0.86 | 10.34
| 91.67 | 8.33 |
| 24.58 | 1.40 |
----------+--------+--------+
3000+ | 30 | 1 | 31
| 6.47 | 0.22 | 6.68
| 96.77 | 3.23 |
| 16.76 | 0.35 |
----------+--------+--------+
Total 179 284 464
38.58 61.42 100.00
From the row percents, it appears that the greater the lateral distance, the lower the response. For a
lateral distance of 3000+m only 3.23% of the geese had a high response. Thus, the further away the
plane is laterally, the lower the response. For this data, the lowest response is when the plane is
further than 3000 meters.
Thus, the recommendation would be a maximum height of 300 m and a minimum lateral distance of
3000 m.
RC 200(140) 200(140)
Eˆ11 1 1 = 46.667 Eˆ 21 Eˆ 31 46.667
n 600 600
200(460)
Eˆ12 Eˆ 22 Eˆ 32 153.333
600
2
Eˆij
46.667 46.667 46.667 153.333
47.98
153.333 153.333
The rejection region requires .05 in the upper tail of the 2 distribution with
df ( r 1)(c 1) (3 1)(2 1) 2 . From Table IV, Appendix D, .05
2
5.99147 . The rejection
region is 2 5.99147 .
Since the observed value of the test statistic falls in the rejection region ( 2 47.98 5.99147) , H0
is rejected. There is sufficient evidence to indicate quality of filters and shift are related at .05 .
25
b. pˆ1 .125
200
For confidence coefficient .95, .05 and / 2 .05 / 2 .025 . From Table II, Appendix D,
z.025 1.96 . The 95% confidence interval is:
[ni Ei ]2 (26 23) 2 (146 136) 2 (361 341) 2 (143 136) 2 (13 23) 2
10.55 a. 2 9.647
Ei 23 136 341 136 23
b. From Table IV, Appendix D, with df 5 , .05 2
11.0705
c. No. Since the observed value of the test statistics does not fall in the rejection region
( 2 9.647 11.0705) , H0 is not rejected. There is insufficient evidence to indicate the salary
distribution is non-normal for .05 .
Chi-Square with 5 DF
x P( X <= x )
9.647 0.914122
Using frequencies in Fr
1 2 All
1 441 8 449
2 47 2 49
All 488 10 498
Since no level was given, we will use .05 . The rejection region requires .05 in the upper tail of
the 2 distribution with df ( r 1)(c 1) (2 1)(2 1) 1 . From Table IV, Appendix D, .05
2
3.84146 .
The rejection region is 2 3.84146 .
Since the observed value of the test statistic does not fall in the rejection region ( 2 1.188 3.84146) , H0
is not rejected. There is insufficient evidence to indicate that Defect and Pred_EVG are dependent at
.05 . If Defect and Pred_EVG are independent, then the Pred_EVG is no better predicting defects than
just guessing. I would not recommend the essential complexity algorithm be used as a predictor of
defective software modules.
CANDIDATE TIME
Frequency|
Col Pct | 1| 2| 3| 4| 5| 6| Total
---------+--------+--------+--------+--------+--------+--------+
SMITH | 208 | 208 | 451 | 392 | 351 | 410 | 2020
| 52.53 | 55.32 | 55.34 | 55.92 | 56.16 | 55.33 |
---------+--------+--------+--------+--------+--------+--------+
COPPIN | 55 | 51 | 109 | 98 | 88 | 104 | 505
| 13.89 | 13.56 | 13.37 | 13.98 | 14.08 | 14.04 |
---------+--------+--------+--------+--------+--------+--------+
MONTES | 133 | 117 | 255 | 211 | 186 | 227 | 1129
| 33.59 | 31.12 | 31.29 | 30.10 | 29.76 | 30.63 |
---------+--------+--------+--------+--------+--------+--------+
Total 396 376 815 701 625 741 3654
Since no value of was given, we will use .05 . The rejection region requires .05 in the upper tail
of the 2 distribution with df ( r 1)(c 1) (3 1)(6 1) 10 . From Table IV, Appendix D,
.05
2
18.3070 . The rejection region is 2 18.3070 .
Since the observed value of the test statistic does not fall in the rejection region ( 2 2.2839 18.3070) ,
H0 is not rejected. There is insufficient evidence to indicate Voting and Time period are dependent at
.05 . Thus, we can conclude that voting and time period are independent. This means that regardless
of time period, the percentage of votes received by each candidate is the same. In the table created by SAS,
the bottom number in each cell is the column percent. This is the percent of votes received by the
candidate in each time period. An inspection of these percents indicates that candidate Smith received
approximately 55.3% of the votes each time period, candidate Coppin received approximately 13.8% of the
vote, and candidate Montes received approximately 30.9% of the vote. All of this indicates that the
election was rigged