Professional Documents
Culture Documents
A Problem in Statistical Analysis Simultaneous Inference
A Problem in Statistical Analysis Simultaneous Inference
A Problem in Statistical Analysis Simultaneous Inference
The Condor93~1023-1025
0 The CooperOrnithologicalsociety I99 I
KATHLEEN G. BEAL
Department of Mathematics and Statistics, Wright State University,Dayton, OH 45435
HARRY J. KHAMIS
Department of Mathematics and Statisticsand Department of Community Health, Schoolof Medicine,
Wright State University,Dayton, OH 45435
Zar 1984:44 or Ott 1984:llO). When we perform P trol our experimentwisetype I error rate so that it does
tests of hypotheseson P responsevariables, the prob- not exceed0.05. Johnsonand Wichem (1988) discuss
ability of acceptingall null hypotheseswhen true is (1 theory and development of the Bonferroni method in
- a)’ if the testsare independent. For example, if P = multivariate analysis,however some of their formulas
3 then the probability of acceptingall three null hy- are in error (Johnson, pers. comm. with H.K.; corri-
pothesesif true is (1 - 0.05)’ or 0.8574 (Johnsonand genda can be obtained from H.K.).
Wichem 1988:187). The combined probability of a The use of the Bonferroni method, namely usingan
type I error for all hypotheses,called the experiment- alpha level of 0.01 for each t-test in this example, in-
wise type I error rate and denoted by (Y’, is then 00 = suresthat the probability of incorrectly rejecting any
(1 - 0.8574) or 0.1426, not 0.05. The experimentwise one or more of the five null hypothesesis bounded by
type I error rate increaseswith the number of simul- 0.05. If the Bonferroni method is not used, then each
taneoustests of hypotheses;for five tests it is 0.2262, t-test would be conducted at the 0.05 level of signifi-
for 10 tests it is 0.4013, etc. So, for 01= 0.05 and a canceleading to an experimentwisetype I error rate as
large number of tests, the chancesof rejecting a true high as 0.2262. So, null hypothesesmight be rejected
null hypothesisat least once, (Y’, becomesmuch larger in good faith when they should not be.
than 0.05. According to a formula called the Bonferroni The primary disadvantageof the Bonferroni method
inequality, the experimentwise type I error rate, OI’, is is that it may be more conservativethan a multivariate
always less than or equal to (P.,) regardlessof the procedure, that is, the actual experimentwise type I
correlation structureamong the P tests;that is error rate may be somewhat less than (Y.Rice (1989)
discussesthis problem and provides an adjustment,
00 5 P.a.
called the sequential Bonferroni technique, which in-
Consequently,in order to insure that the experiment- creasesthe power of the Bonferroni procedure. For
wise type I error rate, LY’,does not exceed, say 0.05, discrete data, Tarone (1990) has provided a modified
simply use a value of a equal to 0.05/P (see Morrison Bonferroni method.
1990:32-33). Finally, the Bonferronimethod is most effectivewhen
Researchersseem to be largely unaware of the prob- P, the number of tests, is small. As P increases,LY’=
lem regarding simultaneous conclusionsdrawn from a/P decreases.Since the Bonferroni method is conser-
multiple tests of hypothesesconcerning two or more vative when P is large, the probability of the type II
response variables. Introductory statistical texts and error (failing to reject the null hypothesis when it is
statistical references (such as Sokal and Rohlf 1981 false) becomes a concern. Generally, researchersare
and Snedecorand Cochran 1989) usedmost frequently urged to carefully consider the number of tests of hy-
by ornithologists restrict discussion of the experi- pothesesduringthe designphaseof their research.Many
mentwise type I error rate to multiple comparisontech- researchersappear to feel that the more tests of hy-
niauesfor ANOVA. An examination of the 1989 issues pothesesthe better. Eliminating redundant or unnec-
of-The Auk, The Condor, and The Wilson Bulletin essarytests of hypothesesand carefully choosing the
revealed at least forty instancesin which multiple uni- sets of tests to be Bonferroni adjusted reducesP, pre-
variate testswere used for a data set without regardto vents 01’from becoming smaller than necessary,and
inflated values of the experimentwise type I error rate. bolstersthe power of the tests.
We describe a univariate procedure that addresses The Bonferroni method provides a solution to a se-
this problem (simultaneousinference)which is simpler rious statisticalproblem, namely control of the exper-
than the multivariate procedures,but is relatively un- imentwise type I error rate for simultaneousinferences.
known among researchers.In fact, this univariate al- This solution is simple computationally and in terms
ternative, called the Bonferronimethod,is actually su- of interpretation, and it is superiorto other more com-
perior to multivariate methods when the number of plex multivariate techniqueswhen the number of hy-
hypothesesis small (see Johnson and Wichem 1988: pothesesis small. The Bonferroni method can be ap-
188-l 90). plied to any test procedure or confidenceinterval and
For a singletest of hypothesisusinga pooled sample is not restrictedto the useof t-tests (seeMorrison 1990
t-test (as described,for example, by Ott 1984, df = 10, for a variety of applications). It is a simple way of
and LY= 0.05) the null hypothesisis rejected in favor compensatingfor the multiplicity of significancetests
of a two-sided alternative if t is greaterthan the critical and requires no assumptionsother than those neces-
value of 2.228 or less than that of -2.228. If five sets sary for the validity of the individual significancetests
of hypothesesare testedsimultaneouslythe calculation (Miller 1981:8).
of the test statistics is typical of multiple t-tests but
interpretation of the observed level of significance(P
value) is adjustedaccordingto the Bonferroni method, LITERATURE CITED
a method that accountsfor the inflation of the exper-
imentwise type I error incurred by simultaneoustests. FINCH,D. M. 1991. House Wrens adjust laying dates
The Bonferroni method requiresus to divide the prob- and clutch size in relation to annual flooding. Wil-
ability of a type I error, here 0.05, by the number of son Bull. 103:25-43.
tests and to draw our conclusionsbased on that new JOHNSON, RICHARD A., ANDDEANW. WICHERN. 1988.
level of significance,namely (Y’= 0.05/5 = 0.01. The Applied multivariate statisticalanalysis.Prentice-
critical valuesoft are now + 3.169. more extreme than Hall, Englewood Cliffs, NJ.
those for the single test of hypothesis.This technique MILLER,R. 6., JR. 1981. Simultaneousstatisticalin-
allows us to use simple testsof hypothesesand to con- ference. Springer-Verlag,New York.
SHORT COMMUNICATIONS 1025
MORRISON,DONALDF. 1990. Multivariate statistical tistical methods. Iowa State Univ. Press, Ames,
methods. McGraw-Hill, New York. IA.
Orr, LYMAN. 1984. An introduction to statistical SOKAL,R. R., AND R. J. ROHLF. 1981. Biometry. W.
methods and data analysis. Duxbury Press, Bos- H. Freeman and Co., New York.
ton, MA. TARONE,R. E. 1990. A modified Bonferroni method
RICE, WILLIAM R. 1989. Analyzing tablesof statistical for discrete data. Biometrics 46:5 15-522.
tests. Evolution 43:223-224. ZAR, JERROLD H. 1984. Biostatisticalanalysis.Pren-
SNEDECOR, G. W., AND W. G. COCHRAN. 1989. Sta- tice-Hall, Englewood Cliffs, NJ.