Professional Documents
Culture Documents
1996 Vangel Print PDF
1996 Vangel Print PDF
1996 Vangel Print PDF
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to The
American Statistician.
http://www.jstor.org
Confidence Intervals for a Normal Coefficient of Variation
Mark G. VANGEL
to easily calculate confidence intervals for r that attain very
nearly the nominal confidence level for any sample size.
This article presents an analysis of the small-sample dis- These calculations require only a table of quantiles of the
tribution of a class of approximate pivotal quantities for a X2 distribution.
normal coefficient of variation that contains the approxima- Let Y, denote a x2 random variable with v = n-1 de-
tions of McKay, David, the "naive" approximate interval grees of freedom, and define W- Yl/v. For aeE (0, 1) let
obtained by dividing the usual confidence interval on the 2 denote the 100crpercentile of the distribution of YL,
standard deviation by the sample mean, and a new interval and let t x= /Iv be the corresponding quantile of W,.
closely related to McKay. For any approximation in this Define the random variable
class, a series is given for e(t), the difference between the
cdf's of the approximate pivot and the reference distribu-
K2(1+0K2) (5)
tion. Let denote
i the population coefficient of variation. (I + HK2) K2
For McKay, David, and the "naive" interval e(t) = 0(,2), where 0 = O(v,ae) is a known function. If we choose 0 so
while for the new procedure e(t) = O(r04). that
KEY WORDS: Chi-squared approximation;Noncentral t
Pr(Q < t) -Pr(Wi, < t) (6)
distribution;McKay's approximation.
then, because the distribution of W, is known and free of
r, we can use Q as an approximate pivot for constructing
hypothesis tests and confidence intervals for K. We define
1. INTRODUCTION the accuracy of the approximation (6) to be e(t) _ p - a,
where p Pr(Q < t). Note that p is the actual confidence
If X is a normal random variable with mean ,u and vari- level of a one-sided confidence interval for r,2, based on Q,
ance -2, then the parameter having nominal confidence ae.In Section 2 we give a Taylor
series expansion for e(t) in powers of r,2, leaving the de-
O_
.N-- ~~~~~~~(1)
tails to the Appendix. We then consider four choices for 0:
At corresponding to the approximations of McKay (1932) and
is called the population coefficient of variation. Let Xi for David (1949), to the "naive" approximate interval obtained
i = 1, .. ., n be an independent random sample, with Xi by dividing the usual confidence interval on the standardde-
N(At,u2) for each i. In terms of the usual sample estimates viation by the sample mean, and to a new interval closely
of the normal parameters related to McKay (1932).
n
McKay (1932) proposed that Q and W, are approxi-
x = EXi/n
mately equal in distribution when 0 = v/(v + 1), but he
(2)
was unable to investigate the small-sample distribution of
i=l1
Q. Consequently, Fieller (1932) and Pearson (1932) per-
and formed numerical and simulation studies with satisfactory
n results. David (1949) proposed McKay's approximation
S2 = X _
(X-)2/(-)(3 with 0 = 1; this suggestion has received much less attention
i=l1 than McKay (1932). Much later, Iglewicz and Myers (1970)
compared selected quantiles of the approximate distribution
a point estimate of (1) is
for K, obtained from Q with McKay's choice of 0, with the
K_ S/X. (4) corresponding exact values obtained using the noncentral t
distribution. This numerical investigation demonstrated that
This statistic is widely calculated and interpreted, often for McKay's approximation is very good, at least for n > 10
very small n, usually without an accompanying confidence and 0 < K < .3. Instead of examining differences in quan-
interval. An exact method for confidence intervals on , tiles numerically, we will investigate differences in cdf's
based on the noncentral t distribution is available (Lehmann analytically, and thereby develop a deeper understanding of
1986, p. 352), but it is computationally cumbersome; hence the small-sample properties of these approximations.
the need for approximate intervals. In this article, we will McKay arrived at his approximation by first deriving an
investigate an approximate pivotal quantity that can be used expression for the density of S/ XI, where S2 = (n-
1)S2/n. He then reexpressed this density as a contour inte-
gral, to which he applied an approximation closely related
Mark G. Vangel is Mathematical Statistician, Statistical Engineering
Division, National Institute of Standards and Technology, Gaithersburg,
to the saddlepoint method (e.g., DeBruijn 1958, chap. 5)
MD 20899-0001. This work was completed while the author was with the in order to obtain an approximate density. Since McKay's
Materials Directorate, Army Research Laboratory,Watertown, MA. statistic is based on an asymptotic approximation, it can
?- 1996 American Statistical Association The American Statistician, February 1996, Vol. 15, No. 1 21
be expected to have good properties for large samples. It is For McKay's approximation 0 =(1) _ v/(v + 1), and (8)
likely that, had McKay performed the small-sample analysis becomes
of this article to Q(02), he would have proposed a slightly
different confidence interval that would not have required
verification for small samples by "the direct, but neverthe- 2
less arduous, method of numerical test" (McKay 1932, p. el (t) tH' (t) I + 2 + 9w + 2w2 - 3w3 + 2w4
695) (i.e., numerical integration and Monte Carlo simula-
tion), and the history of this problem as outlined in the
previous paragraphwould have been different.
-3vt + 9v2t + 5w3t - 7T4t - 5w2t2 1
2. A TAYLOR SERIES FOR e(t) - 2v3t2 + 9w4t2 - 5w4t3 + w4t4 4
+ 2(1+ v)3
Denote the distribution of W, by H, (.) so that, for 0
< ae < 1,H,(t) = a. Because u(x) x/(Ox + 1) is a
monotone function with inverse u-l (y) y/(l - Oy),
+ 0(Z6)W, (9)
[(l+2 2)
=Pr K( 2X + t < 2)
(1 - t)w - 11
e(t) OtH(t) [t-1 JI2 / -16t + 4Vt + 18v2t - 14w3t + 6t2
- 15wt2 - 6w2t2 + 18v3t2
+ 5Vt3 - 4w2t3 - 1OV3t3
+ (1-v + 2vt)(1-Ot)
I + v
* 0
o I ~
I~ ~~I70 ~ ~~~ ~ ~1 ~
I 1 1I
0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.1 0.2 0.3 0.4 0.5
Kappa Kappa
0.0L.O0.2 .3 0.40.5 ?0 0
AlphaK=.95a Alpha=.99
-Cocm
- ModifiedMcKay McKa
CZ0
.1-10
.001 .005 .025 .1 .25 .5 .75 .9 .975 .995 .999
Kappa = < 9.1 E-M07)/tX
2le(alpha)l Alpha (on logit scale)
b0 ,, McKayl
C\J
2 The(Exct)Appuace
Figue fo.25MKa n ModifiedMcKayMehd,aFucinofx,orn=5nd =.5nd2.
For most applications, will be small, so our plan is to + t(Ot - 1)2/2[2H1' (t) + tH (t)] K4 + Q(K6) (A.6)
let
H'(q)= H1(t) + t(Ot-1 )H (t)K2 + Q(K4) (A.7)
2)
a = a(Ei )-1+ ~~~t and
(1-St)K2 (A.1)
w+ 1 w+ 1
andto expandPr[(K/iS)2? q] in a Taylorseriesin ,S2, then
to expandeach termin this seriesagainin powersof ,S2, US- es2 +?O(^;4). (A.8)
ing (A.1). We assumethroughoutthatq iS nonnegative;this +(Ot-1)(w-1-2wt)
substituting (A.6), (A.7), and (A.8) into (A.5), and collecting Iglewicz, B., and Myers, R. H. (1970), "On the Percentage Points of the
Sample Coefficient of Variation,"Technometrics, 12, 166-169.
terms in e2 leads to (8).
Lehmann, E. L. (1986), TestingStatistical Hypotheses (2nd ed.), New York:
John Wiley.
[Received July 1993. Revised November 1994.] McKay, A. T. (1932), "Distribution of the Coefficient of Variation and the
Extended 't' Distribution," Journal of the Royal Statistical Society, 95,
695-698.
REFERENCES
Pearson, E. S. (1932), "Comparisonof A. T. McKay's Approximation with
David, F. N. (1949), "Note on the Application of Fisher's k-statistics," Experimental Sampling Results,"Journal of the Royal Statistical Society,
Biometrika, 36, 383-393. 95, 703-703.