Download as pdf or txt
Download as pdf or txt
You are on page 1of 18

JOURNAL OF QUALITY TECHNOLOGY

, VOL. , NO. , –


https://doi.org/./..

RESEARCH PAPER

Some perspectives on nonparametric statistical process control


Peihua Qiu
Department of Biostatistics, University of Florida, Gainesville, Florida

ABSTRACT KEYWORDS
Statistical process control (SPC) charts play a central role in quality control and management. Many data categorization;
conventional SPC charts are designed under the assumption that the related process distribution is distribution-free; log-linear
normal. In practice, the normality assumption is often invalid. In such cases, some articles show that modeling; multivariate
distribution; non-gaussian
certain conventional SPC charts are robust and can still be used as long as their parameters are prop-
data; nonparametric
erly chosen. Some other articles argue that results from such conventional SPC charts would not be procedures; normality;
reliable and that nonparametric SPC charts should be considered instead. In recent years, many non- ordering; ranking
parametric SPC charts have been proposed. Most of them are based on the ranking information in
process observations collected at different time points. Some of them are based on data categoriza-
tion and categorical data analysis. In this article, we give some perspectives on issues related to the
robustness of conventional SPC charts and to the strengths and limitations of various nonparametric
SPC charts.

1. Introduction
statistical theory has been developed for normally dis-
In our daily life, we are often concerned about the quality tributed random variables. An intuitive explanation for
of products because it is related directly to the quality of the reason why many continuous numerical quality vari-
our life. In quality control and management, statistical ables in our daily life roughly follow normal distributions
process control (SPC) plays a central role. SPC charts are can be given by using the central limit theorem (CLT).
mainly used for monitoring sequential processes (e.g., That is, a quality characteristic of a product is often
production lines, internet traffic, medical systems, social affected by many different factors, including the quality
or economic status of a population) to make sure that of raw material, labor, manufacturing facilities, proper
they work stably and satisfactorily. They have been widely operation in the manufacturing process, and so forth. So,
used in manufacturing and other industries. See books, by the CLT, its distribution should be roughly normal.
such as Hawkins and Olwell (1998), Montgomery (2012), In practice, however, there are many variables whose
and Qiu (2014), for related discussions. distributions are substantially different from normal
Since Walter A. Shewhart proposed the first control distributions. For instance, economic indices and other
chart in 1931, many control charts have been proposed in non-negative indices are often skewed to the right. The
the past eighty years, including different versions of the lifetimes of products can often be described reasonably
Shewhart chart, cumulative sum (CUSUM) chart, expo- well by Weibull distributions that could be substantially
nentially weighted moving average (EWMA) chart, and different from normal distributions. In multivariate cases,
the chart based on change-point detection (CPD). See, the normality assumption is especially difficult to satisfy
for instance, Champ and Woodall (1987), Crosier (1988), because this assumption implies that all individual vari-
Crowder (1989), Hawkins (1991), Hawkins, Qiu, and ables follow normal distributions and all subsets of them
Kang (2003), Lowry et al. (1992), Page (1954), Reynolds follow joint normal distributions. A direct conclusion of
and Lou (2010), Roberts (1959), Shewhart (1931), and the multivariate normality assumption is that the regression
Tracy, Young, and Mason (1992). A common feature of relationship between any two subsets of the individual vari-
the control charts discussed in these and many other ables must be linear, which is rarely valid in practice. When
articles is that the process distribution is assumed to the normality assumption is invalid, some people think
be normal. As pointed out in Subsection 2.3.1 of Qiu that the conventional control charts based on the normal-
(2014), normal distributions play an important role in ity assumption can still be used as long as their parameters
statistics because many continuous numerical variables are properly chosen because the charting statistics of these
roughly follow normal distributions in practice and much charts are often weighted sample averages of some process

CONTACT Peihua Qiu pqiu@ufl.edu Department of Biostatistics, University of Florida,  Mowry Road, Gainesville, FL , USA.
©  American Society for Quality
50 P. QIU

observations; thus, the CLT will guarantee that their dis- Stoumbos (2000), Steiner and MacKay (2000), among
tributions are close to normal (cf., Borror, Montgomery, many others. Also, we focus on the Phase II SPC problem,
and Runger 1999; Stoumbos and Sullivan 2002; Testik, and the Phase I SPC problem can be discussed similarly.
Runger, and Borror 2003). Some other people think that In Phase II SPC research, usually the parameters involved
nonparametric control charts should be considered in in the IC distribution are assumed known. In practice,
cases when no parametric distributions (including the however, they often need to be estimated from an IC
normal distributions) are appropriate for describing the data set. The impact of the sample size of the IC data
process distribution (cf., Capizzi 2015; Chakraborti, van set on the performance of control charts with estimated
der Laan, and Bakir 2001; Hackl and Ledolter 1991; Qiu IC parameters is out of the scope of this article. See Qiu
and Hawkins 2001; Qiu and Li 2011a, Qiu and Li2011b). (2014, Chapters 3–5) for related discussions.
Between the two different opinions about parametric In a process monitoring application, if a parametric
versus nonparametric control charts, which one is appro- distribution is available to describe the process distribu-
priate in which cases? We will provide some personal tion well, then a proper parametric control chart can be
perspectives on this issue in the current article. In recent constructed and used. For instance, if we know that the
years, many nonparametric control charts have been IC and out-of-control (OC) process distributions are con-
proposed in the literature. Some recent overviews of this tinuous numeric and their probability density (or mass)
topic can be found in Capizzi (2015), Chakraborti, van der functions are f0 and f1 , respectively, then the charting
Laan, and Bakir (2001), and Qiu (2014, Chapters 8 and statistic of the CUSUM control chart for detecting the
9), and some recent articles can be found in the special distributional shift from f0 to f1 based on a sequence of
issue on nonparametric statistical process control of the independent process observations {X1 , X2 , . . .} is
journal Quality and Reliability Engineering International
(Chakraborti, Qiu, and Mukherjee 2015). Most existing Gn = max(0, Gn−1 + log( f1 (Xn )/ f0 (Xn ))), for n ≥ 1,
methods are based on the ranking/ordering information
of process observations at different time points. Some of where G0 = 0. In the case when f0 and f1 are the densities
them are based on data categorization and on categorical of the familiar N(μ0 , σ 2 ) and N(μ1 , σ 2 ) distributions
data analysis. In this article, we will discuss the pros and with μ1 > μ0 , the above statistic becomes
cons of various different nonparametric control charts.
Cn+ = max(0, Cn−1+
+ (Xn − μ0 )/σ − k), for n ≥ 1,
The rest of the article is organized as follows. In
[1]
Section 2, the robustness property of the normality
where C0+ = 0 and k = (μ1 − μ0 )/(2σ ). It gives a sig-
assumption of certain representative conventional control
nal when Cn+ > hC , where hC > 0 is a control limit. In
charts is discussed, and their in-control (IC) performance
practice, the OC mean μ1 is often unknown. So the
is compared to the IC performance of certain representa-
value of the allowance constant k needs to be specified
tive nonparametric control charts in various cases when
beforehand. Then the control limit hC can be chosen such
the normality assumption is invalid. In Sections 3 and 4,
that a given IC average run length (ARL) value, denoted
the pros and cons of various univariate and multivariate
as ARL0 , is reached. It has been shown that the CUSUM
nonparametric control charts are discussed, respectively.
chart Eq. [1] has the optimality property that its OC ARL
In Section 5, some concluding remarks are given regard-
value, denoted as ARL1 , would be the shortest among all
ing certain future research topics on nonparametric SPC.
control charts with a given value of ARL0 , if the mean
shift size is δ = (μ1 − μ0 )/σ and k is chosen δ/2 (cf.,
2. Robustness of parametric control charts Moustakides 1986; Ritov 1990; Yashchin 1993).
Another popular control chart is the EWMA chart. If
In this section, we discuss whether conventional we are interested in detecting process mean shift from the
parametric SPC charts can still be used when a para- IC mean μ0 , as in Eq. [1], then the EWMA charting statis-
metric distribution specified beforehand is invalid. Our tic is
discussion focuses mainly on univariate cases when the
specified parametric distribution is a normal distribu- En = λ(Xn − μ0 )/σ + (1 − λ)En−1 , for n ≥ 1, [2]
tion for simplicity, although most conclusions made in
this section should also be appropriate for other uni- where E0 = 0, and λ ∈ (0, 1) is a weighting parameter.
variate parametric distributions and for multivariate The chart gives a signal of an upward mean shift if En >
cases as well. For control charts designed for monitoring hE , where hE > 0 is a control limit. In Eq. [2], the weight-
non-normal parametric distributions, such as binomial, ing parameter λ needs to be specified beforehand, and the
Poisson, Weibull, and log-normal distributions—see control limit hE is chosen such that a given value of ARL0
Gan (1993), Jiang, Shu, and Tsui (2011), Reynolds and is reached.
JOURNAL OF QUALITY TECHNOLOGY 51

(a) (b)

Figure . (a) Actual ARL0 values of the CUSUM chart Eq. [] in cases when the IC process distribution is assumed to be N(0, 1) but is actually
the standardized version with mean zero and standard deviation one of the χdf2 distribution (solid curve) or the tdf distribution (dashed
curve), where df is the degrees of freedom. (b) Corresponding results of the EWMA chart Eq. []. The dotted horizontal line in each plot
denotes the nominal ARL0 value of each chart.

Eqs. [1] and [2] are for detecting upward mean shifts CUSUM and EWMA charts in cases where the assumed
only. Their counterparts for detecting downward or normality assumption is invalid. In the CUSUM chart
arbitrary shifts can be defined similarly. In the case where Eq. [1], we choose k = 0.5 and hC = 4.389. In such cases,
the process distribution is assumed normal, their con- its nominal ARL0 is 500 when the IC process distribution
trol limits can be found from tables in text books, such is N(0, 1). Suppose that the true IC process distribution
as Tables 4.1 and 5.1 in Qiu (2014). They can also be is actually the standardized version with mean zero and
computed easily by R packages spc and qcc. standard deviation one of the χd2 f distribution, where d f
Besides the CUSUM and EWMA charts, the Shewhart is the degrees of freedom changing from 1 to 20. In such
and CPD charts are also commonly discussed in the cases, the actual ARL0 values of the CUSUM chart are
literature. Generally speaking, the CUSUM and EWMA shown by the solid curve in Figure 1(a). The dashed curve
charts are more effective than the Shewhart charts for in the plot shows the actual ARL0 values of the CUSUM
detecting persistent and relatively small shifts, while the chart when the true IC process distribution is the stan-
Shewhart charts are effective for detecting transient and dardized version with mean zero and standard deviation
relatively large shifts. Regarding the CPD charts, they one of the td f distribution, where d f is the degrees of
have the advantages that (i) they can provide a reasonably freedom changing from 3 to 20. The corresponding
good estimator of the shift position when they give a results for √the EWMA chart Eq. [2] with λ = 0.2 and
signal of shift and (ii) they do not need much prior hE = 2.962 λ/(2 − λ) are shown in Figure 1(b). From
information about the IC and OC parameters. However, the plots, we can see that the actual ARL0 values of the two
the computation involved in the CPD charts is relatively charts could be substantially smaller than the nominal
extensive. In this article, we mainly focus on the CUSUM ARL0 of 500 in most cases considered. These results imply
and EWMA charts. But most results presented in this that the related production process would be stopped
article should also apply to the Shewhart and CPD charts. unnecessarily often by the two charts. Thus, much
From the above brief description about the conven- time and effort would be wasted in trying to figure out
tional control charts, we can see that their construction the root causes of the false signals, and the efficiency of
(cf., Eq.[1]) and design (i.e., selection of parameters such the production process would be greatly compromised.
as hC and hE ) depend heavily on the assumed parametric The above example shows that the conventional
distributions f0 and f1 . In practice, however, the para- CUSUM and EWMA charts are inappropriate to use in
metric distributions f0 and f1 are often unavailable for cases where they are designed for normally distributed
describing the IC and OC process distributions, as dis- processes but the actual process distributions are not nor-
cussed in Section 1. Real-data examples, in which the con- mal. In such cases, the shift detection power of the charts
ventional normality assumption is invalid, were provided may be irrelevant because a good power could be due to
by several articles, including Hawkins and Deng (2010), an overly small ARL0 value. Also, it may not be realistic to
Qiu and Li (2011b), and Zou and Tsung (2010). Next, adjust the control limits of the charts so that their actual
let us investigate the IC performance of the conventional ARL0 values are all equal to the given nominal values and
52 P. QIU

then compare the OC performance of the charts because


it is often difficult to know the difference between the
actual and nominal ARL0 values of a chart in practical
settings. This is similar to the situation in hypothesis test-
ing, where we should never consider a testing procedure
whose actual significance level is larger than the nominal
significance level (Lehmann 1986, Chapter 3).
In the literature, there has been some discussion about
the robustness of the conventional control charts to the
normality assumption (e.g., Borror, Montgomery, and
Runger 1999; Humana, Kritzingera, and Chakrabortiborti
2011; Shiau and Hsu 2005; Testik, Runger, and Borror
2003). As discussed in Section 1, some authors think that
the conventional control charts should be robust to the
normality assumption while some others disagree with
them. To address this issue, let us consider the EWMA Figure . Actual ARL0 values of the two-sided EWMA chart Eq. []
chart Eq. [2]. Obviously its charting statistic can be writ- in cases where the actual IC process distribution is the standard-
ten as ized version with mean zero and standard deviation one of the χdf2
distribution. The dot-dashed horizontal line denotes the nominal

n   ARL0 value of the chart when the IC process distribution is assumed
Xi − μ0 to be N(0, 1).
En = λ (1 − λ) n−i
, for n ≥ 1. [3]
i=1
σ
transformed process observations. Transformations con-
sidered were either parametric or nonparametric. The
So, En is a weighted average of all process observations
nonparametric transformations were found inappropri-
up to the current time point n. In Example 5.2 of Qiu
ate to use because they could be regarded as parametric
(2014), we consider the two-sided version of this chart
transformations with an infinite number of parameters
√ gives a signal of process mean shift when |En | > hE =
that
and their estimators from an IC data set of a reasonable
ρ λ/(2 − λ). For this version of the chart, when (λ, ρ)
size would have large variability. The major conclusions
are chosen to be (0.05, 2.216), (0.1, 2.454), or (0.2, 2.635),
about parametric transformations were that they should
the ARL0 values of the chart are 200 in all three cases if
be used with care, and a parametric transformation
the IC process distribution is assumed to be N(0, 1). If
defined from the IC data could generally shrink the
the true IC process distribution is actually the standard-
difference between the actual and the nominal ARL0
ized version with mean zero and standard deviation one
values, especially when the IC data size was large. Yet it
of the χd2 f distribution, the actual ARL0 values of the chart
was difficult to find a parametric transformation that was
are shown in Figure 2. From the figure, it can be seen that
good for all cases with non-normal data. For a related
the actual ARL0 values are quite close to the nominal ARL0
discussion, see Qiu and Li (2011a).
value of 200 when λ is chosen small (e.g., 0.1 or 0.05) and
As a summary of our discussion in this section, in cases
df is not too small (i.e., the process distribution is not too
when the normality assumption is invalid, conventional
skewed). In practice, however, it is often hard to know how
control charts designed based on that assumption should
different the actual process distribution is from a normal
be used with care because their IC performance could
distribution. Therefore it is hard to determine how small
be substantially different from what we would expect.
λ should be chosen in a given application. Furthermore, if
The transformation approach is helpful in alleviating the
λ is chosen small, the corresponding chart would be inef-
problem, but it cannot solve the problem satisfactorily in
fective in detecting relatively large shifts, as demonstrated
many cases.
in the literature (cf., Figure 5.4 in Qiu 2014).
In cases where the true IC process distribution is
unknown but an IC data set is available, Qiu and Zhang
3. Univariate nonparametric control charts
(2015) discussed a possible approach to overcome the
difficulty due to the violation of the normality assump- To overcome the limitations of the parametric control
tion. By that approach, a transformation is first defined charts discussed in the previous sections, there have been
based on the IC data set so that the transformed process many nonparametric control charts developed in recent
observations roughly follow a normal distribution, then years that do not require specification of a parametric
the conventional control charts can be applied to the form for describing the process distribution. Some of
JOURNAL OF QUALITY TECHNOLOGY 53

them are even distribution-free in the sense that dis- process mean shifts; Bakir and Reynolds (1979) and Li,
tributions of their charting statistics do not depend on Tang, and Ng (2010) suggested two CUSUM charts; and
the true process distribution. In many articles, however, Graham, Chakraborti, and Human (2011) and Li, Tang,
people do not always make a clear distinction between and Ng (2010) discussed the related EWMA charts. In the
the terminologies of “nonparametric control charts” charts discussed by Li, Tang, and Ng (2010), a reference
and “distribution-free control charts.” Instead, both ter- sample is assumed to be available. When a reference sam-
minologies are used to refer to the charts that can be ple is available, Chakraborti, van der Laan, and van de
consulted when the process distribution does not have Wiel (2004) and Mukherjee, Graham, and Chakraborti
a parametric form. Some nonparametric control charts (2013) proposed the so-called precedence charts by com-
may not be distribution-free—in the sense that their paring the observations in a given batch of Phase II data
design may still depend on the process distribution— with the observations in the reference sample. Because
although a parametric form for the process distribution the IC distribution of ψn does not depend on the specific
is not required. In this section, we discuss some represen- form of the IC process distribution as long as the IC pro-
tative univariate nonparametric control charts and their cess distribution is symmetric, most charts based on ψn
major properties. For simplicity of presentation, our dis- are distribution-free in that sense. Besides ψn , some alter-
cussion focuses mainly on Phase II SPC, although Phase native nonparametric statistics are possible. For instance,
I SPC is also important and can be discussed similarly. Amin, Reynolds, and Bakir (1995) and Lu (2015) consid-
For Phase I univariate nonparametric control charts, see ered using the sign test statistic in their nonparametric
articles such as Capizzi and Masarotto (2013); Graham, control charts, Chowdhury, Mukherjee, and Chakraborti
Human, and Chakraborti (2010); Jones-Farmer, Jordan, (2014) used the Cucconi test statistic, and Liu, Tsung,
and Champ (2009); and Ning et al. (2015). and Zhang (2014) used sequential ranks. In cases where
Research on nonparametric SPC in univariate cases has there is only one observation at each time point, several
quite a long history. For some early articles, see Alloway nonparametric control charts have been proposed using
and Raghavachari (1991), Bakir and Reynolds (1979), the ranking information among observations at different
Hackl and Ledolter (1991), and Park and Reynolds (1987). time points. For instance, Zou and Tsung (2010) proposed
Many methods in this area are based on the ranking/ a nonparametric EWMA chart based on a nonparamet-
ordering information within different batches of the ric likelihood ratio test. Their chart is self-starting
observed batch data of a related process. Let the batch and capable of detecting all distributional shifts. Hawkins
of m independent and identically distributed process and Deng (2010) proposed a nonparametric CPD chart
observations at the current time point n be based on the nonparametric Mann-Whitney two-sample
test. Ross, Tasoulis, and Adams (2011) and Ross and
Xn1 , Xn2 , . . . , Xnm , for n ≥ 1. Adams (2012) proposed several nonparametric CPD
charts for detecting mean, variance, and other distribu-
Then, the sum of the Wilcoxon signed-ranks within the
tional shifts.
nth batch of observations is defined to be
All the nonparametric control charts mentioned above

m
are based on the ranking information among differ-
ψn = sign(Xn j − η0 )Rn j , [4]
ent process observations. Another type of nonparamet-
j=1
ric control charts takes an alternative approach by first
where η0 is the IC median of the process distribution, categorizing the original observations and then using
sign(u) = −1,0,1, respectively, when u < 0, = 0, > 0, tools of categorical data analysis for constructing con-
and Rn j is the rank of |Xn j − η0 | in the sequence trol charts (cf., Qiu 2008; Qiu and Li 2011b). Assume
{|Xn1 − η0 |, |Xn2 − η0 |, . . . , |Xnm − η0 |}. Obviously the that the process observations at the current time point
absolute value of ψn would be small if the process is IC n are Xn = (Xn1 , Xn2 , . . . , Xnm ) , where the batch size m
because the positive and negative values in the summa- can be one and the observations are numerical. First, we
tion of Eq. [4] will be roughly the same and they will categorize the original observations into the following
mostly be canceled out. If there is an upward mean shift c intervals: (−∞, ξ1 ], (ξ1 , ξ2 ], . . . , (ξc−1 , ∞). Let Yn jl =
before or at time n, then the value of ψn will tend to be I(Xn j ∈ (ξl−1 , ξl ]), for l = 1, 2, . . . , c, where ξ0 = −∞
positively large. Its value will be negatively large if there and ξc = ∞. Then, Yn j = (Yn j1 , Yn j2 , . . . , Yn jc ) has one
is a downward mean shift. So, ψn in Eq. [4] carries useful component equal to one, the remaining components are
information about potential mean shifts. Based on this all zero, and the index of the component “one” is the
statistic, Bakir (2004) and Chakraborti and Eryilmaz index of the interval that Xn j belongs to. The vector Yn j
(2007) proposed their Shewhart charts for detecting can be regarded as the categorized data of Xn j , and its
54 P. QIU

distribution can be described by f = ( f1 , f2 , . . . , fc ) , the upward CUSUM chart with the charting statistic
where fl denotes the probability that Xn j belongs to  
the lth interval, for l = 1, 2, . . . , c. When the pro- Cn+ = max 0, Cn−1
+
+ ψn − k , for n ≥ 1, [5]
cess is IC, the distribution of Yn j is denoted as f (0) = where C0+ = 0 and k = 9, and the chart gives a signal

( f1(0) , f2(0) , . . . , fc(0) ) . Then, gnl = mj=1 Yn jl denotes the when Cn+ > h, then the possible ARL0 values of the chart
observed count of original observations at the current are 38.40, 53.76, 72.98, 110.87, 155.36, and 249.32 when
time point that belong to the lth interval, and m fl(0) is h is chosen in (0, 12). In such cases the chart cannot
the corresponding expected  count. The Pearson’s Chi- reach the ARL0 value of 200, for instance. In some cases,
square statistic X n2 = p (gnl − m f (0) )2 /(m f (0) ) mea- we may prefer to use a specific ARL0 value (e.g., 200,
l=1 l l
sures their discrepancy. It has been justified that a mean 370), especially when we want to compare several control
shift in the original process distribution will result in a charts. In such cases, Qiu and Li (2011b) proposed a
shift in the distribution of Yn j under some mild condi- simple modification of a charting statistic so that its
tions. Then, based on X n2 , a CUSUM chart for detecting discreteness can be reduced dramatically without alter-
process mean shifts can be constructed in a similar way ing its OC properties. By this idea, the statistic Cn+ in
to that in Crosier (1988), and it was shown that this chart Eq. [5] can be modified in the way described as follows.
could also detect process variance shifts (cf., Qiu and Li Let bn1 , bn2 , . . . , bnm be i.i.d. random numbers from the
2011b). As a sidenote, it is obvious that (i) Shewhart and distribution N(0, ν 2 ), where ν > 0 is a small number
EWMA charts can also be constructed easily based on (e.g., 0.01). Then, ψn in Eq. [5] can be replaced by
n2 and (ii) these charts can be used for detecting distri-
X
butional shifts of processes with categorical observations 
m
ψn∗ = sign(Xn j − η0 )(Rn j + bn j ). [6]
by simply skipping the data categorization step. From the
j=1
above description, the IC properties of the charts based on
the discretized data Yn j depend on the c − 1 parameters From Eq. [6], when the process is IC, the mean of ψn∗

f1(0) , f2(0) , . . . , fc−1
(0)
(note: fc(0) = 1 − c−1 (0)
l=1 f l ). As dis-
is still zero, but ψn∗ can take more different values than
cussed in the categorical data analysis (cf., Agresti 2002), ψn . Thus it is less discrete than ψn . The discreteness of
the Pearson’s Chi-square statistic X n2 would be most pow- ψn∗ is controlled by the value of ν in the following sense:
erful for detecting distributional shifts in Yn j when we When ν is chosen smaller, ψn∗ can take less different
choose fl(0) = 1/c, for all l. So, these parameter values values and thus it is more discrete, and it is less discrete
are always recommended. In that sense, the charts based when ν is chosen larger. After the process becomes OC,
on the discretized data Yn j are distribution-free. However, and because ν is chosen small, the OC performance of
in practice, to categorize the observations we still need to ψn∗ is almost the same as that of ψn . Again, the difference
determine the cut points {ξl , l = 1, 2, . . . , c − 1}, which between the OC performance of ψn∗ and ψn is controlled
are the quantiles of the IC process distribution F0 (e.g., ξ1 by ν. When ν is chosen smaller, ψn∗ and ψn will be more
is the f1(0) quantile). In some cases these parameters can similar, and their OC performance is more different if ν is
be determined easily. For example, if we know that the chosen larger. In practice, ν should be chosen as small as
IC process distribution is symmetric and c = 2, then ξ1 possible, as long as a specific ARL0 value can be achieved
equals the IC process mean or median. But, in a general within a desired precision. To demonstrate these results,
case, they depend on F0 and need to be estimated from an let us consider the following example. Assume that the
IC data set, which is similar to the case with a conventional IC process distribution is the standardized version of the
CUSUM chart in which the IC mean and variance need to t3 distribution Eq. [5] with mean zero and variance one,
be estimated from an IC data set. In that sense, the charts and we consider using the modified version of with k = 9
based on Yn j are not distribution-free, although only and with ψn replaced by ψn∗ in Eq. [6]. If the given ARL0
some IC parameters need to be determined in advance. value is 200 and ν in the distribution of bn j is chosen
Qiu and Li (2011b) showed that the performance of the to be 0.01, then the actual ARL0 value is computed to
charts based on Yn j would be stable in cases when M ≥ be 199.936 when h = 7.992, based on 10,000 replicated
200 and m = 5, where M is the sample size of the IC data. simulations. We tried other commonly used ARL0 values
One common feature of most nonparametric control as well, such as 370 and 500, and they can all be reached
charts is that their charting statistics are discrete. A direct within a good precision too. Now, assume that the process
consequence of the discreteness is that these charts may mean changes from zero to δ, and δ varies from zero to
not be able to reach a specific ARL0 value. For instance, two with a step of 0.2. The ARL1 values of Eq. [5] with
when m = 5, the possible values of ψn in Eq. [4] are and without using the modification are presented in
{−15, −13, . . . , −1, 1, . . . , 13, 15}. If we consider using Figure 3 by the solid and dashed lines, respectively. It
JOURNAL OF QUALITY TECHNOLOGY 55

when m > 1, c can be chosen smaller than its value in


cases when m = 1.
Next, we compare the numerical performance of
the following representative control charts: (i) the non-
parametric CUSUM chart based on data categorization
with c = 5 (denoted as C-CUSUM), (ii) the Lepage-type
nonparametric control chart based on change-point
detection that was proposed in Ross, Tasoulis, and
Adams (2011) (denoted as LP-CPD), (iii) the conven-
tional two-sided CUSUM chart based on the normality
assumption (denoted as N-CUSUM), and (iv) the con-
ventional EWMA Eq. [2] (denoted as R-EWMA). In all
charts, ARL0 is chosen to be 500 and all results are com-
puted from batch data with batch size m = 5 based on
10,000 replicated simulations. The charts C-CUSUM and
Figure . The solid line denotes the ARL1 values of the modified LP-CPD represent two different types of nonparamet-
version of Eq. [] with k = 9, h = 7.992, and ν = 0.01 (ARL0 = ric control charts that are based on data categorization
200 in such cases) in cases where the IC process distribution is the
standardized version of the t3 distribution with mean zerp and vari-
and data ranking, respectively. The IC parameters in
ance one, the process has a mean shift of δ at the initial time point, C-CUSUM (i.e., {ξ1 , ξ2 , . . . , ξc−1 }) are estimated from
and δ changes from zero to two with a step of .. The dashed line an IC data of size M = 500. Once its allowance con-
denotes the ARL1 values of the unmodified version with k = 9 and stant (cf., k in Eq. [1)] is chosen, its control limit is
h = 7.992 in the same cases. The dotted line on top denotes the determined by a bootstrap procedure with a bootstrap
assumed ARL0 value of .
sample size of 10,000 from the IC data such that ARL0
reaches 500. The conventional charts N-CUSUM and
can be seen that the two sets of ARL1 values in the two R-EWMA are set up in a similar way; namely, their
cases are indeed close to each other, and the difference IC parameters (i.e., μ0 , σ , and η0 ) are estimated from
gets smaller when δ increases. One obvious limitation of the IC data, and their control limits are determined
the modification procedure discussed above is that the by the bootstrap procedure from the IC data such that
ARL0 value of the modified chart depends on the random ARL0 = 500. The chart LP-CPD can be implemented
numbers bn j and is thus random. However, as long as we in the R-package cpm. It does not need to estimate the
compute ARL0 based on a large number (e.g., 10,000) of IC parameters. Instead, it requires a “burn-in” sample.
replicated simulations, this randomness should be small. The default “burn-in” sample size is 20. In this example,
To use the nonparametric control charts, some infor- we consider both the default version and the version
mation in the original observations is lost and not used with a “burn-in” sample size of 120 (i.e., 120 ∗ 5 = 600
in process monitoring. This is true for all nonparametric observations), denoted as LP-CPD and LP-CPD1, respec-
control charts, including the ones based on data rank- tively. Each of the three charts C-CUSUM, N-CUSUM,
ing/ordering and the ones based on data categorization. and R-EWMA has a smoothing parameter involved
For the nonparametric control charts based on data (i.e., the allowance constant k in the two CUSUM charts
ranking/ordering, so far we have not found any method and the weighting parameter λ in the EWMA chart). If we
to control the amount of lost information. For the ones compare their performance by specifying their parameter
based on data categorization, the amount of lost informa- values in advance, then the comparison may not be mean-
tion is controlled by the number of intervals, c, used in ingful. For instance, the C-CUSUM chart with k = 0.1
data categorization. If c is chosen larger, then the amount may not be comparable with the R-EWMA chart with
of lost information will be smaller; the amount of lost λ = 0.1. Even the two CUSUM charts C-CUSUM and N-
information will be larger otherwise. If c is chosen larger, CUSUM with a same k value are not comparable because
however, more IC parameters (e.g., ξ1 , ξ2 , . . . , ξc−1 ) need the former is based the categorized data while the latter
to be estimated from the IC data. Qiu and Li (2011b) is based on the original data. The CUSUM and EWMA
discussed the selection of c. The major conclusions are charts are not comparable with the CPD charts LP-CPD
that (i) when m = 1 and the process distribution is quite and LP-CPD1 because the former have their smoothing
symmetric, c can be chosen as small as 5; (ii) when m = 1 parameters involved while the latter do not have such
and the process distribution is quite skewed, c should be parameters. To compare all charts fairly, in this article
chosen larger, but the performance of the related chart the related smoothing parameter is chosen such that
would hardly be improved for choosing c > 10; and (iii) the ARL1 value of each chart for detecting a given shift
56 P. QIU

Figure . Optimal OC ARL values of five control charts when the IC ARL is , m = 5, and the actual IC process distribution is the stan-
dardized version of N(0, 1) (plot (a)), t(4) (plot (b)), χ 2 (1) (plot (c)), and χ 2 (4) (plot (d)). Scale on the y-axis is in natural logarithm.

reaches the minimum. Namely, the optimal performance (e.g., the magnitude of the shift is larger than 0.8). In cases
of the charts is compared here, as it was in papers such as where the process distribution is symmetric with heay
Qiu (2008). The IC process distribution is chosen to be tails (plot (b)), the chart LP-CPD does not perform well,
the standardized version with mean zero and variance one the version LP-CPD1 is reasonably good when the shift
of the following four distributions: N(0, 1), t4 , χ12 , and size is large, the conventional chart N-CUSUM performs
χ42 . The t distribution represents symmetric distributions well only when the shift is small, the other conventional
with heavy tails, while the two Chi-square distributions chart R-EWMA is worse than N-CUSUM consistently,
represent skewed distributions with different skewness. and the chart C-CUSUM is the best almost all the time.
Then, for each IC process distribution, we consider 10 So, in this case, the conventional chart N-CUSUM is rec-
mean shifts from −1.0 to 1.0 with step 0.2. The calculated ommended only when the shift is expected to be small,
ARL1 values of the four charts are shown in Figure 4. and the chart C-CUSUM should instead be used. In cases
From Figure 4, we can reach the following conclusions. where the process distribution is skewed (plots (c) and
First, in cases where the normality assumption is valid (d)), the chart C-CUSUM still performs well in all cases,
(plot (a)), the N-CUSUM performs the best, as expected, and the other charts are not always effective. These results
and the R-EWMA chart also performs reasonably well, are intuitively reasonable for the following reasons. (i)
especially when the shift sizes are small. The chart LP- When the normality assumption is valid, the nonpara-
CPD is not good, but the version LP-CPD1 is reasonably metric charts LP-CPD, LP-CPD1, and C-CUSUM are not
good, especially when the shift size is large. In this as effective as the N-CUSUM chart because the former
case, the conventional charts N-CUSUM and R-EWMA would lose information by using ranks or data catego-
should be used, the nonparametric charts LP-CPD1 and rization. (ii) When the normality assumption is invalid,
C-CUSUM will lose some efficiency, but the loss of effi- the conventional charts N-CUSUM and R-EWMA would
ciency is quite small when the shift size is relatively large not be effective because they are constructed based on the
JOURNAL OF QUALITY TECHNOLOGY 57

normal distribution densities. (See the general formula CUSUM and EWMA charting statistics in Eqs. [1] and
for the CUSUM charting statistic immediately before [2] and the Hotelling’s statistic in multivariate cases,
Eq. [1]). Therefore, when the normality assumption is when the normality assumption is invalid. In such cases,
violated, its performance will not be good. because the conventional IC distributions of these statis-
Because the nonparametric control charts do not tics under the normality assumption are no longer valid,
use all information in the original data, should they be various bootstrap algorithms were developed for com-
avoided in practice? To answer this question, let us briefly puting their control limits (e.g., Noorossanaa and Ayoubi
discuss a similar issue in the context of hypothesis testing. 2011; Phaladiganon et al. 2011). Such control charts
In that context, the null hypothesis is usually protected would have good IC properties, but their OC properties
because it should have been tested extensively in the past. may not be as good as we would expect because their
A hypothesis testing procedure is acceptable only when charting statistics are constructed under the normality
its type-I error probability is smaller than or equal to a assumption or other related distributional assumptions
prespecified significance level α. In such cases, the smaller and have good properties only when such assumptions
its type-II error probabilities, the better. The degree of are valid. See some numerical results about N-CUSUM
protection of the null hypothesis is controlled by α. In and R-EWMA in Figure 4 and some related discussions
applications where the consequence of type-I error could in Jones and Woodall (1998).
be very serious (e.g., applications related to new drugs), α
should be chosen small (e.g., 0.01). Otherwise, α could be
4. Multivariate nonparametric control charts
chosen relatively large. In most applications, people use
the default value of α, which is 0.05. Now, in the context of Quality is a multifaceted concept. In most SPC
Phase II SPC, the IC status of the process at the beginning applications, we are concerned with multiple quality
of online monitoring is confirmed in Phase I analysis. So characteristics. In that sense, SPC research should focus on
it should be protected as well, in the sense that an online multivariate cases. When process observations are multi-
monitoring procedure is acceptable only when its actual variate, it is rare in practice that their distribution is multi-
ARL0 value is not smaller than the prespecified level. variate normal, as explained in Section 1. In the statistical
Otherwise the false signal rate could be higher than what literature, existing methods for describing multivariate
is expected. Of course, it is also not good if the actual non-normal data or transforming multivariate non-
ARL0 value of a chart is much larger than the prespecified normal data to multivariate normal data are limited. If a
level because this could result in more defective products control chart based on the normality assumption is used
than expected. Such a chart, however, is easier to figure in cases where the assumption is invalid, then the actual
out because its ARL1 values are usually also large and it is ARL0 value of the chart could be substantially different
in a disadvantageous status in comparison with compet- from the assumed one, as demonstrated by Figure 1 in
ing charts. In cases where the consequence of defective univariate cases. See the related discussion in Section 9.1
products is very serious, a small prespecified ARL0 value of Qiu (2014). Therefore, development of multivariate
should be used. Otherwise, the prespecified ARL0 value nonparametric control charts is exceptionally important.
can be chosen relatively large. As shown in Figure 1, In this section, we discuss some existing methods for
when the assumed parametric distribution is invalid, the handling this problem and discuss their major proper-
actual ARL0 values of the conventional parametric control ties. A small comparison study among some of them
charts could be substantially smaller than the prespecified is also presented. It should be pointed out that multi-
ARL0 value. In such cases, they should be avoided and the variate nonparametric SPC is much more challenging
nonparametric control charts should be used. The loss of than its univariate counterpart, partly because (i) the
information is just the price to pay for the nonparametric multiple quality characteristics could have a complicated
control charts to reach the prespecified ARL0 level without correlation structure among them and (ii) potential
specifying a parametric form for describing the IC process shifts in the process distribution could have infinitely
distribution. One important future research topic is to many possible directions, and so forth. Thus, it requires
minimize the lost information while keeping the favorable much future research to solve this problem properly in
properties of the nonparametric control charts. different scenarios and compare different methods in a
At the end of this section, we would like to mention comprehensive manner.
that the IC distributions of nonparametric charting As in univariate cases, one natural idea to handle the
statistics are often difficult to derive analytically. So, the multivariate nonparametric SPC (MNSPC) problem is
bootstrap and other numerical approaches are often used to use the ranking/ordering information in the process
for this purpose (e.g., Ambartsoumian and Jeske 2015; observations. In multivariate cases, there are two types of
Chatterjee and Qiu 2009). Some authors also considered ranking information. The longitudinal ranking refers to
using some conventional charting statistics, such as the the one among observations at different time points, and
58 P. QIU

the cross-component ranking refers to the ranking across Boone and Chakraborti (2012) proposed a MNSPC
different components of a multivariate observation at a Shewhart chart based on component-wise signs,
given time point. The first MNSPC method based on lon- described as follows. At the current time point n, assume
gitudinal ranking might be the one by Liu (1995) using the that we observe a batch of m independent and identically
concept of data depth (cf., Liu and Singh 1993; Liu, Singh, distributed process observations
and Teng 2004). One fundamental difference between
Xn1 , Xn2 , . . . , Xnm , for n ≥ 1,
univariate and multivariate cases is that process obser-
vations in univariate cases are naturally ordered on the where Xn j = (Xn j1 , Xn j2 , . . . , Xn jp ) , for j = 1, 2, . . . , m.
number line and such ordering is not well defined in Then, for the lth component, the sign statistic is defined
multivariate cases. One major purpose of data depth as
is to establish an order among multivariate observa- 
m
tions. There are several different definitions of data ξnl = sign(Xn jl −  μ0l ),
depth in the literature. The Mahalanobis depth of a j=1

p-dimensional point y with respect to a p-dimensional where sign(u) = −1, 0, 1, respectively, when u < 0, =
distribution F0 is proportional to the reciprocal of the 0, > 0, and  μ0 = ( μ01 ,  μ0p ) is the IC process
μ02 , . . . , 
Mahalanobis distance (y − μ0 ) 0 (y − μ0 ), where μ0 median vector. Let ξ n = (ξn1 , ξn2 , . . . , ξnp ) . Then, the
and 0 are the mean vector and covariance matrix of Shewhart charting statistic is ξ n ξ ξ n , where ξ is an
n n
F0 . Then, p-dimensional observations can be ordered estimator of the covariance matrix of ξ n . This chart is
by their Mahalanobis distances: the smaller the Maha- difficult to use in cases when m = 1 (i.e., a single obser-
lanobis distance, the closer to the center of the dis- vation is obtained at each time point), due to the discrete-
tribution F0 . One obvious limitation of this depth is ness of its charting statistic. Also, although Boone and
that it ignores the shape of F0 completely. To over- Chakraborti (2012) determined the control limits of the
come this limitation, Liu (1990) defined the simpli- Shewhart chart using the χ p2 distribution, this distribu-
cial depth of y as P(y ∈ S (Y1 , Y2 , . . . , Y p+1 )), where tion is appropriate for describing the IC distribution of
(Y1 , Y2 , . . . , Y p+1 ) is a simple random sample from F0 , ξ ξ n only when m is large and
ξ n ξ is close to ξ . In
n n n
S (Y1 , Y2 , . . . , Y p+1 ) is an open simplex with vertices practice, however, m is usually small (e.g., m = 5) and the
at Y1 , Y2 , . . . , Y p+1 , and P(·) is the probability under components of ξ n are correlated. Thus, the IC distribution
F0 . In practice, F0 is often unknown. Instead, we may ξ ξ n needs to be estimated from an IC data set using
of ξ n n
have a reference sample (Y1 , Y2 , . . . , YM ) from F0 . In a numerical approach (e.g., a bootstrap algorithm). In that
such cases, Liu (1990) suggested replacing the probability sense, this Shewhart chart is no longer distribution-free.
in the definition of simplicial depth by the propor- Qiu (2014, Section 9.2) generalized it to a multivariate

tion [1/( M p + 1 )] I(y ∈ S (Yi1 , Yi2 , . . . , Yi p+1 )), where EWMA chart as follows. First, we define
(Yi1 , Yi2 , . . . , Yi p+1 ) is a subset of (Y1 , Y2 , . . . , YM ). This
En = λξ n + (1 − λ)En−1 , for n ≥ 1, [7]
idea is similar to replacing F0 by its empirical estimator.
By the concept of data depth, some Shewhart, CUSUM, where E0 = 0 and λ ∈ (0, 1) is a weighting parameter.
EWMA, and other types of charts have been constructed Then, the chart signals a mean shift when
in the literature (cf., Li, Zhang, and Jeske 2013; Li, Dai,
En E−1 En > h, [8]
and Wang 2014; Liu 1995). Based on extensive numerical n

studies, Bush et al. (2010), Stoumbos et al. (2001), Stoum- where h > 0 is a control limit. Boone and Chakraborti
bos and Jones (2000), and some others have pointed out (2012) proposed another MNSPC Shewhart chart by gen-
that control charts based on data depth are generally inef- eralizing the Wilcoxon signed-rank sum statistic in Eq. [4]
fective, especially in cases where a large reference sample into multivariate cases. In this chart, the signed-rank sum
is unavailable. This conclusion is not a surprise because in statistic was computed for each component of process
most charts based on data depth, the nonparametric func- observations, and the charting statistic takes a quadratic
tion F0 is estimated from a reference sample. While F0 can form of the vector of these componentwise signed-rank
be regarded as a parametric function with infinite num- sums. A similar MNSPC chart is discussed in Bush et al.
ber of parameters, its empirical estimators would have (2010). An EWMA chart based on the Wilcoxon signed-
a large variation, especially in cases where the dimen- rank sum statistic was proposed recently by Chen, Zi, and
sionality p is relatively large. Consequently, the related Zou (2016).
charting statistics would have large variation as well, The charts discussed in the previous paragraph use
resulting in ineffective detection of process distributional component-wise ordering information that ignores the
shifts. ordering among different components. To overcome this
JOURNAL OF QUALITY TECHNOLOGY 59

limitation, Zou and Tsung (2011) and Zou, Wang, and antiranks have the following properties. The first antirank
Tsung (2012) proposed two EWMA charts using the is particularly sensitive to downward mean shifts in a
so-called spatial sign and spatial rank that were discussed small number of components of Xn , and the last antirank
extensively in the nonparametric statistics literature is particularly sensitive to upward mean shifts in a small
(cf., Oja 2010). Assume that a p-dimensional random number of components of Xn . If we do not know the direc-
vector X has the mean μ0 . Then, its spatial sign is defined tion of a shift, then the combination of the first and last
to be S(X) = (X − μ0 )/X − μ0  when X = μ0 , and antiranks should be sensitive to the shift. In other words,
zero otherwise. For a sample (X1 , X2 , . . . , Xn ) from the we can reduce the dimension of the SPC problem from p
distribution of X, the spatial rank of Xi is defined to be to two without losing much efficiency if the antiranks are
1 n
ri = n j=1 S(Xi − X j ), for i = 1, 2, . . . , n. Assume that used. Qiu and Hawkins then suggested a nonparametric
the IC mean and the IC covariance matrix of X are μ0 CUSUM chart based on the antiranks for detecting mean
and 0 , respectively, and the “most robust” measure of shifts violating H0(1) . To detect (2)
 p mean shifts violating H0 ,
scatter of the distribution of X defined by Tyler (1987) a regular CUSUM using j=1 Xn j should be effective.
is A0 , which is an upper triangular p × p matrix with However, the above monitoring scheme has the obvious
positive diagonal elements. Then, the EWMA charting drawback that two separate control charts need to be
statistic suggested by Zou and Tsung (2011) is defined as used for detecting mean shifts violating H0(1) and H0(2) ,
[(2 − λ)p/λ]En En , where respectively, making it inconvenient to use. To overcome
this drawback, Qiu and Hawkins (2003) proposed a mod-
En = (1 − λ)En−1 + λS(A0 Xn ), for n ≥ 1. ification using the antiranks of (Xn1 , Xn2 , . . . , Xnp, 0).
This modified CUSUM chart was shown to be effective in
Zou and Tsung (2011) pointed out that this chart was detecting arbitrary mean shifts. From the above descrip-
distribution-free only in some special cases and its con- tion, it can be seen that the IC properties of the nonpara-
trol limit needed to be estimated from an IC data set in metric control charts based on antiranks are determined
a general case. A CUSUM chart based on spatial sign was completely by the IC distribution of the antiranks. In
discussed by Li, Zhang, and Jeske (2013). The EWMA cases where the IC process distribution is exchangable in
chart suggested by Zou, Wang, and Tsung (2012) replaced the sense that the distribution function is unchanged if
the spatial sign S(Xn ) in the above expression by a prop- the order of measurement components is changed, then
erly defined spatial rank of Xn . Holland and Hawkins the IC distribution of the antiranks would be uniform in
(2014) proposed a CPD chart based on spatial rank. That its domain. Thus, the IC properties of the related control
chart can be accomplished using the R-package NPMVCP. charts would not change in such cases. However, in a
The control charts discussed above are all based general case, parameters in the IC distribution of the
on longitudinal ranking of the observed data. Qiu and antiranks should still be estimated from an IC data set.
Hawkins (2001) suggested an alternative strategy for non- In cases where Xn is multivariate normally distributed,
parametric SPC based on cross-component ranking of the we know that all marginal distributions are normal and
data. Let {Xn = (Xn1 , Xn2 , . . . , Xnp ) , n ≥ 1} be the Phase the relationship between any two subsets of its p compo-
II observations of a p-dimensional process with the IC nents is linear (i.e., the regression function of one subset
mean vector μ0 and the IC covariance matrix 0 , and μ = on the other is always linear). In cases where the joint
(μ1 , μ2 , . . . , μ p ) be the true mean of Xn . Without loss of distribution of Xn is not normal, its marginal distributions
generality, assume that μ0 = 0 and 0 = Ip×p (otherwise, and the relationship between a pair of two subsets of the
consider transformed observations  Xn = 0−1/2 (Xn − components of Xn could be complicated, which explains the
μ0 )). Qiu and Hawkins noticed that any mean shift main reason why multivariate non-Gaussian distributions
violates either H0(1) : μ1 = μ2 = · · · = μ p or H0(2) : are difficult to describe. However, if all components of
p (1)
j=1 μ j = 0, where H0 is related to the ranking of the p Xn are categorical, this difficulty disappears because the
components of Xn and H0(2) is related to their magnitudes. log-linear modeling approach is effective in describing
To detect mean shifts violating H0(1) , Qiu and Hawkins the relationship among categorical variables (e.g., Agresti
suggested using the antiranks (also called inverse ranks) 2002). Based on this consideration, Qiu (2008) proposed a
of the p components of Xn . The first antirank An1 is general scheme to construct nonparametric multivariate
defined to be the index in (1, 2, . . . , p) of the smallest SPC charts by first categorizing the original components
component, the last antirank Anp is the index of the largest of Xn (if some components are already categorical, then
component, and so forth. While the p conventional ranks this step can be skipped for these components) and then
of (Xn1 , Xn2 , . . . , Xnp ) are equally important in detecting describing the joint distribution of the categorized data
mean shifts when no prior information is available regard- using a log-linear model. Then a control chart can be
ing which component(s) of Xn would have mean shift, the constructed accordingly by comparing the empirical and
60 P. QIU

IC distributions of the categorical data, as in univariate IndT5 (3): The five components of Xn are independent and
cases discussed in Section 3. In the chart by Qiu (2008), each of them has the standardized version with mean
only two categories (i.e., ≥ or < the IC median) are zero and variance one of a t3 distribution.
used for each continous component of Xn , for simplicity. IndChisq5 (3): The five components of Xn are independent
Its performance might be improved if more categories and each of them has the standardized version with
are used. Like other multivariate nonparametric control mean zero and variance one of a χ32 distribution.
charts discussed above, the control charts based on data Normal5 (0, CS05 ): Xn ∼ N5 (0, CS05 ), where CS05 is a
categorization are distribution-free only in some special covariance matrix with all diagonal elements being 1
cases (e.g., the process distribution is exchangable and and all off-diagonal elements being 0.5 (i.e., a com-
symmetrical). In a general case, some parameters for pound symmetry case).
describing the IC distribution of the categorized data Mixture: Xn1 ∼ N(0, 1), Xn2 has the standardized ver-
should be estimated from an IC data set by the log-linear sion with mean zero and variance one of a t3 distribu-
modeling. tion, Xn3 has the standardized version with mean zero
Next, we present a numerical study to compare sev- and √variance one of a χ32 distribution,
√ Xn4 ∼ (Xn2 +
eral representative MNSPC charts described above. The η1 )/ 2, and Xn5 ∼ (Xn3 + η2 )/ 2, where η1 and η2
MNSPC charts considered here include (i) the EWMA are two independent random numbers generated from
chart in Eqs. ([7]) and ([8]) based on the component- N(0, 1).
wise sign statistic ξ n , denoted as SIGN-EWMA, (ii) the
CPD chart by Holland and Hawkins (2014) that is based The first two cases IndT5 (3) and IndChisq5 (3) repre-
on the spatial rank, denoted as SR-CPD, (iii) the mod- sent different scenarios where the components of Xn are
ified antirank-based chart by Qiu and Hawkins (2003), indendent, while the last two cases represent scenarios
denoted as ANTIRANK, and (iv) the control chart based where they are correlated. At time point τ , assume that
on data categorization that was proposed by Qiu (2008), the process mean has one of the following five shifts:
denoted as CATEGORIZE. In SR-CPD, the control lim-
M1 : (−0.5, 0, 0, 0, 0), M2 : (−1, 0, 0, 0, 0),
its provided in Holland and Hawkins (2014) are used,
where a “burn-in” sample of 32 IC observations is used. M3 : (−1, 1, 0, 0, 0),
In ANTIRANK, only the first and last antiranks are con- M4 : (−0.5, −0.5, −0.5, 0.5, 0.5),
sidered. These four charts are chosen mainly because M5 : (0.5, 0.5, 0.5, 0.5, 0.5).
of the following three considerations: (i) they represent
four types of charts that use the sign statistic, spatial Because the SR-CPD chart requires a “burn-in” sample
rank, cross-component antirank, and data categorization, and Holland and Hawkins (2014) recommended the size
respectively, (ii) all of them are either EWMA, CPD, or of that sample to be 33 or more, we first consider the case
CUSUM charts that are good at detecting persistent mean where τ = 50. To use the four control charts for process
shifts, and (iii) we have their software. monitoring, we need to choose their procedure parame-
In the numerical study, we assume that there is a ters (e.g., λ and h in SIGN-EWMA) properly. To make the
single observation at each time point, observations at comparison among the four charts as fair as possible, in
different time points are independent, the IC distribution this article we choose the parameters of each chart so that
is known, p = 5, and ARL0 = 200. So, the impact of the (i) the actual ARL0 = 200 and (ii) the ARL1 value reaches
estimation error of the IC distribution from an IC data the minimum when detecting a given shift. Namely, we
and the data autocorrelation are not considered here. compare the optimal performace of the four charts in this
However, it should be pointed out that one important study, as we did in univariate cases (cf., Figure 4).
property of the CPD chart SR-CPD is that it does not Based on 10,000 replicated simulations, the calculated
need any prior information about the IC process dis- ARL1 values of the five charts are shown in Figure 5 in
tribution, besides a “burn-in” sample, in order to run the log scale, along with their actual ARL0 values. From
that chart. Thus, by assuming the IC distribution to be the plot, we can see that (i) all five methods are reliable
known, we may have put it in a disadvantageous status in to use in the four cases considered in the sense that their
the comparison. To account for this, besides the original actual ARL0 values are close to the specified value 200, (ii)
CPD chart SR-CPD, we also consider the SR-CPD chart no single method is unanimously better than the remain-
that uses a large “burn-in” sample of 532 observations, ing methods, (iii) the chart SR-CPD1 is unanimously
denoted as SR-CPD1. To use a large “burn-in” sample, better than SR-CPD, as expected, and is also better than
the SR-CPD1 chart could learn enough information the other charts in most cases, and (iv) it seems that
about the IC distribution from the “burn-in” sample. The the charts SR-CPD1, ANTIRANK, and CATEGORIZE
following four IC distributions are considered: are better in detecting small shifts (e.g., M1) when the
JOURNAL OF QUALITY TECHNOLOGY 61

Figure . Calculated ARL0 and ARL1 values of the four MNSPC charts for detecting shifts M–M when τ = 50, where M = (, , , , ).
(a) IndT5 (3); (b) IndChisq5 (3); (c) Normal5 (0, CS05 ); and (d) Mixture. The y–axes are in the log scale.

components of Xn are correlated (cf., plots (c) and (d)). in the observed data for process monitoring. Intuitively,
The corresponding results when τ = 0 (i.e., zero-state a more efficient solution to the MNSPC problem is to
run length performance) are shown in Figure 6. The combine the longitudinal ranking and cross-component
charts SR-CPD and SR-CPD1 are not included in such ranking information in the observed data. But it is still
cases because their requirement of a “burn-in” sample unknown to us how to combine the two types of ranking
cannot be satisfied. From the plots, it can be seen that information effectively. In the method based on data
(i) the chart SIGN-EWMA is unanimously worse than categorization, it has been shown in Qiu and Li (2011b)
the other two charts, (ii) the two charts ANTIRANK that its performance can be improved if more than
and CATEGORIZE perform similarly well, and (iii) two categories are used in univariate cases, and some
it seems that CATEGORIZE performs slightly better practical guidelines about the selection of the number
than ANTIRANK when detecting small shifts (i.e., of categories are given in the penultimate paragraph in
M1). We also tried cases where τ is between 0 and 50, Section 3. In multivariate cases, it is unknown whether
and the results are between those shown in Figure 5 similar guidelines are still valid. It has been pointed out
and Figure 6. several times in the article that some information on the
Although there have been some MNSPC charts pro- observed data will be lost by using either the ranking
posed in the literature as discussed above, there are still information or the data categorization. This is especially
some fundamental issues related to the MNSPC problem true in multivariate cases. So, a natural research question
that need to be addressed. For instance, most existing is how to minimize the lost information in MNSPC. All
MNSPC methods are based on either the longitudinal these research questions require much future research
ranking or the cross-component ranking information effort.
62 P. QIU

Figure . Calculated ARL0 and ARL1 values of the three MNSPC charts for detecting shifts M–M when τ = 0, where M = (, , , , ).
(a) IndT5 (3); (b) IndChisq5 (3); (c) Normal5 (0, CS05 ); and (d) Mixture. The y-axes are in the log scale.

5. Some concluding remarks factors (Zhang et al. 2015). Therefore new SPC methods
are needed for such applications, and the new methods
In this big-data era, SPC has found more and more
should be able to accommodate the complicated data
applications in engineering, environmental research,
structure properly.
medicine, public health, and other disciplines and areas
(Qiu 2017, 2018) because data streams are common in
these applications and it is often a fundamental task to
About the authors
monitor online the properties of the related data streams.
When the data structure becomes more complicated, Peihua Qiu received his Ph.D. in statistics from the Statistics
conventional SPC charts need to be modified or gen- Department at University of Wisconsin at Madison in 1996.
He worked as a senior research consulting statistician of the
eralized properly to still be useful. In this article, we
Biostatistics Center at Ohio State University during 1996-1998.
only focus on cases where the conventional normality He then worked as an assistant professor (1998-2002), an asso-
assumption is invalid. In practice, other assumptions, ciate professor (2002-2007), and a full professor (2007-2013)
such as the independent observations and the identical at the School of Statistics at University of Minnesota. He is
IC distribution, could all be violated. For instance, when an elected fellow of the American Statistical Association, an
monitoring the spatial-temporal pattern of the incidence elected fellow of the Institute of Mathematical Statistics, an
elected member of the International Statistical Institute, a senior
rate of an infectious disease across the united state and member of the American Society for Quality, and a lifetime
over time, observed incidence rates could be spatially and member of the International Chinese Statistical Association. He
temporally correlated. Furthermore, even when there is served as associate editor for Journal of the American Statistical
no disease outbreak in a given region and within a given Association (2006-2012), Biometrics (2011-2012), Technomet-
time period, the distribution of the disease incidence rics (2007-2012), and Statistical Papers (2011-2012), and guest
co-editor for Multimedia Tools and Applications, and Quality
rate could still change in the given region and the given
and Reliability Engineering International. He was the editor-
time period due to seasonality and other environmental elect (2013) and editor (2014-2016) of Technometrics, and is
JOURNAL OF QUALITY TECHNOLOGY 63

currently the founding chair of the Department of Biostatistics charts based on a linkage ranking algorithm. Quality and
at University of Florida, started since July 1, 2013. Reliability Engineering International 26:663–75.
Capizzi, G. 2015. Recent advances in process monitoring: Non-
Peihua Qiu has made substantial contributions in the areas of
parametric and variable-selection methods for phase I and
jump regression analysis, image processing, statistical process
phase II. Quality Engineering 27:44–67.
control, survival analysis and disease screening and surveil-
Capizzi, G., and G. Masarotto. 2013. Phase I distribution-free
lance. So far, he has published over 100 research papers, many
analysis of univariate data. Journal of Quality Technology
of which appeared in top journals, including Technometrics,
45:273–84.
Journal of the American Statistical Association, Annals of
Chakraborti, S., and S. Eryilmaz. 2007. A nonparametric
Statistics, Annals of Applied Statistics, Journal of the Royal Sta-
Shewhart-type signed-rank control chart based on runs.
tistical Society (Series B), Biometrika, Biometrics, IEEE Trans-
Communications in Statistics—Simulation and Computation
actions on Pattern Analysis and Machine Intelligence, and IIE
36:335–56.
Transactions. His research monograph titled, Image Processing
Chakraborti, S., P. Qiu, and A. Mukherjee. 2015. Editorial to
and Jump Regression Analysis, (2005, Wiley) won the inaugu-
the special issue: Nonparametric statistical process con-
ral Ziegel prize in 2007 for its contribution in bridging the gap
trol charts. Quality and Reliability Engineering International
between jump regression analysis in statistics and image pro-
31:1–2.
cessing in computer science. His second book titled, Introduc-
Chakraborti, S., P. van der Laan, and S. T. Bakir. 2001. Nonpara-
tion to Statistical Process Control, was published in 2014 by
metric control charts: An overview and some results. Jour-
Chapman & Hall/CRC.
nal of Quality Technology 33:304–15.
Chakraborti, S., P. van der Laan, and M. van de Wiel. 2004. A
class of distribution-free control charts. Journal of the Royal
Acknowledgments Statistical Society, Series C 53:443–62.
Champ, C. W., and W. H. Woodall. 1987. Exact results for She-
We thank the editor and two referees for many constructive whart control charts with supplementary runs rules. Tech-
comments and suggestions that, greatly improved the quality of nometrics 29:393–9.
the article. Chatterjee, S., and P. Qiu. 2009. Distribution-free cumulative
sum control charts using bootstrap-based control limits.
The Annals of Applied Statistics 3:349–69.
Funding Chen, N., X. Zi, and C. Zou. 2016. A distribution-free mul-
tivariate control chart. Technometrics 58 (4): 448–459.
This research is supported in part by a National Science Foun- doi:10.1080/00401706.2015.1049750.
dation grant. Chowdhury, S., A. Mukherjee, and S. Chakraborti. 2014. A
new distribution-free control chart for joint monitoring of
unknown location and scale parameters of continuous dis-
tributions. Quality and Reliability Engineering International
References 30:191–204.
Crosier, R. B. 1988. Multivariate generalizations of cumulative
Agresti, A. 2002. Categorical data analysis. 2nd ed. New York,
sum quality-control schemes. Technometrics 30:291–303.
NY: John Wiley & Sons.
Crowder, S. V. 1989. Design of exponentially weighted moving
Alloway, Jr., J. A., and M. Raghavachari. 1991. Control chart
average schemes. Journal of Quality Technology 21:155–62.
based on the Hodges-Lehmann estimator. Journal of Quality
Gan, F. F. 1993. An optimal design of CUSUMcontrol charts for
Technology 23:336–47.
binomial counts. Journal of Applied Statistics 20:445–60.
Ambartsoumian, T., and D. R. Jeske. 2015. Nonparametric
Graham, M. A., S. Chakraborti, and S. W. Human. 2011. A non-
CUSUM control charts and their use in two-stage SPC
parametric exponentially weighted moving average signed-
applications. Journal of Quality Technology 47:264–77.
rank chart for monitoring location. Computational Statistics
Amin, R., M. R. Reynolds, Jr, and S. T. Bakir. 1995. Non-
and Data Analysis 55:2490–503.
parametric quality control charts based on the sign statis-
Graham, M. A., S. W. Human, and S. Chakraborti. 2010. A phase
tic. Communications in Statistics—Theory and Methods
I nonparametric Shewhart-type control chart based on the
24:1597–623.
median. Journal of Applied Statistics 37:1795–813.
Bakir, S. T. 2004. A distribution-free Shewhart quality con-
Hackl, P., and J. Ledolter. 1991. A control chart based on ranks.
trol chart based on signed-ranks. Quality Engineering 16:
Journal of Quality Technology 23:117–24.
611–21.
Hawkins, D. M. 1991. Multivariate quality control based on
Bakir, S. T., and M. R. Reynolds, Jr., 1979. A nonparametric pro-
regression-adjusted variables. Technometrics 33:61–75.
cedure for process control based on within group ranking.
Hawkins, D. M., and Q. Deng. 2010. A nonparametric change-
Technometrics 21:175–83.
point control chart. Journal of Quality Technology 42:
Boone, J. M., and S. Chakraborti. 2012. Two simple Shewhart-
165–73.
type multivariate nonparametric control charts. Applied
Hawkins, D. M., and D. H. Olwell. 1998. Cumulative sum charts
Stochastic Models in Business and Industry 28:130–40.
and charting for quality improvement. New York: Springer-
Borror, C. M., D. C. Montgomery, and G. C. Runger. 1999.
Verlag.
Robustness of the EWMA control chart to non-normality.
Hawkins, D. M., P. Qiu, and C. W. Kang. 2003. The changepoint
Journal of Quality Technology 31:309–16.
model for statistical process control. Journal of Quality Tech-
Bush, H. M., P. Chongfuangprinya, V. C. P. Chen, T. Sukchotrat,
nology 35:355–66.
and S. B. Kim. 2010. Nonparametric multivariate control
64 P. QIU

Holland, M. D., and D. M. Hawkins. 2014. A control chart based Noorossanaa, R., and M. Ayoubi. 2011. Profile monitoring
on a nonparametric multivariate change-point model. Jour- using nonparametric bootstrap T 2 control chart. Com-
nal of Quality Technology 46:63–77. munications in Statistics—Simulation and Computation 41:
Humana, S. W., P. Kritzingera, and S. Chakrabortiborti. 2011. 302–15.
Robustness of the EWMA control chart for individual Oja, H. 2010. Multivariate nonparametric methods with R. New
observations. Journal of Applied Statistics 38:2071–87. York, NY: Springer-Verlag.
Jiang, W., L. Shu, and K.-L. Tsui. 2011. Weighted CUSUM con- Page, E. S. 1954. Continuous inspection scheme. Biometrika
trol charts for monitoring Poisson processes with varying 41:100–15.
sample sizes. Journal of Quality Technology 43:346–62. Park, C., and M. R. Reynolds, Jr. 1987. Nonparametric
Jones, L. A., and W. H. Woodall. 1998. The Performance of procedures for monitoring a location parameter based
Bootstrap Control Charts. Journal of Quality Technology on linear placement statistics. Sequential Analysis 6:
30(4):362–375. 303–23.
Jones-Farmer, L. A., V. Jordan, and C. W. Champ. 2009. Phaladiganon, P., S. B. Kim, V. C. Chen, J. Baek, and S. K.
Distribution-free phase I control charts for subgroup loca- Park. 2011. Bootstrap-based T 2 multivariate control charts.
tion. Journal of Quality Technology 41:304–17. Communications in Statistics—Simulation and Computation
Lehmann, E. L. 1986. Testing statistical hypotheses. New York, 40:645–62.
NY: John Wiley & Sons. Qiu, P. 2008. Distribution-free multivariate process control
Li, J., X. Zhang, and D. R. Jeske. 2013. Nonparametric multivari- based on log-linear modeling. IIE Transactions 40:664–77.
ate CUSUM control charts for location and scale changes. Qiu, P. 2014. Introduction to statistical process control. Boca
Journal of Nonparametric Statistics 25:1–20. Raton, FL: Chapman Hall/CRC.
Li, S. Y., L. C. Tang, and S. H. Ng. 2010. Nonparametric CUSUM Qiu, P. 2017. Statistical process control charts as a tool for ana-
and EWMA control charts for detecting mean shifts. Journal lyzing big data. In Big and complex data analysis: Statisti-
of Quality Technology 42:209–26. cal methodologies and applications, ed. E. Ahmed, 123–138.
Li, Z., Y. Dai, and Z. Wang. 2014. Multivariate change point con- New York, NY: Springer-Verlag.
trol chart based on data depth for phase I analysis. Com- Qiu, P. 2018. Jump regression, image processing, and quality
munications in Statistics-Simulation and Computation 43: control (with discussions). Quality Engineering, 30 (1): 137–
1490–507. 153.
Liu, L., F. Tsung, and J. Zhang. 2014. Adaptive nonparametric Qiu, P., and D. M. Hawkins. 2001. A rank based multivariate
CUSUM scheme for detecting unknown shifts in location. CUSUM procedure. Technometrics 43:120–32.
International Journal of Production Research 52:1592–606. Qiu, P., and D. M. Hawkins. 2003. A nonparametric multivari-
Liu, R. Y. 1990. On a notion of data depth based on random ate CUSUM procedure for detecting shifts in all directions.
simplices. Annals of Statistics 18:405–14. JRSS-D (The Statistician) 52:151–64.
Liu, R. Y. 1995. Control charts for multivariate processes. Qiu, P., and Z. Li. 2011a. Distribution-free monitoring of
Journal of the American Statistical Association 90: univariate processes. Statistics and Probability Letters 81:
1380–87. 1833–40.
Liu, R. Y., and K. Singh, 1993. A quality index based on data Qiu, P., and Z. Li. 2011b. On nonparametric statistical pro-
depth and multivariate rank tests. Journal of the American cess control of univariate processes. Technometrics 53:
Statistical Association 88:257–60. 390–405.
Liu, R. Y., K. Singh, and J. H. Teng. 2004. DDMA-charts: Qiu, P., and J. Zhang. 2015. On phase II SPC in cases when nor-
Nonparametric multivariate moving average control charts mality is invalid. Quality and Reliability Engineering Inter-
based on data depth. Allgemeines Statisches Archiv 88: national 31:27–35.
235–58. Reynolds, Jr, M. R., and J. Lou. 2010. An evaluation of a GLR
Lowry, C. A., W. H. Woodall, C. W. Champ, and S. E. Rigdon. control chart for monitoring the process mean. Journal of
1992. A multivariate exponentially weighted moving aver- Quality Technology 42:287–310.
age control chart. Technometrics 34:46–53. Reynolds, Jr, M. R., and Z. G. Stoumbos. 2000. A general
Lu, S. L. 2015. An extended nonparametric exponentially approach to modeling CUSUM charts for a proportion. IIE
weighted moving average sign control chart. Quality and Transactions 32:515–35.
Reliability Engineering International 31:3–13. Ritov, Y. 1990. Decision theoretic optimality of the CUSUM
Montgomery, D. C. 2012. Introduction to statistical quality con- procedure. Annals of Statistics 18:1464–69.
trol. New York, NY: John Wiley & Sons. Roberts, S. V. 1959. Control chart tests based on geometric mov-
Moustakides, G. V. 1986. Optimal stopping times for detect- ing averages. Technometrics 1:239–50.
ing changes in distributions. Annals of Statistics 14: Ross, G. J., and N. M. Adams. 2012. Two nonparametric control
1379–87. charts for detecting arbitrary distribution changes. Journal
Mukherjee, A., M. A. Graham, and S. Chakraborti. 2013. of Quality Technology 44:102–16.
Distribution-free exceedance CUSUM control charts for Ross, G. J., D. K. Tasoulis, and N. M. Adams. 2011. Nonpara-
location. Communications in Statistics—Simulation and metric monitoring of data streams for changes in location
Computation 42:1153–87. and scale. Technometrics 53:379–89.
Ning, W., A. B. Yeh, X. Wu, and B. Wang. 2015. A nonparamet- Shewhart, W. A. 1931. Economic control of quality of manufac-
ric phase I control chart for individual observations based tured product. New York, NY: D. Van Nostrand Company.
on empirical likelihood ratio. Quality and Reliability Engi- Shiau, J. J., and Y. C. Hsu. 2005. Robustness of the EWMA
neering International 31:37–55. control chart to non-normality for autocorrelated
JOURNAL OF QUALITY TECHNOLOGY 65

processes. Quality Technology and Quantitative Man- Tracy, N. D., J. C. Young, and R. L. Mason. 1992. Multivariate
agement 2:125–46. control charts for individual observations. Journal of Quality
Steiner, S. H., and R. J. MacKay. 2000. Monitoring processes Technology 24:88–95.
with highly censored data. Journal of Quality Technology Tyler, D. E. 1987. A distribution-free M-estimator of multivari-
32:199–208. ate scatter. Annals of Statistics 15:234–51.
Stoumbos, Z. G., and L. A. Jones. 2000. On the properties Yashchin, E. 1993. Statistical control schemes: Methods, appli-
and design of individual control charts based on simplicial cations, and generalizations. International Statistical Review
depth. Nonlinear Studies 7:147–78. 61:41–66.
Stoumbos, Z. G., L. A. Jones, W. H. Woodall, and M. R. Zhang, J., Y. Kang, Y. Yang, and P. Qiu. 2015. Statistical monitor-
Reynolds, Jr. 2001. On nonparametric multivariate control ing of the hand, foot, and mouth disease in China. Biomet-
charts based on data depth. Frontiers in Statistical Quality rics 71:841–50.
Control 6:207–27. Zou, C., and F. Tsung. 2010. Likelihood ratio-based
Stoumbos, Z. G., and J. H. Sullivan. 2002. Robustness to non- distribution-free EWMA control charts. Journal of Quality
normality of the multivariate EWMA control chart. Journal Technology 42:1–23.
of Quality Technology 34:260–76. Zou, C., and F. Tsung. 2011. A multivariate sign EWMA control
Testik, M. C., G. C. Runger, and C. M. Borror. 2003. chart. Technometrics 53:84–97.
Robustness properties of multivariate EWMA control Zou, C., Z. Wang, and F. Tsung. 2012. A spatial rank-based
charts. Quality and Reliability Engineering International 19: multivariate EWMA control chart. Naval Research Logistics
31–38. 59:91–110.
Copyright of Journal of Quality Technology is the property of American Society for Quality,
Inc. and its content may not be copied or emailed to multiple sites or posted to a listserv
without the copyright holder's express written permission. However, users may print,
download, or email articles for individual use.

You might also like