Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

CUTHBERT DANIEL

1 16 Pinehurst Ave., New York 33, N. Y.

Application of Statistical Methods


in Chemical Engineering
1 The backlog of statistical knowledge ready to be applied to chemical
engineering problems is large
b Best immediate prospect is that some engineers will study this research
and decide to use some of it

MOST chemical engineers spend


part of their time interpreting data.
tion. There are, however, many cases in
the experience of every statistician, and
must be added the further adverse cir-
cumstance that the numerical work re-
Kearly all data require interpretation; quite a few in the literature, in which quired to extract the little information
only rarely do the numbers force a single random error has been strikingly reduced present is expensive and complex, usually
indubitable conclusion on the inter- by statistical means. Some examples requiring large-scale computing ma-
preter. Even when “the data” consist appear later. chinery.
of a single number, the experienced A major contribution of statistics to When the effects of a considerable
engineer knows that another measure- chemical engineering lies in giving engi- number of independently variable fac-
ment taken under essentially the same neers a means of measuring the random tors are to be studied, factorial designs
conditions would not always agree ex- variability of their processes, together and their fractional replications are
actly with the number a t hand. Such with a means of partitioning the vari- available. I t is the writer’s opinion
discrepancies are often shrugged off ability into parts allocable to different that the greatest gains from current
with some unverified statement: “Condi- sources. Thus, if the engineer could statistical methods applied to chemical
tions have changed” or “We thought actually say “80% of our observed prod- engineering problems will be made in
that carload was poor” or “The assay uct variability comes from raw-material this area. The practicability of study-
lab again.” variation, 159& from process variation, ing the simultaneous impact of several
The fact that every system possesses and 5% from assay uncertainty,” he factors (five to fifteen are oftcn man-
some intrinsic random variability is not would take entirely different action from ageable) is a windfall unsuspected by
easily accepted by chemical engineers. that needed if the percentages were re- most engineers.
Perhaps part of the reason for this re- spectively 20. 40, and 40. This sort of By far the commonest application of
jection is the engineer‘s lack of knowl- partitioning (discussed later) is called statistics to the proccss industries is in
edge of how to be objective in drawing components-of-variance analysis. the use of some of the techniques of sta-
conclusions from uncertain data. He When large masses of data on pres- tistical quality control. Simple plotting
can hardly be blamed for this since none sures, temperatures, concentrations, and of the averages of small adjacent groups
of the standard texts-e.g., Walker, other operating conditions have already of readings gives many operating engi-
Lewis, McAdams, and Gilliland‘s “Prin- been accumulated, it is natural to be- neers a chance to visualize the operation
ciples of Chemical Engineering” (22), lieve that they must contain a great deal of the plant more effectively than the
Perry’s “Chemical Engineers’ Hand- of information about the process. The usual columns of hourly or shift figures.
book” (76), Sherwood and Reed‘s analysis of these data is usually handled Parallel plotting of the scatter (range)
“Applied Mathematics in Chemical by the statistical device called “multiple within the small sets gives one a feeling
Engineering” (20) makes even a passing regression” analysis. The yield of in- for how large the variability of neighbor-
reference to modern statistical methods. formation from these data, per dollar ing points is and, even more important,
Random variation and the uncer- spent, is generally low. They contain a sense of how stable thisvariability is.
tainty it brings with it cannot be entirely much less information than data taken If the papers published in the Trans-
eliminated by mathematical manipula- under planned conditions. T o this actions of the National Conocntions of the
American Society for Quality Control are
typical, then improvements in stability
can often be produced even with very
little attention given to such tiresome
details as assumptions. The simple
attention-calling of the range and mean
charts appears sufficient to induce im-
provement in many cases.
More careful appIication of the serious
ideas behind statistical quality control
can hardly fail to produce even better
results. The idea of verifying the sta-
bility of the population sampled (estab-
lishing a “state of statistical control,”

1 392 INDUSTRIAL AND ENGINEERING CHEMISTRY


in W. A. Shewhart’s phrase) is a serious ating engineers, and those connected
one, cutting deeply into the whole set with assay and control laboratories, - =
of ideas and practices called scientific find quality-control charts of direct and
method. T h e idea of rational sub- easy use, both in judging process sta-
groups gives in many cases a real hand- bility and in evaluating the analyses 92 high temperature
hold in the separation of measurement reported by the laboratories. The re-
error from process error. T h e idea of cently developed methods of acceptance
examining long sequences of process sampling by variables have found ap-
measurements in rational groups, to plication in plants which receive raw
materials in batches, or in lots that have 84
watch for trends, jumps, mavericks, and Results at
mistakes, is the third first-rate idea of serious heterogeneity-e.g., iron ore- 82 . low temperature
statistical quality control. and which therefore require some sophis- 80
Perhaps the principal use of statistical tication in sampling methods.
-0 1 2
quality control for chemical engineers T o return to the general discipline TIMU scale
will turn out to be as an educational step called here “the statistical part of ex-
Solid lines show no interaction between
in the development of men who will perimental design,” developed origi-
temperature and time. Lower solid
want to make more thoroughgoing nally by Fisher (72),it appears that some
line and upper dotted line illustrate
applications of statistics. Statistical parts of this field apply directly when-
interaction between temperature and
quality control as it may be applied in ever more or less variable measure-
time
the process industries is discussed further ments of the effects of several factors are
in a later s-ction. to be made. The general problem of so
The statistical devices mentioned so placing experimental points that they should be on the desk of any engineer
far (componen ts-of-variance analysis, de- give maximum information about the using statistical methods.
sign of experiments, multiple regression, effects of several independent variables “Statistical Analysis in Chemistry and
and statistical quality control), even has been solved for many common situ- the Chemical Industry,” by Bennett
though modern, are standard and so to and Franklin ( Z ) , is a one-volume text
speak ready-made. \%‘henever the as- planned to take a patient engineer most
sumptions made in their derivations of the way from the very beginning to
appear justified, efficient use may be medium competence in application.
made of these tools. The bulk of this Its balance between theory and appli-
paper is directed to discussing these cation is excellent, especially as its au-
methods and some others Lvhich have thors show frequent recognition of the
already attained wide use. fact that it is only theory that is applied.
Some statisticians feel that these ready- No experienced engineer will expect a
made tools are not likely to remain the single text to answer all his questions,
major contributions of statistics, that or to give a clear summary of all the
-“ , latest developments. Some develop-
they are not modern (the main prin- -3r -‘d 0 +e 72” + 0

ciples were clearly understood twenty ments that were already well along when
The normal law of error the book was being written do not ap-
years ago), and that new statistical tools
should be forged, more closely fitted to pear. S e w work on error rates and
chemical engineering situations. These ations. T h e work of Box and his associ- allowances-mainly by Tukey and his
contentions may well be true. Much ates (3, 6, 8 ) is finding increasing appli- associates (27)-on fractional replication
can be learned, however, by using the cation, especially in bench and pilot of factorial designs (7, 14) and, on the
available and tested methods. Their plant work. fitting of response surfaces (3-6) are in
lack of modernity does not mean that O n the other hand plant scale tests the writer’s opinion not fully treated
chemical engineers have used them with their great expense, limited range in this volume. But it is quite safe to
widely; indeed their age has been at- of variation of factors, and high vari- say that engineers who have mastered
tained in occasional and cautious ap- ability, are usually best run as “fac- the contents of this single book will gen-
plication. torial designs.” An unexpected ad- erally know what to read next, and they
Some parts of statistics apply with vantage of these designs is that they can will actually know more about the
about the same cogency to laboratory usually be arranged so that no out-of- field than most engineers who have
scale engineering research, to pilot plant specification material is produced. been brought up on less important texts.
work, to semiworks process development, Statistical Literature. There are now Probably the very next book to be
and to full scale plant tests. This degree available several works written expressly read by the engineer who has studied
of generality is claimed for regression for chemists and chemical engineers by Bennett and Franklin will be “Design
analysis and for the related area- statisticians of rank. Two of these de- and Analysis of Industrial Experiments,”
statistical design of experiments. Oper- serve immediate reference since thev edited by Davies (9). Many new ex-

An experimental plan which uses 1/2 (figure a t left), 1/4 (center), 1/8 (right) of the treatment combinations from a com-
plete factorial design with seven factors, each a t two levels. (Letters denote the factors, subscripts the levels of factors,
shaded squares the combinations to b e used)

VOL. 48, NO. 9 SEPTEMBER 1956 1393


Millions and millions of digits-indispensable building blocks in statistical design of many experiments

amplrs of factorial designs are given In many universities chemical engineer- to add to these the departments at Vir-
and a n excellent summary of the main ing students can take courses in statistics ginia Polytechnic Institute and a t the
points in Box’s surface-fitting program given in other departments. The range University of Illinois.)
takes up a full chapter. The research or of excellence of these courses is so wide Statistical and Scientific Methods.
development engineer who has used or that it lvould seem safer to plan self- There is a feeling among some engineers
read both these works will be able to teaching until some more serious system and scientists that statistical methods
decide responsibly on the extent to which of accrediting is in wide use. are put forward as a substitute for other
statistical design can aid him in his work. Something can be judged by the methods usually referred to as “scien-
Some will wish to extend their insight quality of texrs used in statistics courses tific.” This feeling is admittedly re-
into the designs proposed by reading for engineers. If either of the two longer enforced by the scientific limitations
some of the historically important papers works praised above or Hald‘s book (73) of some of the proponents of statistical
and books. The leading papers are by is used. and if the instructor has himself methods. The defective integration of
Fisher and by his colleagues, especially some experience beyond the field of qual- modern staristical ideas with the general
Yates. Forty-three of Fisher‘s papers ity control in applying statistical meth- scientific program has resulted in the
are available in ( 7 7 ) Yates’s pamphlet ods to chemical engineering problems, more-or-less ingroxvn development of a
“Design and Analysis of Factorial then the course is in all likelihood a sizable bolus of material that now must
Experiments” (24), readable by chemi- recommendable one. But it appears- be accepted or rejected as a whole.
cal engineers with average mathematical a t least in the East and Midwest-that There is little content to the argument
background. A good summary of the these conditions are not met by many of often heard from scientists bvho are not
underlying distribution theory is to be the courses offered. statisticians to the effect that they prefer
found in Kempthorne’s book ( 7 4 ) . An interesting interim effort to bridgr exact methods and hence will have
Some engineers and chemists, finding the gap between demand and supply of nothing to do with statistics.
the ideas of statistics unfamiliar and serious statistical teaching is being made The real decisions to be made are :
abstruse, may wish to start their reading by some schools in the form of short a. Between archaic and modern sta-
,,.
with Youden’s little book (25). Written “intensive” courses. These courses tend tistics
by a chemist who has himself made to become propaganda sessions if they 6. Between less and more efficient
major contributions to statistical meth- are shorter than a week, and are some- methods of collecting data
ods, this work has been criticized only times merely extended exhortations when c. Betlveen judging by fashion and
as being perhaps too persuasive. longer. The minimum course from current practice and judging by taking
An optimist might report cheerfully which something can be gained by a the time to make a more logical ap-
praisal
on the prospects, the trends and the op- willing chemical engineer is: in this
portunities in statistical education, but ivriter‘s opinion. a 6-week course a t one Each of these choices has been stated
the record of current activities is short. of the major statistical centers of the ro make clear which is the desirable alter-
A term of statistics is now required in the country. (hlost statisticians would judge native. Scientists can hardly maintain
course work of undergraduates in chemi- the statistical departments of the State the convention that whatever they are
cal engineering a t Cornell. As far as College of North Carolina, of Iowa Stare noiv doing is scientific and that all elsr
the writer knows, no other school has University, and of Princeton. Columbia, is not scientific.
such a requirement. A similar elective Stanford. and the University of California Many introductory- texts in physical
course a t Princeton is well attended. as in this category. hlan)- ivould ivant chemistry. elementary physics, and ai)-

1 394 INDUSTRIAL AND ENGINEERING CHEMISTRY


plied mathematics contain a section on (1, 3, 4), and (2, 3, 4 ) . Direct com- ment-versus-standard method. I t is aIsu
“errors of measurement.” This section is parisons of, say, 1 with 2 are possible in possible to extract some information
usually a n oversimplified presentation the first two of the sets. An indirect from these 18 “partially balanced in-
of Gauss‘s propagation-of-error equa- comparison is possible for the latter two complete blocks” abouE the random
tions, making the assumption (usually sets since 1 is compared with the average error made inside blocks.
unstated) that if there is random vari- of 3 and 4 in the third set, and 2 with the Exactly analogous remarks hold for
ation, it must be in the measuring oper- average of 3 and 4 in the fourth set. The the other eight treatments, and so this
ation. I t violates the scientists‘ deter- proper means of weighting these three design will in general give more precise
ministic preconceptions to consider that comparisons will not be detailed here: results than the usual plan. Of course?
random variation may be present in the but it is worth remarking that informa- if the standard is still needed for com-
process itself. This objection persists tion from all four blocks is used in draw- parison purposes, one of the niny num-
in spite of the successes of random ing conclusions about every difference. bers used must be allocated to it.
(“stochastic”) models in many parts of Instead of using some standard catalyst Drift and Time Trends During
pure science, from the kinetic gas theory in every block, which would consume Experimentation. Since drift is usu-
and statistical mechanics to particle one third of the test time, lve can use ally less between adjacent units or ron-
physics and radiation chemistry. The pairs of catalysts as go-betweens for secutive times, the ideas reviewed in thr.
preference for exact methods is entirely making indirect comparisons. preceding section have also direct use
natural and such methods will al\vays be T h e variety of such balanced block de- here. Youden’s work. some of i t ahead!.
striven for. but it seems safe to say that signs is great and has been very com- known to chemical engineers through
there \vi11 for some time remain a n area pletely worked out. -An introductory his columns in Industrial and Engineering
of scientific endeavor in which such exact discussion appears in Youden (76). Chemistry (1954-5). is especially note-
methods are not a t hand. The graphs More detailed treatments appear in worthy because of its combination oi‘
which appear in every issue of every Kempthorne ( 7 I): in Cochran and Cox ingenuity and fitness. His paper (76)
chemical engineering journal-and (8),and in Davies ( 9 ) . shoFvs hoiv to arrange measurements in
ivhich shoiv the data points-support Controls Generalized. As suggested one sequence so as to have all compari-
this contention. above, the scientific dogma of “adequate sons practically unaffected by instru-
For many readers, it is not necessary controls’‘ is capable of considerable mental drift.
to add the comment that statistical generalization and strengthening. es- A more exact but more difficult
methods cannot be substituted for tech- pecially \\-hen small blocks of experi- scheme, allowing for elimination of an)-
nical competence or for creative scien- mental material tend to be more homo- polynomial trend (of kno\vn degree)
tific thought. They are only offered as geneous than larger ones. Suppose that is discussed by Box (..I, 5). This method
efficient means of testing scientific ideas, only two treatments (one comparison) is appropriate for estimating the first-
of measuring properties, of screening can be arranged inside the natural block. order effects of any number of inde-
large numbers of alternative possibili- The block may be a day, a shift. a batch pendently variable factors. First-order
ties, of aiding in making the decisions of raw material: a pair of reactors, or multifactor designs arc also discussed
that plague all technical men, especially xvhatever other natural grouping is ex- later.
chemical engineers. A clearer exposi- pected to be homogeneous. To com- Extra Measurements. It is evident
tion of this point of vierv is to be found in pare, say? nine treatments using a con- from the experimental plans outlinrd
“Introduction to Research,” by TVilson trol each time means that half of all the thus far, that all possess a high degree of
(.??). work done is given over to the control. internal balance and symmetry. It
Suppose each pair is done in duplica- follows that any later additions, fur
tion. Eighteen blocks are required and example. a n extra set of measurements
Improvements in Precision a n d each condition or treatment is looked a t a n intermediate temperature level,
Accuracy of Measurements a t twice. But different treatments can \vi11 not ordinarily have the built-in
be compared only through their (match- richness of comparability that the earlier
Paired Comparisons Versus Ab-
ing) standards or controls, rvhich brings sets shoLved. I n many cases it is possiblc
solute Measurements. For some pur-
in some error each time. to reserve some of the several batches of
poses so-called “absolute” measurements
Consider in contrast to the nine treat- raw material so that, if further com-
are required, but more often than not it
ments, each paired \vith a standard in parisons are needed, material \vi11 bc
is the comparisons between several con-
duplicate, the following set of 18 pairs available for linking the new treatments
ditions that are of major interest. Such
direct comparisons must usually be made taken from Kempthorne ( 7 1 , p. 554): with the old. Xgain Youden has mad?
”close together.” Color matching, flavor (1: 21, (1, 31, (1: 41, (1: a> ( 2 , 31, (2: 3, the earliest contribution.
preferencing, and scent grading all have (2, 81,(33 61, (3, 91, (4: 3, (4: 61, (4: 71, A discussion of successive blocks in a
their more objective counterparcs in (5, 6 ) . (5: 8), ( 6 : 91%( 7 >8): ( 7 >91, (8,9). multifactor experiment. to be run onc
chemical engineering. It is usually better Each of the nine treatments or condi- after another when improved sensitivity
to test the effect of raising some tempera- tions can be compared \vith each other is desired, is given in the section on ap-
ture in adjacent time periods, and with but not all comparisons are direct. For plications.
the same lot of raw material, if that is pos- example, 1 is compared with 2: 3. 4, Carrying this idea to the point wherc,
sible. Of course, the effect \vi11 have and 7 directly. These comparisons are the decision to make each measurement
to be checked a t other (adjacent pairs exactly as precise as the duplicate com- rests on the results of the last measure-
of) time periods, and n i t h other lots of parisons through a standard considered ment gives us the devices called se-
raw material. But it is generally-and above. But in addition, 1 may be com- quential plans. These are also dis-
rightly-believed that comparisons that pared with 2 by matching (1, 3) Lvith cussed under “applications.”
are more closely grouped are likely to (2, 3), also by matching ( l ! 4), (4: 5) Control of Laboratory Precision and
be more sensitive indicators of the dif- and (2, 5): and finally by matching Accuracy. Although many analytical
ference to be estimated. (1, 7), (7, 8) and (2, 8 ) . The proper chemists incline to consider such action
Suppose four different catalysts are weighting to give the three sorts of com- as impugning their professional com-
to be compared, and that only three parison is not shown here) but it is clear petence, engineers often send in duplicate
can be tested in one day. The obvious that the comparison using all these bits samples for assay, ivith different code
subdivision to make is into all possible of information is considerably more pre- letters. I t is no secret that such dupli-
scts of 3-namely, (1, 2, 3), ( l ? 2; 4), cise than by the classical each-treat- cates do not in general agree so \vel1 as

VOL. 48, NO. 9 SEPTEMBER 1956 1395


,the parallel pairs that chemists tradi- a rather generalized example. A cor- casionally a series approximation can be
tionally run. relator is confronted with a volume of developed to bring this part of assumption
A more justifiable course would be for plant data which gives the daily values a back into force.
the assay laboratory to publish regularly of siveral outcomes (yields, various If assumption b requires, as it often
its precision for each type of assay. These properties of a product being made, etc.) does, that a large part of the data be re-
precisions would have to be determined along with the measured values of a moved from the study, this is mainly an
by “sleepers” (duplicate samples uni- number of plant operating conditions and advantage, since it serves to emphasize
dentifiable by the analysts, but known some repeatedly measured properties of that voluminous data do not necessarily
to the Iaboratory director). If some of incoming streams. contain a large amount of information.
the sleepers are also made up as stand- The outcomes-of which J are re- If certain fixed sets of x , are present in
ards, then a running pair of plots-one corded-are to be called y, (where j = subgroupings of the original collection,
of the ranges of duplicate pairs, the other 1, 2, 3. , . . , J ) . The independent vari- then each may be analyzed separately.
of the averages of the pairs-can be used ables will be named xi, where i = 1, 2 , But if, as is more usual, there is little
to keep track of the analyst’s perform- 3, . . . , I. There are then I “independ- uniformity in rhe names of the missing
ance both as to the stability of his pre- ent” factors. The main assumptions xi, then only the “complete” points can
cision (as measured by the ranges that must be satisfied for the simplest or be analyzed. (A recent case in the
behaving stably), and of his accuracy classical (least-square) fit of a set of writer’s experience saw the reduction of
(as judged by the pair averages hover- equations of the form Y , = f , ( x , ) are as 1300 datum cards to 300, in order to
ing around the true or standard value, follows : meet this restriction.)
with no long runs on either side). Na- Assumption c is sometimes mistakenly
turally such a chart system should be a. An equation connecting each y j ignored. Each x , that is measured with
set up with the aid of a quality control with the x i is known as to form, but has error comparable with the scatter of xi
engineer. Naturally too, as stability several unknown constants in it. The in the data will generally be judged to
over considerable periods of time is equations are (for the simplest case) have no influence, or to have one that is
demonstrated, relaxation in the fre- linear in the unknown constants, but not a n underestimate of its true influence.
quency of sampling should be arranged. necessarily in the xi. Thus it is as-
However, if the observed values of
sumed that
some x , cluster around a few target- or
specification-values from which they
Applications of Statistics differ because of control difficulties (not
to Chemical Engineering Research measurement errors), then “unbiased”
and Development estimates of the true bk may bc found.
where the form of all the ppl~(xJare known The “error in y , for fixed x,,” sometimes
Use of Data Already Available. and only the constants b k ( k = 0, 1, 2, called the “error of estimate,” is usually
Statistical advice is usually sought by . , . , K ) are to be determined. Each assumed independent of the x, when a
the chemical engineer who has collected p may be a function of several X I . It least-squares fit is made. If the error
a mass of data which resists “ordinary” is not judged necessary here to adjoin of estimate varies widely from point to
the subscriptj to every term on the right. point in x , space, then unbiased estimates
interpretation. The data may have b. Each datum point is complete, con-
been accumulated over a term of plant sisting of a t least one measurement of of the 61,may still be obtained but the
operation or may be the result of a num- each x1 and one of each y,; a “datum precision of these estimates may be
ber of laboratory experiments or suc- point” is then a set o f 1 f J numbers. very low. The alternative is to weight
cessive pilot plant runs. Indeed such c. Each x, is quite precisely measured, the datum points inversely by their local
collections are sometimes made the compared with its actual spread in the scatter. Such a “weighted regression
occasion for “trials” of statistical methods. data under study. analysis” is not easy, and is most fruitful
I t should be said at once that such trials d. The scatter of each y j for fixed x i , when the weights are rather well known
although unknown, is about the same or vary very widely.
are usually failures. A common reason
regardless of the set of values a t which Once these assumptions are shown to
for failure is that the data are inhomo- the x1 were fixed.
geneous. Inhomogeneity as used here e. No pair of x i shows very close apply tolerably well for a particular set of
means that some conditions (not measured linear correlation. (If such linear cor- data, then the work of estimating the
and often not even namable) have relation is found, it is easy to reinstate constants, bk: and their uncertainties,
changed (slowly or suddenly), so that this assumption by dropping one of the is not too onerous. The estimation of
different sections of the data are actually correlated pair.) thirty such values, with the precision of
samples from quite different populations. each, would have been considered quite
Perhaps almost as frequent a cause of If assumption a (known form of equa- unmanageable twenty years ago, but is
failure is the personal one. the self- tion) is not a safe one, it may be prac- now commonplace, thanks to the de-
styled statistician not having been careful ticable to test the data for the existence velopment of automatic computing ma-
to master the assumptions and limitations of some connection simply by seeing chinery.
of the data-reduction methods he uses. whether “near duplicates”-i.e., y, values One useful way to present the con-
The so-called multiple-regression taken for the (small) sets of nearly match- clusions of such a numerical analysis is
methods are by far the most useful in ing x,, are more closely equal than non- by means of two equations for each de-
reducing massive data to a few manage- neighbors-Le., y 1 values taken for widely pendent variable. The first is a pre-
able equations. In the hands of com- differing sets of xi values. If near- diction equation, which gives the best
petent workers aided by modern com- duplicate y, do in fact match better than prediction of E’, values (Y, being the true
puting machinery, such data can some- nonneighbors (exact statistical tests are value of whichj, is an observation). given
times be made to yield valuable informa- not available), then there must be some any set of x , in the range studied. The
tion. Even in the contrary cases, where dependence of y , on the x , , and full en- second equation shows how well the first
“no significant regression” is found, the gineering and technical effort would be one predicts. In terms of the p K , this
analysis may pin the possible effects of brought to bear to derive the form of the equation is that of a hyperboloid of two
some factors into a small enough range relation, thus reinstating assumption a. sheets. I t is usually displayed as a table
so that useful statements may be made If the form of equation relatingyj to the of values showing the error of prediction
about the lack ofinfluence of these factors. xi is known but is not linear in the un- for various favorable and unfavorable
Many of the statistical contributions to known constants, then laborious trial- combinations of the x,. This form of
data correlation can be indicated through and-error methods must be used. Oc- judgment of the quality of prediction (or

1 396 INDUSTRIAL AND ENGINEERING CHEMISTRY


of representation of the data used) is more The paragraphs just preceding are not where ub is the the standard error of the
intelligible to engineers than is a large planned to make their readers better slope
collection of statements about the “sig- analysts, but only to indicate some rather u is the standard deviation of dupli-
nificance” of the various constants in the common situations which are now cate y values
prediction equation. manageable by fairly well understood -
S ( x 2)z is the sum of squares of
Many engineering readers will have methods. These methods, while not deviations of the chosen x values from
long since asked themselves, What is so simple as to be mastered overnight, their average, 3
wrong with current common sense are not so complicated that chemical T h e measurements may be equally
methods? The common sense method engineers need despair. It is fortunate spaced over the interval X I to XII, but it
usually starts by winnowing the available that a t least two first-rate expositions seems likely that the points near the
collection of numbers, retaining only by engineers and chemists are now avail- middle of the x range are giving little
those that are thought by experienced able (2, 73). information about the slope. Equation
men to have some relevance to the The engineer who is approaching 4 confirms this conjecture. At the other
current problem. No one objects to this multiple regression for the first time will extreme the data may all be taken with x
phase of the “c.s. method.” probably want to have both books a t a t x I or a t xII, and again it seems reason-
T h e second step, sometimes used even his desk, since different aspects of nomen- able to take iV/2 observations a t each
as part of the winnowing process, is that clature, order of presentation, and even end. Using only Equation 4,one easily
of graphing some of the important in- subject matter of examples, will appeal finds that the ratio of the variance of
dependent variables against one (or to different readers. A word of warn- slope for the equally spaced case to the
more) of the dependent ones. A serious ing should be given to the engineer who variance of slope for the equally bunched
error is sometimes made here. To take has read some book on data fitting and -
case is 3(N 1)/(N + 1). (The square
a very simple example, suppose that the who feels that the methods he is familiar of a standard deviation or standard error
true relation is with are much simpler and probably is called a variance.)
nearly as good. Many results of great Thus for any tolerably large A’ the
usefulness were given by Gauss. These equally bunched arrangement is in this
were partly rediscovered and greatly important sense three times as effective
and that in the data a t hand XI and x2 extended by Fisher. Perhaps a fair as equal spacing. Put the other way,
are in fact linearly correlated so that touchstone of the responsibility of recom- almost three times as many equally
mendations on curve and surface-fitting spaced measurements will be required
would be the extent of reference to Gauss to reach any given precision of slope
and to Fisher. An excellent set of supple- estimation as will be needed if the
The constants bo, b l , 6 2 , G and d are not mentary readings on regression is given measurements can be equally bunched
known. If a single scatter diagram of in (73). Of use to the engineer con- over the same x range. The new
y versus X I is now attempted, it is clear fronted with a multivariate problem property of a set of data that is utilized
that the rate of change of Y with x I for (uncertainty in several variables in a n here is its arrangement. The extensions
fixed xp that will be estimated is ( b l+ bzd) equation) is the book by Rao (78), but and generalizations of this motion are
a large part of the field, unfamiliar to
and not bl. I t is entirely possible for this work is far more difficult than those
the former quantity to be widely different hitherto named. Research in multi- most engineers, called the statistical
from the latter. Such confusions are variate analysis is proceeding rapidly, design of experiments. Some of these
eliminated by the multiple-regression but the services of a statistician active in developments will be discussed later.
type of analysis. the field will be required for up-to- The connection between Equation 4
Put more generally, it is a pleasing and date application. and the apparent paucity of information
quite general property of least-squares The results of a careful multiple-re- in much plant and operating data is
analysis (when the assumptions given gression analysis are often discouraging. easy to make. Inspection of such data,
above are all roughly satisfied) that they Great masses of data are fed into com- classified by each X I in turn, usually re-
give unbiased estimates of the true (un- plicated calculations only to find that a veals that most of the data are grouped
known) constants. The term unbiased few slopes are estimated with about rather closely around the average value
refers to a long range property. In a 4=507, precision. I t violates our sense for that X I . T h a t is just the position that
particular case some of the estimated of fitness to find so little issue from such gives least information about the rate of
slopes will be larger than their true a voluminous substrate. A more ob- change of y with X I .
values, and some smaller. jective sense of the proportionality factors It is quite common to find that pairs of
The writer has occasionally en- relating amount of data to amount of x I vary together (or in opposition) so that
countered enterprising engineers who information can be developed by con- it is not easy to separate the effects of the
have decided on their own to tackle sidering a few elementary examples. two. Such correlations, which can occur
cases \\.here there is heavy error in one or for many pairs of X I in a single set of
Suppose that the relation between some
more x , as well as in some of the y,. data, further weaken the informative
variable Y measured with precision U, and
They have responded to this by deriving value of the set with respect to the
a variable x , which can be measured with
equations that minimize, say, the per- “effects” (constant rates of change in the
negligible error, is known to be linear.
pendicular distances of the observed
points from the fitting plane. This plan
Thus: Y = bo + blx. The true value of
simplest case) of X I on y.
If the writer’s experience is typical,
the slope, 61,is not known. The value of then it can be said that the prospects of
is usually of little use, since it will have
parameters that are not scale invariant. x can be set a t any value inside some finding a good prediction equation by
Thus changing the scale of one of the x1 range, XI to X I I , and then y can be multiple-regression techniques by use
will give a different plane. Even using measured. Since each Y is measured of plant data are poor. However, ex-
five squares per unit for XI on the graph with uncertainty, the slope is also esti- ceptions have occurred and many hints
paper chosen instead of ten will then give mated with uncertainty. A’ measure- about the relations among a number of
a different predicting equation! A ments are to be made, and they may be factors have been garnered by this
sounder approach to such a problem spaced as the experimenter sees fit. The means. The testing of derived predic-
would start with the services of a uncertainty in slope is related to the un- tion equations and the using of the hints
statistician who is familiar with the certainty in the y valucs by the equation just mentioned are both carried forward
“identifiability problem” and with so- most effectively by the methods to be
called multivariate analysis. U*P = u”/s(x - Q)Z (4) described.

VOL. 48, NO. 9 SEPTEMBER 1956 1397


Figure 1 a. Region of permissible Figure 1 b. “Classical” experiment Figure 1 c. Experiment covering
experimentation larger region of permissible experi-
mentation

These devices, most of them by no The claim made here is only that such where the constants 63, 61,and bs are to
means new, are discussed because so variation is necessary, not that it is be estimated from the data. I t is further
few chemical engineers are aware of their sufficient. assumed that there is only a certain
advantages. The improvements are Perhaps the simplest example of (known) range of X I and x2 in which ex-
aimed a t the deliberate elimination of the statistical design of a multifactor experi- perimentation is practicable. The ex-
two disadvantages of ‘.unplanned” data ment can be constructed to answer the perimental conditions (but not the results
just outlined. Data will be taken, then, (rhetorical) question, if we vary more of the experiments) can be plotted on
where they d o the most good for detecting than one factor a t a time, say XI and x ? , an XI, x2 grid (Figure la), inside or on the
influences or effects and so that there hoxv can we tell Lvhich produced the practicability boundaries.
are no correlations (or minimal ones) be- observed variation in J ? It is curious The “classicist,“ varying only one
tween the independent variables. that this question \vas not raised in factor a t a time, will d o the threr
Balanced Schedules of Experimental connection with the gleaning of in- runs indicated by the three dots in
Runs. Deliberate variation of factors formation from planr data since it is Figure I b . Comparing this “design”
(independent variables) is more costly only the rarest of accidents to find single with that indicated in Figure I C suggests
than is the unselective taking of data. and independent variation of two factors that the latter might be somewhat better
This commonplace observation should under normal plant operation. 41- since it seems to span the available ex-
he followed by the more important though the question asked above is perimental region a little better. Cal-
statement that it is only by the deliberate usually not meant to be answered, it is culation confirms this suspicion, thr
variation of factors that the engineer exactly in its careful answer that we find slope 6 1 being determined just as well b?
can become sure that observed correla- new gains in the power of variable data the symmetrical design, but 6 2 being de-
tions are in fact in causal relation to one to give information about effects. termined with somewhat better pre-
another. The more experienced and Suppose that the “response surface” cision-namely, with three fourths as
pessimistic operating engineer would is a plane to a close enough approxima- large a variance. This gain, though
want to add several more comments, one tion over the practicable range of varia- small, becomes proportionately greatei
being that even after such deliberate tion of 1 1 and xz. The equation is then as more factors, and more constants.
variation we are often not sure enough. 1- = 6n + 61.~1 + 62x2 (5) are included in the prediction equation.
It need not be overemphasized that
the three points in the second design are

with a more fully balanced design.


I Figure 2a gives relative positions of the
I I points in “factor space” for the former
design; Figure 2b for the latter. The

kx2
/ y3
-----
J
/
/ kx27 ----- --_
4
crampedness that was only just notice-
able in the two-factor case is here rathei
painful. No one would claim that the
four points of Figure 2a cover the per-

1 398 INDUSTRIAL AND ENGINEERING CHEMISTRY


face of the cube. Such a difference has
in it a component of uncertainty from
each measurement. If y precision: u.
is constant a t all x , : then the standard
8
error of the difference 0 . 3 - y l ) is 4%.
Less obvious is the fact that the J dif-
ference in Figure 2b will he measured by
the difference betlveen the average of
the txvo y values a t the higher level of X I
and the average of the tivo a t the lower
level. Everyone feels that the average
of a pair of independent measurements is
more reliable than a single number.
The standard error of the relevant dif- A
ference here, namely of 0 3 + J,) 2 -
+
(y1 12) ;2>is U: which is notably smaller
than the d2ufound above. Figure 3a. 2 ? factorial design Figure 3b. Five point design
The same advantage holds for the
estimates o f t h e effects of var!-ing .Y? and
~ 3 . Put another xvay, the four points of
Figure 2a xvould have to be run in dupli-
cate. making eight runs in all: to gain
as precise results as the four points of
Figure 2b. The gain in economy of ef- B. If four measuremenrs are taken. nom‘ the biquadratic equation in x1 and
fort. a t least in those cases \\.here the cost the consistency of effect of d a t tivo levels x2:
of experimentation is heavily influenced
by the number of runs, increases steadily
of B can be judged. The four measure- I- = bo + 61x1 + 62x2 4-
with the number of factors to be varied.
ments required are indicated as the ver-
texes of a rectangle in Figure 3a. The
611x12 + b.nx? + hl?xl.rr (6)
Similar designs exist for any number of corresponding literal notation is given The coefficients b l and b ? are estimated
factors. They are exceptionally easy to beside each point. by the “main effects” of x1 and x t , each
write when the number of runs is a The design of Figure 3a is the first divided by the change in the respective
power of 2 and the number of factors is shoivn here of the type called factorial, independent variable. The coefficient
one less. Two (equivalent) nomencla- meaning that all levels chosen for each b12 is estimated by the correspondins
tures are given in Table I for the t\vo factor are rested at all levels of every interaction effect. The sum of the t\vo
designs just discussed. The three in-
dependent variables XI. .??, and .x3 are
other factor. I t is in fact the factorial pure quadratic coefficients-Le., bll +
design called the 22, the exponent mean- 6 2 2 , is estimated by the difference be-
here renamed A . B, and C. ing that two factors are being varied tween the average response of the four
and the base that each is a t t\vo levels. outside points and the response a t the
Such a design gives three results, each center. (For simplicity, the ranges of
using all the data and each being of maxi- variation of x1 and x 2 are here taken as
mum precision. The three results are unity.) If the response surface is simply
Table I. Numerical and Literal Nota- an estimate of the average effect of in- curved, being uniformly convex or uni-
tions for Multifactor Experiments creasing factor A (or 21) from the low formly concave over the region of experi-
Classical Design Balanced Desigri level chosen to the hither level; a simi- mentation, then the design given is effi-
One-at-a- Time Two-at-a- Time lar estimate for the effect of B,and finally cient. If the response is possibly saddle
\-il- .Yll- a judgment of the consistency, or addi- shaped in the region studied, then this
meii- mei l - tivity, of these ttvo effects. This meas- arrangement of points may fail since the
cal Liteial cal Litri a
I
ure of additivity (or of nonadditivity) is trde value of (b11 + 622) may then bc
Run ABC ABC found as the agreement (or otherwise) zero or very small, even though 611 and
1 000 (11 000 (11 of the tt5-o “effects” (ab - b ) and [a - bs2 are large.
2 100 a 011 bc ( l ) ] and so is given by combining the The five-point design just given is one
3
4
010
001
b
C
101
110
ac
ab
results of the runs as follows: [ab + that would rarely he used because the
(1) - a - b ] . If this quantity is of precision of the various effects and of
considerable magnitude, one then speaks their sum as assembled in Equation 6 is
of A and B as interacting. The effect of not good enough.
varying A from its low level to its high A few simple rules are needed to show
A listing of the designs for all numbers level is then not the same when B is low as how the precision of a quantity (let us
of factors up to 100 divisible by 4 has when B is high. The average effects of call it y ) measured with uncertainty is
been given by Plackett and Burman (77). A and B no longer have simple meaning. related to the precision of functions of
Box (4) has shokvn that all the inter- The effects of the two factors can no that quantity. In the first place, if tm’o
mediate designs also exist. They are longer be spoken of as additive. Regard- independent measurements of y are
called by Box multifactors designs of first ing the four y values obtained as points made, the variance (standard deviation
order. on a surface, there is some twist a t right squared) of the sum or difference of the
It is not always safe to speak about the angles to the direction of motion as one two measurements is the sum of the
“effect of factor A” without further proceeds from the two points at low B variances of the single measurements.
qualification as to the levels of other to the two a t high ,4. Thus the uncertainty (in variance units)
factors. It may well be that the effect Since only t\vo levels of A and of B of the total weight of some material that
of factor ‘4, by Tvhich is meant the ef- are available in the 22. no measure of must be weighed in two lots is the sum of
fect on some dependent variable of curvature in the direction of motion is the uncertainties (still in variance units)
changing factor A from one level. 0, to possible. But by adding one point, as of the Lveights of the tlvo parts. The
another level 1, is not the same at one in Figure 3 b , it is possible to get some net weight: estimated as the difference be-
level of factor B as it is at another level of idea of the “total” curvature. Consider tlveen a gross and a tare Lveighing, also

VOL. 48, NO. 9 0 SEPTEMBER 1956 1399


has a variance equal to the sum (not the duplicate assays are made on the results
difference) of the variances of the individ- of each run, and if, as can generally be
ual weighing. Generalizing to A‘ assumed, run variabirity is not corre-
From Equation 15 it is seen that the lated with assay variability, then the
independent measurements on y, y l ,
standard error of a difference between
y z , . . . y, . . . y N yields reliability of the average of all N,iZ:,
two equally precise means is 4 i i u v , measurements is given by the equation
rrhen N independent measurements in
all have been taken. This relation plays
a n important part in industrial experi-
mentation.
I t is the fashion among chemical en- Obviously there is a sharp limit to the
gineers to attribute to others the random improvement in precision to be attained
variation that appears in their data. by increasing N , only.
The assay laboratory cannot check its I t is a common occurrence that im-
where Var { } symbolizes the variance own results, the analytical chemists make provements in yield (or whatever other
of the quantity in curly braces. all kinds of errors or, turning in the dependent variable is being measured)
The y i need not all be measurements other direction, the raw material we have about as large as UR are of industrial im-
of the same true Y but are required only to use varies all over the place, no two portance. In planning experimental
to be independent measurements. If batches are ever the same? and so on. It work, then, it is well to be quite sure that
the variances of all the y , are the same, is quite feasible to get quantitative esti- effects as large as UR are detected and not
then the sum of the variances shown in mates of all these sources of variability by missed. To ensure this outcome, a
Equation 8 may be written as Nu: taking data in a balanced, nested ar- minimal requirement might be that the
where uv is the standard deviation of rangement. observed effect ( 7 2 - j 1 in the simplest
duplicate y measurements. As a simplified example, assume that cases) be assuredIy within f U ~ / 1 2of the
Since standard deviations have the two separate carloads of raw material true effect. If “assuredly” be translated
same units as the random variables to can be followed through the process, modestly “with 95% assurance,” then
which they refer, it comes as no surprise that two runs can be made on each car- the true value will assuredly be covered
to learn that the standard deviation of load, and that duplicate, but differently by an interval centered on the observed
CJ is cu,, for any constant c. Similarly, coded, samples from each run can be effect (D = 5 2 - j J and of length four
sent to the assay laboratory. Eight re- times the standard error of the observed
VUY ( c y ] = c2uy* (9) sults will be produced. The four dif- D. Thus
and in particular ferences between duplicate assays prop-
erly averaged (by averaging their
squares) can be used to estimate the assay
variance completely separated from the From this equation hiR,the total num-
where c has been set a t l/A’. other sources of variallility under study ber of runs required (half at each level
Equations 8 and 10 may be combined here. The two differences between the of the factor whose effect is being studied),
to give a n expression for the variance of averages of the pairs of runs from the emerges as 64.
the commonest of all linear functions of a same carload estimate run-to-run varia- Most engineers will shy at this number,
set of measurements, their average: bility with some contamination of assay feeling that if this were a correct line of
variability but with none from raw- reasoning hardly anything \vould ever
material carload-to-carload scatter. The have been discovered. I t should suffice
run variation can? by subtraction, be to remind them that effects of magni-
where the )I] are repeated independent estimated clear of assay variation. tude 2uR can be measured within XICUR
measurements ori y . Using Equations 8 Finally, the single difference between the with only 16 runs (eight under each
and 10 above gives two carload averages gives an estimate of condition of the independent varia-
the raw-material variability. T h e repe- ble) and that effects of the order of 4 u ~
tition of this whole cycle of “hierarchal” can be estimated with the same relative
measurements, including if necessary precision by four runs. I t is these factors
further splits, can be carried far enough to whose effects are discovered first: and
Equation 12, possibly the most im- specify, with desired precision, just what little or no statistical sophistication is re-
portant equation in our field! justifies and proportion of the total variability is being quired to interpret data obtained by
gives quantitative expression to the conlributed from each source. varying factors of such influence.
widely held feeling that a n average value But the purpose in criticizing here the But the principle of simultaneous and
is more reliable than a single measure- engineers’ attitude toward assay error is balanced variation of several factors can
ment. Taking the square root of both a different one. I t is to emphasize that now be brought to bear, with impressive
sides of Equation 12 yields most engineers do not have a clear idea gains in effectiveness per run.
of the order of magnitude of the error As a first step a full 26 factorial experi-
ment can be made in 64 runs (25 = 64)
standard deviation of their own results.
The other equation needed here gives and thus the average effects of six sepa-
I t is to emphasize, further, that without
the standard deviation (sometimes called rately variable factors can be estimated,
a n accurate estimate of the “run stand- each one as precisely as if the whole 64
the standard error) of the difference be- ard deviation” one knows neither how
tween two independent averages : runs were devoted to studying only the
many duplicate runs to make or hohv effect of a single factor. I t is worth
VUT ( j z - j l ] = VUT{ji) + VU^ (uz] (13) many duplicate assays to request. I t is
obvious that if the duplicate-run standard
adding that all the fifteen (6 X 5,’2) two-
factor interactions among six factors are
deviation (which we will call U R ) is much also estimable and one can see whether
larger than the duplicate-assay standard the “average effects” observed are in
Equation 14 follows from 13 when the deviation, u,, no gain in precision can fact roughly the same a t high and low
error in y l and in y2 measurements is the be expected by increasing the number of levels of other factors or whether they
same. If, as is often convenient, N 1 = duplicate assays. are quite disparate.
itT*= N / 2 , then If AT, duplicate runs are made and X,,, I t will have been noted that only 6 +
1 400 INDUSTRIAL AND ENGINEERING CHEMISTRY
15: or 21, conclusions have been drawn nevertheless suffice to pick out any very domization, invented by Fisher (as was
from the 64 runs imagined above. The influential factors in a smaller number of the principle of symmetry and balance or
remaining 38 conclusions can also be runs. T h e fractions of fractions are orthogonalization described above), must
drawn, but they estimate such abtruse 26-1-1 28-8 -2 211-4-3 and 215-7-3, requiring, often be carried through several times
properties of the data as 3-, 4-? 5-, and then 16, 16, 16, and 32 runs respec- independently in one experimental se-
6-factor interactions. tively. They can be easily derived by quence. As a n example, the material
Just as it was shown above to be pos- any statistician experienced in this field. produced in a set of randomized runs
sible to answer some questions about the I t will sometimes be practical to aug- should usually not be sent to the analyti-
effects of three factors in only one half ment the 2P-a fractional factorials with cal laboratory in that order, but rather
the 23, so it is also possible to fractionate some additional measurements taken a t should be held a t least until some major
the full factorials in still larger numbers the center of the design to permit a n fraction of the total is available and then
of factors without serious loss of informa- over-all estimation of the curvature of the sent in a new random order.
tion about main effects and t\vo-factor (p-dimensional) response surface. T h e A too-brief summary of the recom-
interactions. Indeed, if three-factor and contrast which measures over-all curva- mendations just made might run as fol-
higher order interactions can be assumed ture is the difference between the aver- lows. Block the experiment into sections
negligible (this assumption is usually a age of the “outside” points and the aver- judged more homogeneous than the
safe one), then eight factors and their 28 age response a t the center. If one third whole; orthogonalize (balance) inside
two-factor interactions can all be esti- of 2p-q points are taken a t the center, the the blocks with respect to all factors and
mated in 64 runs. Since 64 is one variance of the curvature contrast will two-factor interactions thought to influ-
fourth of 28, or 256, such a design is be the same as that of the effect con- ence the outcome; randomize with re-
called a “quarter replicate of a 28,’’ or trasts. A less precise judgment is not spect to all conditions which might influ-
more compactly, a P 2 . The actual likely to be useful; indeed, more precision ence the results but which cannot be
combinations of runs that ivould be will sometimes be required. controlled.
needed are given in (7, 8, 74) and in The augmentation of the two-level Returning now to our strictures against
many other places. They can be gen- factorials brings with it a further advan- the analysis of unplanned data, it will
erated immediately from the six sets of tage in that a n unbiased estimate of UZ be seen that the following points have
letters ab, adf, bc, de, fg, and ph. Each of can be obtained from the duplicated been covered.
these sets represents the conditions of a center points. This estimate can be
run. The product of each pair of these compared with that obtained from the a. Deliberate variation in random order
(dropping all squared letters) gives comparisons due to combinations of fac- of the conditions under which runs are
made guarantees that observed effects,
another f i n . I n fact, all possible prod- tors that can be assumed roughly equiva- if consistent, are due to the factors varied.
ucts of the six sets given, plus the “low lent. (Many such comparisons are to 6 . Simultaneous variation of several
set”-with all factors a t their low levels- be found in any large experiment. Their factors permits the use of each run in
give the 64 runs required. Thirty-six unsupported use as error estimates is drawing several conclusions.
conclusions can be drawn, each one using likely to err on the conservative side; 6. Balanced variation of the factors
all the data. that is, the error is likely to be somewhat permits adequate precision in estimat-
Other fractional replicates are of course overestimated.) ing the effect of each factor and removes
possible, the 2+’, 28-2, 211-4,and 2l5-7 Only one other experimental principle undesirable intercorrelations between
being the most efficient in the sense that derived from statistics awaits discussion ; factors.
d. Extreme variation of the factors,
each permits drawing about half as this is the principle of randomization. measurements of y being taken a t x
many conclusions as there are runs. Of the 64 runs in the 28-2 just discussed, values as far apart as possible, maximizes
Put another way, the four designs re- 32 are a t “low A” and 32 a t “high L4,’’ the sensitivity of the experiment com-
quire 32, 64, 128, and 256 runs respec- I t would be an elementary scientific error pared Lvith the relative insensitivity of
tively: if one of these numbers of runs to run off the 32 runs a t either level of A poorly placed, unplanned data.
has been chosen for precision reasons, first, simply on the grounds that such a e. Fractionating the fractional factorial
then the corresponding number of fac- grouping was convenient. If some fac- design alloivs early discovery of influen-
tors (6, 8, 11 or 15) should be marshaled tor not under control, e.g.. cooling-Mater tial factors, thus making redesign prac-
ticable lvhen necessary.
and built into the sequence of runs. Put temperature (c.w.t ), were to increase
a third way? if only seven factors are gradually during the sequence of runs, Questions of analysis and interpreta-
thought to be important, then 64 runs then obviously the runs a t one level of A tion are discussed elsewhere-e.g., in ( 9 ) .
must be made in any case. since the 27-1 would have been carried through a t a Suffice it to conclude this section with a
is the smallest fraction having all tbvo- higher cooling-water temperature than reassurance. The calculations required
factor interactions separately estimable. those a t the other level. If c.w.t. in to analyze even the largest of the designs
It would then be a pity not to add another fact has quite a n influence on the out- discussed above are much simpler than
factor, since the 28-2 also requires only 64 come. then the effect reported as “due the corresponding multiple regression cal-
runs.. to A” is actually “due to A + c.w.t.” culations and usually do not require the
The run specifications for all these de- There are a multitude of such uncon- use of computing machinery.
signs can be found in (7). Efficient trolled variables operative in most ex- In some of those cases in which curva-
methods of analysis and interpretation perimental situations. I t is imperative ture is detected, the development en-
can be found in (2, 9, 74). to evade as far as possible the effects not gineer will want to study the response
Even though factors are being varied only of linear trends in these uncontrolled surface in more detail, just in the region
over ranges that are expected to produce factors, but of any other changes or com- already covered. O n the other hand,
effects of magnitude uR, it will of course binations of changes. This neutraliza- the responses found may suggest to him,
happen that some factor or factors may tion can only take the form of spreading by their magnitudes and directions, that
produce much larger effects. I t will be the trends as equally as possible across all further tests should be run a t new values
necessary to wait until all N R runs are in the levels of the factors being varied. of the independent variables. In the
to find this out if the designs just given are This can be carried through by doing the latter case rough approximations to
used in toto. But balanced fractions of runs in a n objectively randomized order. steepest ascent (or descent) directions will
these fractions are available which, while Tables of random digits available in be desirable. I n the former case effi-
they measure each main effect along many statistical texts (9, 70)are used for cient designs for studying curvatures will
with several two-factor interactions, d o this purpose. The practice of ran- be wanted. One scheme for accomplish-

VOL. 48, NO. 9 * SEPTEMBER 1956 1401


ing these ends is that developed by Box the largest with the smallest (8:page 68). applied to chemical engineering problems
and his associates (3,6 ) . I t is compactly In fact, the t value calculated from the is quite large. The best immediate
described in ( 9 ) .of which Box is coauthor. two extreme means of three will exceed prospect is that some engineers will
the tabled 0.05 value 13% of the time study this body of research and will decide
when there are no true differences. This to use some of it. First-rate texts are
Improvements in the Art of mistake is of course made \verse \vhen the available, the journal literature is grow-
Statistical Inference
extremes of a still larger set are chosen. ing, and more and more research statis-
Tests of Significance and Confidence Thus for six means? use of the 0.05 t-value ticians are becoming interested in
Intervals. T h e familiar reporting of on the largest versus the smallest mean chemical engineering problems.
some result of experimentation as “very will give apparent significance 40y0 of
highly significant” has probably misled the time when no difference exists,
more engineering readers than has any Tukey has shown (21) how to use Literature Cited
other single piece of statistical jargon. more recently derived multipliers that
If the statistician had written instead will operate with the desired over-all (1) Bechhofer. R. E.; Sobel, X I . . Dunnett,
C . LV., Bionietrzka 41, 170 (1954).
that “as divergent a result as this would probability of error and give narroiv- (2) Bennett, C. A.: Franklin, N. L.,
only very rarely be observed if there \cere est possible intervals when only pair- ”Statistical Analysis in Chemistry
no real effect,” then the engineer reader wise comparisons are to be made. and the Chemical Industry,” Wiley,
Lvould be less impressed. T h e engineer The required multipliers, called “student- New York. 1954.
( 3 ) Box, G. E. P.: Biomdrics 10, 16 (1954).
would then probably further translate ized ranges” are tabulated in Dixon ( 4 ) Box, G. F,. P., Biometrika 39, 49
the statement as, “ H e means that the and Massey‘s text ( 7 0 ) , in Bennett (1952).
true value is very likely on this (or that) and Franklin (2). and in Pearson and ( 5 ) Box? G. E. P.: Hay, FY. A.: Biometrics
Hartley‘s tables (75). 9. 304 11953).
side of zero.” I n this form the conclu- ( 6 ) Box; G. E. P.. iVilson, K. B.! J. Roy.
sion is hardly ne\vs. and rarely useful. \\‘hen other comparisons are also to Statistical Soc. B13, 1 (1951).
Indeed it had better be admitted he made-for example, in a set of seven ( 7 ) Brownlee, K. A.: Kelley. B. K.,
straight off that tests of significance, while means. it may be that the loivest four Loraine, P. K., BiometriXa 35, 268
representing an important stage in the are to he compared with the upper t\vo- 11948).
( 8 ) Cochran. W. G.? Cox, G., “Ex-
recent history of statistics, correspond to then the studentized range does not give perimental Designs,‘’ LViley, Sew
a quite primitive state of the engineering the shortest intervals (though it may be York, 1950.
field to which the statistics are being used a t the given level of confidence). ( 9 ) Davies, 0. L., ed., “Design and Anal-
Many alternatives appear. depending on ysis of Industrial Experiments,”
applied. Hafner, New York, 1954.
Kearly always the chemical engineer how one wants to distribute one‘s errors, (10) Dixon. W. J., Massey, F., “Intro-
would like to knoiv uithin what range of on ho\v much freedom is demanded to duction to Statistical Analysis,”
value he can be quite sure: on the basis mull over the data after they are col- McGraw-Hill, New York. 1951.
lected, and on hoxv much calculation (11) Fisher, R. A.; “Contributions tu
of these data, that the true value lies. hlathematical Statistics,” Wiley,
Since many significance tests (or tests of one is willing to carry out. A mono- New York, 1950.
hypotheses) can be recast in this form. graph in this whole matter by Tukey is in (12) Fisher, R. X.: “Design of Experi-
and since actually much more informa- press (27). A paper by H . Scheff6 (7s) ments,“ 6th ed., Oliver and Boyd,
tion is given by this means, the so-called compares Tukey’s range method for London, 1951.
(13) Hald, A , : “Statistical Theory with
“estimation by confidence interval” pro- pairs with the (infinite) set of contrasts Engineering Applications,” Wiley,
cedures are in general recommended. that is equivalent to the usual “analysis- Xew York, 1952.
More information is given by saying, for of-variance test.” (14) Kempthorne, O., ”Design and Anal-
Another approach to the multiple- ysis of Experiments,” FViley, New
example, “The true effect on yield of York, 1952.
varying the temperature from T I to Tn comparison problem is exemplified in (15) Pearson, E. S., Hartley, H. 0..
lies, with 99% certainty, in the range 0.2 the recent work of Bechhofer. Sobel, and “Biometrika Tables for Statisti-
to 4.0,” than is given by saying, “There Dunnett (7). Picking the largest of I: cians.” vol. 1, Cambridge L. P.,
is a highly significant positive effect of varieties, each measured with uncertainty New York. 1934.
Perry, J. H., “Chemical Engineer’s
raising the temperature from T I to T2.“ is easier the greater the distance betiveen Handbook.” 3rd ed., McGraw-Hill,
Multiple Comparisons. The stand- the largest and its closest contender. New York, 1950.
ard tests of significance and confidence- Bechhofer and coworkers have shown Plackett. R. L., Burman. J. P.,
Biomeirika 33, 405 (1946).
interval statements are derived in such hoiv to decide the number of measure- (18) Rao, C. R., “Advanced Statistical
a way that the long-run relative fre- ments that must he taken under each of Methods in Biometric Research,”
quency of their giving incorrect results- the I.; conditions, so as to guarantee with Wiley, New York, 1952.
Le., of reporting significant differences (19) Scheffe: H., Biometrika 40, 87 (1953).
a predetermined probability that rvhen (20) Sherwood, T. K., Reed, C. E.,
among a set of I; means when in fact the the largest has some predetermined de- “Applied Mathematics in Chemical
true means are identical, has the stated gree of superiority. it will be in fact Engineering,” McGraw-Hill. New
value provided only one significant chosen. York, 1939.
judgment or confidence-interval state- (21) Tukey, J. W., “The Problem of Mul-
Prospects. T h e field of research in tiple Comparisons,” in press.
ment is made by use of the particular (22) Walker, W. H., Lewis, M’. K.,
problems of chance variation is groiving
average values needed for that test. McAdams, W. H.: Gilliland,
rapidly. T h e relations between sub-
T h e average greedy engineer can E. R.? “Principles of Chemical
hardly be content with one use of each fields are becoming somelvhat clearer. EnTineering,” 3rd ed. McGraw-
Research in decision-function theory. Hill, New York, 1937.
average value: he will want a t least to (23) Wilson, E. B., Jr., “Introduction to
make all possible pair-wise comparisons in sequential-approximation procedures,
Scientific Research,” McGraw-Hill.
among, say, K means. in time-series spectral analysis: in sta- New York, 1952.
T h e resulting K ( K - 1)/2 comparisons tistical linear programming, in multi- (24) Yates, F.. Commonwealth Bur. Soil Sci.
variate analysis. and in other areas is Tech. Commun. No. 35 (1937).
are by no means independent, nor does (25) Youden. W. J., “Statistical Methods
making all of them conform to the occasionally seen to approach applica- for Chemists,” Wiley, New York,
derivation of the significance tests men- bility. S o space need be taken here to 1951.
tioned above. Even when only three summarize or judge this lvork. The (26) Youden. W. J.: Science 120, 627
average values are produced, say as the greatest immediate prospects are, in the (1954).
result of testing one factor at three levels, writer’s judgment, elsewhere. T h e back- RECEIVEDfor review hlarch 6, 1956
it is not safe to use the t test for comparing log of statistical knowledge ready to be ACCEPTEDMay 8, 1956

1 402 INDUSTRIAL AND ENGINEERING CHEMISTRY

You might also like