Professional Documents
Culture Documents
ESGC6110 - Lecture 5 Multifactor Designs (Chapter 8, PG 221)
ESGC6110 - Lecture 5 Multifactor Designs (Chapter 8, PG 221)
ESGC6110 - Lecture 5 Multifactor Designs (Chapter 8, PG 221)
5.1 Introduction
When more than 2 factors are under study, the number of possible treatment
combinations grows exponentially. E.g. 3 factors at 5 level each, there are
53 125 possible combinations. It is rare to carry out an experiment with 125
different treatment combination, because the management needed and the
money required would be great.
Managing the collection of data is also an issue. E.g. gathering sales data for a
supermarket, the assistant may forget to count how many particular items were
sold that day!
There are several ways to manage the size of experiments involving several
factors. To design both effectively and efficiently an experiment that has many
factors, we must make informed compromises.
For any of the 27 cell, the row designates the level of A, the column designates
the level of B, and the subscripts inside the cells designate the level of C. For
eg, the cell in the fourth row and third column has A at level 1, B at level 3 and
C at level 2.
A – row factor; B – column factor; C – “inside” factor. there is no advantage
or disadvantage to being a row, column or inside factor.
Table 5.2 is Latin Square design. It is balanced: each factor is at each level the
same number of times (three); each level of a factor is used in combination with
each other level of a factor the same number of times – once.
The balance in the set of nine treatment combination in Table 5.2 guarantees the
unbiasedness of the main effects of the factors.
- When we look at differences in the row means (i.e. the mean for each
level of A), row one includes exactly one data value at each level of B,
and exactly one data value at each level of C. The same holds for rows 2
& 3. each row mean is on equal footing with respect to the levels of
factors B & C.
- The differences among the row means can legitimately be examined in
the traditional F-test, ANOVA way.
Table 5.3: data values at A1 all had factor C at C1 , and all date values at A2 all
had factor C at C2 and so on, row differences could not be attributed solely on
the impact of factor A. Table 5.3 is poor design even though it does provide an
unbiased evaluation of the effect of factor B.
Table 5.3: differences in the row means can be due to the level of factor A or to
the level of factor C; the two effects cannot be separated. When the impact of
one factor can’t be separated from that of a second, the effects are confounded.
For each level of each factor to be on equal footing in a design (Table 5.2), we
must be willing to assume that there is no interaction among the factors.
If there is interaction among the factors, the value of the main effects
determined may not be valid. A1 included a data value that had levels (B1, C1),
but A2 and A3 did not include such value. It (B1, C1) combination greatly
increases the yield beyond the average effect of B1 alone plus the average effect
of C1 alone, then the mean of row one may be much higher than the other row
means solely due to including the (B1, C1) combination, and not due at all to the
levels of A being A1.
However, the latin square design of three factors at three levels each, with only
9 combinations – the same number needed for just 2 factors at three level each –
1/3 replicate, only run 9 out of 27 combinations – cut costs of running the
experiment.
Yijk i j k ijk
i,j and k take on values 1,2,3,…,m. i.e. we have 3 factors, each at m levels.
2
Error By subtraction (m-1)(m-2)
Total Y... )
2 m2-1
i j k (Yijk ..
None of the 3 factors are significant at = 0.05. This indicates that service
policies, hours open, and amenity levels do not affect sales much. However,
amenities, with a p-value of 0.099 would be significant at = 0.10.
If the Latin-square design has replication, the model is similar, except that there
is an additional, explicit term for error.
ANOVA Table for m-level Latin Square with n replicates
nm2 – 3m + 2
Total Y... )
2 nm2-1
i j k (Yijk ..
Factors:
- Cost of the valet-parking service (row)
- Quantity of handicapped parking spaces in the parking lot closest to the
clinic entrance (column)
- Number of valet-parking attendants on duty (inside factor, indirect
indicator of waiting time)
The most significant factor is the number of handicapped spaces; as this number
decreases from ten to four, the average demand increases. As the number of
attendants increases from one to four, the average demand increases from 40.5
to 48.5, to 51.75, to 66.25. The results indicate the importance of the number of
handicapped spaces as well as of the number of valet parking attendants.
Table
Number of handicapped Number of Attendants
space
4 6 8 10 1 2 3 4
Mean number of 87.5 59 31 29.5 40.5 48.5 51.75 66.25
patients using
valet parking
The same holds for factor B, C and D – all assuming that there are no
interaction effects among the factors.
If m=3, (m+1)=4. In the Table 5.3.1, the nine treatment combinations represent
a one-ninth replicate of the possible 34 = 81 treatment combinations.
The flaws of a complete Graeco-Latin Square design: All the df have been used
to estimate the effects and none are left, in an unreplicated design, to allow the
assessment of error. That is, if there are (m+1) factors each “using up” (m-1) df,
all (m2-1) df are utilized; the SSW would come out zero, and the MSW would
be 0 0 (its df), or, an “indeterminate form”, not possible to perform significance
testing.
The ANOVA table illustrates the testing of the four hypotheses related to the
significance of each of factors A, B, C, D. The value of SSW is determined by
subtracting the other SSB terms from the TSS. The SSB for each factor has 4
df, corresponding to 5 levels.
Exercises: