Professional Documents
Culture Documents
Sampling Theory
Sampling Theory
Sampling Theory
Unit - I
Fundamental principles of experiments – randomization, replication and local control. Size of
experimental units. Analysis of variance- one-way and two-way classifications.
Unit - II
Analysis of Variance and Basic Designs: Concept of Cochran’s Theorem. Completely randomized
design(CRD)- Randomized block design(RBD) - Latin square design(LSD) and their analysis -
Missing plot techniques in RBD and LSD.
Unit - III
Post ANOVA Tests: Multiple range test; Newman-Keul’s test-Duncan’s multiple range test-
Tukey’s test. Analysis of Covariance technique for RBD with one concomitant variable.
Unit - IV
Factorial experiments: 22, 23 and 2n factorial experiments. Definitions and their analyses.
Unit - V
Principles of confounding –partial and complete confounding in 23– balanced incomplete block
design(BIBD)– parametric relationship of BIBD.
REFERENCE BOOKS:
1. Das, M.N. and Giri,N.C. (1988) Design and Analysis of Experiments(2 nd Edition).
New Age International, New Delhi.
2. Douglas,C. Montgomery(2012) Design and Analysis of Experiemnts. John Wiley &
sons, New York.
3. Goon A. M., Gupta, S.C. and Dasgupta, (2002)B. Fundamentals of Statistics, Vol.II,
World Press, Kolkata.
4. Gupta, S. C. and V. K. Kapoor (1999) Fundamentals of Applied Statistics (Third
Edition), Sultan Chand & Sons, New Delhi.
5. Dean, A and Voss (2006) Design and Analysis of Experiments. Springer India Private
Limited, New Delhi.
1
Unit -I
DESIGN 0F EXPERIMENTS
1.1 Introduction
In 1935 sir Ronald A. Fisher laid the foundation for the subject which
has come to be known by the title of his book „ The Design of Experiments‟.
Since then the theory of experimental design has been considerably developed
and extended. Applications of this theory are found today laboratories and
research in natural sciences, engineering, and nearly all branches of social
science.
Definition
2
v) Hypothesis to be tested.
3) Analysis phase,
i) Data collection and processing.
ii) Computation of test statistics.
iii) Interpretation of results for the experiment.
1.2 Definitions:
1. Experiment
An experiment is a device or a means of getting an answer to the
problem under consideration.
Experiment can be classified into two categories;
i) Absolute
ii) Comparative
i) Absolute experiment
Absolute experiments consist in determining the absolute value of
some characteristics like,
a) Obtaining average intelligence quotient (I.Q) of a group of people.
b) Finding the correlation co-efficient between two variables in a bivariate
distribution etc.
3
i) In field experiments the plot of land is the experimental unit. In other
experiments, unit may be a patient in a hospital, a lump of dough or a batch of
seeds.
4. Blocks
In agricultural experiments, most of the times we divide the whole
experimental unit (field) into relatively homogeneous sub groups or strata.
These strata which are more uniform amongst themselves than the field as a
whole are known as blocks.
5. Yield
The measurement of the variable under study on different
experimental units are termed as yields.
6. Experimental error
Let us suppose that a large homogeneous field is divided into
different plots (of equal shape and size) and different treatments are applied to
these plots. If the yields from some of the treatments are more than those of
others, the experimenter is faced with the problem of deciding if the observed
differences are really due to treatment effects or they are due to chance
(uncontrolled) factors. In field experimentation, it is a common experience that
the fertility gradient of the soil does not follow any systematic pattern but
behaves in an erratic fashion. Experience tells us that even if the same is used
in all the plots, the yields would still vary due to the differences in soil fertility.
Such variation from plot to plot, which is due to random factors beyond human
control, is spoken of as experimental error.
7. Replication
Replication means the execution of an treatments more than once.
In other words, the repetition of treatments under investigation is known as
replication.
8. Precision
The reciprocal of the variance of the mean is termed as the precision.
Thus for an experiment replicated r times is given by.
1
r 2
var( x )
Where σ2 is the error variance per unit.
9. Efficiency of a Design
Consider the designs D1 and D2 with error variances per unit 12
and 22 and replications r1 and r2 respectively. Then the variance of the
difference between two treatment means is given by
2 12 r1 and 2 22 r2 for D1 and D2 respectively. Then the ratio
4
2 22 r r r
E 1 2 12 22 is termed as efficiency of design D1 w.r.t D2.
r2 2 1 1 2
10. Uniformity Trials
The fertility of the soil does not increase or decrease uniformity in
any direction but is distributed over the entire field in an erratic manner.
Uniformity trails enable us to have an idea about the fertility variation of the
field. By uniformity trail, we mean a trail in which the field (experimental
material) is divided into small units (plots) and the same treatment is applied
on each of the units and their yields are recorded.
5
We have seen that replication will provide an estimate of experimental
error. For valid conclusions about our experimental results, we should have
not merely an estimate of experimental error but it should be an unbiased
estimate. Also, if our conclusions are to be valid, the treatment means and also
differences among treatment means should be estimated without any bias. For
the purpose we use the technique of randomization.
When all the treatments have equal chances of being allocated to
different experimental units it is known as randomization.
The following are the main objectives of randomization.
i) The validity of the statistical test of the Significance.
i.e.) t-test for testing the significance of the difference of two means. F-
test for testing the homogeneity of variance.
ii) The purpose of randomness is to assure that the source of variation,
not controlled in the experiment operate randomly. Randomization eliminates
bias in any form.
3) Local control
We know that the estimate of experimental error is based on the
variations from experimental unit to experimental unit. In other words, the
error in an experiment is a measure of “ within block” variation. This suggests
that if we group the homogeneous experimental units into blocks, the
experimental error will be reduced considerably. If the experimental material,
say field for agriculture experimentation is heterogeneous and different
treatment are allocated to various units at random over the entire field the soil
heterogeneous will also enter the uncontrolled factors and thus increase the
experimented error. It is desirable to reduce the experimental error as for as
practicable without unduly increasing the number of replications, so that even
smaller difference between treatments can be detected as significant.
The process of reducing the experimental error by dividing relatively
heterogeneous experimental area (field) into homogeneous blocks is known as
local control.
Remarks:
1. Local control, by reducing the experimental error, increases the efficiency
of the design.
2. Various forms of arranging the units(plots) into homogeneous
groups(blocks) have so far been evolved and are known as experimental
designs, e.g., Randomised Block Design, Latin Square Design etc.,
1.4 Analysis of Variance
6
inherent in nature. The total variation in any set of numerical data is due to
number of causes which may be classified as: (i) Assignable causes, and (ii)
Chance causes.
(ii) Parent population from which observations are taken is normal, and
7
Table 1.1: ONE-WAY CLASSFIED DATA
The total variation in the observation yij can be split into the following
two components:
(i) The variation between the classes or the variation due to different
bases of classification, commonly known as treatments.
(ii) The variation within the classes, i.e, the inherent variation of the
random variable within the observations of a class.
The firs type of variation is due to assignable causes which are beyond
the control of human hand.
Let yij denote the jth observations in the ith level of a factor A and let yij be
corresponding random variable. Let the mathematical model for one way
classified data
E(yij)= µ+ti
E ee y ij E y ij
2 2
ij
y ( ti )
2
= ij
ij
= y ij t i
2
ij
E ee
0
2 yij t i (1) 0
ij
y
ij
ij ti 0
yij t i 0
ij ij ij
9
y
ij
ij ti
ij ij
y
ij
ij G =Grand Total
G ni ni t i
i i
n
i
i n
G n ni t i …(1)
i
(ee)
0
t i
2 yij t i (1) 0
j
y
j
ij ti 0
yij t i 0
j j j
y
j
ij ti
j j
y
j
ij Ti
Ti ni ni t i …(2)
We assume that
n t
i
i i 0
G=nµ
G
̂
n
10
From equation (2)
Ti ni ni t i
G
Ti ni ni t i
n
G
Ti ni ni t i
n
Ti G
tˆi
ni n
E ee ( y ij ˆ tˆi ) 2
2
ij
( yij t i )( yij t i )
ij
yij2 ˆyij tˆi yij
ij
G 2 Ti 2 G 2
yij2
ij n i ni n
G 2 Ti 2 G 2
= yij
2
n n
ij n i i
11
Table 1.2: Anova Table for One –way Classified Data
Under the null hypothesis, H 0 t1 t 2 t k against the alternative that all t‟s
st2
are not equal, the test statistic F Fk 1, n k
s E2
Problem 1.1.
Strains of typhoid A B C D E
No. of mice, ni 10 6 8 11 5
12
Solution.
i.e.,
Ho:µ𝐴 = µ𝐵 = µ𝐶 = µ𝐷 =
µ𝐸 𝐻1 : 𝐴𝑡 𝑙𝑒𝑎𝑠𝑡 𝑡𝑤𝑜 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑒𝑎𝑛𝑠 𝑎𝑟𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑡
ni ni
1
Also ⇒ si2
ni
yij2 yi2 ⇒ y 2
ij ni si2 yi2
j 1 j 1
𝐺 2 (482.2)2
𝐶. 𝐹 = = = 5,812.92
𝑁 40
Ti 2
C.F .
i ni
(109)2 (81)2 (92)2 (123.2)2 (77)2
= + + + + − 5812.92
10 6 8 11 5
13
𝐸𝑟𝑟𝑜𝑟 𝑆. 𝑆. = 𝑇. 𝑆. 𝑆. −𝑆. 𝑆. 𝑑𝑢𝑒 𝑡𝑜 𝑎𝑡𝑟𝑎𝑖𝑛 = 2,424.81 − 92.32 = 2,332.49
Total 39 2,424.81
14
For example, the yield of milk may be affected by differences in
treatments, (i.e, rations as well as the differences in variety, i.e., breed and
stock of the cows. Let us suppose that the n cows are divided into h different
groups or classes according to their breed and rations given at random to
cows in each group) on the yield of milk.
Let yij = [Yield of milk from the cow of jth breed or stock, fed on the ration i]; i=
1, 2, ..., k; j= 1, 2, ..., h
Note that the suffix i refers to the treatments (rations) and the suffix j
refers to the breed and stock of the cow. The yield can be expressed as
variable values in the following k × h two-way. One factor of variation, say,
varieties (breed and stock of cows) is represented along the columns and the
other factor of variation, say, treatments (rations) is represented along the
rows of the table.
15
Column T1. T.2 ... T.j ... T.h G=
Totals yij
Column
means
y.1 y.2 ... y.j ... y.h
=( yij ) 𝑘
i
Where yij is the yield of the (i,j)th element which is in the ith row and jth
column with p levels and q levels, where µ is the general mean effect
E eij yij i j
ij ij
E eij2 y ij i j …(1)
2
ij ij
E E E
0; 0; 0
i j
Differentiate w.r.to µ
2 yij i j (1) 0
ij
16
y
ij
ij i j 0
y
ij
ij
ij j i
i
ij
j 0
where y
ij
ij G
2 yij i j (1) 0
j
yij i j 0
j
= yij i j 0
j j j j
where yj
ij Ti
Ti q q i j …(3)
j
2 yij i j (1) 0
i
yij i j 0
i
= yij i j 0
i i i i
where y
i
ij Tj
T j p i p j … (4)
i
17
We assume that i
i 0,
j
j 0
G=pqµ
G
̂
pq
Ti = qµ+qαi
Ti -qµ = qαi
Ti q
ˆ i
q
Ti G
̂ i
q pq
T j p p j
Tj G
̂ j
p pq
E ( yij i j ) 2
ij
E ( yij i j )( yij i j )
ij
G T G Tj G
yij2 yij yij i yij
ij pq ij ij q pq ij p pq
18
Where yij
ij G, yij Ti , yij T j
j i
Ti 2 T j
2
G 2
G 2
G2
y ij
2 i j
ij pq q pq p pq
Under the null hypothesis H0t = t1=t2=…=tp against the alternative that all t‟s
are not equal the test statistic is :
M .S . A
F A ~ F p 1, p 1q 1
M .S .E
19
treatments differ significantly. If Ft is less than tabulated value then FA is not
significant and we conclude that the data do not provide any evidence against
the null hypothesis which may be accepted.
M .S .B
FB ~ Fq 1, p 1q 1
M .S .E
Problem 1.2.
Method
Analyst M1 M2 M3
Solution:
20
Here two factors of variation are, say,
Null hypothesis :
Alternative Hypothesis :
yi = response of the ith analysts and the jth methods (i = 1,2, … ,5 ; j = 1,2,3)
21.6 466.56
21
5 6.9 7.4 7.1
2,294.21
3,827.45
= 765.67
𝐺2 (107.1)2 11,470.41
Correction Factor = = =
𝑁 15 15
Ti.2 CF
1
S.S.A. = S.S. due to factor A (Analysis) = h
i
2,294.21
= − 764.694 = 764.737 − 764.694 = 0.043
3
T. 2j CF
1
S.S.B. = S.S. due to factor B (methods) = k
i
3,827.45
= − 764.694 = 765.49 − 764.694 = 0.796
5
22
S.S. due to Error (S.S.E.) = T.S.S. – S.S.A. – S.S.B. = 0.976 – 0.043 –
0.796 = 0.137
Factor B
(Methods) h–1=3–1=2 0.796 0.3980
0.3980
FA = = 23.27∗
Error 4×2=8 0.0171
OR 0.137 0.0171
14 - (4+2) = 8
Since the calculated value FA< 1, it is not significant and we fail to reject
HOA. Hence, there is no significant difference between the analysis.
23
Unit –II
Let X1, X2, ....,Xn, denote a random sample from normal population
𝑁 (0, 𝜎 ). Let the sum of the squares of these values be written in the form:
2
n
X i2 = Q1 + Q2 + ⋯ . +Qk
i 1
Where Qj is a quadratic from in X1, X2, ....,Xn, with rank (degrees of
freedom) rj: j= 1, 2, ...., k. Then the random variables Q1, Q2, ...., Qk are
mutually independent and 𝑄𝑗 𝜎 2 is 𝜒 2 -variate with rj degrees of freedom if and
only if
rj n .
j 1
For example
24
Let there be five treatments each to be replicated four times. There are,
therefore, 20 plots. Let these plots be numbered from 1 to 20 conveniently.
When a coin is tossed, there are two events, that is, either the head comes up,
or the tail. We denote the “head” by H and the “tail” by T.
Layout of CRD
1 2 3 4
A C A D
5 6 7 8
B D B D
9 10 11 12
C B C A
13 14 15 16
B D A C
Advantages of CRD
25
iii) This makes the design less efficient and results in less sensitivity in
detecting significant effects.
Applications
The model is
yij ( ti )
2
ij
(ee)
0
2 yij t i (1) 0
ij
2 yij t i 0
ij
y ti
0
0
2
ij
ij
Where y
i
ij G , G= Grand total
26
G ni ni t i … (1)
i i
(ee)
0
t i
y ti
0
0
2
ij
j
y t
j
ij
j j
i 0
Where y j
ij Ti
Ti ni ni t i 0 …(2)
n t
i
i i 0, n
i
i n
G nˆ 0
G
̂
n
Ti ni ̂ ni t i
Ti ni G
tˆi
ni ni n
Ti G
tˆi
ni n
27
Error Sum of Squares
E ee y ij t i
2 2
ij
y
ij
ij t i yij t i
y y
ij
ij ij t i other terms are vanished
y
ij
2
ij ˆ yij tˆ i yij
G T TG
= yij2 G i i
ij n i ni n
G 2 Ti 2 G 2
= ( y 2
n i ni n
ij
ij
Where y j
ij Ti
G2
Where is the correction factor
n
28
Error n-k By Er.S .S
MSSE=
subtraction nk
E.S.S=T.S.S-
Tr.S.S
Total n-1 G2
yij2
ij n
Problem 2.1 :
A 55 49 42 21 52 219
B 61 112 30 89 63 355
C 42 97 81 95 92 407
D 169 137169 85 154 714
Figures in antique in the Table are not given in the original data. They are a
part of the calculations for analysis.
Weight gain of baby chicks fed on different feeding materials composed of
tropical feed stuffs is given in Table.
29
Solution:
Null hypothesis, Hₒ: tA = tB = tC = tD
i.e., the treatment effects are same. In other words, all the treatments (A, B, C,
D) are alike as regards their effect on increase in weight.
Alternative hypothesis, H₁: At least two of tᵢ‟s are different.
Raw S.S. (R.S.S.) = yi j
2
ij = 55² + 49² +……..+ 85² + 154² = 1, 81,445
Test statistic: FT ̴ F(3,16), Tabulated Fₒ.ₒ₅ (3, 16) = 3.06. Hence FT is highly
significant and we rejected Hₒ at 5% level of significance and conclude that the
treatments A, B, C and D differ significantly.
Advantages of RBD
(i) Accuracy:
(ii) Flexibility:
30
In R.B.D no restriction are placed on the number of treatments
or the number of replicates. In general, at least two replicates are required to
carry out the test of significance (factorial design is an exception). In addition,
control (check) or some other treatments may be included more than once
without complications in the analysis.
Disadvantages of RBD
Layout of RBD: -
BlockI A B C D E
BlockII B C D E A
BlockIII C D E A B
BlockIV D E A B C
Lay out:
31
means total
y11 y12 .... y1r y1. T1.
y 21 y 22 .... y 2 r y 2. T2.
.. ... ... ... ... ..
yi1 yi 2 .... yij y ir yi. Ti.
... ... ... ... .... ....
... ... ... .... .... .....
y t1 yt 2 .... y tr y t . Tt .
means y .1 y.2 .... y.r
total T.1 T.2 .... T.r G
Let us assume that yij is the response of the yield of experiment unit from
ith treatment jth block.
The model is
y ij t i b j eij ; i 1, 2, , t ; j 1, 2,, r
Where yij is the response or the yield of the experimental unit receiving the ith
treatment in the jth block;
t r
Where µ , ti and bj are constants so that ti 0
i 1
and b
j 1
j 0
If we write y
i j
ij G Grand Total
y
j
ij Ti = Total for ith treatment
y
i
ij B j = Total for jth block
32
µ , ti and bjare estimated by the method of least squares
E eij2 y ij t i b j
2
… (1)
i j i j
E
0
2 ( yij t i b j ) 0
i j
0
( y
i j
ij ti b j )
2
0
y t
i j
ij
i j i j
i b j ) 0
i j
i
y
i j
ij tr r t i t b j 0
i j
Where y
i j
ij G
G tr r t i t b j 0 … (2)
i j
E
0
t i
2 ( yij t i b j )(1) 0
j
2 ( yij t i b j ) 0
j
0
(y
j
ij ti b j )
2
0
33
y t b ) 0
j
ij
j j
i
j
j
Where y j
ij Ti
Ti r rt i b j …(3)
j
E
0
b j
2 ( yij t i b j )(1) 0
i
2 ( yij t i b j ) 0
i
0
(y
i
ij ti b j )
2
0
y t b ) 0
i
ij
i i
i
i
j
Where y i
ij Bj
B j t ti tb j
i ….(4)
t r
ti 0
i 1
and b
j 1
j 0
G tr
G
ˆ
tr
34
Ti r̂ rt i
Ti r̂ rt i
Ti G
tˆi
r tr
B j tˆ tb j
Bj G ˆ
bj
t tr
E y ij t i b j
2
i j
E ( yij t i b j )( yij t i b j )
i j
G T G Bj G
y ij2 G y ij ( i ) y ij ( )
i j tr i j r tr i j t tr
Where yi j
ij G; y
j
ij Ti ; yij B j
i
Ti 2 Bj
2
G i G j
2 2
G2
= ( y ij
2
)
i j tr r tr t tr
G2
Where, correction factor=
tr
35
G2
Total Sum of Square = y 2
ij
i j tr
T i
2
G2
Treatment Sum of Square = ST2 = i
r tr
B 2
j
G2
Block Sum of Square = SB2=
j
t tr
Total rt-1
Under the null hypothesis H0t = t1=t2=…=ti against the alternative that all t‟s
are not equal the test statistic is :
S T2
FT ~ Ft 1,t 1r 1
S E2
36
S T2
FT 2 ~ Ft 1,t 1r 1
SE
Problem 3.3
Table 2.3
Solution:
Null hypothesis:
Alternative hypothesis:
H₁t: At least two τᵢ‟s are different. ; H₁b: At least two bᵢ‟s are different.
For finding the various S.S., we rearrange the above table as follows:
37
Table 2.4
Block
(Bᴊ)
Tᵢ²
14,161.00 14,376.01 14,908.41 4,212.01 4,395.69 9,273.69
Average
29.75 30.0 30.5 16.2 16.6 24.1
38
Block 3 219.43 s²B = 73.14 Fb = 73.14/15.31 = 4.7
Total 23 1.350.25
Tabulated F₃, ₁₅, (0.05) = 5.42 and F₅, ₁₅ (0.05) = 4.5 .Since under Hₒt Ft ̴ F (5,
15) and under Hₒb, Fb ̴ F (3, 15), we see that Ft is significant while Fb is not
significant at 5% level of significance. Hence, Ft is rejected at 5% level of
significance and we conclude that treatment effects are not alike. On the other
hand, Hb may be retained at 5% level of significance and we may conclude that
the blocks are homogeneous.
Let the observation yij = x (say) in the jth block and receiving the ith treatment be
missing, as given in table 3.7 Table 3.7
Treatments
1 2 … I … t
… …
where
y i . is total of known observations getting ith treatment
y. j is total of known observations in jth block and
39
y.. is total of all known observations
G 2 G x
2
Correction factor =
tr tr
ij
i j tr tr
2
y . i y i .. x
Sum of square due to treatment (S.S.Tr) = i
C.F i
C.F
r r
y y x
2 2
.j
C.F C.F
.j
Block (S.S.B) =
j
t r
i j r t
=
2
x 2 cons tan t terms independen t of x
G x 2 i.
y x
G x
2
y. j x 2 G x 2
tr r tr t tr
y x
2
tr r tr t tr
y x
2
y. j x G x
2 2
x 2 cons tan t terms independen t of x
i.
r t tr
40
S .S .E
0
x
2x 2
yi. x 2 y. j x 2 G x 0 0
r t tr 2
y x y x
x
i.
ij G x 0
r t tr
trx t ( yi. x) r y. j x G x
0
tr
trx tyi. tx ry . j rx (G x) 0
x(tr t r 1) tyi. ry . j G 0
x(tr t r 1) tyi. ry . j G
x((t 1)(r 1)) tyi. ry . j G
tyi. ry . j G
x
(r 1)(t 1)
Problem 3.3
Suppose that the value for treatment 2 is missing in replication III. The data
will then be as presented in the table below.
Replication
Treatment Total
I II III IV
41
2 29.5 30.4 X 29.6 89.5
= 397.7/12
= 33.1
= [135.1 – 4(33.1)]²/(5)(4)
= 7.29/20
= 0.3645
= 19946.9725 – 19425.1445
= 521.8280
42
= 521.4635
Source of variation
df SS MS F
Total 18 938.9610
2……. l…………m……………. K
1……
1 y₁₁ y₁₂ B₁
2 y₂₁ y₂₂ B₂
. .
. .
J x B’ᴊ+x
43
. .
. .
I y B’ᵢ+y
. .
. .
r Bᵣ
G 2 G x y
2
Correction factor =
tr tr
yij2
i j tr
x 2 y 2 cons tan t terms independent of x and y
tr
2
y . i y i .. x y
Sum of square due to treatment (S.S.Tr) = i
C.F i
C.F
r r
y y x
2
y
2
.j
C.F C.F
.j
Block (S.S.B) =
j
t r
S.S.E=T.S.S – S.S.Tr-S.S.B
x 2
y 2 cons tan t terms indepent of x and y C.F
2
= Ti x Tm B j x Bi
2 2 2
y y
C.F C.F
r r t t
44
T x
2
T y
2
2 i m
x y cons tan t terms indepent of x and y .C.F
2
r r
B x
2
B y
2
C.F C.F
j i
t t
T x T B x B
2 2 2 2
y y
G x y
2
x2 y2 m i
i j
...(1)
r r t t tr
.S .S .E
0
x
2 B j x
2x
2(Ti x)
2(G x y) 0
r t tr
B x
(T x) j
x i (G x y) 0 0
r t tr 2
xtr t Ti x r B j x G x y
0
tr
xtr t Ti x r ( B j x) G x y 0 tr 0
xtr tTi tx rB j rx G x y 0
x(tr t r 1) tTi rB j G y
45
tTi rB j G y
x
(t 1)(r 1)
.S.S.E
0
y
2 Bi y
2y
2(Tm y)
2(G x y) 0
r t tr
B y
(T y ) i
y m (G x y) 0 0
r t tr 2
ytr t Tm y r Bi y G x y
0
tr
ytr t Tm y r ( Bi y ) G x y 0 tr 0
ytr tTm ty rBi ry G x y 0
y(tr t r 1) tTm rBi G x
tTm rBi G x
y
(t 1)(r 1)
Problem 3.4
E C D B A Total
46
26 42 39 37 24 168
A D E C B
24 33 21 (X) 38 116
D B A E C
47 45 31 29 31 183
B A C D E
38 24 36 41 34 173
C E B A D
41 24 (X) 26 30 121
The means for second row and fourth column in which C is missing are
116/4 = 29.0 and 133/4 = 33.25, respectively. Hence the first estimate for C is
= 2030/12 – 1584.24/12
= 169.17 – 132.02
= 37.15
47
G₂ = 5(116 + 133 + 150) – 2(798.15)/12
= 1995/12 – 1596.3/12
= 166.25 – 133.03
= 33.22
G‟ = 761 + 33.22
= 794.22
B² = 169.17 – 2(794.3)/12
= 36.8
It can be seen that the estimated values for B are same and that for C are
very close. Hence we stop the iteration process at third cycle. The final
estimates for B and C for the missing plots are 36.8 and 33.3 respectively.
The column total, row total, etc., with respect to the missing plots are
modified by adding the estimated values. Thus we have,
CF = (831.1)²/25
= 27629.0884
48
Treatment SS = 28448.586 – CF = 819.4976
Error SS = 278.0288
Now ignoring the treatment classification the missing values are estimated
as in the case of RBD. The estimate of the second row, fourth column missing
value is 28.5; and that of fifth row, third column is 28.2. After substituting the
estimated values and analyzing the data as RBD, we get the error sum of
squares as 1031.5856. Then we have,
= 1031.5856 – 278.0288
= 753.5568
Source of
variation
df SS MS F
Total 22 1273.0416
LSD is defined for eliminating the variation of two factors called row and
column in this design. The number of treatments is equal to the number of
replications.
Layout of design
2x2 layouts
A B
B A
3x3 layouts
A B C
B C A
C A B
4x4 layouts
A B C D
B C D A
C D A B
D A B C
5X5 layouts
A B C D
E
B C D E
A
C D E A
B
50
D E A B
C
E A B C D
Example:
A Latin in which the treatments say A, B, C etc occur in the first row
and first column in alphabetical order is called standard Latin square.
Example:
A B
B A
Advantages of LSD
1. With two way grouping LSD controls more of the variation than
CRD or RBD.
2. The two way elimination of variation as a result of cross grouping
often results in small error mean sum of squares.
3. LSD is an incomplete 3-way layout. Its advantage over the complete
3-way layout is that instead of m³ experimental units only m² units
are needed. Thus, a 4x4 LSD results in saving of m³ = 4³ - 4² = 64-
16 = 48 observations over a complete 3-way layout.
4. The statistical analysis is simple though slightly complicated than
for RBD. Even 1 or 2 missing observations the analysis remains
relatively simple.
5. More than one factor can be investigated simultaneously.
Disadvantages of LSD
1. LSD is suitable for the number of treatments between 5 and 10 and
for more than 10 to 12 treatments the design is seldom used. Since
51
in that case, the square becomes too large and does not remain
homogeneous.
2. In case of missing plots the statistical analysis becomes quite
complex.
Let yijk (i, j, k=1,2,…,m)denote the response from the unit in the ith row, jth
column and receiving the kth treatment.
The model is
y ijk ri c j t k eijk ; i, j , k 1, 2, , m
Where µ is the constant mean effect; ri, cj and tk due to the ith row, jth column
and kth treatment respectively and eijk is error effect due to random component
assumed to be normally distributed with mean zero and variance
e2 i.e., eijk ~ N (0, e2 )
If we write
ijk
E E E E
0, 0, 0, 0
ri c j t k
52
Differentiate with respect to µ in equation (1)
E
2 yijk ri c j t k (1) 0
ijk
y
ijk
ijk ri c j t k 0
ijk ijk ijk ijk
Where y
ijk
ijk G , I,j,k=m2, I,j=m, I,k=m
G m 2 m ri m c j m t k 0 … (2)
i j k
E
2 yijk ri c j t k (1) 0
ri jk
y
jk
ijk ri c j t k 0
jk jk jk jk
Where y
jk
ijk Ri , I,j,k=m2, I,j=m, I,k=m
Ri m mri m c j m t k 0 … (3)
j k
E
2 y ijk ri c j t k (1) 0
c j ik
y
ik
ijk ri c j t k 0
ik ik ik ik
Where y
ik
ijk C j , I,j,k=m2, I,j=m, I,k=m
C j m m ri mc j m t k 0 … (4)
i k
E
2 yijk ri c j t k (1) 0
t k ij
53
y
ij
ijk ri c j t k 0
ij ij ij ij
Where y
ij
ijk Tk , I,j,k=m2, I,j=m, I,k=m
Tk m m ri m c j mt k 0 … (5)
i j
We assume that, r
i
i 0, c
j
j 0 and t
k
k 0
G m2
G m2
G
̂
m2
Ri m mri 0
Ri m̂ mri
Ri mG
rˆi
m m m2
Ri G
2 rˆi
m m
C j m mc j 0
C j m̂ mc j
Cj mG
cˆ j
m m.m 2
54
Cj G
cˆ j
m m2
Tk m mt k 0
Tk m̂ mt k
Tk mG ˆ
tk
m mm 2
Tk G
tˆk
m m2
E eijk ( yijk ri c j t k ) 2
2
ijk
= ( yijk ri c j t k )( yijk ri c j t k )
ijk
G R G Cj G T G
yijk2 y ijk yijk i 2 yijk 2 yijk k 2
ijk m2 ijk ijk m m ijk m m ijk m m
Ri2 C j Tk2
2
G i
2
G
2 G 2
k G2
( y ijk2 2 ) 2 2 2
j
ijk m m m m m m m
G2
Total Sum of Square = y 2
ijk 2
ijk m
R i
2
G2
Row Sum of Square = S R2 = i
m m2
55
C 2
j
G2
Column Sum of Square= S C2 =
j
m m2
T k
2
G2
Treatment Sum of Square= S T2 = k
m m2
Rows m-1 S R2 s R2 S R2 (m 1) FR s R2 s E2
Error (m-1)(m-2) S E2 s E2 S E2 (m 1) (m 2)
Total m2-1
Alternative Hypotheses
56
Similarly, we can test for H0c and H0t.
Problem 3
An experiment was carried out to determine the effect of claying the ground
on the field of barley grains; amount of clay used were as follows:
A: No clay
The yields were in plots of 8 meters by 8 meters and are given in table.
I D B C A 83.1
II C A D B 66.9
III A D B C 105.2
IV B C A D 105.0
Perform the ANOVA and calculate the critical difference for the treatment
mean yields.
57
Solution:
The four treatment totals are:
A: 30.8, B:86.9, C:124.5, D:118.0
Grand total G = 360,2, N = 16.
C.F. = (360.2)²/16 = 8109.0025
Raw S.S. = (29.1)² + (18.9)² +……..+ (9.5)² + (28.9)² = 10,052.08
Total S.S. = 10,052.08 – 8,109.0025 = 1,943.0775
S.S.R. = ¼ [(83.1)² + (66.9)² + (105.0)² + (105.0)²] – 8,109.0025
= 33,473.26/4 – 8,109.0025 = 259.3125
S.S.C. = ¼ [(75.8)² + (109.6)² + (84.1)² + (90.7)²] – 8,109.0025
= 33057.10/4 – 8109.0025 = 155.2725
S.S.T. = ¼ [(30.8)² + (86.9)² + (124.5)² + (118.0)²] – 8,109.0025
= 37924.50/4 – 8109.0025 = 1372.1225
Error S.S. = T.S.S. – S.S.R. – S.S.C. – S.S.T. = 156.3700
ANOVA TABLE FOR L.S.D.
Source of
variation S.S. Variance Ratio
(1) d.f. (3) M.S.S.
(4) = (3) ÷
(2) (2)
Rows 259.5375 86.4375 FR = 86.4375/26.0616 =
Columns 3 155.2725 51.7575 3.32<4.76
1,372.1225 457.3742 Fc = 51.7576/26.0616 = 1.98
Treatments 3 156.3700 26.0616 <4.76
Error FT = 457.3742/26.0616 =
3 17.55 > 4.76
6
Total 1,943.0775
15
Hence we conclude that the variation due to rows and columns is not
significant but the treatments, i.e., different levels of clay, have significant
effect on the yield.
58
2.5.2 One Missing observation in LSD
Let us suppose that in m×m Latin Square, the observation occurring in the ith
row , jth column and receiving the kth treatment is missing. Let us assume that
its value is x, i.e., yijk=x
G = grand total.
R x
G x 2 i G x
2
x 2 cons tan t terms independen t of x
m2 m m2
2
2
C j x 2 Tk x 2
G x
G x
m m2 m m2
R x C x
2 2
i
j
Tk x 2 G x 2
x cons tan t terms independen t of x
2
2 2
m m m m
Differentiate w. r.to x
R x
x
i
C j x Tk x 2G x 0 0
m m m m2 2
m Ri x mC x
mTk x 2G x
2
m2 x
0
j
2 2
m m m m2 m2
m 2 x mRi mx mC j mx mTk 2G 2 x 0
59
m 2 x mx mx mx 2 x mRi mC j mTk 2G
x(m 2 m m m 2) mRi mC j mTk 2G
x((m 1)(m 2) mRi mC j mTk 2G
m( Ri C j Tk 2G
x
(m 1)(m 2)
Unit -III
60
In this formula t is the critical (table) value of t for a specified level of
significance and error degrees of freedom rᵢ and rᴊ for the number of
replications for the ith and jth treatment respectively, the formula for t-test is
Yi Y j
t
1 1
S
ri r j
The two treatment means are declared significantly different at specified
level of significance.
If the difference exceeds the calculated CD value, otherwise they are not
significant CD value.
61
Present the results by using either the line notation (or) the alphabet notation
to indicate which treatment pair which are significantly different from each
other.
3.2.3 Tukey’s range test:
Tukey‟s range test is also known as Tukey‟s test, Tukey‟s HSD (Honest
significance difference) test. It can be used on raw data or in cons unction with
an ANOVA (post-hoc analysis) to find means that are significantly different
from each other. Tukey‟s test compares the means of every treatment to the
means of every other treatment.
The test statistic:
Tukey‟s test is based on a formula very similar to that of the t-test. In fact,
Tukey‟s test is essentially a t-test, except that is corrects for experiment wise
error rate.
Formula to,
Y YB
qs A
S.E
Where YA is a larger of the two means being compared. YB is the smaller of
the two means being compared. S.E is the standard error.
This qs value can then be compared to a q value from the studentized range
distribution. If the qs value is larger than the q critical value obtained from the
distribution. The two means are said to significantly different.
The studentized range distribution:
( y y min )
q max
S 2n
3.2.4 Student – Newman Keuls (SNK) test
The Newman-Keuls (or) student Newman Keuls (SNK) method is a stepwise
multiple comparison. Procedure used to identify sample means that are
significantly different from each other. It was named after student (1927) D.
Newman and M. Keuls,
Procedure:
1. The Newman Keuls method employs stepwise approach when comparing
sample means.
2. Prior to any mean comparison, all sample means are rank ordered in
ascending or descending order there by producing an ordered range (p) of
sample means.
3. A comparison is then made between the largest and smallest sample means
within the largest range.
4. Assuming that the largest range is four means (p=4) a significant difference
between the largest and smallest means as revealed by the Newman-Keuls
62
method would result in a reflection of the null hypothesis for that specific
range of means.
5. The next largest comparison of two sample means would then be made within
a smaller range of three means (p=3).
6. Continue this process until a final comparison is made.
7. It there is no significant difference between the two sample means. Then all
the null hypothesis within that range would be retained and no further
comparisons within smaller ranges are necessary.
63
2
…(3.2)
( SSE )
0 2 yij i j ( xij x.. )
i j
y x
i j
ij
i j i j
i
i j
j
i j
ij x.. 0
…(3.3)
( SSE )
0 2 ( yij i j ( xij x..)
i j
y ( x
j
ij
j j
i
j
j
j
ij x..) 0
y r ( x
j
ij
j
i
j
j
j
ij x..) 0 …(3.4)
( SSE )
0 2 ( yij i j ( xij x..)
j i
y ( x
i
ij
i i
i
i
j
i
ij x..) 0
y
i
ij
i i
i v j ( xij x..) 0
i
…(3.5)
( SSE )
2 [{ yij i j ( xij x.. )}( xij x.. )] …(3.6)
i j
64
From equation (3. 2)
y ij
ˆ y..
i j
rv
( xij x..) 0
i j
y
i
ij r ( ˆ ˆ i ) ˆ ( xij x.. ) 0
j
0
y i. ( y.. ˆ i ) ˆ .( xi. x.. ) 0
r
y
i
ij v v j vˆ ( x. j x.. ) 0
0
y. j ( j ) ˆ ( x. j x.. ) 0
v
y. j j ) ˆ ( x. j x.. ) 0
65
0 [{ yij y.. { yi. y.. ˆ ( xi. x.. )} { y. j y.. ˆ ( x. j x.. )} ˆ ( xij x )}(xij x.. )]
i j
0 [{( yij yi. y. j y.. ) ˆ ( xij xi. x. j x.. )}( xij x.. )
i j
[{( y
i j
ij yi. y. j y.. ) ˆ ( xij xi. x. j x.. )}
( y
i j
ij yi. . y. j y.. )( xij xi. x. j x.. ) ˆ ( xij xi. x. j x.. ) 2 0
i j
(the product terms will be zero since algebraic sum of deviations from mean is
zero)
( x
i j
ij xi. x. j x.. ) 2
Let us write:
E xy
Then, ˆ
E xx
Substituting the values of ˆ , ˆ i ,ˆ j and ˆ in (3.2), the unrestricted error sum of
squares for model (3.1) becomes:
2
i j i 1 j 1
66
[( yij yi. y. j y.. ) ˆ ( xij xi x. j x.. )]2 (on simplification)
i j
[( yij yi. y. j y.. ) 2 ˆ 2 ( xij xi x. j x.. ) 2 2ˆ ( xij xi. x. j x.. )( yij yi. y. j y.. )
i j i j i j
2
E xy E
E yy .E xx 2 xy .E xy
E xx E xx
E xy2
E yy E yy ̂E xy
E xx
E xy2
Error S. S for y in RBD
E xx
d.f for SSE = Total d.f.-(d.f. due to treatments) – (d.f. due to blocks) – (d.f. due
to β )
=(rv-1)-(v-1)-(r-1)-1 = (r-1)(v-1)-1
2
( SSE ) *
0 2 y ij i j ( xij x.. )
i j
67
y ij rv r i v j xij x.. 0 …(3.8)
i j i j i j
( SSE ) *
0 2 ( y ij i j ( xij x..)
i j
y ( x
j
ij
j j
i
j
j
j
ij x..) 0
y r ( x
j
ij
j
i
j
j
j
ij x..) 0 …(3.9)
( SSE )
0 2 ( yij i j ( xij x..)
j i
y ( x
i
ij
i i
i
i
j
i
ij x..) 0
y
i
ij
i i
i v j ( xij x..) 0
i
…(3.10)
( SSE ) *
2 [{ y ij i j ( xij x.. )}( xij x.. )] …(3.11)
i j
y ij
ˆ y..
i j
rv
( xij x..) 0
i j
68
y
i
ij r ( ˆ ˆ i ) ˆ ( xij x.. ) 0
j
0
y i. ( y.. ˆ i ) ˆ ( xi. x.. ) 0
r
y
i
ij v v j vˆ ( x. j x.. ) 0
0
y. j ( j ) ˆ ( x. j x.. ) 0
v
y. j j ˆ ( x. j x.. ) 0
[(x
i j
ij x.. ){ yij y.. ( y. j y.. ) ˆ ( x. j x.. ) ˆ ( xij x.. )}] 0
[ x
i j
ij x.. ){ y ij y. j ) ̂
69
[(x
i j
ij x. j x. j x.. ){( y ij y. j ) ˆ ( xij x. j )}] 0
( x
i j
ij x. j )( y ij y. j ) ˆ ( xij x. j ) 2 0, the other product terms are zero
( x ij x. j )( y ij y. j )
ˆ
i j
( x
i j
ij x. j ) 2
Let us define :
E xx ( xij x. j ) 2 ; E yy ( y ij y. j ) 2 ; E xy ( xij x. j )( y ij y. j )
i j i j i j
E xy
̂
E xx
Hence, under H0, the restricted error sum of squares is given by:
(SSE)* =minimum
value of error S. S
yij ˆ ˆ j ˆ ( xij x.. )
2
i j
( y y. j ) ˆ ( xij x. j )
2
ij
i j
( y
i j
ij y. j ) 2 ˆ 2 ( xij x. j ) 2 2 ˆ ( xij x. j )( yij y.. )
i j i j
E
2
E xy
E xx 2
xy
E yy ( E xy )
E E
xx xx
2
E xy
E yy … 3.12
E xx
=(vr-1)-(r-1)-1=vr-r-1=r(v-1)-1
70
(SST = St2)=(SSE)* -(SSE)
Where (SSE) and (SSE)* are given in (3.1) and (6.12) respectively.
=[r(v-1)-1]-[(r-1)(v-1)-1]=v-1
Hence the test statistic for testing H0: α1=α2= …=αv, is given by
SSyy = Eyy+Tyy+Byy
Where Tyy r ( yi. y.. ) 2 is the treatment S.S for y for RBD
i
Similarly we have,
SSxx=Exx+Txx+Bxx
SPxy=Exy+Txy+Bxy
Where Txy r ( xi. x.. )( yi. y.. ) and Bxy v( x. j x.. )( y. j y.. )
i j
71
E xy ( xij xi. x. j x.. )( yij yi. y. j y.. )
i j
Using the above notations, the above statistical analysis can be elegantly
expressed in the ANOCOVA table 3.1:
SSE- v-1
SSE*
72
Unit -IV
So far we have discussed the experiments in which the effects of a single set of
treatments were estimated and compared. In these experiments most of the
other variable factors were kept constant. In practice, the response of biological
organisms to the factor of interest is expected to differ under different levels of
other factors. For example, the yield of paddy varieties may differ under
different rates of fertilizer application, spacing and irrigation schedules. Hence,
in agricultural research we frequently wish to what happens with a range of
combination of factors. When several factors are investigated simultaneously in
a single experiment, such experiments are known as factorial experiments.
1. It increases the scope of the experiment and its inductive value and it
does so mainly by giving information not only on the main factors but on
their interactions.
2. The various levels of one factor constitute replications of other factors
and increase the amount of information obtained on all factors.
3. When there are no interactions, the factorial design gives the maximum
efficiency in the estimate of the effects.
4. When interactions exist, their nature being unknown a factorial design is
necessary to avoid misleading conclusions.
5. In the factorial design the effect of a factor is estimated at several levels
of other factors and the conclusions hold over a wide range of conditions.
Basic Ideas and Notations in the 2n Factorial Experiment
Let us first consider the design of the of the form 2n in which there are n
factors, each at two levels. Levels may be quite qualitative alternatives
like two species of a plant. In some cases one level is simply the control
group, ie., the absence of the factor and the other is its presence.
In order to develop extended notation to present the analysis of the
design in a concise form, let us start, for simplicity with a 22 – factorial
design.
Here we have two factors each at two levels (0,1), say, so that there are
2×2=4 treatment combinations in all. Following the notations due to
Yates, let the capital letters A and B indicates the names of the two
factors under study and let the small letters a and b denote one of the
73
two levels of each of the corresponding factors and this will be called the
second level. The first level of A and B is generally expressed by the
absence of the corresponding letter in the treatment combinations. The
four treatment combinations can be enumerated as follows:
a0b0 or 1 : Factors A and B, both at first level.
a1b0 or a : A at second level and B at first level
a0b1 or b : A at first level and B at second level
a1b1 or ab : A and B both at second level
Let [ab]: Total yield of the r units receiving the treatment ab.
[1]
(1): the mean yields of the r units receiving the treatment 1.
r
[ a]
(a): the mean yields of the r units receiving the treatment a.
r
[b]
(b): the mean yields of the r units receiving the treatment b.
r
[ ab]
(ab): the mean yields of the r units receiving the treatment ab.
r
74
The effect of A at the first level b0 of B = (a1b0 ) (a0 b0 )
= (a)-(1) --------> 1
= (ab)-(b) -------> 2
= (b)-(1) ------> 3
1
(2)+ (1) => A= [(ab)-(b) + (a)-(1)]
2
1
= [b (a-1) +1(a-1)]
2
1
= [(a-1) + (b+1)]
2
1
(4)+ (3) => B= [(b)-(1) + (ab)-(a)]
2
1
= [(b-1) +a (b-1)]
2
1
= [(a+1) (b-1)]
2
1
(2)- (1) => AB= [(ab)-(b)-{(a)-(1)}]
2
1
= [b (a-1)-1(a-1)]
2
75
1
= [(a-1) (b-1)] ---------- > 5
2
1
(4)- (3) => BA= [(ab)-(a)-{(b)-(1)}]
2
1
= [a (b-1)-1(b-1)]
2
1
= [(a-1) (b-1)] ----------- > 6
2
The equation (5) and (6) are same hence the interaction effect AB as same as
BA.
The sum of squares due to any Factorial effect is obtained by multiplying the
1
square of the effect total by the Factor . Where r is the common replicate
4r
[ A] 2
number sum of square due to main effect of A= = S A2
4r
[ B]2
Similarly S.S due to main effects of B = = S B2
4r
76
[ AB] 2
S.S due to interaction effect of AB= = S AB
2
4r
G2
Correction factor (C.F) =
N
Block S.S =
B 2j C.F S 2
R
4
S.S Treatment =
Ti 2
C.F S 2 t
r
Block S R2 r-1 S R2 S R2
S R2 FR
r 1 S E2
Treatment S t2 3 S t2 S t2
S t2 Ft
3 S E2
Main Factor S A2 1 S A2 S A2
A S A2 FA
1 S E2
Main Factor S B2 1 S B2 S B2
B S B2 FB
1 S E2
77
Interaction 2
S AB 1 2
S AB 2
S AB
effect AB S AB
2
FAB
1 S E2
Error S E2 3(r-1) S E2
S E2
3( r 1)
( 2 2 -1)(r-1)
Total S T2 4r-1
( 2 2 r-1)
Table value:
Conclusion:
(i) FR < FR* we need not reject and we conclude that there is no
significance difference between replicates.
(ii) Ft < Ft* we need not reject and we conclude that there is no
significance difference between Treatment.
78
(iv) FB < FB* we need not reject and we conclude that there is no
significance difference main effect B.
(v) FAB < F * we need not reject and we conclude that there is no
significance difference interaction effect AB.
Problem 1:
Find out ,the effect and interaction effect in the following 2 Factorial
experiment and write dawn the analysis of variance table;-
00 10 01 11
I 64 25 30 10
II 25 14 50 33
79
III 76 12 41 17
IV 75 33 25 10
Solution:
Null hypothesis H ob :
(108) 2
b 146 -156 -108 729
16
(80) 2
400
ab 70 -76 80 16
G 2 (540) 2
C.F = 18225
N 16
80
C.F = 18225
Total sum of square = yij C..F (64) 2 (25) 2 ......(25) 2 (10) 2 C..F
ijs
= 25460- 18225
= 7235
73290
= 18225
4
S.SB = 97.5
S.S. Tr = 4493
= 7235- (97.5+4493)
S.S.E = 2633.5
81
Main effect B 729 1 729 2.48 FB
Total 7235 15 - -
F- Table value:
Conclusion:
(v) FAB FB* We need not reject H0 and we conclude that there is no
significance difference between interaction effect AB.
83
b0 c1 a1b0c1 - a0b0c1 = (ac)-(c)
b1
c1 a1b1c1 - a0b1c1 = (abc)-
(bc)
Simple effect of A
S.E. of A =
1
a 1b 1c 1
4
Simple effect of B:
84
a1
Simple effect of B
S.E. of B =
1
b 1a 1c 1
4
85
a0 b0 a0b0c1 - a0b0c0 = (c)-(1)
a1
b1 a1b1c1 - a1b1c0 = (abc)-(ab)
Simple effect of C
S.E. of C =
1
a 1b 1c 1
4
AB =
1
abc bc ab b ac c a 1
4
86
=
1
abc bc ab b ac c a 1
4
1
bca 1 ba 1 ca 1 1a 1
4
a 1bc b c 1
1
4
a 1bc 1 1c 1
1
4
a 1b 1c 1
1
4
I.E of B =
1
a 1b 1c 1
4
AC =
1
abc bc ab b ac c a 1
4
1
abc bc ac c ab b a 1
4
1
bca 1 ca 1 ba 1 1a 1
4
1
a 1bc c b 1
4
1
a 1cb 1 1b 1
4
1
a 1b 1c 1
4
I.E. of AC =
1
a 1b 1c 1
4
Interaction effect of BC
BC =
1
abc ac bc c ab a b 1
4
87
(In main effect B we are add c1 terms and subtract c0 terms)
1
abc ac bc c ab a b 1
4
1
ac 1 cb 1 ab 1 1b 1
4
1
b 1ac c a 1
4
1
b 1ca 1 1a 1
4
1
b 1a 1c 1
4
Interaction effect of AC =
1
a 1b 1c 1
4
Interaction effect of AB at C0 of C =
1
a1b1 a0 b1 c0 a1b0 a0 b0 c0
2
Interaction effect of AB at C1 0f C =
1
a1b1 a0 b1 c1 a1b0 a0 b0 c1
2
1
bca 1 ca 1 ba 1 1a 1
4
a 1bc c b 1
1
4
a 1cb 1 1a 1
1
4
a 1b 1c 1
1
4
I.E of ABC =
1
a1b 1c 1
4
(αβ)ij is the interaction effect due to i and jth level of Treatment AB.
(αγ)ik is the interaction effect due to i and kth level of Treatment AC.
(αβγ)ijk is the interaction effect due to i,j and kth level of Treatment ABC.
Ʃαi=0, Ʃ(αβ)ij=0
Ʃβj=0, Ʃ(βα)ij=0
Ʃ(αβγ)ijk=0, Ʃ(aγ)ik=0
Ʃρl=0
Null Hypothesis:
[ac]
ac [ab]-[b] [bc]+[abc]-[c]-[ac] [ac]-[c]+[abc]-[bc]-[a]+[1]-[ab]+[b] [AC]
90
bc [bc] [ac]-[c] [ab]-[b]-[a]+[1] [bc]+[abc]-[c]-[ac]-[b]-[ab]+[1]+[a] [BC]
[abc]
S R2 S R2
Replication S R2 r-1 S R2 FR
r 1 S E2
S t2 S t2
Treatment S t2 7 S t2 Ft
7 S E2
S A2 S A2
Main effect A S A2 1 S A2 FA
1 S E2
S B2 S B2
Main effect B S B2 1 S B2 FB
1 S E2
2 2
S AB S AB
Int. effect AB 2
S AB 1 S AB
2
FAB
1 S E2
S C2 S C2
main effect C S C2 1 S C2 FC
1 S E2
2 2
S AC S AC
Int. Effect AC 2
S AC 1 S AC
2
FAC
1 S E2
2 2
S BC S BC
Int. Effect BC 2
S BC 1 S BC
2
RBC
r 1 S E2
91
2 2
S ABC S ABC
Int. Effect 2
S ABC 1 S ABC
2
R ABC
r 1 S E2
ABC
S E2
Error S E2 7r-1) S E2 -
7r 1
Total S T2 8r-1 - -
Table value:
Conclusion:
(i) FR < F1 we need not reject the null hypothesis H0 and we conclude
that there is no significance difference between replicates.
(ii) Ft< F2 we need not reject the null hypothesis H0 and we conclude
that there is no significance difference between Treatments.
(iii) FA< F3 we need not reject the null hypothesis H0 and we conclude
that there is no Factorial effect in main effect A.
(iv) FB< F3 we need not reject the null hypothesis H0 and we conclude
that there is no Factorial effect in main effect B
(v) FAB< F3 we need not reject the null hypothesis H0 and we conclude
that there is no Factorial effect in interaction effect AB.
92
(vi) FC< F3 we need not reject the null hypothesis H0 and we conclude
that there is no Factorial effect in main effect C.
The results and the notations of 22 and 23 can be generalized to the case of 2n
experiment. g Here we consider n factors each at 2 levels. Suppose A, B, C,
D,…, K are the factors each at two levels(0, 1). Corresponding small letters a, b,
c, d,…,k denote the corresponding factors at the second level, the first level of
any factor being signified by the absence of the corresponding small letter. The
treatment combinations, in standard order can be written as:
1, a, b, ab, c, ac, bc, abc, d, ad, bd, abd, cd, acd, bcd, abcd, etc.
93
nC1+nC2+…+nCn=[nC0+nC1+…+nCn]-1
=(1+1)n-1=2n-1
As in the case of 22 and 23- experiment the results for the main effects and
interactions can be generalized to the case 2n experiment. Thus, for n factors A,
B, C, D, …,K the main effects and interactions are given by the expression.
1
[(a 1)(b 1)(c 1)(d 1) (k 1)]
2 n 1
It will be seen that all the factorial effects (main and interaction) are mutually
orthogonal contrasts of treatment totals. Hence, having obtained the factorial
effect totals by Yates technique, the S.S due to each factorial effect is given by
2
2
2n
r.2 n
r.1
i 1
2
S R2
s R2
S R2
B 2
j
C.F
r-1
r 1
n
2
S T2
s n
T
2
Treatments 2
2 1
n
2 1
t
S T2 C.F
i
94
s A2 S A2
Main effect A S A2 =[A]2/r.2n 1
s B2 S B2
Main effect B S =[B]2/r.2n
2
B
1
s K2 S K2
Main effect K S 2
K =[K]2/r.2n 1
Two-factor
Interactions
2
s AB S AB
2
AB 2
S AB = 1
[AB]2/r.2n
2
s AC S AC
2
AC S 2
AC = [AC]2/r.2n 1
2
s BC S BC
2
BC S 2
BC = [BC]2/r.2n 1
Three-factor
Interactions
2
s ABC S ABC
2
ABC S 2
ABC = 1
[ABC]2/r.2n
2
s ACD S ACD
2
ACD S 2
ACD = 1
[ACD]2/r.2n
95
... K S AB ... K
2 2
s AB
n-factor S 2
ABK = 1
interactions [AB…K]2/r.2n
ABCD…K
S E2
Error S = 2
By (r-1)(2n-1) s
2
(r 1)(2 n 1)
E
E
subtraction
The block effects and the factorial effects ( main and interactions) can be tested
for significance by comparing their means S.S with error S.S.
UNIT- V
5.1 Confounding
When only the portions of treatment combinations are allotted to block within a
replication, the comparison between blocks in a replication represents some treatment
comparison, either a main effect or on interaction. In such cases it is not possible to distinguish
treatment comparisons from block comparisons. Such a mix up is termed as confounding.
The only and the greatest advantage of confounding scheme lies in the fact that it reduces
the experimental error considerably by stratifying the experimental material into homogeneous
sub sets or sub groups. The removal of the variation among incomplete blocks within replicates
often results in smaller error mean square as compared with a randomised complete block
design, thus making the comparisons among some treatments more precise.
1. The confounded contrasts are replicated fewer times than are the other contrasts and as
such there is loss of information on them and they can be estimated with a lower degree
of precisions as the number of replications for them is reduced.
96
2. The algebraic calculations are usually more difficult and the statistical analysis is complex,
specifically when some of the units are missing.
3. A number of problems arise if the treatments interact with blocks.
The same interactions are confounding in each replication (or) different sets of interaction
are confounded in different replications. Both the procedures are practised if the same set of the
interactions is confounded in all the replications confounding are called complete confounding.
For example:Let us consider confounding the highest order interactions ABC we know that
the interaction effect ABC is given by
ABC
1
abc bc ac c ab b a 1
4
1
abc a b c ab bc ac 1
4
Replicate
Block 1 1 (ab)
97
Block 4 (c) (abc)
Table 5.1
This confound component contain in the (2r-1) d.f. The ANOVA table will be as follows.
Blocks S b2 2r-1 S b2 S b2
F1
S E2
Treatment S t2 6 S t2 S t2
F2
S E2
98
A S A2 1 S A2 S A2
F3
S E2
B S B2 1 S B2 S B2
F4
S E2
AB 2
S AB 1 2
S AB 2
S AB
F5
S E2
C S C2 1 S C2 S C2
F6
S E2
AC 2
S AC 1 2
S AC 2
S AC
F7
S E2
BC 2
S BC 1 2
S BC 2
S BC
F8
S E2
Error S E2 6(r-1) S E2 -
Total S T2 8r-1 - -
Null Hypothesis: -
Inference:
99
If the calculated F value is less than the tabulated F* value we accept H0. Otherwise we
reject H0.
The analysis of 23 partially confounded design differs from that of the ordinary 23
factorial experiment replicated 4 times only in the calculation of the partially confounded
interactions. Each interaction being estimated only from the three replicates in which the given
interaction is not confounded.
Analysis of 23 partially confounded design with four replications and ‘r’ such
replications. Let us suppose that a number of repetitions say ‘r’, of the above pattern or layout
are performed such the positions of the replication, Blocks within replications and Blocks within
Blocks are randomised then the structure of the Anova table will be as follows.
Table 5.3
Blocks 8r-1 1 2 G2
Ʃ(total of Blocks) -
4 32r
Treatment 7 S A2 S B2 SC2 S AB
2
S BC
2
S AC
2
S ABC
2
St2
A 1 [A]2/32 r
B 1 [B]2/32 r
C 1 [C]2/32 r
AB 1 [AB]2/24 r
100
AC 1 [AC]2/24 r
BC 1 [BC]2/24 r
ABC 1 [ABC]2/24 r
It has already been explained that sum of square for confounded effects are to be
obtained from those replications only in which the given effect is not confounded from practical
point of view these sum of square can be obtain from the table of Yates method for all the 4
replications by appling sum adjusting factor (A.F) for any confounded effects is computed as
follows
(ii) Note the sign of (1) in the corresponding algebraic expression of the effect to be
confounded.
A.F = T1 – T2 (say)
A.F = T2 – T1
101
This adjusting factor will be subtracted from the Factorial effects total of the
confounded effects obtained from Yates’s method for all replicates.
If in a block the number of experimental units or plots is smaller than the number of treatments,
then the block is said to be incomplete and a design constituted of such blocks is called an incomplete
block design.
Balanced incomplete block designs which were developed for experiments in plant breeding and
agriculture selection comparisons among pairs of treatments is made with equal precision.
Definitions
An incomplete block design is one having v treatments and b blocks each of size k such that each
of the treatments is replicated r times and each pair of treatments occurs once and only once in the
same blocks v, b, r, and k are known as the parameters of the I.B.D
Parameters of BIBD
The integers v,r,b,k and λ are called the parameters of the BIBD., where
λ= number of blocks in which any pair of treatments occurs together or number of times any two
treatments occur together in a block. The following parametric relations serve as a necessary condition
for the existence of a BIBD.
Theorem 5.1
Vr = bk
102
Proof:
Since there are v treatments each replicated r times, total number of plots in the design is vr. Further
since there are b blocks each of size k, there are bk plots in all.
Hence vr=bk
Incidence Matrix: Associated with any design D is the incidence matrix N = (nij), (i = 1,2,…,v; j = 1,2,…,b),
where nij denotes the number of times the ith treatment occurs in the jth block. Thus by the definition of
a BIBD
Where nij =1, if ith treatment occurs in the jth block …(5.2)
=0, otherwise
b b … (5.3)
nij nij2 r; (i 1, 2, ...,v)
j 1 j 1
… (5.4)
v v
n n
i 1
ij
i 1
2
ij k ; ( j 1, 2,...,v) …(5.5)
b
n n
j 1
ij ij ; (i l 1, 2,..,v)
Since nijnlj=1 if and only if ith and lth treatments occur together in the jth block otherwise it is zero and
they occur together in λ blocks
103
n12j n 1j n2 j n nvj
1j
j j j
n2 j n1 j n j n2 j nvj
2
2j
NN j j
nvj n1 j n vj n2 j j nvj2
j j
r
r
…. (5.6)
r vv
Proof
Let us denote by Emn the m×n matrix all of whose elements are unity. From (5.6), we get
r 1
r 1
NN Ev1
r vv 1 v1
1
1
= r v 1
1
NN Ev1 N ( N Ev1 )
From (5.4)
n1b n2b nvb 1
104
ni1
i k
n
i 2
k
= N i = N N
n
k
ib
i
From (5.3)
nv1 nv 2 nvb vb 1 b1
n1 j
j r
2 j
n r
k j = k k …(5. 9)
r v1
nvj
j
[r+λ(v-1)]Ev1 = kr Ev1
r+λ(v-1) = kr
r
r
Proof. From (5.6) the determinant of the matrix |NN’|
r vv
Adding 2nd , 3rd , …, vth columns to the first column and taking *r+(v-1)] common from the first column,
we get
105
1
1 r
NN [r (v 1) ]1 r
1 r
1
0 (r ) 0 0
r (v 1) 0 0 (r ) 0
0 0 0 (r )
v 1 k 1 vk
Indicating that the design reduces to randomized block design. Hence, NN is non singular and
consequently Rank( NN ) v …(5.11)
Rank ( NN ) Rank ( N )
Theorem 5.4
(i) r≥k
(ii) b ≥ v+r-k
proof
b
(i) we have vr = bk r k
v
106
since b ≥ v, we get r ≥ k
(ii) we have v-k ≥ 0 and r-k ≥ 0
v
(v k )(r k ) 0 1(r k ) 0
k
v
i.e., ( r k ) ( r k ) 0
k
vr
v r k bvrk vr bk
k
Symmetric BIBD
Theorem 5.5 In a symmetric BIBD, the number of treatments common between any two
blocks is λ
Proof.
We have
r
r
NN
r
r 0 0 0
0 r 0 0
0 0 0 r
=(r-λ)Iv+λEvv
( NN ) 1 I v 2 Evv
r
1
( N ) 1 N 1 I v 2 Evv
r r
107
1
Premultiplying by (N’), we get N 1 N 2 N Evv …(5.14)
r r
N’Evv=NEvv=rEvv=k Evv
N
E vv E vv
r
1
Substituting in (5.14), we get N 1 N Evv
r r
1
Iv N N NE vv
1
N N Evv
r r r
N’N=(r-λ)Iv+λEvv …(5.15)
N’N=N’N
Thus, the inner product of any two rows of N is equal to the inner product of any two columns of N, i.e.,
λ.
Hence, in case of a symmetric BIBD, any two blocks have λ treatments in common.
Reference Books
Das, M.N. and Giri, N.C. (1988) Design and Analysis of Experiments( 2nd Edition). New Age International,
New Delhi.
Douglas,C. Montgomenry (2012). Design and Analysis of Experiments. John Wiley and sons, New York.
Gupta, S.C. and V.K. Kapoor(1999). Fundamentals of Applied Statistics (Third Edition), Sultan Chand and
Sons, New Delhi.
108