Analysis of Variance and Design of Experiments

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 23

Analysis of Variance and Design of

Experiments
Experimental Designs and Their Analysis
:::
Lecture 18
Completely Randomized Design

Shalabh
Department of Mathematics and Statistics
Indian Institute of Technology Kanpur

Slides can be downloaded from http://home.iitk.ac.in/~shalab/sp1


Completely randomized design (CRD)
The CRD is the simplest design. Suppose there are v treatments
to be compared.
• All experimental units are considered the same and no
division or grouping among them exist.
• In CRD, the v treatments are allocated randomly to the whole
set of experimental units, without making any effort to group
the experimental units in any way for more homogeneity.
• Design is entirely flexible in the sense that any number of
treatments or replications may be used.

2
Completely randomized design (CRD)
• The number of replications for different treatments need not
be equal and may vary from treatment to treatment
depending on the knowledge (if any) on the variability of the
observations on individual treatments as well as on the
accuracy required for the estimate of individual treatment
effect.

3
Completely randomized design (CRD)
Example:
Suppose there are 4 treatments and 20 experimental units, then
 the treatment 1 is replicated, say 3 times and is given to 3
experimental units,
 the treatment 2 is replicated, say 5 times and is given to 5
experimental units,
 the treatment 3 is replicated, say 6 times and is given to 6
experimental units and
 finally, the treatment 4 is replicated [20 ‐ (6 + 5 + 3) =]6 times
and is given to the remaining 6 experimental units.
4
Completely randomized design (CRD)
• All the variability among the experimental units goes into
experimented error.

• CRD is used when the experimental material is homogeneous.

• CRD is often inefficient.

• CRD is more useful when the experiments are conducted


inside the lab.

• CRD is well suited for the small number of treatments and for
the homogeneous experimental material.

5
Completely randomized design (CRD): Layout of CRD
Following steps are needed to design a CRD:

 Divide the entire experimental material or area into a number of


experimental units, say n.

 Fix the number of replications for different treatments in


advance (for given total number of available experimental
units).

 No local control measure is provided as such except that the


error variance can be reduced by choosing a homogeneous set of
experimental units.

6
Completely randomized design (CRD) : Procedure
Let the v treatments are numbered from 1,2,...,v and
ni be the number of replications required for ith treatment such
v
that  n  n.
i 1
i

 Select n1 units out of n units randomly and apply treatment 1


to these n1 units.
(Note: This is how the randomization principle is utilized is CRD.)

 Select n2 units out of (n ‐ n1 ) units randomly and apply


treatment 2 to these n2 units.
7
Completely randomized design (CRD) : Procedure
 Continue with this procedure until all the treatments have
been utilized.

 Generally, the equal number of treatments are allocated to all


the experimental units unless no practical limitation dictates
or some treatments are more variable or/and of more
interest.

8
Completely randomized design (CRD): Analysis
There is only one factor which is affecting the outcome –
treatment effect. So the set‐up of one‐way analysis of variance is
to be used.
yij : Individual measurement of jth experimental units for ith
treatment i = 1,2,...,v , j = 1,2,...,ni.
yij : Independently distributed following N (    i ,  2 )
v

with  ni i  0.
i 1

 : overall mean
 i : ith treatment effect

9
Completely randomized design (CRD): Analysis
H 0 : 1   2  ...   v  0 Treatments
1 2 ... v
H1 : All  s are not equal.
i
'

_____________
The data set is arranged as follows: y11 y21 ... yv1
y12 y22 ... yv 2
   
y1n1 y2n2 ... yvnv
_____________
T1 T2 ... Tv
ni
where Ti   yij is the treatment total due to ith effect,
j 1
v v ni
G   Ti   yij is the grand total of all the observations.
i 1 i 1 j 1
10
Completely randomized design (CRD): Analysis
In order to derive the test for H0, we can use either the likelihood
ratio test or the principle of least squares.

Since the likelihood ratio test has already been derived earlier, so
we choose to demonstrate the use of the least‐squares principle.

The linear model is yij     i   ij , i  1, 2,..., v , j  1, 2,..., ni

where  ij ' s are identically and independently distributed random


errors with mean 0 and variance  2.
The normality assumption of  s is not needed for the estimation
of parameters but will be needed for deriving the distribution of
various involved statistics and in deriving the test statistics. 11
Completely randomized design (CRD): Analysis
Let v ni v ni
S     ( yij     i )2 .
2
ij
i 1 j 1 i 1 j 1

Minimizing S with respect to  and  i, we get the normal equations

S v ni


 0  2  (y
i 1 j 1
ij    i )  0

v v ni
or n   ni i   yij
i 1 i 1 j 1

S ni
 0  2 ( yij     i )  0, i  1, 2,..., v
 i j 1
ni
or ni   ni i   yij
j 1

12
Completely randomized design (CRD): Analysis
v
Solving them using n
i 1
i i  0 , we get

ˆ  yoo
ˆi  yio  yoo
where
1 ni
yio 
ni
y
j 1
ij is the mean of observation receiving the ith treatment
and
1 v ni
yoo   yij is the mean of all the observations.
n i 1 j 1

13
Completely randomized design (CRD): Analysis
The fitted model is obtained after substituting the estimate ̂ and
ˆi in the linear model, we get

yij  yoo  ( yio  yoo )  ( yij  yio )


or ( yij  yoo )  ( yio  yoo )  ( yij  yio ).

Squaring both sides and summing over all the observation, we have
v ni v v ni

 ij oo  i io oo  ij io
( y  y
i 1 j 1
)  n ( 2
y  y )
i 1
 ( y  y ) 2
2

i 1 j 1

  
 Sum of squares 
 Total sum     Sum of squares 
or  of squares  =  due to treatment  +  due to error 
  effects  
 
or TSS  SSTr  SSE 14
Completely randomized design (CRD): Analysis
v ni
 Since  ( y
i 1 j 1
ij  yoo )  0, so TSS is based on the sum of (n ‐1)
squared quantities. Thus TSS carries only (n ‐ 1) degrees of freedom.
v
 Since  ni ( yio  yoo )  0, so SSTr is based only on the sum of (v ‐1)
i 1
squared quantities. Thus SSTr carries only (v ‐ 1) degrees of
freedom.
ni
 Since n (y
i 1
i ij  yio )  0 for all i = 1,2,...,v, so SSE is based on the

sum of squaring n quantities like ( yij  yio ) with v constraints


ni

(y
j 1
ij  yio )  0, so SSE carries (n – v) degrees of freedom.

15
Completely randomized design (CRD): Analysis
• Using the Fisher‐Cochran theorem,
TSS = SSTr + SSE with degrees of freedom partitioned as
(n – 1) = (v ‐ 1) + (n – v).

Moreover, equality in TSS = SSTr + SSE has to hold exactly.

To ensure that the equality holds exactly, we find one of the sums
of squares through subtraction.

Generally, it is recommended to find SSE by subtraction as


SSE = TSS ‐ SSTr . 16
Completely randomized design (CRD):Analysis
v ni
TSS   ( yij  yio ) 2
i 1 j 1

v 2 ni
G
  yij2 
i 1 j 1 n
v ni
G2
G   yij : Grand total, : correction factor
i 1 j 1 n
ni
SSTr   ni ( yio  yoo ) 2
j 1

 Ti 2  G 2
v
   
i 1  ni  n
ni
Ti   yij Treatment total.
j 1
17
Completely randomized design (CRD): Analysis
Now under H 0 : 1   2  ...   v  0 , the model become
Yij     ij ,
v ni
and minimizing S   ij
 2

i 1 j 1

S G

with respect to gives  0  ˆ   yoo .
 n
v ni
The SSE under H0 becomes SSE   ( yij  yoo )2
i 1 j 1

and thus TSS = SSE. This TSS under H0 contains the variation only
due to the random error whereas the earlier TSS = SSTr + SSE
contains the variation due to treatments and errors both.
18
Completely randomized design (CRD): Analysis
This TSS under TSS = SSE contains the variation only due to the
random error whereas the earlier TSS = SSTr + SSE contains the
variation due to treatments and errors both.

The difference between the two will provides the effect of


treatments in terms of sum of squares as
v
SSTr   ni ( yi  yoo ) 2
i 1

19
Completely randomized design (CRD): Distributions and
decision rules:
Using the normal distribution property of  ij ' s, we find that
yij’s are also normal as they are the linear combination of  ij ' s.
Under H0  SSTr ~  2 (v  1)
2
SSE
 ~  2 (n  v).
2
SSTr and SSE are independently distributed.
MStr
Under H0 , ~ F (v  1, n  v)
MSE
Reject H0 at  * level of significance if F  F *, 1,n .
[Note: We denote the level of significance here by  * because
 has been used for denoting the factor] 20
Completely randomized design (CRD): ANOVA table

The analysis of variance table for CRD is

Source of Degrees of Sum of squares Mean squares F ‐ value


variation freedom
Between
v–1 SSTr MSTr
treatments
MSTr
Within
MSE
n–v SSE MSE
populations

Total n–1 TSS

21
Completely randomized design (CRD): Expectations
v ni
E ( SSE )   E ( yij  yio ) 2
i 1 j 1
v ni
  E ( ij   io ) 2
i 1 j 1
v ni v
  E ( )   i io. )
i 1 j 1
n E ( 2 2
ij
i 1
v
2
 n   ni 2

i 1 ni
 (n  v) 2
 SSE 
E ( MSE )  E     2

 nv 

22
Completely randomized design (CRD): Expectations
v
E ( SSTr )   ni E ( yio  yoo ) 2
i 1
v
  ni E ( i   io   oo ) 2
i 1
v
 v 2 
  ni    ni E ( io )  nE ( oo ) 
i
2 2

i 1  i 1 
v
 v
 2
 2

  ni i    ni
2
n 
i 1  i 1 ni n 
v
  ni i2  (v  1) 2
i 1

 SStr  1 v
E ( MSTr )  E    
 v  1  v  1 i 1
n 
i i
2
  2
.

In general E  MSTr   
2

but under H 0 , all  i  0 and so E ( MSTr )   2 23

You might also like