Professional Documents
Culture Documents
Experimental Design
Experimental Design
Experimental Design
Dr Yunita Ismail
Environmental Engineering Study Program
Engineering Faculty
President University
What is Experimental Design?
Because of variability
We wouldn’t need a science of experimental design if
3
Examples of Environmental Projects
4
Goals of Experimental Design
Without
randomization,
Confounding variable
the confounding
variable differs
among
treatments
Confounding variable Experimental units Treatments
Experimental units Treatments
With
randomization,
Confounding variable
the confounding
variable does
not differ among
treatments
Blinding
• Blinding is the concealment of information from
the participants and/or researchers about which
subjects are receiving which treatments
• Single blind: subjects are unaware of treatments
• Double blind: subjects and researchers are unaware
of treatments
Blinding
• Example: testing heart medication
• Two treatments: drug and placebo
• Single blind: the patients don’t know which group
they are in, but the doctors do
• Double blind: neither the patients nor the doctors
administering the drug know which group the
patients are in
Goals of Experimental Design
• Avoid experimental artifacts
• Eliminate bias
1. Use a simultaneous control group
2. Randomization
3. Blinding
• Reduce sampling error
1. Replication
2. Balance
3. Blocking
Replication
• Experimental unit: the individual unit to which
treatments are assigned
Experiment 1
Experiment 2
Tank 1 Tank 2
Experiment 3
All separate tanks
Replication
• Experimental unit: the individual unit to which
treatments are assigned
2 Experimental
Experiment 1
Units
2 Experimental
Experiment 2
Units
Tank 1 Tank 2
8 Experimental
Units Experiment 3
All separate tanks
Replication
• Experimental unit: the individual unit to which
treatments are assigned
2 Experimental
Experiment 1
Units
2 Experimental
Units Pseudoreplication Experiment 2
Tank 1 Tank 2
8 Experimental
Units Experiment 3
All separate tanks
Why is pseudoreplication bad?
Experiment 2
Tank 1 Tank 2
s
SE Y
n
Larger n Smaller SE
Balance
• In a balanced experimental design, all treatments
have equal sample size
Better than
Balanced Unbalanced
Balance
• In a balanced experimental design, all treatments
have equal sample size
• This maximizes power
• Also makes tests more robust to violating
assumptions
Blocking
• Blocking is the grouping of experimental units that
have similar properties
• Within each block, treatments are randomly
assigned to experimental treatments
• Randomized block design
Randomized Block Design
Randomized Block Design
• Example: cattle tanks in a field
Very sunny
Not So Sunny
Block 1
Block 2
Block 3
Block 4
What good is blocking?
• Blocking allows you to remove extraneous variation
from the data
• Like replicating the whole experiment multiple
times, once in each block
• Paired design is an example of blocking
Basic Ideas of Design:
Independent Variables (Factors)
Remember:
You can randomly assign treatment levels but not blocks
Basic Ideas of Design (Crossing)
Example
The Tennessee class size experiment assigned students to one
of three class size conditions. All three treatment conditions
occurred within each of the participating schools
Example
The Tennessee class size experiment actually assigned
classrooms to one of three class size conditions. Each
classroom occurred in only one treatment condition
T1 T2 … T1
T1 T2 T1
…
T2 T1 T2
T1 T2 T1
…
T2 T1 T2
Blocks
1 2 … n
Treatment 1
…
Treatment 2
Consider possible sources of bias
Think About These Designs
Individual 1 Individual 1
Individual 2 Individual 2
…
…
Individual nT Individual nC
The Randomized Block Design
Block 1 … Block m
Individual 1 Individual 1
…
…
…
Treatment 1
Individual n1 Individual nm
Individual n1 +1 Individual nm + 1
…
…
Treatment 2 …
Treatment Control
…
Individual n1 Individual nm Individual nm+1 Individual n2m
Randomization Procedures
You may want to use a table of random numbers, but be sure to pick
an arbitrary start point!
• Hierarchical designs
Experimental Designs
For each design we will look at
• Structural Model for data (and what it means)
• Two inference models
• What does ‘treatment effect’ mean in principle
• What is the estimate of treatment effect
• How do we deal with context effects
• Two statistical analysis procedures
• How do we estimate and test treatment effects
• How do we estimate and test context effects
• What is the sensitivity of the tests
The Randomized Block Design
The population (the sampling frame)
Treatment 1 2 … m
1 …
2 …
The Randomized Block Design
School 1
Schools
Treatment 1 2 … m
1 …
2 …
The Conceptual Model
The statistical model for the observation on the kth
person in the jth school in the ith treatment is
Yijk = μ +αi + βj + αβij + εijk
where
μ is the grand mean,
αi is the average effect of being in treatment i,
βj is the average effect of being in school j,
αβij is the difference between the average effect of
treatment i and the effect of that treatment in school
j,
εijk is a residual
Effect of Context
Yijk i j ij ijk
Context Effect
Two-level Randomized Block Design
With No Covariates (HLM Notation)
Level 1 (individual level)
α1 + αβ1j
α2 + αβ2j
α1 + αβ1j + ε1j●
α2 + αβ2j + ε2j●
Effects and Estimates
The comparative treatment effect in any given school j is
(α1 – α2)
The term (ε1●● – ε2●●) depends on the students in the schools in the
sample
Both (ε1●● – ε2●●) and (αβ1● – αβ2●) are random and average to 0
across students and schools, respectively
Conditional Inference
(Schools Fixed)
The estimate of the mean treatment effect in the experiment is still
The term (ε1●● – ε2●●) depends on the students in the schools in the
sample
The term (αβ1● – αβ2●) depends on the schools in sample, but the
treatment effect in the sample of schools is the effect we want to
estimate
Expected Mean Squares
Randomized Block Design
(Two Levels, Schools Random)
Source df E{MS}
FT = MST/MSTxS with (m – 1) df
Source df E{MS}
2σW2 /mn
Fixed Effects Procedures
The test for treatment effects:
α1 α 2 α 1• α 2 •
n
with m(n – 1) df
Comparing Fixed and Mixed Effects Statistical
Procedures
(Randomized Block Design)
Fixed Mixed
Inference
Model Conditional Unconditional
Operational α1 α 2 α 1• α 2 • α1 α 2 n
n
Effect Size 1 ( nω S 1) ρ
df 2m(n – 1) (m – 1)
Schools
2
Treatment 1 2 … m m +1 m +2 …
m
2
The Hierarchical Design
Treatment 1 schools
Schools
Treatment 1 2 … m m +1 m+2 … 2m
2
The Hierarchical Design
Treatment 2 schools
Schools
2
The Conceptual Model
The statistical model for the observation on the kth person in the jth school in the
ith treatment is
αβij is the difference between the average effect of treatment i and the effect of
that treatment in school j,
εijk is a residual
Or βj(i) = βi + αβij is a term for the combined effect of schools within treatments
The Conceptual Model
The statistical model for the observation on the kth person in the jth school in the
ith treatment is
or βj(i) = βi + αβij is a term for the combined effect of schools within treatments
Two-level Hierarchical Design
With No Covariates (HLM Notation)
Level 1 (individual level)
But we cannot estimate the treatment effect in a single school because each
school gets only one treatment
(α1 – α2)
Note that this treatment effect is not quite the same as in the
randomized block design, where we estimate
(α1 – α2) + (αβ1● – αβ2●)
Statistical Analysis Procedures
Two kinds of statistical analysis procedures
(as in the randomized block design)
Source df E{MS}
FT = MST/MSBS with (m – 2) df
α1 α 2 n
2 1 2( n 1) ρ2
where ρ = σS /(σS2 + σW ) = σS /σ
2
Expected Mean Squares
Hierarchical Design
(Two Levels, Schools Fixed)
Source df E{MS}
Note that the school effects are confounded with treatment effects
and m(n – 1) df
Comparing Fixed and Mixed Effects Procedures
(Hierarchical Design)
Fixed Mixed
Inference
Conditional Unconditional
Model
Estimand (α1 – α2) + (β●(1) – β●(2)) (α1 – α2)
• practically difficult
• politically infeasible
• theoretically impossible
SE 2is
The precision N
Precision of the Estimated Treatment Effect
SE increases as ρ increases
Statistical Power
Power in simple (simple random sample) designs depends on:
• Significance level
• Effect size
• Sample size
d
n 0.10 0.20 … 0.80 1.00 1.20 1.40
8 05 07 … 31 46 60 73
9 06 07 … 35 51 65 79
10 06 07 … 39 56 71 84
11 06 07 … 43 63 76 87
Computing Statistical Power
Power in complex (clustered sample) designs depends on:
• Significance level
• Effect size δ
This is the same table that you use for simple designs
Power in Two-level Hierarchical Design
With No Covariates
Basic Idea:
Operational Effect Size = (Effect Size) x (Design Effect)
n
T
1 n 1 ρ
ΔT = δ x (Design Effect)
1 n 1 ρ
SE decreases as m increases
SE deceases as n increases, but only up to point
SE increases as ρ increases
SE decreases as RW2 and RS2 increase
Power in Two-level Hierarchical Design
With Covariates
Basic Idea:
ΔT = δ x (Design Effect)
1 n 1 ρ RW n R S RW
A 2 2 2
1 n 1 ρ RW n R S RW
2 2 2
where
μ is the grand mean,
αi is the average effect of being in treatment i,
βj(i) is the average effect of being in school j, in treatment i
γk(ij) is the average effect of being in class k in treatment i, in school
j,
εijkl is a residual
Three-level Hierarchical Design
With No Covariates (HLM Notation)
Level 1 (individual level)
Yijkl = β0jk + εijkl ε ~ N(0, σW2)
2 1 pn 1 ρ S ( n 1) C
SE T
m pn
SE decreases as m increases
ΔT = δ x (Design Effect)
1 ( pn 1) S n 1 ρ C
1 S
1
p
C
Thus power only increases up to some limit as n increases
Three-level Hierarchical Design
With Covariates (HLM Notation)
Level 1 (individual level)
2
SE T
m
1 ( pn 1) S n 1 ρ RW pnR S RW S nR C RW C
2 2 2 2 2
pn
SE decreases as m increases
SE increases as ρ increases
ΔT = δ x (Design Effect)
1 ( pn 1) S n 1 ρ RW pnR S RW S nR C RW C
2 2 2 2 2
1 1 R S S
1 R
2 1 2
p C C
Thus power only increases up to some limit as n increases
Randomized Block Designs
Two-level Randomized Block Design
With No Covariates (HLM Notation)
Level 1 (individual level)
Thus
ωS = σTxS2/σS2
Precision in Two-level Randomized Block Design
With No Covariates
SE increases as ρ increases
SE increases as ωS = σTxS2/σS2 increases
Power in Two-level Randomized Block Design
With No Covariates
Basic Idea:
Operational Effect Size = (Effect Size) x (Design Effect)
n
T
1 n 1 ρ
ΔT = δ x (Design Effect)
1 n S 1 ρ
SE increases as ρ increases
SE increases as ωS = σTxS2/σS2 increases
Basic Idea:
ΔT = δ x (Design Effect)
1 n S 1 ρ RW n S R S RW
A 2 2 2
Thus
ωS = σTxS2/σS2
Precision in Three-level Randomized Block Design
With No Covariates
2 1 pn S 1 ρ S ( n 1) C
SE T
m pn
SE decreases as m increases
SE increases as ωS increases
Basic Idea:
ΔT = δ x (Design Effect)
1 ( pn S 1) S n 1 ρ C
1 2 S S 1
p
C
Thus power only increases up to some limit as n increases
Power in Three-level Randomized Block Design
With Covariates
2
SE T
m
1 ( pn S 1) S n 1 ρ RW pn S R S RW S nR C RW C
2 2 2 2 2
pn
SE decreases as m increases
Basic Idea:
Operational Effect Size = (Effect Size) x (Design Effect)
ΔT = δ x (Design Effect)
pn / 2
1 ( pn S 1) S n 1 ρ RW pn S R S RW S nR C RW C
2 2 2 2 2
The operational sample size is the number of schools
Power in Three-level Randomized Block Design
With Covariates
1 2 1 R S S S
1 R
2 1 2
p C C
Thus power only increases up to some limit as n increases
What Unit Should Be Randomized?
(Schools, Classrooms, or Students)
Experiments cannot estimate the causal effect on any
individual
Assign Schools 1 pn 1 ρ S ( n 1) C
(Hierarchical Design)
pn