Professional Documents
Culture Documents
Sequential Test
Sequential Test
Sequential Test
Goals
P(Making an error) = α
Null Alternative
True True Total
Not Called
U T m-R
Significant
Called
V S R
Significant
m0 m-m0 m
p˜ j = min[mp j , 1]
• The general null hypothesis (that all the null hypotheses are
true) is rarely of interest
p˜ j = min[(m " j + 1) • p j , 1]
The point here is that we don’t multiply every pi by the same factor m
• For
!example, when m = 10000:
p˜1 = 10000 • p1, p˜ 2 = 9999 • p2 ,..., p˜ m = 1• pm
Who Cares About Not Making ANY
Type I Errors?
• FWER is appropriate when you want to guard against
ANY false positives
V V
FDR = FPR =
R m0
Benjamini and Hochberg FDR
• To control FDR at level δ:
2. Then find the test with the highest rank, j, for which the p
value, pj, is less than or equal to (j/m) x δ
j
p( j) " #
m
B&H FDR Example
Controlling the FDR at δ = 0.05
"V %
Storey : pFDR = E $ | R > 0'
#R &
!
20000
10000
0
P-values
Estimating The Proportion of Truly Null
Tests
• Under the alternative hypothesis p-values are skewed towards 0
1500
1000
Frequency
500
0
P-value
Estimating The Proportion of Truly Null
Tests
• Combined distribution is a mixture of p-values from the null and
alternative hypotheses
5000
4000
3000
Frequency
2000
1000
0
P-value
Estimating The Proportion of Truly Null
Tests
• For p-values greater than say 0.5, we can assume they mostly
represent observations from the null hypothesis
5000
4000
3000
Frequency
2000
1000
0
P-value
Definition of π0
• "ˆ 0 is the proportion of truly null tests:
#{ pi > #;i = 1,2,...,m}
"ˆ 0 ( #) =
m(1$ #)
!
5000
4000
!
3000
Frequency
2000
1000
0