Professional Documents
Culture Documents
Hypothesis Testing and Sample Size Calculation: Po Chyou, Ph. D. Director, BBC
Hypothesis Testing and Sample Size Calculation: Po Chyou, Ph. D. Director, BBC
and
Sample Size Calculation
Po Chyou, Ph. D.
Director, BBC
Hypothesis Testing
on
• Population mean(s) • Coefficients based on
• Population median(s) regression model
Population proportion(s) • Odds ratio
• Population variance(s) • Relative risk
• Population correlation(s) • Trend analysis
Association based on • Survival distribution(s) /
contingency table(s) curve(s)
• Goodness of fit
Hypothesis Testing
1. Definition of a Hypothesis
An assumption made for the sake of argument
2. Establishing Hypothesis
Null hypothesis - H0
Alternative hypothesis - Ha
3. Testing Hypotheses
Is H0 true or not?
Hypothesis Testing
4.Type I and Type II Errors
Type I error: we reject H0 but H0 is true
α = Pr(reject H0 / H0 is true) = Pr(Type I error)
= Level of significance in hypothesis testing
α/2 α/2
Z
–Z α/2 0
Z α/2
Reject H 0 Do not reject H0 Reject H 0
Hypothesis Testing
6. An Example
A random sample of 400 persons included 240 smokers and 160 non-
smokers. Of the smokers, 192 had CHD, while only 32 non-smokers
had CHD.
Could a health insurance company claim the proportion of smokers
having CHD differs from the proportion of non-smokers having
CHD?
CHD No CHD
Smokers x1 n1 - x 1 n1
Non-Smokers x2 n2 - x 2 n2
n = n1 + n2
CHD No CHD
Smokers 192 48 240
Non-Smokers 32 128 160
400
Hypothesis Testing
Example (continued)
Let P1 = the true proportion of smokers having CHD
P2= the true proportion of non-smokers having CHD
- Step 1 H0 : P1 = P2
- Step 2 Ha : P1 P2
Z
0
–Z .025 Z.025
= -1.96 = 1.96
α=.05
c2
0 c2 =3.841
.05, 1
2 = (Oij - Eij )2
i, j Eij
For i = 1, 2 and j =1, 2
α=.05
c2
0 c2 =3.841
.05, 1
CHD No CHD
Smokers 192 48 240
Non-Smokers 32 128 160
224 176 400
CHD No CHD
Smokers E11 E12
Non-Smokers E21 E22
Hypothesis Testing
Example (continued) : Same as before
- Step 5 (continued)
E11 = n1m1 = 240 * 224 = 134.4
n 400
E12 = n1 - n1m1 = 240 - 134.4 = 105.6
n
E21 = n2m1 = 160 * 224 = 89.6
n 400
Expectation
Counts
E22 = n2 - n2m1 = 160 - 89.6 = 70.4
CHD No CHD
n
Smokers 134.4 105.6
Non-Smokers 89.6 70.4
Hypothesis Testing
Example (continued) : Same as before
- Step 5 (continued)
Definition of Power
Recall :
= Pr (accept H0 / H0 is false) = Pr (Type II error)
Power = 1 - = Pr(reject H0 / H0 is false)
Sample Size Estimation
for Intervention on Tick Bites Among
Assumptions Campers
1. Given that the proportion (PCON) of women who are obese at baseline
(i.e., the control group) is constant. There are a total of 840 women in
the control group. Based on our preliminary data analysis results,
approximately 50% of these 840 women at baseline are obese (BMI
>= 27.3).
2. Given that the proportion (PINT) of women who are obese in the
intervention group is reduced by 5% or more compared to that of the
control group after intervention has been implemented. There are a
total of 680 women who had been newly recruited. Based on our
preliminary data analysis results, 50% of these 680 newly recruited
women are obese. Assume that 60% of these women will agree to
participate, we will have 200 women to be targeted for intervention.
Statistical Power Calculation
for Intervention on Obesity of Women in
MESA (continued)
Assumptions
Jacob Cohen
OR