Professional Documents
Culture Documents
Hypothesis Testing and Comparison of Two Populations
Hypothesis Testing and Comparison of Two Populations
Hypothesis Testing and Comparison of Two Populations
Dr. Burton
If the heights of male teenagers are normally distributed with a mean of 60 inches and standard deviation of 10, And the sample size was 25, what percentage of boys heights in inches would be: Between 57 and 63
Lass than 58
61 or larger
7.2a
%
Height Z x-
57 57 - 60 10 / 25 63 - 60 10 / 25
60 0
63
Z=
s/ n
7.2b
%
Height Z Z= x- s/ n
58 60 -1.0 0 58 - 60 10 / 25
7.2c
%
Height Z Z= x- s/ n 61 - 60 10 / 25
60 0
61
0.5
Z = 0.50
Hypothesis Testing
Hypothesis: A statement of belief Null Hypothesis, H0: there is no difference between the population mean and the hypothesized value 0. Alternative Hypothesis, Ha: reject the null hypothesis and accept that there is a difference between the population mean and the hypothesized value 0.
Correct results 1-
Type I Error
b Type II
Error
c Reject H0
Correct results 1-
Differences
H0 True = statistically insignificant H0 False = statistically significant Accept H0 = statistically insignificant Reject H0 = statistically significant
http://en.wikipedia.org/wiki/False_positive
= 0.05
0.025
1.96 SE
-2 -1 0 1 2
0.025
-3
1.645 SE
-3 -2 -1 0 1 2
= 0.05
3
t-Tests
Students t-test is used if:
two samples come from two different groups. e.g. A group of students and a group of professors
t-Test
(Two Tailed)
x A - xB - 0
t =
Sp [ ( 1/NA ) + ( 1/NB) ]
df = NA + NB - 2
A2
NA
B2
Population NB
s A2
NA
s B2
Sample
NB
1
Sp NA
NB
Sp =
s2(nA-1) + s2 (nB 1) n A+ nB - 2
12.14 (6) + 34.67 (5)
Sp = 7+ 6 - 2
22.38 = 4.73
t =
Sp [ ( 1/NA ) + ( 1/NB) ]
=
4.73
d f = N E + N C - 2 = 11
Paired t-test
d = D/N
t paired = t p =
d - 0
Standard error of d
d - 0 = ------------Sd2 N
S d2 = df = N - 1
d2/ N-1
d 2 = D 2 ( D) 2 / N
6
7 8 9 10
22
12 30 5 14
26
13 47 16 9
4
1 17 11 -5
16
1 289 121 25
Total
171
208
D = 37
D2 = 511
N = 10 d - 0 = ------------Sd2 N
t paired = t p =
d - 0 Standard error of d
d = D/N
= 37/10 = 3.7
= 3.7 / 41.5667 / 10
= 3.7 / 4. 15667
d 2 = D 2 ( D) 2 / N
= 511 - 1369/10 = 374.1 S d2 = d 2 / N - 1 = 374.1 / 10 1 = 41.5667
= 3.7 / 2.0387
= 1.815 df = N 1 = 9 0.05 > 1.833
Correct results 1-
Type I Error
Reject H0
Differences H0 True = statistically insignificant H0 False = statistically significant Accept H0 = statistically insignificant Reject H0 = statistically significant
Standard 2 X 2 table
Disease status Present Absent Risk Present Factor Status Absent Total Total
a c a+c
b d b+d
a = subjects with both the risk factor and the disease b = subjects with the risk factor but not the disease c = subjects with the disease but not the risk factor d = subjects with neither the risk factor nor the disease a + b = all subjects with the risk factor c + d = all subjects without the risk factor a + c = all subjects with the disease b + d = all subjects without the disease a + b + c + d = all study subjects
Standard 2 X 2 table
Disease status Present Absent Risk Present Factor Status Absent Total Total
a c a+c
b d b+d
5 1 6
13 81 94
18 82 100
Sensitivity = a/a+c = 100 X 5/6 = 83.3% (16.7% false neg.) Specificity = d/b+d = 100 X 81/94 = 86.2%(13.8% false pos.)