Professional Documents
Culture Documents
DR - Dinesh BIOSTAT Tests of Significance 1 Min
DR - Dinesh BIOSTAT Tests of Significance 1 Min
Dr M. Dinesh Dhamodhar
Reader
Department of Public Health Dentistry
Contents
• Introduction
• Tests of significance
• Parametric test
• Conclusion
• Bibliography
• VARIABLE
– A general term for any feature of the unit
which is observed or measured.
• FREQUENCY DISTRIBUTION
– Distribution showing the number of
observations or frequencies at different values or
within certain range of values of the variable.
Tests of significance
• Population
• is any finite collection of elements
I.e – individuals, items, observations etc,.
✓Sample –
✓is a part or subset of the population
✓Parameter –
✓ is a constant describing a population
✓Statistic –
✓is a quantity describing a sample, namely a function
of observations
Statistic Parameter
(Greek) (Latin)
Mean x
Standard s
Deviation
Variance 2
2
s
Correlation r
coefficient
Number of n N
subjects
Hypothesis testing
• Hypothesis H
• is an assumption about the status of a phenomenon
• Type I error =
• Type II error =
tolerable probability
✓Thus α is the maximum
of type I error
• Difference b/n level of significance & P-value -
LOS P-value
1) Maximum tolerable 1) Actual probability of
chance of type I error type I error
2) α is fixed in advance 2) calculated on basis of data
following procedures
Zone of Zone of
Rejection H0 Rejection H0
1/2
Zone of 1/2
Acceptance H0
x x +1.96SD
x −1.96SD
Confidence interval
Confidence limits – 95%
TESTS IN TEST OF SIGNIFICANCE
Parametric
Non-parametric
(normal distribution &
(not follow
Normal curve )
normal distribution)
where SD =
(n1 − 1)SD12 + (n2 − 1)SD22
n1 + n2 − 2
SE( x1 − x2 ) =
(n1 − 1)SD12 + (n2 − 1)SD22 1 1
n1 + n2 − 2 n + n
1 2
4) Calculate degree of freedom
df = (n1 −1) + (n2 −1) = n1 + n2 −1
5) Compare the calculated value &
the table value
6) Draw conclusions
• Example – difference b/n caries experience of high
& low socioeconomic group
Sl Details High socio Low socio
no economic group economic group
I Sample size n1 = 15 n2 = 10
x1 − x2 0.65
t= = = 6.34, df = 23
SE( x1 − x2 ) 0.1027
t0.001 = 3.76 tc t0.001
There is a significant difference
Other applications
II) Difference b/n sample mean & population mean
x−
t= df = n − 1
SE = SD
n
III) Difference b/n two sample proportions
p1 − p2 n1 p1 + n2 p2
where P =
t= n1 + n2
1 1
PQ + Q = 1− P
n1 n2
df = n1 + n2 − 2
Paired ‘t’ test
SD
SE =
n
x −0 x
4. Test criterion t= =
SE(d ) SD(x )
n
5. Degree of freedom df = n − 1
6. Refer ‘t’ table & find the probability
of calculated value
7. Draw conclusions
• Example – to find out if there is any significant improvement in DAI
scores before and after orthodontic treatment
Sl no DAI before DAI after Difference Squares
1 30 24 6 36
2 26 23 3 9
3 27 24 3 9
4 35 25 10 100
5 25 23 2 4
Total 20 158
Mean x =
x 20
= =4
n 5
sum of squares, (x − x ) = (6 − 4) + (3 − 4) + (3 − 4) + (10 − 4) + (2 − 4)
2 2 2 2 2 2
= 4 + 1 + 1 + 36 + 4
= 46
( x − x )2
46
SD = = = 11.5 = 3.391
n −1 4
SD 3.391
SE = = = 1.5179
n 5
x 4
tc = = = 2.6352 df = n − 1 = 4
SE 1.5179
but t 0.5 = 2.78
tc t 0.5
Hence not significant
Z test (Normal test)
• Similar to ‘t’ test in all aspect except that the sample size should be >
30
• In case of normal distribution, the tabulated value of Z at -
Sum of squares S2 Si Sk
T1
Mean of values x1 = x2 xi xk
n
ANOVA table
2
k − 1 i (xi − x ) = i xi2 − 2 i (xi − x )
I Between 2 T 2
S B2
(k − 1, N − k )
N SB =
Groups
k −1
SW2
II With in Ti 2
groups n − k (x − xi )
2 T
= i j xij2 −i i 2
i j ij i ni
2
x 2
−
ni SW =
i j ij
N −k
2
III Total T x −
2 T
n − 1 (x − x ) = i j xij2 −
2
i j ij
2
ST2 =
i j ij N
N N −1
Example- see whether there is a difference in number of patients
seen in a given period by practitioners in three group practice
Practice A B C
Individual values 268 387 161
349 264 346
328 423 324
209 254 293
292 239
Calculate
No of observations (n) 5 4 5
Sum of x values 1441 1328 1363
Sum of squares 426899 462910 393583
Mean of values 288.2 332.0 272.6
Between group sum of squares
=
( x ) ( x ) ( x ) ( x + x + x )
+A
2
+ − B
2
C
2
A B C
2
nA nB nC n A + nB + nC
= 8215.71
n A + nB + nC
A B C
= 63861.71
2 =
(O − E ) 2 O – observed frequency
E – expected frequency
E
• Steps
1. State null & alternate hypothesis
2. Make contingency table of the data
( )
r c frequency by
3. Determine expected
rc
E=
N (total frequency
4. Calculate chi-square of each by-)
2
=
(O − E)
2
E
5. calculate degree of freedom
df = (c − 1)(r − 1)
6. Sum all the chi-square of each cell – this gives
chi-square value of the data
=
2 (O − E )2
E
7. Compare the calculated value with the table
value at any LOS
8. Draw conclusions
• Chi square test only tells the presence or absence of
association
n1 (n1 + 1)
U = n1n2 + − R1 or R2
2
Where, n1n2 are sample sizes
R1 R2 are sum of ranks assigned to I & II group
Comparison of birth weights of children born to 15 non
smokers with those of children born to 14 heavy smokers
NS 3.9 3.7 3.6 3.7 3.2 4.2 4.0 3.6 3.8 3.3 4.1 3.2 3.5 3.5 2.7
HS 3.1 2.8 2.9 3.2 3.8 3.5 3.2 2.7 3.6 3.7 3.6 2.3 2.3 3.6
Ranks assignments
R1 26 23 16 21 8 29 27 17 24 12 28 10 15 13 03
R2 7 5 6 11 25 14 9 4 20 22 19 2 1 18
Sum of R1= 272 and Sum of R2=163
Difference T=R1 – R2 is 109