Professional Documents
Culture Documents
Week 3 Selecting Proper Stat Tests
Week 3 Selecting Proper Stat Tests
Overview:
60 A
A
A AA A
Cognitive
A
50 A
A A A
A A
function
A
A A A
A
AA AA
A AA
score at 3 A
A A
A A AA
A A
Pearson’s Correlation
40
P=0.10 (NS)
A
months
AA
AA A
A
after ICU
A A
A
A
P=0.03 (Significant)
20 A
10
0 250 500
Biomarker (S100)
1
MED101x
Introduction to Applied Biostatistics
Pearson’s Correlation
P=0.539 (NS)
NIH Research Funds ($ billions)
Student’s T-test
P=0.405 (NS)
Total delirium days
Mann-Whitney U test /
2
Wilcoxon rank sum test
P=0.012 (Significant)
Absen t Presen t
APO-E4
APO-E
2
MED101x
Introduction to Applied Biostatistics
What should we think about a doctor who uses the wrong treatment,
either willfully or through ignorance, or who uses the right treatment
wrongly (such as by giving the wrong dose of a drug)? Most people
would agree that such behavior was unprofessional, arguably
unethical, and certainly unacceptable.
What then would we think about researchers who use the wrong
techniques (either willfully or in ignorance), use the right techniques
wrongly, misinterpret their results, report their results selectively, cite
the literature selectively, and draw unjustified conclusions? We
should be appalled. Yet numerous studies of the medical literature,
in both general and specialist journals, have shown that all of the
above phenomena are common. This is surely a scandal.
3
MED101x
Introduction to Applied Biostatistics
Q1 Q2 Q3
D ependent
(paired) C ontinuous ( Q4,5
N on-norm al)
O rdered categorical
/ Q6
2
>2
Q7
M ixed effect R egression
W ilcoxon signed-rank test
Friedm an test
N om inal 2 M cN em ar's test
C ontinuous ( N orm al) P earson's correlation (r)
C orrelation C ontinuous ( N on-norm al)/ordered S pearm an's correlation (rs)
N om inal(2 levels) 2 S pearm an/Kappa (A grreem ent)
C ontinuous ( N orm alresidulas) Linear R egression
*
Indepdende C ontinuous ( N on-norm alresidulas) Linear R egression
nt O rdered categorical O rdered Logistic R egression
(un-paired) N om inal (2 levels) B inary Logistic R egression
(>2) M ultinom ial Logistic R egression
M ultivariable
Tim e to Event C ox P roportional H azard R egression
C ontinuous ( N orm alresidulas) Linear M ixed Effect R egression
D ependent C ontinuous ( N on-norm alresidulas) Linear M ixed Effect R egression*
(paired) O rdered categorical G eneralized Estim ation Equation (G EE)
N om inal(2 levels) G eneralized Estim ation Equation (G EE)
*Transform outcom e variables for norm alizing residuals
Created based on Publishing Your Medical Research Paper, by Daniel Byrne, Williams and Wilkins (1998)
Q2,
Q3, Q4, Q5
Q1,Univariate Difference Q6, No. of Q7,sampl
Paired /
/Mutivariable /Correlatio Type of outcome (Normality) groups e size Valid Tests
related
n
Continuous (Normal) 2 Student's t-test
>2 One-way ANOVA
Continuous (Non-normal)/ 2 Mann-Whitney U test
Indepdende
Ordered categorical >2 Kruskal-Wallis H test
nt
(un-paired) Nominal 2 <20 Fisher's exact test
≧2 ≧20 Chi-square test
Difference Time to Event Log-Rank test(Kaplan-Meier plot)
Created based on Publishing Your Medical Research Paper, by Daniel Byrne, Williams and Wilkins (1998)
4
MED101x
Introduction to Applied Biostatistics
Question 1 – Univariate?
Which type of test do you need:
Univariate or Multivariable?
5
MED101x
Introduction to Applied Biostatistics
Population?
Patients with an echo for possible coronary disease
Exposure?
Use of Aspirin at the baseline visit
Control?
No use of Aspirin at the baseline visit
Outcome?
Long term mortality
(median FU of 3.1 years)
Results
6
MED101x
Introduction to Applied Biostatistics
7
MED101x
Introduction to Applied Biostatistics
Question 2 -Difference?
• Do you want to test for a difference
between groups or want to test for
correlation between variables?
60 A
A
A AA A
Cognitive
A
50 A
A A A
A A
function
A
A A A
A
AA AA
A AA
score at 3 A
A A
A A AA
A A
Pearson’s Correlation
40
P=0.10 (Not Significant)
A
months
AA
AA A
A
after ICU
A A
A
A
P=0.03 (Significant)
20 A
10
0 250 500
Biomarker (S100)
8
MED101x
Introduction to Applied Biostatistics
Comparing Difference
6 Examples:
Student’s T-test
Comparing 2 means
Total delirium days
Mann-Whitney U test /
2
Wilcoxon rank sum test
Comparing 2 medians
0
Absen t Presen t
APO-E4
APO-E
Question 3 - Paired?
• Were the groups paired or unpaired /
(dependent or independent)?
Examples:
Student t-test comparing 2 independent means.
(Comparing outcome between intervention and control groups)
9
MED101x
Introduction to Applied Biostatistics
Independent VS
Paired VS
- Discrete/Categorical/Factor?
-Nominal? 2 levels (Binary, dichotomous) ex. Died / Survived
>2 levels. Ex. Disease Type (cancer, DM, cardiovascular)
-Ordinal?
> 2 levels. Ex. Disease severity (1: Mild, 2: Moderate, 3: Severe)
Disease score (0: normal, 10: abnormal)
10
MED101x
Introduction to Applied Biostatistics
Question 5 – Normality?
80.0 80.0
Count
Count
40.0 40.0
0.0 0.0
0.0 0.5 0.9 1.4 1.0 2.0 3.0 4.0
log_Valsal VALSAL_1
6
6
] 5
4 4
] 3
2 2
0 Normal
Non-normal
0
Use Parametric
Use Non-parametric
Absent Present Absent Present
APO-E4 APO-E4
11
MED101x
Introduction to Applied Biostatistics
APO-E4 Absent
Absent
APO-E4 Present
Present
30
Count
20
10
0
0 10 20 30 0 10 20 30
12
MED101x
Introduction to Applied Biostatistics
Question 6 - #groups?
• How many groups are there for the
independent (predictor) variable?
- 2 levels ?
- 3 or More?
Examples:
Student t-test comparing 2 group means
ANOVA comparing 3 or more group means
13
MED101x
Introduction to Applied Biostatistics
Examples:
Greater than total N=20, use Chi-square test
Greater than 20 and less than 40 and
an expected # in a cell < 5,
use Fisher’s exact test
Observed Expected
died survived died survived
Control Control
14
MED101x
Introduction to Applied Biostatistics
Selection of Regression
15
MED101x
Introduction to Applied Biostatistics
Example 1
• Comparing ventilator free days between patients
who were randomized to daily awakening and
breathing trial vs daily breathing trial among
ventilated patients in medical ICU: A prospective
randomized study.
Q1: (Univariate?) Univariate Mutivariable Linear regression
Q2: (Difference?) Difference
Q3: (Paired?) Unpaired
Q4: (Type?) Continuous
Q5: (Normality?) Normal
Non-Normal
Q6: (#groups?) 2
Q7: (sample size?) > 30 in each group
16
MED101x
Introduction to Applied Biostatistics
Example 2
• Cytokine responses of peripheral blood mononuclear cells (PBMC)
from HIVseronegative adults with prior extra pulmonary TB were
compared with responses from persons with prior pulmonary
tuberculosis and latent M. tuberculosis infection in a case-control
study. Antas, Journal of Allergy and Clinical Immunology. 2006.
Kruskal-Wallis H Test
1-way ANOVA
Example 3
• We want to estimate the relationship between two
numerical measures: Bio-marker value for S100 and
patient’s cognitive scores measured at 3 months after
ICU discharge among patients in medical ICU.
Q1: (Univariate?) Univariate Multivariable Linear regression
Q2: (Difference?) Correlation
Q3: (Paired?) NA
Q4: (Type?) Continuous
Q5: (Normality?) Normal Non-Normal
Q6: (#groups?) 1 group
Q7: (sample size?) > 30 in each group
Pearson’s r Spearman’s ρ
Correlation coeffient Rank Correlation coefficient
17
MED101x
Introduction to Applied Biostatistics
Example 4
• Martinez-Picado et. al. compared proportion of patients with
HIV infection who had viral surge between alternation of
antiretroviral drug regimens and standard regimens. A
Randomized, Controlled Trial. Annals of Internal Medicine.
2003
Q1: (Univariate?) Univariate Multivariable Logistic regression
Q2: (Difference?) Difference
Q3: (Paired?) Unpaired
Q4: (Type?) Nominal
Q5: (Normality?) NA
Q6: (#groups?) 2
Q7: (sample size?) > 20 < 20
Example 5
• A researcher wants to evaluate the effect of a new diet on
weight loss by comparing patient’s weight before and after
the diet program.
18