Professional Documents
Culture Documents
Assessing Data Quality - Bias (SR)
Assessing Data Quality - Bias (SR)
Bias
Bi
D St
Dr. h i R
Stephanie Rollll
Institute for Social Medicine, Epidemiology and Health Economics
Charité University Medical Center, Berlin, Germany
Learning objectives
• Assessing the quality of study data
Study question
What is the prevalence of
tuberculosis (TB) in Cambodia?
The perfect study
• Includes all Cambodians (ca. 14 mil.).
• Assesses TB status objectively,
j y equally,
q y at same
time.
Result (invented)
Persons with TB: 140 000 (1%)
What happened?
prevalence of TB: 2%?
What happened?
• Only 32 of 50 villages participated.
Village participated
„selection bias“
Bias (systematic error)
1) Is the sample
representative?
studyy sample
p
Selection bias II
total population
Group A
studyy sample
p
Group B
3 possible explanations for a
result
Association between exposure and outcome
measurement bias
regression to mean
• selection
l ti off cases: patients
ti t ini hospital
h it l
A good idea?
a) yes
b) no
Selection bias in case-control
study
Association of mobile phone use and brain tumors
no phone cases
phone (brain tumor)
controls
phone
(no brain tumor)
Attrition bias
diarrhea diarrhea
(N=60, mean age 8.6 y) (N=85, mean age 6.5 y)
Two main types of bias
with many categories
non-respondent
d t bi
bias self referral bias
giving consent bias .... many more...
sampling bias
selection bias missing data bias
attrition bias
lost to follow up bias
measurement bias
regression to mean
Differential
• iff misclassification
f off exposure is related to disease
• if misclassification of disease is related to exposure
effect:
ff t bias
bi can go in i either
ith direction
di ti from
f the
th null;
ll it can
inflate or attenuate your effect estimates (OR and RR)
Diagnostic suspicion bias
K
Knowledge
l d about
b t subject‘s
bj t‘ exposure leads
l d tto more
thorough
g search for the outcome than for an
unexposed individual.
Exposed subjects are more likely to have the
disease diagnosed than the nonexposed
nonexposed.
• Study Design
• Study Execution
• Data collecion
All stages!
• Data analysis
• Interpretation of Results
• Publication
Biases in
in...
- Foreign language
• Literature Review
exclusion bias
• Study Design - Literature search bias
- One
One-sided
sided reference bias
• Study Execution - Rhetoric bias
• Data collecion
• Data analysis
• Interpretation of Results
• Publication
Selection bias
Biases in
in...
Sampling frame bias
Berkson (admission rate) bias
• Literature Review Centripetal bias
Diagnostic access bias
Diagnostic purity bias
• Study Design Hospital access bias
Migrator bias
• Study Design
- wrong control bias
• Study Execution - contamination bias
(controls also receive
• Data collecion treatment/are exposed)
• Interpretation of Results
• Publication
Biases in
in...
- Instrument bias
C
Case d fi iti bi
definition bias
Diagnostic vogue bias
Forced choice bias
• Literature Review Framing bias
Insensitive
I iti measure bias bi
Juxtaposed scale bias
• Study Design Laboratory data bias
Questionnaire bias
S l fformatt bias
Scale bi
• Study Execution Sensitive question bias
Stage bias
Unacceptability bias
U d l i /
Underlying/contributing
t ib ti cause off d
death
th bias
bi
• Data collecion Voluntary reporting bias
- Data source bias
Competing death bias
• Data analysis Famil history
Family histor bias
Hospital discharge bias
Spatial bias
- Observer bias
• Interpretation of Results Diagnostic suspicion bias
Exposure suspicion bias
Expectation bias
• Publication Interviewer bias
Therapeutic personality bias
Biases in
in...
- Subject bias
Apprehension bias
Attention bias (Hawthorne effect)
• Literature Review Culture bias
End-aversion bias
(end-of-scale/central tendency bias)
Faking bad bias
• Study Design Faking good bias
Family information bias
Interview setting bias
• Study Execution Obsequiousness bias
Positive satisfaction bias
Proxy respondent bias
- Recall bias
• Data collecion Reporting bias
Response fatigue bias
Unacceptable disease bias
• Data analysis Unacceptable exposure bias
Underlying cause (rumination bias)
Yes-saying bias
• Interpretation of Results - Data handling bias
Data capture error
Data entry bias
Data merging error
• Publication Digit preference bias
Record linkage bias
- Confounding bias
Biases in
in... Latency bias
Multiple exposure bias
• Literature Review Nonrandom sampling bias
Standard population bias
• Study Design Spectrum bias
- Analysis
y strategy
gy bias
• Study Execution Distribution assumption bias
Enquiry unit bias
• Data collecion E ti t bias
Estimator bi
Missing data handling bias
Outlier handling bias
• Data analysis
Overmatching bias
Scale degradation bias
• Interpretation of Results - Post
P t hoc
h analysis
l i bibias
Data dredging bias
• Publication Post hoc significance
g bias
Repeated peeks bias
Biases in
in...
• Literature Review
• Study Design
• Study Execution
• Study Design
• Study Execution
• Data collecion
• Data analysis
Possible reasons
• Variability of chosen sample from underlying
population
• Outcome or risk factor incorrectly assessed
(independent of group)
How to deal with random errors?
• Use
U bibig sample
l size
i
e) get bigger