Professional Documents
Culture Documents
Study Designs
Study Designs
1
Study design
2
Types of Epidemiologic design strategies
3
1. Descriptive Epidemiology
4
Time
• Information organized by time easily shows the trend of the
disease over time and establishes the usual occurrence of the
disease in the population which is essential in identifying excess
occurrence (epidemics).
• It can also be used to predict seasonal and secular (long-term)
trends.
5
Cont….
6
Place
7
Cont….
9
Types of descriptive studies
Correlational /ecological
Cross-sectional
10
A. Case Report
11
Example
12
B. Case Series
13
Example
14
Strength
16
Cont…
3. Detailed and complete risk factor information is difficult to
obtain for all cases from records.
4. Studies are prone to atomistic fallacy the forces that cause
or prevent disease at an individual level are different from
those that work at societal level.
17
Example
18
C. Correlational or Ecological
19
Cont…
Cause could not be ascertained/established.
Correlation coefficient (r) is the measure of association
in Correlational studies.
It is important to note that positive association does
not necessarily imply a valid statistical association.
20
Cont…
21
Examples
Hypertension rates and average per capita salt consumption
compared between two communities.
Alemu et.al., Climatic variables and malaria transmission dynamics
in Jimma town. Parasites & Vectors 2011, 4:30
A study was initiated to see the relationship between
meteorological factors (monthly minimum, maximum and mean
temperature, total rainfall and relative humidity) and malaria case
occurrence over the last decade in Jimma town, Southwest Ethiopia.
22
strength
Can be done quickly and inexpensively, often using available
data.
Limitation
1. Inability to link exposure with disease
Data on exposure and outcome are not linked at the
individual level; association found with aggregate data may
not apply to individuals (this is referred as ecological
fallacy/Bias).
23
Cont…
24
Cont…
2. Lack of ability to control for effects of potential
confounding factors.
There may be other things that are the true cause.
• For example, often people with high salt consumption
also have high meat consumption.
25
D. Cross Sectional Studies
26
Cross-sectional Design
factor present
No Disease
factor absent
Study
population
factor present
Disease
factor absent
time
28
Cont…
29
Cont…
30
Types of cross-sectional studies
1. Single cross-sectional studies
• Determine single proportion or mean in a single population at a
single point in time.
2. Comparative cross-sectional studies
• Determine two proportions or means in two populations at a
single point in time
3. Time series cross-sectional studies
• Determine a single proportion or mean in a single population at
multiple points in time
31
Comparative cross-sectional studies
32
Cont…
You can independently fit the model for each groups to see
the effect of factors
You can fit the model to assess the overall factors of outcome
variable
33
Advantages
34
Limitation
1. “Chicken or egg" dilemma
35
Example
In the study of knowledge of modern contraceptive, and use
of contraception, you may show that women who know
about modern contraception are more likely to use it.
• So you may want to educate women about it, believing that
this will lead to higher rate of use.
• The problem is, did the women know about it and then start
to use it, or did they learn about it because they were using
it?
36
Cont…
• The temporal inference problem can be avoided if an inalterable
characteristic is the focus of the investigation.
37
3. Healthy worker survivor effect
• In studies conducted in occupational settings, because these studies
include only current and not former workers, the results may be
influenced by the selective departure of sick individuals from the
workforce.
Those who remain employed tend to be healthier than those who
leave employment.
This phenomenon known as the “healthy worker survivor effect,”
generally attenuates an adverse effect of an exposure
38
Analysis
Either compare prevalence rate of the outcome in exposed
Vs non-exposed, or
Compare prevalence rate of the exposure in those with and
without the outcome
Timing of the subdivision of the study population into
comparison groups distinguishes cross-sectional studies
from other observational analytic studies
39
Cont…
40
Group Exercise
41
2. Analytic Studies
• This session gives overview of the purposes of analytic
epidemiology, the common features of analytic studies and
the types of analytic studies.
42
Purposes of analytic epidemiology
43
We use analytic epidemiology
• To quantify the association between exposures and
outcomes and
• To test hypotheses about causal relationships.
44
Analytic studies are broadly classified in to two types of
studies to understand causes and effects:
1. Observational studies Both types use
• Case control study "control group", the
use of control group
• Cohort study is the main
• Cross sectional study distinguishing feature
of analytic studies
2. Experimental studies/
interventional
45
2.1. Observational studies
46
A. Case-control study design
Definition
• A case-control study is one in which persons with a
condition ("cases") and suitable comparison subjects
("controls") are identified, and then the two groups are
compared with respect to prior exposure.
Subjects are sampled by their outcome status.
47
Overview of design
49
Selection of cases
• Set clear definition of cases
Depending on the certainty of the diagnosis, and the
amount of information available, it is often useful to
perform analyses separately for cases classified as definite,
probable, or possible.
50 groups.
Incident Vs Prevalent cases
Prevalent Cases
• Increase sample size available for rare disease
• More feasible
• When records are available they can be conducted using
secondary data alone.
Due to their feasibility, they are the more commonly used ones.
BUT
• Difficult to establish temporal sequence between exposure and
outcome – reverse causation.
E.g. Physical inactivity and CHD
51
Incident cases
•Helpful to establish temporal relationship between exposure
and outcome.
control study
53
General population
• Referred as population-based case control study
• Involves locating and obtaining data from all affected
individuals or a random sample from a defined population
• It avoids bias arising from whatever selection factors lead
affected individual to utilize a particular health care
facility or physician
54
Cont…
• Allows the description of the entire picture of the disease in
that population
• Are not routinely used because of the logistic and cost
considerations
55
Selection of controls
Considerations:
• Avoiding selection bias.
• Avoiding information (‘recall’) bias
There is no control group that is optimal for all situations.
56
Sources of controls
A. Hospital Controls
Advantages
• Easily identified and readily available in sufficient number
with reduced cost.
• More likely than healthy individuals to be aware of
antecedent exposures or events (minimize recall bias).
57
Cont…
58
Disadvantages
59
Cont…
• Danger of altering the direction of association or masking a
true association between exposure and outcome of interest
For example, in studying the association of cigarette
smoking and lung Cancer, individuals with other
respiratory illnesses could not be taken as controls, since
smoking is also known to have some association with
other respiratory illnesses.
60
B. General population controls
Advantages:
• Generalizable
• Good when cases are selected to represent affected
individuals in a defined population.
61
Disadvantages:
• Costly and time consuming
• Recall bias - controls may not recall exposures with the
same level of accuracy.
• People might be less motivated to participate for the same
reason given above, which increases non-response rate, i.e.,
selection bias.
62
C. Special controls
63
Advantages
- They are healthy.
- More likely to be cooperative than members of the general
population, because of their interest in the cases.
- Offer a degree of control over some confounding factors,
such as ethnicity, socioeconomic status, or environment
64
Disadvantage/limitations
- If the study factor is likely to be similar to the cases, an
underestimate of the true effect of the exposure of interest
may result.
- E.g. if the study factor is diet, it will be similar for both
cases and controls, if controls are siblings.
65
Case control: numbers and ratio
66
Ascertainment of disease and exposure status
67
Sources of information for disease status
• Review of death certificates, case registries that maintain
ongoing surveillance.
• Office records of physicians
• Hospital admission or discharge records
• Pathology department log books
68
Sources of information about the exposure
• From study subjects themselves, by either interview or mail
questionnaire
• From a surrogate, such as spouses of participants or
mothers of children.
• From records (e.g medical records).
69
Strengths and weaknesses of case-control studies
Advantages
• Is relatively quick and inexpensive compared with other
analytic designs
• Is particularly well suited to the evaluation of diseases
with long latent periods
• Is optimal for the evaluation of rare diseases
• Can examine multiple etiologic factors for a single disease.
70
Disadvantages/limitations
• Inefficient for the evaluation of rare exposures
• Can not directly compute incidence rates of disease in
exposed and non exposed individuals, unless study is
population based
71
Cont…
72
Small group work
• Suppose you are interested to study whether khat chewing
causes depression.
• How will you conduct case control study?
73
B. Cohort studies
Learning Objectives
74
Defi nition
75
Overview of design
• Subjects are selected by exposure, or determinants of
interest from a given population, and followed to see if they
develop the disease or outcome of interest.
• Then, the subsequent development of outcome assessed
and the rate of outcome is compared between the exposed
and non-exposed.
• The direction of inquiry about outcome is always forwards
in time
76
Cont…
• However, the actual data collection can be carried out in either
retrospective or prospective manner as described in the next
section which is about types of cohort studies
• The two groups should be free of the study outcome.
• One of the main functions of cohort study is to provide
information on the incidence and to describe natural history
of disease.
77
Figure 3.: overview of design of cohort studies
Source: LSI Training
78
Example
• Association between cigarette smoking and coronary heart disease
(CHD)
In this study, people who smoke cigarettes are considered as
exposed and those who do not smoke as non-exposed.
Both groups are followed for a period of time and compared
with regard to frequency of development of CHD.
The finding of higher frequency of CHD in smokers as
compared to non-smokers would suggest that smoking is
possibly a cause of CHD.
79
Types of cohort studies
80
Prospective cohort
• The relevant exposures may or may not have occurred at the
time the study is begun.
• At the beginning of the study the outcome has not yet
occurred.
• Participants must be followed into the future to assess
incidence rates of the disease.
- Regarded as more reliable than the retrospective, if the
sample size is large and follow-up complete.
81 Unless specified, cohort study refers to prospective cohort
Strengths
82
Limitations
• Is inefficient for the evaluation of rare diseases, unless the
attributable-risk percent is high.
• Extremely expensive and time consuming.
• Validity of the results can be seriously affected by loses to
follow-up.
83
Retrospective cohort
84
Advantages
85
Disadvantages
86
Selection of exposed group
Selection of exposed group should consider scientific and
feasibility issues which include:
-The frequency of the exposure of interest in the study
population.
-The need to obtain complete and accurate exposure and
outcome information on all study subjects. Example: the use
of physicians or nurses permits longer and fairly complete
follow up.
87
Cont…
88
Selection of controls
89
Exposure ascertainment
Advantages
• Can make available information for high proportion of cohort.
90
Disadvantages
- Information on exposure level may be insufficient.
- Such records frequently do not contain data on potential
confounding variables
91
2) Information supplied by the study subjects themselves
• Particularly useful for collecting information on exposures
that are not routinely recorded.
• A potential for bias always exists in the use of such data
since it cannot be obtained as objectively as from
preexisting records.
• Stigma associated with certain exposures may influence a
respondent’s answer
Disadvantages
92 • Potential for information bias, particularly recall.
3)Direct physical examination or testing
• For some exposures or characteristics direct physical
examination and/or blood testing may be necessary
• Provide an objective and unbiased means of classifying
study subjects with respect to exposure.
93
Outcome ascertainment
• With adequate consideration to the resources available for
the study, the aim is to obtain complete, comparable and
unbiased information on the subsequent health experience
of every study subject.
94
One or a combination of the following sources could be
used:
1. Death certificate
2. Hospital records
3. Directly from the study participants
-For those who report an event of interest, additional
information such as hospital records can be obtained to
confirm the diagnosis
95
Cont…
5. Autopsy records
96
Follow-up
97
Cont…
98
Analysis
99
Small group work
100
Measure of association
101
Objectives
102
Introduction
103
Why Estimate Comparisons?
• Overall rate of disease in an exposed group says nothing
about whether exposure is a risk factor for or causes a
disease.
• This can only be evaluated by comparing disease
occurrence in an exposed group to another group that is
usually not exposed.
• The latter group is usually called the comparison or
reference group.
104
Association
• Statistical relationship between exposure and disease.
• An association is said to exist between two variables when a
change in one variable parallels or coincides with a change in
another variable.
• Requires comparing two groups:
Exposed Vs Unexposed
Cases Vs non cases/controls.
105
Cont….
• Variables can be related or unrelated to one another.
106
Cont…
107
Cont…
108
Relative Risk [RR]
109
Cont…..
110
Table 1: a 2 by 2 table indicating findings of a cohort study
111
From the above table the RR is calculated as:
112
Example-1
Table 2: Data from a cohort study of oral contraceptive (OC) use and
bacteriuria among women aged 15-49 years
113
Calculate RR?
114
Interpretation
• RR=1
– Risk in exposed = risk in non-exposed
– No association
• RR>1
– Risk in exposed > risk in non-exposed
• RR<1
– Risk in exposed < risk in non-exposed
116
Guideline for strength of association
1.0 = No association
1.1-1.3 = Weak
1.4-1.7 = Mild
1.8-3.0 = Moderate
3.0-8.0 =Strong
117
Cont….
118
Exercise
119
Odds Ratio (OR)
121
Table 5: Indicating findings of case-control studies
122
Cont…
123
Cont...
124
Example-1: Data from a case-control study of current oral
contraceptive (OC) use and MI in pre-menopausal female
nurses (Table 3)
125
Interpretation: the odds of having MI is 1.6 times higher
among OCP users compared to that of the non OCP users
126
Exercise
127
Interpretation of OR
Odds ratio = 1
• The odds of disease in the exposed and non exposed is the same, No
association.
Odds ratio > 1
• The odds of disease is greater in the exposed
• Implies that the odds of exposure is x times higher among cases than
controls
• The factor may be a risk factor.
128
Cont…
129
Measure of impact
• Comparing disease occurrence among the exposed with the
disease occurrence among the unexposed comparison
group by subtracting one from the other.
• It is Absolute Comparisons
130
Gives information on:
– The absolute effect of exposure on disease occurrence
– The excess disease risk, or disease burden, in the
exposed group compared to the unexposed group
– The public health impact of an exposure, that is, how
much disease would be prevented if the exposure were
removed
131
Cont…
132
A. Attributable Risk(AR)
• More precisely called prevalence difference, cumulative incidence
difference, and incidence rate difference
• It is also known as risk difference or excess risk among exposed groups,
rate difference, attributable rate.
Quantifies the excess risk in the exposed that can be attributable to
the exposure.
134
Concept of Attributable risk
135
Fig 1.Pictorial illustration of AR calculation
136
Example: 1
Consider the hypothetical cohort study conducted to assess
association between malaria during pregnancy and low birth weight.
Let‘s calculate AR from findings of the study indicated in the table
below.
137
Cont…
138
B. Attributable Risk Percent (AR%)
139
Fig 2. Pictorial illustration of how to compute AR%
140
Example-1
141
Preventive Fraction
142
Example: vaccine efficacy
143
C. Population Attributable Risk (PAR)
144
Cont…
145
How to Calculate?
• PAR = It – Io or
• PAR = CIt – CIo
Where
• It= incidence rate in the total population
• Io =incidence rate in the non exposed
146
Population Attribuable risk fraction (PAR%)
147
Example -2: Fast driving and Automobile Deaths
148
Association and cause effect /Evaluation
of Evidences
149
Learning objectives
151
Judgment of causality
152
Alternative explanations for the observed
association other than cause-effect relationships:
153
To show a Valid Statistical Association
Rule out the following as alternative explanation:
– Chance (random error) : How likely is it that what we
found is a true finding
– Bias: Whether systematic error has been built into the
study design
– Confounding: Whether an extraneous factor is related to
both the disease and the exposure
154
Chance
To control/minimize chance:
1. Designing phase
o Increase the sample size (increase the power of the
study)
2. Analysis phase
o Hypothesis testing, P-value’s role
o Confidence interval determination
155
Role of confounding factors
o Confounding refers to the mixing of the effect of an
extraneous variable with the effect of the exposure and
disease of interest
o Can overestimate /underestimate the true association
156
Criteria for Confounding Factors
o Must be an independent predictor of disease with or
without exposure
o Must be associated (correlated) with exposure
157
Confounding
Smoking
158
Confounding
/mediator?
Diet CHD
Cholesterol
159
Evaluating & controlling of confounding
160
Role of confounding factors…
o Distinguish confounding from effect modification (interaction)
• By stratification analysis
o Distinguish confounding from mediation
161
Confounding
Imagine you have stratified your dataset for smoking status in the alcohol
- lung cancer association study. Would the odds ratios differ in the two
strata?
The alcohol association would yield the similar odds ratio in both strata
and would be close to unity. In confounding, the stratum-specific odds
ratios should be similar and different from the crude odds ratio by at
least 15%. Stratification is one way of identifying confounding at the time
of analysis.
Sex
163
Read more on
• Stratified analysis
• Matched analysis
164
Role of Bias
It is any systematic error in an epidemiologic study that
results in an incorrect estimate of the association between
exposure and outcome.
Bias may result the design, collection, recording, analysis or
interpretation of data
It describes an error arising from the design, execution or
analysis, interpretation and dissemination phase
o Two most common sources of bias are selection bias and
information bias
165
Role of Bias…
o We should minimize introduction of Bias by:
166
Establishing a Causal Association
168
Process of establishing causation
Develop Hypothesis 1. Developing a hypothesis
171
Cont…
2. Consistency of relationship;
172
3. Specificity of the association
o If a particular exposure increases the risk of a certain disease, but
does not increase the risk of other diseases, this may be taken as
evidence in favour of a cause–effect relationship (Germ theory)
Single Exposure Single Outcome
Plasmodium Malaria
HIV AIDS
173
4. Temporality
o Temporality refers to the necessity that the cause precede the onset of
the disease (effect) in time
o This is usually problematic in cross-sectional and case-control studies
Exposure Disease
174
Temporality…
o Any claim of causation must involve the cause preceding in
time the presumed effect
o Easier to establish in certain study designs
– Prospective cohort and experimental studies
Normal Cancer
lung
o This assumes that the more intense the exposure, the greater the risk
of disease development
o This intensity can be measured by dose, or by duration of exposure
o The risk of disease increases with increasing exposure to a causal agent
o Duration (frequency) of exposure to risk factor
o However, there may be threshold level to cause disease
E.g. Cigarette smoking dose response
176
Dose-response relationship…
Dose-response (‘biological gradient’)
– the relationship between the amount of exposure (dose) to a substance
and the resulting changes in outcome (response)
If an increase in the level of exposure increases the risk of the
outcome
– this strengthens the argument for causality
R
R I
R I S
I K
R
S
S
I
S
K
K K
0 cigs/day < 5 cigs/day 5 - 20 cigs/day > 20 cigs/day
177
Cont…
6. Biological Plausibility:
Hypothesis should be coherent with what is known about
the disease; both biologically and using laboratory.
Knowledge about physiology, biology and pathology
should support the cause-effect relationship
178
Cont…
7. Study design;
It is most important to consider.
179
Cont…
8. Reversibility/Experimental evidence
Removal of a possible cause results in a reduced disease
risk
180
causality.
Cont…
Judging the evidence
There are no completely reliable criteria for determining whether
an association is causal or not.
In judging the different aspects of causation,
The correct temporal relationship is essential,
• Once this has been found, weight should be given to
– Plausibility,
– Consistency, and
– dose-response relationship
181
THANK YOU!
182