Cohort Assignment

Cohort Assignment 5: Effect Measure Modification
(100 Points)
Recommended Reading
Rothman, Greenland and Lash (Modern Epidemiology, 4th Edition page numbers in parentheses): 71-
83, 298-300 (619-49)
Rothman, “Measuring Interaction” 198-211
Ward JB, Gartner DR, Keyes KM, Fliss MD, McClure ES, Robinson WR. How do we assess a racial
disparity in health? Distribution, interaction, and interpretation in epidemiological studies. Annals of
Epidemiology. 2019;29:1-7. doi:10.1016/j.annepidem.2018.09.007
Resources: Output covariance matrix and parameter estimates from PROC GEMMOD (to estimate
ICR variance)
Introduction
For this assignment, you will conduct an analysis relevant to the second specific aim of our cohort
study, which is to determine whether the estimated association between early prenatal care and
preterm birth differs among major racial/ethnic populations that might be targeted for state funded
early prenatal care programs.
For the purposes of this assignment, race will be dichotomized as African American (AA, the index
category) or non-AA (the reference category, including all NC birth certificate race categories other
than AA combined) using a new variable, raceaa, created from race2:
raceaa = 0 if non-AA (White and other combined)

raceaa = 1 if AA
Notation and terminology

Values of pnc5 and raceaa are represented by subscripts i and j, respectively. There are four
possible combinations of the two variables, as shown below. Births with reference values for both
variables are “jointly unexposed”, and births with index values for both variables are “jointly exposed”.
Four risks (Rij) may be estimated for each combination of pnc5 and raceaa based on model 1:
pnc5 raceaa Rij Estimated value
0 0 no early care, non-AA R00 0 doubly unexposed
1 0 had early care, non-AA R10 0 + 1 exposed to care only
0 1 no early care, AA R01 0 + 2 “exposed” to race only
1 1 had early care, AA R11 0 + 1 + 2 doubly exposed
Risk difference homogeneity = additive risks (when averaged across the population)
A linear risk model with two dichotomous covariates and no interaction terms forces the estimated risk
associated with exposure to both covariates to equal the sum of the risks associated with each
exposure alone, as shown for model 1 below (where pnc5 indicates early prenatal care status and
raceaa indicates race):
Risk(preterm) = 0 + 1(pnc5) + 2(raceaa) model 1
This model forces risk difference homogeneity, such that the estimated RD for early care vs. no early
care for non-AA births (RD10 vs. 00) is identical to RD for early care vs. no early care among African
Americans (RD11 vs. 01):
1
RD10 vs. 00 = (0 + 1(1) + 2(0)) – (0 + 1(0) + 2(0)) = 1
RD11 vs. 01 = (0 + 1(1) + 2(1)) – (0 + 1(0) + 2(1)) = 1
Similarly, the RD for AA vs. non-AA race is identical for births with and without early care:
RD01 vs. 00 = (0 + 1(0) + 2(1)) – (0 + 1(0) + 2(0)) = 2
RD11 vs. 10 = (0 + 1(1) + 2(1)) – (0 + 1(1) + 2(0)) = 2
In addition, the RD contrasting the 17-week risk of preterm for “jointly exposed” births (had early care
and AA) and “jointly unexposed” (no early care, not AA) is equal to the sum of the average risks
associated with exposure to each individual exposure alone:
RD11 vs. 00 = (0 + 1(1) + 2(1)) – (0 + 1(0) + 2(0)) = 1 + 2
Observed versus expected risks

The estimated average risk of preterm associated with “joint exposure” to early care and AA race (R 11)
from model 1 may be written in terms of estimated risks for the other 3 exposure strata:
R11 = 0 + 1 + 2
= (0 + 1) + (0 + 2) - 0
= R10 + R01 – R00
This relation will always hold for model 1 because model 1 does not allow departure from additive
risks (i.e., it forces RD homogeneity). To assess whether or not the assumption of additive risks is
valid we must fit a second model that allows departures from additivity (i.e., RD heterogeneity), for
example:
Risk(preterm) = 0 + 1(pnc5) + 2(raceaa) + 3(pnc5*raceaa) model 2
As with model 1, 4 risks may be estimated:

pnc5 raceaa Rij Estimated value
0 0 no early care, non-AA R00 0
1 0 had early care, non-AA R10 0 + 1
0 1 no early care, AA R01 0 + 2
1 1 had early care, AA R11 0 + 1 + 2 + 3
In contrast with model 1, model 2 allows the estimated average risk in the “jointly exposed” (0 + 1 +
2 + 3) to be more or less than the sum of the average risks estimated in association with exposure to
each factor individually. In addition, it allows RD to depart from the homogeneity assumption:
RD10 vs. 00 = 1
RD11 vs. 01 = 1+ 3
Therefore, the coefficient for the model 2 product interaction term (3) indicates the extent to which
observed average risks in the jointly exposed differ from the average risks expected assuming
additivity. In other words, it is an estimate of the extent to which the observed RD 11 vs. 00 departs from
the RD11 vs. 00 expected under homogeneity.
If the homogeneity assumption is correct, 3 = 0 so that

R11 = 0 + 1 + 2 + 3
= 0 + 1 + 2 + 0
= (0 + 1) + (0 + 2) - 0
= R10 + R01 – R00
and
RD10 vs. 00 = RD11 vs. 01
2
Therefore, the expected value of R11 under the assumption of RD homogeneity can be written as
expected(R11) = R10 + R01 – R00
so that when the homogeneity assumption is true,

R11 = expected(R11) = R10 + R01 – R00
This relation also can be written in terms of RD with a common reference group (R00) and the
corresponding model 2 coefficients:
expected(R11 – R00) = (R10 – R00) + (R01 – R00) - (R00 – R00)
expected(RD11 vs. 00) = RD10 vs. 00 + RD01 vs. 00
= 1 + 2
Estimating RD modification using the Interaction Contrast (IC)
The interaction contrast (IC) is a measure of the difference between the observed R 11 estimate and
the R11 expected under the assumption of additive risks:
IC = R11 - expected(R11)
= R11 – (R10 + R01 – R00)
= R11 + R00 - R10 - R01
where IC = 0 when the estimated risk associated with joint exposure is equal to the sum of the
estimated “independent” risks for each exposure when averaged across the population. The IC also
can be written in terms of RD:
IC = RD11 vs 00 - expected(RD11 vs 00)
= RD11 vs. 00 - (RD10 vs. 00 + RD01 vs. 00)
Rewriting the IC in terms of the coefficients from model 2 we can also see that:
IC = (0 + 1 + 2 + 3) + (0) - (0 + 1) - (0 + 2)
IC = 3
Important point: The IC will be <0 or >0 when the average estimated risk associated with joint
exposure differs from the sum of the average risks for the “independent” exposures (i.e., the
independent risks are “non-additive”). IC= 0 when there is no RD modification, but IC = 0 does not
indicate the absence of causal interaction because we only observe “net” effects of synergism and
antagonism averaged across the population. Therefore, IC = 0 also can occur when the sum of sub-
additive risks among response types susceptible to antagonistic causal interactions is equivalent to
the sum of super-additive risks among response types that are susceptible to synergistic causal
interactions (i.e the response types cancel each other out; see Rothman, Greenland & Lash for
additional information on this topic).
Interpreting Interaction Contrasts

When the independent effect estimates for two exposures are in opposite directions (as is the case
with early care and AA race/ethnicity, it is easiest to describe RD modification of one exposure (e.g.,
our “main” exposure early prenatal care versus no early care) by a second exposure (e.g., the “effect
modifier” AA versus non-AA race/ethnicity) :
For a “main exposure” that is positively associated with the outcome (main effects RD >0):
 IC < 0 indicates a net reduction in the positive association with joint exposure to the effect
modifier (antagonism).
 IC > 0 indicates a net increase in the positive association with joint exposure to the effect
modifier (synergism).
For a “main exposure” that is inversely associated with the outcome (main effects RD <0):
 IC < 0 indicates a net increase in the inverse association with joint exposure to the effect
modifier (synergism).
3
 IC > 0 indicates a net reduction in the inverse association with joint exposure to the effect
modifier (antagonism).
For example, an estimated IC <0 for early prenatal care vs. no early care (an exposure that is
inversely associated with preterm) and AA vs. no-AA race/ethnicity (which is positively associated with
preterm birth) would suggest the inverse (negative) association of early prenatal care on preterm was
increased (i.e., was more negative) in association with AA race/ethnicity than non-AA race/ethnicity.
 Conversely, an estimated IC<0 would also indicate that the positive association between AA
versus non-AA race/ethnicity and preterm was reduced (i.e., the RD was closer to 0 or possibly
less than zero) in association with early prenatal care versus no early care.
Estimating RD modification using Interaction Contrast Ratios (ICRs)

Interaction contrast ratios based on ratio effect estimate measures (RR and OR) may also be used to
assess departures from the assumption of additive risks; therefore, RD modification can be evaluated
even when risks cannot be estimated (e.g., using case-control study data.)
As noted previously:
expected(R11) = R10 + R01 – R00
This statement can be written in terms of RR by dividing through by R00
expected(R11/R00) = R10/R00 + R01/R00 – R00/R00

expected(RR11 vs. 00) = RR10 vs. 00 + RR01 vs. 00 - 1
The ICR is used to estimate the difference between the observed RR11 vs. 00 and the RR11 vs. 00 expected
under the assumption of RD homogeneity:
ICR = RR11 vs. 00 - expected(RR11 vs. 00)

= RR11 vs. 00 - (RR10 vs. 00 + RR01 vs. 00 – 1)
= RR11 vs. 00 - RR10 vs. 00 - RR01 vs. 00 + 1
The ICR can also be written as:

= IC / R00
In addition, when the OR is a valid estimate of the RR:
exp(OR11 vs. 00) = OR10 vs. 00 + OR01 vs. 00 - 1

ICR = OR11 vs. 00 - expected(OR11 vs. 00)
= OR11 vs. 00 - OR10 vs. 00 - OR01 vs. 00 + 1
A. Contingency Table Analyses

Joint and independent effect measures estimated for pnc5 and raceaa using a 4 x 2 contingency
table will be equivalent to estimates generated by a model that does not force homogeneity, since the
risk of preterm in each exposure stratum is independent of the risks in the other exposure strata. In
other words, contingency table analyses do not force homogeneity.
A1. Frequencies, risks, expected risks and IC

Enter the following where indicated in or below table A1. For simplicity, you do not need to estimate
95% CI.
 The number of preterm births and the total number of births in each of the four exposure
strata defined by pnc5 and raceaa
4
 Estimated risks for each combination of pnc5 and raceaa
 The expected R11 assuming additive “independent” risks
expected(R11) = R10 + R01 – R00
 The IC based on the estimated risks
IC = R11 - expected(R11)
A2. Common Referent Estimates

Enter the following where indicated in table A2. For simplicity, you do not need to estimate 95% CI.
Note: this step is best performed in Excel or by hand.
 Risk differences (RD), risk ratios (RR) and incidence odds ratios (OR) for each combination of
pnc5 and raceaa relative to the “jointly unexposed” (no early prenatal care, non-AA)
 Expected joint RD11 vs. 00, RR11 vs. 00 and OR11 vs. 00 assuming no risk difference modification
expected(RD11 vs. 00) = RD10 vs. 00 + RD01 vs. 00

expected(RR11 vs. 00) = RR10 vs. 00 + RR01 vs. 00 – 1
expected(OR11 vs. 00) = OR10 vs. 00 + OR01 vs. 00 - 1
 The interaction contrast (IC) based on estimated RD and interaction contrast ratio (ICR)
based on RR, and OR
ICRD = RD11 vs 00 - expected(RD11 vs 00)

ICRRR = RR11 vs. 00 - expected(RR11 vs. 00)
ICROR = OR11 vs. 00 - expected(OR11 vs. 00)
A3. Stratum-Specific Estimates

Enter the following where indicated in table A3. For simplicity, you do not need to estimate 95% CI.
 RD, RR and OR for early care vs. no early care among non-African Americans (e.g. RD10 vs. 00)
and early care vs. no early care among African Americans (e.g. RD11 vs. 01)
 RD, RR and OR for AA vs. non-AA race among those with early care (e.g. RD 11 vs. 10) and AA
vs. non-AA race among those without early prenatal care (e.g. RD01 vs. 00).
B. Interaction term models and RD modification
B1. Create a product interaction term (multiplicative variable) pncXrace where

pncXrace = pnc5 * raceaa
Linear Risk Model:

Risk(preterm) = 0 + 1(pnc5) + 2(raceaa) +3(pncXrace) model 3
Log Risk Model:

lnRisk(preterm) = 0 + 1(pnc5) + 2(raceaa) +3(pncXrace) model 4
B2. Use model 3 to estimate the following, and enter results where indicated:
 RD and 95% CI for each combination of pnc5 and raceaa relative to the common referent
group of “jointly unexposed” births (no early care & non-AA) (table B1)
 The interaction contrast (IC) and its 95% CI (table B1)
 The expected RD11 vs. 00 assuming no RD modification (table B1, no CI)
 RD and 95% CI for early care vs. no early care according to maternal race (table B2)
 RD and 95% CI for AA vs. non-AA maternal race according to prenatal care (table B2)
5
 The product interaction term coefficient, its 95% CI and its p-value (below table B2)
 The LR test statistic and its p-value comparing model 3 to a reduced linear risk model with
pnc5 and raceaa only (i.e., model 1) (below table B2)
 Take time to note the difference between the estimates you enter in Tables B1 and B2. In Table
B1, you have a common referent group: no early care/non-AA. For Table 2, you are estimating RDs
for preterm birth within strata of race and then within strata of care.
How to perform likelihood ratio test (LRT)
To perform a likelihood ratio test (LRT) by hand, subtract the log likelihood (on the first page of the
Genmod output under “Criteria for Assessing Goodness of Fit”) from the reduced model (i.e. the
model without pncXrace) from the log likelihood in the full model (i.e. the model with pncXrace) and
multiply by two:
LRT =(LogLikelihoodFull – LogLikelihoodReduced)*2
The LRT is a chi-square test statistic with the degrees of freedom equal to the difference in the
number of parameters estimated in the reduced versus the full model. Note: LRT tests are only valid
for nested models (i.e. the “full” model includes all of the information included in the “reduced” model.)
For example, model 1 is nested in model 3.
You can calculate the P-value for the LRT in Excel or SAS.
Excel: chidist(test_statistic, degrees_of_freedom)
SAS: data lrt_log; *Note: Your output will be in the dataset lrt_log, not your output;
p_log = 1 - probchi(test_statistic,degrees_freedom);
put p_log; run;
For model 3:
IC = (0 + 1 + 2 + 3) + (0) - (0 + 1) - (0 + 2)
= 3
expected(R11) = R10 + R01 – R00

= (0 + 1) + (0 + 2) - 0
= 0 + 1 + 2
B3. Use model 4 to estimate the following, and enter results where indicated:
 RR for each combination of pnc5 and raceaa relative to the common referent group of “jointly
unexposed” births (no early care & non-AA) (table B1)
 The interaction contrast ratio (ICR) (table B1)
 The expected RR11 vs. 00 assuming no RD modification (table B1, no CI)
For model 4:
ICR = RR11 vs. 00 - expected(RR11 vs. 00)
= RR11 vs. 00 - (RR10 vs. 00 + RR01 vs. 00 – 1)
= RR11 vs. 00 - RR10 vs. 00 - RR01 vs. 00 + 1
expected(RR11 vs. 00) = RR10 vs. 00 + RR01 vs. 00 - 1

C. Indicator term models and RD modification
6
Here, you will use single referent models (sometimes referred to as a “joint and separate effects”
models) to generate coefficients that directly estimate the “joint” effect estimate measure (for the two
covariates in combination) and the “separate” effect measures (for each covariate alone) relative to a
common reference group (the “jointly unexposed”). In this section, pay attention to any similarities (or
differences) in estimates of interaction using the different approaches. Also, note that indicator term
models are generally used for the assessment of additive interaction using ICRs when risks cannot be
directly estimated (e.g. case-control studies.)
C1. Indicator terms

Create a variable pncrace that combines prenatal care (pnc5) and race (raceaa) into a single
variable where:
pncrace = 0 if no early prenatal care & non-AA
pncrace = 1 if had early prenatal care & non-AA
pncrace = 2 if no early prenatal care & AA
pncrace = 3 if had early prenatal care & AA
Then create the indicator terms pncrace1, pncrace2 and pncrace3:

Indicator Variables
Rij pnc5 raceaa pncrace pncrace pncrace pncrace
1 2 3
R00 no early care, non-AA 0 0 0 0 0 0
R10 had early care, non-AA 1 0 1 1 0 0
R01 no early care, AA 0 1 2 0 1 0
R11 had early care, AA 1 1 3 0 0 1
Risk(preterm|pncrace) = 0 + 1(pncrace1) + 2(pncrace2) +3(pncrace3) model 5
When model 5 is used to estimate joint and independent effect measures:

 The coefficient for pncrace1 from model 5 is the estimated RD for the separate or
“independent” association between preterm and early care vs. no early care among non-AA
births (i.e., births “unexposed” to race)
 The coefficient for pncrace2 from model 5 is the estimated RD for the separate or
“independent” association between preterm and AA vs. non-AA race among births that are
“unexposed” to early prenatal care
 The coefficient for pncrace3 from model 5 is the estimated RD for the “joint” association
between preterm and early care and AA race versus no early care and non-AA race.
To estimate corresponding lnRRs and lnORs use models 6 and 7, respectively (noting that the lnOdds
model is provided for completeness; you do not have to estimate the lnOdds model in this section):
lnRisk(preterm|pncrace) = 0 + 1(pncrace1) + 2(pncrace2) +3(pncrace3) model 6
lnOdds(preterm|pncrace) = 0 + 1(pncrace1) + 2(pncrace2) +3(pncrace3) model 7
C2. Use model 5 to estimate the following, and enter results where indicated:
 RD and 95% CI for each combination of pnc5 and raceaa relative to the common referent
group of “jointly unexposed” births (no early care & non-AA) (table C1)
 The interaction contrast (IC, no CI) (table C1)
 The expected RD11 vs. 00 assuming no RD modification (table C1, no CI)
 RD and 95% CI for early prenatal care vs. no early care according to maternal race (table C2)
7
 RD and 95% CI for AA vs. non-AA maternal race according to prenatal care (table C2)
 The LR test statistic and its p-value comparing model 5 to a reduced model with pnc5 and
raceaa only (i.e., model 1) (below table C2)
*For model 5:
IC = R11 – expected(R11)
= R11 – (R10 + R01 – R00)
= (0 + 3) – ((0 + 1) + (0 + 2) - 0)
= 3 – (1 + 2)
C3. Use model 6 to estimate the following, and enter results where indicated:
 RR for each combination of pnc5 and raceaa relative to the common referent group of “jointly
unexposed” births (no early care & non-AA) (table C1)
 The interaction contrast ratio (ICR) and 95% CI (see instructions below for 95% CI) (table C1)
 The expected RR11 vs. 00 assuming no RD modification (table C1, no CI)
95% Confidence Interval Estimates for the ICR

We will use a method to estimate the variance of the ICR from Hosmer and Lemeshow (1992).
There also are other methods for estimating the ICR variance, including a bootstrap method that
may provide a more accurate estimate under some scenarios than the “delta method” we are using
(see Assmann et al, 1996).
1. Access the variance-covariance matrix after fitting model 6 to estimate the variance for the ICR
using the variance formula below (from Hosmer & Lemeshow).
Var(ICR) = (RR10 vs. 002 * Var(1)) + (RR01 vs. 002 * Var(2)) + (RR11 vs. 002 * Var(3)) + (RR10 vs. 00 * RR01 vs. 00 *
2 * Cov(1,2)) + (-RR10 vs. 00 * RR11 vs. 00 * 2 * Cov(1,3)) + (-RR01 vs. 00 * RR11 vs. 00 * 2 * Cov(2,3))
2. Estimate the 95% confidence limits for the ICR as shown below:
Lower 95% CI for the ICR = ICR – 1.96 * Var(ICR)1/2
Upper 95% CI for the ICR = ICR + 1.96 * Var(ICR)1/2
 Round intermediate values to six decimal places for calculations.

 This formula can be used to estimate the Var(ICR) when RD modification is modeled using
indicator terms (model 6) , but is not valid when RD modification is modeled using product-
interaction terms (model 4). Other methods to estimate the Var(ICR) are also available.
 OR and corresponding variance estimate from logistic regression models with indicator terms
may be substituted in the formula above when ICRs are estimated based on OR.
Note: The original Hosmer and Lemeshow paper shows how to set up a spreadsheet to estimate
the ICR variance based on this method. There’s also a SAS macro (see Lundberg 1996).
References
1. Hosmer D, Lemeshow S. Confidence interval estimation of interaction. Epidemiol, 3: 452-456, 1992.
2. Lundberg M, Fredlund P, Hallqvist J, Diderichsen F. A SAS program calculating three measures of
interaction with confidence intervals. Epidemiology, 7: 655-6., 1996.
3. Assmann SF, Hosmer DW, Lemeshow S, Mundt KA. Confidence intervals for measures of interaction.
Epidemiology, 7: 286-90., 1996.
D. Assessing effect measure modification for modifiers with >2 categories
Methods and models to assess effect measure modification can be extended to accommodate
interactions between covariates with more than 2 categories. For example, effect measure
modification of the relation between early prenatal care (vs. no early care) and the 17 week risk of
8
preterm birth by race/ethnicity (raceth2) can be assessed using indicator term models with 7 indicator
terms for the 8 possible combinations of pnc5 (0, 1) and raceth2 (0, 1, 2, 3) as shown below (model
8):
indicator variables for pncraceth

Rij pnc5 raceth2 pncraceth Z1 Z2 Z3 Z4 Z5 Z6 Z7
R00 no early care, 0 0 0 0 0 0 0 0 0 0
white non-Hispanic
R10 had early care, 1 0 1 1 0 0 0 0 0 0
white non-Hispanic
R01 no early care, 0 1 2 0 1 0 0 0 0 0
white Hispanic
R11 had early care, 1 1 3 0 0 1 0 0 0 0
white Hispanic
R02 no early care, AA 0 2 4 0 0 0 1 0 0 0
R12 had early care, AA 1 2 5 0 0 0 0 1 0 0
R03 no early care, 0 3 6 0 0 0 0 0 1 0
other race
R13 had early care, 1 3 7 0 0 0 0 0 0 1
other race
Risk(preterm) = 0 + 1(Z1) + 2(Z2) +3(Z3) +4(Z4) +5(Z5) +6(Z6) +7(Z7) model 8
Alternatively, you can evaluate RD modification using product term interaction models. To do this,
create 3 indicator terms for the 4 categories of raceth2 (racethwh, racethb, racetho, as for
assignment 2) and include each in a product interaction term with pnc5 (model 9)
Risk(preterm) = 0 + 1(pnc5) + 2(racethwh) +3(racethb) +4(racetho) +5(pnc5 X racethwh)

+6(pnc5 x racethb) +7(pnc5 X racetho) model 9
As for the models used to assess effect measure modification between dichotomous covariates, the
indicator term and product term models shown above are equivalent with regard to their assumptions
and maximum likelihood values.
D1. Assessing RD modification based on RD for modifiers with >2 categories
a. Use a product term or indicator term interaction model to estimate RD and 95% CI for each stratum
of prenatal care (pnc5) and race/ethnicity (raceth2) relative to White non-Hispanic births with no early
prenatal care. Report risks, RD and CI in table D1.
b. Use an LR test to compare the fit of the interaction model with a main effects model (model 10)
Risk(preterm) = 0 + 1(pnc5) + 2(racethwh) +3(racethb) +4(racetho) model 10
D2. Stratum specific RD and NNT for modifiers with >2 categories
a. Use a product term or indicator term interaction model to estimate stratum-specific RD and 95% CI
for early care vs. no early care according to race/ethnicity (raceth2). Report risks, RD and CI in table
D2.
b. Estimate the number of white non-Hispanic, white Hispanic, African American and other
race/ethnicity births that would need to receive early prenatal care in order for the number of preterm
births to decrease by one (i.e., the number needed to treat, or NNT) and report results in table D2.
9
D3. Assessing RD modification based on RR for modifiers with >2 categories
a. Use a single referent indicator term log-risk model to evaluate RD modification based on estimated
RR, and enter results in table D2. To do this you will need to estimate 3 expected RRs (assuming no
RD modification) and 3 ICRs.
LnRisk(preterm) = 0 + 1(Z1) + 2(Z2) +3(Z3) +4(Z4) +5(Z5) +6(Z6) +7(Z7) model 11
 ICR11 indicates whether the estimated joint effect measure for early care and white Hispanic
race/ethnicity (the observed RR11) is consistent with additive risks for early care (vs. no early
care) and white Hispanic race/ethnicity (vs. white non-Hispanic race/ethnicity)
 ICR12 indicates whether the estimated joint effect measure for early care and AA race/ethnicity
(observed RR12) is consistent with additive risks for early care (vs. no early care) and AA (vs.
white non-Hispanic) race/ethnicity
 ICR13 indicates whether the estimated joint effect measure for early care and other race/ethnicity
(observed RR13) is consistent with additive risks for early care (vs. no early care) and Other
race/ethnicity (vs. white non-Hispanic race/ethnicity)
Index group Referent group

RR10 vs. 00 1 had early care, white no early care, white
non-Hispanic non-Hispanic
RR01 vs. 00 2 no early care, white no early care, white
Hispanic non-Hispanic
RR11 vs. 00 3 had early care, white no early care, white Expected11 vs. 00 =RR10 vs. 00 + RR01 vs. 00 -1
Hispanic non-Hispanic ICR11 =RR11 vs. 00 –RR10 vs. 00 –RR01 vs. 00 +1
RR02 vs. 00 4 no early care, AA no early care, white
non-Hispanic
RR12 vs. 00 5 had early care, AA no early care, white Expected12 vs. 00 =RR10 vs. 00 + RR02 vs. 00 -1
non-Hispanic ICR12 =RR12 vs. 00 –RR10 vs. 00– RR02 vs. 00 +1
RR03 vs. 00 6 no early care, other no early care, white
non-Hispanic
RR13 vs. 00 7 had early care, other no early care, white Expected13 vs. 00 =RR10 vs. 00 + RR03 vs. 00 -1
non-Hispanic ICR13 =RR13 vs. 00 –RR10 vs. 00 –RR03 vs. 00 +1
10
Written Assignment (100 points total; 95 points content, 5 points style)
Please provide brief answers to the following questions.
1. Which model will produce perfectly homogenous risk difference estimates for early care vs. no
early care across strata of race (AA or non-AA)? Why? (2 points)
2. Briefly explain the similarities or differences in RD measures for early care and AA race derived
from contingency tables (section A), model 3 and model 5. (2 points)
3. Would a Likelihood Ratio Test comparing model 6 to a log-risk model with pnc5 and raceaa
alone be relevant to an analysis of biologic interaction between early prenatal care and
race/ethnicity? Briefly justify your answer. (2 points)
4. List one advantage and one disadvantage of dichotomizing race as African American or non-
African American. How might conclusions about the presence or absence of health disparities be
affected by dichotomizing race into two groups? (4 points)
5. Describe the analyses of risk difference modification by race/ethnicity from part D as you would
for the results section of a publication. Address the following in your description (10 points):
 Which group had the highest estimated 17-week risk of preterm? Which had the lowest
estimated risk?
 Do estimated risks associated with early prenatal care differ according to race/ethnicity? If so,
how?
 Which group or groups appeared to have the greatest reduction in preterm birth associated
with early prenatal care vs. no early care?
 Are the joint effects of early prenatal care and race/ethnicity different from what you would
expect assuming additive risks?
 Be brief and note estimates that support your answers. You may also refer to specific results
tables as appropriate.
6. Would the results of your analysis support targeting early prenatal care programs to specific
race/ethnicity groups in order to reduce the incidence of preterm birth in North Carolina? Why or
why not? Consider the frequency of exposure, incidence of outcome, and results from your
analysis of effect measure modification. Be brief, but justify your conclusions (10 points)
11
Table A1. Contingency Table Analyses (5 points)
Early prenatal Race ij Preterm Total Risk
care (N) (N)
No early care Non-AA 00
Had early care Non-AA 10
No early care AA 01
Had early care AA 11
*Round risk estimates to three significant digits. You do not need to show 95% CI.
expectedR11= ________________________ IC based on risks= ___________________________
Table A2. Contingency Table Analyses: Common Referent Estimates (5 points)*

RD RR OR
no early care/non-AA vs no early 00 vs 00 0 1.0 1.0

care/non-AA
early care/non-AA vs no early 10 vs 00
care/non-AA
no early care/AA vs no early care/non- 01 vs 00
AA
early care/AA vs no early care/non-AA 11 vs 00
Round all estimates to three significant digits. You do not need to show 95% CI for estimates in or
below table A2.
expectedRD11 vs 00* = _______________________ IC based on RD = ________________________
expectedRR11 vs 00* = _______________________ ICR based on RR = _______________________
expectedOR11 vs 00* = _______________________ ICR based on OR = _______________________
*expected RD, RR and OR assuming RD homogeneity
Table A3. Contingency Table Analyses: Stratified Estimates (4 points)*

RD RR OR
Effect estimate measure for care vs.

no care by race
Early care vs. no care if non-AA 10 vs 00
Early care vs. no care if AA 11 vs 01
Effect estimate measure for AA vs.

non-AA race by early care
AA vs. non-AA if no early care 01 vs 00
AA vs. non-AA if had early care 11 vs 10
Round all estimates to three significant digits. You do not need to show 95% CI
12
Table B1. Product term interaction models: common referent analyses (6 points)
linear risk log-risk
RD 95% CI RR 95% CI
no early care/non-AA vs no early 00 vs 00
care/non-AA
early care/non-AA vs no early care/non- 10 vs 00
AA
AA
Round all values to three significant digits.
expectedRD11 vs 00* = _______________________ IC (95% CI) = _________________________
expectedRR11 vs 00* = _______________________ ICR (no CI) = _______________________
*expected RD and RR assuming RD homogeneity
Table B2. Product term interaction models: stratified analyses (5 points)

linear risk
RD 95% CI
RD for care vs. no care by race
Early care vs. no early care if non-AA 10 vs 00
Early care vs. no early care if AA 11 vs 01
RD for AA vs. non-AA by early care

*Round all values in table B2 to three significant digits. Round p-values to 1 significant figure.
Likelihood ratio test

chi2 = df = p=
Coefficient for pncXrace
Coefficient and 95% CI = p=
13
Table C1. Indicator term models: common referent analyses (6 points)
linear risk log-risk
RD 95% CI RR 95% CI
no early care/non-AA vs no early 00 vs 00
care/non-AA
early care/non-AA vs no early care/non- 10 vs 00
AA
AA
expectedRD11 vs 00* = _______________________ IC (no CI) = _________________________
expectedRR11 vs 00* = _______________________ ICR (95% CI) = _______________________
*expected RD and RR assuming RD homogeneity
Table C2. Indicator term models: stratified analyses (3 points)

linear risk
RD 95% CI
RD for care vs. no care by race
Early care vs. no early care if non-AA 10 vs 00
Early care vs. no early care if AA 11 vs 01
RD for AA vs. non-AA by early care

Likelihood Ratio Test

chi2 = df = p=
14
D1. Assessing RD modification based on RD for modifiers with >2 categories (14 points)
Preterm Total Risk RD 95% CI
births (N) births (N)
White non-Hispanic
no early care
had early care
White Hispanic
no early care
had early care
African American
no early care
had early care
Other race
no early care
had early care
Round risks, RD and 95% CI to three significant digits.
Likelihood ratio test chi2 = df = p=
D2. Stratum-specific RD and NNT for modifiers with >2 categories (4 points)
RD 95% CI NNT*
Early care vs. no early care by race/ethnicity
White non-Hispanic 10 vs 00
White Hispanic 11 vs 01
AA 12 vs 02
Other 13 vs 03
D3. Assessing RD modification based on RR for modifiers with >2 categories (13 points)
Preterm Total RR 95% CI Expected ICR 95% CI
births (N) births (N) joint RR*
no early care,
white non-Hispanic
had early care,
white non-Hispanic
no early care,
white Hispanic
had early care,
white Hispanic
no early care,
AA
had early care, AA
no early care,
other
had early care,
other
Round all values to three significant digits. *Expected joint RR assuming no RD modification
15

Cohort Assignment

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Cohort Assignment

Uploaded by

Copyright:

Available Formats

Cohort Assignment 5: Effect Measure Modification

raceaa = 0 if non-AA (White and other combined)

Notation and terminology

Risk(preterm) = 0 + 1(pnc5) + 2(raceaa) model 1

Observed versus expected risks

As with model 1, 4 risks may be estimated:

If the homogeneity assumption is correct, 3 = 0 so that

so that when the homogeneity assumption is true,

Interpreting Interaction Contrasts

Estimating RD modification using Interaction Contrast Ratios (ICRs)

This statement can be written in terms of RR by dividing through by R00

expected(R11/R00) = R10/R00 + R01/R00 – R00/R00

ICR = RR11 vs. 00 - expected(RR11 vs. 00)

The ICR can also be written as:

In addition, when the OR is a valid estimate of the RR:

exp(OR11 vs. 00) = OR10 vs. 00 + OR01 vs. 00 - 1

A. Contingency Table Analyses

A1. Frequencies, risks, expected risks and IC

expected(R11) = R10 + R01 – R00

 The IC based on the estimated risks

A2. Common Referent Estimates

expected(RD11 vs. 00) = RD10 vs. 00 + RD01 vs. 00

ICRD = RD11 vs 00 - expected(RD11 vs 00)

A3. Stratum-Specific Estimates

B. Interaction term models and RD modification

B1. Create a product interaction term (multiplicative variable) pncXrace where

Linear Risk Model:

Log Risk Model:

How to perform likelihood ratio test (LRT)

Excel: chidist(test_statistic, degrees_of_freedom)

expected(R11) = R10 + R01 – R00

expected(RR11 vs. 00) = RR10 vs. 00 + RR01 vs. 00 - 1

C1. Indicator terms

Then create the indicator terms pncrace1, pncrace2 and pncrace3:

Risk(preterm|pncrace) = 0 + 1(pncrace1) + 2(pncrace2) +3(pncrace3) model 5

When model 5 is used to estimate joint and independent effect measures:

lnRisk(preterm|pncrace) = 0 + 1(pncrace1) + 2(pncrace2) +3(pncrace3) model 6

lnOdds(preterm|pncrace) = 0 + 1(pncrace1) + 2(pncrace2) +3(pncrace3) model 7

95% Confidence Interval Estimates for the ICR

 Round intermediate values to six decimal places for calculations.

indicator variables for pncraceth

Risk(preterm) = 0 + 1(Z1) + 2(Z2) +3(Z3) +4(Z4) +5(Z5) +6(Z6) +7(Z7) model 8

Risk(preterm) = 0 + 1(pnc5) + 2(racethwh) +3(racethb) +4(racetho) +5(pnc5 X racethwh)

D1. Assessing RD modification based on RD for modifiers with >2 categories

Risk(preterm) = 0 + 1(pnc5) + 2(racethwh) +3(racethb) +4(racetho) model 10

LnRisk(preterm) = 0 + 1(Z1) + 2(Z2) +3(Z3) +4(Z4) +5(Z5) +6(Z6) +7(Z7) model 11

Index group Referent group

Please provide brief answers to the following questions.

expectedR11= ________________________ IC based on risks= ___________________________

Table A2. Contingency Table Analyses: Common Referent Estimates (5 points)*

no early care/non-AA vs no early 00 vs 00 0 1.0 1.0

expectedRD11 vs 00* = _______________________ IC based on RD = ________________________

expectedRR11 vs 00* = _______________________ ICR based on RR = _______________________

expectedOR11 vs 00* = _______________________ ICR based on OR = _______________________

*expected RD, RR and OR assuming RD homogeneity

Table A3. Contingency Table Analyses: Stratified Estimates (4 points)*

Effect estimate measure for care vs.

Early care vs. no care if AA 11 vs 01

Effect estimate measure for AA vs.

AA vs. non-AA if had early care 11 vs 10

Round all values to three significant digits.

expectedRD11 vs 00* = _______________________ IC (95% CI) = _________________________

expectedR11= IC based on risks= ___

expectedRD11 vs 00* = _ IC based on RD = __

expectedRR11 vs 00* = _ ICR based on RR = _

expectedOR11 vs 00* = _ ICR based on OR = _

expectedRD11 vs 00* = _ IC (95% CI) = ___

expectedRR11 vs 00* = _ ICR (no CI) = _

expectedRD11 vs 00* = _ IC (no CI) = ___

expectedRR11 vs 00* = _ ICR (95% CI) = _