Professional Documents
Culture Documents
Assessing Sensitivity To Unmeasured Confounding Using A Simulated Potential Confounder
Assessing Sensitivity To Unmeasured Confounding Using A Simulated Potential Confounder
To cite this article: Nicole Bohme Carnegie, Masataka Harada & Jennifer L. Hill (2016) Assessing
Sensitivity to Unmeasured Confounding Using a Simulated Potential Confounder, Journal of
Research on Educational Effectiveness, 9:3, 395-420, DOI: 10.1080/19345747.2015.1078862
METHODOLOGICAL STUDIES
ABSTRACT KEYWORDS
A major obstacle to developing evidenced-based policy is the difficulty of causal inference
implementing randomized experiments to answer all causal questions of sensitivity analysis
interest. When using a nonexperimental study, it is critical to assess how unmeasured confounder
much the results could be affected by unmeasured confounding. We hidden bias
omitted variable
present a set of graphical and numeric tools to explore the sensitivity of
causal estimates to the presence of an unmeasured confounder. We
characterize the confounder through two parameters that describe the
relationships between (a) the confounder and the treatment assignment
and (b) the confounder and the outcome variable. Our approach has two
primary advantages over similar approaches that are currently
implemented in standard software. First, it can be applied to both
continuous and binary treatment variables. Second, our method for
binary treatment variables allows the researcher to specify three possible
estimands (average treatment effect, effect of the treatment on the
treated, effect of the treatment on the controls). These options are all
implemented in an R package called treatSens. We demonstrate the
efficacy of the method through simulations. We illustrate its potential
usefulness in practice in the context of two policy applications.
One of the biggest struggles in policy analysis is the inability to directly study many ques-
tions of interest using randomized experiments. Attempts to infer causality using nonexperi-
mental studies are vulnerable to violations of the assumption that requires, colloquially
speaking, that all confounders have been measured. Rather than abandoning the goal of
causal inference using nonexperimental data, methods that allow researchers to explore sen-
sitivity of their inferences to violations of this assumption can act as a middle ground. These
approaches help to build confidence in the results of nonexperimental studies by differentiat-
ing between studies whose results are relatively immune to potential unmeasured confound-
ers versus those whose results might change drastically.
This article presents a set of graphical and numeric tools to investigate the sensitivity of
causal estimates in nonexperimental studies to the presence of an unmeasured confounder.1
CONTACT Nicole Bohme Carnegie carnegin@uwm.edu Joseph J. Zilber School of Public Health, University of
Wisconsin-Milwaukee, P.O. Box 413, Milwaukee, WI 53201-0413, USA.
a
University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
b
National Graduate Institute for Policy Studies, Tokyo, Japan
c
New York University, New York, New York, USA
1
Throughout the article we use the term “unmeasured confounder” rather than using terms such as “omitted variable” or
“unobserved covariate” for the sake of consistency and clarity. In practice it is possible that this variable was measured but
was simply omitted from the analysis.
© Nicole Bohme Carnegie, Masataka Harada, and Jennifer L. Hill.
This is an Open Access article. Non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly
attributed, cited, and is not altered, transformed, or built upon in any way, is permitted. The moral rights of the named author(s) have been
asserted.
We characterize the confounder through two parameters which describe (a) the relationship
between the confounder and the treatment assignment and (b) the relationship between the
confounder and the outcome variable. Our approach to assessing sensitivity to an unmea-
sured confounder has two primary advantages over similar approaches (with two sensitivity
parameters) that are currently implemented in standard software. First, it can be applied to
both continuous and binary treatment variables. Second, our method for binary treatment
variables allows the researcher to specify three possible estimands (average treatment effect,
effect of the treatment on the treated, effect of the treatment on the controls). All tools
described in this article are available in an R package called treatSens.
2
Ichino, Mealli, and Nannicini (2008) developed a computational approach along this line; sensatt (Nannicini, 2007) performs
their sensitivity analysis in Stata.
3
In Stata, rbounds (DiPrete & Gangl, 2004) performs the sensitivity analysis of Rosenbaum (2002b) for continuous outcome var-
iables, while mhbounds (Becker & Caliendo, 2007) performs the counterpart for binary outcome variables. In R, rbounds (Keele,
2010) performs the sensitivity analysis of Rosenbaum (2002b).
4
Some of these methods are nonetheless categorized as a semiparametric due to their reliance on a parametric model to
estimate propensity scores used to match or stratify.
5
However, recent work by Rosenbaum (2011) presents a promising alternative statistic.
6
Some methods in this realm (McCandless, Gustafson, & Levy, 2007; Rosenbaum & Rubin, 1983 for instance,) rely on still more
sensitivity parameters, which we find to be prohibitively complicated for standard usage. A few strategies, such as the one
suggested by Altonji et al. (2005) require only one sensitivity parameter but this restriction unfortunately makes it quite diffi-
cult to interpret.
7
isa (Harada, 2011) performs the sensitivity analysis of Imbens (2003) in Stata.
8
gsa (Harada, 2012) performs the sensitivity analysis of Harada (2013) in Stata.
Y ðz Þ ? ZjX 8 z 2 Z (1)
9
and is referred to as the ignorability assumption. This is a critical assumption, because if it is
satisfied, we can identify causal effects simply by appropriately adjusting for the covariates in
X (for instance by an appropriate model for Y conditional on the covariates and Z or by a
propensity score approach).
It is rare, however, that researchers have confidence that this assumption holds. As a strat-
egy for exploring sensitivity to this assumption, we posit an unmeasured variable, U, that, if
included, would serve to satisfy ignorability. That is, we assume that our set of potential out-
comes, Y(Z), is conditionally independent of the treatment assignment, Z, given both the
vector of observed covariates, X, and an additional unmeasured confounder U. Thus, the
ignorability assumption would relax to the following version,
Y ðz Þ ? ZjX; U 8 z 2 Z (2)
We illustrate the basic structure of our sensitivity analysis first assuming that our data
comprise independent, identically distributed units. In addition, we make modeling assump-
tions. Specifically, we assume that, if ignorability is satisfied given X and U, the following
equation holds
9
Ignorability is typically referred to as “selection on observables” in economics and “all confounders measured” or
“exchangeability” in the epidemiology literature.
where by and bz represent the vectors of regression coefficients for the covariate matrix X,
and t is the population-level treatment effect. It is natural to think of U as being normally
distributed if we conceptualize U as a linear combination of all unmeasured confounders,
which is approximately normally distributed for a sufficient number of confounders or if the
10
We avoid creating a separate set of models for binary outcomes due to the issues with noncollapsibility inherent in such
models (Greenland, Robins, & Pearl, 1999).
where z~; y~; s 2z~ and s 2y~ denote the realized residuals and their respective variances from
regressions run for Z conditional on X and for Y conditional on Z and X, respectively.
We can draw U_ from an estimated version of this conditional distribution for any valid
combination of values for the sensitivity parameters (discussed in the “General Algorithm”
section) by plugging these values into the formula along with estimates of the required
residual variances.
U » Bernoulliðpu Þ (9)
where F denotes the probit link. We posit a binary U in this scenario for reasons of mathe-
matical convenience.
The distribution of U conditional on the sensitivity parameters and observed data corre-
sponding to this complete data likelihood is not as readily available as in the multivariate
normal case. Thus we rely on a computational trick to draw from the correct distribution.
This strategy capitalizes on the fact that if the parameters from the complete data likelihood
were known, the conditional distribution of U would be straightforward to define as a
ASSESSING SENSITIVITY TO UNMEASURED CONFOUNDING 401
Bernoulli distribution with probabilities equal to the ratio of appropriate likelihoods.
y;z;x;u D 1
p
UjY; Z; X » Bernoulli (10)
py;z;x
where the numerator (representing the joint likelihood of the variables with U set equal to 1)
and denominator (representing the likelihood after marginalizing over U) can be calculated
using the following formulas
!
¡ 1=2 ðY ¡ Xby ¡ zy ¡ tZ Þ2
py;z;x;u D 1 D 2ps 2y ¢xuz exp ¡
2s 2y ¢xuz (11)
ð1 ¡ Z Þ Z u
¢ð1 ¡ FðXb C z ÞÞ
z z
ðFðXb C z ÞÞ p
z z
and
!
¡ 1=2 ðY ¡ Xby ¡ tZ Þ2
py;z;x D 2ps 2y ¢xuz exp ¡
2s 2y ¢xuz
¢ð1 ¡ FðXbz ÞÞð1 ¡ Z Þ ðFðXbz ÞÞZ ð1 ¡ pu Þ
! (12)
¡ 1=2 ðY ¡ Xb y
¡ z y
¡ tZ Þ2
C 2ps 2y ¢xuz exp ¡
2s 2y ¢xuz
¢ð1 ¡ FðXbz C zz ÞÞð1 ¡ ZÞ ðFðXbz C zz ÞÞZ pu
The challenge in making use of this distribution is that we not only do not know the true
parameters needed to calculate the probabilities for this conditional distribution, we cannot
even estimate all of them unbiasedly. In order to properly estimate by, t, and s 2y ¢xuz , we
would need to know U. This motivates an iterative estimation strategy that is a form of Sto-
chastic EM (expectation-maximization) algorithm (Neilsen, 2000). We iterate between two
steps. In the one step we estimate the model parameters in Equation 7 and Equation 8 condi-
tional on the observed data and a draw of the vector U. In the other step we draw U_ from its
conditional distribution (as in Equation 10) conditional on the data and the parameter value
estimates from the previous step. We repeat these steps enough times that we are confident
that we are drawing it from the correct distribution; in our applications this typically took
less than 10 steps.
11
This is analagous to the variance formula used for multiple imputation estimates (Rubin, 1987).
Calibration
As other scholars have before us (e.g., Imbens, 2003), we suggest calibrating the
strength of these sensitivity parameters by using estimates of the regression coefficients
for the observed covariates, X, in the data (from the regression of Y on Z and X and
from the regression of Z on X) as a benchmark. Any confounder with a negative partial
association with the outcome is reverse-scaled so that this coefficient estimate will be
positive; these estimates are tagged by displaying them as 5. To create a common scale
for these coefficients (among themselves and relative to the coefficients that serve as
our sensitivity parameters) we first standardize all continuous covariates and the out-
come to have mean of zero and standard deviation of one. As an additional calibration
404 N. B. CARNEGIE ET AL.
Figure 1. Each contour on the plot reflects the combinations of sensitivity parameters that would lead to
the treatment effect estimate used as the label for that contour. The lines labeled “N.S.” correspond to the
treatment effect estimates that forfeit statistical significance. The pale contour corresponds to the effect
estimate that would be yielded by an unmeasured confounder with properties equivalent to that of the
observed confounder that is farthest from the origin. The plus signs and triangles give the coefficient
estimates for the observed covariates. The 5 sign reflects a covariate whose coefficient for the outcome
model was negative, prompting a rescaling of the variable by ¡ 1.
aid, we plot a gray contour corresponding to the treatment effect estimate obtained
with sensitivity parameters equivalent to the pair of observed coefficients that are far-
thest from the origin.12 The display of the coefficients for the observed confounders in
this example allows us to see that, in order for either no treatment effect or a
nonsignificant treatment effect to represent the truth, the unmeasured confounder
would have to have considerably greater predictive power than the observed
confounders.
12
It may seem odd that this gray line will not necessarily intersect the symbol for the covariate with the most extreme set of
coefficients. This is driven by two considerations. The first is the fact that the plot function provides contours that represent a
smoothed fit to the estimated treatment effect estimates at our specified grid points. The second is that the estimates of the
coefficients on the observed covariates from the Y model will not be precisely equivalent to estimates from the by in the
model formulation that includes U.
Simulation Study
We present a simulation study to demonstrate the performance of our approach in the more
complex setting, binary treatment with weighted estimators.14 We can use this setting to
investigate whether our algorithm is able to recover estimands such as ATE and ATT in the
setting where covariate-driven treatment effect heterogeneity exists.
Data-Generating Process
We adopt a fairly simple simulation setup that nonetheless captures the key features of the
problem. We begin by assuming four independent covariates, X1, …, X4, each distributed as
an independent standard normal variable. We also posit a covariate M that follows a Ber-
noulli distribution with Pr ðM D 1Þ D :5. We distinguish this covariate because of its role as
moderator of the treatment effect (made explicit below). Finally we assume an unmeasured
covariate, U, that follows a Bernoulli distribution with Pr ðU D 1Þ D :5.
Because this simulation is designed to verify that the algorithms behave as expected, we
generate Z and Y according to a slight variation on our specified complete-data likelihood
described in Equation 7,
YjZ; U; X; M » N ay C Xby C uy M C zy U C tZ C tint MZ; s 2y ¢xuz
(13)
ZjX; U » BernoulliðFðaz C Xbz C uz M C zz U ÞÞ;
with ay D ¡1.5, by D (.2, .4, .6, .8), uy D 1, t D ¡3, tint D 6, s 2y ¢xuz D 4, az D ¡1.5, bz D (.25,
.25, .25, .25), and uz D 1. This specification allows for treatment effects to vary based on the
level of observed covariate M; in particular, observations with M D 1 have a treatment effect
13
As far as we know the combination of sensitivity analysis and IPTW has only been used in one applied article (Frank et al.,
2008); that article did not discuss the specific assumptions that accompany this technique.
14
Simulations for the continuous treatment scenario are not shown given that the mathematical derivations for that scenario
are straightforward.
Estimands
There are several estimands of interest in this simulation. One set of estimands repre-
sents different ways of conceptualizing the average treatment effect. One such estimand
is the population average treatment effect (PATE; which is 0). We also consider the
model-based sample approximation to this, which is the estimated coefficient on Z
from a regression of Y on Z, X, M, and U for our sample; in other words, this estimand
represents the coefficient we would have estimated had we had access to U as an
observed covariate. We refer to this estimand as the regression average treatment effect
(RATE).
A second set of estimands represents ways of conceptualizing the average effect of
the treatment on the treated, which differs from PATE because of the treatment effect
modification. This includes the population average treatment effect for the treated
(PATT). PATT varies by zz; when zz D (¡2, ¡1, 0, 1, 2), PATT D (.71, .94, 1.04, .94,
.71). We also consider the model-based sample approximation to these, which is the
estimated coefficient on Z from a weighted regression of Y on Z, X, M, and U for our
sample, with weights corresponding to those described in the “Use of Modification
Weights to Target Specific Estimands” section. This is the coefficient we would have
estimated using a standard inverse probability of treatment weighting (IPTW) approach
had we had access to U as an observed covariate; we call it the regression average
treatment effect for the treated (RATT).15
Results
Figures 2 and 3 present the simulation results that demonstrate the performance of our
approach targeting two different types of estimands, ATE and ATT, respectively. Simula-
tions are performed across 15 different combinations of the sensitivity parameters, which
are presented in the top (zz) and the right (zy) of each figure. For each distinct combination
of zz and zy, we present three box plots. The box plot in the middle is a summary of our sen-
sitivity analysis estimates, b
t ðzz ; zy Þ, over 1,000 simulations. The box plot on the left is a
summary of the estimated treatment effects from a regression of Y on Z, X, M, and U, had
we had access to U, RATE in Figure 2 and RATT in Figure 3. The box plots to the right are
the counterparts from the regression of Y on Z, X, and M assuming no access to the unmea-
sured confounder; that is, the regression estimate based purely on observed data. Thus, the
left box plots can be thought of as the best possible results, while the right box plots are the
worst case scenario. Finally, PATE and PATT, displayed as red solid line segments in
Figure 2 and Figure 3, respectively, represent the true population level parameters.
15
We choose not to present results for PATC and RATC for the sake of parsimony since, due to the nature of the problem
(e.g., PATE is just a weighted average of PATT and PATC) they provide no additional information in this setup.
Figure 3. Sensitivity analysis results from the simulation for estimates targeting ATT estimands (PATT,
RATT). Each box plot displays the distribution of estimates/estimands across 1,000 simulations. The left
box plot in each cell displays RATT. The middle box plot displays the sensitivity analysis results. The right
box plot displays regression estimates that exclude U. Each set of three box plots varies across combina-
tions of the sensitivity parameters (labels on the top and to the right).
16
Even if we could randomize some families to receive center-based care, we could not force them to participate in this care or
force those in the control group to not participate in center-based care.
Results
Figure 4 contains an analysis of the sensitivity of the estimated coefficient of the continuous
treatment to potential unmeasured confounding.19 Most of the observed covariates have a
17
When a child’s behavior was evaluated by multiple caregivers, the average of their evaluations was used.
18
Because of a seven-month gap between the Phase I and Phase II studies and the fact that hours per week in care are
recorded everyfourP34 months in Phase II, the variable for average hours in care is created with the following formula:
care 24 » 50 D 271 m D 24 carem C 8 ¢care42 C 4 ¢care46 C 4 ¢care50 . Because we observed excessively large entries for this
continuous treatment in some cases, we capped the maximum value of time in care at 70 hours.
19
We use a 10 by 20 grid with 100 draws of U_ per cell.
negative association with externalizing behavior and have been rescaled (multiplied by ¡1)
so that the estimated coefficients will be positive; these are represented by 5 on the plot. All
continuous covariates have also been standardized to have a mean of 0 and standard devia-
tion of 1 to facilitate comparisons of their magnitude to that of the sensitivity parameters.
This figure shows that, over a range of plausible sensitivity parameters, the estimated
treatment effects range from roughly ¡.5 to .8, with the majority of the range consistent
with a significant, positive treatment effect. The blue reference lines indicate that the
sensitivity parameters required for an unmeasured confounder to drive the estimate to non-
significance are larger than the coefficients for the observed confounders by about 50%. To
drive the estimate to zero (red reference line), these parameters would have to be about
200% larger than the largest coefficient estimate for the observed covariates. Such values are
not implausible. For instance, consider a confounder with zz D .5 and zy D .4. This corre-
sponds to a continuous confounder with the following property: when comparing two
groups that differ by one standard deviation on this confounder they would also differ by .5
standard deviations on the treatment variable (which corresponds to about 6 hours per week
of nonparental care). This confounder would also have the property that when comparing
two groups that differ by one standard deviation on this confounder they would also differ
by .4 standard deviations on the outcome variable (which corresponds to 4 on the externaliz-
ing behavior index).
20
Notably, removal of 1974 earnings is particularly problematic for these data most probably due to the so-called “Ashenfelter
dip” (due to Ashenfelter, 1978), which describes the phenomenon that those who enroll in job training programs are more
likely to have experienced job loss right before the participation (implying that conditioning on only one period of
pretreatment earnings is likely not sufficient for establishing ignorability).
Results
Figure 5 shows the results of the sensitivity analysis that examines the sensitivity to unmea-
sured confounding for each of two estimates of the effect of participation in the NSW job
training program. The left panel does not use weights and thus targets an average treatment
effect (ATE). The right panel uses modification weights to target the ATT. We use standard-
ized coefficients to facilitate comparisons between the coefficients on observed confounders
and the sensitivity parameters for our posited binary unmeasured confounder. These bench-
marking coefficients are more strongly predictive of the treatment assignment in the
unweighted sample than in the weighted sample, which is not surprising since the weighted
sample creates greater balance between the treatment and (weighted) control groups. With
one exception, the observed covariates have either similar or reduced power to predict the
outcome variable in the weighted sample as well.
The left panel suggests that absent unmeasured confounding the effect estimate would be
about 0. It further suggests that it would take a reasonably strong unmeasured confounder
that was negatively associated with treatment but positively associated with the outcome to
yield a significant positive effect (such effects correspond to the upper-left corner of the
plot). One such confounder does exist in the observed data, however. On the other hand, if
the unmeasured confounder were strongly positively associated with both treatment and
outcome this would suggest a significant negative treatment effect estimate (upper-right cor-
ner of the plot).
Figure 5. Sensitivity analysis results from the NSW. Left panel displays results from algorithm that uses no
weights. Right panel displays results from algorithm that uses modification weights to target the effect of
the treatment on the treated.
Discussion
This article describes a general approach to sensitivity analysis with specific implementations
for models with continuous and binary treatment variables. Our approach makes use of the
conditional distribution of the unmeasured confounder implied by each of the two models
to generate candidate realizations of the unmeasured confounder. This allows us to estimate
the distribution of treatment effect estimates that correspond to an analysis that conditions
on that confounder. Conveniently, the unmeasured confounder is characterized by sensitiv-
ity parameters that can be interpreted as regression coefficients (from the regression of the
outcome on the treatment, observed covariates, and unmeasured covariate and from the
regression of the treatment on the observed covariates and the unmeasured covariate). These
coefficients are interpretable to applied researchers who have substantial experience in inter-
preting regression coefficients. In addition, this specification allows us to benchmark the
magnitude of the sensitivity parameters relative to the corresponding coefficients for
observed confounders in our data.
An important contribution of our approach is the extension of the binary treatment variable
implementation to allow for targeting of the effect of two causal estimands that are often of
strong interest for policy and practice: the effect of the treatment on the treated and the effect of
the treatment on the controls. Given the complexity of the algorithm for the binary treatment,
particularly combined with the weighted estimation needed for targeting ATT and ATC, we
have performed simulations to verify the properties of the algorithm. Across a wide range of
combinations of sensitivity parameters, the algorithm reliably recovers a distribution nearly
equivalent to that of the regression estimates in a regression controlling for the corresponding
unmeasured confounder (i.e., it mimics the sampling distributions of RATE or RATT). The
NSW example highlights the potential practical implications of this feature of our strategy.
414 N. B. CARNEGIE ET AL.
This approach to assessing sensitivity to unmeasured confounding described in this article
is a useful complement to any observational study for which it is unclear if ignorability has
been satisfied. Software that implements these algorithms is currently available in the treat-
Sens package on the CRAN archive (for the R software program).
Funding
This research was partially supported by Institute of Education Sciences grant R305D110037.
ARTICLE HISTORY
Received 16 December 2013
Revised 17 December 2014
Accepted 29 July 2015
EDITORS
This article was reviewed and accepted under the editorship of Carol McDonald Connor and Spyros
Konstantopoulos.
References
Altonji, J. G., Elder, T. E., & Taber, C. R. (2005). Selection on observed and unobserved variables:
Assessing the effectiveness of Catholic schools. Journal of Political Economy, 113, 151–184.
Angrist, J., & Pischke, J. (2008). Mostly harmless econometrics: An empiricist’s companion. Princeton,
NJ: Princeton University Press.
Anme, T., & Segal, U. A. (2004). Implications for the development of children in over 11 hours of cen-
tre-based care. Child: Care, health and development, 30, 345–352.
Ashenfelter, O. (1978). Estimating the effects of training programs on earnings. Review of Economics
and Statistics, 60, 47–57.
Bacharach, V. R., & Baumeister, A. A. (2003). Child care and severe externalizing behavior in kinder-
garten children. Journal of Applied Developmental Psychology, 23, 527–537.
Bates, J. E., Marvinney, D., Kelly, T., Dodge, K. A., Bennett, D. S., & Pettit, G. S. (1994). Child care his-
tory and kindergarten adjustment. Developmental Psychology, 30, 690–700.
Baydar, N., & Brooks-Gunn, J. (1991). Effects of maternal employment and child-care arrangements
on preschoolers’ cognitive and behavioral outcomes: Evidence from the Children of the National
Longitudinal Survey of Youth. Developmental psychology, 27, 932–945.
Becker, S. O., & Caliendo, M. (2007). Sensitivity analysis for average treatment effects. Stata Journal, 7,
71–83.
Belsky, J. (1999). Quantity of nonmaternal care and boys’ problem behavior/adjustment at ages 3 and
5: Exploring the mediating role of parenting. Psychiatry: Interpersonal and Biological Processes, 62,
1–20.
Bross, I. D. (1966). Spurious effects from an extraneous variable. Journal of Chronic Diseases, 19,
637–647.
Bross, I. D. (1967). Pertinency of an extraneous variable. Journal of Chronic Diseases, 20, 487–495.
Buckley, J., & Schneider, M. (2007). Charter schools: Hype or hope?. Princeton, NJ: Princeton Univer-
sity Press.
Campbell, S. B. (2002). Behavior problems in preschool children: Clinical and developmental issues.
New York, NY: Guilford Press.
Copas, J. B., & Li, H. G. (1997). Inference for non-random samples. Journal of the Royal Statistical
Society. Series B (Methodological), 59, 55–95.
ASSESSING SENSITIVITY TO UNMEASURED CONFOUNDING 415
Cornfield, J., Haenszel, W., Hammond, E. C., Lilienfeld, A. M., Shimkin, M. B., & Wynder, E. L.
(1959). Smoking and lung cancer: Recent evidence and a discussion of some questions. Journal of
the National Cancer Institute, 22, 173–203.
Crockenberg, S., & Litman, C. (1991). Effects of maternal employment on maternal and two-year-old
child behavior. Child Development, 62, 930–953.
Dehejia, R. H. (2005). Practical propensity score matching: A reply to Smith and Todd. Journal of
Econometrics, 120, 355–364.
Dehejia, R. H., & Wahba, S. (1999). Causal effects in nonexperimental studies: Reevaluating the evalu-
ation of training programs. JASA, 94, 1053–1062.
DiPrete, T. A., & Gangl, M. (2004). Assessing bias in the estimation of causal effects: Rosenbaum
bounds on matching estimators and instrumental variables estimation with imperfect instruments.
Sociological Methodology, 34, 271–310.
Egeland, B., & Hiester, M. (1995). The long-term consequences of infant day-care and mother-infant
attachment. Child Development, 66, 474–485.
Farmer, A. D., Bierman, K. L., & The Conduct Problems Prevention Research Group (2002). Predic-
tors and consequences of aggressive-withdrawn problem profiles in early grade school. Journal of
Clinical Child and Adolescent Psychology, 31, 299–311.
Field, T., Masi, W., Goldstein, S., & Perry, S. (1988). Infant day care facilitates preschool social behav-
ior. Early Childhood Research Quarterly, 3, 341–359.
Frank, K. A. (2000). Impact of a confounding variable on the inference of a regression coefficient.
Sociological Methods and Research, 29, 147–194.
Frank, K. A., Sykes, G., Anagnostopoulos, D., Cannata, M., Chard, L., Krause, A., & McCrory, R.
(2008). Extended influence: National board certified teachers as help providers. Education, Evalua-
tion, and Policy Analysis, 30, 3–30.
Gelman, A., & Hill, J. (2007). Data analysis using regression and multilevel/hierarchical models.
New York, NY: Cambridge University Press.
Greenland, S. (1996). Basic methods for sensitivity analysis of biases. International Journal of Epidemi-
ology, 25, 1107–1116.
Greenland, S., Robins, J. M., & Pearl, J. (1999). Confounding and collapsibility in causal inference. Sta-
tistical Science, 14, 29–46.
Greenstein, T. N. (1993). Maternal employment and child behavioral outcomes: A household econom-
ics analysis. Journal of Family Issues, 14, 323–354.
Gustafson, P., McCandless, L., Levy, A., & Richardson, S. (2010). Simplified Bayesian sensitivity analy-
sis for mismeasured and unobserved confounders. Biometrics, 66, 1129–1137.
Han, W.-J., Waldfogel, J., & Brooks-Gunn, J. (2001). The effects of early maternal employment on
later cognitive and behavioral outcomes. Journal of Marriage and Family, 63, 336–354.
Harada, M. (2011). ISA: Stata module to perform Imbens’ (2003) sensitivity analysis. Boston, MA:
Statistical Software Components, Boston College Department of Economics.
Harada, M. (2012). GSA: Stata module to perform generalized sensitivity analysis. Boston, MA: Statisti-
cal Software Components, Boston College Department of Economics.
Harada, M. (2013). Generalized sensitivity analysis (Technical report). New York, NY: New York
University.
Harvey, E. (1999). Short-term and long-term effects of early parental employment on children of the
National Longitudinal Survey of Youth. Developmental Psychology, 35, 445–459.
Haskins, R. (1985). Public school aggression among children with varying day-care experience. Child
Development, 56, 689–703.
Hill, J. L., Weiss, C., & Zhai, F. (2011). Challenges with propensity score strategies in a high-dimen-
sional setting and a potential alternative. Multivariate Behavioral Research, 46, 477–513.
Hofferth, S. (1999). Child care in the first three years of life and preschoolers’ language and behavior.
Paper presented at the biennial meeting of the Society for Research in Child Development,
Albuquerque, NM.
Howes, C. (1988). Relations between early child care and schooling. Developmental Psychology, 24, 53–57.
U » N 0; s 2u
ZjX; U » N bz x C zz u; s 2z ¢xu (A1)
YjX; U; Z » N by x C zy u C tz; s 2y ¢xuz
Varying s 2u only changes the range of possible coefficients, not the qualitative results
of the sensitivity analysis. The mean and variance of YjX, Z are modified equivalently
to defining sensitivity parameters zy and zz by setting z0 D s uz. Thus, the shape of the
distribution of bt will be unchanged, but varying s 2u has implications for benchmarking
according to coefficients of X, because changing the assumed variance will change the rela-
tionship to the observed X. For this reason, we take s 2u D 1 and standardize all variables
in the analysis.
418 N. B. CARNEGIE ET AL.
This yields a joint distribution
¡ 1=2 u2
pðU; Y; ZjX Þ D ð2pÞ exp ¡
2
¡ 1=2 ðz ¡ bz x ¡ zz uÞ2
¢ 2ps 2z ¢xu exp ¡ (A2)
2s 2z ¢xu
!
¡ 1=2 ðy ¡ by x ¡ zy u ¡ tz Þ2
¢ 2ps y ¢xuz
2
exp ¡
2s 2y ¢xuz
After factoring pðU; Y; ZjX Þ, we obtain the following conditional distributions for U and
(Y, Z):
UjX; Z; Y » N mu ¢xzy ; s 2u ¢xzy
s 2y ¢xuz zz ðz ¡ bz xÞ C s 2z ¢xu zy ðy ¡ by x ¡ tz Þ
mu ¢xzy D
s 2z ¢xu s 2y ¢xuz C s 2y ¢xuz zz 2 C s 2z ¢xu zy2 (A3)
s 2z ¢xu s 2y ¢xuz
s 2u ¢xzy D
s 2z ¢xu s 2y ¢xuz C s 2y ¢xuz zz 2 C s 2z ¢xu zy2
and
P
Y; ZjX » N myz ¢x ; yz ¢x
myz ¢x D ðby x C tz; bz xÞ
" 2 # (A4)
s y ¢xuz C zy2 zy z z
Syz ¢x D
z y zz s 2z ¢xu C zz2
We see that the distribution in (A3) is a function of the residuals, conditional variances,
and coefficients from the underlying true models for Y and Z including U. Because U is
unobservable, we would like to rewrite this distribution in terms of what we can observe in
our data, plus sensitivity parameters that the analyst can vary. To do so, we need the condi-
tional distributions of Y and Z without U. We can obtain the distribution of Z given X in a
similar manner to that of pðY; ZjX Þ, which yields
ZjX » N bz x; s 2z ¢xu C zz 2 (A5)
Thus, the observed residuals ~z have mean z ¡ bzx and variance s 2z~ D s 2z ¢xu C zz2 .
ASSESSING SENSITIVITY TO UNMEASURED CONFOUNDING 419
From the bivariate normal distribution of Y and Z given X in A4, we see that the distribu-
tion of Y given Z and X is
!
zy z z s 2z ¢xu s 2y ¢xuz C s 2y ¢xuz zz 2 C s 2z ¢xu zy2
YjX; Z » N b x C tz C 2
y
ðz ¡ b z
x Þ; (A6)
s z ¢xu C zz 2 s 2z ¢xu C zz 2
This implies that the residuals ~y from a linear model of Y as a function only of the observed
s 2z ¢xu s 2y ¢xuz C s 2y ¢xuz zz 2 C s 2z ¢xu zy 2
data (Z, X) have mean ðy ¡ by x C tz Þ ¡ c z~and variance s 2y~ D s 2z ¢xu C zz 2
. Note
that c also quantifies the bias in the estimate of t for given coefficients z and z , with denom- y z
inator given by s 2z~. Thus, in this limited case we can directly calculate the bias in b
t associated
with particular values of zz and zy as a function of the variance in ~z .
Given these distributions, we can now rewrite the conditional distribution of U in terms
of observables and the coefficients of U:
UjX; Z; Y » N mu ¢ xzy; s 2u ¢ xzy
s ~2y zz z~ C s ~2z ¡ zz2 zy y~
mu ¢ xzy D
s~2z s 2y~ (A7)
0 1
1 s 2
~
z s 2
y ¡ zy 2
C z z
z2 y2
s 2u ¢ xzy D 2 2 @ A s 2 ¡ zz2
z~
s ~z s ~y s ~z
2