Professional Documents
Culture Documents
accidental death
accidental death
DOI: 10.1002/da.22882
RESEARCH ARTICLE
Conclusions: RDoC symptom scores parsed from clinical documentation are associated with
suicide and illustrates that multiple domains contribute to risk in a time-varying fashion.
KEYWORDS
accidental death, electronic health records, natural language processing, Research Domain
Criteria, suicide, survival analysis
very large cohort of consecutive hospital discharges (McCoy, Castro, TA B L E 1 Examples of tokens (terms) loading on each RDoC domain
Roberson, Snapper, & Perlis, 2016) from two large general hospitals. RDoC domain Example terms
The present work extends that work to include specific RDoC-
Arousal and regulatory Vigilance, sexual abuse, reactivity, theta,
informed phenotypes, rather than narrative documentation in general. rem sleep
Moreover, it seeks to quantify risk by examining such phenotypes for Cognitive Impulse, attentive, attention, distracted,
the first time in a general medical-surgical hospital population (McCoy forget
et al., 2015; McCoy, Yu, et al., 2018). In particular, we hypothesized Negative Depressive disorder, panic attack, nervous,
palpitation, worried
that multiple RDoC domains, and particularly positive and negative
Positive Drink, intoxication, gamble, with alcohol,
valence, would be associated with differential risk of suicide and
heroin
accidental death in this large general hospital cohort.
Social Misunderstand, inappropriate, threaten,
engaged, empathic
RDoC, Research Domain Criteria.
2.3 Outcome
3 RESULTS
For all individuals in the cohort and identified as deceased on the basis
of the Federal social security registry, we queried public death cer- The cohort as a whole included 815,457 admissions for 444,317 indi-
tificate data available from the Massachusetts Department of Public viduals. Median follow-up was 1,793 days (interquartile range 963–
Health in order to determine reported (coded) cause of death in terms 2,760), representing a total of 2,262,588 person-years of follow-up.
of international classification of disease codes. Consistent with prior There were a total of 1,982 suicides or accidental deaths during
MCCOY ET AL . 3
TA B L E 2 Characterization of study cohort overall and stratified by primary outcome (suicide or accidental death)
follow-up, including 513 within 180 days of discharge, 1,137 within 4 DISCUSSION
2 years, and 1,758 within 5 years of discharge. Characteristics of the
cohort, including those who did or did not experience the endpoint, are In this investigation of 444,317 hospitalized individuals across two
summarized in Table 2. The outcome was observed most often in those large academic medical centers, we identified modest but significant
who were male, those who were white, and those with public insurance associations between each of the five eRDoC scores and risk of suicide
(Table 2). or accidental death over more than 2.2 million person-years of follow-
Kaplan–Meier survival curves for the cohort, comparing top versus up. Among these, greatest magnitude of risk was associated with posi-
bottom quintile for each eRDoC domain, are illustrated in Figure 1A–E, tive valence, particularly during the first 1.5 years of follow-up; while in
with shading indicating 95% confidence intervals. Importantly, to facil- general risk peaked in the first 90 days and subsequently decayed for
itate readability, the y-axis of each of these is restricted, reflecting the all domains, negative valence represented an exception to this pattern,
modest absolute risk for suicide in the cohort as a whole. Quintiles dif- with a gradual increase in risk after 1 year that plateaued by 2 years.
fered significantly (corrected P < 0.001) for all five eRDoC domains in In a prior meta-analysis of suicide risk factors categorized according
analyses stratified for age, sex, and race. Plots of instantaneous hazard to RDoC domains, the majority were associated with negative valence
of the outcome in time-varying models adjusted for age, sex, and race (Glenn et al., 2018), although—consistent with our results—effect sizes
are shown over the first year of follow-up in Figure 2A–E. Early haz- for other domains were generally similar. Our results provide further
ard predominates in all five eRDoC domains but the highest and low- support for the importance of negative valence, but also indicate the
est risk quintiles hazard remains separated in longer term follow-up potential utility of further investigation of additional RDoC domains.
(Figure 2). More broadly, these results indicate the feasibility of characterizing
To facilitate comparison of the size of effect associated with strati- these domains in nonpsychiatric populations; although prediction of
fication into the top versus the bottom quintile, we compared the ratio risk in known psychiatric cohorts is also important, the majority of sui-
of cumulative events in the top versus the bottom quintile of each cide attempts occur among individuals who recently sought contact
domain over the follow-up period (Figure 3A) with bootstrap 95% con- with nonpsychiatrist caregivers (Luoma et al., 2002), and investigation
fidence intervals at selected time points (Figure 3B). The magnitude of of such cohorts has been less extensive (Ursano et al., 2018).
effect is greatest for positive valence, although by 2 years similar risk is Perhaps less evident, but equally notable, we identified differential
observed for negative valence. In aggregate, while Figure 1 illustrates effects of these domains over time, particularly for negative valence.
modest absolute change in risk, Figure 3 illustrates substantial relative Although the basic notion that short- and long-term suicide risk fac-
change in risk, with three- to five-fold greater hazard in the top com- tors may be different is frequently discussed, our results indicate the
pared to the bottom quintile for each domain. complexity of this relationship. Although in general the risk associated
In sensitivity analysis (not shown), we also examined a more with each of these domains diminishes with time, likely indicating that
restricted outcome, the 232 explicitly documented suicides during these measures are state characteristics rather than trait characteris-
follow-up. In this more limited analysis, the direction of effect was sim- tics, the rate of decay varies. This complexity may help to guide future
ilar to the primary analysis; however, the difference between highest studies and interventions, with an effort to match assessments and
and lowest risk quintile of the cognitive domain and arousal and reg- interventions during the postdischarge period. That is, recognizing that
ulatory domain was no longer statistically significant. In all five cases, the greatest contributors to risk change over time, multiple strategies
by inspection, the period of greatest hazard was shifted earlier in the to reduce suicide risk may be required depending on whether short,
follow-up period. Likewise, a sensitivity analysis examining quartiles intermediate, or longer term risk is being considered.
rather than quintiles led to similar results (see Supporting Information We note multiple important considerations in interpreting our
figure). result. First, the extent to which natural language processing efforts
4 MCCOY ET AL .
F I G U R E 1 Kaplan–Meier curves contrasting highest and lowest quintile of arousal and regulatory (a), cognitive (b), negative (c), positive (d), and
social (e) eRDoC domain scores time to primary outcome. Note Y-axis is truncated for clarity.
generalize across health systems remains to be established, and rep- tion (Filannino et al., 2017). Such templated text could also intro-
resents a challenge for all such studies. The eRDoC algorithm used duce bias if associated with particular clinicians caring for higher risk
does not incorporate negation (see, for example, Chapman, Bridewell, patients; however, in this academic medical center-based cohort, nar-
Hanbury, Cooper, & Buchanan, 2001), so “psychosis” and “absence rative notes are nearly always written by house officers who rotate
of psychosis” would have the same impact on eRDoC score. Nega- on and off service, leading to essentially random distribution. Sec-
tion was not included in the original validation of the eRDoC algo- ond, it bears emphasis that the estimated domain scores are not prox-
rithm (McCoy, Yu, et al., 2018). Negation is an interesting avenue ies or substitutes for actual assessment of individual symptoms or
for future research; however, experience with manual narrative note RDoC experimental paradigms; it is entirely possible that more focused
annotation suggests the majority of negated concepts appear in tem- assessment (for example, of cognitive symptoms) would identify larger
plated text such as review of systems and added little to classifica- effects. Indeed, the present study represents a starting point for
MCCOY ET AL . 5
F I G U R E 2 Hazard curves from Royston–Parmar time-varying parametric survival model contrasting highest and lowest quintile of arousal and
regulatory (a), cognitive (b), negative (c), positive (d), and social (e) eRDoC domain scores
6 MCCOY ET AL .
F I G U R E 3 Comparison between eRDoC domains (color) ratio of within domain ratio of cumulative event in the highest and lowest quintile of
eRDoC domain score over time with LOESS curve (a) and within bootstrap 95% confidence intervals (b)
improvement. Third, our outcome—suicide and accidental death—is catchment areas. Although these are “open” health systems insofar as
drawn from death indices that may be prone to misclassification. participants may get care elsewhere, the mortality outcome is estab-
Indeed, we combine these two groups because in prior work we have lished by death certificates and national death index data. Second,
observed substantial elevation in risk of both outcomes among indi- the clinical measures are drawn from routine clinical care and do
viduals with psychiatric diagnoses (McCoy et al., 2016). On the other not require any additional administration of research measures or
hand, we would expect that such misclassification would lead us to paradigms, as such these scores could be delivered in a clinical setting
underestimate effect sizes by introducing greater heterogeneity (i.e., without changes to clinical workflow. Finally, temporal dependency of
if true suicides are classed as other causes of death, and vice versa). results and the ability to target potentially modifiable eRDoC symp-
Finally, while these models do improve risk stratification, they toms offer appealing avenues for future study as compared to fixed
require further work before clinical deployment is likely to be useful: demographic traits.
the goal of this report is to highlight the usefulness of natural language
processing and RDoC domains, not to present an actionable risk model.
Although suicide remains among the most studied outcome of psychi- 5 CONCLUSION
atric illness, and its contribution to mortality continues to increase,
the absolute rate of suicide in this clinical population remains low. These results indicate that estimates of dimensional psychopathology
On the other hand, this analysis also has relevant strengths. First, it derived from natural language processing applied to hospital discharge
reflects a large and highly generalizable pair of cohorts drawn from two documentation are associated with suicide and accidental death.
different general hospital systems with distinct clinical cultures and They suggest the utility of pursuing more sensitive measures of such
MCCOY ET AL . 7
dimensions in order to better understand and predict this most feared neuropsychiatric clinical records: Overview of 2016 CEGS N-GRID
psychiatric outcome. More broadly, they add to a growing literature shared tasks Track 2. Journal of Biomedical Informatics, 75S, S62–S70.
https://doi.org/10.1016/j.jbi.2017.04.017
supporting the use of narrative clinical notes to capture essential
Franklin, J. C., Ribeiro, J. D., Fox, K. R., Bentley, K. H., Kleiman, E. M., Huang,
clinical data not reflected in standard administrative data.
X., … Nock, M. K. (2017). Risk factors for suicidal thoughts and behav-
iors: A meta-analysis of 50 years of research. Psychological Bulletin,
ACKNOWLEDGMENTS 143(2), 187–232. https://doi.org/10.1037/bul0000084
This study was funded by the National Institute of Mental Health (grant Glenn, C. R., Kleiman, E. M., Cha, C. B., Deming, C. A., Franklin, J. C., & Nock,
M. K. (2018). Understanding suicide risk within the Research Domain
numbers R56MH115187; R01MH106577) and the Brain and Behavior
Criteria (RDoC) framework: A meta-analytic review. Depression and Anx-
Foundation. The sponsors had no role in study design, writing of the iety, 35(1), 65–88. https://doi.org/10.1002/da.22686
report, or data collection, analysis, or interpretation.
Glenn, C. R., & Nock, M. K. (2014). Improving the short-term pre-
diction of suicidal behavior. American Journal of Preventive Medicine,
AUTHOR CONTRIBUTIONS 47(3 Suppl 2), S176–S180. https://doi.org/10.1016/j.amepre.2014.06.
004
T.H.M. drafted and revised the manuscript, planned the experiments,
Gvion, Y., & Levi-Belz, Y. (2018). Serious suicide attempts: Systematic
developed the original software, and analyzed data. A.M.P. assisted
review of psychological risk factors. Frontiers in Psychiatry, 9, 56.
in manuscript revision and submission. R.H.P. drafted and revised the https://doi.org/10.3389/fpsyt.2018.00056
manuscript, planned the experiments, oversaw data collection and Hawton, K., Casañas I Comabella, C., Haw, C., & Saunders, K. (2013).
characterization, and advised on analysis. Risk factors for suicide in individuals with depression: A systematic
review. Journal of Affective Disorders, 147(1–3), 17–28. https://doi.org/
10.1016/j.jad.2013.01.004
CONFLICT OF INTERESTS
Heron, M. (2018). Deaths: Leading causes for 2016. National Vital Statis-
T.H.M. receives research funding from The Stanley Center at the Broad tics Reports: From the Centers for Disease Control and Prevention,
Institute, Brain and Behavior Foundation, and Telefonica Alfa. R.H.P. National Center for Health Statistics, National Vital Statistics System, 67(6),
holds equity in Psy Therapeutics, serves on the scientific advisory 1–77.
board of Genomind and Psy Therapeutics, and consults to RID Ven- Insel, T. R., & Cuthbert, B. N. (2015). Medicine. Brain disorders? Pre-
cisely. Science, 348(6234), 499–500. https://doi.org/10.1126/
tures. R.H.P. receives research funding from NIMH, NHLBI, NHGRI,
science.aab2358
and Telefonica Alfa. A.M.P. has no disclosures to report.
Insel, T. R., Cuthbert, B. N., Garvey, M., Heinssen, R., Pine, D. S., Quinn,
K., … Wang, P. (2010). Research Domain Criteria (RDoC): Toward
ORCID a new classification framework for research on mental disorders.
The American Journal of Psychiatry, 167(7), 748–751. https://doi.org/
Thomas H. McCoy Jr. https://orcid.org/0000-0002-5624-0439 10.1176/appi.ajp.2010.09091379
Amelia M. Pellegrini https://orcid.org/0000-0001-9312-7673
Le-Niculescu, H., Levey, D. F., Ayalew, M., Palmer, L., Gavrin, L. M., Jain, N.,
Roy H. Perlis https://orcid.org/0000-0002-5862-6757 … Niculescu, A. B. (2013). Discovery and validation of blood biomarkers
for suicidality. Molecular Psychiatry, 18(12), 1249–1264. https://doi.org/
REFERENCES 10.1038/mp.2013.95
Barak-Corren, Y., Castro, V. M., Javitt, S., Hoffnagle, A. G., Dai, Y., Perlis, R. H., Luoma, J. B., Martin, C. E., & Pearson, J. L. (2002). Contact with men-
… Reis, B. Y. (2017). Predicting suicidal behavior from longitudinal elec- tal health and primary care providers before suicide: A review of
tronic health records. The American Journal of Psychiatry, 174(2), 154– the evidence. The American Journal of Psychiatry, 159(6), 909–916.
162. https://doi.org/10.1176/appi.ajp.2016.16010077 https://doi.org/10.1176/appi.ajp.159.6.909
Chapman, W. W., Bridewell, W., Hanbury, P., Cooper, G. F., & Buchanan, B. McCoy, Jr., T. H., Castro, V. M., Hart, K. L., Pellegrini, A. M., Yu, S.,
G. (2001). A simple algorithm for identifying negated findings and dis- Cai, T., & Perlis, R. H. (2018). Genome-wide association study
eases in discharge summaries. Journal of Biomedical Informatics, 34(5), of dimensional psychopathology using electronic health records.
301–310. https://doi.org/10.1006/jbin.2001.1029 Biological Psychiatry, 83(12), 1005–1011. https://doi.org/10.1016/
Chung, D. T., Ryan, C. J., Hadzi-Pavlovic, D., Singh, S. P., Stanton, C., & Large, j.biopsych.2017.12.004
M. M. (2017). Suicide rates after discharge from psychiatric facilities: McCoy, Jr., T. H., Castro, V. M., Roberson, A. M., Snapper, L. A., &
A systematic review and meta-analysis. JAMA Psychiatry, 74(7), 694. Perlis, R. H. (2016). Improving prediction of suicide and acciden-
https://doi.org/10.1001/jamapsychiatry.2017.1044 tal death after discharge from general hospitals with natural lan-
Cuthbert, B. N. (2014). The RDoC framework: Facilitating transition guage processing. JAMA Psychiatry, 73(10), 1064–1071. https://doi.org/
from ICD/DSM to dimensional approaches that integrate neuroscience 10.1001/jamapsychiatry.2016.2172
and psychopathology. World Psychiatry, 13(1), 28–35. https://doi.org/ McCoy, Jr., T. H., Castro, V. M., Rosenfield, H. R., Cagan, A., Kohane,
10.1002/wps.20087 I. S., & Perlis, R. H. (2015). A clinical perspective on the rele-
Fanelli, G., & Serretti, A. (2019). The influence of the serotonin transporter vance of Research Domain Criteria in electronic health records.
gene 5-HTTLPR polymorphism on suicidal behaviors: A meta-analysis. The American Journal of Psychiatry, 172(4), 316–320. https://doi.org/
Progress in Neuro-Psychopharmacology & Biological Psychiatry, 88, 375– 10.1176/appi.ajp.2014.14091177
387. https://doi.org/10.1016/j.pnpbp.2018.08.007 McCoy, Jr., T. H., Yu, S., Cai, T., & Perlis, R. H. (n.d.). CQH Dimen-
Filannino, M., Stubbs, A., McCoy, T. H., Rosenquist, J. N., Brown, H. E., sional Phenotyper. Retrieved from https://github.com/thmccoy/
Perlis, R. H., & Uzuner, Ö (2017). Symptom severity prediction from CQH-Dimensional-Phenotyper
8 MCCOY ET AL .
McCoy, Jr., T. H., Yu, S., Hart, K. L., Castro, V. M., Brown, H. E., Rosen- Schoenfeld, D. (1980). Chi-squared goodness-of-fit tests for the pro-
quist, J. N., … Perlis, R. H. (2018). High throughput phenotyping portional hazards regression model. Biometrika, 67(1), 145–153.
for dimensional psychopathology in electronic health records. Biolog- https://doi.org/10.2307/2335327
ical Psychiatry, 83(12), 997–1004. https://doi.org/10.1016/j.biopsych. Stone, D. M., Simon, T. R., Fowler, K. A., Kegler, S. R., Yuan, K., Holland, K.
2018.01.011 M., … Crosby, A. E. (2018). Vital signs: Trends in state suicide rates—
Murphy, S. N., Mendis, M., Hackett, K., Kuttan, R., Pan, W., Phillips, L. C., … United States, 1999–2016 and circumstances contributing to suicide—
Chueh, H. C. (2007). Architecture of the open-source clinical research 27 States, 2015. MMWR Morbidity and Mortality Weekly Report, 67(22),
chart from Informatics for Integrating Biology and the Bedside. AMIA 617–624. https://doi.org/10.15585/mmwr.mm6722a1
Annual Symposium Proceedings, 2007, 548–552. Turecki, G., & Brent, D. (2015). Suicide and suicidal behavior.
Murphy, S. N., Weber, G., Mendis, M., Gainer, V., Chueh, H. C., Churchill, The Lancet, 387(10024), 1227–1239. https://doi.org/10.1016/
S., & Kohane, I. (2010). Serving the enterprise and beyond with S0140-6736(15)00234-2
informatics for integrating biology and the bedside (i2b2). Journal of Ursano, R. J., Kessler, R. C., Naifeh, J. A., Herberman Mash, H. B.,
the American Medical Informatics Association: JAMIA, 17(2), 124–130. Nock, M. K., Aliaga, P. A., … Stein, M. B. (2018). Risk factors associ-
https://doi.org/10.1136/jamia.2009.000893 ated with attempted suicide among US Army soldiers without a his-
Niculescu, A. B., Levey, D., Le-Niculescu, H., Niculescu, E., Kurian, S., & tory of mental health diagnosis. JAMA Psychiatry, 75(10), 1022–1032.
Salomon, D. (2015). Psychiatric blood biomarkers: Avoiding jumping to https://doi.org/10.1001/jamapsychiatry.2018.2069
premature negative or positive conclusions. Molecular Psychiatry, 20(3),
286–288.
Rossen, L. M., Hedegaard, H., Khan, D., & Warner, M. (2018). County-
level trends in suicide rates in the U.S., 2005–2015. American Jour- SUPPORTING INFORMATION
nal of Preventive Medicine, 55(1), 72–79. https://doi.org/10.1016/
Additional supporting information may be found online in the Support-
j.amepre.2018.03.020
ing Information section at the end of the article.
Royston, P., & Parmar, M. K. B. (2002). Flexible parametric proportional-
hazards and proportional-odds models for censored survival
data, with application to prognostic modelling and estimation How to cite this article: McCoy TH, Jr, Pellegrini AM, Perlis
of treatment effects. Statistics in Medicine, 21(15), 2175–2197.
RH. Research Domain Criteria scores estimated through
https://doi.org/10.1002/sim.1203
natural language processing are associated with risk for
Runeson, B., Odeberg, J., Pettersson, A., Edbom, T., Jildevik Adamsson, I., &
Waern, M. (2017). Instruments for the assessment of suicide risk: A sys- suicide and accidental death. Depress Anxiety. 2019;1–8.
tematic review evaluating the certainty of the evidence. PLoS One, 12(7), https://doi.org/10.1002/da.22882
e0180292. https://doi.org/10.1371/journal.pone.0180292