Professional Documents
Culture Documents
Are Prediction Models For Vaginal Birth After Cesarean Accurate. Am J Obstet Gynecol 2019
Are Prediction Models For Vaginal Birth After Cesarean Accurate. Am J Obstet Gynecol 2019
org
OBSTETRICS
Are prediction models for vaginal birth after
cesarean accurate?
Benjamin S. Harris, MD, MPH; R. Phillips Heine, MD; Jinyoung Park, MA; Keturah R. Faurot, MPH, PhD; Maeve K. Hopkins, MD;
Andrew J. Rivara, MD; Hanna R. Kemeny, BS; Chad A. Grotegut, MD, MSc, MBA; J. Eric Jelovsek, MD, MMEd
BACKGROUND: The use of trial of labor after cesarean delivery cal- range of threshold probabilities for which the best prediction model would
culators in the prediction of successful vaginal birth after cesarean delivery be of clinical value.
gives physicians an evidence-based tool to assist with patient counseling RESULTS: Four hundred four women met the eligibility criteria. The
and risk stratification. Before deployment of prediction models for routine observed rate of successful vaginal birth after cesarean delivery was 75%
care at an institutional level, it is recommended to test their performance (305/404). Concordance indices were 0.717 (95% confidence interval,
initially in the institution’s target population. This allows the institution to 0.659e0.778), 0.703 (95% confidence interval, 0.647e0.758), and
understand not only the overall accuracy of the model for the intended 0.727 (95% confidence interval, 0.669e0.779), respectively. Brier scores
population but also to comprehend where the accuracy of the model is were 0.172, 0.205, and 0.179, respectively. Calibration demonstrated
most limited when predicting across the range of predictions (calibration). that Grobman 2007 and Metz vaginal birth after cesarean delivery models
OBJECTIVE: The purpose of this study was to compare 3 models that were most accurate when predicted probabilities were >60% and were
predict successful vaginal birth after cesarean delivery with the use of a beneficial for counseling women who did not desire to have vaginal birth
single tertiary referral cohort before continuous model deployment in the after cesarean delivery but had a predicted success rates of 60e90%. The
electronic medical record. models underpredicted actual probabilities when predicting success at
STUDY DESIGN: All cesarean births for failed trial of labor after ce- <60%. The Grobman 2007 and Metz vaginal birth after cesarean delivery
sarean delivery and successful vaginal birth after cesarean delivery at an models provided greatest net benefit between threshold probabilities of
academic health system between May 2013 and March 2016 were 60e90% but did not provide a net benefit with lower predicted proba-
reviewed. Women with a history of 1 previous cesarean birth who un- bilities of success compared with a strategy of recommending vaginal birth
derwent a trial of labor with a term (37 weeks gestation), cephalic, and after cesarean delivery for all women .
singleton gestation were included. Women with antepartum intrauterine CONCLUSION: When 3 commonly used vaginal birth after cesarean
fetal death or fetal anomalies were excluded. The probability of successful delivery prediction models are compared in the same population, there are
vaginal birth after cesarean delivery was calculated with the use of 3 differences in performance that may affect an institution’s choice of which
prediction models: Grobman 2007, Grobman 2009, and Metz 2013 and model to use.
compared with actual vaginal birth after cesarean delivery success. Each
model’s performance was measured with the use of concordance indices, Key words: calculator, calibration, decision curve analysis, prediction
Brier scores, and calibration plots. Decision curve analysis identified the model, risk, TOLAC, validation, VBAC model
TABLE 2
Comparison of model performance in the validation cohort
Published concordance index Validation concordance
Model N (95% confidence interval) index (95% confidence interval) Validation Brier score
9
Grobman et al, 2007 7666 0.754 (0.742e0.755) 0.717 (0.659e0.778) 0.172
Grobman et al, 200910 9616 0.774 (0.764e0.784) 0.703 (0.647e0.758) 0.205
1
Metz et al, 2013 1170 0.80 (0.76e0.84) 0.727 (0.669e0.779) 0.179
Harris et al. External validation of VBAC prediction models. Am J Obstet Gynecol 2019.
Results
Four hundred four women met eligi-
bility criteria. The observed rate of suc-
cessful VBAC was 75% (305/404).
Women in the successful VBAC cohort
were predominantly Hispanic (37.1%)
and had normal body mass indexes
(38.3%). Fifty-four percent of the par-
ticipants with successful VBAC had a
history of vaginal delivery (Table 1).
Twenty-nine percent of those with suc-
There was poor calibration of the model in the study cohort. Most of the predictions were >80% cessful VBAC had recurrent indication
predicted probability with few predictions below this threshold. Additionally, predicted probabilities for cesarean delivery compared with
from 0.6e1.0 were lower than actual proportions of successful vaginal birth after cesarean delivery. 52% among patients with failed TOLAC
The rug plot allows for visually quantitating the relative number of persons with and without the (P<.001). Among those women who
outcome across the range of predictions.
were admitted for a TOLAC, the rate of
VBAC, vaginal birth after cesarean delivery.
successful VBAC was higher among
Harris et al. External validation of VBAC prediction models. Am J Obstet Gynecol 2019.
women who were in labor compared
Comment
Main findings
This study demonstrated the value of
Most of the predicted probabilities were lower than the actual proportions in women with successful understanding and comparing the per-
vaginal birth after cesarean delivery. The predictions were less accurate at lower predicted prob- formances of 3 commonly used VBAC
abilities (0.1e0.4 and 0.5e0.7) compared with higher probabilities (>0.7). The rug plot allows for models in an institutional cohort before
visually quantitating the relative number of persons with and without the outcome across the range of routine deployment. Although these
predictions.
findings are specific to a single health
VBAC, vaginal birth after cesarean delivery.
Harris et al. External validation of VBAC prediction models. Am J Obstet Gynecol 2019.
system, we recommend other in-
stitutions undertake similar institution-
specific studies before widely applying
with those who were induced (225/285 depicted in Figure 2. There was poor any prediction model for routine care.
[79%] vs 79/119 [66%]; P¼.006). calibration of the model in the study These results led our institution to
The Grobman 2007 model validation cohort. The majority of predictions were conclude the following: (1) We can
concordance index was 0.717 (95% >80% predicted probability, with few reassure clinicians and our institutional
confidence interval [CI], 0.659e0.778), predictions below this threshold. Addi- leadership that the Grobman 2007 and
and Brier score was 0.172.9 The tionally, predicted probabilities of Metz VBAC models are accurate when
discriminatory ability was similar to the 0.6e1.0) were lower than actual pro- probabilities are >60% and that prob-
published concordance index of 0.754 portions of successful VBAC (Figure 2). abilities in this range can be commu-
(95% CI, 0.742e0.755; Table 2). The The Metz 2013 model sample nicated confidently to patients and their
calibration curve is depicted in Figure 1. concordance index was 0.727 (95% CI, care providers to inform decisions. This
The predicted probabilities were lower 0.669e0.779), and Brier score was provides useful information to patients
than the actual proportions of women 0.179.1 This discriminatory ability was who do not desire to attempt VBAC but
with successful VBAC among all pre- similar to the published concordance have predicted success rates of
dictions. The predictions were less ac- index of 0.800 (95% CI, 0.760e0.840; 60e90%. (2) None of the models were
curate at lower predicted probabilities Table 2). The calibration curve is depic- accurate when probabilities were
(0.2e0.6) compared with higher prob- ted in Figure 3. Performance was similar <60%. For example, providing a pre-
abilities (>0.6; Figure 1). For example, at to the Grobman 2007 model. The ma- dicted risk of successful VBAC of 20%
a predicted probability of successful jority of predicted probabilities were may not be much different from a
VBAC of 50%, the observed VBAC fre- lower than the actual proportions in predicted probability of 50%. There-
quency was 60% in our population. women with successful VBAC. The pre- fore, it would not be beneficial and
The Grobman 2009 model sample dictions were less accurate at lower pre- could potentially be harmful if these
concordance index was 0.703 (95% CI, dicted probabilities (0.1e0.4 and probabilities were used to make
0.647e0.758), and Brier score was 0.5e0.7) compared with higher proba- threshold decisions in this lower success
0.205.10 This discriminatory ability was bilities (>0.7) range. We plan to modify the displayed
lower than the published concordance Figure 4 allows comparison of the 3 result of the calculator to report that
index of 0.774 (95% CI, 0.764e0.784; decision curves for each model. There “this patient’s predicted probability is
Table 2). The calibration curve is are 5 curves: (1) allow everyone to <60% and the model is not accurate in
this range.” We suggest other in- Implications of findings Strengths and limitations
stitutions study the model’s calibration Model performance at our institution The strengths of our study are the com-
in their population to understand and represented the natural history of model parison of commonly used models with
potentially avoid areas of miscalibra- application in our population. That is, the use of the same population and the
tion.3 The Grobman 2009 model had patients were not counseled routinely use of decision curve analysis to aid in
unacceptable performance in our pop- regarding TOLAC with the use of any of counseling patients as they approach
ulation, and we do recommend its use the published prediction models. At Duke decisions regarding TOLAC. Decision
at our institution. Similar studies at University during the study time period, curve analysis is a useful method to
other institutions may benefit users of the average number of annual deliveries compare prediction models when using
these models. was 3250. Our counseling practice is to the same validation cohort. It are also
useful in quantitating the overall net limitations, we believe that this process is predicting successful VBAC? Evid Based Pract
benefit to patients when decisions are a similar and a feasible way for in- 2015;18:5–6.
6. Warren JB, Hamilton A, Rugge B. What’s the
made with the use of a range of predicted stitutions to validate models developed best way to predict the success of a trial of labor
thresholds12 because clinicians and pa- by other researchers before deployment after a previous C-section? J Fam Pract
tients at different institutions are likely to at their own institution. 2015;64:E3–7.
have different thresholds. The Grobman 7. Van Buuren S. Multiple imputation of discrete
2007 model and Metz model added net Conclusion/future research and continuous data by fully conditional speci-
fication. Stat Methods Med Res 2007;16:
benefit when decisions are made using direction 219–42.
the model by a clinician or patient with In conclusion, institutions should use 8. Buuren SV, Groothuis-Oudshoorn K. MICE
predicted probabilities of 60e95%. similar methods to validate a prediction multivariate imputation by chained equations in
Therefore, predictions from both model’s performance before routine use in R. J Stat Soft 2011;45:1–76.
models are most useful for the patient their population. When multiple models 9. Grobman WA, Lai Y, Landon MB, et al.
Development of a nomogram for prediction of
who is predicted to have a >60% chance exist, this process can be used to choose vaginal birth after cesarean delivery. Obstet
of success but is unsure about attempting which model to deploy, establish baseline Gynecol 2007;109:806–12.
VBAC. In this case, the model could overall accuracy, and understand areas of 10. Grobman WA, Lai Y, Landon MB, et al. Does
reassure the patient about success. model miscalibration in their patient information available at admission for delivery
However, use of these model predictions population. We acknowledge that this re- improve prediction of vaginal birth after cesar-
ean? Am J Perinatol 2009;26:693–701.
below threshold probability of 60% do quires an institution to have expertise in 11. Rufibach K. Use of Brier score to assess
not provide any net benefit over allowing model recalibration. In our health system, binary predictions. J Clin Epidemiol 2010;63:
all patients to attempt VBAC. the Grobman 2007 and Metz VBAC 938–9.
This study has limitations. The vali- models were accurate when probabilities 12. Vickers AJ. Decision analysis for the evalu-
dation was retrospective by design, had a are >60%, potentially providing useful ation of diagnostic tests, prediction models and
molecular markers. Am Stat 2008;62:314–20.
smaller cohort sample compared with information to patients who do not desire 13. Van Calster B, Vickers AJ. Calibration of risk
the original validation studies, and relied to VBAC but have predicted success rates prediction models: impact on decision-analytic
on electronic medical record data entry of 60e90%. The 2 models are not accurate performance. Med Decis Making 2015;35:
by varying providers to best reflect when predicted success rates were <60%. 162–9.
routine clinical care. The data set does Clinicians are encouraged to test their 14. Metz TD, Allshouse AA, Faucett AM,
Grobman WA. Validation of a vaginal birth after
not include women who were eligible for VBAC model’s performance in the target cesarean delivery prediction model in women
a TOLAC but who declined and elected population and encourage systematic with two prior cesarean deliveries. Obstet
to proceed with an elective repeat ce- routine timepoints for checking the Gynecol 2015;125:948–52.
sarean delivery. Because of this, we are model’s performance over time. n 15. Chaillet N, Bujold E, Dubé E, Grobman WA.
not able to calculate the proportion of Validation of a prediction model for vaginal birth
after caesarean. J Obstet Gynaecol Can
eligible women at Duke University References 2013;35:119–24.
Health System who elected to attempt a 1. Metz TD, Stoddard GJ, Henry E, Jackson M, 16. Maykin MM, Mularz AJ, Lee LK,
trial of labor. Next, a power analysis was Holmgren C, Esplin S. Simple, validated vaginal Valderramos SG. Validation of a prediction
not conducted because, unfortunately, birth after cesarean delivery prediction model for model for vaginal birth after cesarean delivery
there are no generally accepted ap- use at the time of admission. Obstet Gynecol reveals unexpected success in a diverse Amer-
2013;122:571–8. ican population. AJP Rep 2017;7:e31–8.
proaches to estimate the sample size re- 2. Collins GS, Reitsma JB, Altman DG,
quirements for validation studies of Moons KG. Transparent reporting of a multi-
prediction models.2 It is possible that variable prediction model for individual prog- Author and article information
poor performance of the Grobman 2009 nosis or diagnosis (TRIPOD): the TRIPOD From the Department of Obstetrics and Gynecology, Duke
statement. Ann Intern Med 2015;162:55–63. University Health System (Drs Harris, Heine, Hopkins,
model was due to differences in how
3. Fagerberg MC, Marsal K, Kallen K. Predicting Rivara, Grotegut, and Jelovsek) and Duke University
cervical examination data were collected the chance of vaginal delivery after one cesarean School of Medicine (Ms Kemeny), Durham, NC; the
from the electronic health record in our section: validation and elaboration of a published Department of Physical Medicine and Rehabilitation,
cohort with respect to the original pub- prediction model. Eur J Obstet Gynecol Reprod University of North Carolina, Chapel Hill, NC (Dr Faurot
lication. Similar to previous studies, we Biol 2015;188:88–94. and Ms Park).
4. Dresang LT, Hampton A. Vaginal birth after Received Sept. 26, 2018; revised Jan. 5, 2019;
are also not able to account for the effect
cesarean (VBAC) calculator risks. Birth 2015;42: accepted Jan. 28, 2019.
of counseling, patient preference, or 379–80. The authors report no conflict of interest.
individual labor management styles 5. Edgoose J, Hampton A, Schrager S. Corresponding author: Benjamin S. Harris, MD, MPH.
on delivery outcomes.1 Despite these How good are online TOLAC calculators for benjamin.harris@duke.edu