Professional Documents
Culture Documents
The Accuracy of Substituted Judgments in Patients With Terminal Diagnoses
The Accuracy of Substituted Judgments in Patients With Terminal Diagnoses
Terminal Diagnoses
Daniel P. Sulmasy, OFM, MD, PhD; Peter B. Terry, MD, MA; Carol S. Weisman, PhD;
Deborah J. Miller, RN, PhD; Rebecca Y. Stallings, MHS; Margaret A. Vettese, RN, PhD;
and Karen B. Haller, RN, PhD
Background: Patients' loved ones often make end-of-life Ann Intern Med. 1998;128:621-629.
treatment decisions, but the accuracy of their substituted
From Georgetown University Medical Center, Washington, D.C.;
judgments and the factors associated w i t h accuracy are and the Johns Hopkins Medical Institutions, Baltimore, Mary-
poorly understood. land. For current author addresses, see end of text.
Objective: To assess the accuracy of judgments made by
surrogate decision makers; ascertain the beliefs, practices,
and clinical and sociodemographic factors associated w i t h
W hen patients face end-of-life decisions and
are unable to speak for themselves, loved
ones are often asked to make substituted judg-
accuracy of surrogates' decisions; assess the preferences of
patients for life-sustaining treatments; and compare dif- ments. Ethicists have argued that this is an impor-
ferences in accuracy across diagnoses. tant way of respecting patient autonomy (1, 2), but
preliminary studies indicate that substituted judg-
Design: Cross-sectional paired interviews.
ments may not be very accurate (3-11). This con-
Setting: Outpatient practices of three university hospitals. cern has led some observers to question the useful-
Patients: 250 patients w i t h terminal diagnoses of conges- ness of asking for substituted judgments (8, 12).
tive heart failure, AIDS, amyotrophic lateral sclerosis, lung If the concept of substituted judgment is to re-
cancer, and chronic obstructive pulmonary disease (50 pa- main clinically useful, health care professionals need
tient-surrogate pairs in each group) and 50 general med- a better sense of the circumstances under which
ical patients and their surrogates.
substituted judgments made by surrogates are accu-
Measurements: The accuracy of surrogate predictions rate or inaccurate. Such data may alert clinicians to
was measured by using scales based on 10 potential treat- instances in which patients are at risk for having
ments in each of three hypothetical clinical scenarios. their wishes misrepresented and may aid in the
Results: Preferences varied according t o mode of treat- design of education campaigns to improve the ac-
ment and scenario. On average, surrogates made correct curacy of substituted judgments.
predictions in 66% of instances. Accuracy was better for Little is known about the factors associated with
the permanent coma scenario than for the scenarios of accurate substituted judgments, especially among
severe dementia or coma w i t h a small chance of recovery
patients with terminal diagnoses (8, 11). In a pilot
{P < 0.001). In a binary logit model, the accuracy of substi-
tuted judgments was positively associated w i t h the patient
study of 50 general medical patients (13), we found
having spoken w i t h the surrogate about end-of-life issues that accuracy was positively associated with discus-
(odds ratio [OR], 1.9 [95% CI, 1.6 t o 2.3]), the patient sions between patients and surrogates and nega-
having private insurance (OR, 1.4 [CI, 1.1 t o 1.7]), the tively associated with religiosity. Building on this
surrogate's level of education (OR, 1.5 [CI, 1.2 to 1.9]), and experience, we studied 250 terminally ill patients
the patient's level of education (OR, 1.7 [CI, 1.4 t o 2.2]). and their legal surrogates and a comparison group
Accuracy was negatively associated w i t h the patient's be- of 50 general medical patients and their surrogates.
lief that he or she would live longer than 10 years (OR, 0.6 Our objectives were to 1) assess the accuracy of the
[CI, 0.5 t o 0.7]), surrogate experience w i t h life-sustaining substituted judgments made by the surrogates; 2)
treatment (OR, 0.4 [CI, 0.3 t o 0.5]), surrogate participation
ascertain the beliefs, practices, and clinical and so-
in religious services (OR, 0.67 [CI, 0.50 t o 0.91]), and a
diagnosis of heart failure (OR, 0.6 [CI, 0.5 t o 0.8]). Age,
ciodemographic factors associated with accuracy; 3)
ethnicity, marital status, religion, and advance directives assess the preferences of patients for life-sustaining
were not associated w i t h accuracy. treatments; and 4) compare differences in accuracy
across diagnoses.
Conclusions: The accuracy of substituted judgments is
associated w i t h multiple clinically apparent patient and
surrogate factors. This information can help clinicians Methods
identify conditions under which substituted judgments are
likely t o be accurate or inaccurate and can help target Patients
populations for education designed t o improve the accu- Between November 1993 and February 1996, pa-
racy of surrogate decision making.
tients were recruited from the outpatient practices
of Johns Hopkins Hospital and the Hopkins Bay-
This paper is also available at http://www.acponline.org. view Medical Center, Baltimore, Maryland, and
© 1998 American College of Physicians 621
Georgetown University Medical Center, Washing- or after the patient interview. Patients and surro-
ton, D.C. Eligible patients were older than 17 years gates were asked not to speak with each other about
of age, spoke English, and had one of the following the interviews until both interviews were completed.
conditions known to be associated with a less than Informed consent was obtained from all partici-
50% chance of surviving for 2 years: New York pants. The study was approved by the institutional
Heart Association class III or IV congestive heart review boards of the Johns Hopkins Medical Insti-
failure with no possibility of transplantation (14), tutions and Georgetown University Medical Center.
advanced HIV infection according to the Centers
for Disease Control and Prevention case definition Interview Schedule
for AIDS (before the use of protease inhibitors) Structured interviews consisted of closed-ended
(15), amyotrophic lateral sclerosis with respiratory questions about sociodemographic factors, previous
compromise (16), unresectable non-small-cell lung discussions about end-of-life decisions, advance di-
cancer (17), or oxygen-dependent chronic obstruc- rectives, and past experiences. Patients were asked
tive pulmonary disease with dyspnea at rest (18, 19). about their treatment preferences, and surrogates
Treating physicians were asked to exclude patients were asked to predict patient preferences. Sociode-
whom they thought they were likely to live longer mographic information included age, sex, ethnic
than 2 years and patients who were cognitively in- group, exposure category for HIV infection, level of
capable of participating. The comparison group education, socioeconomic factors, insurance status,
comprised general medical patients who were older and religious beliefs and practices.
than 64 years of age, spoke English, and were ex- We elicited preferences for various medical in-
pected to live longer than 2 years. terventions in the setting of three hypothetical clin-
To have a power of 0.80 to detect a 15% differ- ical scenarios that would render patients unable to
ence between patient preferences and surrogate pre- make decisions for themselves: permanent coma,
dictions at a = 0.05, we needed 263 patient-surrogate coma with a small chance of recovery (the "small
pairs. Rounding up to 300 gave a power of 0.75 to chance" scenario), and severe dementia (Table 1).
detect a 25% difference in agreement between di- For each scenario, the patient was asked to report
agnostic groupings at a = 0.05. his or her preferences for intensive care, intubation,
Surrogates were chosen according to Maryland's cardiopulmonary resuscitation, feeding tube place-
legal hierarchy of surrogates for incapacitated pa- ment, nasotracheal suction, surgery, hemodialysis,
tients: durable power of attorney, guardian, spouse, esophagogastroduodenoscopy, phlebotomy, and chest
adult child, parent, sibling, other relative, or friend roentgenography. All scenarios and interventions
(20). For uniformity, surrogates of patients recruited were described briefly and simply to ensure under-
at Georgetown University Medical Center were also standing. For example, the feeding tube question
chosen according to the Maryland hierarchy. was worded, "Would you want to be fed by a feed-
Attending physicians were contacted 2 weeks be- ing tube (that is, a tube put through your nose or
fore scheduled patient visits. Patients were initially through the skin over your stomach)?"
informed of the general nature of the study and Requests for a limited trial of an intervention
asked to supply the phone number of the surrogate were interpreted as "yes" responses. Patients and
who was highest in Maryland's legal hierarchy. At surrogates were urged to give their best judgment.
the time of the appointment, details of the study "Unsure" responses were recoded as "yes" re-
were described and informed consent was obtained. sponses because they would be clinically interpreted
Surrogates were interviewed within 48 hours before that way. The interview schedules had been previ-
ously developed and tested for reliability (13) and
were based on the Medical Directive of Emanuel
Table 1 . Clinical Scenarios
and Emanuel (21). Slight modifications were made
Please imagine that each situation happens to you in addition to your for our protocol; the revised instruments showed
present medical condition. high internal consistency, with Kuder-Richardson
Scenario 1: Imagine that you are permanently unconscious and, in the
opinion of your physicians, have no hope of waking up and can feel statistics of 0.97 for the permanent coma scenario,
no pain. 0.94 for the small chance scenario, and 0.93 for the
Scenario 2: Imagine that you are in a coma and can feel no pain. The
physicians think that with aggressive treatment: severe dementia scenario. The complete set of in-
You have a high chance of dying (4 in 5). struments is available from the authors on request.
You have a small chance of coming out of the coma but not being able
to speak or understand others (1 in 5).
You have a very small chance of coming out of the coma and being Surrogate Accuracy in Matching Patient
able to speak and understand others (less than 1 in 100). Preferences Scale Scores
Scenario 3: Imagine that you are not in a coma, but you develop a brain
disease (such as Alzheimer disease) that cannot be cured and renders We constructed a Surrogate Accuracy in Match-
you unable to recognize people or to speak. Assume that you can
feel pain. ing Patient Preferences Scale (SAMPPS) for each of
the three scenarios (SAMPPS-1 for permanent
622 15 April 1998 • Annals of Internal Medicine • Volume 128 • Number 8
coma, SAMPPS-2 for small chance, and SAMPPS-3 the SAMPPS-1 model to predict the SAMPPS-2
for severe dementia). This scale assigns a score of 1 and SAMPPS-3 scores.
when the surrogate's prediction correctly matches
the patient's preference for each scenario-treatment
pair. Mismatches are assigned a score of 0. The sum
Results
of the correct answers constitutes the SAMPPS
score. For each of the three scenarios, a perfect score
Participants
is 10 and a complete mismatch has a score of 0.
Interviews of both patient and surrogate were
Statistical Analysis completed for 75% of pairs that met study inclusion
Categorical variables were analyzed by using the criteria. Of interviews that were not completed,
chi-square test or the Fisher exact test. Two-tailed t- 51% were not completed because of patient refusal;
tests were used for paired continuous variables. 25% were not completed because of patient refusal
Analysis of variance was used to compare continu- to allow surrogate contact; 15% were not completed
ous variables across multiple groups. Differences be- because of surrogate refusal; and 9% were not com-
tween surrogate and patient demographic character- pleted for other reasons, chiefly our inability to
istics were examined by using the McNemar test for arrange for a surrogate interview within 48 hours of
paired comparisons. The McNemar test was also the patient interview. Compared with participants,
used to assess any pattern in mismatches between nonparticipants were older (68 and 60 years of age;
patient and surrogate preferences. Differences in P< 0.001), more likely to be female (65% and
patient preferences for treatment according to mo- 39%; P < 0.001), more likely to receive Medicare
dality or scenario were assessed by using the Coch- (24% and 10%; P < 0.001), and more likely to be in
ran Q test. the chronic obstructive pulmonary disease or gen-
Although K scores have often been reported in eral medicine groups (P < 0.001). Participants and
similar studies, we do not report them because of nonparticipants did not differ significantly by ethnic
concern about the appropriateness of K as a test of group. Patients and surrogates were interviewed on
surrogate accuracy (11, 22), and the "paradox of the same day in all but 31 cases.
kappa" (23). We presented elsewhere an alternative Surrogates were younger, better educated, more
to K for measuring surrogate accuracy (24). likely to be married, and more likely to be female
than patients (Table 2). They were likely to be
Factors Associated with the Accuracy of spouses (57%) or adult children (22%). The surro-
Substituted Judgments gate that we interviewed, chosen according to Mary-
A skewed distribution was found for SAMPPS land's legal hierarchy, was the person the patient
scores. The Friedman two-way analysis of variance would have chosen (the patient's preferred surro-
was used to compare SAMPPS scores across scenar- gate) in 90% of cases. Patients with AIDS were the
ios, and the Kruskal-Wallis one-way analysis of vari- youngest, the most likely to be of minority ethnicity,
ance was used to compare SAMPPS-1 scores across the most likely to have not graduated from high
diagnoses. school, and the most likely to lack private insurance
A binary logit model (25) was constructed using (Table 3).
the 10 binary items (correct/incorrect) per patient-
surrogate pair of the SAMPPS-1 score. Indepen- Substituted Judgments
dent variables were screened for association with Surrogate accuracy did not differ significantly ac-
SAMPPS-1 score by using the Spearman rank cor- cording to whether patients were interviewed on the
relation and Kruskal-Wallis tests, as appropriate. same day (SAMPPS-1 score, 7.4) or on different
Variables that did not contribute significantly to the days (SAMPPS-1 score, 7.0) (P > 0.2). Overall, sur-
multivariate model were eliminated in a stepwise rogates correctly predicted the wishes of patients for
manner. When diagnostic categories were stratified particular treatments in specific scenarios in 66% of
by age (>65 years or <65 years), the log odds instances. Surrogates were better at predicting pa-
differed significantly by age group for patients with tients' wishes for more invasive procedures. For ex-
amyotrophic lateral sclerosis, but no such interac- ample, in the permanent coma scenario, surrogates
tion was found between education and insurance. were correct about ventilator care in 84% of in-
Model diagnostics to identify influential observa- stances and about cardiopulmonary resuscitation in
tions were examined (26), revealing 9 outliers that 79% of instances compared with 57% of instances
were subsequently removed from the final model. for phlebotomy and 62% of instances for chest
Thus, the final model reports on only 291 patient- roentgenography (P < 0.001).
surrogate pairs. The model was validated by using As shown in Figure 1, accuracy scores did not
the independent variables found to be significant in differ significantly according to patient diagnosis,
15 April 1998 • Annals of Internal Medicine • Volume 128 • Number 8 623
Table 2. Characteristics of 300 Study Patients and thought that the patient would want the treatment,
300 Surrogates*
and only 36% of these patients did want it.
Characteristic Patient Surrogate P Value
Mistaken Substituted Judgments
Mean age (range), y 60(22-84) 55(18-83) <0.001t
Women, % 39.0 73.3 <0.001* For most modes of treatment, when the surro-
White persons, % 72.7 72.3 NS* gates were wrong, they were no more likely (by the
Marital status, %
Married 58.7 75.3 McNemar test) to state that the patient would want
Widowed, separated, or divorced 26.7 12.0 treatment when the patient did not want it than
Never married or living with
significant other 14.7 12.7 <0.001* they were to state that the patient would not want
Education, % treatment when the patient did want it. The only
Less than high school 22.0 14.7
High school graduate or some exceptions across all three scenarios were the less
college 50.3 53.7 invasive procedures of nasotracheal suctioning,
College graduate or more than
college 27.7 31.7 <0.05* phlebotomy, and chest roentgenography. In the per-
Religion, % manent coma scenario, for example, the McNemar
Protestant 56.7 58.7
Catholic 27.0 28.3 statistic was significant only for suctioning (P =
Jewish, other, or none 16.3 13.3 NS* 0.02), phlebotomy (P < 0.001), and chest roentgen-
Type of insurance, %
Private insurance, health ography (P < 0.001). For these treatments, the sur-
maintenance organization, rogate's tendency was to state that the patient would
or other 74.3
Medicare only 9.7 want treatment when the patient did not want it.
Medicaid only 16.0
* Analysis of variance,
t Chi-square test.
Discussion