Professional Documents
Culture Documents
Use of An Artificial Neural Network For The Diagnosis of Myocardial Infarction
Use of An Artificial Neural Network For The Diagnosis of Myocardial Infarction
Myocardial Infarction
William G. Baxt, MD
• Objective: To validate prospectively the use of an Jilectronic data processing technologies have been
artificial neural network to identify myocardial infarction used to aid in clinical diagnosis for more than 20 years
in patients presenting to an emergency department (1). The acceptance of such technologies has not oc-
with anterior chest pain. curred (2). The reasons for this are varied but can be
• Design: Prospective, blinded testing. traced to the fact that their use is time-consuming and
• Setting: Tertiary university teaching center. inconvenient and because most have not been able to
• Patients: A total of 331 consecutive adult patients perform consistently better than physicians (2, 3). Arti-
presenting with anterior chest pain. ficial neural network-based pattern recognition has been
• Measurements: Diagnostic sensitivity and speci- successfully used in several complex environments,
ficity with regard to the diagnosis of acute myocardial which suggests that this technology may, potentially, be
infarction. applied to the clinical setting (4-7). Unlike most other
• Main Results: An artificial neural network was forms of artificial intelligence, the artificial neural net-
trained on clinical pattern sets retrospectively derived work can improve performance by training. The net-
from the cases of 351 patients hospitalized with a high works can also define relationships among input data
likelihood of having myocardial infarction. It was pro- that are inapparent using other approaches, and they
spectively tested on 331 consecutive patients present- can use these relationships to improve accuracy. This
ing to an emergency department with anterior chest technology has been applied to the analysis of clinical
pain. The ability of the network to distinguish patients data (8-15); however, the network's diagnostic accuracy
with from those without acute myocardial infarction has not been substantially better than that of physicians
was compared with that of physicians caring for the (16-18).
same patients. The physicians had a diagnostic sensi- An artificial neural network has been trained to diag-
tivity of 77.7% (95% CI, 77.0% to 82.9%) and a diag- nose, with a high degree of accuracy, acute myocardial
nostic specificity of 84.7% (CI, 84.0% to 86.4%). The infarction in patients presenting to an emergency de-
artificial neural network had a sensitivity of 97.2% (CI, partment (19). Data sets that were retrospectively ob-
97.2% to 97.5%; P = 0.033) and a specificity of 96.2% tained from 356 patients suspected of having had a
(CI, 96.2% to 96.4%; P< 0.001). myocardial infarction were used to train and test the
• Conclusion: An artificial neural network trained to network. The network was trained on one half of the
identify myocardial infarction in adult patients present- patients and was tested on the remaining half to which
ing to an emergency department may be a valuable aid it had never been exposed. The process was reversed
to the clinical diagnosis of myocardial infarction; how- and the results pooled in order to test all patients. The
ever, this possibility must be confirmed through pro- network performed with a sensitivity of 92% and a
spective testing on a larger patient sample. specificity of 96% (19). These figures were substantially
better than those reported previously for either physi-
cians (sensitivity, 88%; specificity, 71%) or any comput-
er-based technology (sensitivity, 88%; specificity, 74%)
(20). The network had a sensitivity of 80% and a spec-
ificity of 96% when all patients with electrocardio-
graphic evidence of myocardial infarction were removed
from the study sets. This work has recently been inde-
pendently corroborated (21).
Because the initial study was retrospective, the net-
work's performance must be prospectively validated be-
fore this network can be considered a legitimate aid to
clinical diagnosis. We report the results of the first
phase of this validation, the prospective testing of the
network on a group of 331 adult patients presenting to
an emergency department with anterior chest pain.
Methods
Annals of Internal Medicine. 1991;115:843-848.
Description of Artificial Neural Network
From the University of California, San Diego Medical Center,
San Diego, California. For the current author address, see end An artificial neural network is a group of interconnected
of text. mathematic equations that accept input data and that calculate
extended to other clinical settings could be explored. where fi(netpi) is the derivative of the activation function with
Finally, potentially one of the most important aspects respect to a change in the net input to the unit.
of these findings may be that the network may be able The delta for the hidden units is calculated in terms of the
units to which they project and the weights on those projec-
to identify relationships between clinical patient varia- tions:
bles that have not been appreciated by clinicians in the
past. Strategies have already been developed to define n
these relationships (21, 25). Not only does this possibil- 4. SPi=fi(netpi) 2 SpkWki
ity have important implications about the limits of clin- k= Q
ical reasoning, but it also may suggest that such tech- Weights and biases are updated by the calculation of the
nologies as reported here may potentially be used in delta weight:
further elucidating disease processes in general. Such
conclusions await ultimate validation of these observa- 5. Awj = awj + ((1 - a)(e8jaj))
tions and the successful extension of these technologies where alpha is termed network momentum and epsilon is
to other disease states. termed the learning rate parameter. Weights are updated by
adding the delta weight to the old weight.
Acknowledgments: The author thanks Dr. David Zipser for technical Training of the network is followed by calculating the
help and Ms. Kathleen James for manuscript preparation. summed square of the error, represented by E, across the
entire pattern set:
Requests for Reprints: William G. Baxt, MD, Department of Emer-
gency Medicine, UCSD Medical Center, 225 Dickinson Street, #8676, 6. E = E E (tPi ~ Opi)2
San Diego, CA 92103-8676.
P i
Current Author Address: Dr. Baxt: Department of Emergency Medi- where the index p ranges over the set of input patterns P, i
cine, UCSD Medical Center, 225 Dickinson Street, #8676, San Diego,
CA 92103-8676. ranges over the set of output units, tpi is the target of pattern
p, Opi is the network output for pattern p. When the summed
square of the error has ceased diminishing or has reached 0,
Appendix the network has been trained. If no relationship between the
pattern sets and their target value exists, this value will not
The mathematical operation of the network can best be diminish.
appreciated by again referring to Figure 1. This aspect of
network application is independent of its specific use and can
be viewed as generic. The network functions by the application
of binary or analog coded data comprising the pattern set to References
the 20 input units (see Table 1). This signal is then multiplied 1. Reggia JA, Tuhrim S; eds. Computer-Assisted Medical Decision
by the initially random weights on the projections between Making. Computers and Medicine, v. 2. New York: Springer-Ver-
each input unit and the first layer hidden units: lag; 1985.
2. Szolovits P, Patil RS, Schwartz WB. Artificial intelligence in medical
n diagnosis. Ann Intern Med. 1988;108:80-7.
3. McDonald CJ. Protocol-based computer reminders, the quality of
1. netpi = ZJ wuapj + biasj care and the non-perfectibility of man. N Engl J Med. 1976;295:
j=o 1351-5.
4. Widrow G, Hoff ME. 1960 Adaptive Switching Circuits Institute of
where netpi equals the net input of the unit for pattern p, w is Radio Engineering Western Electronic Show and Convention. Con-
a random weight, a is the input value applied to the unit, j vention Record; part 4:96-104.
represents the input or presynaptic units, / represents the first 5. Rumelhart DE, Hinton GE, Williams RJ. Learning internal repre-
layer hidden unit or postsynaptic unit, and bias is a modifiable sentations by error propagation. In: Rumelhart DE, McClelland JL;
weight that is multiplied by an input that is always equal to 1. eds. Parallel Distributed Processing: Explorations in the Microstruc-
The net activation of the hidden unit is calculated by: ture of Cognition. Cambridge, Massachusetts: MIT Press; 1986:318-
64.
1 6. McClelland JL, Rumelhart DE. Training hidden units. In: McClel-
2 a [ =
land JL, Rumelhart DE; eds. Explorations in Parallel Distributed
' P J + e~netpi Processing. Cambridge, Massachusetts: MIT Press; 1988:121-60.