Professional Documents
Culture Documents
8d083e3e409247598c7bc38c33682067 (1)
8d083e3e409247598c7bc38c33682067 (1)
appLications
of
artificiaL
inteLLigence
MedicaL
appLications
of
artificiaL
inteLLigence
edited by
arvin agah
MATLAB® is a trademark of The MathWorks, Inc. and is used with permission. The MathWorks does not warrant the
accuracy of the text or exercises in this book. This book’s use or discussion of MATLAB® software or related products
does not constitute endorsement or sponsorship by The MathWorks of a particular pedagogical approach or particular
use of the MATLAB® software.
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
© 2014 by Taylor & Francis Group, LLC
CRC Press is an imprint of Taylor & Francis Group, an Informa business
This book contains information obtained from authentic and highly regarded sources. Reasonable efforts have been
made to publish reliable data and information, but the author and publisher cannot assume responsibility for the valid-
ity of all materials or the consequences of their use. The authors and publishers have attempted to trace the copyright
holders of all material reproduced in this publication and apologize to copyright holders if permission to publish in this
form has not been obtained. If any copyright material has not been acknowledged please write and let us know so we may
rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted, or uti-
lized in any form by any electronic, mechanical, or other means, now known or hereafter invented, including photocopy-
ing, microfilming, and recording, or in any information storage or retrieval system, without written permission from the
publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright.com (http://
www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923,
978-750-8400. CCC is a not-for-profit organization that provides licenses and registration for a variety of users. For
organizations that have been granted a photocopy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for
identification and explanation without intent to infringe.
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
and the CRC Press Web site at
http://www.crcpress.com
Contents
Preface...............................................................................................................................................ix
Editor................................................................................................................................................xi
Contributors.................................................................................................................................. xiii
v
vi Contents
13. Deep Learning for the Semiautomated Analysis of Pap Smears.............................. 193
Kriti Chakdar and Brian Potetz
17. Fuzzy Naïve Bayesian Approach for Medical Decision Support.............................. 267
Kavishwar B. Wagholikar and Ashok W. Deshpande
21. Intelligent Light Therapy for Older Adults: Ambient Assisted Living...................343
Joost van Hoof, Eveline J. M. Wouters, Björn Schräder, Harold T. G. Weffers,
Mariëlle P. J. Aarts, Myriam B. C. Aries, and Adriana C. Westerlaken
27. Artificial Intelligence Approaches for Drug Safety Surveillance and Analysis... 431
Mei Liu, Yong Hu, Michael E. Matheny, Lian Duan, and Hua Xu
As tools and techniques in artificial intelligence are being further enhanced, their applica-
tions to medicine are expanding. This book is an attempt to capture the breadth and depth
of medical applications of artificial intelligence.
This book is composed of 28 chapters, written by 82 authors, with 50 unique affiliations,
from 17 countries. The first five chapters provide a general overview of artificial intel-
ligence, followed by 22 chapters that focus on projects that apply artificial intelligence to
the medical domain. The final chapter provides a list of pertinent resources on artificial
intelligence. Over 1200 entries are provided in the index.
The editor thanks all the authors for contributing to this book.
The editor acknowledges the efforts of the staff at CRC Press/Taylor & Francis in help-
ing with this project from the initial concept to the printed book. Special thanks go to
T. Michael Slaughter (Executive Editor), Laurie Schlags (Project Coordinator), Rachael
Panthier (Project Editor), Michele Smith (Editorial Assistant—Engineering), and Amor
Nanas (Manila Typesetting Company).
ix
Editor
Dr. Arvin Agah is associate dean for research and graduate programs in the School of
Engineering and professor of electrical engineering and computer science at the Univer
sity of Kansas. His research interests include applied artificial intelligence and autono-
mous robots. He has published more than 150 refereed articles in these areas. Dr. Agah has
been a co-investigator on projects that represent more than $32 million in research fund-
ing. He has supervised 13 PhD students and 40 master’s degree students and has received
multiple honors for his teaching excellence. Prior to joining the University of Kansas, he
spent two years at the Bio-Robotics Division of Mechanical Engineering Laboratory in
Tsukuba, Japan. He has been a researcher at the IBM Los Angeles Scientific Center, and
Xerox Research Center in Rochester, New York. In addition to teaching courses at the
University of Kansas, Dr. Agah has taught at the Department of Engineering Systems at
University of Tsukuba in Japan, the Center of Excellence in Remote Sensing Education
and Research at Elizabeth City State University in North Carolina, and the Department of
Mechanical Engineering at the University of Canterbury in New Zealand. He is a senior
member of IEEE and ACM.
Dr. Agah received his bachelor’s degree in computer science with highest honors from
the University of Texas at Austin, master’s degree in computer science from Purdue
University, and one in biomedical engineering and PhD in computer science from the
University of Southern California.
xi
Contributors
and
xiii
xiv Contributors
Arvin Agah
CONTENTS
1.1 Introduction............................................................................................................................. 1
1.2 A Bit of History....................................................................................................................... 2
1.3 Chapters in This Book............................................................................................................ 4
References.......................................................................................................................................... 6
1.1 Introduction
Artificial intelligence (AI) concepts, techniques, and tools have been utilized in medical
applications for over four decades. The overall goal has been to benefit health care by
assisting health care professionals in improving their effectiveness, productivity, and con-
sistency. Improvements in accuracy and efficiency of AI techniques have steadily increased
AI’s viability as a choice for tackling problems in medicine. The availability of AI software
has played a significant role in the further adoption of AI for medical applications.
AI techniques have been successfully applied to the medical domain, and AI systems
are being integrated into health care. Applications of AI span a wide and diverse range of
fields in medicine. In addition to more traditional medical applications of AI (diagnosis,
therapy, automatic classification, rehabilitation), more recent applications include disease
genes, wearable computing, hospital scheduling, visualization, medical robotics, surgery
simulation, artificial consciousness, and much more.
Medical applications of AI are so prevalent that literature surveys have been conducted
in order to study the use of AI techniques in medical applications. One such study reported
that neural networks were the most commonly used analytical tool (Ramesh et al. 2004).
Another study designated fuzzy logic–neural networks as the most often used AI tech-
nique, stating that there have been intensive interests in AI techniques in genetics, cardiol-
ogy, radiology, and so forth (Yardimci 2007).
There are numerous published works (books, journals, conference proceedings) on med-
ical applications of AI, covering a very wide range, from nutrition (Buisson 2008), to data
mining (Ciosa and Moore 2002), to traditional Chinese medicine (Feng et al. 2006). In addi-
tion to publications that provide a general coverage of different AI techniques for a variety
of medical applications, the more specific publications on medical applications of AI can
be categorized into two classes. The first set focuses on a specific field in AI, as it is applied
to different areas in medicine. The second set concentrates on an explicit area in medicine
and covers the utilization of a number of AI techniques in that area. Examples of books
1
2 Medical Applications of Artificial Intelligence
in the first set include those with foci on fuzzy logic (Barro and Marin 2010), expert sys-
tems (Fieschi 1990), software agents (Moreno and Nealon 2004), and genetic computation
(Smith and Cagnoni 2011). Examples of books in the second set include medical informat-
ics (Kelemen et al. 2008; Yoshida et al. 2010), medical imaging (Schaefer et al. 2009), medical
diagnosis (Schmitt et al. 2010), and book series on computational intelligence in health care
(Springer 2007–2011).
Many survey papers provide a strong starting point for those interested in AI in medi-
cine. Examples of survey papers in the first set include AI applications in the intensive care
unit (Hanson and Marshall 2001), medical applications of case-based reasoning (Holt et
al. 2006), fuzzy logic in medicine (Torres and Nieto 2006), wearable computing (Lukowicz
2008), evolutionary computation (Pena-Reyes and Sipper 2000), and the use of smart and
adaptive systems in different areas (Abbod et al. 2002), comprising emergency and inten-
sive care, general and surgical medicine, pathology, and medical imaging. Examples of
survey papers in the second set include a survey of medical documents’ summarization
(Afantenos 2005), hospital scheduling (Spyropoulos 2000), bioinformatics (Valentini 2009),
and brain pathology (Hemanth et al. 2009).
Research highlights for the first decade of AIM were reported in the work of Clancey
and Shortliffe (1984), where AIM is stated to be focused on AI programs for diagnosis
and therapy recommendations. The AIM design features that physicians would consider
important were discussed, including explaining the diagnostic and treatment decisions,
being portable and flexible, improving cost efficiency, and autonomously learning from
medical experts. In addition, a number of AIM challenges were identified by Clancey and
Shortliffe (1984), which still hold true: methods to acquire and process data; knowledge
acquisition and representation; capability to provide explanation; and integrations of AIM
systems into the working environment of health care professionals, including the human–
computer interface.
In the first decade of AI in medicine, the proper evaluation of the techniques was given
consideration; this was needed in order to establish the quality of work in AI in medi-
cine. One such evaluation (Chandrasekaran 1983) identified a number of challenges in
evaluating AI systems: focusing only on the final results, instead of intermediate stages;
comparing results against absolute standards of correctness, instead of work of other
clinicians; not knowing the correct answer as clinicians may disagree; and difficulty in
determining the ability to scale. Such issues are still impacting the evaluation of medical
AI systems.
The adolescence of AI in medicine is discussed in the work of Shortliffe (1993), during the
second decade of AI in medicine. The coming-of-age of AI in medicine is proposed to have
started in 2009, that is, the third decade, offering perspectives from a number of experts in
AIM (Patel et al. 2009). It was argued that the AIM field is robust, although less visible than
AI’s heyday. Progress has been made by researchers, and additional research grant fund-
ing would be beneficial. Goals are identified for a better understanding of the errors and
risk taking, that is, the resiliency of AIM systems (Patel et al. 2009). Challenges identified
included improved data capture and processing, reliable mechanisms for patients’ confi-
dentiality, and better modeling techniques. Some argue that AIM systems for discovery
should not attempt to be autonomous and instead should be semiautonomous support sys-
tems for discovery. The following are identified to be important topics in AIM: knowledge
representation, systems modeling, effective use of information in decision making, data
analysis, and interdisciplinary education programs (Patel et al. 2009).
AIM was strong in its third decade, with numerous research efforts. It even garnered
enough attention to result in online courses, namely, the medical artificial intelligence
course, which was offered in 2005 (MIT OpenCourseWare 2005).
We are now in the fourth decade of AI in medicine. The research is strong and is con-
tinuing. Many books have been written to showcase the results, for instance, the work of
Miller (2011). The advantages offered by AI in medicine are that such systems can offer
accuracy (computers are less likely to make mistakes), cost and efficiency (no fatigue),
and replication (ability to make numerous copies) (Butler 2011). Overall, AI technologies
have not been integrated into medicine, as it was once predicted. Major problems have
been unrealistic expectations and usability (Bond 2010). AI researchers have focused on
building effective and usable software tools that can be used for medical applications.
However, most AI systems in medicine are still semiautonomous, as some form of human
supervision is still required to ensure proper diagnosis and treatment. There is interest
from health care professionals to have systems that can assist them, not replace them. AI
systems can monitor patients, make suggestions, and help mine the data. The challenge is
in developing hybrid systems that can effectively and efficiently combine the experience
of medical care professionals with attributes that AI software can provide. The interaction
challenge requires more attention.
4 Medical Applications of Artificial Intelligence
AI, including support vector machines, neural networks, naïve Bayesian classifiers, hidden
Markov models, k-means clustering, and principal component analysis.
Prominent machine learning and data mining methods are explained in Chapter 3,
along with their medical applications. The techniques include logistic regression, k-nearest
neighbor, K*, decision trees, random forests, rule learners, neural networks, and evolution-
ary algorithms. The concept of hybrid systems is also introduced where an ensemble of
classifiers or a multi-classifier is built using a number of classifiers. Chapter 4 provides
another review of AI techniques with medical applications, covering fuzzy logic, genetic
algorithm, and neural networks. Chapter 5 examines the information retrieval aspects of
AI, drawing comparisons between medicine and law.
Chapters 6–27 provide in-depth coverage of medical applications of AI. Chapters 6–10
focus on classification problems in medicine. Image processing is the common theme in
Chapters 11–14. Chapter 15–17 emphasize medical decision making. Use of AI in diagnosis
is covered in Chapters 18–19, and assisted living is included in Chapters 20–21. Chapters
22–27 present a variety of medical applications of AI.
In Chapter 6, genetic algorithms are utilized to characterize soft tissue in terms of pre-
dicting the viscoelastic response. The experimental results are shown to be accurate and
time efficient compared with other existing methods. The application of support vector
machines and wavelet transform to classification of electroencephalogram (EEG) signals
is the subject of Chapter 7. The work considers developing compact feature vectors in order
to make the process more efficient.
The applications of naïve Bayesian classifiers are described in Chapter 8, with emphasis
on two problematic characteristics of data sets, namely, high dimensionality and small
size. The application domain is microarray gene expression for different diseases. Chapter
9 focuses on prediction of chemical toxicity profiles, using random forest and naïve Bayes
methods. The performance of the computational techniques are analyzed and compared.
The theme of classification continues in Chapter 10, where neural networks are used for
cancer prediction. The classifier and dominant gene prediction methodologies are used for
predicting tumors.
Chapter 11 presents the application of data mining for melanoma diagnosis. Algorithms
are evaluated based on their dermatoscopic score, using basic melanoma data sets. AI
applications to optical coherence tomography images are discussed in Chapter 12. The
main layers of the retina are extracted, and 2-D and 3-D surfaces are reconstructed. Support
vector machines and deep belief networks are applied to the analysis of pap smears in
Chapter 13. The machine learning techniques are used to identify discriminative visual
features that can be predictive of cancer and precancer grading. Chapter 14 discusses a
fuzzy clustering approach to MRI segmentation, forming clusters of similar-intensity sets
in images. MRI segmentation is used for detection in clinical diagnosis.
Chapter 15 presents a therapeutic decision-making approach, quantifying risk and
uncertainty for optimal treatment. The methodology is applied to hyperglycemic manage-
ment of critically ill patients. Clinical decision making is surveyed in Chapter 16, describ-
ing the practices and challenges. A narrative of a mock patient is used to contextualize and
categorize the process. Clinical decision making is further explored in Chapter 17, based
on the fuzzy naïve Bayesian approach. Other fuzzy set theoretic approaches are presented.
Chapter 18 emphasizes clinical diagnostics, namely, methodologies in metabolomic
studies. AI methodologies of neural networks, genetic algorithms, and self-organizing
maps are applied to spectrometry-like instruments to aid patient care and disease diagno-
sis and management. The amblyopia (lazy eye) vision disorder is the subject of Chapter 19.
6 Medical Applications of Artificial Intelligence
References
AAAI (2013). AITopics. aitopics.org.
Abbod, M.F., Linkens, D.A., Mahfouf, M., and Dounias, G. (2002). Survey on the use of smart and
adaptive engineering systems in medicine. Artificial Intelligence in Medicine, Vol. 26, 179–209.
Afantenos, S., Karkaletsis, V., and Stamatopoulos, P. (2005). Summarization from medical docu-
ments: a survey. Artificial Intelligence in Medicine, Vol. 33, 157–177.
Artificial Intelligence in Medicine Society. (2013). Conference on Artificial Intelligence in Medicine.
aimedicine.info/aime.
Ax, A.F. (1960). Computers and psychophysiology in medical diagnosis. IRE Transactions on Medical
Electronics, Vol. ME–7, Iss. 4, 263–264.
Barro, S. and Marin, R. (2010). Fuzzy Logic in Medicine (Studies in Fuzziness and Soft Computing).
Physica-Verlag, Heidelberg, Germany.
Bond, A. (2010). Reality checkup: Medical artificial intelligence still a hard sell in the clinic. Scientific
American, http://www.scientificamerican.com/article.cfm?id=artificial-intelligence-medical-tests-
software-diagnosis, January 12.
Buisson, J.-C. (2008). Nutri-Educ, a nutrition software application for balancing meals, using fuzzy
arithmetic and heuristic search algorithms. Artificial Intelligence in Medicine, Vol. 42, 213–227.
Introduction to Medical Applications of Artificial Intelligence 7
Butler, L.M. (2011). How Artificial Intelligence Doctors Will Change Medicine. EzineArticles.com.
Chandrasekaran, B. (1983). On evaluating AI systems for medical diagnosis. AI Magazine, Vol. 4,
No. 2, 34–48.
Ciosa, K.J. and Moore, G.W. (2002). Uniqueness of medical data mining. Artificial Intelligence in
Medicine, Vol. 26, 1–24.
Clancey, W.J. and Shortliffe, E.H. (1984). Readings in Medical Artificial Intelligence: The First Decade.
Addison Wesley, Reading, MA.
Elsevier. (2013). Artificial Intelligence in Medicine Journal. www.journals.elsevier.com/artificial-intelli
gence-in-medicine.
Feng, Y., Wu, Z., Zhou, X., Hou, Z., and Fan, W. (2006). Knowledge discovery in traditional Chinese
medicine: state of the art and perspectives. Artificial Intelligence in Medicine, Vol. 38, 219–236.
Fieschi, M. (1990). Artificial Intelligence in Medicine: Expert Systems. Chapman & Hall, London.
Hanson, C.W. and Marshall, B.E. (2001). Artificial intelligence applications in the intensive care unit.
Critical Care Medicine, Vol. 29, No. 2, 427–435.
Healthinformatics. (2013). Artificial Intelligence in Medicine. http://healthinformatics.wikispaces.
com/Artificial+Intelligence+in+Medicine.
Hemanth, D.J., Vijila, C.K.S., and Anitha, J. (2009). A survey on artificial intelligence based brain
pathology identification techniques in magnetic resonance images. International Journal of
Reviews in Computing, Vol. 4, 30–45.
Holt, A., Bichindaritz, I., Schmidt, R., and Perner P. (2006). Medical applications in case-based reason-
ing. The Knowledge Engineering Review, Vol. 20, No. 3, 289–292.
Holtzman, W.H. (1960). Can the computer supplant the clinician? Journal of Clinical Psychology,
Vol. 16, Iss. 2, 119–122.
Hunt, E. (1968). Computer simulation: artificial intelligence studies and their relevance to psychol-
ogy. Annual Review of Psychology. Vol. 19, 135–168.
InTouch Technologies. (2013). RP-VITA Remote Presence Robot. www.intouchhealth.com.
iRobot Corporation. (2013). RP-VITA Remote Presence Robot. www.irobot.com.
Kelemen, A.A., Abraham, A., and Liang, Y. (2008). Computational Intelligence in Medical Informatics
(Studies in Computational Intelligence). Springer-Verlag, Berlin, Germany.
Ledley, R.S. and Lusted, L.B. (1960). The use of electronic computers in medical data processing: aids
in diagnosis, current information retrieval, and medical record keeping. IRE Transactions on
Medical Electronics, Vol. ME–7, 31–47.
Lukowicz, P. (2008). Wearable computing and artificial intelligence for healthcare applications.
Artificial Intelligence in Medicine, Vol. 42, 95–98.
Miller, P.L. (2011). Selected Topics in Medical Artificial Intelligence (Computers and Medicine). Springer-Verlag,
New York.
MIT OpenCourseWare. (2005). MIT Course Number: HST.947 Medical Artificial Intelligence. ocw.mit.
edu/courses/health-sciences-and-technology/hst-947-medical-artificial-intelligence-spring-2005
Moreno, A. and Nealon, J.L. (2004). Applications of Software Agent Technology in the Health Care Domain.
Birkhäuser.
Nokia. (2013). Nokia Sensing X challenge. www.nokiasensingxchallenge.org.
Nordyke, R.A., Kulikowski, C.A., and Kulikowski C.W. (1971). A comparison of methods for the
automated diagnosis of thyroid dysfunction. Computers and Biomedical Research, Vol. 4, 374–389.
OpenClinical. (2013). Artificial intelligence in medicine: an introduction. www.openclinical.org/aiin-
medicine.html.
Patel, V.L., Shortliffe, E.H., Stefanelli, M., Szolovits, P., Berthold, M.R., Bellazzi, R., and Abu-Hanna, A.
(2009). The coming of age of artificial intelligence in medicine. Artificial Intelligence in Medicine,
Vol. 46, 5–17.
Pena-Reyes, C.A. and Sipper, M. (2000). Evolutionary computation in medicine: an overview.
Artificial Intelligence in Medicine, Vol. 19, 1–23.
Qualcomm. (2013). Qualcomm Tricorder X Prize. www.qualcommtricorderxprize.org.
Ramesh, A.N., Kambhampati, C., Monson, J.R., and Drew, P.J. (2004). Artificial intelligence in medi-
cine. Annals of the Royal College of Surgeons of England. Vol. 86, No. 5, 334–338.
8 Medical Applications of Artificial Intelligence
Schaefer, G., Hassanien, A.E., and Jiang, J. (2009). Computational Intelligence in Medical Imaging:
Techniques and Applications. Chapman and Hall/CRC, Boca Raton, Florida.
Schmitt, M., Teodorescu, H.-N., Jain, A., Jain, A., and Jain, S. (2010). Computational Intelligence Processing
in Medical Diagnosis (Studies in Fuzziness and Soft Computing). Physica-Verlag, Heidelberg, Germany.
Schwartz, W.B. (1970). Medicine and the computer: the promise and problems of change. New
England Journal of Medicine, Vol. 283, No. 23, 1257–1264.
Shortliffe, E.H. (1993). The adolescence of AI in medicine: will the field come of age in the 90’s?
Artificial Intelligence in Medicine, Vol. 5, 93–106.
Shortliffe, E.H., Axline, S.G., Buchanan, B.G., Merigan, T.C., and Cohen, S.N. (1973). Computers and
Biomedical Research, Vol. 6, Iss. 6, 544–560.
Slack, W.V. and Van Cura, L.J. (1968). Patient reaction to computer-based medical interviewing.
Computers and Biomedical Research, Vol. 1, Iss. 5, 527–531.
Smith, S.L. and Cagnoni, S. (2011). Genetic and Evolutionary Computation: Medical Applications. John
Wiley & Sons, Ltd., West Sussex, UK.
Springer. (2007–2011). Advanced Computational Intelligence Paradigms in Healthcare – Series (1 to 6)
(Studies in Computational Intelligence). Springer-Verlag, Berlin, Germany.
Spyropoulos, C.D. (2000). AI planning and scheduling in the medical hospital environment. Artificial
Intelligence in Medicine, Vol. 20, 101–111.
Szolovits, P. (1982). Artificial Intelligence in Medicine. Westview Press, Inc., Boulder, Colorado.
Torres, A. and Nieto, J.J. (2006). Fuzzy logic in medicine and bioinformatics. Journal of Biomedicine and
Biotechnology, Vol. 2006, 1–7.
University of Waikato. (2013). Weka (Waikato Environment for Knowledge Analysis). www.
cs.waikato.ac.nz/ml/weka.
U.S. National Library of Medicine. (2013). PubMed. www.ncbi.nlm.nih.gov/pubmed.
Valentini, G. (2009). Computational intelligence and machine learning in bioinformatics. Artificial
Intelligence in Medicine, Vol. 45, 91–96.
XPrize. (2013). X PRIZE Foundation. www.xprize.org.
Yardimci, A. (2007). A survey on use of soft computing methods in medicine. In Proceedings of the 17th
International Conference on Artificial Neural Networks, Porto, Portugal, 69–79.
2
Overview of Artificial Intelligence
David O. Johnson
CONTENTS
2.1 Introduction to Artificial Intelligence..................................................................................9
2.2 Machine Learning.................................................................................................................. 9
2.3 Support Vector Machines.................................................................................................... 11
2.4 Neural Networks.................................................................................................................. 12
2.5 Naïve Bayesian Classifier..................................................................................................... 13
2.6 Hidden Markov Models....................................................................................................... 17
2.7 k-Means Clustering............................................................................................................... 21
2.7.1 k-Means Clustering When k is Known.................................................................. 21
2.7.2 k-Means Clustering When k is Unknown.............................................................. 23
2.7.3 Reconstruction Error................................................................................................ 23
2.7.4 Peakedness................................................................................................................. 23
2.8 Principal Component Analysis........................................................................................... 25
References........................................................................................................................................ 27
2.2 Machine Learning
Machine learning classifiers take as input empirical data and predict the features of the
data. As an example, consider anticancer drug design. One method of anticancer drug
design is to create compounds and then test how well they kill cancer cells in laboratory
petri dishes. Thousands of compounds may be created and tested before one is found that
kills cancer cells. Clearly, the process of creating the compound and then testing it is a
lengthy one. Machine learning has been applied to speed this process up by identifying
9
10 Medical Applications of Artificial Intelligence
the compounds that are more likely to kill cancer cells. Those compounds are then created
and tested.
How is this done? Before a compound is created, the chemical attributes of it can be
determined (e.g., is it an acid or a base?). What the attributes are does not really matter. The
idea is to find those compounds that have attributes similar to those of compounds known
to kill cancer. Thus, the empirical data that is the input to the machine learning classifier
are the attributes of the compounds, and the feature that the machine learning classifier
predicts is whether the compound will kill cancer cells or not. This type of machine learn-
ing is called “supervised learning” because there is a set of data (e.g., the known cancer-
killing compounds) that we can use to “train” the machine learning classifier to recognize
which attributes are important in predicting whether a compound is cancer killing or not.
There is another type of machine learning called “unsupervised learning,” which we will
talk about later in the chapter.
There are a number of metrics for measuring how well a particular machine learning
classifier works. Everyone who proposes a new machine learning classifier can use these
metrics to compare their classifier with other classifiers. All of these metrics are based
on a “confusion matrix.” The best way to explain a confusion matrix is with an example.
Assume we have 100 compounds, and we are predicting whether they kill cancer or not.
Suppose 30 of the compounds actually kill cancer, and 70 do not. Now suppose that the
machine learning classifier correctly predicts that 20 of the cancer-killing compounds are
cancer killing but incorrectly predicts that 10 of them are not cancer-killing compounds.
Also suppose that the machine learning classifier correctly predicts that 60 of the non-
cancer-killing compounds are non-cancer killing but incorrectly predicts that 10 of the
non-cancer-killing compounds are cancer killing. We can represent this in a confusion
matrix as shown in Table 2.1.
Confusion matrices are always shown in this way, and each cell has a generic name, as
shown in Table 2.2.
The values in these cells are then used to calculate the metrics that are used to compare
machine learning classifiers. There are many metrics that are used, but the most common
TABLE 2.1
Example of a Confusion Matrix for a Machine Learning Classifier That Predicts Whether a
Compound Is Cancer Killing or Not
Actually Kill Cancer Actually Do Not Kill Cancer
Predicted to Kill Cancer 20 10
Predicted Not to Kill Cancer 10 60
TABLE 2.2
Standard Confusion Matrix Nomenclature
Actually Kill Cancer Actually Do Not Kill Cancer
Predicted to Kill Cancer True positive (TP) False positive (FP)
Predicted Not to Kill Cancer False negative (FN) True negative (TN)
Overview of Artificial Intelligence 11
are precision (P), recall (R), true positive rate (TPR), and false positive rate (FPR). These
metrics are calculated as follows:
P = TP/(TP + FP)
R = TP/(TP + FN)
In the example above, the TPR (and the R) of the machine learning classifier would be
calculated as follows:
y L1
L2 L3
FIGURE 2.1
Support vector machine (SVM) example.