Download as pdf or txt
Download as pdf or txt
You are on page 1of 12

ISSN: 2455-8826

Asian Journal of Innovative Research in Science, Engineering and Technology


(AJIRSET)
Available online at: www.engineeringjournal.info

RESEARCH ARTICLE

An Intelligent Medical Decision Support System


Ogbonnaya S. Success, LN Onyejegbu*

University of Port-Harcourt, Rivers State, Nigeria.

*Corresponding Author: Email: nneka2k@yahoo.com

Abstract

Continuous interpretations of numerous health examination results of patients are tedious and
monotonous for clinicians; that are more often than not burdened by the overwhelming and seemingly
intimidating number of patients they need to attend to routinely. This has culminated in a lot of fatal
errors on the part of the medical practitioners which has led to the loss of innocent lives. Medical
diagnosis is an important but complicated task that should be performed accurately and efficiently and
its automation would be very useful. In this work, we developed an intelligent medical decision support
system that would aid medical personnel in the accurate diagnosis and prediction of hepatitis A and B
using C5.0 algorithms. A comparison was made between using the proposed C 5.0 Algorithm, and
existing C4.5 algorithm. The simulation result shows that the Mean Absolute Error for C5.0 is 0.04 while
that of C4.5 is 0.09%. The time complexity of the process for C5.0 is 0.03 seconds while that of C4.5 is
0.05 seconds. The correctly classified instance of C 5.0 is 98.05% while that of C4.5 is 94.81%. The result
shows that C5.0 algorithm performs better than C4.5, in the diagnosis and prediction of hepatitis A and
B. C5.0 machine learning algorithms was used to train the historical datasets of the hepatitis virus.
WEKA simulation software was used to compute and simulate C5.0 algorithm. The methodology adopted
for this work, is Object Oriented Analysis and Design Methodology. PHP programming language, HTML
and MySQL database was used for the implementation.

Keywords: Hepatitis, C4.5 and C5.0 Algorithm, WEKA, Decision Tree, Medical decision Support system.

Introduction
Health examinations produces full and Brause [5] opined that most health
detailed information about the health practitioners are posed with lots of task one
condition of patients, the obtained of which is learning to diagnose in their early
information is needed for the management of year of study. For them, the highest task in
individual and public issues of health. health evaluation is generating a
Examining health conditions does not only comprehensive report after interpreting the
give health practitioners needed information result gotten from the examination. The
to aid early diagnosis of diseases but also results gathered from distributed sources of
supplies the health examination receivers’ test are simultaneously and entirely
proper recommendation for their self-health interpreted to get the actual picture of health
management. Both health practitioners and conditions of every health evaluation
receivers of health examination derive receiver. However, the detailed
benefit from the comprehensive information interpretation of multiple examination items
supplied, to obtain meaningful result from is complex and error-prone for a busy
the examination process. clinician.

A detailed health examination package may Continuous interpretations of numerous


compose of several examination items such as health evaluation results of patients are
one on one physical examination, tedious and monotonous for health
endoscopies, laboratory experiments and test, practitioners. Health practitioners have to
radiological studies, and others. The task of spend much effort in screening plenty of
health science is to diagnose and prevent health evaluation results to detect relatively
disease; the focus here is on diagnosis. few clinically significant findings. It is most

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 1


Available online at: www.engineeringjournal.info

likely to have a reduction in the number and may partition the problem space and may
value in terms of quality of health evaluation likely be a meaningful approximation to a
result interpretations at the situation of candidate solution - to low level facets which
human fatigue. Under this circumstance, may involve a more local inference that may
health practitioners tend to simplify the explain one-or two findings and would no
procedures such as following their own advanced the problem solving process to the
experience rather than clinical guidelines to same extent.
interpret clinical data and make decision. All
these conditions potentially affect the most Oguntimilehin A et. al. [14] discoursed that
important issues that concerns health clinical diagnosis is basically categorization
evaluations and quality control of all the which permits physician to forecast using
health examination reports. medical conditions and to decide suitable
cause. This is a difficult decision procedure
In a need of obtaining a practical, more that encompasses large volume of ambiguity
accurate and effective computer-assistance and also uncertainty organization mostly
for health evaluations impels the when dealing with disease of multiple
development process of the medical decision symptoms. This experience builds a system
support system (MDSS) proposed in our that powerfully maps the connections,
research. This research developed an observations and corresponding diagnoses.
intelligent model for medical decision support
system (MDSS) that would aid human Dragulescuand Albu [8] implemented a
medical personnel in the accurate diagnosis system that is made of three important parts.
and proper prediction of Hepatitis cases A First of logical inference is used to decide
and B. what type of Hepatitis virus is present for a
new patient. The possibilities are B, B+D and
This was be achieved by developing an C. After the second portion of the program
intelligent rules from hepatitis data set using was employed to understand what will the
SEE5.0/C5.0 decision tree algorithm, a model type be and the grade of hepatitis B (if the
for intelligent medical decision support sick person is diseased with the virus,
system (DSS) was developed using the hepatitis B). This branch of the application is
developed rules, the system was developed using methods from statistical
implemented using PHP, HTML and MySQL inference. The third\ section of the
as the database. The data for this research application consist of the patients ill with C
was gotten from University of California virus and it predicts the biological
Irvine (UCI) Machine Learning Repository parameter; evolution during the treatment
database. using AI Network.
Review of Related Work
The literature review measures the extent to Kahn R. et. al. [11] proposed neutrosophic
which work has been done in relation to classifier which employs Neutrosophic logic
intelligent medical decision support system. and is an addition of normally fuzzy classifier
Evan DA [9] describe four various levels used. They compared it using the resulting
whereby medical knowledge are structured in parameters: like sum rules and the nature of
a clinical problem solving framework. They associated functions. They proved that
identified that observations are entities of lengthy fuzzy classifier: Neutrosophic
evidence that are known as possibly classifier; optimizes the believed parameters
pertinent in a difficult solving situation. No in comparison to fuzzy equivalent. Finally
useful clinically facts were constituted. they established that Neutrosophic logic still
in its emerging stage quiet holds the probable
Facets reflect general pathological to be further experimented using different
descriptions such as aortic insufficiency, or fields.
categorical descriptions such as endocrine
problem. They are also interim hypotheses Ahmed [1] used neutrosophic and fuzzy C/
that divide the information in the problem means gathering for enhancing CT liver
into manageable sub-problems and suggest appearance. Abdominal CT imaging was
possible solutions. Facets vary in their level chosen and a segmentation approach was
of abstraction - from high level facets which given to get their capability and accuracy.

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 2


Available online at: www.engineeringjournal.info

The image is transmuted into Neutrosophic drawing inferences was employed to infer the
domain, this is designated with three data from the rules developed. The/ de-
subsets. The ratio of fact in the subset T, fuzzification method engaged is centroid
ratio of indeterminacy using subset 1, the approach. The result obtained resembles
ratio of inaccuracy using subset F. threshold human decision making with its capability to
of subset T, I, F was adapted using fuzzy C work from approximate reasoning and
mean algorithm. ultimately find a precise solution. It has 85%
exact diagnosis.
He concluded that Neutrosophy can reduce
over segmentation and gives a better Nicolae V et. al. [13] presented an approach
presentation on deafening and irregular that was ment to integrate computational
images gotten using other approach since the intelligence with clinical investigation
technic can handle ambiguity and methods and knowledge for glaucoma
indeterminacy better. Ming R et. al. [12] diagnosis. The knowledge acquired is
applied neutrosophy to image processing by embedded in a fuzzy logic inference system.
defining a-neutrosophic domain, which is The resulting neuro fuzzy glaucoma
termed by three subsets T, 1, and F. They diagnosis and prediction system is expected
then employ watershed algorithm to perform to lower the effort, difficulties and risk cost to
segmentation of the shape in the this disease. Voula and [6] presented a new
ncutrosophic domain. The experiments hybrid modeling methodology suitable for
showed that the plan method got better complex decision making procedures. Their
results when compared with the existing work extended previous work on competitive
method. fuzzy cognition maps for a medical decision
support system by complementing them with
Anil [3] in their, work titled "A soft case based reasoning methods.
computing paradigm for a medical data
mining tool to predict risk of coronary heart
events", they proposed a neuro-fuzzy soft The synergy of these methodologies is
computing data mining tool which was used accomplished by the planned decision
for the prediction of the severity of risk in support application that is suitable to handle
coronary heart events. The proposed system situations where decisions are not clearly
was based on medical data obtained from distinct. The methodology developed was
clinical observations performed on cardiac applied successfully to model and test two
patients. decision support systems, one a differential
diagnosis problem from speech pathology
area for diagnosis of language impairments
They used the concept of decision tree and the other decision making choices in
founded on 1D3 algorithm to classify the external beam radiation therapy.
feature that contributes towards the
diagnosis. The outcome is transformed into a Chrysostomos [10] presented an expert
crisp if then rules and then transformed into system for differential diagnosiserythemato-
fuzzy rule base. A neuro fuzzy method is squamous disease incorporating decisions
given to improve the fuzzy membership made by three classification algorithm:
function which/ outcomes in the improvement nearest neighbour classifier, naive Bayesian
of the old system. The efficiency of the newly classifier and voting feature interval 5. Their
generated expert system, prediction accuracy proposed system enables doctors to
is presented against the real life which was differentiate six types Erythemato-squamous
established to be operational. diseases using clinical and histopathological
parameters obtained from a patient-The
systein also gave explanation for the
Djain and Kimbi [7] proposed a medical classification of each classifier.
diagnostic support system for the
management of hypertension (MEDIAG).
Their proposed system diagnoses the Gt Venira [4] presented a comparison
possibility of the ailment and its severity by between Backpropagation and Naive Baye
applying fuzzy logic approach. MEDIAG Classifiers to diagnose Hepatitis disease. The
applied forward chaining method in making performance of proposed methods was
inferences and the root sum squares of selected for each classification tasks of

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 3


Available online at: www.engineeringjournal.info

hepatitis disease. The overall accuracy of that the present hospital systems existing in
diagnosis systems were 98% and 97% Portuguese hospitals is deficient of a reliable
respectively. decision support system factor to get better
solution. A standardization and
Sonawane [15], system ascertained that centralization of planning procedures which
program based Medical Decision Support progresses reliability of the operating theater
System (MDSS) will be very helpful for the and handles the fragile situation of the
medical professional due to its accurate and waiting lists for surgery was developed using
speedy decision making. They discoursed optimization methods and data mining.
that forecasting the presence of heart ailment
accurately, effects in lives saving. Sequential Schizophrenia spectrum disorder was
least optimization technique was use in diagnosis by a decision system created by
support of vector machine (SVM). Database Kahn R [11].The system have four-stages:
that exists at the Cleveland heart disease knowledge acquisition and organization, the
gotten from UCI repository was trained using development of computer-aided model, and
SMO methods. The result shown made us to evaluation of the overall system’s
know that the MDSS is able to diagnosis performance. The required knowledge is
faster the heart disease and produce a more mined from an professional via interviews.
correct result; it was also show excellent The knowledge was modeled by algorithms
capability to predict large dataset. and transmitted to a reckoning model formed
using covering approach. The performance
According to Amin SU [2] medical errors are valuation is based on comparison of several
harmful and costly. It leads to thousands of diagnoses of eighty one pieces clinical
mortality yearly. Hence, CDSS would give between DSS and an expert. The outcomes
chances to decrease medical errors so as to showed a comparatively small rate of
enhanced patient safety. The proponents misclassification (18-34%) also an excellent
started a reasonable examination of the performance or behaviour by the DSS with
behaviour and performance of the six CDSS 66-82% accuracy.
programs having diverse data mining
methods for the diagnosis of heart disease. It Shankar F et. al. [16] proposes a prediction
was concluded based on results that the system for liver disease using intelligent
system cannot identify treatment or machine learning. Researches provided
management options for patients with heart various data to identify the causes for
disease. hepatitis. Here, Decision tree technique was
applied to determine the structural
Sonawane [15] emphasized that decision information of tissues.C4.5 algorithm was
methods integral in operating theatre body used to generate decision tree that
are a lot subjected to investigation, which concentrates on some attributes such like
occasionally indicate a far results from age, sex, steroids, antivirals, etc for the
optimal. They additional uphold that the diagnosis of the disease. These features
pending surgery lists had constantly been a helped in defining the abnormalities of the
general problem, partly owing to the point sick person with Heart ailment.

Figure 1: overall prototype Of CDSSs (Source: Lincoln 2001)

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 4


Available online at: www.engineeringjournal.info

C5.0 combines several firsthand abilities was obtained from University of California
such as adaptable misclassification costs. In Irvine (UCI) Machine Learning Repository
C4.5, all faults are classified as alike; database.
practically some classification faults are more
severe than others. C5.0 allows a distinct This work developed a model for an
charge to be distinct for each real class pair; intelligent medical decision support system
if this choice is used, Then C5.0 makes that would aid human medical personnel in
classifiers to minimize anticipated accurate diagnosis and prediction of hepatitis
misclassification rates slightly than error B using C5.0 algorithm. It will develop a
occurrence. reliable guide for human physician and
medical personnel decision support system.
In an application that categorizes individuals
The proposed system will produce intelligent
as probable or not probable to "churn," C5.0
decision support system with ability to
efforts to minimize the weighted projected
decrease cost and the time of uncertainty
error rate.
diagnosing hepatitis problems.
C5.0 is very easy to use. Several possibilities
have been extended and simplified -- to The system provides efficient decision process
support sampling and cross-validation, that involves lots of vagueness and
example -- and C4.5's programs used in uncertainty management mostly in a
generating rule sets have been fused into a situation of multiple symptoms. Some of the
distinct program. benefits of the system includes that; it
The Windows form, See5 is very user-friendly minimizes the time constrain for data
with nice graphic interface. classification (It optimizes the system’s time
complexity), the proposed system uses a
C4.5 has a standard rule that induces command of memory magnitude less than
classification rules in the form of a decision C4.5 during rule set construction, the C5.0
tree. As an addition of ID3, usual criteria of rule sets formation have noticeably lower
selecting splitting attributes in C4.5 are error on unseen cases of various datasets,
information gain ratio, in place of using C5.0 expresses new processes for creating
information gain as that in ID3, information rule sets, and the development is substantial.
gain ratio avoids the bias of selecting
attributes with numerous values. The use of appropriate method enhances
Material and Method effectiveness and the efficiency in all
research work. We adopted the object
In this research an intensive discussion of a Oriented Analysis and Design Methodology
more accurate intelligent medical prediction in analysis of this intelligent system for
and decision support system model for hepatitis diagnoses and treatment. This
diagnosis and prediction of Hepatitis by methodology features the unified modeling
using signs and the symptoms of patients as language (UMLs): Case diagram, Activity
variable predictors and also making use of diagram, High Level Model and the
SEE 5.0 decision tree algorithm for architectural design of the newly introduces
classification. The model was well system. The used case diagram can be seen
implemented using real life sample data of in Figure 2
Hepatitis disease. The data for this research

Figure 2: The use case diagram of the proposed system

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 5


Available online at: www.engineeringjournal.info

The use case diagram of the proposed system with other nodes; will be help to properly
can be seen in figures 2 and consist of the diagnose hepatitis symptoms and
following components: treatment.
 Likelihood of Hepatitis issues: the
 C5.0 Algorithm: split the sample
intelligent rules developed are used to
established on the area that gives the
predict hepatitis cases and treatment.
highest information gain. Each subsample
distinguished by the first split is then split  Treatment of Hepatitis: the developed
again, usual; Based on a different field and intelligent rules for hepatitis diagnoses was
the process continues reputedly till the adopt to interpret the symptoms from the
subsample cannot be split anymore. client and the identify the treatment and
the type of hepatitis case such as hepatitis
 Training hepatitis dataset: Training
B, C, D or acute Hepatitis.
hepatitis dataset are the theoretical time
series information of various symptoms and  Intelligent decision provision system: the
the treatment of hepatitis diseases. outcomes of our proposed model will
provide intelligent guides for efficient
 Intelligent rules for hepatitis diagnoses:
medical decision for hepatitis treatment
Rule sets proposed in this work hold most
cases.
of significant information in C5.0 decision
tree however with less difficult model. The o The activity diagram of the proposed
rules obtain with C5.0 algorithm in union system can be seen in Fig. 3.

Figure 3: Activity diagram of the proposed system

The activity diagram of the planned system attributes and symptoms of hepatitis dataset.
as display in figure 3.3 gives the individual The entropy and information again of the
presentation of the varying component of the features were compute and the dataset were
proposed system. The raw hepatitis dataset been split using the dataset gain ratios. After
is gotten from the database. Data mining the training all the features of trained
model is used to process, integrate dataset, an intelligent rule based were
transformed and cleans the extracted raw develop to diagnose hepatitis cases for
hepatitis dataset. Then the C5.0 algorithm efficient treatment and decision quick
were adapt to thoroughly train the dataset to making. The architecture of the proposed
formulate rules using the obtained system is shown in Figure 4

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 6


Available online at: www.engineeringjournal.info

Figure 4: The architecture of the proposed system

The components of architecture of the been used to intelligently diagnose various


proposed system as seen in figure 4 are as cases of hepatitis cases.
follow:
 Ruled Based System: developed with C5.0
algorithm are used by the system to
 Hepatitis Symptoms: the hepatitis
efficiently treat any type hepatitis case
symptoms were used as input to test and
such as hepatitis A, B, C and acute
compute the proposed model.
hepatitis. This rules make prediction for
 C4.5 Algorithm: is a machine learning individual records. Rue sets are derived
algorithm were used to develop and from decision trees and, in a way, shows a
efficient intelligent system to control and simplified or distilled version of the
treat hepatitis cases. information seen in the decision tree.
 Validated by Domain Expert: the proposed  Display Advice to End User: The outcomes
intelligent system were been integrated of intelligent result for hepatitis treatment
with other component of the system by the were displayed to the client
experts.
 Knowledge Base: this contains the The high level model is shown in Figure 5
information store in the database that is

Figure 5: High level model of the proposed system

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 7


Available online at: www.engineeringjournal.info

The High level model of the proposed system mining Model”, “Data warehouse”, and “Data
as depicted in figure 5 comprises of “Data mining techniques”, “Knowledge Discoveries”
gathering from Central Database”, “Data and “Decision Support System”.

Table 1: Training dataset for hepatitis symptoms (Source University of California Irvine
(UCI) Machine Learning Repository database)
Clas AGE SEX STEROIDANTIVIRALS FATIGUE MALAISEANOREXIALIVER_BIG LIVER_FISPLRMEEN_PALPABLSPIEDERS ASCITES VARICES BILIRUBIN ALK_PHOSPHATE SGOT ALBUMIN PROTIME HISTOLOGY Die 54 Male yes no yes yes no no no no no yes no 1.2 85 92 3.1 66 no
Die 50 Male no no yes no no yes no no no no no 0.9 135 42 3.5 ? yes Live 57 Male yes no yes yes no no no no yes yes no 4.6 82 55 3.3 30 no
Die 54 Male no no no no no no no no no no no 1 85 30 4.5 0 no
Die 78 Male yes yes yes no no no no no no no no 0.7 96 32 4 ? yes Live 31 Male yes no yes yes yes no no yes no no no 8 ? 101 2.2 ? no
Die 31 Male no yes no no no no no no no no no 0.7 46 52 4 80 yes Die 48 Male no no yes yes yes no yes no yes no no 2 158 278 3.8 ? no
Die 34 Male yes no no no no no no no no no no 1 ? 200 4 ? yes Die 72 Male no yes yes no no no yes no no no no 1 115 52 3.4 50 no
Die 34 Male no no no no no no no no no no no 0.9 95 28 4 75 yes Live 38 Male yes no no no no no yes no no no no 0.4 243 49 3.8 90 no
Die 25 Male no no yes no no yes yes yes yes yes yes 1.3 181 181 4.5 57 no
Live 51 Male yes no yes no yes no no yes yes no no ? ? ? ? ? yes Die 51 Male no no no no no yes yes no yes no no 0.8 ? 33 4.5 ? no
Die 23 Male no no no no no no no no no no no 1 ? ? ? ? yes Die 38 Male no no no no no no yes no yes no yes 1.6 130 140 3.5 56 no
Die 39 Male no no yes no no no yes no no no no 0.7 ? 48 4.4 ? yes Live 47 Male no no yes yes no no yes no yes yes yes 1 166 30 2.6 31 no
Die 30 Male no no no no no no no no no no no 1 ? 120 3.9 ? yes Die 45 Male no yes no no no no no no no no no 1.3 85 44 4.2 85 no
Die 39 Male yes yes no no no yes yes no no no no 1.3 78 30 4.4 85 yes Die 36 Male yes no yes yes yes yes yes no yes no yes 1.7 295 60 2.7 ? no
Live 54 Male yes no yes yes no no yes yes no yes no 3.9 120 28 3.5 43 no
Die 32 Male no yes yes no no no yes no yes no no 1 59 249 3.7 54 yes Die 51 Male no no yes no no no yes yes yes no yes 1 ? 20 3 63 no
Die 41 Male no yes yes no no no yes no no no no 0.9 81 60 3.9 52 yes Live 49 Male yes no yes yes no no no yes yes no no 1.4 85 70 3.5 35 no
Die 30 Male no no yes no no no yes no no no no 2.2 57 144 4.9 78 yes Live 45 Male no no yes yes yes no no no yes yes no 1.9 ? 114 2.4 ? no
Die 47 Male yes yes no no no no no no no no no ? ? 60 ? ? yes Die 31 Male yes no yes no no no no no no no no 1.2 75 173 4.2 54 no
Live 41 Male no no yes no no no yes yes yes no yes 4.2 65 120 3.4 ? no
Die 38 Male yes no yes yes yes no no no no yes no 2 72 89 2.9 46 yes Live 70 Male yes no yes yes yes no no no no no no 1.7 109 528 2.8 35 no
Die 66 Male no no yes no no no no no no no no 1.2 102 53 4.3 ? yes Die 20 Male yes no no no no no no no no no no 0.9 89 152 4 ? no
Die 40 Male yes no yes no no no yes no no no no 0.6 62 166 4 63 yes Die 36 Male no no no no no no no no no no no 0.6 120 30 4 ? no
Die 38 Male yes no no no no no no no no no no 0.7 53 42 4.1 85 no Live 46 Male no no yes yes yes no no no yes yes yes 7.6 ? 242 3.3 50 no
Die 44 Male no no yes no no no yes no no no no 0.9 126 142 4.3 ? no
Die 38 Male yes yes no no no yes yes no no no no 0.7 70 28 4.2 62 yes
Die 61 Male yes no yes yes no yes yes no yes no no 0.8 75 20 4.1 ? no
Die 22 female no yes yes no no no no no no no no 0.9 48 20 4.2 64 yes Die 53 female yes no yes no no no no yes yes no yes 1.5 81 19 4.1 48 no
Die 27 Male no no yes yes yes yes yes yes yes no no 1.2 133 98 4.1 39 yes Live 43 Male no no yes no no no no yes yes yes no 1.2 100 19 3.1 42 no

The WEKA simulation result for Information Gain Attribute Ratio is shown in Figure 6.

Figure 6: Information gain attribute ratio

C5.0 model works by splitting the sample information gain from WEKA computation.
based on the field that provides the least The information gain value field is displayed
information gain as shown in Figure 7. in Figure 7.

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 8


Available online at: www.engineeringjournal.info

Figure 7: Information gain value field

The C5.0 algorithm slits training data examine the lowest level split, those sample
samples on basis of the highest information subsets that don’t have remarkable
gain value field. The sample subset that is contribution to the model will be rejected.
get from the former split will be split The knowledge denoted with decision tree
afterwards. The process will continue until could be mined and presented using IF
the sample subject cannot be split and is THEN rules. The decision rules for hepatitis
usually according to other field. Finally, dataset classification is shown in Figure 8.

Figure 8: C5.0 Decision tree for hepatitis classification and control

The intelligence of our proposed system is This application have been run and tested
well defined for proper diagnoses. successfully.
Experiments Carried Out
Rulebase for hepatitis classification.

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 9


Available online at: www.engineeringjournal.info

Table 2: Reveals the comparison of correctly categorized instances of C5.0 and C4.5

Table 2 shows correct classification of instances of C5.0 is 98.05% and C4.5 is


hepatitis datasets. It shows the comparison 94.81%, this implies that C5.0 is more
of correctly classified instances for C4.5, and resourceful than C4.5 algorithm. The results
C5.0 Algorithms. The correctly classified are signified in Figure 9.

Figure 9: Graphical representation of comparison of correctly classified instances for C4.5,


and C5.0.

Table 3: Comparison of time constrain of the process

Table 3 shows the comparison of time seconds and that of C4.5 is 0.05 seconds, this
constrains of process for C4.5, and C5.0 entails that C5.0 algorithm has minimum
machine learning algorithm. The time time complexity than C4.5. The results were
complexity of the process forC5.0 is 0.03 graphically shown in Figure 11.

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 10


Available online at: www.engineeringjournal.info

Figure 10: Graphical representation of the comparison of Time constrain of the process for
C5.0, and C4.5.

Table 4: Comparison of mean absolute error for C5.0 and C4.5 algorithm classification
process.

Table 4 shows the comparison of Mean 0.09%.This means C5.0 algorithm has the
Absolute Error for C5.0 and C4.5decision tree minimum mean absolute error of
categorization process. The Mean Absolute classification than C4.5 algorithms. The
Error for C5.0 is 0.04while that ofC4.5 is results were represented graphically in
Figure 11.

Figure 11: Comparison of mean absolute error for classification

Discussion of Results
In this work WEKA (simulator software for The intelligent system was developed to
data mining) was used to simulate C5.0 to interpret the symptoms of the hepatitis
develop a decision support system for diagnoses and treatment. Using the
hepatitis diagnoses and treatment. The simulator (WEKA) we obtain the attribute of
outcomes of the simulation were then used to a specific tree node using the C5.0
create an intelligent system (decision information pain, split information and gain
support) for treatment of hepatitis cases. ration for C5.0. The simulation result shows
100% of the training data was used with C5.0 that the Mean Absolute Error for C5.0 is 0.04
algorithm to design rules and pattern to while that of C4.5 is 0.09% (see Table 4). This
control various complicated cases of hepatitis implies that C5.0 algorithm has the
problems. minimum mean

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 11


Available online at: www.engineeringjournal.info

absolute error of classification than C4.5 Conclusion


algorithms. The time complexity of the
This work have successfully designed an
process for C5.0 is 0.03 seconds while that of
intelligent Medical decision support system
C4.5 is 0.05 seconds. This implies that C5.0
using C5.0 machine learning algorithm that
algorithm has minimum time complexity
will help to improve treatment of various
than C4.5 (see Table 3). The correctly
cases of Hepatitis challenges. The developed
classified instances of C5.0 is 98.05% while
intelligent system improved the uncertainty
C4.5 is 94.81%, this implies that C5.0 is more
in the estimation of the treatment of various
efficient than C4.5 algorithm (see Table 2)
cases of hepatitis. It provides a reliable guide
for hepatitis A and B treatment for
physicians and medical personnel.

References
1 Ahmed (2014) Neutrosophic and fuzzy C/ means Evans and V. Petal (Eds.), Cognitive Science
gathering for enhancing CT liver appearance. and Medicine: Biomedical Modeling, 11-255.
IEEE Transactions of Information Technology MIT Press, Cambridge, Massachusetts.
in Biomedicine, 11(4):101-113
10 Gt Venira, Emeksiz (2010) presented an expert
2 Amin SU, Agarwal K, Beg R (2013) Data system for differential diagnosis’, American
mining in clinical decision support system’. Journal of Computer Sciences and
International Journal of Advanced Research in Mathematics, 129(2):73-85
Computer Engineering & Technology
11 Kahn R, Perkins D, Lieberman J (2013)
(IJARCET), 2(1):56-67.
Predictors of treatment response in patients
3 Anil, Anithia (2015) "A soft computing with first-episode prostate cancer disorder. The
paradigm for a medical data mining tool to British Journal of Gynecology, 185(1):18-24.
predict risk of coronary heart events", The
12 Ming, R, Pale T (2010) Applied neutrosophy to
British Journal of Medical Science, 121(1):21-
image processing. International Journal of
32.
Advanced Research in Computer Engineering &
4 Bekir (2011) Comparison between Back Technology (IJARCET), 1(3):106-119.
propagation and Naive Baye Classifiers to
13 Nicolae V, Lath D (2012) an approach to
diagnose Hepatitis disease’, International
integrate computational intelligence with
Journal Computer Engineering and applied
clinical investigation methods and knowledge
Physics. 60(2):97-118
for glaucoma diagnosis’.American Journal of
5 Brause (2008) Physicians Roll in Clinical Applied Sciences, 179(2), 98-112
Decision Support Systems: State of Art.
14 Oguntimilehin A, Adetunmbi AO, Abiola OB
Rockville: AHRQ Publication 10(4):50-70.
(2015) “A Machine Learning Approach to
6 Chrysostomos (2008) A new hybrid modeling Clinical Diagnosis of Typhoic Fever”
methodology for complex decision making International Jounal of Computer and
procedures’. American Journal of Computer Information Technology, 2(4).
Sciences and Mathematics, 89(3):68-82
15 Sonawane, Vadicherla (2013) Program based
7 Djain and Kimbi (2011) proposed a medical Medical Decision Support System (MDSS).
diagnostic support system for the management IEEE Transactions of Information Technology
of hypertension’International Journal of in Biomedicine, 9(7):205-215
Advanced Research in Computer Engineering &
16 Shankar F, Gomes C, Brito A (2014) A
Technology (IJARCET), 1(5):126-147.
prediction system for liver disease using
8 Dragulescuand Albu (2007) Logical Inference intelligent machine learning Automation
System For detection of Hepatitis virus’Jounal Science and Engineering, IEEE Transactions on
of Computer and Information Technology, 1(3). Robotics & Control Systems, 99-112.
9 Evan DA, Gadd CS (2005) Managing coherence
and context in medical problem solving. In D.

LN Onyejegbu et. al.|June 2018| Vol.03 | Issue 06 |01-12 12

You might also like