Professional Documents
Culture Documents
Camera Ready
Camera Ready
Camera Ready
Algorithm in Weka
Dr.Neeraj Bhargava
School of system sc. &
engg.
professor
MDS UniversityAjmer,
India
profneerajbhargava@g
mail.com
Sonia Dayma
School of system sc. &
engg.
MDS UniversityAjmer,
India
soniadayma786@gmai
l.com
DM,
Algo,
Predictor,Weka
tool
EASE OF USE
INTRODICTION
Abishek Kumar
School of system sc. &
engg.
MDS University Ajmer
India
ap481998@gmail.com
III.
Pramod Singh
School of system sc. &
engg.
MDS University Ajmer
India
pramodrathore88@gm
ail.com
MTEODOLOGY
Age
Chest Pain
Nominal
Rest b press
Blood Sugar
Numeric
Nominal
Rest Electro
Nominal
Nominal
Min=20-40
Avg=41-60
Upper=61-80
Asympt
atyp_angina
non_anginal
typ_angina
Good=120/80 mm
Average=129-139
Poor=179/90
F=False, T=True
Normal,
left_vent_hyper,
st_t_wave_abnorm
ality
A<=100
B>100&<=150
C>150
Nominal
Yes, no
Nominal
Positive
negative
CART
Figure1. AGE
Figure 2. CHEST_PAIN
0.08
167
42
64.1148%
IV.
Figure5. Rest_electro
Figure7. Exercice_angina
Figure 4. Blood_sugar
Figure 6. max_heart_rate
Figure 8. Disease
Value
RESULT
A. Final Result
When algorithm is applied to dataset, then the result is
produced, i.e. shown in figure 3.1. It consist the information of
dataset analysis such as information about total instances,
classified and unclassified instances, classification accuracy
measures, detailed accuracy measures and confusion matrix
etc.
=== Run information ===
Scheme:weka.classifiers.trees.SimpleCart -S 1 -M 2.0 -N 5 -C
1.0
Relation: heart_disease_male
Instances: 209
Attributes: 8
| exercice_angina=(no)
age
| | age=(Avg)|(Min)
chest_pain
| | | max_heart_rate=(A)|(B): negative(17.0/10.0)
rest_bpress
| | | max_heart_rate!=(A)|(B)
blood_sugar
| | | | rest_bpress=(Poor): negative(3.0/1.0)
rest_electro
| | | | rest_bpress!=(Poor): positive(7.0/1.0)
max_heart_rate
| | age!=(Avg)|(Min): positive(7.0/2.0)
exercice_angina
| exercice_angina!=(no): positive(54.0/6.0)
disease
Number of Leaf Nodes: 6
Test mode:10-fold cross-validation
Size of the Tree: 11
=== Classifier model (full training set) ===
CART Decision Tree
Time taken to build model: 0.08 seconds
chest_pain=(atyp_angina)|(non_anginal): negative(88.0/13.0)
=== Stratified cross-validation ===
chest_pain!=(atyp_angina)|(non_anginal)
=== Summary ===
Correctly Classified Instances
167
79.9043 %
Incorrectly Classified Instances
42
20.0957 %
Kappa statistic
0.5913
Mean absolute error
0.2779
Figure9.
Root Result
mean squared error
0.3869
Relativefigure9
absoluteshows
error the relation
56.3624
%
The above
information
including
Root
relative
squared
error
77.9345
% cross validation
name, instances and attributes. The 10 fold
209 equally and then the
meansTotal
theNumber
datasetofisInstances
split into 10 slices
Negative
FP rate = FP/ (FP+TN)
=>Positive
TP=70
TN=97
* Recall= TP rate
FN=22
FP=20
B. Stratified Cross-Validation
Stratified Cross-Validation 3)
is consisting of some
essential classification accuracy measures like kappa statistic,
MAE (Mean Absolute Error), RMSE (Root Mean Squared
error) etc. These measures show the accuracy factors of
algorithm.
These rules are generated by JRIP rules and they are helpful to
take final decision by the machine. These are not final or exact
result, these may vary on other machine or as dataset changes.
The decision tree generated by the classifier is depended on
these rules.
The Rule 1: If (Exercice_angina=yes) Then Disease=
Positive.
Rule
3:
If
(Chest_pain=typ_angina)
Disease=Positive (in Some Cases)
Then
Rule
4:
If
(Chest_pain=typ_angina)
Disease=Negative (in Many Cases).
Then
F. Tree Visualization
In above section, all the rules are interprated. These rules are
visualized in the tree form. The tree visualization is the
simplest method to understand the conditions and their results.
[2]