Summary:: Petalwidthcm, Species) Samples: 150

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 5

Summary:

In evaluating the Iris dataset, I found it important to be able to reference graphs of the
samples. By keeping species as null, we can find the k-means clustering for the samples using
R software. In addition, using rattle function in RStudio I found the correlation between
samples and error matrix for the models i.e. Decision tree, linear, SVM. Finally, we get the
results of k-means clustering.

Data set link: https://www.kaggle.com/uciml/iris/data


Attributes: 6 (Id, SepalLengthCm, SepalWidthCm, PetalLengthCm,
PetalWidthCm, Species)

Samples: 150

Model accuracy/error rate:

K-means
Related work:

Hierarchical clustering
Decision tree
SVM

S.no model Error rate (%)

1 Decision tree 4.5


2 Linear 0
3 SVM 4.5
Decisional Attributes:

PetalLengthCm, PetalWidthCm, SepalWidthCm

Accuracy of Decisional attributes

PetalLengthCm – 95.2%

PetalWidthCm – 98.5%

SepalWidthCm – 96.9%

You might also like