Welcome to Scribd!

# Author - Jyotiraditya Ghatage # Date - 25th Aug 2021 # Title - Decision Tree (Gini Index)

Uploaded by

0% found this document useful (0 votes)

9 views3 pages

The document describes building a decision tree classifier model to predict diabetes using patient data on age and blood pressure. It loads data, separates features from labels, trains a model on 80% of the data and tests it on the remaining 20%. The model achieves a 91% accuracy on the test data as measured by confusion matrix analysis.

Original Description:

Original Title

B25_Expt-3_DT (1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

9 views3 pages

# Author - Jyotiraditya Ghatage # Date - 25th Aug 2021 # Title - Decision Tree (Gini Index)

Uploaded by

Jyotiraditya Ghatage

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

9/10/2021 DMDW_Expt-3_DT.

ipynb - Colaboratory

# Author - jyotiraditya ghatage

# Date - 25th Aug 2021
# Title -Decision Tree (Gini Index)

# Step 1 :Load the libraries

import numpy as np
import pandas as pd
import seaborn as sns

from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier # model name with Camel Case
from sklearn import metrics

# Step 2 Load labelled data
#(input feature x=2 (age, bp); output label y: 1(diabetes))

df = pd.read_csv("/content/Decision-Tree-Classification-Data.csv")

# understand dataset (No. of samples: 0 to 986 = 987 )
# 100% - 80%(training) , 20%(testing)
# 987 - 789(training), 198(testing)

df.head()

age bp diabetes

0 65 65 1

1 45 82 0

2 35 73 1

3 45 90 0

4 50 68 1

df.tail()

age bp diabetes

982 45 87 0

983 40 83 0

984 40 83 0

985 40 60 1

986 45 82 0

# seperate features(x: age, bp) from labels (y: diabetes)
# x -
https://colab.research.google.com/drive/1ZOq3uBmMQ1TtJ6vHMdAH2TgXgp4xVsHs?authuser=1#scrollTo=r5s1AWxcl1qU&printMode=true 1/3
9/10/2021 DMDW_Expt-3_DT.ipynb - Colaboratory
# x
x = df.drop("diabetes",axis = 1) # age, bp
y = df.diabetes # diabetes

x.head()

age bp

0 65 65

1 45 82

2 35 73

3 45 90

4 50 68

y.head()

0 1

1 0

2 1

3 0

4 1

Name: diabetes, dtype: int64

# Adequate model fitting (80%,20%) avoid overfiiting, underfitting
x_train, x_test,y_train, y_test = train_test_split(x,y,test_size=0.20,random_state=15)
# random_state(to shuffle the data), test_size(percent of test cases)

x_train.shape # training : 789

(789, 2)

x_test.shape # testing : 198

(198, 2)

# Model building/model training/model creation

model = DecisionTreeClassifier()
model.fit(x_train,y_train)

DecisionTreeClassifier(ccp_alpha=0.0, class_weight=None, criterion='gini',

max_depth=None, max_features=None, max_leaf_nodes=None,

min_impurity_decrease=0.0, min_impurity_split=None,

min_samples_leaf=1, min_samples_split=2,

min_weight_fraction_leaf=0.0, presort='deprecated',

random_state=None, splitter='best')

# Model testing
y_predict=model.predict(x_test)

https://colab.research.google.com/drive/1ZOq3uBmMQ1TtJ6vHMdAH2TgXgp4xVsHs?authuser=1#scrollTo=r5s1AWxcl1qU&printMode=true 2/3
9/10/2021 DMDW_Expt-3_DT.ipynb - Colaboratory

accuracy = (metrics.accuracy_score(y_test,y_predict))*100

print(accuracy)

91.41414141414141

from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_test,y_predict)
print(cm)
sns.heatmap(cm,annot=True)

[[93 4]

[13 88]]

<matplotlib.axes._subplots.AxesSubplot at 0x7fb2ff5211d0>

check 0s completed at 8:32 PM

https://colab.research.google.com/drive/1ZOq3uBmMQ1TtJ6vHMdAH2TgXgp4xVsHs?authuser=1#scrollTo=r5s1AWxcl1qU&printMode=true 3/3

GCMS Reference Manual 28 May 2015
Document445 pages
GCMS Reference Manual 28 May 2015
Bom
100% (2)
Statisitics Project 6
Document48 pages
Statisitics Project 6
AMAN PRAKASH
100% (2)
935 Robert Bosch Interview Questions in C A Micro Controllers PDF
Document2 pages
935 Robert Bosch Interview Questions in C A Micro Controllers PDF
dhapra
0% (1)
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
Document8 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
vince.lachica
No ratings yet
Logistic - Ipynb - Colaboratory
Document6 pages
Logistic - Ipynb - Colaboratory
Akansha Uniyal
No ratings yet
Charmi Shah 20bcp299 Lab2
Document7 pages
Charmi Shah 20bcp299 Lab2
Princy
100% (1)
Logistic Regression
Document10 pages
Logistic Regression
C T
No ratings yet
Tensorflow Logistic Regression
Document10 pages
Tensorflow Logistic Regression
C T
No ratings yet
Generative AI Binary Classification
Document7 pages
Generative AI Binary Classification
Cyborg Ultra
No ratings yet
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
Document8 pages
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
utsavarora1912
No ratings yet
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
Document73 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
aditya b
100% (1)
Importing Packages: Id Label Tweet 0 1 2 3 4
Document8 pages
Importing Packages: Id Label Tweet 0 1 2 3 4
rajat raina
No ratings yet
Linear Regression
Document10 pages
Linear Regression
WONDYE DESTA
No ratings yet
Assignment: Name: Md. Nasim Uddin ID: 15162103276 Intake: 32 Section: 07
Document8 pages
Assignment: Name: Md. Nasim Uddin ID: 15162103276 Intake: 32 Section: 07
Md Nasim
No ratings yet
Basri 14002164 Tugas2-Binary PDF
Document4 pages
Basri 14002164 Tugas2-Binary PDF
Basri
100% (1)
NF Assighment4
Document5 pages
NF Assighment4
Abdul Moaid
No ratings yet
Name: MANOGNA GV Email Id: Major Project: Diabetes Prediction Let's Import Required Libraries!
Document4 pages
Name: MANOGNA GV Email Id: Major Project: Diabetes Prediction Let's Import Required Libraries!
Manogna Gv
No ratings yet
Haberman Datasets Analysis - Ipynb - Colaboratory
Document13 pages
Haberman Datasets Analysis - Ipynb - Colaboratory
Shyamal Hazarika
No ratings yet
Saurabh Verma 9919102005
Document11 pages
Saurabh Verma 9919102005
Yogendra pratap Singh
100% (1)
Diabetic Prediction Using LogicalRegression
Document9 pages
Diabetic Prediction Using LogicalRegression
Yagnesh Vyas
No ratings yet
IRIS BPNN - Ipynb - Colaboratory
Document4 pages
IRIS BPNN - Ipynb - Colaboratory
rwn data
100% (1)
4-10 Aiml
Document25 pages
4-10 Aiml
Guna Seelan
No ratings yet
Assignment 1
Document6 pages
Assignment 1
Abhineet Kumar mm22m006
No ratings yet
Machine Learning Splitting Data To Train Test
Document2 pages
Machine Learning Splitting Data To Train Test
Chanchal jain
No ratings yet
Mla - 2 (Cia - 3) - 20221013
Document21 pages
Mla - 2 (Cia - 3) - 20221013
JEFFREY WILLIAMS P M 20221013
No ratings yet
C2M4 - Assignment: 1 Cox Proportional Hazards and Random Survival Forests
Document18 pages
C2M4 - Assignment: 1 Cox Proportional Hazards and Random Survival Forests
Sarah Mendes
No ratings yet
ML Lab6.Ipynb - Colaboratory
Document5 pages
ML Lab6.Ipynb - Colaboratory
Avi Srivastava
100% (1)
E21CSEU0770 Lab4
Document4 pages
E21CSEU0770 Lab4
kumar.nayan26
No ratings yet
Dsbda 4
Document4 pages
Dsbda 4
Arbaz Shaikh
No ratings yet
Ii Avaliação Parcial - Ia - 25.0-Gabarito
Document9 pages
Ii Avaliação Parcial - Ia - 25.0-Gabarito
Pedro Carvalho
No ratings yet
BTVN1 - Colaboratory
Document4 pages
BTVN1 - Colaboratory
Tam Nguyen Thi
No ratings yet
Au953721103009 Font
Document26 pages
Au953721103009 Font
tommyshelby.gr1
No ratings yet
LAB4
Document5 pages
LAB4
dam huu khoa
No ratings yet
ML Assigment 4
Document6 pages
ML Assigment 4
Talha Khan
No ratings yet
Laboratorio Regresión Logística - Colaboratory Grupo 2
Document7 pages
Laboratorio Regresión Logística - Colaboratory Grupo 2
Priscila Flores
No ratings yet
Brain Tumor Classification
Document12 pages
Brain Tumor Classification
Ultra Bloch
100% (1)
Diabetes Case Study - Jupyter Notebook
Document10 pages
Diabetes Case Study - Jupyter Notebook
Abhising
100% (1)
EDA Assignment
Document15 pages
EDA Assignment
degaci
No ratings yet
Linear - Regression - Ipynb - Colaboratory
Document4 pages
Linear - Regression - Ipynb - Colaboratory
avnimote121
No ratings yet
ANANYAA GUPTA 20BCT0177 ML MTT 24/11/21 Q3 Breast Cancer Dataset
Document4 pages
ANANYAA GUPTA 20BCT0177 ML MTT 24/11/21 Q3 Breast Cancer Dataset
Ananyaa Gupta
No ratings yet
4 Exploratory Data Analysis.
Document1 page
4 Exploratory Data Analysis.
Shubham Tagalpallewar
No ratings yet
Logistic Regression
Document8 pages
Logistic Regression
Nipuni
No ratings yet
Parcial2-Javier Cardenas
Document7 pages
Parcial2-Javier Cardenas
Javier Cardenas
No ratings yet
Haberman Data Set Ed A
Document10 pages
Haberman Data Set Ed A
Varun Akuthota
No ratings yet
Lab10 Regression Evaluation Methods
Document5 pages
Lab10 Regression Evaluation Methods
iffi khan
No ratings yet
ML 7
Document6 pages
ML 7
pratikn1406
No ratings yet
Group Work Assignment Supervised and Unsupervised Learning
Document10 pages
Group Work Assignment Supervised and Unsupervised Learning
Daren Walace
No ratings yet
Adipose - Tissue - Prediction - Jupyter Notebook
Document8 pages
Adipose - Tissue - Prediction - Jupyter Notebook
Vrushali Vishwasrao
No ratings yet
R Project 1
Document36 pages
R Project 1
AlvinBurhani
No ratings yet
Practical Machine Learning
Document11 pages
Practical Machine Learning
minhajur rahman
No ratings yet
Lab 3. Linear Regression 230223
Document7 pages
Lab 3. Linear Regression 230223
ruso
100% (1)
Vinay Kumar Kannegala Siddalingappa Marks: 43/52
Document12 pages
Vinay Kumar Kannegala Siddalingappa Marks: 43/52
vinay kumar
No ratings yet
Logistic Multiclass Classification
Document2 pages
Logistic Multiclass Classification
jaymehta1444
No ratings yet
Labpractice 2
Document29 pages
Labpractice 2
Rajashree Das
100% (2)
20AI16 - ML Record
Document24 pages
20AI16 - ML Record
Menma
No ratings yet
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
Document34 pages
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
JASHWIN GAUTAM
No ratings yet
Diabetes Model
Document44 pages
Diabetes Model
sasda
100% (1)
Logistic Regression For Binary Classification With Core APIs - TensorFlow Core
Document22 pages
Logistic Regression For Binary Classification With Core APIs - TensorFlow Core
zwd.slmn
No ratings yet
Naive Bayes Project
Document5 pages
Naive Bayes Project
Night Music
No ratings yet
Estiven - Hurtado.Santos - Regresión Con Varios Algoritmos
Document16 pages
Estiven - Hurtado.Santos - Regresión Con Varios Algoritmos
Estiven Hurtado Santos
No ratings yet
Linear and Multilinear Regression
Document5 pages
Linear and Multilinear Regression
Harisankar R N R
No ratings yet
Image Processing And Acquisition Using Python
From Everand
Image Processing And Acquisition Using Python
successkpk
No ratings yet
10T SRAM Computing-in-Memory Macros For Binary and
Document15 pages
10T SRAM Computing-in-Memory Macros For Binary and
그랬구나
No ratings yet
Whatman Price Catalog: GE Healthcare Life Sciences
Document94 pages
Whatman Price Catalog: GE Healthcare Life Sciences
Gayan Karunasena Konara
No ratings yet
DC RG
Document16 pages
DC RG
John Rojas
No ratings yet
Assignment Co Operations
Document20 pages
Assignment Co Operations
yasirism
100% (2)
ArmCAD 2005 Handbuch Englisch
Document521 pages
ArmCAD 2005 Handbuch Englisch
Goran Mrkela
No ratings yet
Introduction To Matlab
Document45 pages
Introduction To Matlab
SureshCool
No ratings yet
Command Reference: Optima/Econo DMC-2xxx Series
Document263 pages
Command Reference: Optima/Econo DMC-2xxx Series
Animesh Ghosh
No ratings yet
Land Development Banks PDF
Document9 pages
Land Development Banks PDF
Tapesh Awasthi
No ratings yet
Baker Bill Rosa 1973 Mexico PDF
Document21 pages
Baker Bill Rosa 1973 Mexico PDF
the missions network
No ratings yet
Dotnet - Interview Question and Answer's
Document395 pages
Dotnet - Interview Question and Answer's
Prashanth Vamani
No ratings yet
Mandago - 2018 - Green Reward Compensation - Environmental Sustainability
Document2 pages
Mandago - 2018 - Green Reward Compensation - Environmental Sustainability
Amelia
No ratings yet
Palawan Branches
Document28 pages
Palawan Branches
xdmhundz999
0% (1)
Reso Sympathy
Document2 pages
Reso Sympathy
sangguniang
No ratings yet
Use To Show An Exact Time: - Two O'clock - Midnight / Noon - The Moment, Etc
Document3 pages
Use To Show An Exact Time: - Two O'clock - Midnight / Noon - The Moment, Etc
Kasira Pammpers
No ratings yet
Bam 200 Sas #19
Document7 pages
Bam 200 Sas #19
allia Lopez
No ratings yet
Statement of Purpose
Document1 page
Statement of Purpose
Engr Mubashir Mukhtar
No ratings yet
Consolidated Invoice - Nexa Equity - Project Scuba
Document3 pages
Consolidated Invoice - Nexa Equity - Project Scuba
rhenke
No ratings yet
ABLE Contract Approval.
Document5 pages
ABLE Contract Approval.
Ferris Ferris
No ratings yet
KP53V85 Tech Manual
Document106 pages
KP53V85 Tech Manual
richiegran
No ratings yet
Business Plan Hindi Pa Final
Document10 pages
Business Plan Hindi Pa Final
Maria Theresa Cortez Mendoza
No ratings yet
Part 1 - Clinical Manual - January 2018 - Version 8.0
Document260 pages
Part 1 - Clinical Manual - January 2018 - Version 8.0
Abhishek
No ratings yet
Current Social Issues in The Philippines
Document1 page
Current Social Issues in The Philippines
Mr. Fifth
No ratings yet
Hcil - Honda Cars Interview Call Letter
Document3 pages
Hcil - Honda Cars Interview Call Letter
Neha Sharma
No ratings yet
Research Paper On Emotional Stability
Document8 pages
Research Paper On Emotional Stability
egw48xp5
100% (1)
Case Study BRI
Document44 pages
Case Study BRI
iambadass
No ratings yet
Omega - 8500 8501 8900 8901 - E Technical Guide
Document34 pages
Omega - 8500 8501 8900 8901 - E Technical Guide
Hugo Beraldo
No ratings yet
Speculative Application PHD CH
Document2 pages
Speculative Application PHD CH
Kalki kk
No ratings yet