Welcome to Scribd!

Garishav Basra 102103129 2CO5

Uploaded by

0% found this document useful (0 votes)

16 views8 pages

This document discusses classifying handwritten digits using the MNIST dataset with a K-Nearest Neighbors (KNN) model. It loads the MNIST train and test datasets, preprocesses the data by separating features and labels, and splits the train data into train and test subsets. It then trains a KNN classifier on the train data and evaluates it on the test data, achieving an accuracy of 96.36%. Evaluation metrics like classification report and confusion matrix are also generated.

Original Description:

Original Title

Garishav__Basra_102103129_2CO5

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

16 views8 pages

Garishav Basra 102103129 2CO5

Uploaded by

dsingh1be21

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 8

Search inside document

MNIST Handwritten Digit Recognition

Importing libraries
In [1]:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
#%matplotlib inline

Loading the MNIST datasets

In [2]:
data_df = pd.read_csv("data.csv")
#test_df = pd.read_csv("test.csv")
In [3]:
data_df.head()
Out[3]:
l
pi pi pi pi pi pi pi pi pi pi pi pi pi pi pi pi pi pi pi
a .
x x x x x x x x x xe xe xe xe xe xe xe xe xe xe
b .
el el el el el el el el el l7 l7 l7 l7 l7 l7 l7 l7 l7 l7
e .
0 1 2 3 4 5 6 7 8 74 75 76 77 78 79 80 81 82 83
l

.
0 1 0 0 0 0 0 0 0 0 0 . 0 0 0 0 0 0 0 0 0 0
.

.
1 0 0 0 0 0 0 0 0 0 0 . 0 0 0 0 0 0 0 0 0 0
.

.
2 1 0 0 0 0 0 0 0 0 0 . 0 0 0 0 0 0 0 0 0 0
.

.
3 4 0 0 0 0 0 0 0 0 0 . 0 0 0 0 0 0 0 0 0 0
.

.
4 0 0 0 0 0 0 0 0 0 0 . 0 0 0 0 0 0 0 0 0 0
.

5 rows × 785 columns

In [4]:
#test_df.head()

For train and test both we will use

train.csv (Taking train data as complete
data)
In [5]:
data_df.shape
Out[5]:
(42000, 785)

Data Preparation for Model Building

In [6]:
y=data_df['label']
x=data_df.drop('label',axis=1)
In [7]:
#x_for_test_data=test_df[:]
In [8]:
type(x)
Out[8]:
pandas.core.frame.DataFrame
In [9]:
plt.figure(figsize=(7,7))
some_digit=1266
some_digit_image = x.iloc[some_digit].to_numpy()
plt.imshow(np.reshape(some_digit_image, (28,28)))
print(y[some_digit])
6
In [10]:
sns.countplot( x='label', data=data_df)
Out[10]:
<AxesSubplot:xlabel='label', ylabel='count'>
we can conclude that our dataset is balanced

Splitting the train data into train and test

In [11]:
from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size = 0.30,
random_state = 40)
In [12]:
x_train.shape,y_train.shape,x_test.shape,y_test.shape
Out[12]:
((29400, 784), (29400,), (12600, 784), (12600,))

Models
KNN
In [13]:
#from sklearn.preprocessing import StandardScaler
#scaler = StandardScaler()
#scaler.fit(x_train,y_train)
#x_train = scaler.transform(x_train)
#x_train.shape
k=3
In [14]:
from sklearn.neighbors import KNeighborsClassifier
classifier = KNeighborsClassifier(n_neighbors = 3)
classifier.fit(x_train, y_train)
Out[14]:
KNeighborsClassifier(n_neighbors=3)
In [15]:
y_pred = classifier.predict(x_test)
y_pred
C:\ProgramData\Anaconda3\lib\site-packages\sklearn\neighbors\_classificatio
n.py:228: FutureWarning: Unlike other reduction functions (e.g. `skew`, `ku
rtosis`), the default behavior of `mode` typically preserves the axis it ac
ts along. In SciPy 1.11.0, this behavior will change: the default value of
`keepdims` will become False, the `axis` over which the statistic is taken
will be eliminated, and the value None will no longer be accepted. Set `kee
pdims` to True or False to avoid this warning.
mode, _ = stats.mode(_y[neigh_ind, k], axis=1)
Out[15]:
array([0, 2, 1, ..., 2, 4, 7], dtype=int64)
In [16]:
from sklearn.metrics import
accuracy_score,classification_report,confusion_matrix
print(accuracy_score(y_test, y_pred))
0.9636507936507936
In [17]:
print(classification_report(y_test, y_pred))
precision recall f1-score support

0 0.97 0.99 0.98 1236

1 0.96 1.00 0.98 1370
2 0.98 0.96 0.97 1252
3 0.95 0.96 0.95 1369
4 0.97 0.96 0.97 1215
5 0.95 0.95 0.95 1132
6 0.97 0.99 0.98 1216
7 0.96 0.96 0.96 1326
8 0.98 0.92 0.95 1197
9 0.94 0.94 0.94 1287

accuracy 0.96 12600

macro avg 0.96 0.96 0.96 12600
weighted avg 0.96 0.96 0.96 12600

In [18]:
print(confusion_matrix(y_test, y_pred))
[[1224 0 2 0 0 1 6 0 1 2]
[ 0 1364 0 0 0 0 2 2 1 1]
[ 5 10 1204 6 1 1 2 18 2 3]
[ 3 4 6 1315 0 22 1 7 9 2]
[ 2 12 1 0 1165 0 5 1 0 29]
[ 3 1 1 27 2 1075 16 0 2 5]
[ 10 1 0 0 1 3 1201 0 0 0]
[ 1 17 5 0 0 0 0 1278 1 24]
[ 5 8 7 25 11 21 2 4 1102 12]
[ 7 5 1 12 18 6 0 21 3 1214]]
In [19]:
#y_pred_on_test_data = classifier.predict(x_for_test_data)
#y_pred_on_test_data

Din 00267 27 2004 en
Document8 pages
Din 00267 27 2004 en
yunus emre
No ratings yet
Minutely
Document1 page
Minutely
Anonymous CDd9eukAxN
No ratings yet
Asme-B18 31 2-2014
Document21 pages
Asme-B18 31 2-2014
gabriel
100% (1)
Erp Checklist
Document3 pages
Erp Checklist
Suresh Kesavan
100% (1)
MiniProject - ML - Ipynb - Colaboratory
Document26 pages
MiniProject - ML - Ipynb - Colaboratory
Night Music
No ratings yet
Fashion MNIST-6
Document10 pages
Fashion MNIST-6
akif barbaros dikmen
No ratings yet
Random Forest: Implementaciones de Scikit-Learn Sobre QSAR
Document11 pages
Random Forest: Implementaciones de Scikit-Learn Sobre QSAR
Richard Jimenez
100% (1)
Task 1
Document9 pages
Task 1
Dương Vũ Minh
No ratings yet
Alphabet - Kaggal - Jupyter Notebook
Document6 pages
Alphabet - Kaggal - Jupyter Notebook
ankitasonawane9988
No ratings yet
Assignment 6.2a
Document4 pages
Assignment 6.2a
dash
No ratings yet
Code ExerciseModelSelection
Document19 pages
Code ExerciseModelSelection
aimen.nsiali
100% (1)
Mloa Exp1 C121
Document49 pages
Mloa Exp1 C121
Devanshu Maheshwari
No ratings yet
20MIS1025 Perceptron
Document5 pages
20MIS1025 Perceptron
Sandip Das
No ratings yet
Scikit Learn What Were Covering
Document15 pages
Scikit Learn What Were Covering
Raiyan Zannat
No ratings yet
Logistic Regression
Document10 pages
Logistic Regression
Chichi Jnr
100% (1)
I Avaliação Parcial - 25.0 PTS - Gabarito
Document9 pages
I Avaliação Parcial - 25.0 PTS - Gabarito
Pedro Carvalho
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
Document5 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
test
No ratings yet
Loading The Dataset: Import As Import As Import As Import As From Import From Import From Import From Import From Import
Document3 pages
Loading The Dataset: Import As Import As Import As Import As From Import From Import From Import From Import From Import
Divyani Chavan
No ratings yet
NguyenTrungThinh BT3.3
Document5 pages
NguyenTrungThinh BT3.3
Nguyen Trung Thinh
No ratings yet
Code
Document7 pages
Code
bushraqayyum
No ratings yet
Assignment-7 (CSL5402) Name:-Kommireddy - Manikanta Sai Roll No: - 1906036 Branch: - CSE-1
Document7 pages
Assignment-7 (CSL5402) Name:-Kommireddy - Manikanta Sai Roll No: - 1906036 Branch: - CSE-1
Manikanta Sai
100% (1)
Assignment-7 (CSL5402) Name:-Kommireddy - Manikanta Sai Roll No: - 1906036 Branch: - CSE-1
Document7 pages
Assignment-7 (CSL5402) Name:-Kommireddy - Manikanta Sai Roll No: - 1906036 Branch: - CSE-1
Manikanta Sai
100% (1)
Heart Disease Prediction - Jupyter Notebook
Document9 pages
Heart Disease Prediction - Jupyter Notebook
kavya
100% (1)
ML0101EN Clas Logistic Reg Churn Py v1
Document13 pages
ML0101EN Clas Logistic Reg Churn Py v1
banicx
100% (1)
How To Train A Model With MNIST Dataset
Document7 pages
How To Train A Model With MNIST Dataset
Magdalena Falkowska
No ratings yet
8 - Logistic - Regression - Multiclass - Ipynb - Colaboratory
Document6 pages
8 - Logistic - Regression - Multiclass - Ipynb - Colaboratory
duryodhan sahoo
No ratings yet
Maxbox - Starter67 Machine Learning
Document7 pages
Maxbox - Starter67 Machine Learning
Max Kleiner
No ratings yet
P06 The Classification Pipeline Ans
Document16 pages
P06 The Classification Pipeline Ans
Jayson Carlsen
No ratings yet
Content: From Import Import As Import Import Import As
Document8 pages
Content: From Import Import As Import Import Import As
بشار الحسين
No ratings yet
Nama: Mashuril Agil NIM: 0702173157 Kelas: Sistem Informasi-2 Mata Kuliah: Data Science
Document5 pages
Nama: Mashuril Agil NIM: 0702173157 Kelas: Sistem Informasi-2 Mata Kuliah: Data Science
ada saja
No ratings yet
6PAM1052 Stats Practical
Document25 pages
6PAM1052 Stats Practical
Bhuvanesh Srini
No ratings yet
Human Activity Recognition Using Smartphone Data
Document18 pages
Human Activity Recognition Using Smartphone Data
officialimraneben
No ratings yet
Advertising Data Analysis
Document12 pages
Advertising Data Analysis
Vishal Sharma
No ratings yet
Logistic Multiclass Classification
Document2 pages
Logistic Multiclass Classification
jaymehta1444
No ratings yet
Multi - Class - Scaled - Down - Data - Colaboratory
Document2 pages
Multi - Class - Scaled - Down - Data - Colaboratory
Tahseen Jwad
No ratings yet
Lecture 21
Document138 pages
Lecture 21
Tev Wallace
No ratings yet
NLP Group 22
Document8 pages
NLP Group 22
Sudipto Das
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
Document6 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
kevin
No ratings yet
Cluster Analysis With R
Document11 pages
Cluster Analysis With R
john kay
No ratings yet
DeepTrading With TensorFlow 2 - TodoTrader
Document9 pages
DeepTrading With TensorFlow 2 - TodoTrader
alypaty
No ratings yet
Assignment 6.1
Document4 pages
Assignment 6.1
dash
No ratings yet
Generative AI Binary Classification
Document7 pages
Generative AI Binary Classification
Cyborg Ultra
No ratings yet
Loading The Dataset: 'Churn - Modelling - CSV'
Document6 pages
Loading The Dataset: 'Churn - Modelling - CSV'
Divyani Chavan
No ratings yet
Lab - 7 - 21130568 - NguyenNhuToan - Ipynb - Colab
Document10 pages
Lab - 7 - 21130568 - NguyenNhuToan - Ipynb - Colab
nguyennhutoan722003
No ratings yet
As3 - Sailau Dinara - Colaboratory
Document6 pages
As3 - Sailau Dinara - Colaboratory
Farukh Iskalinov
No ratings yet
Fault Detection Using Data Based Models
Document14 pages
Fault Detection Using Data Based Models
Pierpaolo Vergati
No ratings yet
Instructions:: Mltest2question - Jupyter Notebook
Document6 pages
Instructions:: Mltest2question - Jupyter Notebook
Aniruddha Trivedi
No ratings yet
Iris Claasification
Document1 page
Iris Claasification
Mohammad Faysal Khan
No ratings yet
EJ2SEMANA3
Document14 pages
EJ2SEMANA3
fuck off we need limits
No ratings yet
Supervised Learning With Scikit-Learn: Preprocessing Data
Document32 pages
Supervised Learning With Scikit-Learn: Preprocessing Data
NourheneMbarek
No ratings yet
Initialization
Document16 pages
Initialization
lex
No ratings yet
Practice - DL - Ipynb - Colaboratory
Document14 pages
Practice - DL - Ipynb - Colaboratory
noorullahmd461
No ratings yet
Pytorch (Tabular) - Regression
Document13 pages
Pytorch (Tabular) - Regression
Guru75
No ratings yet
utf-8''C2M1 Assignment
Document24 pages
utf-8''C2M1 Assignment
Sarah Mendes
No ratings yet
Observation: As We Can See We Have Threwe Types of Datatypes I.E. (Int, Float, Object) That Means We Have Both Categorical and Numerical Data
Document2 pages
Observation: As We Can See We Have Threwe Types of Datatypes I.E. (Int, Float, Object) That Means We Have Both Categorical and Numerical Data
PRATIK RATHOD
No ratings yet
Fetal Health Analysis Dense Model: # Importing The Dependince
Document35 pages
Fetal Health Analysis Dense Model: # Importing The Dependince
Rony Arturo Bocangel Salas
100% (1)
Data Entry
Document4 pages
Data Entry
2147033
No ratings yet
Data Mining Portfolio
Document19 pages
Data Mining Portfolio
JohnMaynard
No ratings yet
Untitled Document
Document5 pages
Untitled Document
dexteradrian28022004
No ratings yet
ML0101EN Clas K Nearest Neighbors CustCat Py v1
Document11 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
banicx
100% (1)
Scikit-Learn-Exercises - Jupyter Notebook
Document28 pages
Scikit-Learn-Exercises - Jupyter Notebook
Sharil nadzimuddin johari
100% (2)
Lecture 22
Document64 pages
Lecture 22
Tev Wallace
No ratings yet
Math Reproducibles - Grade 2
From Everand
Math Reproducibles - Grade 2
Vicky Shiotsu
Rating: 3.5 out of 5 stars
3.5/5 (3)
Installing Cement Board Underlayment For Tile
Document2 pages
Installing Cement Board Underlayment For Tile
Ri Sovannaphumi
No ratings yet
On Second Thought by Wray Herbert - Excerpt
Document23 pages
On Second Thought by Wray Herbert - Excerpt
Crown Publishing Group
22% (9)
XII Physics Chapter 8 - Electromagnetic Waves Saju Hsslive
Document7 pages
XII Physics Chapter 8 - Electromagnetic Waves Saju Hsslive
Vikash Sharma
No ratings yet
Orca Share Media1674907295188 7025070367837930829-1
Document1 page
Orca Share Media1674907295188 7025070367837930829-1
Stephanie Desoacido
No ratings yet
Bee4113 Chapter 3
Document30 pages
Bee4113 Chapter 3
Kung ChinHan
100% (17)
Illustrated Parts List: 324 South Service Road, Unit 104 Melville NY 11747 TEL (631) 815-5520 FAX (631) 815-5526
Document51 pages
Illustrated Parts List: 324 South Service Road, Unit 104 Melville NY 11747 TEL (631) 815-5520 FAX (631) 815-5526
iwancalwohotmail.com
No ratings yet
Zbirka Zadataka Iz Masinskih Elemenata
Document2 pages
Zbirka Zadataka Iz Masinskih Elemenata
Ali Ali
0% (1)
0773D1 PDF
Document2 pages
0773D1 PDF
Khalid Ejaz
No ratings yet
Cat 3114 Marine Spec Sheet Abby PDF
Document1 page
Cat 3114 Marine Spec Sheet Abby PDF
Gb Jkt
No ratings yet
High Precision Copper Alloys: For Reliable Cost-Effective Connectors
Document3 pages
High Precision Copper Alloys: For Reliable Cost-Effective Connectors
vkms
No ratings yet
Equipment For Engine Room
Document10 pages
Equipment For Engine Room
Adrian Po
No ratings yet
Temperature Humidity Sensor Chart
Document64 pages
Temperature Humidity Sensor Chart
vyhtran4731
100% (1)
OTN Network Poster Web
Document1 page
OTN Network Poster Web
Tarun Singh
No ratings yet
User Man Ual.: Ages 14+
Document10 pages
User Man Ual.: Ages 14+
alejo9719
No ratings yet
Uploading An EDS File From A Drive
Document5 pages
Uploading An EDS File From A Drive
LJOC
0% (1)
F 2296 - 03 Rjiyoty
Document3 pages
F 2296 - 03 Rjiyoty
frostest
No ratings yet
Hve r13
Document1 page
Hve r13
aleem_201s
No ratings yet
Thermal Expansion Loop - ML Loops ALL
Document6 pages
Thermal Expansion Loop - ML Loops ALL
Pamela De Melo-Langford
No ratings yet
Sports Boat, Build A Sleek
Document9 pages
Sports Boat, Build A Sleek
Jim
100% (3)
TDA2009A
Document12 pages
TDA2009A
namsongday
No ratings yet
06 SQL
Document93 pages
06 SQL
huyeb
No ratings yet
Automatic Center Stand For Two Wheelers Using Lead Screw
Document3 pages
Automatic Center Stand For Two Wheelers Using Lead Screw
International Journal of Innovative Science and Research Technology
No ratings yet
Speed Modeling of DC Motorcusing Tacho Generator
Document7 pages
Speed Modeling of DC Motorcusing Tacho Generator
Abanoub Malak Selem
No ratings yet
Edu-Board CM2 MAX User Guide - v30
Document57 pages
Edu-Board CM2 MAX User Guide - v30
EduBoard
No ratings yet
Deuteron MK F 6 UK
Document2 pages
Deuteron MK F 6 UK
Robert Skiba
No ratings yet
Form Pengukuran Grounding
Document2 pages
Form Pengukuran Grounding
ghooestiep
No ratings yet