DeepLearning Lect5

Deep Learning
Lecture-4
Dr. Abdul Jaleel
Associate Professor
Handwritten digits classification
using neural network
Handwritten Digits
MNIST Handwritten Digit Classification Dataset

andwritten Digits Classification:
A Simple Neural Network
Handwritten Digits Classification
Handwritten Digits Classification
Handwritten Digits as Input
Handwritten Digits as Input
Handwritten Digits as Input: Array Flattening
Handwritten Digits as Input: 28x28 Array
Handwritten digits classification
using neural network
(Python Implementation)
1) we will classify handwritten digits using a simple neural network

which has only input and output layers.
2) We will than add a hidden layer and

see how the performance of the model improves
array([[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 14, 1, 154, 253, [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 24, 114, 221,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
90, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 253, 253, 253, 253, 201, 78, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 0], 0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 139, 253, [ 0, 0, 0, 0, 0, 0, 0, 0, 23, 66, 213, 253, 253,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
190, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 253, 253, 198, 81, 2, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 0], 0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 11, 190, [ 0, 0, 0, 0, 0, 0, 18, 171, 219, 253, 253, 253, 253,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
253, 70, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 195, 80, 9, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 0], 0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 35, [ 0, 0, 0, 0, 55, 172, 226, 253, 253, 253, 253, 244, 133,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
241, 225, 160, 108, 1, 0, 0, 0, 0, 0, 0, 0, 0, 11, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 0], 0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, [ 0, 0, 0, 0, 136, 253, 253, 253, 212, 135, 132, 16, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
81, 240, 253, 253, 119, 25, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 0], 0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 3,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
18, 18, 18, 126, 136, 175, 26, 166, 255, 247, 127, 0,
0, 45, 186, 253, 253, 150, 27, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0], 0, 0],
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 30, 36, 94, 154, 170,
0, 0, 16, 93, 252, 253, 187, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
253, 253, 253, 253, 253, 225, 172, 253, 242, 195, 64, 0,
0, 0], 0, 0],
0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 0, 0, 0, 249, 253, 249, 64, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 49, 238, 253, 253, 253, 253,
0, 0], 0, 0]], dtype=uint8)
253, 253, 253, 253, 251, 93, 82, 82, 56, 39, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
0, 46, 130, 183, 253, 253, 207, 2, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 18, 219, 253, 253, 253, 253,
0, 0],
253, 198, 182, 247, 241, 0, 0, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 39,
0, 0],
148, 229, 253, 253, 253, 250, 182, 0, 0, 0, 0, 0, 0,
[ 0, 0, 0, 0, 0, 0, 0, 0, 80, 156, 107, 253, 253,
0, 0],
205, 11, 0, 43, 154, 0, 0, 0, 0, 0, 0, 0, 0,
Very simple neural network with no hidden layers
Very simple neural network with no hidden layers
Neural Network Using hidden layer
313/313 [==============================]
- 0s 1ms/step - loss: 0.0966 - accuracy: 0.9716
[0.09658893942832947, 0.9715999960899353]
Using Flatten layer so that we don't have to call .reshape on input dataset
313/313 [==============================]
- 0s 1ms/step - loss: 0.0813 - accuracy: 0.9779
[0.08133944123983383, 0.9779000282287598]
Epoch 1/10
1875/1875 [==============================] - 3s 2ms/step - loss: 0.2959 - accuracy: 0.9185
Epoch 2/10 1875/1875 [==============================] - 3s 2ms/step - loss: 0.1368 - accuracy: 0.9603
Out[59]:
<tensorflow.python.keras.callbacks.History at 0x1fe24629e80>
House Price Prediction
using Neural Networks
House Price Prediction using Neural Networks
Batch Gradient Descent
Batch Gradient Descent
- What if the Dataset has 200 Features?

- What if the deep learning network is with 5000 weights
- Too Much Computations!!!

Stochastic Gradient Descent
Mini Gradient Descent
Implementation of stochastic batch gradient descent
and stochastic gradient descent in python
We will use very simple home prices data set to implement batch and
stochastic gradient descent in python.
 Batch gradient descent uses all training samples in forward pass to calculate cumulative error and
than we adjust weights using derivatives.
 In stochastic GD, we randomly pick one training sample, perform forward pass, compute the error
and immediately adjust weights.
Data import for batch gradient descent in python
Data Preprocessing/Scaling:
Since our columns are on different scale it is important to perform scaling on
them
Next, convert target

column (i.e. price) into
one dimensional array.
It has become 2D due
 https:// to scaling that we did
towardsdatascience.com/what-and-why-behind- above but now we
fit-transform-vs-transform-in-scikit-learn-78f9 should change to 1D
15cf96fe
 https://
towardsdatascience.com/get-into-shape-14637fe
1cd32
 https://www.analyticsvidhya.com/blog/2021/04/
difference-between-fit-transform-fit_transform-
methods-in-scikit-learn-with-python-code
https://pandas.pydata.org/pandas-docs/version/0.23/
https://
pandas.pydata.org/pandas-docs/version/0.23/genera
ted/pandas.DataFrame.transform.html
Gradient descent allows you to find weights
(w1,w2,w3) and bias in following linear equation for
housing price prediction
Batch
Gradient
Descent
Stochastic Gradient Descent Implementation
Stochastic GD will use randomly picked single training sample to calculate error and using
this error we back propagate to adjust weights
Exercise
Implement mini batch gradient descent in python and plot cost vs epoch graph.
Mini batch is intermediate version of batch GD and stochastic GD.
In stochastic we used one randomly picked training sample, In mini gradient descent you will use a batch
of samples in each iterations.
For example if you have total 50 training samples, you can take a batch of 10 samples, calculate
cumulative error for those 10 samples and then adjust weights.
In SGD we adjust weights after every one sample.
In Batch we adjust weights after going through all samples but in mini batch we do after every m
samples (where m is batch size and it is 0 < m < n, where n is total number of samples

DeepLearning Lect5

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

DeepLearning Lect5

Uploaded by

Copyright:

Available Formats

Deep Learning

MNIST Handwritten Digit Classification Dataset

1) we will classify handwritten digits using a simple neural network

2) We will than add a hidden layer and

- What if the Dataset has 200 Features?

- Too Much Computations!!!

Next, convert target

Mini batch is intermediate version of batch GD and stochastic GD.

In SGD we adjust weights after every one sample.

You might also like