"Hello World" of Deep Learning: Disclaimer: This PPT Is Modified Based On Dr. Hung-Yi Lee

“Hello world”
of deep learning
Disclaimer:
This PPT is modified based on
Dr. Hung-yi Lee
http://speech.ee.ntu.edu.tw/~tlkagk/courses_ML17.h
tml
Introduction
If you want to learn theano:
http://speech.ee.ntu.edu.tw/~tlkagk/courses/MLDS_2015_2/Le
Keras cture/Theano%20DNN.ecm.mp4/index.html
http://speech.ee.ntu.edu.tw/~tlkagk/courses/MLDS_2015_2/Le
cture/RNN%20training%20(v6).ecm.mp4/index.html
Very flexible
or
Need some
effort to learn
Easy to learn and use

Interface of
TensorFlow or (still have some flexibility)
Theano You can modify it if you can write
keras TensorFlow or Theano
Keras
• François Chollet is the author of Keras.
• He currently works for Google as a deep learning
engineer and researcher.
• Keras means horn in Greek
• Documentation: http://keras.io/
• Example:
https://github.com/fchollet/keras/tree/master/exa
mples
Keras: strengths
● Easy-to-use Python library
● It wraps Theano and TensorFlow (it benefits from the
advantages of both)
● Guiding principles: modularity, minimalism,
extensibility, and Python-nativeness
● Why python? Easy to learn, powerful libraries (scikit-
learn, matplotlib...)
● Many easy-to-use tools: real-time data augmentation,
callbacks (Tensorboard visualization)
● Keras is gaining official Google support
Keras: Weakness
● Less flexible
● No RBM for example
● Less projects available online than caffe
● Multi-GPU not 100% working
https://www.tensorflow.org/install/
Install Tensorflow
• Mac: open Terminal window
• $ pip install tensorflow
• Windows: open Anaconda Prompt window

• $ pip install tensorflow
Install Keras
• Mac: open command window
• $ pip install keras
• Windows: open Anaconda Prompt window

• $ pip install keras
Then in Rodeo, use the code:
import keras
print (keras.__version__)
“Hello world”
• Handwriting Digit Recognition
Machine “1
”
28 x 28
MNIST Data: http://yann.lecun.com/exdb/mnist/

Keras provides data sets loading function: http://keras.io/datasets/
We do not really minimize total loss!
Mini-batch  Randomly initialize
network parameters
1  Pick the 1st batch
x 1
NN y 1
^ 𝑦
Mini-batch
𝑙 1 𝐿′ = 𝑙 1+𝑙 3 1 +⋯
x31 NN y31 ^𝑦 31 Update parameters once

𝑙 31  Pick the 2nd batch
……
𝐿′ ′= 𝑙 2+𝑙 1 6 +⋯
Update parameters once
x2 NN y2 ^ 𝑦 2
Mini-batch
…
𝑙 2  Until all mini-batches
have been picked
x16 NN y16 ^𝑦 16
𝑙 16 one epoch
……
Repeat the above process

Batch size influences both speed and
Mini-batch performance. You have to tune it.
 Pick the 1st batch

x1 NN y1 ^ 𝑦 1
Mini-batch
𝐿′ = 𝑙 1+𝑙 3 1 +⋯
𝑙 1
Update parameters once
x31 NN y31 ^𝑦 31
𝑙 31  Pick the 2nd batch
……
𝐿′ ′= 𝑙 2+𝑙 1 6 +⋯
100 examples in a mini-batch Update parameters once
…
Batch size = 1  Until all mini-batches
Stochastic gradient descent have been picked
Repeat 20 times one epoch
Very large batch size can yield
Speed worse performance
• Smaller batch size means more updates in one epoch

• E.g. 50000 examples
• batch size = 1, 50000 updates in one epoch 166s 1 epoch
• batch size = 10, 5000 updates in one epoch 17s 10 epoch
166s Batch size = 1 and 10, update the same
amount of times in the same period.
Batch size = 10 is more stable, converge
faster
17s GTX 980 on MNIST with
50000 training examples
Review: Speed - Matrix
Operation
x1 …… y1
x 2 W1 W2 ……
WL y2
b1 b2 bL
……
……
……
……
……
xN
x a1 a2…… y yM
y ¿ 𝑓 ()
x Forward pass (Backward pass is similar)
WL …
¿ 𝜎
𝜎
W 2 𝜎 (
x(
W1 ( +)
) b2 … + bL
b1 + )
Speed - Matrix Operation
• Why mini-batch is faster than stochastic gradient
descent?
Stochastic Gradient Descent
𝑧 1 1 𝑥 𝑧 1
= 𝑊 = 1
𝑊 𝑥 ……
Mini-batch
matrix
𝑧 1 𝑧 1 1 𝑥 𝑥 Practically, which
= 𝑊
one is faster?
Keras
Save and load models

http://keras.io/getting-started/faq/#how-can-i-save-a-keras-model
How to use the neural network (testing):
case 1:
case 2:
Example 1
Test loss 0.14084320782721044

Test accuray 0.9575
Example 2
Test loss 2.306609627532959 Test

accuray 0.1028

"Hello World" of Deep Learning: Disclaimer: This PPT Is Modified Based On Dr. Hung-Yi Lee

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

"Hello World" of Deep Learning: Disclaimer: This PPT Is Modified Based On Dr. Hung-Yi Lee

Uploaded by

Copyright:

Available Formats

“Hello world”

Easy to learn and use

• Windows: open Anaconda Prompt window

• Windows: open Anaconda Prompt window

Then in Rodeo, use the code:

MNIST Data: http://yann.lecun.com/exdb/mnist/

x31 NN y31 ^𝑦 31 Update parameters once

Repeat the above process

 Pick the 1st batch

100 examples in a mini-batch Update parameters once

• Smaller batch size means more updates in one epoch

Save and load models

How to use the neural network (testing):

Test loss 0.14084320782721044

Test loss 2.306609627532959 Test

You might also like