Professional Documents
Culture Documents
Improving The Neural Networks
Improving The Neural Networks
Improving The Neural Networks
Neural Networks
Network Weights
Activation Functions Dropout
Initialization
Input
Output
Multiple Hidden
Layers
Deep Neural Network
Hidden Layers
Input
Output
Input
Output
Input
Output
Gradient
becomes close to
zero
If Weights are initialized with big numbers
o11 = 0.89
W111 = 0.5 2.1
1.05
W132 = -0.3 o13 = 0.741