Batch Normalization

Uploaded by

sakthiasphaltalpha

0% found this document useful (0 votes)

3 views11 pages

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

3 views11 pages

Batch Normalization

Uploaded by

sakthiasphaltalpha

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 11

Search inside document

Batch Normalization

Introduction
• Normalization-bringing the numerical data into a common scale
without destroying its shape
• Reason-Neural Network processes the data easily and generalizes
appropriately
• Neural networks process the data not as an individual but rather as a
batch
Why Batch normalization
• Initially the input X is normalized before entering into a neural
network
• But as it goes through the network and at the last layer it will not be
in the same scale
• Because as we apply the activation function on the data at each layer
this leads to an internal co-variant shift in the data
Internal Covariant shift
• Suppose a model classifying the two different classes as a dog or not
• Ex: we have only white dog images.
• These images will have a certain distribution
• So model parameters trained for that
• If we have non-white dog images, this has a different distribution
• So, the model needs to change its parameter according to this.
• Hence the distribution of the hidden activation also needs to be changed.
• This hidden change is known as the internal Co-variant shift
• Data Distribution-the arrangement of the datapoints within the dataset.
• Internal Covariant-shifting deep learning, our target keeps changing
during training due to the continuous updates in weights and biases.
• This is known as the “internal covariate shift”.
• Batch normalization helps us stabilize this moving target, making our
task easier.
How Batch normalization works
• It works by normalizing the output of a previous activation layer by
subtracting the batch mean and dividing by the batch standard
deviation.
• However, these normalized values may not follow the original
distribution.
• To tackle this, batch normalization introduces two learnable
parameters, gamma and beta, which can shift and scale the
normalized values.
• Two-step process
• Input is normalized
• Scaling and offsetting is performed
• Step 1
• Normalization of input data-
• Mean =0
• SD=1
• In this step we have our batch input from layer h, first, we need to calculate the mean of this
hidden activation.
• m is the number of neurons at layer h.
• The next step is to calculate the standard deviation of the hidden
activations.

• Using the μ and σ we can normalize the hidden activation values

• ε - The smoothing term that assures numerical stability within the

operation by stopping a division by a zero value.
Rescaling of Offsetting
• two components γ(gamma) and β (beta) are used.
• These are learnable parameters that enable the accurate
normalization of each batch.
Benefits
• Speeds up learning: By reducing internal covariate shift, it helps the
model train faster.
• Regularizes the model: It adds a little noise to your model, and in some
cases, you might not even need to use dropout or other regularization
techniques.
• Allows higher learning rates: Gradient descent usually requires small
learning rates for the network to converge. Batch normalization helps us
use much larger learning rates, speeding up the training process.
• Speed up the training

2306 9MA0-32 A Level Mechanics - June 2023 PDF
Document20 pages
2306 9MA0-32 A Level Mechanics - June 2023 PDF
kong99999seifalikong99999
100% (4)
Batch Normalization
Document7 pages
Batch Normalization
iamrishitaganguly
No ratings yet
Unit 3
Document110 pages
Unit 3
Nishanth Nuthi
No ratings yet
Deep Learning (All in One)
Document23 pages
Deep Learning (All in One)
B Basit
No ratings yet
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
Document75 pages
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
zofashane
No ratings yet
Batch Normalization
Document2 pages
Batch Normalization
gjhfgjfgj
No ratings yet
Batch Normalization Separate
Document20 pages
Batch Normalization Separate
Neeraj Garg
No ratings yet
Transfer Learning and Fine-Tuning
Document32 pages
Transfer Learning and Fine-Tuning
Devyansh Gupta
No ratings yet
6 CNN
Document50 pages
6 CNN
SWAMYA RANJAN DAS
No ratings yet
Machine Learning NN
Document16 pages
Machine Learning NN
Megha
100% (1)
1 - Introduction of AI
Document92 pages
1 - Introduction of AI
Naveena Nagineni
No ratings yet
Unit Online 1.4
Document132 pages
Unit Online 1.4
Nitesh Saini
No ratings yet
Computer Vision NN Architecture
Document19 pages
Computer Vision NN Architecture
Prasu Muthyalapati
No ratings yet
BN Layer
Document4 pages
BN Layer
mavoho1719
No ratings yet
MODULE 2 Deep Learning
Document26 pages
MODULE 2 Deep Learning
ENG20DS0040 SukruthaG
No ratings yet
Lecture 15
Document21 pages
Lecture 15
Abood Fazil
No ratings yet
Unit 3 - Diving - Deep - Learning
Document108 pages
Unit 3 - Diving - Deep - Learning
Alekhya Roy
No ratings yet
Fundamental of ML Week 3
Document16 pages
Fundamental of ML Week 3
Raj Physio
No ratings yet
DL Class3
Document28 pages
DL Class3
Rishi Chaary
No ratings yet
Module 5
Document72 pages
Module 5
prattyush1234
No ratings yet
RNN LSTM GRU PascaMI v2
Document60 pages
RNN LSTM GRU PascaMI v2
Ahmad Fathan Hidayatullah
No ratings yet
CS435 Ch5
Document15 pages
CS435 Ch5
Kareem CS
No ratings yet
465-Lecture 2-4
Document43 pages
465-Lecture 2-4
Faisal Bin Abdur Rahman 1912038642
No ratings yet
Linear Regression
Document36 pages
Linear Regression
Collins Chavhanga
No ratings yet
Unit IV BPA GD
Document12 pages
Unit IV BPA GD
Thrisha Kumar
No ratings yet
Unit 4
Document18 pages
Unit 4
ATISH KUMAR
No ratings yet
XAI Final
Document18 pages
XAI Final
Romil Mehla
No ratings yet
IBM Uni1&2
Document128 pages
IBM Uni1&2
Alekhya Roy
No ratings yet
DCGAN (Deep Convolution Generative Adversarial Networks)
Document27 pages
DCGAN (Deep Convolution Generative Adversarial Networks)
lakpa tamang
No ratings yet
Unit 4
Document37 pages
Unit 4
swathi
No ratings yet
Clustering (Unit 3)
Document71 pages
Clustering (Unit 3)
vedang maheshwari
100% (1)
Artificial Neural Networks
Document26 pages
Artificial Neural Networks
bt2014
No ratings yet
Regularization: Swetha V, Research Scholar
Document32 pages
Regularization: Swetha V, Research Scholar
Shanmuganathan V (RC2113003011029)
No ratings yet
UNIT-2 Foundations of Deep Learning
Document64 pages
UNIT-2 Foundations of Deep Learning
bhavana
No ratings yet
Building Good Training Sets UNIT 1 PART2
Document46 pages
Building Good Training Sets UNIT 1 PART2
Aditya Sharma
No ratings yet
Artificial Neural Networks - Lect - 3
Document16 pages
Artificial Neural Networks - Lect - 3
ma5395822
No ratings yet
4 - Finding and Fixing Data Quality Issues
Document48 pages
4 - Finding and Fixing Data Quality Issues
mkz01041
No ratings yet
Ann Cae-3
Document22 pages
Ann Cae-3
Anurag Raut
No ratings yet
Gradient Descent 5 Part 2
Document15 pages
Gradient Descent 5 Part 2
trendysyncs
No ratings yet
DuongToGiangSon 517H0162 HW2 Nov-26
Document17 pages
DuongToGiangSon 517H0162 HW2 Nov-26
Son Tran
No ratings yet
Untitled
Document128 pages
Untitled
P.V.S. VEERANJANEYULU
No ratings yet
Advanced Machine Learning CIE
Document13 pages
Advanced Machine Learning CIE
sharma xerox
No ratings yet
Neural Network: Prof. Subodh Kumar Mohanty
Document37 pages
Neural Network: Prof. Subodh Kumar Mohanty
prashantkr2698
No ratings yet
Unit 2 ML 2019
Document91 pages
Unit 2 ML 2019
Pratham MURKUTE
No ratings yet
ANN Doc
Document2 pages
ANN Doc
sonali Pradhan
No ratings yet
Pca
Document18 pages
Pca
gerry
No ratings yet
Recurrent Neural Networks Explanation - GeeksforGeeks
Document5 pages
Recurrent Neural Networks Explanation - GeeksforGeeks
piyushkumar.sinha2019
No ratings yet
Deep Learning
Document40 pages
Deep Learning
Dr. Dnyaneshwar Kirange
No ratings yet
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
Document12 pages
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
Hodatama Karanna One
No ratings yet
Deep Learning
Document78 pages
Deep Learning
iamrishitaganguly
No ratings yet
Lecture 4
Document33 pages
Lecture 4
Venkat ram Reddy
No ratings yet
02.data Preprocessing PDF
Document31 pages
02.data Preprocessing PDF
sunil
100% (1)
Lec 13
Document14 pages
Lec 13
mheba11
No ratings yet
Artificial Neural Networks - Lect - 4
Document17 pages
Artificial Neural Networks - Lect - 4
ma5395822
No ratings yet
Dimenstionality Reduction
Document101 pages
Dimenstionality Reduction
abhishek durgad
No ratings yet
Module - 2 Ver 1.4
Document35 pages
Module - 2 Ver 1.4
Pranav B
No ratings yet
DWDM Unit 2
Document23 pages
DWDM Unit 2
niharika
No ratings yet
Backpropag-Relu-Grad Descent
Document2 pages
Backpropag-Relu-Grad Descent
Nishita Verma
No ratings yet
L 10 Principal Component Analysis 09052024 072206pm
Document37 pages
L 10 Principal Component Analysis 09052024 072206pm
Bahadar Ayaz
No ratings yet
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
Rating: 5 out of 5 stars
5/5 (2)
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Fragment Velocities From Thermobaric Explosives in Metal Cylinders
Document7 pages
Fragment Velocities From Thermobaric Explosives in Metal Cylinders
Anonymous QFUEsUAn
No ratings yet
Regression Statistics: Multiple R 0.79 R Square 0.63 Adjusted R Square 0.60 Standard Error 233.08
Document4 pages
Regression Statistics: Multiple R 0.79 R Square 0.63 Adjusted R Square 0.60 Standard Error 233.08
Matei Pavelescu
No ratings yet
HW 03
Document54 pages
HW 03
alagar krishna kumar
No ratings yet
Mathematics Exam
Document10 pages
Mathematics Exam
James Jarrous
No ratings yet
Introduction To Statistics - Lecture Note RC-1
Document64 pages
Introduction To Statistics - Lecture Note RC-1
ermias aleme
No ratings yet
SEMI-FINALin Reading & Writing 2018-2019
Document4 pages
SEMI-FINALin Reading & Writing 2018-2019
Raquel Domingo
No ratings yet
Physics DPP PDF
Document12 pages
Physics DPP PDF
aayush kumar
No ratings yet
Lrdi 10
Document8 pages
Lrdi 10
sujay giri
No ratings yet
Computer Based Numerical & Statistical Techniques (MCA - 106)
Document209 pages
Computer Based Numerical & Statistical Techniques (MCA - 106)
Vikash Bora
No ratings yet
MTH302FinalTermSolvedPaper-vulmshelp.com
Document7 pages
MTH302FinalTermSolvedPaper-vulmshelp.com
Shehryar Ali Khilji
No ratings yet
Statistics and Parametric Tests
Document74 pages
Statistics and Parametric Tests
Jacqueline Carbonel
No ratings yet
ML in Financial Crisis Prediction Survey
Document16 pages
ML in Financial Crisis Prediction Survey
NAGA KUMARI ODUGU
No ratings yet
Lesson 2 Perimeter and Area in The Coordinate Plane
Document3 pages
Lesson 2 Perimeter and Area in The Coordinate Plane
api-283338157
No ratings yet
STRUCTURES II Notes
Document52 pages
STRUCTURES II Notes
qjh2wwx7sm
No ratings yet
LAPLACE TRASFORM Question Bank Questions
Document1 page
LAPLACE TRASFORM Question Bank Questions
Arvika Bhatia
No ratings yet
Communication System Lab Manual
Document81 pages
Communication System Lab Manual
abdulrehman001
No ratings yet
Image Compression
Document133 pages
Image Compression
Deebika Kaliyaperumal
No ratings yet
Asymmetric Relationship of Environmental Degradation and Economic Growth With Tourism Demand in Pakistan - Evidence From Non-Linear ARDL and Causality Estimation
Document12 pages
Asymmetric Relationship of Environmental Degradation and Economic Growth With Tourism Demand in Pakistan - Evidence From Non-Linear ARDL and Causality Estimation
Shine Elverda
No ratings yet
Curriculum Map Template 1
Document3 pages
Curriculum Map Template 1
Manuel John Soriano
No ratings yet
Bearing Settlement
Document4 pages
Bearing Settlement
Bahaismail
100% (1)
BB NPTEL Lecture 4
Document20 pages
BB NPTEL Lecture 4
Deepak Choudhary
No ratings yet
تقرير التجربة الثالثة‎
Document6 pages
تقرير التجربة الثالثة‎
Izzaldin
No ratings yet
Numerical Study of Turbine Blade Design
Document16 pages
Numerical Study of Turbine Blade Design
19MECH052 SYED YOUNUS
No ratings yet
DAA UNIT-3 (Updated)
Document33 pages
DAA UNIT-3 (Updated)
pilli maheshchandra
No ratings yet
GWR and Health
Document12 pages
GWR and Health
Rimjhim Bajpai
No ratings yet
X + y 4 2x + 3y 3 X, Y
Document148 pages
X + y 4 2x + 3y 3 X, Y
Vinayaka Ram
No ratings yet
Free Algebra2 Worksheets Unit14
Document19 pages
Free Algebra2 Worksheets Unit14
Mohamed Elsaadawi
No ratings yet
Project Report (Analysis of Truss)
Document28 pages
Project Report (Analysis of Truss)
YeeKian Yan
No ratings yet
Michael Clapis Cylinder Blocks
Document5 pages
Michael Clapis Cylinder Blocks
api-734979884
No ratings yet