Welcome to Scribd!

Skip carousel

4 NN Regularization

Uploaded by

virgulatiit21a1068

0% found this document useful (0 votes)

2 views13 pages

Original Title

4_NN_Regularization.pptx

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

2 views13 pages

4 NN Regularization

Uploaded by

virgulatiit21a1068

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 13

Search inside document

Regularization of Neural

Networks

Dinesh K. Vishwakarma, Ph.D.

PROFESSOR, DEPARTMENT OF INFORMATION TECHNOLOGY

DELHI TECHNOLOGICAL UNIVERSITY, DELHI.

Webpage: http://www.dtu.ac.in/Web/Departments/InformationTechnology/faculty/dkvishwakarma.php
Regularization
▪To improve the performance of the NN,
regularization is done.
▪ An NN performs incredibly well on the
training set, but not nearly as good on the
test set.
▪NN has a very high variance and it cannot
generalize well to data it has not been
trained on.
▪These are the sign of overfitting.
1/28/2022 Dinesh K. Vishwakarma, Ph.D. 2
Solution of Overfitting
▪Get more data
▪Use regularization
✔ Getting more data is sometimes impossible,
and other times very expensive.
✔ Therefore, regularization is a common method
to reduce overfitting and consequently improve
the model’s performance.

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 3

Solution of Overfitting…
▪Two most common approach used as
Regularization for NN:
▪ L2 regularization
▪ Dropout.

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 4

L2 regularization
▪

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 5

L2 regularization…
▪ Lambda: regularization parameter. The
addition of the Frobenius norm, denoted by the
subscript F.
▪ lambda is a parameter that can be tuned.
✔ Larger weight values will be more penalized if the
value of lambda is large.
✔ Similarly, for a smaller value of lambda, the
regularization effect is smaller.
▪ This makes sense, because the cost function must be minimized.
▪ By adding the squared norm of the weight matrix and multiplying it
by the regularization parameters, large weights will be driven down
in order to minimize the cost function.
1/28/2022 Dinesh K. Vishwakarma, Ph.D. 6
How Regularization Works?
▪

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 7

Dropout Regularization
▪ Dropout involves going over all the layers in a
neural network and setting probability of keeping
a certain nodes or not.
▪ The input layer and the output layer are kept the
same.
▪ The probability of keeping each node is set at
random. Only threshold is decided: a value that
will determine if the node is kept or not.
▪ For example, if you set the threshold to 0.8, then
there is a probability of 20% that a node will be
removed from the network.
▪ Therefore, this will result in a much smaller and
simpler neural network.
1/28/2022 Dinesh K. Vishwakarma, Ph.D. 8
Dropout Regularization…

▪ Dropout means that the NN cannot rely on any input node,

since each have a random probability of being removed.
Therefore, the NN will be reluctant to give high weights to
certain features, because they might disappear.
▪ Consequently, the weights are spread across all features,
making them smaller. This effectively shrinks the model and
regularizes it.

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 9

▪https://towardsdatascience.com/how-to-i
mprove-a-neural-network-with-regularizat
ion-8a18ecda9fe3

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 10

When to use Deep Learning?
▪Data size is large Deep
▪High end Learning
infrastructure

Performance
▪Lack of domain Machine
understanding Learning

▪Complex problem
such as image
classification, Amount of Data
speech recognition
etc. Fuel of deep learning is the big data
by Andrew Ng
1/28/2022 Dinesh K. Vishwakarma, Ph.D. 11
Limitations of Deep
Learning
▪Very slow to train
▪Models are very complex, with lot of
parameters to optimize:
✔ Initialization of weights
✔ Layer-wise training algorithm
✔ Neural architecture
• Number of layers
• Size of layers
• Type – regular, pooling, max pooling, soft max
✔ Fine-tuning of weights using back propagation

1/28/2022 Dinesh K. Vishwakarma, Ph.D. 12

Thank you!
dinesh@dtu.ac.in

1/28/2022 Dinesh K. Vishwakarma, Ph.D.

Slide 13 of 74

Knowledge Distillation and Student-Teacher Learning For Visual Intelligence: A Review and New Outlooks
Document40 pages
Knowledge Distillation and Student-Teacher Learning For Visual Intelligence: A Review and New Outlooks
Raymon Arnold
No ratings yet
An Introduction To Data Mining: Prof. S. Sudarshan CSE Dept, IIT Bombay
Document48 pages
An Introduction To Data Mining: Prof. S. Sudarshan CSE Dept, IIT Bombay
api-19917789
No ratings yet
Deep Super Learner: A Deep Ensemble For Classification Problems
Document12 pages
Deep Super Learner: A Deep Ensemble For Classification Problems
vishesh goyal
No ratings yet
Younified Levelup
Document9 pages
Younified Levelup
Mitesh Nagpal
No ratings yet
Types of MC
Document29 pages
Types of MC
njhujhkun
No ratings yet
465-Lecture 10-11
Document79 pages
465-Lecture 10-11
labuni.jeni
No ratings yet
Deep Learning
Document4 pages
Deep Learning
rabinbhaumik7
No ratings yet
Difference Between Supervised & Unsupervised Learning With Two
Document8 pages
Difference Between Supervised & Unsupervised Learning With Two
Sruthi Madathil
No ratings yet
Visvesvaraya Technological University Belagavi: House Price Prediction Using Machine Learning
Document9 pages
Visvesvaraya Technological University Belagavi: House Price Prediction Using Machine Learning
jaya sree
No ratings yet
Accelerate Your Workflow With Data Analytics
Document49 pages
Accelerate Your Workflow With Data Analytics
Eka Putra
0% (1)
A Survey Paper On Hard Disk Failure Prediction Using Machine Learning
Document6 pages
A Survey Paper On Hard Disk Failure Prediction Using Machine Learning
IJRASETPublications
No ratings yet
DeepLearning L1 Intro
Document92 pages
DeepLearning L1 Intro
lafdali
No ratings yet
16 Streams
Document61 pages
16 Streams
sanjaikumar.mani
No ratings yet
Module 5
Document72 pages
Module 5
prattyush1234
No ratings yet
Complete Deep Learning Interview Question
Document46 pages
Complete Deep Learning Interview Question
kdiwakarreddy1710
No ratings yet
Introduction To Database: What Is Data?
Document91 pages
Introduction To Database: What Is Data?
Prachi P
No ratings yet
Supersived Machine Learning
Document52 pages
Supersived Machine Learning
farhan yutub
No ratings yet
Thanks To XYZ Agency For Funding
Document5 pages
Thanks To XYZ Agency For Funding
Serkalem Negusse
No ratings yet
DL Notes 1 5 Deep Learning
Document189 pages
DL Notes 1 5 Deep Learning
sugumar
100% (1)
Data Warehousing and Mining
Document14 pages
Data Warehousing and Mining
Val
No ratings yet
Global Sparse Momentum SGD For Pruning Very Deep Neural Networks
Document13 pages
Global Sparse Momentum SGD For Pruning Very Deep Neural Networks
chuliang guo
No ratings yet
Adadelta: An Adaptive Learning Rate Method Matthew D. Zeiler Google Inc., USA New York University, USA
Document6 pages
Adadelta: An Adaptive Learning Rate Method Matthew D. Zeiler Google Inc., USA New York University, USA
Olmer Olortegui
No ratings yet
A Gift From Knowledge Distillation Fast Optimization, Network Minimization and Transfer Learning
Document9 pages
A Gift From Knowledge Distillation Fast Optimization, Network Minimization and Transfer Learning
TungKVT
No ratings yet
Choi Self-Ensembling With GAN-Based Data Augmentation For Domain Adaptation in Semantic ICCV 2019 Paper
Document11 pages
Choi Self-Ensembling With GAN-Based Data Augmentation For Domain Adaptation in Semantic ICCV 2019 Paper
balodhi
No ratings yet
Deep Learning
Document4 pages
Deep Learning
JOUHAINA KOUROU
No ratings yet
A Deep Learning Prediction Process Accelerator Based FPGA PDF
Document4 pages
A Deep Learning Prediction Process Accelerator Based FPGA PDF
BoppidiSrikanth
No ratings yet
1710 11573 PDF
Document14 pages
1710 11573 PDF
Ashish Sharma
No ratings yet
Padding Module: Learning The Padding in Deep Neural Networks
Document10 pages
Padding Module: Learning The Padding in Deep Neural Networks
kartheekchowdarybollineni
No ratings yet
Data Preprocessing: Lecturer: Dr. Nguyen Thi Ngoc Anh
Document39 pages
Data Preprocessing: Lecturer: Dr. Nguyen Thi Ngoc Anh
Tam Kiếm
No ratings yet
0 1 Intro Notations
Document6 pages
0 1 Intro Notations
mahdi nematshahi
No ratings yet
Ds 14
Document15 pages
Ds 14
Jayanth Ragav
No ratings yet
Deep Learning in Natural Language Processing: Uday A R (1St16Ec071)
Document26 pages
Deep Learning in Natural Language Processing: Uday A R (1St16Ec071)
Kevin Raj S
No ratings yet
An Introduction To Data Mining IIT Bombay
Document48 pages
An Introduction To Data Mining IIT Bombay
ekanshs
No ratings yet
Deep Learning (All in One)
Document23 pages
Deep Learning (All in One)
B Basit
No ratings yet
Res Net
Document8 pages
Res Net
jaffar bikat
No ratings yet
Accelerating Very Deep Convolutional Networks For Classification and Detection
Document14 pages
Accelerating Very Deep Convolutional Networks For Classification and Detection
Leandro Gonzaga
No ratings yet
Objective:: Khushal Chawda
Document2 pages
Objective:: Khushal Chawda
Khushal Chawda
No ratings yet
Deep Forest: Zhi-Hua Zhou, Ji Feng
Document34 pages
Deep Forest: Zhi-Hua Zhou, Ji Feng
rana
No ratings yet
Datamining Intro IEP
Document48 pages
Datamining Intro IEP
Deepa Churi
No ratings yet
Deep Learning
Document18 pages
Deep Learning
R Raj
No ratings yet
Artificial Neural Networks - Lect - 4
Document17 pages
Artificial Neural Networks - Lect - 4
ma5395822
No ratings yet
Dataset Distillation
Document24 pages
Dataset Distillation
Pixelwar
No ratings yet
Nearest Neighbour and Clustering
Document122 pages
Nearest Neighbour and Clustering
NatarajanSubramanyam
No ratings yet
Deep Gradient Compression
Document14 pages
Deep Gradient Compression
Willams Jamess
No ratings yet
Group Decision Support System
Document11 pages
Group Decision Support System
Sock the Cat
No ratings yet
04 Vision Wang
Document6 pages
04 Vision Wang
hectorjazz
No ratings yet
CIKM
Document173 pages
CIKM
Sahithi Katakam
No ratings yet
Decision Tree Comprehesive
Document7 pages
Decision Tree Comprehesive
Kiruthiga Sivaraman
No ratings yet
Efesrfes
Document1 page
Efesrfes
Deepak Pradhan
No ratings yet
Denoising Pretraining For Semantic Segmentation
Document12 pages
Denoising Pretraining For Semantic Segmentation
Ye Du
No ratings yet
Busiess Analytics Data Mining Lecture 3
Document52 pages
Busiess Analytics Data Mining Lecture 3
utkarsh bhargava
No ratings yet
Programming With Z DSDM: Quality: Management and Technology
Document1 page
Programming With Z DSDM: Quality: Management and Technology
Marcel Matuschek
No ratings yet
Chapter 6 Deep Learning Knowledge
Document24 pages
Chapter 6 Deep Learning Knowledge
durant
No ratings yet
Oversampling Techniques Deep Belief Network DenseNets DNN
Document23 pages
Oversampling Techniques Deep Belief Network DenseNets DNN
jaffar bikat
No ratings yet
A Survey On Image Data Augmentation For Deep Learn
Document49 pages
A Survey On Image Data Augmentation For Deep Learn
Gabriele Castaldi
No ratings yet
Lecture 1a - Introduction
Document38 pages
Lecture 1a - Introduction
Karthick PN
No ratings yet
12 Memory3
Document52 pages
12 Memory3
Petar
No ratings yet
Is the Answer Reasonable?, Grade 8: The Test Connection
From Everand
Is the Answer Reasonable?, Grade 8: The Test Connection
Frank Schaffer Publications
No ratings yet
Is the Answer Reasonable?, Grade 7: The Test Connection
From Everand
Is the Answer Reasonable?, Grade 7: The Test Connection
Frank Schaffer Publications
No ratings yet
Using the Standards, Grade 3: Geometry
From Everand
Using the Standards, Grade 3: Geometry
MathQueue
No ratings yet
Erp Case Study
Document2 pages
Erp Case Study
mehak
No ratings yet
(Guy - Exton) - Brexit and Game Theory A Single-Case Analysis
Document9 pages
(Guy - Exton) - Brexit and Game Theory A Single-Case Analysis
Amairani Ibáñez
No ratings yet
Man and Superman
Document4 pages
Man and Superman
Srestha Kar
100% (1)
CO 126 F FRESDENLEN 410358301910485666 Candy 126 F Optima Wash System
Document41 pages
CO 126 F FRESDENLEN 410358301910485666 Candy 126 F Optima Wash System
SweetOfSerbia
No ratings yet
Free Fraction Tictac Toe Game
Document10 pages
Free Fraction Tictac Toe Game
Angeliki Ho
No ratings yet
Compilation of Assignments: Structural Theory I
Document2 pages
Compilation of Assignments: Structural Theory I
Tejay Tolibas
No ratings yet
Build Anything Book
Document267 pages
Build Anything Book
Paul C
50% (2)
Practice Test C
Document64 pages
Practice Test C
Aneri
100% (1)
Full Chapter Nitride Semiconductor Light Emitting Diodes Leds Materials Technologies and Applications Woodhead Publishing Series in Electronic and Optical Materials 1St Edition Huang PDF
Document54 pages
Full Chapter Nitride Semiconductor Light Emitting Diodes Leds Materials Technologies and Applications Woodhead Publishing Series in Electronic and Optical Materials 1St Edition Huang PDF
william.laudenslager880
100% (6)
CP Lcit RT 5040 Pro 907
Document63 pages
CP Lcit RT 5040 Pro 907
Ness Hur
No ratings yet
Mispa I2 User Manual
Document43 pages
Mispa I2 User Manual
Josef Grapes
No ratings yet
ICoT 5200 - 5300 - 5400 Ordering Guide
Document1 page
ICoT 5200 - 5300 - 5400 Ordering Guide
Peter Rhoads
No ratings yet
Indotech Power Transformers Brochure
Document2 pages
Indotech Power Transformers Brochure
nmanj
No ratings yet
Cover Letter Sample Interpreter Job
Document5 pages
Cover Letter Sample Interpreter Job
qwaskmrmd
100% (1)
Sheet Metal Stamping Design Guidelines
Document3 pages
Sheet Metal Stamping Design Guidelines
Truong Chi Tuan B1803341
No ratings yet
Cpim Brochure
Document4 pages
Cpim Brochure
Muhammad
No ratings yet
Tomlinson - What Was The Third World
Document16 pages
Tomlinson - What Was The Third World
María Camila Valbuena Londoño
No ratings yet
BSS19860 04 MF&C Vis 2
Document4 pages
BSS19860 04 MF&C Vis 2
Johan Tan
No ratings yet
Jsu PT3 ENGLISH
Document2 pages
Jsu PT3 ENGLISH
Dineswari Selvam
No ratings yet
Physics Category 1 9th - 10th Grades SAMPLE TEST
Document4 pages
Physics Category 1 9th - 10th Grades SAMPLE TEST
Achavee Sukrat
No ratings yet
A Methodology To Develop Computer Vision Systems in Civil Engineering - Applications in Material Testing and Fish Tracking PDF
Document309 pages
A Methodology To Develop Computer Vision Systems in Civil Engineering - Applications in Material Testing and Fish Tracking PDF
Microdigital Ware
No ratings yet
Writing Visual Basic Projects
Document1 page
Writing Visual Basic Projects
AmmuKutty
No ratings yet
Xii Economics 2022 Revision Model Question
Document7 pages
Xii Economics 2022 Revision Model Question
Rahul Rock
No ratings yet
Placement Prep PDF
Document30 pages
Placement Prep PDF
Ashwin Balaji
No ratings yet
Chapter Ten: Forecasting
Document49 pages
Chapter Ten: Forecasting
K59 Tran Gia Khanh
No ratings yet
Reading Speaking: What Wou You
Document4 pages
Reading Speaking: What Wou You
gulo
No ratings yet
Bangladesh Country Paper
Document27 pages
Bangladesh Country Paper
Faruque As Sunny Sunny
No ratings yet
How To Publish in International Journals
Document92 pages
How To Publish in International Journals
JayaSurya Kanukurthy
No ratings yet
Asymptotic Analysis PDF
Document26 pages
Asymptotic Analysis PDF
Sitti Arraya Tsabitah
No ratings yet
The Heart of Innovation in SPARK - GFRP Bridge
Document8 pages
The Heart of Innovation in SPARK - GFRP Bridge
Mysara Mohsen
No ratings yet