Welcome to Scribd!

Backpropag-Relu-Grad Descent

Uploaded by

0% found this document useful (0 votes)

19 views2 pages

Backpropagation is a standard method for training neural networks by calculating the gradient of a loss function with respect to the network's weights. It allows fine-tuning of weights based on error rates from previous iterations to reduce errors and increase generalization. Gradient descent is an optimization algorithm that tweaks parameters iteratively to minimize a convex function and reach its local minimum. ReLU is a commonly used non-linear activation function that helps accelerate training of deep neural networks compared to sigmoid and tanh functions by avoiding the vanishing gradient problem as layers increase.

Original Description:

Original Title

backpropag-relu-grad descent

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

19 views2 pages

Backpropag-Relu-Grad Descent

Uploaded by

Nishita Verma

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Back-propagation is the essence of neural net training.

It is the method of fine-tuning the

weights of a neural net based on the error rate obtained in the previous epoch (i.e., iteration).
Proper tuning of the weights allows you to reduce error rates and to make the model reliable by
increasing its generalization.

Backpropagation is a short form for "backward propagation of errors." It is a standard method of

training artificial neural networks. This method helps to calculate the gradient of a loss function
with respect to all the weights in the network.

Key Points About Back Propagation:

- Simplifies the network structure by elements weighted links that have the least effect on
the trained network
- You need to study a group of input and activation values to develop the relationship
between the input and hidden unit layers.
- It helps to assess the impact that a given input variable has on a network output. The
knowledge gained from this analysis should be represented in rules.
- Backpropagation is especially useful for deep neural networks working on error-prone
projects, such as image or speech recognition.
- Backpropagation takes advantage of the chain and power rules allows backpropagation
to function with any number of outputs.

Gradient Descent is an optimization algorithm that's used when training a machine learning
model. It's based on a convex function and tweaks its parameters iteratively to minimize a given
function to its local minimum.

For gradient descent to reach the local minimum we must set the learning rate to an appropriate
value, which is neither too low nor too high. This is important because if the steps it takes are
too big, it may not reach the local minimum because it bounces back and forth between the
convex function of gradient descent. If we set the learning rate to a very small value, gradient
descent will eventually reach the local minimum but that may take a while.

Steps to implement Gradient Descent

1. Randomly initialize values.
2. Update values.
3. Repeat until slope =0

ReLu is a non-linear activation function that is used in multi-layer neural networks or deep
neural networks. Traditionally, some prevalent non-linear activation functions, like sigmoid
functions (or logistic) and hyperbolic tangent, are used in neural networks to get activation
values corresponding to each neuron. Recently, the ReLu function has been used instead to
calculate the activation values in traditional neural network or deep neural network paradigms.
The reasons of replacing sigmoid and hyperbolic tangent with ReLu consist of:
1. Computation saving - the ReLu function is able to accelerate the training speed of deep
neural networks compared to traditional activation functions since the derivative of ReLu
is 1 for a positive input. Due to a constant, deep neural networks do not need to take
additional time for computing error terms during the training phase.

2. Solving the vanishing gradient problem - the ReLu function does not trigger the
vanishing gradient problem when the number of layers grows. This is because this
function does not have an asymptotic upper and lower bound. Thus, the earliest layer
(the first hidden layer) is able to receive the errors coming from the last layers to adjust
all weights between layers. By contrast, a traditional activation function like sigmoid is
restricted between 0 and 1, so the errors become small for the first hidden layer. This
scenario will lead to a poorly trained neural network.

-Nishita Verma
BTBM/18/120
Section D
Sem 5

Math Assessment Part 1 Part 2 Complete Solutions Correct Answers Key PDF
Document62 pages
Math Assessment Part 1 Part 2 Complete Solutions Correct Answers Key PDF
Dat Do
100% (3)
Math in The Modern World Chapter 1
Document29 pages
Math in The Modern World Chapter 1
Aira-Mae Medrano Radjail
82% (77)
Solutions PDF
Document20 pages
Solutions PDF
Erick Cargnel
No ratings yet
CHE572 Chapter 3 Fluidization PDF
Document31 pages
CHE572 Chapter 3 Fluidization PDF
Muhd Fahmi
No ratings yet
Notes On Introduction To Deep Learning
Document19 pages
Notes On Introduction To Deep Learning
thumpsup1223
No ratings yet
Activation Functions: Sigmoid, Tanh, Relu, Leaky Relu, Prelu, Elu, Threshold Relu and Softmax Basics For Neural Networks and Deep Learning
Document15 pages
Activation Functions: Sigmoid, Tanh, Relu, Leaky Relu, Prelu, Elu, Threshold Relu and Softmax Basics For Neural Networks and Deep Learning
RMolina65
No ratings yet
SoftComp 02
Document33 pages
SoftComp 02
Aditya Raut
No ratings yet
UNIT-2 Foundations of Deep Learning
Document64 pages
UNIT-2 Foundations of Deep Learning
bhavana
No ratings yet
Unit Iv
Document34 pages
Unit Iv
danmanworld443
No ratings yet
A Gentle Introduction To The Rectified Linear Unit (ReLU)
Document17 pages
A Gentle Introduction To The Rectified Linear Unit (ReLU)
21008059
No ratings yet
An Introduction To Artificial Neural Networks - by Srivignesh Rajan - Towards Data Science
Document11 pages
An Introduction To Artificial Neural Networks - by Srivignesh Rajan - Towards Data Science
Ravan Farmanov
No ratings yet
Deep Learning - DL-2
Document44 pages
Deep Learning - DL-2
Hasnain Ahmad
No ratings yet
DeepLearing Theory
Document51 pages
DeepLearing Theory
tharun
No ratings yet
Deep Learning and Its Applications
Document21 pages
Deep Learning and Its Applications
Aman Agarwal
No ratings yet
Viava Questions For Software Laboratory
Document20 pages
Viava Questions For Software Laboratory
kative5033
No ratings yet
Unit 2 v1.
Document41 pages
Unit 2 v1.
Kommi Venkat saketh
No ratings yet
12 Types of Neural Network Activation Functions
Document38 pages
12 Types of Neural Network Activation Functions
Nusrat Ullah
No ratings yet
Deep Learning Interview Questions and Answers
Document21 pages
Deep Learning Interview Questions and Answers
Sumathi M
No ratings yet
Deep Learning Notes
Document44 pages
Deep Learning Notes
AJAY SINGH NEGI
100% (1)
SVM
Document2 pages
SVM
Niraj Anand
No ratings yet
A Probabilistic Theory of Deep Learning: Unit 2
Document17 pages
A Probabilistic Theory of Deep Learning: Unit 2
Harshit
No ratings yet
Lecture 15
Document21 pages
Lecture 15
Abood Fazil
No ratings yet
NNDL
Document96 pages
NNDL
Yogesh Krishna
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
Document7 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
Mrunal Bhilare
No ratings yet
Deep Learning Unit 1
Document32 pages
Deep Learning Unit 1
Aditya Pratap Singh
No ratings yet
Week 4
Document5 pages
Week 4
Mrunal Bhilare
No ratings yet
Activation Function - Lect 1
Document5 pages
Activation Function - Lect 1
NiteshNaruka
No ratings yet
Assignment - 4
Document24 pages
Assignment - 4
Durga prasad T
No ratings yet
UNIT IV - Neural Networks
Document7 pages
UNIT IV - Neural Networks
lokesh Koppanathi
No ratings yet
Recurrent Neural Networks Explanation - GeeksforGeeks
Document5 pages
Recurrent Neural Networks Explanation - GeeksforGeeks
piyushkumar.sinha2019
No ratings yet
ML Session 15 Backpropagation
Document30 pages
ML Session 15 Backpropagation
kr8665894
No ratings yet
Activation Function
Document31 pages
Activation Function
Ankur Sharma
No ratings yet
Chapter 6 Deep Learning Knowledge
Document24 pages
Chapter 6 Deep Learning Knowledge
durant
No ratings yet
ML3 Unit 4-3
Document13 pages
ML3 Unit 4-3
ISHAN SRIVASTAVA
No ratings yet
Backpropagation and Resilient Propagation
Document6 pages
Backpropagation and Resilient Propagation
anilshaw27
No ratings yet
Unit Iv DM
Document58 pages
Unit Iv DM
Suganthi D PSGRKCW
No ratings yet
Module 3.2 Time Series Forecasting LSTM Model
Document23 pages
Module 3.2 Time Series Forecasting LSTM Model
Duane Eugenio Ani
No ratings yet
Activation Functions - Ipynb - Colaboratory
Document10 pages
Activation Functions - Ipynb - Colaboratory
GOURAV SAHOO
No ratings yet
Unit 4
Document18 pages
Unit 4
ATISH KUMAR
No ratings yet
DL Unit 1
Document16 pages
DL Unit 1
nitin
No ratings yet
Artificial Neural Network Part-2
Document15 pages
Artificial Neural Network Part-2
Zahid Javed
No ratings yet
ML Mentorship Prahitha Movva V1
Document5 pages
ML Mentorship Prahitha Movva V1
Guru Velmathi
No ratings yet
Hyperparameters
Document15 pages
Hyperparameters
raja
No ratings yet
Deep Learning Questions
Document51 pages
Deep Learning Questions
Aditi Jaiswal
100% (1)
Artificial Neural Networks (ANN) : 1-Introduction
Document5 pages
Artificial Neural Networks (ANN) : 1-Introduction
Sarah Dhaoui
No ratings yet
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
Document5 pages
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
vik
No ratings yet
Complete Deep Learning Interview Question
Document46 pages
Complete Deep Learning Interview Question
kdiwakarreddy1710
No ratings yet
Unit 6 ML
Document2 pages
Unit 6 ML
091105Akanksha ghule
No ratings yet
Activation Functions in Neural Networks - GeeksforGeeks
Document12 pages
Activation Functions in Neural Networks - GeeksforGeeks
wendu feleke
No ratings yet
Unit - 4 ANN
Document17 pages
Unit - 4 ANN
Aman Pal
No ratings yet
Deep Learning Unit 2
Document30 pages
Deep Learning Unit 2
Aditya Pratap Singh
No ratings yet
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
Document9 pages
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
Bayu Adhi Nugroho
No ratings yet
Institute of Engineering & Management
Document3 pages
Institute of Engineering & Management
manish pandey
No ratings yet
DEEP LEARNING Paper
Document12 pages
DEEP LEARNING Paper
nivaranimaitra
No ratings yet
Unit 2b
Document11 pages
Unit 2b
Akshaya Gopalakrishnan
No ratings yet
Unit 2 SC
Document6 pages
Unit 2 SC
Katyayni Sharma
No ratings yet
Cours 2 - Training Deep Neural Networks
Document42 pages
Cours 2 - Training Deep Neural Networks
Sarah Bouammar
No ratings yet
Reliable Neural Network Activation
Document18 pages
Reliable Neural Network Activation
Anil Yogi
No ratings yet
Deep Neural Network (DNN)
Document80 pages
Deep Neural Network (DNN)
20210802144
No ratings yet
ML Unit-2
Document141 pages
ML Unit-2
6644 Haripriya
No ratings yet
Deep MLP's
Document44 pages
Deep MLP's
prabhakaran sridharan
No ratings yet
Artificial Neural Networks
Document24 pages
Artificial Neural Networks
punita singh
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Wastewater Engineering
Document5 pages
Wastewater Engineering
Nishita Verma
No ratings yet
Cellulase Enzyme Extraction Flowchart
Document2 pages
Cellulase Enzyme Extraction Flowchart
Nishita Verma
No ratings yet
Ijms 21218084
Document29 pages
Ijms 21218084
Nishita Verma
No ratings yet
Development of Inoculum
Document10 pages
Development of Inoculum
Nishita Verma
100% (2)
Current Scenario of Global Issues
Document8 pages
Current Scenario of Global Issues
Nishita Verma
No ratings yet
Worksheet Math1
Document6 pages
Worksheet Math1
vishy
No ratings yet
Sos Grade 6 Maths
Document1 page
Sos Grade 6 Maths
sidramuneeb514
No ratings yet
3-Representation of Data
Document12 pages
3-Representation of Data
Muhammad Asaad
No ratings yet
Alfred Tarski
Document10 pages
Alfred Tarski
fotiadis
No ratings yet
Longitudinal Vibration of A Bar ": Presentation On
Document17 pages
Longitudinal Vibration of A Bar ": Presentation On
amitpatel
No ratings yet
Russian Brothers - Tesla Tower
Document23 pages
Russian Brothers - Tesla Tower
Davide Tanner Taini
No ratings yet
Paper Test For General Physics
Document2 pages
Paper Test For General Physics
JerrySemuel
No ratings yet
ResearchMethod in Mngg.
Document72 pages
ResearchMethod in Mngg.
GuruKPO
No ratings yet
Module 5 Balancing of Rotating Masses NM Repaired
Document25 pages
Module 5 Balancing of Rotating Masses NM Repaired
Srinivas Balivada
No ratings yet
Itsumo NanItsumo Nando Demo (Always With Me) (Spirited Away) いつも何度でもdo Demo
Document7 pages
Itsumo NanItsumo Nando Demo (Always With Me) (Spirited Away) いつも何度でもdo Demo
Manunart Feungpean
No ratings yet
MGT214 - Chapter 11 Investment Decision Criteria
Document16 pages
MGT214 - Chapter 11 Investment Decision Criteria
Patty Viray
No ratings yet
Fundamentals of Tissue Optics
Document14 pages
Fundamentals of Tissue Optics
Arturo Filemon
No ratings yet
GEC-4 Chapter 1 - Lesson 1
Document13 pages
GEC-4 Chapter 1 - Lesson 1
Koleen Lhyte T. UY
No ratings yet
F5 MT Module 2
Document4 pages
F5 MT Module 2
LAU JIA JIA
No ratings yet
Vitruvian Man by Leonardo Da Vinci - World Mysteries Blog
Document9 pages
Vitruvian Man by Leonardo Da Vinci - World Mysteries Blog
Vito V.- P.
No ratings yet
Algebraic Techniques, Equations, and Inequalities - Revision Questions (With Solutions)
Document20 pages
Algebraic Techniques, Equations, and Inequalities - Revision Questions (With Solutions)
Nabster 369
No ratings yet
Analytical Mechanics: A Guided Tour of
Document139 pages
Analytical Mechanics: A Guided Tour of
abhibecks
No ratings yet
Interactive Simulation of Power Systems Etap Applications and Te
Document12 pages
Interactive Simulation of Power Systems Etap Applications and Te
Muhammad Shahzaib
No ratings yet
Lesson 1: The Basics of C++:: Code::Blocks With Mingw G++ Xcode Compatibility Issues
Document39 pages
Lesson 1: The Basics of C++:: Code::Blocks With Mingw G++ Xcode Compatibility Issues
Tanveer us zaman
100% (1)
Electronics-2 Lab Report 7
Document7 pages
Electronics-2 Lab Report 7
siyal343
No ratings yet
University Law College Quetta
Document3 pages
University Law College Quetta
Sardar
No ratings yet
Chapter 2.c (Equilibrium in Plane 2D)
Document27 pages
Chapter 2.c (Equilibrium in Plane 2D)
Logarithem
No ratings yet
Oligopoly
Document3 pages
Oligopoly
Komala Gowda
No ratings yet
Is PI A Rational Number
Document4 pages
Is PI A Rational Number
nishagoyal
No ratings yet
RMS Syllabus
Document2 pages
RMS Syllabus
Anonymous w7RFguw4hF
No ratings yet
BBar Explained
Document9 pages
BBar Explained
Sandy Yansiku
No ratings yet