Welcome to Scribd!

GRU

Uploaded by

0% found this document useful (0 votes)

3 views2 pages

Gated Recurrent Units (GRUs) are a simpler type of recurrent neural network compared to LSTMs that achieve comparable performance on sequential tasks. GRUs incorporate two gates - a reset gate that determines how much past information to forget, and an update gate that decides how much of a new candidate activation to add to the current hidden state. The candidate activation is proposed update to the hidden state that combines the current input and a reset gate-modulated version of the previous hidden state. GRUs have fewer parameters than LSTMs, making them faster to train and more computationally efficient.

Original Description:

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

3 views2 pages

GRU

Uploaded by

ahmedmakboul535

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

Gated Recurrent Units (GRUs) are a type of recurrent neural network (RNN) architecture similar to

Long Short-Term Memory (LSTM) networks. GRUs were introduced as a simpler alternative to
LSTMs while achieving comparable performance on sequential tasks. Here's an explanation of how
GRUs work:

1. **Update Gate**: Like LSTMs, GRUs use gating mechanisms to control the flow of information
within the network. However, GRUs incorporate only two gates: an update gate (\(z_t\)) and a reset
gate (\(r_t\)).

2. **Reset Gate**: The reset gate (\(r_t\)) determines how much of the past information should be
forgotten. It is computed based on the current input (\(x_t\)) and the previous hidden state (\(h_{t-
1}\)) using a sigmoid activation function. The reset gate decides which parts of the past hidden state
should be considered in computing the candidate activation.

3. **Update Gate**: The update gate (\(z_t\)) decides how much of the new candidate activation
should be added to the current hidden state. It is computed in a similar way to the reset gate but
controls the trade-off between the new candidate activation (\(\tilde{h}_t\)) and the previous hidden
state (\(h_{t-1}\)).

4. Candidate Activation: The candidate activation (\(\tilde{h}_t\)) is a proposed update to the

hidden state at the current timestep. It is computed using the current input (\(x_t\)) and a reset gate-
modulated version of the previous hidden state (\(h_{t-1}\)). This candidate activation is then
combined with the previous hidden state to produce the updated hidden state (\(h_t\)).

5. Mathematical Formulation: The computations in a GRU cell can be summarized as follows:

- Reset Gate: \(r_t = \sigma(W_r \cdot [h_{t-1}, x_t] + b_r)\)
- Update Gate: \(z_t = \sigma(W_z \cdot [h_{t-1}, x_t] + b_z)\)
- Candidate Activation: \(\tilde{h}_t = \tanh(W_h \cdot [r_t \odot h_{t-1}, x_t] + b_h)\)
- Updated Hidden State: \(h_t = (1 - z_t) \odot h_{t-1} + z_t \odot \tilde{h}_t\)
where \(W_r\), \(W_z\), and \(W_h\) are weight matrices, \(b_r\), \(b_z\), and \(b_h\) are bias
vectors, \(\sigma\) represents the sigmoid function, and \(\odot\) denotes element-wise multiplication.

6. **Training**: GRUs are trained using gradient-based optimization algorithms such as stochastic
gradient descent (SGD) or Adam. The parameters of the GRU cells, including the weights and biases,
are updated iteratively to minimize a loss function that measures the discrepancy between the
predicted output and the ground truth.

GRUs have fewer parameters compared to LSTMs due to their simpler architecture, which makes
them faster to train and more computationally efficient. They have been successfully applied in
various sequential tasks, including natural language processing, speech recognition, and time series
prediction. However, the choice between using GRUs or LSTMs often depends on the specific
requirements of the task at hand and empirical performance comparisons.

GCD Calculator PDF
Document2 pages
GCD Calculator PDF
Vishal Mishra
100% (1)
LSTM
Document2 pages
LSTM
ahmedmakboul535
No ratings yet
Dasd
Document3 pages
Dasd
samanthasmr305
No ratings yet
Gauss User Manual
Document6 pages
Gauss User Manual
Mys Genie
No ratings yet
Ettagdsadas 3
Document4 pages
Ettagdsadas 3
samanthasmr305
No ratings yet
Ettag 2
Document4 pages
Ettag 2
samanthasmr305
No ratings yet
Eth802 3 Agent
Document4 pages
Eth802 3 Agent
samanthasmr305
No ratings yet
JHHHH
Document3 pages
JHHHH
samanthasmr305
No ratings yet
E 4
Document4 pages
E 4
samanthasmr305
No ratings yet
BVN
Document3 pages
BVN
samanthasmr305
No ratings yet
ENGR 058 (Control Theory) Final: 1) Define The System
Document24 pages
ENGR 058 (Control Theory) Final: 1) Define The System
BizzleJohn
No ratings yet
Vertical Takeoff and Landing Aircraft: System Description
Document13 pages
Vertical Takeoff and Landing Aircraft: System Description
Jonathan Manzaki
No ratings yet
CS Lab
Document16 pages
CS Lab
Jeremy Hensley
No ratings yet
CS Ans
Document8 pages
CS Ans
subham subhashis
No ratings yet
Digital Control Tutorial
Document27 pages
Digital Control Tutorial
Natalia Bonilla
No ratings yet
WWWH
Document4 pages
WWWH
samanthasmr305
No ratings yet
Document 770
Document6 pages
Document 770
thamthoi
No ratings yet
Applying Structured Singular Values and A New LQR Design To Robust Decentralized Power System Load Frequency Control
Document5 pages
Applying Structured Singular Values and A New LQR Design To Robust Decentralized Power System Load Frequency Control
Noa Noa Rey
No ratings yet
Assignment #3 For Robust Control: M Ref
Document3 pages
Assignment #3 For Robust Control: M Ref
mojtaba fanoodi
No ratings yet
Generic Component VHDL
Document5 pages
Generic Component VHDL
mussadaqhussain8210
No ratings yet
Using MATLAB and Simulink For
Document22 pages
Using MATLAB and Simulink For
Jojo Kaway
No ratings yet
Support Vector Data Description Applied To Machine Vibration Analysis
Document8 pages
Support Vector Data Description Applied To Machine Vibration Analysis
thabiso87
No ratings yet
3 Body
Document2 pages
3 Body
이클립스
No ratings yet
Advanced Computer Graphics and Graphics Hardware: CUDA: Course Project
Document8 pages
Advanced Computer Graphics and Graphics Hardware: CUDA: Course Project
albertucu3
No ratings yet
Tutorial Ns802 11
Document13 pages
Tutorial Ns802 11
Roberto Farias
No ratings yet
Estimating Cragg Appendix PDF
Document5 pages
Estimating Cragg Appendix PDF
gfb1969
100% (1)
Python Machine Learning Errata 2nd
Document4 pages
Python Machine Learning Errata 2nd
Chandrasekar
No ratings yet
Content: Dplyr, Readr, TM, Ggplot2/+ggforce/, Tidyr, Broom Dplyr
Document8 pages
Content: Dplyr, Readr, TM, Ggplot2/+ggforce/, Tidyr, Broom Dplyr
Иван Радонов
No ratings yet
Display List
Document4 pages
Display List
George Puiu
No ratings yet
Digital Control Tutorial
Document12 pages
Digital Control Tutorial
Dan Anghel
No ratings yet
Matrix Vector Multiplier Uart System
Document36 pages
Matrix Vector Multiplier Uart System
Mahela Ekanayake
No ratings yet
W2 Flatness
Document11 pages
W2 Flatness
ahmatogre
No ratings yet
Playing Tensors With Grtensor
Document16 pages
Playing Tensors With Grtensor
xmacromx
No ratings yet
Computer Graphics CS 307 Cohen-Sutherland Line Clipping Algorithm Assessment 3
Document5 pages
Computer Graphics CS 307 Cohen-Sutherland Line Clipping Algorithm Assessment 3
dhanushka
No ratings yet
Ordinary Kriging R
Document19 pages
Ordinary Kriging R
disgusting4all
100% (1)
Finite State Machine (FSM) : Example
Document22 pages
Finite State Machine (FSM) : Example
Danish Wilson
No ratings yet
PID Controller For DC Motor: E G N Karunaratne
Document10 pages
PID Controller For DC Motor: E G N Karunaratne
niroshan047
No ratings yet
Exercise 2: Gaussian Elimination
Document8 pages
Exercise 2: Gaussian Elimination
Bea Oro
No ratings yet
Ovation Advanced Control Algorithms: Predictor
Document10 pages
Ovation Advanced Control Algorithms: Predictor
Muhammad Usman
No ratings yet
Readme PDF
Document6 pages
Readme PDF
Said Elias
No ratings yet
Lab 2-CS-Lab-2020
Document12 pages
Lab 2-CS-Lab-2020
Lovely Jutt
No ratings yet
SHT1x and SHT7x Sample Code: Humidity & Temperature Sensor
Document7 pages
SHT1x and SHT7x Sample Code: Humidity & Temperature Sensor
er_gaurav_ice
No ratings yet
Basilisk Tutorial
Document12 pages
Basilisk Tutorial
Prateek
0% (1)
An Outsider's Perspective On GRBL
Document5 pages
An Outsider's Perspective On GRBL
cory ahumada
No ratings yet
System Identification Toolbox - DC Motor
Document19 pages
System Identification Toolbox - DC Motor
Cristi Kaszta
No ratings yet
Autolisp Example Programs
Document11 pages
Autolisp Example Programs
ntqqjty
No ratings yet
Code
Document143 pages
Code
Adil Jutt G
No ratings yet
Vahid
Document18 pages
Vahid
Swastik Mitra
No ratings yet
Micikevicius, P. - 3D Finite DIfference Computation On GPUs Using CUDA
Document6 pages
Micikevicius, P. - 3D Finite DIfference Computation On GPUs Using CUDA
fonseca_r
No ratings yet
Jaycolpdf 1
Document5 pages
Jaycolpdf 1
P Samyutha 22107849101
No ratings yet
Machine Learning
Document73 pages
Machine Learning
tesscarcamo
No ratings yet
AI File
Document55 pages
AI File
Rohan Kamra
No ratings yet
Uniqueness of The Weights For Minimal Feedforward Nets With A Given Input-Output Map
Document11 pages
Uniqueness of The Weights For Minimal Feedforward Nets With A Given Input-Output Map
Constantinos Louca
No ratings yet
GNN_MetaLayer
Document14 pages
GNN_MetaLayer
MaxImus AlphA
No ratings yet
Hinfinity Control TH
Document8 pages
Hinfinity Control TH
Zia Ur Rahman
No ratings yet
GDI Tutorials - Circle-Based Shapes
Document8 pages
GDI Tutorials - Circle-Based Shapes
Shiwam Isrie
No ratings yet
A Star: Fundamentals and Applications
From Everand
A Star: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Automation for Robotics
From Everand
Automation for Robotics
Luc Jaulin
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Lagrangian and Hamiltonian Physics For High School Students PDF
Document5 pages
Lagrangian and Hamiltonian Physics For High School Students PDF
Jaime
No ratings yet
Advanced Digital Signal Processing With Matlab (R)
Document4 pages
Advanced Digital Signal Processing With Matlab (R)
srinivaskaredla
No ratings yet
Lec 22
Document15 pages
Lec 22
shailiayush
No ratings yet
Machine Learning: The Hundred-Page Book
Document9 pages
Machine Learning: The Hundred-Page Book
Rahul
No ratings yet
Chomsky and Greibach Normal Form
Document6 pages
Chomsky and Greibach Normal Form
arihan23
100% (1)
Lec 4newton Raphson
Document31 pages
Lec 4newton Raphson
nadia.imran
No ratings yet
Igraph
Document299 pages
Igraph
eroteme.thinks8580
No ratings yet
M3 DS21-Data Mining Dan Statistik - Rev
Document101 pages
M3 DS21-Data Mining Dan Statistik - Rev
rin
No ratings yet
#Data Assimilation - Mathematical Concepts and Instructive Examples
Document140 pages
#Data Assimilation - Mathematical Concepts and Instructive Examples
Michael G. Tadesse
No ratings yet
JMP Neural Network Methodology
Document11 pages
JMP Neural Network Methodology
Nguyễn Trần Linh
No ratings yet
DSP6
Document25 pages
DSP6
Miguel Juarez
No ratings yet
Production and Operations Management: Kopykitab
Document16 pages
Production and Operations Management: Kopykitab
Raghunath Reddy
No ratings yet
Controllability and Observability
Document23 pages
Controllability and Observability
Jhon Cerón
No ratings yet
School of Applied Mathematics and Informatics: H H H H H H
Document1 page
School of Applied Mathematics and Informatics: H H H H H H
Hưng Đoàn Văn
No ratings yet
Aut7.1.4 Recognise The Difference Between Linear and Non Linear Sequences
Document39 pages
Aut7.1.4 Recognise The Difference Between Linear and Non Linear Sequences
Kesiya
No ratings yet
Chap 91
Document24 pages
Chap 91
Ahsan Tareen
No ratings yet
Deep Learning in The Era of Big Data: Foundations, Advances, Applications, Challenges, and Future Directions
Document4 pages
Deep Learning in The Era of Big Data: Foundations, Advances, Applications, Challenges, and Future Directions
IJAR JOURNAL
No ratings yet
Section 1afjladfjjjjjjakjdfljaljfjalfjlajljflajfljalfjvcajlfjlajfljaljflajlfjadldjflajfljalfjlajdlcnaljclaajfh9 Playfair Cipher
Document17 pages
Section 1afjladfjjjjjjakjdfljaljfjalfjlajljflajfljalfjvcajlfjlajfljaljflajlfjadldjflajfljalfjlajdlcnaljclaajfh9 Playfair Cipher
sari_simanjuntak14
No ratings yet
Thermal Physics Lecture Note 1
Document15 pages
Thermal Physics Lecture Note 1
Alan Wang
No ratings yet
CMG Numerical Methods
Document4 pages
CMG Numerical Methods
tsanshine
No ratings yet
VIT Online Learning (VITOL) Institute Summer Online Courses Internal Assessment Schedule
Document1 page
VIT Online Learning (VITOL) Institute Summer Online Courses Internal Assessment Schedule
Badhan Paul
No ratings yet
1.8.citedby - Fusing The Old With The New: Learning Relative Camera Pose With Geometry-Guided Uncertainty - 2104.08278
Document11 pages
1.8.citedby - Fusing The Old With The New: Learning Relative Camera Pose With Geometry-Guided Uncertainty - 2104.08278
Suraj Patni
No ratings yet
Computer Vision: Prerequisites Recommended
Document163 pages
Computer Vision: Prerequisites Recommended
Mayank Aggarwal
No ratings yet
Experiment No. 2 Elementary Discrete-Time Signals Objectives
Document19 pages
Experiment No. 2 Elementary Discrete-Time Signals Objectives
다나
No ratings yet
Randomized Algorithm
Document6 pages
Randomized Algorithm
Prateek Bhatnagar
No ratings yet
Introduction To The Theory of Open Quantum Systems: Scipost Physics Lecture Notes
Document136 pages
Introduction To The Theory of Open Quantum Systems: Scipost Physics Lecture Notes
SHIVANK NIGAM
No ratings yet
BEAR User Guide v5
Document88 pages
BEAR User Guide v5
erick huamanchumo luis
No ratings yet
1 s20 S2542660521001037 Main
Document18 pages
1 s20 S2542660521001037 Main
keerthiks
No ratings yet
Euler-Maruyama Method: Numerical Simulation
Document3 pages
Euler-Maruyama Method: Numerical Simulation
Jese Madrid
No ratings yet