Welcome to Scribd!

Recurrent Neural Networks: Azmi Haider Muhammad Salamah

Uploaded by

0% found this document useful (0 votes)

5 views42 pages

This document discusses recurrent neural networks (RNNs) and their applications. It explains that RNNs can process input sequences in different ways such as one-to-one, one-to-many, many-to-one, and many-to-many. It also describes how vanilla RNNs work and their limitations, and introduces long short-term memory (LSTM) networks which improve gradient flow through additive interactions between gates. Applications discussed include image captioning, visual question answering, and language modeling.

Original Description:

Original Title

Untitled

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

5 views42 pages

Recurrent Neural Networks: Azmi Haider Muhammad Salamah

Uploaded by

Cesar Aliaga

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 42

Search inside document

Recurrent neural

networks
AZMI HAIDER
MUHAMMAD SALAMAH
RNN: Process sequences
One to one: “Vanilla” Neural network

• Inputs are unrelated to one another

• Inputs/outputs are of fixed size
RNN: Process sequences
One to many: image captioning
RNN: Process sequences
Many to one: sentiment classification
RNN: Process sequences
Many to many: machine translation
RNN: Process sequences
Many to many: video classification on frame level
Recurrent neural network
RNN has a hidden state updated with each input x:
1. Input x
2. Update the hidden state
3. Outputs y
Recurrent neural network
New state is a function of old state and input:
(Vanilla) Recurrent neural
network
The hidden state is a single vector h:
RNN: computational graph
Another way to look at RNN is time flow graph:
RNN: computational graph
Many to many:
RNN: computational graph:
many to one:
RNN: computational graph:
one to many:
Example:
Character-level language model:

Vocabulary = [ h, e, l, o]

Example:
Training the sequence “hello”
Example:
Each input is a character from the training sequence
Example:
Example: test
At test time, output of a time sample
is fed to the next.
Image captioning
An example of the computer vision world:
Image captioning at test time
We’ve seen Wxh , Whh before… v is the output of the CNN:
Image captioning: in action
Image captioning: failure
Image captioning with
attention
Focus on different parts of the image:
CNN outputs a series of vectors of length L, one for each spatial location
in the image instead of one vector for the entire image.
Image captioning with
attention
In addition to the vocabulary output, now we have an output that
indicates where to give more ATTENTION in the image (meaning, which
of the L locations to take)
Image captioning with
attention
You can see the location vector at each step.
Another use of RNN with
attention
Visual question answering (http://www.visualqa.org):
Vanilla RNN Gradient Flow
Vanilla RNN Gradient Flow
Vanilla RNN Gradient Flow
Vanilla RNN Gradient Flow
Vanilla RNN Gradient Flow
Vanilla RNN Gradient Flow
Long Short-Term Memory (LSTM)
Long Short-Term Memory (LSTM)
Forget gate layer
Input gate layer
The current state
Output layer
Long Short-Term Memory (LSTM)
Long Short-Term Memory (LSTM)
Long Short-Term Memory
(LSTM): Gradient Flow
Long Short-Term Memory
(LSTM): Gradient Flow
RNNs allow a lot of flexibility in
architecture design
Vanilla RNNs are simple but don’t
work very well
Common to use LSTM or GRU: their
summary additive interactions improve gradient
flow
Backward flow of gradients in RNN
can explode or vanish. Exploding is
controlled with gradient clipping.
Vanishing is controlled with additive
interactions (LSTM)

Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
Rating: 4 out of 5 stars
4/5 (6)
Question Bank
Document14 pages
Question Bank
Vinayak Phutane
No ratings yet
Learning Vulkan
From Everand
Learning Vulkan
Parminder Singh
No ratings yet
Neural Nets 4 Recurrent Neural Nets: Wikipedia
Document4 pages
Neural Nets 4 Recurrent Neural Nets: Wikipedia
chuck212
No ratings yet
Sequence Modeling
Document131 pages
Sequence Modeling
raj858778
No ratings yet
Bianchi
Document62 pages
Bianchi
Marwa Bayou
No ratings yet
Recurrent Neural Networks
Document6 pages
Recurrent Neural Networks
B. Vasanthi
No ratings yet
Advanced Data Analytics: Simon Scheidegger - University of Lausanne, Department of Economics
Document50 pages
Advanced Data Analytics: Simon Scheidegger - University of Lausanne, Department of Economics
Ruben Kempter
No ratings yet
9 RNN LSTM Gru
Document91 pages
9 RNN LSTM Gru
sandhya
No ratings yet
Recurrent vs. Recursive Neural Networks in NLP-2
Document1 page
Recurrent vs. Recursive Neural Networks in NLP-2
hedem0ura
No ratings yet
Recurrent Neural Networks
Document18 pages
Recurrent Neural Networks
polinati.vinesh2023
No ratings yet
RNN (Recurrent Neural Networks)
Document48 pages
RNN (Recurrent Neural Networks)
Raja
No ratings yet
Recurrent & Recursive Nets
Document10 pages
Recurrent & Recursive Nets
Aisha Singh
No ratings yet
Steps For Training A Recurrent Neural Network: Advantages
Document13 pages
Steps For Training A Recurrent Neural Network: Advantages
Tegu Limenih
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
Document33 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
Sunil Kumar
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
Document33 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
vip_thb_2007
No ratings yet
Module 4 Recurrent Neural Network
Document78 pages
Module 4 Recurrent Neural Network
itsnavani2002
No ratings yet
DL 4
Document11 pages
DL 4
Rhododendron
No ratings yet
Module No. 4 - Recurrent Neural Networks
Document3 pages
Module No. 4 - Recurrent Neural Networks
Cyril Smart Ambedkar
No ratings yet
Chapter 13 Recurrent Neural Networks
Document46 pages
Chapter 13 Recurrent Neural Networks
Arohon Das
No ratings yet
RNN
Document23 pages
RNN
Muhammad Mohsin Zafar
No ratings yet
Lecture Notes - Recurrent Neural Networks
Document11 pages
Lecture Notes - Recurrent Neural Networks
Bhavin Panchal
No ratings yet
A Recurrent Neural Network
Document22 pages
A Recurrent Neural Network
Murat
No ratings yet
Unit 3
Document27 pages
Unit 3
ABDHUL KALAM
No ratings yet
Sequence Models231205
Document72 pages
Sequence Models231205
jiriraymond65
No ratings yet
Unit 4 - DL
Document23 pages
Unit 4 - DL
Faisel
No ratings yet
Google Project Soli: Presenter: Wenguang Mao
Document28 pages
Google Project Soli: Presenter: Wenguang Mao
tohil chini
No ratings yet
Transformer-Xl Attentive Language Models Beyond A Fixed-Length Context
Document22 pages
Transformer-Xl Attentive Language Models Beyond A Fixed-Length Context
salman younus
No ratings yet
Recurrent Neural Network: Dr. Sukanta Ghosh
Document34 pages
Recurrent Neural Network: Dr. Sukanta Ghosh
Nikhil Saini
100% (1)
Recurrent Neural Network: SUBMITTED BY: Harmanjeet Singh ROLL NO - 1803448 B.Tech, Cse (7) Ctiemt, Shahpur (Jalandhar)
Document11 pages
Recurrent Neural Network: SUBMITTED BY: Harmanjeet Singh ROLL NO - 1803448 B.Tech, Cse (7) Ctiemt, Shahpur (Jalandhar)
GAMING WITH DEAD PAN
No ratings yet
DSP Practical File For Kurukshetra University
Document27 pages
DSP Practical File For Kurukshetra University
Orion Stewart
No ratings yet
Attention: Sharad Jones
Document25 pages
Attention: Sharad Jones
David Guevara
No ratings yet
8-2 正则表达式的神经网络化
Document33 pages
8-2 正则表达式的神经网络化
chunhua li
No ratings yet
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
Document8 pages
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
ahasan
No ratings yet
Transformer Architecture
Document18 pages
Transformer Architecture
pragyajahnvi9
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document40 pages
Deeplearning - Ai Deeplearning - Ai
Jian Quan
No ratings yet
Transformers - Introduction
Document22 pages
Transformers - Introduction
Amirdha Varshini S
No ratings yet
Ch.18.Recurrent Neural Networks PDF
Document27 pages
Ch.18.Recurrent Neural Networks PDF
Vancho Pecovski
No ratings yet
RNN Neural Network
Document23 pages
RNN Neural Network
Nusrat Ullah
No ratings yet
Udacity Deep LEarning Part4 RNN
Document338 pages
Udacity Deep LEarning Part4 RNN
yousef shaban
No ratings yet
KSC2016 - Recurrent Neural Networks
Document66 pages
KSC2016 - Recurrent Neural Networks
shafiahmedbd
No ratings yet
Visualization and Understanding Cnns
Document27 pages
Visualization and Understanding Cnns
ali
No ratings yet
Presentation 2
Document12 pages
Presentation 2
api-332129590
No ratings yet
The Networks For Imaging
Document12 pages
The Networks For Imaging
nwuxv
No ratings yet
06 Neural Networks For NLP
Document26 pages
06 Neural Networks For NLP
Manish kumawat
No ratings yet
CS60010: Deep Learning: Recurrent Neural Network
Document44 pages
CS60010: Deep Learning: Recurrent Neural Network
parantap dansana
No ratings yet
Convolutional-Recursive Deep Learning For 3D Object Classif (R Socher) - NIPS 2012
Document9 pages
Convolutional-Recursive Deep Learning For 3D Object Classif (R Socher) - NIPS 2012
ervinon
No ratings yet
Transport Layer - Premitives
Document29 pages
Transport Layer - Premitives
Chintan Kotadiya
No ratings yet
Customer Shopping Pattern Prediction - A Recurrent Neural Network Approach
Document6 pages
Customer Shopping Pattern Prediction - A Recurrent Neural Network Approach
Prince P Appiah
No ratings yet
GAS Presentation
Document35 pages
GAS Presentation
Lihui Tan
No ratings yet
DL Unit4
Document20 pages
DL Unit4
Anonymous xMYE0TiNBc
No ratings yet
RNIC Verbs Overview2
Document28 pages
RNIC Verbs Overview2
Iltifat Patel
No ratings yet
NeurIPS 2019 Shallow RNN Accurate Time Series Classification On Resource Constrained Devices Paper
Document11 pages
NeurIPS 2019 Shallow RNN Accurate Time Series Classification On Resource Constrained Devices Paper
2K19/BMBA/13 RITIKA
No ratings yet
Natural Language Processing (NLP) - Module 3
Document76 pages
Natural Language Processing (NLP) - Module 3
ENG19CS0357 Vedha Murthy N L
No ratings yet
RNN Simplified.
Document2 pages
RNN Simplified.
sachinsinghmaths
No ratings yet
Digital To Analog Conversion
Document15 pages
Digital To Analog Conversion
Abdul Rauf
No ratings yet
Dlincv 161110052148 PDF
Document271 pages
Dlincv 161110052148 PDF
Raj Verma
No ratings yet
Liquid State Machine: Isolated Word Recognition With The: A Case Study
Document40 pages
Liquid State Machine: Isolated Word Recognition With The: A Case Study
Jhanger Gul
No ratings yet
Digital Signal Processing DSP
Document1,080 pages
Digital Signal Processing DSP
Riccardo Migliaccio
No ratings yet
Davinci Resolve
Document238 pages
Davinci Resolve
Vincent Follézou
No ratings yet
Intro CNN PDF
Document31 pages
Intro CNN PDF
Aditi Jaiswal
No ratings yet
Deep Learning 2017 Lecture5CNN
Document30 pages
Deep Learning 2017 Lecture5CNN
RHYTHM BHATNAGAR
No ratings yet
Deep Learning Simplified From Asimovinstitute PDF
Document21 pages
Deep Learning Simplified From Asimovinstitute PDF
Business Intelligence Rma
No ratings yet
Artificial Neural Networks-Based Machine Learning For Wireless Networks: A Tutorial
Document33 pages
Artificial Neural Networks-Based Machine Learning For Wireless Networks: A Tutorial
Zainab Baba mallam
No ratings yet
Natural Language Processing (NLP) - Module 3
Document76 pages
Natural Language Processing (NLP) - Module 3
ENG19CS0357 Vedha Murthy N L
No ratings yet
Lecture 4
Document146 pages
Lecture 4
Laila Shoukry
No ratings yet
Project 7
Document6 pages
Project 7
sandy milk min
100% (1)
(2020) Gaussian Error Linear Units (Gelus)
Document9 pages
(2020) Gaussian Error Linear Units (Gelus)
Mhd rdb
No ratings yet
Sequence Learning
Document22 pages
Sequence Learning
Anishek Chaudhary
No ratings yet
Chapter 7
Document31 pages
Chapter 7
Fairooz Toroshe
No ratings yet
B.E Syllabus For DL
Document4 pages
B.E Syllabus For DL
Mishra Ji
No ratings yet
Multi Layer Feed-Forward NN
Document15 pages
Multi Layer Feed-Forward NN
sm-malik
No ratings yet
Preceptron
Document17 pages
Preceptron
eng_kmm
No ratings yet
Topic 5
Document32 pages
Topic 5
hmood966
No ratings yet
771 A18 Lec21
Document109 pages
771 A18 Lec21
DUDEKULA VIDYASAGAR
No ratings yet
Lecture 04 (3hrs) Neural Network and Deep Learning-Part A
Document76 pages
Lecture 04 (3hrs) Neural Network and Deep Learning-Part A
Engr. Md. Borhan Uddin
No ratings yet
Vein-Based Biometric Verification Using Densely-Connected Convolutional Autoencoder
Document5 pages
Vein-Based Biometric Verification Using Densely-Connected Convolutional Autoencoder
sara
No ratings yet
Notes Lect 17autoassociated - Hopfield
Document25 pages
Notes Lect 17autoassociated - Hopfield
Mohd Yusuf
No ratings yet
Unit 5
Document39 pages
Unit 5
Bhavani G
No ratings yet
CMPT 413/713: Natural Language Processing: Nat Langlab
Document31 pages
CMPT 413/713: Natural Language Processing: Nat Langlab
Wenpei Li
No ratings yet
Deep Learning-KTU
Document6 pages
Deep Learning-KTU
NAJIYA NAZRIN P N
No ratings yet
Deep Learning
Document49 pages
Deep Learning
pulkit gupta
No ratings yet
Tesis Felipe Astudillo
Document116 pages
Tesis Felipe Astudillo
Martin Ardiles
No ratings yet
G5Baim Artificial Intelligence Methods: Graham Kendall
Document47 pages
G5Baim Artificial Intelligence Methods: Graham Kendall
deadpool
No ratings yet
Springer TextGenerationUsingLongShort TermMemoryNetworks
Document10 pages
Springer TextGenerationUsingLongShort TermMemoryNetworks
Asim Emad
No ratings yet
Neural Network and Fuzzy Logic
Document46 pages
Neural Network and Fuzzy Logic
doc. safe ee
No ratings yet
Convolutional Neural Network - Wikipedia
Document21 pages
Convolutional Neural Network - Wikipedia
yadavghyam2001
No ratings yet
Introduction To Neural Networks: Freek Stulp
Document12 pages
Introduction To Neural Networks: Freek Stulp
meenakshi sharma
No ratings yet
CNN Architectures - Transfer Learning
Document64 pages
CNN Architectures - Transfer Learning
Barani
No ratings yet
Backpropagation Working Error Computation Adjusting Weights
Document12 pages
Backpropagation Working Error Computation Adjusting Weights
Senthil Prakash
No ratings yet