Welcome to Scribd!

Additive - Attention Example

Uploaded by

0% found this document useful (0 votes)

3 views2 pages

Additive attention, introduced by Bahdanau et al. in 2015, is a mechanism used in sequence-to-sequence models to improve the alignment between input and output sequences. It computes attention scores by first applying a feedforward neural network to combine the decoder's previous hidden state with each encoder hidden state. The combined vector is then passed through a non-linear activation function (typically tanh), followed by a linear layer to produce a scalar score for each encoder hidden sta

Original Title

Additive_attention example

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

3 views2 pages

Additive - Attention Example

Uploaded by

l228296

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Additive Attention Mechanism Calculation

Encoder Embedding Vectors

hturn = 0.1 0.2 0.3 0.4

hoff = 0.5 0.6 0.7 0.8

hthe = 0.9 1.0 1.1 1.2

hlight = 1.3 1.4 1.5 1.6

Decoder Hidden State

hdecoder1 = 0.0 0.4 1.0 0.3

Step 1: Concatenate Decoder Hidden State with Each Encoder Hidden State
For hturn :
concat(hdecoder1 , hturn ) = 0.0 0.4 1.0 0.3 0.1 0.2 0.3 0.4
For hoff :
concat(hdecoder1 , hoff ) = 0.0 0.4 1.0 0.3 0.5 0.6 0.7 0.8
For hthe :
concat(hdecoder1 , hthe ) = 0.0 0.4 1.0 0.3 0.9 1.0 1.1 1.2
For hlight :
concat(hdecoder1 , hlight ) = 0.0 0.4 1.0 0.3 1.3 1.4 1.5 1.6

Step 2: Apply Weight Matrix W

Assume W is a weight matrix of appropriate dimensions. For simplicity, let W be an identity matrix of
size 8 × 8 for demonstration purposes.
For hturn :

W · concat(hdecoder1 , hturn ) = 0.0 0.4 1.0 0.3 0.1 0.2 0.3 0.4

For hoff :
W · concat(hdecoder1 , hoff ) = 0.0 0.4 1.0 0.3 0.5 0.6 0.7 0.8
For hthe :

W · concat(hdecoder1 , hthe ) = 0.0 0.4 1.0 0.3 0.9 1.0 1.1 1.2

For hlight :

W · concat(hdecoder1 , hlight ) = 0.0 0.4 1.0 0.3 1.3 1.4 1.5 1.6

Step 3: Apply v Vector and tanh Activation

Assume v is a vector of size 8. For simplicity, let v be a vector of ones: v = 1 1 1 1 1 1 1 1 .
For hturn :

score(hdecoder1 , hturn ) = v·tanh W · concat(hdecoder1 , hturn ) = tanh(0.0)+tanh(0.4)+tanh(1.0)+tanh(0.3)+tanh(0.1)+ta

= 0.0 + 0.3799 + 0.7616 + 0.2913 + 0.0997 + 0.1974 + 0.2913 + 0.3799 = 2.4011

For hoff :

score(hdecoder1 , hoff ) = v·tanh W · concat(hdecoder1 , hoff ) = tanh(0.0)+tanh(0.4)+tanh(1.0)+tanh(0.3)+tanh(0.5)+tanh(

= 0.0 + 0.3799 + 0.7616 + 0.2913 + 0.4621 + 0.5370 + 0.6044 + 0.6640 = 3.7003

1
For hthe :

score(hdecoder1 , hthe ) = v·tanh W · concat(hdecoder1 , hthe ) = tanh(0.0)+tanh(0.4)+tanh(1.0)+tanh(0.3)+tanh(0.9)+tanh

= 0.0 + 0.3799 + 0.7616 + 0.2913 + 0.7163 + 0.7616 + 0.8005 + 0.8337 = 4.5449

For hlight :

score(hdecoder1 , hlight ) = v·tanh W · concat(hdecoder1 , hlight ) = tanh(0.0)+tanh(0.4)+tanh(1.0)+tanh(0.3)+tanh(1.3)+ta

= 0.0 + 0.3799 + 0.7616 + 0.2913 + 0.8617 + 0.8854 + 0.9051 + 0.9216 = 5.0066

Step 4: Apply Softmax to Scores

exp(score)
softmax(score) = P
exp(score)
Calculate the exponential values:

exp(2.4011) ≈ 11.0342, exp(3.7003) ≈ 40.4559, exp(4.5449) ≈ 94.3468, exp(5.0066) ≈ 149.9468

Sum of exponentials:

11.0342 + 40.4559 + 94.3468 + 149.9468 = 295.7837

Calculate the softmax values:

11.0342
αturn = ≈ 0.0373
295.7837
40.4559
αoff = ≈ 0.1368
295.7837
94.3468
αthe = ≈ 0.3190
295.7837
149.9468
αlight = ≈ 0.5069
295.7837

Step 5: Calculate Context Vector ct

ct = αturn · hturn + αoff · hoff + αthe · hthe + αlight · hlight

ct = 0.0373· 0.1 0.3 0.4 0.5 +0.1368· 0.6 0.7 0.8 0.9 +0.3190· 1.0 1.1 1.2 1.3 +0.5069· 1.4 1.5 1.6 1.7

= 0.0037 0.0112 0.0149 0.0186 + 0.0821 0.0958 0.1094 0.1231 + 0.3190 0.3509 0.3828 0.4147 + 0.7097 0

= 1.1145 1.2182 1.3179 1.4180

Solutions
Document11 pages
Solutions
akashkathole74
No ratings yet
Digital Design Using VHDL A Systems Approach Solution Manual by William J. Dally, R. Curtis Harting, Tor M. Aamodt
Document139 pages
Digital Design Using VHDL A Systems Approach Solution Manual by William J. Dally, R. Curtis Harting, Tor M. Aamodt
shitemail404 test
No ratings yet
The Savage Detectives PDF
Document5 pages
The Savage Detectives PDF
Dženan Mušanović
No ratings yet
WORD FORMATION FOR Cae
Document6 pages
WORD FORMATION FOR Cae
rebecca
100% (1)
EnglishFile4e Pre-Intermediate TG PCM Grammar 3C
Document1 page
EnglishFile4e Pre-Intermediate TG PCM Grammar 3C
B Mc
100% (1)
Cs 224N: Assignment #4: 1. Neural Machine Translation With Rnns (45 Points)
Document10 pages
Cs 224N: Assignment #4: 1. Neural Machine Translation With Rnns (45 Points)
Vinícius dos Santos Mello
No ratings yet
Lecture 4
Document12 pages
Lecture 4
Eden
No ratings yet
How To Solve For Equilibrium Data
Document4 pages
How To Solve For Equilibrium Data
Clarissa Alfaro
No ratings yet
Mod6 Design of Washing Machine and Air Condi
Document41 pages
Mod6 Design of Washing Machine and Air Condi
Anand Sivaram
No ratings yet
Programming & Numerical Analysis: Kai-Feng Chen
Document40 pages
Programming & Numerical Analysis: Kai-Feng Chen
Kingsley Etornam Anku
No ratings yet
Numerical Integration CH 6
Document5 pages
Numerical Integration CH 6
Coder D
No ratings yet
DATA Result Exp 3 Phy400
Document3 pages
DATA Result Exp 3 Phy400
Aida Syasya
No ratings yet
05 Ex Numerical Methods
Document8 pages
05 Ex Numerical Methods
raafet slimen
No ratings yet
Math 361S Lecture Notes Differentiation and Richardson Extrapolation
Document17 pages
Math 361S Lecture Notes Differentiation and Richardson Extrapolation
joe mboya
No ratings yet
Calculus, Probability, and Statistics Primers: Dave Goldsman
Document104 pages
Calculus, Probability, and Statistics Primers: Dave Goldsman
banned miner
No ratings yet
Chap 5
Document35 pages
Chap 5
Maurice Politis
No ratings yet
Numerical Differentiation and Differential Equations
Document10 pages
Numerical Differentiation and Differential Equations
ismael kenedy
No ratings yet
1 Mathematical Preliminaries 2
Document17 pages
1 Mathematical Preliminaries 2
shubham
No ratings yet
Power System Protection Fundamentals
Document4 pages
Power System Protection Fundamentals
Malcolm
No ratings yet
HW1 Sol
Document5 pages
HW1 Sol
katerina2018
No ratings yet
ps3 Report PDF
Document6 pages
ps3 Report PDF
Talha YILMAZ
No ratings yet
Section2 1-2 2-Filled
Document12 pages
Section2 1-2 2-Filled
Sononame
No ratings yet
Displacement - Based Design Method MDOF USING ECP-201 PDF
Document7 pages
Displacement - Based Design Method MDOF USING ECP-201 PDF
boudoual
No ratings yet
Assignment 1 Solution
Document11 pages
Assignment 1 Solution
aa
No ratings yet
Acceleration of Free Falling Object
Document7 pages
Acceleration of Free Falling Object
Muhammad shehryar wain
No ratings yet
Numerical Differentiation
Document7 pages
Numerical Differentiation
Justine Boqs
No ratings yet
Numerical Solution of Ordinary DE
Document4 pages
Numerical Solution of Ordinary DE
সৌভিক মাজি
No ratings yet
Chapter 1 Introduction
Document25 pages
Chapter 1 Introduction
otternamedsteve
No ratings yet
Solve Numerical Differential Equation Using Euler Method Calculator
Document3 pages
Solve Numerical Differential Equation Using Euler Method Calculator
Syed Rafay Hashmi
No ratings yet
Kalman Filter Tutorial - Presentation
Document65 pages
Kalman Filter Tutorial - Presentation
carlazar
100% (2)
Ch-8, Math-5 Lecture Note Summer 20-21
Document15 pages
Ch-8, Math-5 Lecture Note Summer 20-21
আসিফ রেজা
No ratings yet
Falling Objects: Omar Marzouk
Document10 pages
Falling Objects: Omar Marzouk
Omr M
No ratings yet
B.6 Design of Forebay
Document12 pages
B.6 Design of Forebay
sandeep
No ratings yet
Midterm 2 Solutions
Document5 pages
Midterm 2 Solutions
pmaz
No ratings yet
Homework Assignment # 6: MATH 235 - Mathematical Models in Science and Engineering
Document4 pages
Homework Assignment # 6: MATH 235 - Mathematical Models in Science and Engineering
123chess
No ratings yet
Blast Pressure and Equivalent Wind Speed Calculation For The Flare
Document6 pages
Blast Pressure and Equivalent Wind Speed Calculation For The Flare
Peter Barabas
No ratings yet
Furnace
Document1 page
Furnace
Hsein Wang
No ratings yet
ChE Day 2
Document6 pages
ChE Day 2
JHuvieCLaire
No ratings yet
Apparatus and Method
Document10 pages
Apparatus and Method
Ali Jafari
No ratings yet
Burgers Analytic
Document2 pages
Burgers Analytic
RhysU
No ratings yet
Math 432 HW 1.4 Solutions: Dy DX
Document7 pages
Math 432 HW 1.4 Solutions: Dy DX
sami
No ratings yet
128ahw5sum10 PDF
Document4 pages
128ahw5sum10 PDF
Mobeen Yaseen
No ratings yet
Report
Document16 pages
Report
zaigham mohiudin
No ratings yet
Individual Assignment MEC500 Numerical M
Document11 pages
Individual Assignment MEC500 Numerical M
000
No ratings yet
1 Harmonic Load Characteristic: ϕ ⋅ − = ϕ ⋅ + = sin cos
Document3 pages
1 Harmonic Load Characteristic: ϕ ⋅ − = ϕ ⋅ + = sin cos
Ersi Ago
No ratings yet
NA Lecture 42
Document48 pages
NA Lecture 42
Hannan Abdul
No ratings yet
CS203 Week5
Document8 pages
CS203 Week5
Rajan Kumar
No ratings yet
Unit1 Lecture Notes 2018
Document12 pages
Unit1 Lecture Notes 2018
Jayden Ho
No ratings yet
Bisection Method
Document9 pages
Bisection Method
Anonymous 1VhXp1
No ratings yet
Lecture 34
Document6 pages
Lecture 34
The trickster
No ratings yet
Lab Report: Applied Physics
Document7 pages
Lab Report: Applied Physics
Masood
No ratings yet
Sheet Chapter 2 Dynamics Analysis of Direct Current Machines
Document19 pages
Sheet Chapter 2 Dynamics Analysis of Direct Current Machines
DGAF
No ratings yet
1.1 The Squeeze Theorem
Document3 pages
1.1 The Squeeze Theorem
Mohamed Alaa
No ratings yet
Quantum Mechanics On Python
Document18 pages
Quantum Mechanics On Python
landser7
No ratings yet
ماتلاب 12
Document6 pages
ماتلاب 12
xhhg4947
No ratings yet
GA-Schema - Crossover and Mutation1
Document83 pages
GA-Schema - Crossover and Mutation1
Abhishek mahalunge
No ratings yet
Cs 224N: Assignment #4: 1. Neural Machine Translation With Rnns (45 Points)
Document7 pages
Cs 224N: Assignment #4: 1. Neural Machine Translation With Rnns (45 Points)
progis
No ratings yet
Newton's Method For Unconstrained Optimization
Document14 pages
Newton's Method For Unconstrained Optimization
Ali
No ratings yet
2.4 Spreading of A Shallow Mass On An Incline: 2.4.1 Far Field Away From The Front
Document7 pages
2.4 Spreading of A Shallow Mass On An Incline: 2.4.1 Far Field Away From The Front
Ratovoarisoa
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
Jean Bourgain
No ratings yet
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
Rating: 3 out of 5 stars
3/5 (1)
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
MMC Module 3 Chapter 1
Document32 pages
MMC Module 3 Chapter 1
Sourabh dh
No ratings yet
Saiful
Document30 pages
Saiful
joy setiawan
No ratings yet
Lie Groups, Lie Algebras, and Their Representations
Document85 pages
Lie Groups, Lie Algebras, and Their Representations
sharline
No ratings yet
ACADEMIC WRITING - Paragraphs PDF
Document3 pages
ACADEMIC WRITING - Paragraphs PDF
lida712
No ratings yet
Compilers Crash Course
Document8 pages
Compilers Crash Course
Javier Sauler
No ratings yet
FTP Abstract
Document3 pages
FTP Abstract
prashvirus
100% (1)
Similes Metaphors and Personification PowerPoint
Document15 pages
Similes Metaphors and Personification PowerPoint
Ozge Dertdegil
No ratings yet
Dynamic Programming: Md. Bakhtiar Hasan
Document149 pages
Dynamic Programming: Md. Bakhtiar Hasan
MUHAMMAD JAWAD CHOWDHURY, 180041228
No ratings yet
Sanjeevini en PDF
Document3 pages
Sanjeevini en PDF
Sujok Swadesh
No ratings yet
Style and Tone
Document3 pages
Style and Tone
Thanh Trân
No ratings yet
Reflection Paper
Document1 page
Reflection Paper
SYLVIA CANTORIA
No ratings yet
Books On Graphology: SR NO Title of The Book Author
Document3 pages
Books On Graphology: SR NO Title of The Book Author
Nirav Hiingu
0% (1)
Medieval Visualization and The Art of Memory
Document29 pages
Medieval Visualization and The Art of Memory
cheez123
100% (2)
Python Assignment 1
Document6 pages
Python Assignment 1
RaJu Bhai
No ratings yet
OOUI As The Future UI
Document7 pages
OOUI As The Future UI
Nani Kasula
No ratings yet
CCNA Cyber Ops Version 11 Chapter 2 Exam Answers Full
Document13 pages
CCNA Cyber Ops Version 11 Chapter 2 Exam Answers Full
noussa79
No ratings yet
Critical Thinking Activity, Analyzing The Speaker's Claims, and Evaluating Claims of The Speaker
Document2 pages
Critical Thinking Activity, Analyzing The Speaker's Claims, and Evaluating Claims of The Speaker
Eurica Cabiluna
No ratings yet
Field Study 1: Learning Episode 2
Document9 pages
Field Study 1: Learning Episode 2
Dulnuan Argueza Stella
No ratings yet
Michael Zand Ahanchian Thesis
Document208 pages
Michael Zand Ahanchian Thesis
khalid hayat
No ratings yet
Predicition of Marine Diesel Engine Performance Under Fault Condition
Document31 pages
Predicition of Marine Diesel Engine Performance Under Fault Condition
Dhana
No ratings yet
Asrar e Khudi (Urdu Manzoom Tarjumah) by Allama Muhammad Iqbal (R.a)
Document87 pages
Asrar e Khudi (Urdu Manzoom Tarjumah) by Allama Muhammad Iqbal (R.a)
Musalman Bhai
100% (19)
Defrh
Document425 pages
Defrh
Divyanshu Semwal
100% (1)
Raz La37 Mariagoestoschool
Document14 pages
Raz La37 Mariagoestoschool
Miloš Popadić
No ratings yet
新百伦nb品牌VI手册2007
Document46 pages
新百伦nb品牌VI手册2007
陈琛
No ratings yet
DLL - MATH 4 - Q3 - WEEK 1 Describes and Draws Parallel Intersectingedumaymay Lauramos
Document8 pages
DLL - MATH 4 - Q3 - WEEK 1 Describes and Draws Parallel Intersectingedumaymay Lauramos
Rodelen Nayat
No ratings yet
11.8 2회차- movie 'Intern'
Document10 pages
11.8 2회차- movie 'Intern'
yong
No ratings yet
EF3e Preint Quicktest 01
Document3 pages
EF3e Preint Quicktest 01
María Emilce López
No ratings yet