Welcome to Scribd!

Timit

Uploaded by

0% found this document useful (0 votes)

7 views1 page

This document contains figures and text analyzing the performance of LSTM neural networks on various datasets. Figure 4 shows predicted error rates and training times for different hyperparameters like learning rate, hidden size, and input noise level. Figure 5 shows pie charts breaking down the variance in performance attributable to different hyperparameters. Figure 6 looks at interactions between hyperparameters and how their effects may not be independent. In general, the hyperparameters' effects are mostly independent, though higher input noise can sometimes benefit lower learning rates.

Original Description:

Original Title

1503.04069v2-8

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

7 views1 page

Timit

Uploaded by

xing007

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 8

65 45 46 TIMIT 40 43 21.5
60 40 44 35 42 21.0
35 20.5
Classification Error
55 30

total time in h
30 42 41 20.0
50 25 25 19.5
20 40 40 19.0
45 20
40 15 38 15 39 18.5
10 36 38 18.0
35 5 10 17.5
30 0 34 5 37 17.0
100 100 IAM Online 100 58 34
100 56 33
80 90 80
Character Error Rate

80 54 32

total time in h
80 31
60 60 52
60 70 30
40 40 50 29
40 60 48
20 20 28
50 46 27
20
0 40 0 44 26
12.5 0.60 10.15 JSB Chorales 3.0 10.2
12.0 0.55 10.10 2.5 10.1
Negative Log Likelihood

11.5 0.50 10.05 0.50

2.0 10.0

total time in h
11.0 0.45 10.00 9.9
10.5 9.95
0.40 9.90 1.5 0.45
10.0 0.35 9.85 9.8
1.0 9.7
9.5 error 0.30 9.80 0.40
9.0 time 0.25 9.75 0.5 9.6
8.5 -6 0.20 9.70 0.0 9.5
10 10-5 10-4 10-3 10-2 20 40 60 80 100 120 140 160 180 200 0.0 0.2 0.4 0.6 0.8 1.0
learning rate hidden size input noise standard deviation

Figure 4. Predicted marginal error (blue) and marginal time for different values of the learning rate, hidden size, and the input noise (columns) for the test set
of all three datasets (rows). The shaded area indicates the standard deviation between the tree-predicted marginals and thus the reliability of the predicted mean
performance. Note that each plot is for the vanilla LSTM but curves for all variants that are not significantly worse look very similar.

On the right side of Figure 6 we can see for the same pair
of hyperparameters how their interaction differs from the case
of them being completely independent. This heat map exhibits
less structure, and it may in fact be the case that we would
need more samples to properly analyze the interplay between
them. However, given our observations so far this might not
be worth the effort. In any case, it is clear from the plot on the
left that varying the hidden size does not change the region of
optimal learning rate.
One clear interaction pattern can be observed in the IAM On-
line and JSB datasets between learning rate and input noise.
Here it can be seen that for high learning rates (' 10−4 )
lower input noise (/ .5) is better like also observed in the
marginals from Figure 4. But this trend reverses for lower
learning rates, where higher values of input noise are beneficial.
Though interesting this is not of any practical relevance because
performance is generally bad in that region of low learning
Figure 5. Pie charts showing which fraction of variance of the test set rates. Apart from this, however, it is difficult to discern any
performance can be attributed to each of the hyperparameters. The percentage regularities in the analyzed hyperparameter interactions. We
of variance that is due to interactions between multiple parameters is indicated
as “higher order.” conclude that there is little practical value in attending to the
interplay between hyperparameters. So for practical purposes
hyperparameters can be treated as approximately independent
For example, looking at the pair hidden size and learning and thus optimized separately.
rate on the left side for the TIMIT dataset, we can see that
performance varies strongly along the x-axis (learning rate), VI. C ONCLUSION
first decreasing and then increasing again. This is what we This paper reports the results of a large scale study on
would expect knowing the valley-shape of the learning rate variants of the LSTM architecture. We conclude that the
from Figure 4. Along the y-axis (hidden size) performance most commonly used LSTM architecture (vanilla LSTM)
seems to decrease slightly from top to bottom. Again this is performs reasonably well on various datasets. None of the eight
roughly what we would expect from the hidden size plot in investigated modifications significantly improves performance.
Figure 4. However, certain modifications such as coupling the input and

Statwiki James
Document57 pages
Statwiki James
hermancx
No ratings yet
Solution Manual, Managerial Accounting Hansen Mowen 8th Editions - CH 9
Document46 pages
Solution Manual, Managerial Accounting Hansen Mowen 8th Editions - CH 9
jasperkennedy0
86% (22)
.Discrete Probability Distributions Worksheet
Document6 pages
.Discrete Probability Distributions Worksheet
Ruzherry Angeli T. Azcueta
No ratings yet
Six Sigma
Document28 pages
Six Sigma
A.P. Raja
No ratings yet
Simon
Document2 pages
Simon
poertade
No ratings yet
Plan Eksekusi Site SC40 DSF Majalengka
Document2 pages
Plan Eksekusi Site SC40 DSF Majalengka
Obett
No ratings yet
Ass12 Statistic
Document24 pages
Ass12 Statistic
Snehal Waghole
No ratings yet
Herbert Legrasse
Document1 page
Herbert Legrasse
Andre
No ratings yet
Makoto Edamura Call of Chtulu
Document2 pages
Makoto Edamura Call of Chtulu
kaj koster
No ratings yet
SƠ ĐỒ 5 YẾU TỐ
Document3 pages
SƠ ĐỒ 5 YẾU TỐ
Nguyễn Ngọc Thắng
No ratings yet
Problemario Capítulo 6 Ingeniería de Procesos Microbiológicos
Document25 pages
Problemario Capítulo 6 Ingeniería de Procesos Microbiológicos
Kenia Carrillo
No ratings yet
Classeur 2
Document2 pages
Classeur 2
Twin Kile
No ratings yet
Kali Riviera
Document2 pages
Kali Riviera
Dumbdink63
No ratings yet
Excel Practică
Document3 pages
Excel Practică
Doina Manuela
No ratings yet
Graduated Cylinders: Science
Document2 pages
Graduated Cylinders: Science
rameshraomesh
No ratings yet
VelocidaD de autoMOVIL
Document1 page
VelocidaD de autoMOVIL
dinosaurio cerritos
No ratings yet
Trabajo Tecnologia
Document2 pages
Trabajo Tecnologia
fredy corrales
No ratings yet
Libro 1
Document5 pages
Libro 1
Oscar Mauricio Serrano Muñoz
No ratings yet
Colegio 1 Colegio 2
Document5 pages
Colegio 1 Colegio 2
David Serrano
No ratings yet
Libro 1
Document5 pages
Libro 1
David Serrano
No ratings yet
Effect of Temperature
Document4 pages
Effect of Temperature
bushra shahid
No ratings yet
People Awarness Towards Online Edu Platform
Document6 pages
People Awarness Towards Online Edu Platform
Ranjith Kumar
No ratings yet
Ficha Editavel Cthulhu
Document2 pages
Ficha Editavel Cthulhu
Celso Lucas Meireles da Silva
No ratings yet
Scatter Plots
Document7 pages
Scatter Plots
Louis
No ratings yet
Boy 36 Tb-U
Document1 page
Boy 36 Tb-U
Almira Clara
No ratings yet
NC WWD Initiatives
Document17 pages
NC WWD Initiatives
stpreps
No ratings yet
Pool Fire T-76 Radios
Document1 page
Pool Fire T-76 Radios
Manuel Saavedra
No ratings yet
Result Nilai Rapot Sts Genap
Document1 page
Result Nilai Rapot Sts Genap
ikok budak taruna communitas
No ratings yet
CDC Growth Charts: United States: KG LB LB
Document1 page
CDC Growth Charts: United States: KG LB LB
Almira Clara
No ratings yet
Graph For Lab
Document2 pages
Graph For Lab
Muhammad Abdullah
No ratings yet
June Exam: Part I: Section I: Open Response - Answer The Questions in The Space Provided
Document3 pages
June Exam: Part I: Section I: Open Response - Answer The Questions in The Space Provided
rmhachey
No ratings yet
Teams Oppurtunities Target Actual Defects %
Document4 pages
Teams Oppurtunities Target Actual Defects %
ramaiahganta
No ratings yet
Gestra v725 PDF
Document1 page
Gestra v725 PDF
Erdinc
No ratings yet
Pivot Table
Document7 pages
Pivot Table
Ruchi Soni
No ratings yet
Grampex Ladrao
Document2 pages
Grampex Ladrao
smurfkkj8
No ratings yet
Lightning Protection Systems: Appendix I Ground Measurement Techniques
Document2 pages
Lightning Protection Systems: Appendix I Ground Measurement Techniques
Rabia akram
No ratings yet
Annexure A (Lightning)
Document3 pages
Annexure A (Lightning)
Rabia akram
No ratings yet
Call - of - Cthulhu - Gabriela Santos
Document2 pages
Call - of - Cthulhu - Gabriela Santos
Haimon
No ratings yet
Control-Valves-electric - DATA SHEET
Document1 page
Control-Valves-electric - DATA SHEET
wowkoreans
No ratings yet
CDC Growth Charts: United States: KG LB LB
Document1 page
CDC Growth Charts: United States: KG LB LB
Almira Clara
No ratings yet
Girl BB-TB
Document1 page
Girl BB-TB
Almira Clara
No ratings yet
Tegan Maths
Document2 pages
Tegan Maths
kaitao03
No ratings yet
teste termopar
Document3 pages
teste termopar
Juliana Oliveira
No ratings yet
All
Document20 pages
All
anon_33083814
No ratings yet
Lab 4
Document2 pages
Lab 4
Masud Sarker
No ratings yet
RM Numerical Data
Document17 pages
RM Numerical Data
Pearl
No ratings yet
Kauri Maths
Document2 pages
Kauri Maths
kaitao03
No ratings yet
Graphic of Length Alteration Graphic of Weight Alteration
Document2 pages
Graphic of Length Alteration Graphic of Weight Alteration
irma rhmwti
No ratings yet
Math
Document1 page
Math
api-242107677
No ratings yet
Oil Pan and Suction Tube
Document2 pages
Oil Pan and Suction Tube
Calon Kaya
No ratings yet
Brandpro Simulation Learning Diary: Team 28 - No Rules
Document21 pages
Brandpro Simulation Learning Diary: Team 28 - No Rules
ananyaverma695
No ratings yet
Introduction To Malaysia NRW - Water Supply Dept.
Document32 pages
Introduction To Malaysia NRW - Water Supply Dept.
mssdigital
No ratings yet
Howard Gilman
Document1 page
Howard Gilman
Andre
No ratings yet
Ficha Chamado de Cthulhu 7a Preenchivel Personalizada 1 1
Document1 page
Ficha Chamado de Cthulhu 7a Preenchivel Personalizada 1 1
Anna Liz
No ratings yet
Plan Ammenage R+2: Balcon
Document1 page
Plan Ammenage R+2: Balcon
KONAN KOFFI DESIRE
No ratings yet
James Maths
Document2 pages
James Maths
kaitao03
No ratings yet
Kelsey Maths
Document2 pages
Kelsey Maths
kaitao03
No ratings yet
Crane Operator's Cab 4
Document26 pages
Crane Operator's Cab 4
Arslan Ahmed
No ratings yet
Folgore Moon: Mechanical Repair
Document2 pages
Folgore Moon: Mechanical Repair
Dumbdink63
No ratings yet
Using Statistical Techniques in Analysing Data: Lesson 3
Document3 pages
Using Statistical Techniques in Analysing Data: Lesson 3
Christian Jim Polleros
No ratings yet
Ficha Do Personagem
Document1 page
Ficha Do Personagem
danieldots
No ratings yet
Hayden Maths
Document2 pages
Hayden Maths
kaitao03
No ratings yet
Rick and Morty #10
From Everand
Rick and Morty #10
Alex Firer
No ratings yet
Rick and Morty #9
From Everand
Rick and Morty #9
Alex Firer
No ratings yet
Figure 2 (B) : Transactions On Neural Networks and Learning Systems 4
Document1 page
Figure 2 (B) : Transactions On Neural Networks and Learning Systems 4
xing007
No ratings yet
Timit Timit: Transactions On Neural Networks and Learning Systems 9
Document1 page
Timit Timit: Transactions On Neural Networks and Learning Systems 9
xing007
No ratings yet
Transactions On Neural Networks and Learning Systems 12
Document1 page
Transactions On Neural Networks and Learning Systems 12
xing007
No ratings yet
Information On IC Engine
Document6 pages
Information On IC Engine
xing007
No ratings yet
Components & Strokes of I.C.Engine
Document6 pages
Components & Strokes of I.C.Engine
xing007
No ratings yet
Components & Strokes of I.C.Engine
Document6 pages
Components & Strokes of I.C.Engine
xing007
No ratings yet
Assignment
Document4 pages
Assignment
xing007
No ratings yet
Vapour Compression Refrigeration Test Rig: Experimental Procedure
Document5 pages
Vapour Compression Refrigeration Test Rig: Experimental Procedure
xing007
No ratings yet
Lecture 5 Statistics
Document52 pages
Lecture 5 Statistics
bedasie2385
0% (1)
Chapter 5 Some Important Discrete Probability Distributions PDF
Document38 pages
Chapter 5 Some Important Discrete Probability Distributions PDF
Mishel
No ratings yet
Thesis Ebe 1999 Tattersfield George Metcalf PDF
Document203 pages
Thesis Ebe 1999 Tattersfield George Metcalf PDF
Lucas P. Kusare
No ratings yet
Advanced Econometrics (I) Chapter 9 - Hypothesis Testing Fall 2012
Document33 pages
Advanced Econometrics (I) Chapter 9 - Hypothesis Testing Fall 2012
Keith Madrilejos
No ratings yet
Nonparametric Regression
Document24 pages
Nonparametric Regression
S
No ratings yet
Astm E1086
Document5 pages
Astm E1086
KH
No ratings yet
Banks Customer Satisfaction in Kuwait PDF
Document77 pages
Banks Customer Satisfaction in Kuwait PDF
pavlov2
No ratings yet
What I Need To Know: System of Measurement
Document9 pages
What I Need To Know: System of Measurement
zest ishuri
No ratings yet
Assignment No # 1: Program: Course Name: Course Code
Document23 pages
Assignment No # 1: Program: Course Name: Course Code
Shokha Jutt
No ratings yet
A Generalized Normal Distribution
Document11 pages
A Generalized Normal Distribution
chang lichang
No ratings yet
Ps 0
Document3 pages
Ps 0
Rupesh Parab
No ratings yet
Module 4 Educ 105 Final
Document35 pages
Module 4 Educ 105 Final
Maria Zobel Cruz
0% (1)
Statistics
Document211 pages
Statistics
Hasan Hüseyin Çakır
100% (6)
Descriptive Statistics Modified
Document36 pages
Descriptive Statistics Modified
Mohammad Bony Israil
No ratings yet
DAY 5 - PROBABILITY AND STATISTICS L TAKE HOME PROBLEMS
Document4 pages
DAY 5 - PROBABILITY AND STATISTICS L TAKE HOME PROBLEMS
Jay Andrew Abaño
No ratings yet
2014 CMOST Presentation PDF
Document86 pages
2014 CMOST Presentation PDF
Fransiskus Sitompul
No ratings yet
Statistics 4040 2013 - 2014
Document183 pages
Statistics 4040 2013 - 2014
Ahmad Hanif
No ratings yet
THNN2 - Đề 4
Document16 pages
THNN2 - Đề 4
Ngọc Hân
No ratings yet
Factors Affecting Self-Regulated Learning in Nursing Students in Turkey
Document13 pages
Factors Affecting Self-Regulated Learning in Nursing Students in Turkey
Debbie
No ratings yet
Thesis
Document20 pages
Thesis
Kaiye Rasonable
100% (1)
Open Electives Circular VII Sem AY 2021-22
Document31 pages
Open Electives Circular VII Sem AY 2021-22
Pavan Kumar
No ratings yet
Baye 9e Chapter 12
Document40 pages
Baye 9e Chapter 12
Jessie Sethdavid
100% (1)
Statistics
Document15 pages
Statistics
Salman Shakir
No ratings yet
Thesis Statement On Population Control
Document5 pages
Thesis Statement On Population Control
afbteepof
100% (2)
Bachelor'S Graduation Thethisthesis: National Economics University Centre For Advanced Educational Programs
Document64 pages
Bachelor'S Graduation Thethisthesis: National Economics University Centre For Advanced Educational Programs
Hà Anh Nguyễn
No ratings yet
Seminar Slides Week 3 - With Solutions - Fullpage
Document33 pages
Seminar Slides Week 3 - With Solutions - Fullpage
Anika Jain
No ratings yet