02_choosing-step-size.en

Uploaded by

Ragnar Lothbrok

0% found this document useful (0 votes)

2 views1 page

Copyright

Available Formats

TXT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as txt, pdf, or txt

0% found this document useful (0 votes)

2 views1 page

02_choosing-step-size.en

Uploaded by

Ragnar Lothbrok

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as txt, pdf, or txt

Jump to Page

You are on page 1of 1

Search inside document

[MUSIC] The second question on stochastic gradient

is how do you pick the step size eta. And this is a significant issue
just like it is with gradient. Both of them, it's kind of annoying, it's a pain to
figure out to
pick that coefficient eta. But it turns out that because of the
oscillations to the stochastic gradient, picking eta can be even more annoying,
much more annoying. So, if we go back to our data set and
when we've been using it, I've shown you this blue curve many times. This was the
best eta that I could find,
the best step size. Now, if I were to pick smaller step sizes. So, smaller ETAs, it
will behave kind of
like stocha elect regular gradient. It will be slower to converge and
you see less oscillations. But, it will eventually get there,
but I mean, much slower. So, we worry about that a bit. On the other hand,
instead of using the best step size, we try to use a larger step
size because we thought, what could make more
progress more quickly? You'll see this crazy oscillations and
the oscillations are much worse than you observe with gradient,
and I showed you gradient. So, you have to be really
careful to pick a too large things can behave
extremely erratically. And in fact, if you picked step size very, very large,
you can end up with behaviors like this. So, this black line here
was an eta that was way too large, that's a technical
term here that we'd like to use. And in this case, the solution is not even close
to anything we got even with etas of oscillations that we have
showed you in the previous slide. It's a huge gap and so, etas too large leads to
really bad
behavior in stochastic gradient. The rule of thumb that we described for
gradient, for picking eta is basically the same as the one for
picking step size for stochastic gradient. The same as for gradient, but
unfortunately it
requires much more trial and error. So, it's even more annoying, so you might
spent a lot of time in the trial and error even though it's a hundred times faster
than converge, it's possible to spend a hundred times more effort trying to find
the right step size, just be prepared. But, we just try several values
exponentially spaced from each other. And try to find somewhere between
an eta that's too small and an eta that is too big, and
then find one that's just right. And, I mentioned this in the gradient
section, but for stochastic gradients, even more important, for those who end
up exploring this further, there's an advanced step where you would make
the step size decrease over iterations. And so, for example, you might have an eta
that depends on what iteration you're in and often you set it to
something like some constant here eta zero divided by
the iteration number t. Iteration number, and
this approach tends to reduce noise and make things behave quite a bit better.
[MUSIC]

Grade 08 1st-Term-Test-Paper
Document9 pages
Grade 08 1st-Term-Test-Paper
Shi
83% (6)
Sam Carpenter Work The System
Document258 pages
Sam Carpenter Work The System
Manos Koufakis
100% (5)
AAON RN RQ-Series Engineering Catalog
Document152 pages
AAON RN RQ-Series Engineering Catalog
las
No ratings yet
03_comparing-gradient-to-stochastic-gradient.en
Document1 page
03_comparing-gradient-to-stochastic-gradient.en
Ragnar Lothbrok
No ratings yet
Environmental Air Pollution Professor Mukesh Sharma Department of Civil Engineering Indian Institute of Technology, Kanpur
Document31 pages
Environmental Air Pollution Professor Mukesh Sharma Department of Civil Engineering Indian Institute of Technology, Kanpur
Cris Cris
No ratings yet
Yousef ML Washin Regression
Document590 pages
Yousef ML Washin Regression
yousef shaban
No ratings yet
Physics 6A Lab #1: Kinematics, Measurement, and Uncertainty
Document9 pages
Physics 6A Lab #1: Kinematics, Measurement, and Uncertainty
kina
No ratings yet
Mitocw - Watch?V 7lqxyl - L28W
Document19 pages
Mitocw - Watch?V 7lqxyl - L28W
Josue Becker
No ratings yet
ECON287 Answers 1
Document5 pages
ECON287 Answers 1
Mukhlish Sabili
No ratings yet
Calculus in A Few Minutes Full C Hap
Document117 pages
Calculus in A Few Minutes Full C Hap
AishwaryaAjmera
No ratings yet
MU193 Problem 1
Document4 pages
MU193 Problem 1
Ivy Santiago
No ratings yet
Section 5
Document78 pages
Section 5
Dr. Yogesh Srivastava
No ratings yet
Lec10 PDF
Document23 pages
Lec10 PDF
ahmad albab
No ratings yet
Experimental Stress Analysis Prof. K. Ramesh Department of Applied Mechanics Indian Institute of Technology, Madras
Document22 pages
Experimental Stress Analysis Prof. K. Ramesh Department of Applied Mechanics Indian Institute of Technology, Madras
tamizhan
No ratings yet
Lec5 PDF
Document8 pages
Lec5 PDF
Keerthi
No ratings yet
Engl169 Paper 10
Document6 pages
Engl169 Paper 10
WILLIAM JAZZEN SABALA
No ratings yet
Eigenvalues Vectors
Document23 pages
Eigenvalues Vectors
Vladimir Jerson Alferez Vargas
No ratings yet
AERO212 Lab 3
Document4 pages
AERO212 Lab 3
shilaa
No ratings yet
2 1 TXT Bias Variance
Document4 pages
2 1 TXT Bias Variance
John
No ratings yet
Remaja Lab Report 7
Document5 pages
Remaja Lab Report 7
dewan kerja ranting tomo
No ratings yet
ARCH265 Study Guide 10
Document4 pages
ARCH265 Study Guide 10
Selena Gemez
No ratings yet
Turbulent Viscosity Limited
Document2 pages
Turbulent Viscosity Limited
ivanmatijevic
No ratings yet
Advanced Linear Programming
Document209 pages
Advanced Linear Programming
praneeth nagasai
100% (1)
GEO247 Doc 7
Document5 pages
GEO247 Doc 7
jijiji
No ratings yet
Advanced Finite Element Analysis Prof. R. Krishnakumar Department of Mechanical Engineering Indian Institute of Technology, Madras Lecture - 5
Document27 pages
Advanced Finite Element Analysis Prof. R. Krishnakumar Department of Mechanical Engineering Indian Institute of Technology, Madras Lecture - 5
abimana
No ratings yet
Kingman Equation
Document16 pages
Kingman Equation
Ahmed
No ratings yet
Swayam Eng
Document34 pages
Swayam Eng
sriramchaudhury
No ratings yet
Nptel TT 34
Document29 pages
Nptel TT 34
Kamal Sahu
No ratings yet
Experimental Stress Analysis Prof. K. Ramesh Department of Applied Mechanics Indian Institute of Technology, Madras
Document25 pages
Experimental Stress Analysis Prof. K. Ramesh Department of Applied Mechanics Indian Institute of Technology, Madras
tamizhan
No ratings yet
MIT Notes 7
Document5 pages
MIT Notes 7
Hacking Boy
No ratings yet
POLS198 Final 5
Document4 pages
POLS198 Final 5
fadma ter
No ratings yet
Za Barrett - Meadows or Malls Writeup
Document3 pages
Za Barrett - Meadows or Malls Writeup
api-494114317
No ratings yet
Optimization: Practice Problems Assignment Problems
Document11 pages
Optimization: Practice Problems Assignment Problems
Nardo Gunayon Finez
No ratings yet
Experimental Stress Analysis Prof. K. Ramesh Department of Applied Mechanics Indian Institute of Technology, Madras
Document22 pages
Experimental Stress Analysis Prof. K. Ramesh Department of Applied Mechanics Indian Institute of Technology, Madras
tamizhan
No ratings yet
Lec 08
Document22 pages
Lec 08
abhishekyadav29ps
No ratings yet
PPl-7 RL0Ko
Document20 pages
PPl-7 RL0Ko
Theo
No ratings yet
Articel Limn Continuity
Document7 pages
Articel Limn Continuity
NorazlinaRahman
No ratings yet
SOC217 Summary 3
Document4 pages
SOC217 Summary 3
Alex 51
No ratings yet
Decision Trees
Document53 pages
Decision Trees
Karl
100% (1)
What Is Data Analysis
Document8 pages
What Is Data Analysis
Fabio Petti
No ratings yet
Advanced Finite Element Analysis Prof. R. Krishnakumar Department of Mechanical Engineering Indian Institute of Technology, Madras Lecture - 2
Document23 pages
Advanced Finite Element Analysis Prof. R. Krishnakumar Department of Mechanical Engineering Indian Institute of Technology, Madras Lecture - 2
abimana
No ratings yet
Scaling Law
Document17 pages
Scaling Law
Vibin Nivas
No ratings yet
Calculus I - Related Rates
Document15 pages
Calculus I - Related Rates
wag2325
No ratings yet
02 W8 L22 P2-Training Sets 10-07
Document4 pages
02 W8 L22 P2-Training Sets 10-07
mahmood apurbo
No ratings yet
Jurnal Pre Lab 7
Document5 pages
Jurnal Pre Lab 7
Simetrik Solusi Integrasi
No ratings yet
Understanding Gradient and Divergence
Document24 pages
Understanding Gradient and Divergence
reddvoid
No ratings yet
EDU164 Module 8
Document4 pages
EDU164 Module 8
BOOM BOOM
No ratings yet
Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur
Document34 pages
Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur
lamba5
No ratings yet
HIST160 Worksheet 2
Document2 pages
HIST160 Worksheet 2
Ricalyn Bugarin
No ratings yet
Lect 5
Document4 pages
Lect 5
simple khan001
No ratings yet
Neutrino
Document4 pages
Neutrino
Parachute Indonesia
No ratings yet
ACC168 Exercise 8
Document5 pages
ACC168 Exercise 8
Joergen Bing Slamet
No ratings yet
1.06 Variance and Standard Deviation: 1 Exploring Data
Document2 pages
1.06 Variance and Standard Deviation: 1 Exploring Data
Kavya Gopakumar
No ratings yet
LAT218 Paper 10
Document3 pages
LAT218 Paper 10
John Yong Hwang
No ratings yet
Lec 6
Document19 pages
Lec 6
kkalani09
No ratings yet
DATA ANYLYSIS USING SPSS WORKSHOP DAY 2 PART 2 (2021-08-17 at 01 - 22 GMT-7) .mp4
Document3 pages
DATA ANYLYSIS USING SPSS WORKSHOP DAY 2 PART 2 (2021-08-17 at 01 - 22 GMT-7) .mp4
Ilika Guha Majumdar
No ratings yet
Introduction To Finite Element Method Dr. R. Krishnakumar Department of Mechanical Engineering Indian Institute of Technology, Madras Lecture - 30
Document25 pages
Introduction To Finite Element Method Dr. R. Krishnakumar Department of Mechanical Engineering Indian Institute of Technology, Madras Lecture - 30
mahendran
No ratings yet
Lec 102
Document12 pages
Lec 102
Keerthi
No ratings yet
Lesson 1 Units Physical Quantities Measurements
Document13 pages
Lesson 1 Units Physical Quantities Measurements
ken geornie benavente
No ratings yet
(English) The Paradox of The Derivative - Chapter 2, Essence of Calculus (DownSub - Com)
Document11 pages
(English) The Paradox of The Derivative - Chapter 2, Essence of Calculus (DownSub - Com)
Paresh Bapat
No ratings yet
Jour243 Quiz 6
Document4 pages
Jour243 Quiz 6
BPN Muna
No ratings yet
1 Percent Better Every Day: How Small and Simple Actions Every Day Lead To Big Results
From Everand
1 Percent Better Every Day: How Small and Simple Actions Every Day Lead To Big Results
Energy L Tony
Rating: 4.5 out of 5 stars
4.5/5 (4)
Optimised Future State: Making the Most of Process Mapping
From Everand
Optimised Future State: Making the Most of Process Mapping
Giles Johnston
Rating: 5 out of 5 stars
5/5 (1)
0301006619863264
Document20 pages
0301006619863264
Ragnar Lothbrok
No ratings yet
rsos.180798
Document20 pages
rsos.180798
Ragnar Lothbrok
No ratings yet
01_why-gradient-ascent-won-t-scale.en
Document1 page
01_why-gradient-ascent-won-t-scale.en
Ragnar Lothbrok
No ratings yet
03_comparing-gradient-to-stochastic-gradient.en
Document1 page
03_comparing-gradient-to-stochastic-gradient.en
Ragnar Lothbrok
No ratings yet
06_optional-adding-regularization.en
Document1 page
06_optional-adding-regularization.en
Ragnar Lothbrok
No ratings yet
s00449-021-02621-8
Document12 pages
s00449-021-02621-8
Ragnar Lothbrok
No ratings yet
Occurrences_Physical_and_Biochemical_Properties_of_Laccase
Document14 pages
Occurrences_Physical_and_Biochemical_Properties_of_Laccase
Ragnar Lothbrok
No ratings yet
The effect of dynamic friction with wet fabrics on skin wetness perception
Document15 pages
The effect of dynamic friction with wet fabrics on skin wetness perception
Ragnar Lothbrok
No ratings yet
s10811-018-1717-6
Document6 pages
s10811-018-1717-6
Ragnar Lothbrok
No ratings yet
TTEFT.000593
Document6 pages
TTEFT.000593
Ragnar Lothbrok
No ratings yet
processes-11-02298-v2
Document14 pages
processes-11-02298-v2
Ragnar Lothbrok
No ratings yet
baek-et-al-2022-dyeing-fabrics-with-a-colorant-extracted-from-blue-green-algae
Document8 pages
baek-et-al-2022-dyeing-fabrics-with-a-colorant-extracted-from-blue-green-algae
Ragnar Lothbrok
No ratings yet
The Effect of a Grooved Hollow in a Fibre on Fabric Moisture- and Heat-transport Properties
Document9 pages
The Effect of a Grooved Hollow in a Fibre on Fabric Moisture- and Heat-transport Properties
Ragnar Lothbrok
No ratings yet
NSTP 1 - Edited
Document2 pages
NSTP 1 - Edited
Jinrew
No ratings yet
NTC 2008 Ex002 PDF
Document5 pages
NTC 2008 Ex002 PDF
putra wira
No ratings yet
Unit+11 Electrostatics
Document12 pages
Unit+11 Electrostatics
Sanket Patil
No ratings yet
Scheda Tecnica Prodotto: Nylatron GSM Materiale Plastico 1317
Document1 page
Scheda Tecnica Prodotto: Nylatron GSM Materiale Plastico 1317
wilderness_666
No ratings yet
Ethical Concerns Associatedwith Corporate Financeand Their Management
Document9 pages
Ethical Concerns Associatedwith Corporate Financeand Their Management
Ved Jain
No ratings yet
Qwertyuiop 1
Document12 pages
Qwertyuiop 1
Ochia Justine
No ratings yet
Case History Evaluation of Laterally Loaded Piles: J. B. Anderson F. C. Townsend and B. Grajales
Document10 pages
Case History Evaluation of Laterally Loaded Piles: J. B. Anderson F. C. Townsend and B. Grajales
Mohamed Adel
No ratings yet
Civil Technology
Document3 pages
Civil Technology
Dur Joy
No ratings yet
Project Management and Practices
Document29 pages
Project Management and Practices
Nur Syafiqah
No ratings yet
Astm D3767
Document8 pages
Astm D3767
DIEGO
No ratings yet
Human Detection System Report
Document39 pages
Human Detection System Report
Vikas Sharma
No ratings yet
Professional Couriers PDF
Document2 pages
Professional Couriers PDF
durgaprasad3004
No ratings yet
1 English1 Q4 Wk5
Document7 pages
1 English1 Q4 Wk5
Angela Mae C. Boringot
No ratings yet
TD EGGER Eurospan E1E05 TSCA Hydro P3 (Rec 224) en
Document2 pages
TD EGGER Eurospan E1E05 TSCA Hydro P3 (Rec 224) en
Clarencegi
No ratings yet
Ballast Formats
Document12 pages
Ballast Formats
906rahul
No ratings yet
Workshop No. 1 Worksheet With Answers
Document3 pages
Workshop No. 1 Worksheet With Answers
kathrina
100% (1)
SONDAR 5000 Manual
Document44 pages
SONDAR 5000 Manual
ovadirc
No ratings yet
Stat MEM Chapter II Correlation
Document5 pages
Stat MEM Chapter II Correlation
Fia Jean Pascua
No ratings yet
DefinitiveStudent Peer Mentoring Policy Post LTC 07.06.17
Document4 pages
DefinitiveStudent Peer Mentoring Policy Post LTC 07.06.17
Wisdom Phangan
No ratings yet
FWC543 - Popular Writing Mac 2020 PDF
Document10 pages
FWC543 - Popular Writing Mac 2020 PDF
Ain Zulkifli
No ratings yet
Rajendra Higher Secondary School, Balangir Academic Session 2020-21
Document19 pages
Rajendra Higher Secondary School, Balangir Academic Session 2020-21
aaaa
100% (1)
Department of Electromechanical Engineering Course Title: Control System
Document21 pages
Department of Electromechanical Engineering Course Title: Control System
Yidersal Marew
No ratings yet
Journal of Architectural Environment & Structural Engineering Research - Vol.4, Iss.1 January 2021
Document49 pages
Journal of Architectural Environment & Structural Engineering Research - Vol.4, Iss.1 January 2021
Bilingual Publishing
No ratings yet
Sound Absorption Characterisation of Woven Materials. Case Study: Auditorium Restoration
Document8 pages
Sound Absorption Characterisation of Woven Materials. Case Study: Auditorium Restoration
Cășeriu Bianca
No ratings yet
Angelica May B. Zaldivar Envi. MNGT
Document3 pages
Angelica May B. Zaldivar Envi. MNGT
Angelica May Zaldivar
No ratings yet
Beam On Elastic Foundation (BEM) - Exact Analysis v1
Document78 pages
Beam On Elastic Foundation (BEM) - Exact Analysis v1
Gourab Mishra
No ratings yet
1 - Ethical - Cultural Dilemma Essay Writing Structure
Document5 pages
1 - Ethical - Cultural Dilemma Essay Writing Structure
Shen Xi
No ratings yet