Welcome to Scribd!

0% found this document useful (0 votes)

12 views

ECS171: Machine Learning: Lecture 10: Neural Networks

Uploaded by

This document summarizes key concepts from a lecture on neural networks: - Neural networks combine multiple perceptrons to generate nonlinear hypotheses for classification problems. This is done by connecting the outputs of one layer of perceptrons as inputs to the next layer, creating a feedforward network. - Common activation functions used in neural networks, like sigmoid and tanh, are more smooth than the step function used in basic perceptrons, making the network easier to optimize. - Neural networks are trained using backpropagation and stochastic gradient descent. Backpropagation efficiently computes the gradient of the error with respect to each weight in the network using a backward pass after a forward pass of example data. This gradient is then used to update weights with SGD

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

White Paper Tavares Model Rocky
Document14 pages
White Paper Tavares Model Rocky
angelramos
No ratings yet
Quantum Scattering Basic Problems
Document23 pages
Quantum Scattering Basic Problems
Jeison Rojas
No ratings yet
Oscillatory Integrals
Document8 pages
Oscillatory Integrals
Liviu Ignat
No ratings yet
Week5-LectureNotes
Document15 pages
Week5-LectureNotes
barra alfa
No ratings yet
Chapter 7 Part 3
Document7 pages
Chapter 7 Part 3
ADY Beats
No ratings yet
Lecture 5 Mask/Filter Transformation
Document24 pages
Lecture 5 Mask/Filter Transformation
tadiwos workeye
No ratings yet
Vademecum PROB ML
Document14 pages
Vademecum PROB ML
anon_57840914
No ratings yet
Pegasos
Document4 pages
Pegasos
Melanie2023
No ratings yet
HW 4
Document3 pages
HW 4
Tadux
No ratings yet
Notes 18 6382 Sturm-Liouville Theory
Document31 pages
Notes 18 6382 Sturm-Liouville Theory
adnan mehmood
No ratings yet
Parte 7
Document7 pages
Parte 7
Elohim Ortiz Caballero
No ratings yet
The Laplace Operator: Definition and Self Adjointness
Document7 pages
The Laplace Operator: Definition and Self Adjointness
Genner Pineda
No ratings yet
APA Chapter3 T20
Document24 pages
APA Chapter3 T20
XxXavillitoxX 5
No ratings yet
Fluid Dynamics - HW#1
Document3 pages
Fluid Dynamics - HW#1
Alexis Dolphin
No ratings yet
Ma6351 Tpde Unit II Fourier Series
Document192 pages
Ma6351 Tpde Unit II Fourier Series
Devender Dua
No ratings yet
Lect - 22 Khalil
Document17 pages
Lect - 22 Khalil
Daniel Ambriz
No ratings yet
Lecture 3
Document6 pages
Lecture 3
Đỗ Đỗ
No ratings yet
PHD Lecture08 09
Document9 pages
PHD Lecture08 09
Roy Vesey
No ratings yet
Sturm Liouville
Document16 pages
Sturm Liouville
Abdulrahman Alrabiea
No ratings yet
QIU3 SAz 2 GWBPTZ 80 TS4 V
Document5 pages
QIU3 SAz 2 GWBPTZ 80 TS4 V
agnit.dg
No ratings yet
En0175 02
Document7 pages
En0175 02
Sanjay Kabra
No ratings yet
Tutorial5.1 mp2013 Soln
Document4 pages
Tutorial5.1 mp2013 Soln
Rahul Gurjar
No ratings yet
DTFT Theorems and Properties
Document4 pages
DTFT Theorems and Properties
sameed raheel
No ratings yet
P182 List of Equations
Document4 pages
P182 List of Equations
Romesor Apol
No ratings yet
Practice Problem Solutions
Document25 pages
Practice Problem Solutions
Romesor Apol
No ratings yet
Lecture 8: Boundary Integral Equations: CBMS Conference On Fast Direct Solvers
Document20 pages
Lecture 8: Boundary Integral Equations: CBMS Conference On Fast Direct Solvers
Kavi Ya
No ratings yet
Negative Exp. Distribution-1
Document4 pages
Negative Exp. Distribution-1
innocent angel
No ratings yet
An Introduction To Some Novel Applications of Lie Algebra Cohomology in Mathematics and Physics
Document25 pages
An Introduction To Some Novel Applications of Lie Algebra Cohomology in Mathematics and Physics
Daniel Vargas
No ratings yet
Elec303 HW7
Document3 pages
Elec303 HW7
wendy
No ratings yet
Geostatistics in Hydrology Krig PDF
Document25 pages
Geostatistics in Hydrology Krig PDF
Hamid Kor
No ratings yet
Gradient Flow PDF
Document62 pages
Gradient Flow PDF
Jhon Edison Bravo Buitrago
No ratings yet
To Nis 11 05 2010
Document16 pages
To Nis 11 05 2010
ukoszapavlinje
No ratings yet
1D FEM Bars Isoparametric Formulation PDF
Document58 pages
1D FEM Bars Isoparametric Formulation PDF
nizam dastan
No ratings yet
2009 Seemous Problems Solutions
Document4 pages
2009 Seemous Problems Solutions
Hipstersdssadad
No ratings yet
ACFrOgBKZIAfR2 vuKBapmdmSmy7S5nMzwZoNNa d1SvFKmkxniQlslfvbk3p Y8oZRkeRvg3Wm94QapGxAfozfVFiN2ypAniIYmEfRGMX3e1C-89aXK-5in69LQKj8yHC9Ep08cREizyfvyf1wG
Document4 pages
ACFrOgBKZIAfR2 vuKBapmdmSmy7S5nMzwZoNNa d1SvFKmkxniQlslfvbk3p Y8oZRkeRvg3Wm94QapGxAfozfVFiN2ypAniIYmEfRGMX3e1C-89aXK-5in69LQKj8yHC9Ep08cREizyfvyf1wG
Sunil Das
No ratings yet
Lecture 06 - Functions of Random Variables
Document50 pages
Lecture 06 - Functions of Random Variables
aryhuh
No ratings yet
Gradient Descent Based Learners
Document11 pages
Gradient Descent Based Learners
sandt
No ratings yet
Fourier Series 2
Document5 pages
Fourier Series 2
mrinvictusthefirst
No ratings yet
Structural Problem
Document21 pages
Structural Problem
Halef Michel Bou Karim
No ratings yet
Euler Homogeneity
Document4 pages
Euler Homogeneity
private
No ratings yet
Boston University Algebra Seminar - Spring 2004
Document74 pages
Boston University Algebra Seminar - Spring 2004
mutia kaco
No ratings yet
Chapter 4
Document64 pages
Chapter 4
Alket Dhami
No ratings yet
Higher Order Linear Differential Equations: Math 240 - Calculus III
Document25 pages
Higher Order Linear Differential Equations: Math 240 - Calculus III
Sadek Ahmed
No ratings yet
Higher Order Linear Differential Equations: Math 240 - Calculus III
Document25 pages
Higher Order Linear Differential Equations: Math 240 - Calculus III
Sadek Ahmed
No ratings yet
Neural Network
Document6 pages
Neural Network
Toshinari Tong
No ratings yet
337week0405 PDF
Document13 pages
337week0405 PDF
kjel reida jøssan
No ratings yet
Function Spaces
Document20 pages
Function Spaces
Giovanni Palombo
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
Document80 pages
AC-ED L04 - Logistic Regression, Regularization
Abel Espin
No ratings yet
Lecture14 Logistic
Document25 pages
Lecture14 Logistic
mohammed.elbakkalielammari
No ratings yet
Heat Transfer
Document5 pages
Heat Transfer
fdcarazo
No ratings yet
Lecture 4 PDF
Document22 pages
Lecture 4 PDF
SparrowGospleGilbert
No ratings yet
Lec 7
Document21 pages
Lec 7
Perike Chandra Sekhar
No ratings yet
Final2013-14 Solution
Document10 pages
Final2013-14 Solution
payal soneja
No ratings yet
Structural Analysis FEM Lecture 6 Method of Weighted Residuals
Document10 pages
Structural Analysis FEM Lecture 6 Method of Weighted Residuals
Paul Richards
No ratings yet
Lecture 2
Document47 pages
Lecture 2
eustachebimenyimana
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
The Dirac Delta Function
Document2 pages
The Dirac Delta Function
Jennessi Reyes Montiel
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
Jean Bourgain
No ratings yet
ECS171: Machine Learning: Lecture 7: Theory of Generalization (LFD 2.1, 2.2)
Document52 pages
ECS171: Machine Learning: Lecture 7: Theory of Generalization (LFD 2.1, 2.2)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 8: VC Dimension (LFD 2.2)
Document43 pages
ECS171: Machine Learning: Lecture 8: VC Dimension (LFD 2.2)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 9: Nonlinear Transforms (LFD 3.4)
Document26 pages
ECS171: Machine Learning: Lecture 9: Nonlinear Transforms (LFD 3.4)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 6: Training Versus Testing (LFD 2.1)
Document46 pages
ECS171: Machine Learning: Lecture 6: Training Versus Testing (LFD 2.1)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 1: Overview of Class, LFD 1.1, 1.2
Document29 pages
ECS171: Machine Learning: Lecture 1: Overview of Class, LFD 1.1, 1.2
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 3: Linear Models I (LFD 3.2, 3.3)
Document48 pages
ECS171: Machine Learning: Lecture 3: Linear Models I (LFD 3.2, 3.3)
svwnerlgwr
No ratings yet
Formula Sheet SOCI 1005
Document6 pages
Formula Sheet SOCI 1005
Katyayani Ramnath
No ratings yet
How To Check Seismic Drift in
Document10 pages
How To Check Seismic Drift in
Erwin Obenza
No ratings yet
MCQ in DAA
Document3 pages
MCQ in DAA
kamakshi2
No ratings yet
Classifications of First-Order Differential Equations
Document8 pages
Classifications of First-Order Differential Equations
fahim shahriar
No ratings yet
Lecture On SQL
Document78 pages
Lecture On SQL
Er Saroj Wagle
No ratings yet
CH 13
Document61 pages
CH 13
anon_273249445
No ratings yet
Ieee Wave Collision and Laws of Collision
Document8 pages
Ieee Wave Collision and Laws of Collision
Swarnav Majumder
No ratings yet
Evaluating Content Uniformity NJPhAST Sep 22 2011
Document43 pages
Evaluating Content Uniformity NJPhAST Sep 22 2011
marrimanu23
No ratings yet
Digital Simulation of Field-Oriented Control
Document8 pages
Digital Simulation of Field-Oriented Control
Nishant
No ratings yet
Bus Eng Prelims 126
Document2 pages
Bus Eng Prelims 126
veronica_celestial
No ratings yet
Unit 2 Measures of Central Tendency and Dispersion: Structure
Document27 pages
Unit 2 Measures of Central Tendency and Dispersion: Structure
ANKUR ARYA
No ratings yet
Surds and Indices
Document9 pages
Surds and Indices
Emmanuel Humphrey
No ratings yet
Fractal Geometry New
Document65 pages
Fractal Geometry New
minisha
No ratings yet
Islamic Domes of Crossed-Arches Origin Geometry An
Document9 pages
Islamic Domes of Crossed-Arches Origin Geometry An
shiva
0% (1)
Mitsubishi FX Advanced Analog and Digital Programming Tutorial 1
Document29 pages
Mitsubishi FX Advanced Analog and Digital Programming Tutorial 1
Fake INC
No ratings yet
Problem: Determine The Total Volume of Earth To Be Excavated Up To Elevation 0
Document17 pages
Problem: Determine The Total Volume of Earth To Be Excavated Up To Elevation 0
gtech00
100% (1)
Inequalities PDF
Document6 pages
Inequalities PDF
Shehraiz Khan
No ratings yet
Keam Paper
Document32 pages
Keam Paper
MSbsuwbsqw
No ratings yet
Decision Trees For Classification and Regression: Piyush Rai Introduction To Machine Learning (CS771A)
Document26 pages
Decision Trees For Classification and Regression: Piyush Rai Introduction To Machine Learning (CS771A)
Siddhant Garg
No ratings yet
Artificial Intelligence: Dr. Anupam Shukla
Document34 pages
Artificial Intelligence: Dr. Anupam Shukla
Prabhat Agrawal
No ratings yet
Java Abstract Class
Document24 pages
Java Abstract Class
Gangadri524
No ratings yet
Design of Automatic Control System For Cooling Unit Using Fuzzy Logic Controller
Document21 pages
Design of Automatic Control System For Cooling Unit Using Fuzzy Logic Controller
محمد القاضي
No ratings yet
Pantaz D. Vadim
Document8 pages
Pantaz D. Vadim
Laura Colun
No ratings yet
Gams Introduction
Document69 pages
Gams Introduction
Võ Hồng Hạnh
No ratings yet
Biostatistics-Haramaya University Full - Aug 25 2008
Document88 pages
Biostatistics-Haramaya University Full - Aug 25 2008
Endale
No ratings yet
Full Download PDF of Business Statistics: A First Course 7th Edition (Ebook PDF) All Chapter
Document53 pages
Full Download PDF of Business Statistics: A First Course 7th Edition (Ebook PDF) All Chapter
peroloshinta36
100% (11)
Lecture 4 Chapter 2 - Force Systems 3D
Document32 pages
Lecture 4 Chapter 2 - Force Systems 3D
robel metiku
No ratings yet
UNIT-4 Illumination Fundamentals
Document8 pages
UNIT-4 Illumination Fundamentals
Dilip TheLip
No ratings yet

ECS171: Machine Learning: Lecture 10: Neural Networks

Uploaded by

svwnerlgwr

0% found this document useful (0 votes)

12 views40 pages

Original Description:

Original Title

lecture10

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

12 views40 pages

ECS171: Machine Learning: Lecture 10: Neural Networks

Uploaded by

svwnerlgwr

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 40

Search inside document

ECS171: Machine Learning

Lecture 10: Neural Networks

Cho-Jui Hsieh
UC Davis

Feb 12, 2018

Neural Networks
Another way to introduce nonlinearity

How to generate this nonlinear hypothesis?

Another way to introduce nonlinearity

How to generate this nonlinear hypothesis?

Combining multiple perceptrons to construct nonlinear hypothesis!

Combining perceptrons
Combining perceptrons

Example: h = (h1 or h2 ):
h(x) = sign(1.5 + h1 (x) + h2 (x))
h1 (x) = sign(w1T x), h2 (x) = sign(w2T x)
Creating more layers

feedforward network
Activation Function

Perceptron: activation function is “hard threshold”

h(x) = θ(w T x) θ(x) = sign(x) (1)
θ: activation function
Activation Function

Perceptron: activation function is “hard threshold”

h(x) = θ(w T x) θ(x) = sign(x) (1)
θ: activation function
Non-differentiable, hard to optimize
Activation Function

Perceptron: activation function is “hard threshold”

h(x) = θ(w T x) θ(x) = sign(x) (1)
θ: activation function
Non-differentiable, hard to optimize
Replace θ by some other better functions
Activation Function
Formal Definitions


1 ≤ l ≤ L
 : layers
(l)
wij 0 ≤ i ≤ d (l−1) : inputs

1 ≤ j ≤ d (l) : outputs

Formal Definitions


1 ≤ l ≤ L
 : layers
(l)
wij 0 ≤ i ≤ d (l−1) : inputs

1 ≤ j ≤ d (l) : outputs


j-th neuron in the l-the layer:

(l−1)
dX
(l) (l) (l) (l−1)
xj = θ(sj ) = θ( wij xi )
i=0
Formal Definitions


1 ≤ l ≤ L
 : layers
(l)
wij 0 ≤ i ≤ d (l−1) : inputs

1 ≤ j ≤ d (l) : outputs


j-th neuron in the l-the layer:

(l−1)
dX
(l) (l) (l) (l−1)
xj = θ(sj ) = θ( wij xi )
i=0

Output:
(L)
h(x) = x1
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Forward propagation
Stochastic Gradient Descent

All the weights W = {W1 , · · · , WL } determine h(x)

Stochastic Gradient Descent

All the weights W = {W1 , · · · , WL } determine h(x)

Error on example (xn , yn ) is

e(h(xn ), yn ) = e(W )
Stochastic Gradient Descent

All the weights W = {W1 , · · · , WL } determine h(x)

Error on example (xn , yn ) is

e(h(xn ), yn ) = e(W )

To implement SGD, we need the gradient

∂e(W )
∇e(W ) : { (l)
} for all i, j, l
∂wij
∂e(W )
Computing Gradient (l)
∂wij

Use chain rule:

(l)
∂e(W ) ∂e(W ) ∂sj
(l)
= (l)
× (l)
∂wij ∂sj ∂wij

(l) Pd (l−1) (l)

sj = i=1 xi wij
(l)
∂sj (l−1)
We have (l) = xi
∂w ij
∂e(W )
Computing Gradient (l)
∂wij

(l) ∂e(W )
Define δj := (l)
∂sj
Compute by layer-by-layer:

(l−1) ∂e(W )
δi = (l−1)
∂si
d (l) (l−1)
X ∂e(W ) ∂sj ∂xi
= × ×
j=1 ∂sj
(l) (l−1)
∂xi ∂sil−1
d
(l) (l) (l−1)
X
= δj × wij × θ0 (si ),
j=1

where θ0 (s) = 1 − θ2 (s) for tanh

(l−1) (l−1) 2 Pd (l) (l)
δi = (1 − (xi ) ) j=1 wij δj
Final layer

(Assume square loss)

(L)
e(W ) = (x1 − yn )2
(L) (L)
x1 = θ(s1 )
So,

(L) ∂e(W )
δ1 = (L)
∂s1
(L)
∂e(W ) ∂x1
= (L)
× (L)
∂x1 s1
(L) (L)
= 2(x1 − yn ) × θ0 (s1 )
Backward propagation
Backward propagation
Backward propagation
Backward propagation
Backward propagation
Backward propagation
Backward propagation
Backpropagation

SGD for neural networks

(l)
Initialize all weights wij at random
For iter = 0, 1, 2, · · ·
(l)
Forward: Compute all xj from input to output
(l)
Backward: Compute all δj from output to input
(l) (l−1) (l)
Update all the weights wijl ← wij − ηxi δj
Backpropagation

Just an automatic way to apply chain rule to compute gradient

Auto-differentiation (AD) — as long as we define derivative for each
basic function, we can use AD to compute any of their compositions
Implemented in most deep learning packages
(e.g., pytorch, tensorflow)
Conclusions

Next class: LFD 4.1

Questions?

White Paper Tavares Model Rocky
Document14 pages
White Paper Tavares Model Rocky
angelramos
No ratings yet
Quantum Scattering Basic Problems
Document23 pages
Quantum Scattering Basic Problems
Jeison Rojas
No ratings yet
Oscillatory Integrals
Document8 pages
Oscillatory Integrals
Liviu Ignat
No ratings yet
Week5-LectureNotes
Document15 pages
Week5-LectureNotes
barra alfa
No ratings yet
Chapter 7 Part 3
Document7 pages
Chapter 7 Part 3
ADY Beats
No ratings yet
Lecture 5 Mask/Filter Transformation
Document24 pages
Lecture 5 Mask/Filter Transformation
tadiwos workeye
No ratings yet
Vademecum PROB ML
Document14 pages
Vademecum PROB ML
anon_57840914
No ratings yet
Pegasos
Document4 pages
Pegasos
Melanie2023
No ratings yet
HW 4
Document3 pages
HW 4
Tadux
No ratings yet
Notes 18 6382 Sturm-Liouville Theory
Document31 pages
Notes 18 6382 Sturm-Liouville Theory
adnan mehmood
No ratings yet
Parte 7
Document7 pages
Parte 7
Elohim Ortiz Caballero
No ratings yet
The Laplace Operator: Definition and Self Adjointness
Document7 pages
The Laplace Operator: Definition and Self Adjointness
Genner Pineda
No ratings yet
APA Chapter3 T20
Document24 pages
APA Chapter3 T20
XxXavillitoxX 5
No ratings yet
Fluid Dynamics - HW#1
Document3 pages
Fluid Dynamics - HW#1
Alexis Dolphin
No ratings yet
Ma6351 Tpde Unit II Fourier Series
Document192 pages
Ma6351 Tpde Unit II Fourier Series
Devender Dua
No ratings yet
Lect - 22 Khalil
Document17 pages
Lect - 22 Khalil
Daniel Ambriz
No ratings yet
Lecture 3
Document6 pages
Lecture 3
Đỗ Đỗ
No ratings yet
PHD Lecture08 09
Document9 pages
PHD Lecture08 09
Roy Vesey
No ratings yet
Sturm Liouville
Document16 pages
Sturm Liouville
Abdulrahman Alrabiea
No ratings yet
QIU3 SAz 2 GWBPTZ 80 TS4 V
Document5 pages
QIU3 SAz 2 GWBPTZ 80 TS4 V
agnit.dg
No ratings yet
En0175 02
Document7 pages
En0175 02
Sanjay Kabra
No ratings yet
Tutorial5.1 mp2013 Soln
Document4 pages
Tutorial5.1 mp2013 Soln
Rahul Gurjar
No ratings yet
DTFT Theorems and Properties
Document4 pages
DTFT Theorems and Properties
sameed raheel
No ratings yet
P182 List of Equations
Document4 pages
P182 List of Equations
Romesor Apol
No ratings yet
Practice Problem Solutions
Document25 pages
Practice Problem Solutions
Romesor Apol
No ratings yet
Lecture 8: Boundary Integral Equations: CBMS Conference On Fast Direct Solvers
Document20 pages
Lecture 8: Boundary Integral Equations: CBMS Conference On Fast Direct Solvers
Kavi Ya
No ratings yet
Negative Exp. Distribution-1
Document4 pages
Negative Exp. Distribution-1
innocent angel
No ratings yet
An Introduction To Some Novel Applications of Lie Algebra Cohomology in Mathematics and Physics
Document25 pages
An Introduction To Some Novel Applications of Lie Algebra Cohomology in Mathematics and Physics
Daniel Vargas
No ratings yet
Elec303 HW7
Document3 pages
Elec303 HW7
wendy
No ratings yet
Geostatistics in Hydrology Krig PDF
Document25 pages
Geostatistics in Hydrology Krig PDF
Hamid Kor
No ratings yet
Gradient Flow PDF
Document62 pages
Gradient Flow PDF
Jhon Edison Bravo Buitrago
No ratings yet
To Nis 11 05 2010
Document16 pages
To Nis 11 05 2010
ukoszapavlinje
No ratings yet
1D FEM Bars Isoparametric Formulation PDF
Document58 pages
1D FEM Bars Isoparametric Formulation PDF
nizam dastan
No ratings yet
2009 Seemous Problems Solutions
Document4 pages
2009 Seemous Problems Solutions
Hipstersdssadad
No ratings yet
ACFrOgBKZIAfR2 vuKBapmdmSmy7S5nMzwZoNNa d1SvFKmkxniQlslfvbk3p Y8oZRkeRvg3Wm94QapGxAfozfVFiN2ypAniIYmEfRGMX3e1C-89aXK-5in69LQKj8yHC9Ep08cREizyfvyf1wG
Document4 pages
ACFrOgBKZIAfR2 vuKBapmdmSmy7S5nMzwZoNNa d1SvFKmkxniQlslfvbk3p Y8oZRkeRvg3Wm94QapGxAfozfVFiN2ypAniIYmEfRGMX3e1C-89aXK-5in69LQKj8yHC9Ep08cREizyfvyf1wG
Sunil Das
No ratings yet
Lecture 06 - Functions of Random Variables
Document50 pages
Lecture 06 - Functions of Random Variables
aryhuh
No ratings yet
Gradient Descent Based Learners
Document11 pages
Gradient Descent Based Learners
sandt
No ratings yet
Fourier Series 2
Document5 pages
Fourier Series 2
mrinvictusthefirst
No ratings yet
Structural Problem
Document21 pages
Structural Problem
Halef Michel Bou Karim
No ratings yet
Euler Homogeneity
Document4 pages
Euler Homogeneity
private
No ratings yet
Boston University Algebra Seminar - Spring 2004
Document74 pages
Boston University Algebra Seminar - Spring 2004
mutia kaco
No ratings yet
Chapter 4
Document64 pages
Chapter 4
Alket Dhami
No ratings yet
Higher Order Linear Differential Equations: Math 240 - Calculus III
Document25 pages
Higher Order Linear Differential Equations: Math 240 - Calculus III
Sadek Ahmed
No ratings yet
Higher Order Linear Differential Equations: Math 240 - Calculus III
Document25 pages
Higher Order Linear Differential Equations: Math 240 - Calculus III
Sadek Ahmed
No ratings yet
Neural Network
Document6 pages
Neural Network
Toshinari Tong
No ratings yet
337week0405 PDF
Document13 pages
337week0405 PDF
kjel reida jøssan
No ratings yet
Function Spaces
Document20 pages
Function Spaces
Giovanni Palombo
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
Document80 pages
AC-ED L04 - Logistic Regression, Regularization
Abel Espin
No ratings yet
Lecture14 Logistic
Document25 pages
Lecture14 Logistic
mohammed.elbakkalielammari
No ratings yet
Heat Transfer
Document5 pages
Heat Transfer
fdcarazo
No ratings yet
Lecture 4 PDF
Document22 pages
Lecture 4 PDF
SparrowGospleGilbert
No ratings yet
Lec 7
Document21 pages
Lec 7
Perike Chandra Sekhar
No ratings yet
Final2013-14 Solution
Document10 pages
Final2013-14 Solution
payal soneja
No ratings yet
Structural Analysis FEM Lecture 6 Method of Weighted Residuals
Document10 pages
Structural Analysis FEM Lecture 6 Method of Weighted Residuals
Paul Richards
No ratings yet
Lecture 2
Document47 pages
Lecture 2
eustachebimenyimana
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Document4 pages
Problem Statement: Advanced Structural Mechanics M Nicholas Fantuzzi
Mahdi Lotfipour
No ratings yet
The Dirac Delta Function
Document2 pages
The Dirac Delta Function
Jennessi Reyes Montiel
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
Jean Bourgain
No ratings yet
ECS171: Machine Learning: Lecture 7: Theory of Generalization (LFD 2.1, 2.2)
Document52 pages
ECS171: Machine Learning: Lecture 7: Theory of Generalization (LFD 2.1, 2.2)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 8: VC Dimension (LFD 2.2)
Document43 pages
ECS171: Machine Learning: Lecture 8: VC Dimension (LFD 2.2)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 9: Nonlinear Transforms (LFD 3.4)
Document26 pages
ECS171: Machine Learning: Lecture 9: Nonlinear Transforms (LFD 3.4)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 6: Training Versus Testing (LFD 2.1)
Document46 pages
ECS171: Machine Learning: Lecture 6: Training Versus Testing (LFD 2.1)
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 1: Overview of Class, LFD 1.1, 1.2
Document29 pages
ECS171: Machine Learning: Lecture 1: Overview of Class, LFD 1.1, 1.2
svwnerlgwr
No ratings yet
ECS171: Machine Learning: Lecture 3: Linear Models I (LFD 3.2, 3.3)
Document48 pages
ECS171: Machine Learning: Lecture 3: Linear Models I (LFD 3.2, 3.3)
svwnerlgwr
No ratings yet
Formula Sheet SOCI 1005
Document6 pages
Formula Sheet SOCI 1005
Katyayani Ramnath
No ratings yet
How To Check Seismic Drift in
Document10 pages
How To Check Seismic Drift in
Erwin Obenza
No ratings yet
MCQ in DAA
Document3 pages
MCQ in DAA
kamakshi2
No ratings yet
Classifications of First-Order Differential Equations
Document8 pages
Classifications of First-Order Differential Equations
fahim shahriar
No ratings yet
Lecture On SQL
Document78 pages
Lecture On SQL
Er Saroj Wagle
No ratings yet
CH 13
Document61 pages
CH 13
anon_273249445
No ratings yet
Ieee Wave Collision and Laws of Collision
Document8 pages
Ieee Wave Collision and Laws of Collision
Swarnav Majumder
No ratings yet
Evaluating Content Uniformity NJPhAST Sep 22 2011
Document43 pages
Evaluating Content Uniformity NJPhAST Sep 22 2011
marrimanu23
No ratings yet
Digital Simulation of Field-Oriented Control
Document8 pages
Digital Simulation of Field-Oriented Control
Nishant
No ratings yet
Bus Eng Prelims 126
Document2 pages
Bus Eng Prelims 126
veronica_celestial
No ratings yet
Unit 2 Measures of Central Tendency and Dispersion: Structure
Document27 pages
Unit 2 Measures of Central Tendency and Dispersion: Structure
ANKUR ARYA
No ratings yet
Surds and Indices
Document9 pages
Surds and Indices
Emmanuel Humphrey
No ratings yet
Fractal Geometry New
Document65 pages
Fractal Geometry New
minisha
No ratings yet
Islamic Domes of Crossed-Arches Origin Geometry An
Document9 pages
Islamic Domes of Crossed-Arches Origin Geometry An
shiva
0% (1)
Mitsubishi FX Advanced Analog and Digital Programming Tutorial 1
Document29 pages
Mitsubishi FX Advanced Analog and Digital Programming Tutorial 1
Fake INC
No ratings yet
Problem: Determine The Total Volume of Earth To Be Excavated Up To Elevation 0
Document17 pages
Problem: Determine The Total Volume of Earth To Be Excavated Up To Elevation 0
gtech00
100% (1)
Inequalities PDF
Document6 pages
Inequalities PDF
Shehraiz Khan
No ratings yet
Keam Paper
Document32 pages
Keam Paper
MSbsuwbsqw
No ratings yet
Decision Trees For Classification and Regression: Piyush Rai Introduction To Machine Learning (CS771A)
Document26 pages
Decision Trees For Classification and Regression: Piyush Rai Introduction To Machine Learning (CS771A)
Siddhant Garg
No ratings yet
Artificial Intelligence: Dr. Anupam Shukla
Document34 pages
Artificial Intelligence: Dr. Anupam Shukla
Prabhat Agrawal
No ratings yet
Java Abstract Class
Document24 pages
Java Abstract Class
Gangadri524
No ratings yet
Design of Automatic Control System For Cooling Unit Using Fuzzy Logic Controller
Document21 pages
Design of Automatic Control System For Cooling Unit Using Fuzzy Logic Controller
محمد القاضي
No ratings yet
Pantaz D. Vadim
Document8 pages
Pantaz D. Vadim
Laura Colun
No ratings yet
Gams Introduction
Document69 pages
Gams Introduction
Võ Hồng Hạnh
No ratings yet
Biostatistics-Haramaya University Full - Aug 25 2008
Document88 pages
Biostatistics-Haramaya University Full - Aug 25 2008
Endale
No ratings yet
Full Download PDF of Business Statistics: A First Course 7th Edition (Ebook PDF) All Chapter
Document53 pages
Full Download PDF of Business Statistics: A First Course 7th Edition (Ebook PDF) All Chapter
peroloshinta36
100% (11)
Lecture 4 Chapter 2 - Force Systems 3D
Document32 pages
Lecture 4 Chapter 2 - Force Systems 3D
robel metiku
No ratings yet
UNIT-4 Illumination Fundamentals
Document8 pages
UNIT-4 Illumination Fundamentals
Dilip TheLip
No ratings yet

ECS171: Machine Learning: Lecture 10: Neural Networks

Uploaded by

Copyright:

Available Formats

You might also like

ECS171: Machine Learning: Lecture 10: Neural Networks

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

ECS171: Machine Learning: Lecture 10: Neural Networks

Uploaded by

Copyright:

Available Formats

ECS171: Machine Learning

Lecture 10: Neural Networks

Feb 12, 2018

How to generate this nonlinear hypothesis?

How to generate this nonlinear hypothesis?

Combining multiple perceptrons to construct nonlinear hypothesis!

Perceptron: activation function is “hard threshold”

Perceptron: activation function is “hard threshold”

Perceptron: activation function is “hard threshold”

j-th neuron in the l-the layer:

j-th neuron in the l-the layer:

All the weights W = {W1 , · · · , WL } determine h(x)

All the weights W = {W1 , · · · , WL } determine h(x)

All the weights W = {W1 , · · · , WL } determine h(x)

To implement SGD, we need the gradient

Use chain rule:

(l) Pd (l−1) (l)

where θ0 (s) = 1 − θ2 (s) for tanh

(Assume square loss)

SGD for neural networks

Just an automatic way to apply chain rule to compute gradient

Next class: LFD 4.1

You might also like