Welcome to Scribd!

Deep Learning Basics Lecture 1 Feedforward

Uploaded by

0% found this document useful (0 votes)

23 views31 pages

This document provides an overview of deep learning basics and feedforward networks. It discusses how machine learning involves collecting data, extracting features, and building models to minimize loss. Feedforward networks are motivated by representing inputs with learned nonlinear functions across multiple layers, inspired by biological neurons. Key components of feedforward networks include the input, hidden layers with activation functions like ReLU to introduce nonlinearity, and an output layer for tasks like regression or classification.

Original Description:

COS 495, Yingyu Liang

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

23 views31 pages

Deep Learning Basics Lecture 1 Feedforward

Uploaded by

baris

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 31

Search inside document

Deep Learning Basics

Lecture 1: Feedforward
Princeton University COS 495
Instructor: Yingyu Liang
Motivation I: representation learning
Machine learning 1-2-3

• Collect data and extract features

• Build model: choose hypothesis class 𝓗 and loss function 𝑙
• Optimization: minimize the empirical loss
Features

𝑥
Color Histogram

Extract build
features hypothesis 𝑦 = 𝑤𝑇𝜙 𝑥

Red Green Blue

Features: part of the model
Nonlinear model

build
hypothesis 𝑦 = 𝑤𝑇𝜙 𝑥

Linear model
Example: Polynomial kernel SVM

𝑥1

𝑦 = sign(𝑤 𝑇 𝜙(𝑥) + 𝑏)

𝑥2

Fixed 𝜙 𝑥
Motivation: representation learning
• Why don’t we also learn 𝜙 𝑥 ?

Learn 𝜙 𝑥 𝜙 𝑥 Learn 𝑤
𝑦 = 𝑤𝑇𝜙 𝑥

𝑥
Feedforward networks
• View each dimension of 𝜙 𝑥 as something to be learned

…
𝑦 = 𝑤𝑇𝜙 𝑥
…

𝑥 𝜙 𝑥
Feedforward networks
• Linear functions 𝜙𝑖 𝑥 = 𝜃𝑖𝑇 𝑥 don’t work: need some nonlinearity

…
𝑦 = 𝑤𝑇𝜙 𝑥
…

𝑥 𝜙 𝑥
Feedforward networks
• Typically, set 𝜙𝑖 𝑥 = 𝑟(𝜃𝑖𝑇 𝑥) where 𝑟(⋅) is some nonlinear function

…
𝑦 = 𝑤𝑇𝜙 𝑥
…

𝑥 𝜙 𝑥
Feedforward deep networks
• What if we go deeper?
…

…
……
𝑦
…

ℎ1 ℎ2 ℎ𝐿
𝑥
Figure from
Deep learning, by
Goodfellow, Bengio, Courville.
Dark boxes are things to be learned.
Motivation II: neurons
Motivation: neurons

Figure from
Wikipedia
Motivation: abstract neuron model
• Neuron activated when the correlation
between the input and a pattern 𝜃
exceeds some threshold 𝑏 𝑥1
• 𝑦 = threshold(𝜃 𝑇 𝑥 − 𝑏) 𝑥2
or 𝑦 = 𝑟(𝜃 𝑇 𝑥 − 𝑏)
• 𝑟(⋅) called activation function 𝑦

𝑥𝑑
Motivation: artificial neural networks
Motivation: artificial neural networks
• Put into layers: feedforward deep networks
…

…
……
𝑦
…

ℎ1 ℎ2 ℎ𝐿
𝑥
Components in Feedforward
networks
Components
• Representations:
• Input
• Hidden variables
• Layers/weights:
• Hidden layers
• Output layer
Components
First layer Output layer
…

…
……
𝑦
…

Input 𝑥 Hidden variables ℎ1 ℎ2 ℎ𝐿

Input
• Represented as a vector

• Sometimes require some

preprocessing, e.g., Expand
• Subtract mean
• Normalize to [-1,1]
Output layers
Output layer
• Regression: 𝑦 = 𝑤𝑇ℎ +𝑏
• Linear units: no nonlinearity

ℎ
Output layers
Output layer
• Multi-dimensional regression: 𝑦 = 𝑊𝑇ℎ +𝑏
• Linear units: no nonlinearity

ℎ
Output layers
Output layer
𝜎(𝑤 𝑇 ℎ
• Binary classification: 𝑦 = + 𝑏)
• Corresponds to using logistic regression on ℎ

ℎ
Output layers
Output layer
• Multi-class classification:
• 𝑦 = softmax 𝑧 where 𝑧 = 𝑊 𝑇 ℎ + 𝑏
• Corresponds to using multi-class
logistic regression on ℎ
𝑧 𝑦

ℎ
Hidden layers
• Neuron take weighted linear
combination of the previous

…
layer
• So can think of outputting one
value for the next layer

…
ℎ𝑖 ℎ𝑖+1
Hidden layers
• 𝑦 = 𝑟(𝑤 𝑇 𝑥 + 𝑏)

• Typical activation function 𝑟

• Threshold t 𝑧 = 𝕀[𝑧 ≥ 0]
𝑟(⋅)
• Sigmoid 𝜎 𝑧 = 1/ 1 + exp(−𝑧) 𝑥 𝑦
• Tanh tanh 𝑧 = 2𝜎 2𝑧 − 1
Hidden layers
• Problem: saturation

𝑟(⋅)
𝑥 𝑦
Too small gradient

Figure borrowed from Pattern Recognition and Machine Learning, Bishop

Hidden layers
• Activation function ReLU (rectified linear unit)
• ReLU 𝑧 = max{𝑧, 0}

Figure from Deep learning, by

Goodfellow, Bengio, Courville.
Hidden layers
• Activation function ReLU (rectified linear unit)
• ReLU 𝑧 = max{𝑧, 0} Gradient 1

Gradient 0
Hidden layers
• Generalizations of ReLU gReLU 𝑧 = max 𝑧, 0 + 𝛼 min{𝑧, 0}
• Leaky-ReLU 𝑧 = max{𝑧, 0} + 0.01 min{𝑧, 0}
• Parametric-ReLU 𝑧 : 𝛼 learnable

gReLU 𝑧

ECE604 f20 hw3
Document3 pages
ECE604 f20 hw3
baris
0% (1)
Mathematical Music From Antiquity To Music AI
Document147 pages
Mathematical Music From Antiquity To Music AI
Rodrigo Carvalho
100% (1)
Neuralink PPT 1
Document28 pages
Neuralink PPT 1
Sai Vamshi Pranay
100% (3)
06 NeuralNetworksII
Document41 pages
06 NeuralNetworksII
Ben
No ratings yet
AML 04 Backpropagation
Document26 pages
AML 04 Backpropagation
Vaibhav
No ratings yet
Deep Learning Nonlinear Control
Document68 pages
Deep Learning Nonlinear Control
Daotao Daihoc
No ratings yet
AI - Physics Informed Neural Network by ARNAB HALDER
Document15 pages
AI - Physics Informed Neural Network by ARNAB HALDER
ARNAB HALDER
No ratings yet
CS60010: Deep Learning: Spring 2021
Document18 pages
CS60010: Deep Learning: Spring 2021
alok
No ratings yet
1AI.04b - Introduction To Machine Learning - Supervised Learning - DT PDF
Document65 pages
1AI.04b - Introduction To Machine Learning - Supervised Learning - DT PDF
hải trần thị thúy
No ratings yet
Pattern Classification 11. Backpropagation & Time-Series Forecasting
Document78 pages
Pattern Classification 11. Backpropagation & Time-Series Forecasting
Mostafa Mohamed
No ratings yet
l15 16 Autoencoders
Document26 pages
l15 16 Autoencoders
Rajakumar Awaradi
No ratings yet
Deep Learning: International Islamic University of Chittagong
Document31 pages
Deep Learning: International Islamic University of Chittagong
Ayat Ullah
No ratings yet
AML 03 Dense Neural Networks
Document20 pages
AML 03 Dense Neural Networks
Vaibhav
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
Document22 pages
Foundations of Machine Learning: Module 6: Neural Network
Nishant Tiwari
No ratings yet
4 - DNN Tip
Document52 pages
4 - DNN Tip
Jeffery Chia
No ratings yet
Regression
Document17 pages
Regression
Renhe SONG
No ratings yet
03 Linear Models
Document46 pages
03 Linear Models
hmiida tfm
No ratings yet
cs224n 2023 Lecture03 Neuralnets
Document83 pages
cs224n 2023 Lecture03 Neuralnets
myturtle game01
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
Document56 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
18MDS019
No ratings yet
Mlfa Autumn 23 Optimization
Document37 pages
Mlfa Autumn 23 Optimization
T M
No ratings yet
ME360: Signal Processing: Liangjing Yang Assistant Professor, ZJU-UIUC Institute
Document27 pages
ME360: Signal Processing: Liangjing Yang Assistant Professor, ZJU-UIUC Institute
陈炯帆
No ratings yet
2-Mathematical Optimization and Deep Learning
Document53 pages
2-Mathematical Optimization and Deep Learning
عزام صالح
No ratings yet
CSE489: Machine Vision (Sheet 7) : Yehia Zakaria
Document34 pages
CSE489: Machine Vision (Sheet 7) : Yehia Zakaria
Reem Gheith
No ratings yet
DL Assigment Aryan Gupta UE218015
Document5 pages
DL Assigment Aryan Gupta UE218015
anshulchambial
No ratings yet
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
Document84 pages
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
Muhammad Arshad Awan
No ratings yet
Face Recognition Using Facenet
Document46 pages
Face Recognition Using Facenet
vasavi college
No ratings yet
Lecture 4 - Linear Classification
Document34 pages
Lecture 4 - Linear Classification
Rajdip Nayek
No ratings yet
Implement Neural Networks Using Keras and Pytorch: Liang Liang
Document32 pages
Implement Neural Networks Using Keras and Pytorch: Liang Liang
raverita
No ratings yet
23 DeepLearning PDF
Document74 pages
23 DeepLearning PDF
kavinscrib
No ratings yet
18 DL Regularization
Document41 pages
18 DL Regularization
spanishbear75
No ratings yet
Lecture04 Neuralnets
Document81 pages
Lecture04 Neuralnets
suman
No ratings yet
Provable Non-Convex Optimization For ML: Prateek Jain Microsoft Research India
Document86 pages
Provable Non-Convex Optimization For ML: Prateek Jain Microsoft Research India
Pihu Jan
No ratings yet
Machine Learning Basics Lecture 2 Lienar Classification
Document34 pages
Machine Learning Basics Lecture 2 Lienar Classification
baris
No ratings yet
Lec1 PerceptronPocket Recap
Document61 pages
Lec1 PerceptronPocket Recap
tejsharma815
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
Document56 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
18MDS019
No ratings yet
Welcome To The Kernel-Class
Document11 pages
Welcome To The Kernel-Class
S
No ratings yet
6-DeepVisualLearning L6
Document82 pages
6-DeepVisualLearning L6
hisisthesongoficeandfire
No ratings yet
Neural-Networks-Overview-365-Data-Science-Course Notes
Document14 pages
Neural-Networks-Overview-365-Data-Science-Course Notes
Badrinath SVN
No ratings yet
Lec 24
Document39 pages
Lec 24
saafinhasan27
No ratings yet
Large Scale Deep Learning
Document170 pages
Large Scale Deep Learning
pavancreative81
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
Document14 pages
Foundations of Machine Learning: Module 6: Neural Network
Nishant Tiwari
No ratings yet
Convolutional Neural Networks
Document79 pages
Convolutional Neural Networks
Lamis Ahmad
No ratings yet
Deep Learning Fundamentals Materials
Document216 pages
Deep Learning Fundamentals Materials
イタチ君イタチ君
No ratings yet
Neural Network
Document38 pages
Neural Network
Lingesh K V
No ratings yet
Module 4 Continued
Document244 pages
Module 4 Continued
guptaatulit55
No ratings yet
0905 Cs 161183 Vishal
Document38 pages
0905 Cs 161183 Vishal
Vishal Jain
No ratings yet
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
Document43 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
Utkarsh Agarwal
No ratings yet
Week 06
Document12 pages
Week 06
Osii C
No ratings yet
Elementary Functions and Transformation of Functions: Bernadeth C. Villanueva
Document38 pages
Elementary Functions and Transformation of Functions: Bernadeth C. Villanueva
bernadeth villanueva
No ratings yet
26 Neural Nets
Document77 pages
26 Neural Nets
damasodra33
No ratings yet
Linear Regression With Multiple Variables: Reading Material: Part 1 of Lecture Notes 1
Document24 pages
Linear Regression With Multiple Variables: Reading Material: Part 1 of Lecture Notes 1
M.Talha Javed
No ratings yet
Neural Network
Document44 pages
Neural Network
Vishnu Ch
No ratings yet
Unit 11-LSTM-CNN
Document72 pages
Unit 11-LSTM-CNN
陳力熊
No ratings yet
Lec 6
Document154 pages
Lec 6
cheint
No ratings yet
Lecture 21 and 22
Document28 pages
Lecture 21 and 22
shivna0809
No ratings yet
Machine Learning Practice
Document31 pages
Machine Learning Practice
Manish Kumar
No ratings yet
E7 2021 FinalReview
Document85 pages
E7 2021 FinalReview
kong
No ratings yet
1.6 Notes
Document5 pages
1.6 Notes
Rahul Mishra
No ratings yet
Machine Learning Basics Lecture 1 Linear Regression
Document31 pages
Machine Learning Basics Lecture 1 Linear Regression
baris
No ratings yet
Mastering Mathematica®: Programming Methods and Applications
From Everand
Mastering Mathematica®: Programming Methods and Applications
John W. Gray
Rating: 5 out of 5 stars
5/5 (1)
Integer Programming
From Everand
Integer Programming
Elsevier Books Reference
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
Rating: 2.5 out of 5 stars
2.5/5 (2)
Let's Practise: Maths Workbook Coursebook 7
From Everand
Let's Practise: Maths Workbook Coursebook 7
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
OSRAM SFH 309 Datasheet
Document16 pages
OSRAM SFH 309 Datasheet
baris
No ratings yet
Machine Learning Basics Lecture 2 Lienar Classification
Document34 pages
Machine Learning Basics Lecture 2 Lienar Classification
baris
No ratings yet
Machine Learning Basics Lecture 1 Linear Regression
Document31 pages
Machine Learning Basics Lecture 1 Linear Regression
baris
No ratings yet
Machine Learning Basics Lecture 3 Perceptron
Document33 pages
Machine Learning Basics Lecture 3 Perceptron
baris
No ratings yet
Deep Learning Basics Lecture 5 Convolution
Document31 pages
Deep Learning Basics Lecture 5 Convolution
baris
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
Document32 pages
Deep Learning Basics Lecture 3 Regularization I
baris
No ratings yet
Deep Learning Basics Lecture 7 Factor Analysis
Document32 pages
Deep Learning Basics Lecture 7 Factor Analysis
baris
No ratings yet
Deep Learning Basics Lecture 2 Backpropagation
Document31 pages
Deep Learning Basics Lecture 2 Backpropagation
baris
No ratings yet
Deep Learning Basics Lecture 6 Convolutional NN
Document36 pages
Deep Learning Basics Lecture 6 Convolutional NN
baris
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
Document27 pages
Deep Learning Basics Lecture 4 Regularization II
baris
No ratings yet
Deep Learning Basics Lecture 8 Autoencoder & DBM
Document28 pages
Deep Learning Basics Lecture 8 Autoencoder & DBM
baris
No ratings yet
SFH 229 - en PDF
Document15 pages
SFH 229 - en PDF
baris
No ratings yet
PYu-RC Group 51 RoHS L 12
Document10 pages
PYu-RC Group 51 RoHS L 12
baris
No ratings yet
Deep Learning Basics Lecture 10 Neural Language Models
Document34 pages
Deep Learning Basics Lecture 10 Neural Language Models
baris
No ratings yet
Deep Learning Basics Lecture 9 Recurrent Neural Networks
Document34 pages
Deep Learning Basics Lecture 9 Recurrent Neural Networks
baris
No ratings yet
ECE604 f20 hw1
Document1 page
ECE604 f20 hw1
baris
No ratings yet
Lectures On Electromagnetic Theory - Weng Cho Chew
Document591 pages
Lectures On Electromagnetic Theory - Weng Cho Chew
baris
No ratings yet
Deep Learning Basics Lecture 11 Practical Methodology
Document25 pages
Deep Learning Basics Lecture 11 Practical Methodology
baris
No ratings yet
SFH 203 - en
Document15 pages
SFH 203 - en
baris
No ratings yet
Vsmy 2940 RG
Document9 pages
Vsmy 2940 RG
baris
No ratings yet
SFH 2505 - en
Document19 pages
SFH 2505 - en
baris
No ratings yet
SFH 235 Fa - en
Document15 pages
SFH 235 Fa - en
baris
No ratings yet
Vsmy 2853 SL
Document6 pages
Vsmy 2853 SL
baris
No ratings yet
Vsmy 2850
Document9 pages
Vsmy 2850
baris
No ratings yet
Literature Review Traffic Management
Document5 pages
Literature Review Traffic Management
aflskeqjr
100% (1)
Full Ebook of Interpretability in Deep Learning 1St Edition Ayush Somani Online PDF All Chapter
Document69 pages
Full Ebook of Interpretability in Deep Learning 1St Edition Ayush Somani Online PDF All Chapter
camilaeanhill
100% (7)
The Impact of Artificial Intelligence On Journalism
Document2 pages
The Impact of Artificial Intelligence On Journalism
Inoka welagedara
No ratings yet
Industry 4.0 in Human Resource Management
Document7 pages
Industry 4.0 in Human Resource Management
preethi
No ratings yet
POV Autonomous-Driving Deloitte
Document66 pages
POV Autonomous-Driving Deloitte
advertisestopper
No ratings yet
Aczel Wagenmakers - Transparent Usage of ChatGPT
Document6 pages
Aczel Wagenmakers - Transparent Usage of ChatGPT
Ngọc Hân
No ratings yet
(Pec Cs701e)
Document4 pages
(Pec Cs701e)
Shourja Ganguly
No ratings yet
The Matrix of Duality or The Unconscious Mind Control Matrix
Document40 pages
The Matrix of Duality or The Unconscious Mind Control Matrix
peter
No ratings yet
Mini Project
Document3 pages
Mini Project
vishali
No ratings yet
Getting Started With AI Using IBM Watson
Document33 pages
Getting Started With AI Using IBM Watson
Sanathkumar Rameshbabu
No ratings yet
Noam Chomsky - The False Promise of ChatGPT - The New York Times
Document9 pages
Noam Chomsky - The False Promise of ChatGPT - The New York Times
Fuzzer Hoax
No ratings yet
Titan Leap Company List 18.5.19
Document5 pages
Titan Leap Company List 18.5.19
manasa
No ratings yet
MPR Sem 4
Document71 pages
MPR Sem 4
devanshawasthi127
No ratings yet
Mia MDCS Brochure
Document23 pages
Mia MDCS Brochure
Shivam Rai
No ratings yet
Lecture 8 - 1
Document16 pages
Lecture 8 - 1
Chanchal Misra
No ratings yet
The Importance of Artificial Intelligence On Improving Writing
Document3 pages
The Importance of Artificial Intelligence On Improving Writing
International Journal of Innovative Science and Research Technology
No ratings yet
Introduction To Artificial Intelligence (AI) Cover Sheet: Project Title: Intake: Role
Document4 pages
Introduction To Artificial Intelligence (AI) Cover Sheet: Project Title: Intake: Role
高橋賢大
No ratings yet
3D Modelling Using Ai
Document4 pages
3D Modelling Using Ai
Ravi Patel
No ratings yet
Class 3 Navie Bayes
Document21 pages
Class 3 Navie Bayes
shrey patel
No ratings yet
Report Kernel Pca Method
Document11 pages
Report Kernel Pca Method
MUHAMMAD NUR FITRI BIN NORAFFENDI
No ratings yet
PGDC Iiit Delhi
Document16 pages
PGDC Iiit Delhi
desai628
No ratings yet
2022-02-19 New Scientist - Making A Mind
Document4 pages
2022-02-19 New Scientist - Making A Mind
Han Yi
No ratings yet
Deep Learning State of The Art: Amulya Viswambharan ID 202090007 Kehkshan Fatima ID
Document17 pages
Deep Learning State of The Art: Amulya Viswambharan ID 202090007 Kehkshan Fatima ID
Amulya
No ratings yet
Big Data, Data Analytics and Artificial Intelligence in Accounting: An
Document35 pages
Big Data, Data Analytics and Artificial Intelligence in Accounting: An
satusembilan87
No ratings yet
Fractured Reservoir Characterization and Performance Forecasting Using Geomechanics and Artificial Intelligence
Document12 pages
Fractured Reservoir Characterization and Performance Forecasting Using Geomechanics and Artificial Intelligence
Qazi Sohail Imran
No ratings yet
6QPG1 CSE Artificial Intelligence CS8691 QBM
Document2 pages
6QPG1 CSE Artificial Intelligence CS8691 QBM
VIJAY VIDHYA SAGAR S
No ratings yet
Automated Decision Systems and Expert Systems: Learning Objectives For Chapter 11
Document17 pages
Automated Decision Systems and Expert Systems: Learning Objectives For Chapter 11
Captain Ayub
No ratings yet
Road Accident Prediction Journal Paper
Document3 pages
Road Accident Prediction Journal Paper
Sanju Satish
No ratings yet