Welcome to Scribd!

Class 2 B

Uploaded by

0% found this document useful (0 votes)

14 views10 pages

- The document discusses feature scaling and regularization techniques for improving machine learning models. - Feature scaling ensures that features have similar scales before being used in models. This can make gradient descent converge faster. Standardization rescales features to have zero mean and unit variance. - Regularization is a method to control model complexity by penalizing large values of coefficients. This helps address overfitting, especially with many features. It works by adding the magnitude of coefficients to the cost function.

Original Description:

Original Title

class2b

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

14 views10 pages

Class 2 B

Uploaded by

Agatha

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 10

Search inside document

Improving

Learning:
Feature Scaling
•  Idea: Ensure that feature have similar scales
Before Feature Scaling Aver Feature Scaling
20 20
15 15

✓2 10 ✓2 10
5 5
0 0
0 5 10 15 20 0 5 10 15 20
✓1 ✓1
•  Makes gradient descent converge much faster

51
Feature StandardizaIon
•  Rescales features to have zero mean and unit variance

Xn
1 (i)
–  Let μj be the mean of feature j: j
µ = x
n i=1 j
–  Replace each value with:
(i)
(i) xj µj for j = 1...d
xj (not x0!)
sj
•  sj is the standard deviaIon of feature j
•  Could also use the range of feature j (maxj – minj) for sj

•  Must apply the same transformaIon to instances for

both training and predicIon
•  Outliers can cause problems
52
Price Quality of Fit

Price

Price
Size Size Size

Underfinng Correct fit Overfinng

(high bias) (high variance)

OverﬁHng:
•  The learned hypothesis may ﬁt the training set very
well ( )
J(✓) ⇡ 0
•  ...but fails to generalize to new examples

Based on example by Andrew Ng 53

RegularizaIon
•  A method for automaIcally controlling the
complexity of the learned hypothesis
•  Idea: penalize for large values of ✓j
–  Can incorporate into the cost funcIon
–  Works well when we have a lot of features, each that
contributes a bit to predicIng the label

•  Can also address overﬁnng by eliminaIng features

(either manually or via model selecIon)

54
RegularizaIon
•  Linear regression objecIve funcIon
Xn ⇣ ⇣ ⌘ ⌘2 XXdd
1
J(✓) = h✓ x(i) y (i) + ✓✓j2j2
2n 2 j=1
i=1 j=1

model ﬁt to data regularizaIon

–  is the regularizaIon parameter ( )

0
–  No regularizaIon on !
✓0

55
Understanding RegularizaIon
Xn ⇣ ⇣ ⌘ ⌘2 d
X
1
J(✓) = h✓ x(i) y (i) + ✓j2
2n i=1 2 j=1

X d
✓j2 = k✓1:d k22
•  Note that
j=1
–  This is the magnitude of the feature coeﬃcient vector!

•  We can also think of this as:

Xd
(✓j 0)2 = k✓1:d ~0k22
j=1
•  L2 regularizaIon pulls coeﬃcients toward 0
56
Understanding RegularizaIon
Xn ⇣ ⇣ ⌘ ⌘2 d
X
1
J(✓) = h✓ x(i) y (i) + ✓j2
2n i=1 2 j=1

•  What happens if we set to be huge (e.g., 1010)?

Price

Size

Based on example by Andrew Ng 57

Understanding RegularizaIon
Xn ⇣ ⇣ ⌘ ⌘2 d
X
1
J(✓) = h✓ x(i) y (i) + ✓j2
2n i=1 2 j=1

•  What happens if we set to be huge (e.g., 1010)?

Price

0 Size
0 0 0

Based on example by Andrew Ng 58

Regularized Linear Regression
•  Cost FuncIon
Xn ⇣ ⇣ ⌘ ⌘2 d
X
1
J(✓) = h✓ x(i) y (i) + ✓j2
2n i=1 2 j=1

•  Fit by solving min J(✓)

✓

•  Gradient update:
n ⇣
X ⇣ ⌘ ⌘
@ 1 (i)
@✓0
J(✓) ✓ 0 ✓ 0 ↵ h ✓ x y (i)
n i=1
Xn ⇣ ⇣ ⌘ ⌘
@ 1 (i)
@✓j
J(✓) ✓j ✓j ↵ h✓ x(i) y (i) xj ↵ ✓j
n i=1
regularizaIon
59
Regularized Linear Regression
1 X ⇣ ⇣ (i) ⌘ ⌘2
n d
X
J(✓) = h✓ x y (i) + ✓j2
2n i=1 2 j=1
Xn ⇣ ⇣ ⌘ ⌘
1
✓0 ✓0 ↵ h✓ x(i) y (i)
n i=1
Xn ⇣ ⇣ ⌘ ⌘
1 (i)
✓j ✓j ↵ h✓ x(i) y (i) xj ↵ ✓j
n i=1

•  We can rewrite the gradient step as:

Xn ⇣ ⇣ ⌘ ⌘
1 (i)
✓j ✓j (1 ↵ ) ↵ h✓ x(i) y (i)
xj
n i=1

60

Machine Learning - Home - Coursera Quiz PDF
Document5 pages
Machine Learning - Home - Coursera Quiz PDF
Mary Peace
100% (1)
Assignment 4
Document8 pages
Assignment 4
Sania Shehzad
No ratings yet
Class 2 C
Document10 pages
Class 2 C
Agatha
No ratings yet
04 LinearRegression PDF
Document61 pages
04 LinearRegression PDF
Alka Choudhary
No ratings yet
Class 2 A
Document32 pages
Class 2 A
Agatha
No ratings yet
05 LogisticRegression PDF
Document23 pages
05 LogisticRegression PDF
Alka Choudhary
No ratings yet
04 LinearRegression
Document61 pages
04 LinearRegression
joselazaromr
No ratings yet
07 SVMs
Document68 pages
07 SVMs
nguyen van truong
No ratings yet
ModelFitting Tutorials
Document58 pages
ModelFitting Tutorials
Bharath kumar
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture3 Compressed
Document15 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture3 Compressed
Rahul Vasanth
No ratings yet
Regularization
Document22 pages
Regularization
Đức Lại Anh
No ratings yet
Lecture 2 Annotated
Document60 pages
Lecture 2 Annotated
Adil Sadiki
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
Document80 pages
AC-ED L04 - Logistic Regression, Regularization
Abel Espin
No ratings yet
Linear Regression
Document75 pages
Linear Regression
Aaqib Inam
No ratings yet
04 LogisticRegression
Document46 pages
04 LogisticRegression
dfcuervoo
No ratings yet
Linear Regression With One Variable: Model Representation
Document48 pages
Linear Regression With One Variable: Model Representation
Ko Có tên
No ratings yet
Linear Regression With One Variable: Model Representation
Document48 pages
Linear Regression With One Variable: Model Representation
Ko Có tên
No ratings yet
Theory of Numbers - Lecture 12
Document4 pages
Theory of Numbers - Lecture 12
ANDHIKA NUGROHO
No ratings yet
Exercise 11: Diffusion in 2D Adi, Thomas Algorithm, Openmp
Document4 pages
Exercise 11: Diffusion in 2D Adi, Thomas Algorithm, Openmp
Adelina Lumban Gaol
No ratings yet
Linear Regression
Document61 pages
Linear Regression
Aymen AlAwady
No ratings yet
Simon Chapter 3
Document12 pages
Simon Chapter 3
shreyas sr
No ratings yet
MIT10 34F15 Lec16
Document21 pages
MIT10 34F15 Lec16
Maman Gocek
No ratings yet
B-17, Finishing Sec 2
Document59 pages
B-17, Finishing Sec 2
yousefzuainat
No ratings yet
SVM Incremental Learning, Adaptation and Optimization - IJCNN 2003 Presentation
Document11 pages
SVM Incremental Learning, Adaptation and Optimization - IJCNN 2003 Presentation
Chris Diehl
No ratings yet
Indefinite: General
Document6 pages
Indefinite: General
Ismail Medhat Salah
No ratings yet
Permodelan
Document165 pages
Permodelan
Pandhu
No ratings yet
Lecture 4-Logistic-Regression
Document50 pages
Lecture 4-Logistic-Regression
Nada Shaaban
No ratings yet
Deep Learning: Models and Optimization: Marco Cuturi
Document272 pages
Deep Learning: Models and Optimization: Marco Cuturi
Bojan Bankovic
No ratings yet
10-Citra Medis (Edge Detection)
Document54 pages
10-Citra Medis (Edge Detection)
fardil
No ratings yet
Machine Learning 10-701 Final Exam May 5, 2015: Obvious Exceptions For Pacemakers and Hearing Aids
Document17 pages
Machine Learning 10-701 Final Exam May 5, 2015: Obvious Exceptions For Pacemakers and Hearing Aids
Nithin
No ratings yet
3 Logistic Regression and Regularization
Document42 pages
3 Logistic Regression and Regularization
Smit
No ratings yet
Applied Machine Learning: Multiple Linear Regression
Document25 pages
Applied Machine Learning: Multiple Linear Regression
Wanida Kratae
No ratings yet
Machine Learning - Home - Week 2 - Notes - Coursera
Document10 pages
Machine Learning - Home - Week 2 - Notes - Coursera
copsamosto
No ratings yet
Lecture - Review of Probability and Statistics
Document42 pages
Lecture - Review of Probability and Statistics
alicevswu
No ratings yet
Unit 4 - Linear Regression
Document52 pages
Unit 4 - Linear Regression
shinjo
No ratings yet
MTH408 Machine - Learning - Logistic - Regression
Document43 pages
MTH408 Machine - Learning - Logistic - Regression
junzi2000
No ratings yet
Week 04
Document101 pages
Week 04
Osii C
No ratings yet
11 Ethem Linear SVM 2015
Document66 pages
11 Ethem Linear SVM 2015
aycaize
No ratings yet
Statistics Chapter3
Document60 pages
Statistics Chapter3
Zhiye Tang
100% (1)
Ifilt 3
Document42 pages
Ifilt 3
Supriya
No ratings yet
6 Template Matching
Document25 pages
6 Template Matching
Abdulrhman Alshameri
No ratings yet
Logistic Regression: Classification
Document32 pages
Logistic Regression: Classification
esteban1815
No ratings yet
3 Linear
Document5 pages
3 Linear
Rachna
No ratings yet
Taylor Expansions PDF
Document14 pages
Taylor Expansions PDF
Dwi Rahayu
No ratings yet
Ebook Calculus 7Th Edition Stewart Solutions Manual Full Chapter PDF
Document54 pages
Ebook Calculus 7Th Edition Stewart Solutions Manual Full Chapter PDF
voormalizth9
100% (13)
Back Propagation in NN
Document30 pages
Back Propagation in NN
Shubham
No ratings yet
Midterm Review Spring18 Sols
Document22 pages
Midterm Review Spring18 Sols
Robert Edwards
No ratings yet
Lec 05
Document53 pages
Lec 05
Hassan Ahmad
No ratings yet
1 Lecture 5b: Probabilistic Perspectives On ML Algorithms
Document6 pages
1 Lecture 5b: Probabilistic Perspectives On ML Algorithms
Jeremy Wang
No ratings yet
A. IP (OR Models)
Document36 pages
A. IP (OR Models)
Amelia Agista p
No ratings yet
L23 Stochastic Gradient and Mini Batch
Document9 pages
L23 Stochastic Gradient and Mini Batch
Ananya Agarwal
No ratings yet
Regularization and Feature Selectio N
Document102 pages
Regularization and Feature Selectio N
Ehab Emam
No ratings yet
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
Document42 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
Rahul Singh
100% (1)
04 Logistic Regression
Document46 pages
04 Logistic Regression
KHUSHI JAIN
No ratings yet
Week 06
Document12 pages
Week 06
Osii C
No ratings yet
Algorithms and Integers: (ECE 314 Discrete Mathematics)
Document18 pages
Algorithms and Integers: (ECE 314 Discrete Mathematics)
John Kenneth Corsino
No ratings yet
Foundations of Deep Learning
Document30 pages
Foundations of Deep Learning
Nelson Ubaldo Quispe M
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 4 Stochastic Gradient Descent
Document12 pages
CS221 - Artificial Intelligence - Machine Learning - 4 Stochastic Gradient Descent
Ardiansyah Mochamad Nugraha
No ratings yet
An Introduction To Support Vector Machines: Biplab Banerjee
Document31 pages
An Introduction To Support Vector Machines: Biplab Banerjee
Akshat sharma
No ratings yet
Class 1 C
Document14 pages
Class 1 C
Agatha
No ratings yet
Class 2 A
Document32 pages
Class 2 A
Agatha
No ratings yet
Class 2 C
Document10 pages
Class 2 C
Agatha
No ratings yet
Class 1 B
Document14 pages
Class 1 B
Agatha
No ratings yet
Decision Trees
Document42 pages
Decision Trees
Agatha
No ratings yet
A Novel Approach To Reservoir Simulation of Hydraulic Fractures Performance Improvement Using Pseudo Well Connections - A Lokhandwala Et Al. 2022
Document11 pages
A Novel Approach To Reservoir Simulation of Hydraulic Fractures Performance Improvement Using Pseudo Well Connections - A Lokhandwala Et Al. 2022
Agatha
No ratings yet
Class 1 A
Document8 pages
Class 1 A
Agatha
No ratings yet
Srikanta Mishra - Machine Learning Applications in Subsurface Energy Resource Management - State of The Art and Future Prognosis-CRC Press (2024)
Document379 pages
Srikanta Mishra - Machine Learning Applications in Subsurface Energy Resource Management - State of The Art and Future Prognosis-CRC Press (2024)
Agatha
No ratings yet
Binocular Hands and Head Tracking Using Projective Joint Probabilistic Data Association Filter
Document50 pages
Binocular Hands and Head Tracking Using Projective Joint Probabilistic Data Association Filter
Miyabi Mayo
No ratings yet
Effects Grid Staggering On Numerical Schemes: International Journal For Numerical Methods in Fluids
Document20 pages
Effects Grid Staggering On Numerical Schemes: International Journal For Numerical Methods in Fluids
Pau L. Riquelme
No ratings yet
Fitdistrplus R Package Fitting Distributions
Document22 pages
Fitdistrplus R Package Fitting Distributions
Juan Telleria
No ratings yet
Time Response Analysis: Transient and Steady State Response, Standard Test Input Functions For Dynamic Systems
Document13 pages
Time Response Analysis: Transient and Steady State Response, Standard Test Input Functions For Dynamic Systems
mumtaz
No ratings yet
Histograms vs. KDEs Explained. Histograms and Kernel Density - by Julian Wergieluk - Towards Data Science
Document11 pages
Histograms vs. KDEs Explained. Histograms and Kernel Density - by Julian Wergieluk - Towards Data Science
Raajeshp
No ratings yet
MTL106 Assignment2B
Document4 pages
MTL106 Assignment2B
vatsaljain0709
No ratings yet
Graphing Systems of Inequalities
Document31 pages
Graphing Systems of Inequalities
Johanna Marie Degayo
No ratings yet
Numerical Solution - Problem With Solutions - Group 4musa
Document12 pages
Numerical Solution - Problem With Solutions - Group 4musa
Gleen Carlo Basol
No ratings yet
Ai 5
Document8 pages
Ai 5
Manzoor A. Siddiqui
No ratings yet
Feed Forward Feed Backward Process
Document9 pages
Feed Forward Feed Backward Process
Tanzeel UR Rehman
No ratings yet
CA02CA3103 RMTGraphical Method For LPP
Document24 pages
CA02CA3103 RMTGraphical Method For LPP
ansarabbas
No ratings yet
Fundamentals of Electronics 22
Document12 pages
Fundamentals of Electronics 22
害BB
No ratings yet
3R Unit Iii Cryptography VSM 2021 22
Document69 pages
3R Unit Iii Cryptography VSM 2021 22
pubg
No ratings yet
Chapter 2
Document36 pages
Chapter 2
Shaine C. Santos
No ratings yet
Integer Programming
Document41 pages
Integer Programming
Gopalan Kathiravan
No ratings yet
Statistics and Probability Module 1
Document5 pages
Statistics and Probability Module 1
Cielo Dimayuga
No ratings yet
LESSON 10 - Functions
Document6 pages
LESSON 10 - Functions
Waleed Zahid
No ratings yet
Bivariate
Document8 pages
Bivariate
F.Ramesh Dhanaseelan
No ratings yet
Numerical Methods For Physicists: Volker Hohmann Institute of Physics University of Oldenburg, Germany
Document90 pages
Numerical Methods For Physicists: Volker Hohmann Institute of Physics University of Oldenburg, Germany
Adeyemi
No ratings yet
Abaqus Tutorial 30 - Chain - Stab - Simuleon
Document8 pages
Abaqus Tutorial 30 - Chain - Stab - Simuleon
Maikson Tonatto
No ratings yet
Automatic Surface Inspection in Steel Products Ensures Safe, Cost-Efficient and Timely Defect Detection in Production
Document13 pages
Automatic Surface Inspection in Steel Products Ensures Safe, Cost-Efficient and Timely Defect Detection in Production
Steve Ooi
No ratings yet
Completing The Square
Document17 pages
Completing The Square
Devi Devi
No ratings yet
Big Data Analytics Importance, Challenges, Categories, Techniques, and Tools (Article) Author Sarah Alswedani, Mostafa Saleh
Document9 pages
Big Data Analytics Importance, Challenges, Categories, Techniques, and Tools (Article) Author Sarah Alswedani, Mostafa Saleh
Var arp
No ratings yet
Operations Research (Me 705C) MCQS: Max Z 30x - 15x, S.T. 2x - 2x 0
Document10 pages
Operations Research (Me 705C) MCQS: Max Z 30x - 15x, S.T. 2x - 2x 0
aghosh704
100% (3)
Chapter 5 Project Time Cost Trade Off
Document18 pages
Chapter 5 Project Time Cost Trade Off
Walid Marhaba
No ratings yet
Subset Sum Problem
Document11 pages
Subset Sum Problem
haider aqeel
No ratings yet
Module - 1 - QB - 40
Document3 pages
Module - 1 - QB - 40
Samanvi Saatvi
No ratings yet
Genetic Programming: A Seminar On
Document23 pages
Genetic Programming: A Seminar On
Zatin Gupta
No ratings yet