Welcome to Scribd!

Regularization

Uploaded by

0% found this document useful (0 votes)

3 views9 pages

The document discusses overfitting and regularization techniques. It states that overfitting occurs when a model learns noise in addition to information. Regularization adds a penalty that increases with model complexity to reduce overfitting. Ridge and lasso regression are regularization techniques that help prevent overfitting when there are many features compared to observations. Ridge regression penalizes all parameters while lasso regression additionally performs feature selection by setting some weights to zero. These techniques are preferable to other methods for large feature sets.

Original Description:

machine learning regularization

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

3 views9 pages

Regularization

Uploaded by

harshbafna.ei20

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 9

Search inside document

Overfitting & Regularization

Overfitting:- Model learns information plus

noise.
• cross-validation sampling
• reducing number of features
• pruning
• Regularization:-
 Adds the penalty as complexity increases.

28/04/2021 Dr Geetishree Mishra 1

• When large number of features are there in a
dataset, compared to the number of
observations, some of the Regularization
(shrinkage models)techniques used to address
over-fitting and feature selection are:
» L2 – Ridge regression
» L1– Lasso regression
• Ridge and Lasso regression are some of the
simple techniques to reduce model complexity
and prevent over-fitting which may result
from simple linear regression.

28/04/2021 Dr Geetishree Mishra 2

Ridge Regression(L2)

Regularization parameter (lambda) penalizes all the

parameters except intercept so that model generalizes the
data and won’t overfit.
It tends to solve the multicollinearity problem through
shrinkage parameter ‘λ’.
28/04/2021 Dr Geetishree Mishra 3
Lasso regression(L1)

 Lasso (Least absolute shrinkage and selection operator)

penalizes the absolute size of the regression coefficients.
 In addition to this, it is quite capable of reducing the
variability and improving the accuracy of linear regression
models.
 Helps in dimensionality reduction and feature selection.
28/04/2021 Dr Geetishree Mishra 4
• Traditional methods like cross-validation,
stepwise regression to handle overfitting and
perform feature selection work well with a
small set of features but Ridge and Lasso
regularization techniques are a great
alternative when we are dealing with a large
set of features.

28/04/2021 Dr Geetishree Mishra 5

Feature scaling..
• Machine learning algorithm just sees number — if
there is a vast difference in the range say few ranging in
thousands and few ranging in the tens, it makes the
assumption that higher ranging numbers have higher
impact on the response. So these more significant
number starts playing a more decisive role while
training the model, leads to higher biasing.
• Thus feature scaling is needed to bring every feature to
the same scale without any upfront importance.
• Feature scaling is important even for the training
algorithms like gradient descent to converge much
faster.

28/04/2021 Dr Geetishree Mishra 6

28/04/2021 Dr Geetishree Mishra 7
Standard Scaler..

•The Standard Scaler assumes data is normally distributed

within each feature and scales them such that the
distribution centered around 0, with a standard deviation of
1.
•Centering and scaling happen independently on each
feature by computing the relevant statistics on the samples
in the training set.
•If data is not normally distributed, this is not the best
Scaler to use.

28/04/2021 Dr Geetishree Mishra 8

MinMax Scaler..

•Transform features by scaling each feature to a given range.

•This estimator scales and translates each feature individually
such that it is in the given range on the training set, e.g.,
between zero and one. This Scaler shrinks the data within the
range of -1 to 1 if there are negative values. We can set the
range like [0,1] or [0,5] or [-1,1].
•This Scaler responds well if the standard deviation is small
and when a distribution is not Gaussian.
•This Scaler is sensitive to outliers.
28/04/2021 Dr Geetishree Mishra 9

Ring of Lust 0.1.7a Walkthrough
Document24 pages
Ring of Lust 0.1.7a Walkthrough
Saul
No ratings yet
Geografia 12 Classe (Longman Moç.)
Document195 pages
Geografia 12 Classe (Longman Moç.)
Orlando Cuna
86% (7)
120 DS-With Answer
Document32 pages
120 DS-With Answer
Asim Mazin
100% (1)
Machine Learning Interview Questions.
Document43 pages
Machine Learning Interview Questions.
hari krishna reddy
100% (1)
Decision Trees
Document16 pages
Decision Trees
AsemSaleh
100% (2)
Eletronic Engines Support 7 3 0 Global Guide
Document524 pages
Eletronic Engines Support 7 3 0 Global Guide
mayphatbaoson_512242
100% (1)
Unit 4 Basics of Feature Engineering
Document33 pages
Unit 4 Basics of Feature Engineering
Yash Desai
No ratings yet
Introduction To Dimensionality Reduction-1
Document16 pages
Introduction To Dimensionality Reduction-1
xavieranosike
No ratings yet
Unit 4 Basics of Feature Engineering
Document33 pages
Unit 4 Basics of Feature Engineering
Kalash Shah
100% (1)
Edab Module - 4
Document16 pages
Edab Module - 4
Chirag 17
No ratings yet
Data Science Interview Question
Document23 pages
Data Science Interview Question
Roshan atul
No ratings yet
02.data Preprocessing PDF
Document31 pages
02.data Preprocessing PDF
sunil
100% (1)
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Document32 pages
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Mohammed Danlami Yusuf
100% (1)
10 AI Success Metric and Performance Indicators
Document30 pages
10 AI Success Metric and Performance Indicators
Shampa Nasrin
No ratings yet
Unit 2 ML 2019
Document91 pages
Unit 2 ML 2019
Pratham MURKUTE
No ratings yet
Machine Learning Doc-2
Document8 pages
Machine Learning Doc-2
nchintakuntla
No ratings yet
Csa202 Unit 2
Document36 pages
Csa202 Unit 2
vbknukwcysgycpmlzs
No ratings yet
Machine Learning For Interviews
Document12 pages
Machine Learning For Interviews
vishalmundhe494
No ratings yet
ML Unit 2
Document90 pages
ML Unit 2
Aanchal Padmavat
No ratings yet
Untitled
Document128 pages
Untitled
P.V.S. VEERANJANEYULU
No ratings yet
DL Class3
Document28 pages
DL Class3
Rishi Chaary
No ratings yet
Lecture 09 ML
Document26 pages
Lecture 09 ML
saharabdouma
No ratings yet
AI Capstone Project - Notes-Part2
Document8 pages
AI Capstone Project - Notes-Part2
minha.fathima737373
No ratings yet
Unit 5 Notes New
Document6 pages
Unit 5 Notes New
patilamrutak2003
No ratings yet
Ensemble Learning Methods
Document24 pages
Ensemble Learning Methods
khatri81
100% (1)
Bias Varience Trade Off
Document35 pages
Bias Varience Trade Off
mobeen
100% (1)
Accuracy Assessment and Confusion Matrix
Document23 pages
Accuracy Assessment and Confusion Matrix
amrutamhetre9
No ratings yet
Variance and Bias
Document14 pages
Variance and Bias
prakash.cse20
No ratings yet
Dimension Reduction
Document38 pages
Dimension Reduction
apurva
No ratings yet
Module0 Introduction
Document27 pages
Module0 Introduction
rohan kalidindi
No ratings yet
L-10 - Presentation1-09052024-072206pm
Document27 pages
L-10 - Presentation1-09052024-072206pm
Bahadar Ayaz
No ratings yet
Theory in Machine Learning
Document47 pages
Theory in Machine Learning
Sreetam Ganguly
100% (2)
Lesson Four
Document28 pages
Lesson Four
Mohamed
No ratings yet
Regression
Document19 pages
Regression
mgs181101
No ratings yet
Dimensionality Reduction
Document19 pages
Dimensionality Reduction
Atul Patil
No ratings yet
Bank Marketing Data
Document14 pages
Bank Marketing Data
sanju
100% (2)
CO3 Session 14
Document15 pages
CO3 Session 14
Devalla Bhaskar Ganesh
No ratings yet
Random Forest
Document25 pages
Random Forest
abdala sabry
No ratings yet
Section 1: Cross-Validation and Model Performance
Document33 pages
Section 1: Cross-Validation and Model Performance
chandreshpadmani9993
No ratings yet
Lecture 4
Document33 pages
Lecture 4
Venkat ram Reddy
No ratings yet
Feature Scaling in Machine Learning
Document4 pages
Feature Scaling in Machine Learning
Varun Bhayana
No ratings yet
Tuning Parameters
Document15 pages
Tuning Parameters
Kodjo ALIPUI
No ratings yet
On Unit-3
Document30 pages
On Unit-3
Nihar Ranjan Prusty 92
No ratings yet
Script
Document5 pages
Script
Simo Jayat
No ratings yet
Machine Learning Models
Document52 pages
Machine Learning Models
Bharath
No ratings yet
Multiple Regression Analysis: Prentice-Hall, Inc
Document30 pages
Multiple Regression Analysis: Prentice-Hall, Inc
Zendo
No ratings yet
Big Data - Sources and Opportunities
Document30 pages
Big Data - Sources and Opportunities
msiskastockerss
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 2
Document2 pages
Data Science Q&A - Latest Ed (2020) - 3 - 2
M K
No ratings yet
LLM ML Interview Q
Document43 pages
LLM ML Interview Q
Yagnesh Vyas
No ratings yet
Pattern Recognition - Unit 2
Document31 pages
Pattern Recognition - Unit 2
Priyansh Kumar
No ratings yet
Ôn Thi KTDL
Document18 pages
Ôn Thi KTDL
20521292
No ratings yet
Deep Learning - Summary - Deep - Learning
Document17 pages
Deep Learning - Summary - Deep - Learning
aabotony
No ratings yet
Feature Pruning and Normalization
Document8 pages
Feature Pruning and Normalization
yevedi5237
No ratings yet
Module - 2 Ver 1.4
Document35 pages
Module - 2 Ver 1.4
Pranav B
No ratings yet
Unit Online 1.4
Document132 pages
Unit Online 1.4
Nitesh Saini
No ratings yet
Dimensionality Reduction-PCA FA LDA
Document12 pages
Dimensionality Reduction-PCA FA LDA
Javada Javada
No ratings yet
Business Data Mining Week 4
Document12 pages
Business Data Mining Week 4
pm6566
No ratings yet
Lec 3
Document13 pages
Lec 3
Shubham
No ratings yet
01 - Feature Engg
Document43 pages
01 - Feature Engg
Shreya Sonar
No ratings yet
Data Analysis (27 Questions) : 1. (Given A Dataset) Analyze This Dataset and Tell Me What You Can Learn From It
Document28 pages
Data Analysis (27 Questions) : 1. (Given A Dataset) Analyze This Dataset and Tell Me What You Can Learn From It
kumar kumar
No ratings yet
Unit I Predictive Analytics
Document39 pages
Unit I Predictive Analytics
NarendranGRevathi
No ratings yet
Mod4 Eda
Document13 pages
Mod4 Eda
Arjun Singh A
No ratings yet
Data Scaling and Normalization
From Everand
Data Scaling and Normalization
Chuck Sherman
No ratings yet
DL 1
Document10 pages
DL 1
harshbafna.ei20
No ratings yet
DocScanner 22 Jun 2023 11 49 Am
Document28 pages
DocScanner 22 Jun 2023 11 49 Am
harshbafna.ei20
No ratings yet
Naive Bayes
Document32 pages
Naive Bayes
harshbafna.ei20
No ratings yet
Ens Embling
Document19 pages
Ens Embling
harshbafna.ei20
No ratings yet
Clustering
Document24 pages
Clustering
harshbafna.ei20
No ratings yet
Environmental Management System Implementation
Document40 pages
Environmental Management System Implementation
kashif Manzoor
No ratings yet
Performance Curves: Centrifugal Pumps S Solidc
Document2 pages
Performance Curves: Centrifugal Pumps S Solidc
Phượng Nguyễn
No ratings yet
Project Lifecycle Models - How The Differ and When To Use Them
Document5 pages
Project Lifecycle Models - How The Differ and When To Use Them
jamoris
100% (1)
Ericsson-4408 Micro-Radio-Data-Sheet
Document2 pages
Ericsson-4408 Micro-Radio-Data-Sheet
Oriza
0% (1)
Hairul Amri Bin Basar No 36, JLN Mesra Indah 1 TMN Mesra Indah 82000, PONTIAN, JOH
Document2 pages
Hairul Amri Bin Basar No 36, JLN Mesra Indah 1 TMN Mesra Indah 82000, PONTIAN, JOH
azirahsksb
No ratings yet
FORM 3: LAC Session Report: Region: I Lac Session No.: Lac 3 - Module 3A Venue/Platform of Session
Document3 pages
FORM 3: LAC Session Report: Region: I Lac Session No.: Lac 3 - Module 3A Venue/Platform of Session
Mark Francis Munar
No ratings yet
L1-03 Shape Optimization Vs Generative Design PDF
Document2 pages
L1-03 Shape Optimization Vs Generative Design PDF
Matías Cofré
No ratings yet
TEMAS Investigacion 2020
Document2 pages
TEMAS Investigacion 2020
axelmrd
No ratings yet
Case Study: Company Analysis: Name of Company: Inaiscarves
Document5 pages
Case Study: Company Analysis: Name of Company: Inaiscarves
gpshivaleela
No ratings yet
Cook Deposition Motion
Document9 pages
Cook Deposition Motion
Mikey Campbell
No ratings yet
SL Manual Lock 13 58 SHAFFER RAM BOP Page-21-25
Document5 pages
SL Manual Lock 13 58 SHAFFER RAM BOP Page-21-25
Richard EV
No ratings yet
Third Product Theme Blutooth Dual SIM Adapter
Document7 pages
Third Product Theme Blutooth Dual SIM Adapter
abbasarslan221
No ratings yet
Evaluate The Quality Assurance (QA) Process and Review How It Was Implemented During Your Design and Development Stages
Document7 pages
Evaluate The Quality Assurance (QA) Process and Review How It Was Implemented During Your Design and Development Stages
Kaveesha Perera
No ratings yet
Athanase Iyakaremye - CV PDF
Document20 pages
Athanase Iyakaremye - CV PDF
CIBA Itd
No ratings yet
Digital Design: Design and Implementation of Car Parking System On VHDL
Document11 pages
Digital Design: Design and Implementation of Car Parking System On VHDL
Hoàng Sơn Nguyễn
No ratings yet
IND Ahmedabad.426470 ISHRAE - Stat
Document35 pages
IND Ahmedabad.426470 ISHRAE - Stat
nikita chawla
No ratings yet
Primacorelw-71 en PDF
Document2 pages
Primacorelw-71 en PDF
sattar12345
No ratings yet
Profile Barkah Group
Document4 pages
Profile Barkah Group
Ricky Noverto
No ratings yet
Novartis
Document1 page
Novartis
manisha_jha_11
No ratings yet
Business Data Networks and Telecommunications 7th Edition Panko Solutions Manual
Document20 pages
Business Data Networks and Telecommunications 7th Edition Panko Solutions Manual
runningspaadecvwm
100% (24)
Fuzzy L8 Variables
Document29 pages
Fuzzy L8 Variables
19BEE053, MD SHAHID ANSARI
No ratings yet
Letters of Enquiry and Placing Orders Priyanshu
Document15 pages
Letters of Enquiry and Placing Orders Priyanshu
priyanshu08394
No ratings yet
BONO - Wat Tub Boiler
Document6 pages
BONO - Wat Tub Boiler
depinfor lusofabril
No ratings yet
Tiny 7 Experience
Document22 pages
Tiny 7 Experience
Mawardi Ramli
No ratings yet
C221-Pipe Drainage-Liverpool
Document18 pages
C221-Pipe Drainage-Liverpool
stefpan
No ratings yet
NB 06 Cat9200 Ser Data Sheet Cte en
Document42 pages
NB 06 Cat9200 Ser Data Sheet Cte en
robin11111111
No ratings yet
Butler Pitch & Putt Disqualification Letter
Document5 pages
Butler Pitch & Putt Disqualification Letter
Anonymous Pb39klJ
No ratings yet