Welcome to Scribd!

Lecture 7

Uploaded by

0% found this document useful (0 votes)

4 views19 pages

This document discusses various techniques for non-linear regression including polynomial regression, logistic regression, and multiple classification. Polynomial regression transforms features into higher-order polynomials to fit nonlinear data with a linear model. Overfitting can occur with very high-degree polynomials. Validation data is used to evaluate model performance during training and identify underfitting or overfitting. Learning curves plot training and validation performance over iterations or data amounts and can indicate underfitting or overfitting. Techniques like regularization, early stopping, and collecting more data can help mitigate overfitting. Logistic regression transforms the output using the sigmoid function for binary classification while softmax regression generalizes this for multiple classes.

Original Description:

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

4 views19 pages

Lecture 7

Uploaded by

huichloemail

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 19

Search inside document

Data Science for Molecular

Engineering
Week 7
Non-linear regression
• ILOs
• Understand the principles of polynomial regression, the concept of
regularization;
• Understand the principles of logistic regression, and its use in binary
classification;
• Understand the principles of multiple classification
What if a linear trend is not available

You can always do a linear

regression, but the model is not
good enough (This is called
Underfitting)
Recall for linear regression:

Can have different function forms

Non-linear regression
• Solution 1. feature transformation
• If y and x are not linear, what about y and x2, or x3 ?

• This is the idea of polynomial regression – using linear model to fit

non-linear data
Polynomial regression
• First, the original feature is transformed into polynomial features with
user-defined degree
• A linear model is then fitted using these transformed features
• The same solution for linear regression can be used to solve polynomial
regression
Overfitting
• What if a very high degree is selected?

Blue points are real data.

A linear model underfits the data;
A quadratic model is about right;

What about a 300-degree model?

What is bad about overfitting?

Why can overfitting happen?
How to tell whether you are
underfitting/overfitting?
• Validation
• Set aside part of the training data to evaluate model performance DURING
training
• Validation data is not directly used to update model parameters (i.e. not used
in gradient calculation)
• Validation data can be used to select the best training method
• Validation data can be used to tune the training process
Learning curves
• Learning curves: curves that plot the comparison of model trained on
the training set and model use on the validations with the same
parameter
• Learning curves can have different types, with the following being the
most common:
• 1) Learning curve over the amount of data
• 2) Learning curve over the number of iterations
Learning curves over the amount of data
• Retrain the model many times with increasing number of data points
and plot some performance metrics on training and validation
datasets

Q1. Why is the training error

increasing and validation error
decreasing?

Q2. Is this underfitting or

overfitting, or neither?
Which is underfitting/overfitting/just right?
Learning curves over the number of iterations
• Train the model once; record the intermediate states of training and
validation losses

Q1. Why is the training loss decreasing?

Q2. Why is the validation loss decreasing and

then increasing?
How to mitigate overfitting?
• Model
• Model selection
• Model regularization
• Training process – early stopping

• Data
• Collect more data
Model selection
• Train different models on the training set and compare results on
validation set
• Cross-validation (usually when dataset size is small)
• Use different partitions of the training data to ensure that the model is
generalizable

Validation data
Regularization
• Regularization is a good way to reduce overfitting
• Penalize large parameter values () when training the model
• Ridge regression

• Lasso regression

• Elastic Net
Early stopping (for iterative methods)
Logistic regression
• Instead of transforming features, logistic regression transforms the
output to a ”probability” (a number between zero and one) using the
sigmoid/logistic function
Logistic regression loss function
• log loss/binary cross entropy

Unfortunately, no closed form solution; need to use gradient descent method

Softmax regression
• The logistic regression can be generalized to multiple classes
Binary Multiple class

Between 0 and 1
Sum up to1 The softmax function

• Softmax regression loss function

• Categorical cross entropy

Mighton John. - Jump Math. Book 1. Grade 2 PDF
Document161 pages
Mighton John. - Jump Math. Book 1. Grade 2 PDF
Arjane Aram Samaniego
100% (1)
Ernest Renan What Is A Nation PDF
Document2 pages
Ernest Renan What Is A Nation PDF
Linda
0% (3)
Compiler Construction MCQ
Document16 pages
Compiler Construction MCQ
Saad
No ratings yet
120 DS-With Answer
Document32 pages
120 DS-With Answer
Asim Mazin
100% (1)
Machine Learning Interview Questions.
Document43 pages
Machine Learning Interview Questions.
hari krishna reddy
100% (1)
Alfresco Web Scripts: Scriptable MVC For REST, AJAX, Widgets and Portlets
Document17 pages
Alfresco Web Scripts: Scriptable MVC For REST, AJAX, Widgets and Portlets
Antoine Dargham
No ratings yet
SONOCARE
Document2 pages
SONOCARE
Arun Sivam
No ratings yet
04-Main Challenges in ML
Document25 pages
04-Main Challenges in ML
Zaid Al-amayreh
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
Document23 pages
15-The Bias - Variance - Trade-Off-08-04-2024
Keshav Chhapolia
No ratings yet
Csa202 Unit 2
Document36 pages
Csa202 Unit 2
vbknukwcysgycpmlzs
No ratings yet
ML3 - Evaluation
Document65 pages
ML3 - Evaluation
param_email
100% (1)
Data Splitting and Bias Variance Tradeoff
Document14 pages
Data Splitting and Bias Variance Tradeoff
Eileen Lovegood
No ratings yet
Choosing Model and Tuning
Document20 pages
Choosing Model and Tuning
kar20201214
No ratings yet
Theory in Machine Learning
Document47 pages
Theory in Machine Learning
Sreetam Ganguly
100% (2)
CHP 3
Document70 pages
CHP 3
its9918k
No ratings yet
DL Class3
Document28 pages
DL Class3
Rishi Chaary
No ratings yet
Cross-Validation in Machine Learning
Document18 pages
Cross-Validation in Machine Learning
Priya dharshini.G
No ratings yet
10 AI Success Metric and Performance Indicators
Document30 pages
10 AI Success Metric and Performance Indicators
Shampa Nasrin
No ratings yet
Bank Marketing Data
Document14 pages
Bank Marketing Data
sanju
100% (2)
ML MU Unit 2
Document84 pages
ML MU Unit 2
Paulos K
100% (3)
ML.1Lecture.2 (Old)
Document23 pages
ML.1Lecture.2 (Old)
Annayah Usman
No ratings yet
Fundamental of ML Week 4
Document15 pages
Fundamental of ML Week 4
Raj Physio
No ratings yet
Module - 2 Ver 1.4
Document35 pages
Module - 2 Ver 1.4
Pranav B
No ratings yet
P-2.1.2 Cross Validation and Regularization
Document37 pages
P-2.1.2 Cross Validation and Regularization
Puneet Parihar
No ratings yet
7 ML
Document38 pages
7 ML
nandukannanmelath
No ratings yet
Lec - 4
Document43 pages
Lec - 4
Yonatan tamiru
No ratings yet
Unit 3 (ML)
Document26 pages
Unit 3 (ML)
BHAVIN THUMAR
No ratings yet
08 Practical
Document24 pages
08 Practical
林山山
No ratings yet
Deep Learning (All in One)
Document23 pages
Deep Learning (All in One)
B Basit
No ratings yet
Section 1: Cross-Validation and Model Performance
Document33 pages
Section 1: Cross-Validation and Model Performance
chandreshpadmani9993
No ratings yet
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
Document12 pages
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
Hodatama Karanna One
No ratings yet
Overfitting in Machine Learning
Document23 pages
Overfitting in Machine Learning
Fasiha Zahid
No ratings yet
Model Training: (Anything Done While We Train The Model)
Document194 pages
Model Training: (Anything Done While We Train The Model)
Raja
No ratings yet
Model Training
Document194 pages
Model Training
Raja
No ratings yet
Kami Export - 13. Model Evaluation
Document25 pages
Kami Export - 13. Model Evaluation
YENI SRI MAHARANI -
100% (1)
Chap 1 - RO - Handouts Operational Research
Document24 pages
Chap 1 - RO - Handouts Operational Research
djalalmarwa7
No ratings yet
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Document32 pages
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Mohammed Danlami Yusuf
100% (1)
Chapter 2 Data Preprocessing
Document23 pages
Chapter 2 Data Preprocessing
liyu agye
No ratings yet
10 - Overfitting and Underfitting
Document22 pages
10 - Overfitting and Underfitting
Panku Rangaree
No ratings yet
Ch1 Introduction To OR
Document26 pages
Ch1 Introduction To OR
yared haftu
No ratings yet
Introduction To Reinforcement Learning: Instructor: Sergey Levine UC Berkeley
Document46 pages
Introduction To Reinforcement Learning: Instructor: Sergey Levine UC Berkeley
kong
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
Document73 pages
Data Mining: Practical Machine Learning Tools and Techniques
Arvind
No ratings yet
Or 3
Document22 pages
Or 3
Mary Ann Pacia
No ratings yet
ML 1 2 3
Document54 pages
ML 1 2 3
Shoba Natesh
No ratings yet
ML Bias and Variance
Document14 pages
ML Bias and Variance
kobaya7455
No ratings yet
Regularization and Bias
Document13 pages
Regularization and Bias
Yadav Yadvendra
No ratings yet
Unit6 Part3 General Procedure
Document19 pages
Unit6 Part3 General Procedure
tamanna sharma
No ratings yet
Exploring The Model
Document13 pages
Exploring The Model
sst sharun
No ratings yet
I. Models and Cost Functions: ML Notations
Document13 pages
I. Models and Cost Functions: ML Notations
sst sharun
No ratings yet
Module 1 ML Mumbai University
Document47 pages
Module 1 ML Mumbai University
2021.shreya.pawaskar
No ratings yet
Class 13 Optimizing The Training Process
Document20 pages
Class 13 Optimizing The Training Process
Sumana Basu
No ratings yet
Feature Selection: Slide 1
Document29 pages
Feature Selection: Slide 1
Prathik Narayan
No ratings yet
Unit 4 A
Document16 pages
Unit 4 A
Akshaya Gopalakrishnan
No ratings yet
Training Evaluation
Document42 pages
Training Evaluation
Raksa Kun
No ratings yet
Deep Neural Network Module 4 Regularization
Document53 pages
Deep Neural Network Module 4 Regularization
Manju Prasad N
No ratings yet
Lecture Testmodels
Document31 pages
Lecture Testmodels
sowmyasanthavel
No ratings yet
Week 10 - PROG 8510 Week 10
Document16 pages
Week 10 - PROG 8510 Week 10
Vineel Kumar
No ratings yet
2017 WB 2635 RobotWealth AFrameworkforApplyingMachineLearningtoSystematicTradingv2
Document69 pages
2017 WB 2635 RobotWealth AFrameworkforApplyingMachineLearningtoSystematicTradingv2
NDamean
No ratings yet
Unit 2
Document28 pages
Unit 2
LOGESH WARAN P
No ratings yet
Unit Online 1.4
Document132 pages
Unit Online 1.4
Nitesh Saini
No ratings yet
ISLR Chap 5 Shaheryar
Document11 pages
ISLR Chap 5 Shaheryar
Shaheryar Zahur
No ratings yet
Regression
Document109 pages
Regression
Pranati Bharadkar
100% (2)
Modellingformanagers2014willumsen 140711053018 Phpapp01
Document29 pages
Modellingformanagers2014willumsen 140711053018 Phpapp01
132Indriana Wardhani
No ratings yet
Business Analytics Process and Data Exploration
Document38 pages
Business Analytics Process and Data Exploration
J Warneck Gultøm
No ratings yet
Data Science for Beginners: Tips and Tricks for Effective Machine Learning/ Part 4
From Everand
Data Science for Beginners: Tips and Tricks for Effective Machine Learning/ Part 4
Tom Lesley
No ratings yet
CENG3300 Lecture 9
Document19 pages
CENG3300 Lecture 9
huichloemail
No ratings yet
Lecture 8
Document19 pages
Lecture 8
huichloemail
No ratings yet
CENG3300 Lecture 2-2
Document23 pages
CENG3300 Lecture 2-2
huichloemail
No ratings yet
Lecture 6
Document29 pages
Lecture 6
huichloemail
No ratings yet
CENG3300 Lecture 2-1
Document21 pages
CENG3300 Lecture 2-1
huichloemail
No ratings yet
Lecture 5
Document18 pages
Lecture 5
huichloemail
No ratings yet
CENG3300 Lecture 4
Document25 pages
CENG3300 Lecture 4
huichloemail
No ratings yet
CENG3300 Lecture 1
Document21 pages
CENG3300 Lecture 1
huichloemail
No ratings yet
CENG3300 Lecture 3
Document24 pages
CENG3300 Lecture 3
huichloemail
No ratings yet
Physics Project
Document23 pages
Physics Project
GNag R'Varma
100% (1)
Wabco e Basic Ecu Diagrama
Document1 page
Wabco e Basic Ecu Diagrama
Luis Hernan Cordova Masias
No ratings yet
Workbook - Building PI System Assets V2012e - Student
Document147 pages
Workbook - Building PI System Assets V2012e - Student
José Alberto Santos
No ratings yet
Handout 06
Document9 pages
Handout 06
abibual desalegn
No ratings yet
Pulsatile Flow Pump Based On An Iterative Controlled Piston Pump
Document7 pages
Pulsatile Flow Pump Based On An Iterative Controlled Piston Pump
yue jiang
No ratings yet
Csvtu Syllabus Be Aei 8 Sem
Document23 pages
Csvtu Syllabus Be Aei 8 Sem
Mohnish Sahu
No ratings yet
PLC Leaflet (2013.1)
Document24 pages
PLC Leaflet (2013.1)
Donald Santana Bautista
No ratings yet
05 Strategy Exploration Guide
Document24 pages
05 Strategy Exploration Guide
shoushi10
No ratings yet
Atutubo Written Work #1 ComProg
Document2 pages
Atutubo Written Work #1 ComProg
Allen Atutubo
No ratings yet
Path Planning For Autonomous Mobile Robot Based On Safe Space
Document9 pages
Path Planning For Autonomous Mobile Robot Based On Safe Space
sanjayb1976gmailcom
No ratings yet
ICS Lecture 8
Document29 pages
ICS Lecture 8
MINI DHABA
No ratings yet
Selenium Headless Browser
Document6 pages
Selenium Headless Browser
Habom
No ratings yet
Resolución de Problemas C7
Document216 pages
Resolución de Problemas C7
Yojhan Corahua
100% (4)
Assignment 3
Document7 pages
Assignment 3
مرتضی اللهی دهقی
No ratings yet
Brainport
Document9 pages
Brainport
lohith kumar
No ratings yet
C++ Handwritten
Document6 pages
C++ Handwritten
Pratap Sahoo
No ratings yet
Learning Guide Module: English 4 - Page 1 of 10
Document10 pages
Learning Guide Module: English 4 - Page 1 of 10
jvvuvuuv lopez
No ratings yet
CG Lab (With Op)
Document73 pages
CG Lab (With Op)
Ram
100% (1)
Golam Muktadir - Case Study Edit Last & Final
Document8 pages
Golam Muktadir - Case Study Edit Last & Final
Muktadir Minex
No ratings yet
Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations' COCI: A Multidisciplinary Comparison of Coverage Via Citations
Document36 pages
Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations' COCI: A Multidisciplinary Comparison of Coverage Via Citations
MS
No ratings yet
6200 SSD Comms - U003 Issue 2
Document3 pages
6200 SSD Comms - U003 Issue 2
aris
No ratings yet
Microcontroller Development Kit: Debug and Trace
Document1 page
Microcontroller Development Kit: Debug and Trace
Saad Memon
No ratings yet
RIOT Board User Manual v1.1
Document53 pages
RIOT Board User Manual v1.1
Leon Constantin
No ratings yet
Kanban
Document12 pages
Kanban
ubadjate
No ratings yet
Icluster Manual End-User 5.1
Document550 pages
Icluster Manual End-User 5.1
Christian Zacarias
100% (1)