Welcome to Scribd!

Lab 7

Uploaded by

0% found this document useful (0 votes)

14 views3 pages

This document summarizes a lab on regularization methods, specifically ridge regression and LASSO. It introduces regularization and the glmnet package in R. It shows how to perform ridge regression and LASSO on a baseball dataset, including exploring the effects of different regularization parameters and using cross-validation to select the best parameter. It also provides pseudocode for how K-fold cross-validation is implemented for these methods.

Original Description:

Ntu lab 7 computing

Original Title

lab7

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

14 views3 pages

Lab 7

Uploaded by

Beitriss Chua

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

MH4510 - Statistical Learning and Data Mining - AY1819 S1 Lab 07

MH4510 - Regularization Method

Matthew Zakharia Hadimaja

28th September 2018 (Fri) - Regularization Method

Course instructor : PUN Chi Seng
Lab instructor : Matthew Zakharia Hadimaja

References
Chapter 6.6, [ISLR] An Introduction to Statistical Learning (with Applications in R). Free access to download
the book: http://www-bcf.usc.edu/~gareth/ISL/
To see the help file of a function funcname, type ?funcname.

1. Preparation

Load dataset
library(ISLR)
data(Hitters)
Hitters <- na.omit(Hitters)

glmnet has different input types. Therefore, we have to create them first
# x, the predictor, has to be a numerical matrix
# model.matrix converts factors to a set of dummy variables
x <- model.matrix(Salary ~ ., Hitters)[, -1]
head(x)
# y, the output, has to be a vector
y <- Hitters$Salary

2. Ridge Regression

The penalty is defined as (1 − α)/2||β||22 + α||β||1 . Therefore, for ridge regression, the alpha is 0.
library(glmnet)
grid <- 10 ^ seq(10, -2, length = 100) # lambda from 10^10 to 10^-2, logarithmically scaled
ridge.mod <- glmnet(x, y, alpha = 0, lambda = grid)
names(ridge.mod) # read ?glmnet for details
dim(coef(ridge.mod))
par(mfrow = c(1,2))
plot(ridge.mod, xvar = 'norm')
plot(ridge.mod, xvar = 'lambda')

1
MH4510 - Statistical Learning and Data Mining - AY1819 S1 Lab 07

Large vs small lambda

Large lambda
ridge.mod$lambda[50]
coef(ridge.mod)[, 10]
sqrt(sum(coef(ridge.mod)[-1, l] ^ 2)) # l2-norm of the coefficients

Small lambda
ridge.mod$lambda[60]
coef(ridge.mod)[, l]
sqrt(sum(coef(ridge.mod)[-1, l] ^ 2))

Predict coefficients with new lambda value s.

predict(ridge.mod, s = 50, type = "coefficients")[1:20, ]

For lambda = 0 or lambda = Inf, what model does the algorithm produce?

3. LASSO

Same as ridge, but alpha = 1 now. Notice that the coefficients can be exactly zero.
lasso.mod <- glmnet(x, y, alpha = 1, lambda = grid)
par(mfrow = c(1,2))
plot(lasso.mod, xvar = 'norm')
plot(lasso.mod, xvar = 'lambda')

Large vs small lambda

Large lambda
lasso.mod$lambda[50]
coef(lasso.mod)[, 50]
sqrt(sum(coef(lasso.mod)[-1, l] ^ 2))

Small lambda
lasso.mod$lambda[80]
coef(lasso.mod)[, 80]
sqrt(sum(coef(lasso.mod)[-1, l] ^ 2))

Predict coefficients with new lambda value s.

predict(lasso.mod, s = 50, type = "coefficients")[1:20, ]

Cross validation

Cross validation to choose best lambda

lasso.cv <- cv.glmnet(x[train, ], y[train], alpha = 1)
plot(lasso.cv)
(lasso.bestlam <- lasso.cv$lambda.min)

Refit using the whole training set

2
MH4510 - Statistical Learning and Data Mining - AY1819 S1 Lab 07

lasso.cvmod <- glmnet(x[train, ], y[train], alpha = 1, lambda = lasso.bestlam)

lasso.coef <- coef(lasso.cvmod)
lasso.coef

Predict test set

lasso.pred <- predict(lasso.cvmod, newx = x[-train, ])
mean((lasso.pred - y[-train]) ^ 2)

4. Tutorial

Explain how K-fold cross-validation is implemented on ridge regression / LASSO with scaling. Please specify
how to compute the cross-validation function and how is the scaling implemented.
This is a pseudo-code for CV without scaling. Note that this does not represent the cv.glmnet function.
Modify the pseudo-code below to answer the question above.
Suppose L is a vector containing lambda values to try, and we have 'X' as our data.

1. set the CV error for each lambda, CV[lambda] = 0, lambda in L

2. split X into training set Tr and test set Te
3. split Tr into K random parts with equal size, Tr[k], k = 1,2,...,K
4. for k in 1:K
1. set Tr[-k] as the k-th pseudo training set, pTr[k]
2. set Tr[k] as the k-th pseudo test set, pTe[k]
3. for lambda in L
1. perform ridge regression / LASSO with Tr[-k] and lambda
2. evaluate the test error on pTe[k]
3. CV[lambda] = CV[lambda] + test error
5. choose lambda that minimises CV[lambda], call it lambda*
6. refit the whole model with Tr and lambda*
7. check the performance on Te

Assignment & Quiz (Matlab)
Document24 pages
Assignment & Quiz (Matlab)
David Solomon Raju Yellampalli
100% (1)
This Study Resource Was: Session 7: Veritas
Document7 pages
This Study Resource Was: Session 7: Veritas
Pagod
100% (1)
Graded Assignment 2 CM20256 / CM50262 Functional Programming
Document3 pages
Graded Assignment 2 CM20256 / CM50262 Functional Programming
Sidrah Amir
No ratings yet
Assignments Walkthroughs and R Demo: W4290 Statistical Methods in Finance - Spring 2010 - Columbia University
Document38 pages
Assignments Walkthroughs and R Demo: W4290 Statistical Methods in Finance - Spring 2010 - Columbia University
tsit
No ratings yet
Ridge Regression and The Lasso
Document7 pages
Ridge Regression and The Lasso
api-285777244
No ratings yet
Eee 336 L1&2
Document24 pages
Eee 336 L1&2
Rezwan Zakaria
No ratings yet
Cappstone
Document2 pages
Cappstone
Ankita Mishra
No ratings yet
Experiment2 2158
Document3 pages
Experiment2 2158
sohit sharma
No ratings yet
Unit-Iii Doing Math and Simulation in R Math Functions: Extended Example: Calculating A Probability
Document19 pages
Unit-Iii Doing Math and Simulation in R Math Functions: Extended Example: Calculating A Probability
jyothi maddirala
No ratings yet
FinalExam For Zhank
Document23 pages
FinalExam For Zhank
chamber games
No ratings yet
Exercise Sheet 6 Solution
Document11 pages
Exercise Sheet 6 Solution
Surya Iyer
No ratings yet
Matlab For Electric Circuits
Document12 pages
Matlab For Electric Circuits
Iftikhar Khan
No ratings yet
Econometrics With R
Document56 pages
Econometrics With R
Krishnan Chari
No ratings yet
Matlab-Ex (Chp#6) .
Document16 pages
Matlab-Ex (Chp#6) .
Falak Sher
No ratings yet
Ex 2 Solution
Document13 pages
Ex 2 Solution
Mian Almas
No ratings yet
Statistical Learning in R
Document31 pages
Statistical Learning in R
Angela Ivanova
No ratings yet
Functions in R Sem-III 2021 PDF
Document30 pages
Functions in R Sem-III 2021 PDF
rajveer shah
No ratings yet
Data Mining Exercise 3
Document11 pages
Data Mining Exercise 3
Mohamed Boukhari
No ratings yet
Using Maxlik
Document20 pages
Using Maxlik
James Hotniel
No ratings yet
Fitting The Nelson-Siegel-Svensson Model With Differential Evolution
Document10 pages
Fitting The Nelson-Siegel-Svensson Model With Differential Evolution
happy_24471
No ratings yet
DM Practice
Document15 pages
DM Practice
66 Rohit Patil
No ratings yet
CS1010S Tutorial 6 PDF
Document24 pages
CS1010S Tutorial 6 PDF
simyanzi1010
No ratings yet
Matlab Introduction - 09222021
Document7 pages
Matlab Introduction - 09222021
f789sgacanon
No ratings yet
Doing Math and Simulations in R: UNIT-3
Document21 pages
Doing Math and Simulations in R: UNIT-3
Kalyan Varma
No ratings yet
Sci Lab Primer
Document18 pages
Sci Lab Primer
Burcu Deniz
No ratings yet
Exploratory PDF
Document20 pages
Exploratory PDF
Sh Ashv
No ratings yet
Matlav Tp2 Final
Document16 pages
Matlav Tp2 Final
SAN RAKSA
No ratings yet
Intro M
Document54 pages
Intro M
Praveen Khatkale Patil
No ratings yet
DSP Lab Manual
Document67 pages
DSP Lab Manual
loststranger990
100% (1)
Comm. Sys Lab: SPRING 2013
Document85 pages
Comm. Sys Lab: SPRING 2013
ahmad035
No ratings yet
Initialization
Document16 pages
Initialization
lex
No ratings yet
SI: Step-By-Step EDM Analysis
Document19 pages
SI: Step-By-Step EDM Analysis
hengzi
No ratings yet
1.intro. To Control System Toolbox 7 PG
Document6 pages
1.intro. To Control System Toolbox 7 PG
arup
No ratings yet
CS246 Hw1
Document5 pages
CS246 Hw1
sudh123456
No ratings yet
DSP Lab Record
Document51 pages
DSP Lab Record
Antony Raja
No ratings yet
R Tools For Portfolio Optimization
Document38 pages
R Tools For Portfolio Optimization
lycancapital
No ratings yet
Gkogi 0340
Document8 pages
Gkogi 0340
Moonlight
No ratings yet
Assignment No 8
Document17 pages
Assignment No 8
264 HAMNA AMIR
No ratings yet
Homework 2
Document14 pages
Homework 2
Juan Pablo Madrigal Cianci
100% (1)
Statistical Inference Part11
Document2 pages
Statistical Inference Part11
briofons
No ratings yet
Assignment 1 Lab
Document14 pages
Assignment 1 Lab
Uzair Ashfaq
No ratings yet
Levenberg-Marquardt Algorithm Handout
Document10 pages
Levenberg-Marquardt Algorithm Handout
uskn
No ratings yet
Problem 5 - Assignment 1
Document2 pages
Problem 5 - Assignment 1
Anand Bharadwaj
No ratings yet
Lab 6
Document9 pages
Lab 6
Muhammad Samay Ellahi
No ratings yet
Count Models in JAGS
Document16 pages
Count Models in JAGS
18rosa18
No ratings yet
Aim: - To Find Transpose of A Given Matrix. Apparatus: - MATLAB Kit. Theory
Document10 pages
Aim: - To Find Transpose of A Given Matrix. Apparatus: - MATLAB Kit. Theory
Gaurav Mishra
No ratings yet
Numpy NP Pandas PD Scipy Matplotlib - Pyplot PLT Statsmodels - Api SM Statsmodels - Tsa.setar - Model Setar - Model
Document3 pages
Numpy NP Pandas PD Scipy Matplotlib - Pyplot PLT Statsmodels - Api SM Statsmodels - Tsa.setar - Model Setar - Model
Anthony Alarcón Moreno
No ratings yet
Control Systems: Lab Instructor: Hussain Asif Lab: 01
Document9 pages
Control Systems: Lab Instructor: Hussain Asif Lab: 01
Usairum Mirza
No ratings yet
Midterm Codes
Document8 pages
Midterm Codes
Maro
No ratings yet
Programming in Matlab
Document52 pages
Programming in Matlab
AVINASH KUMAR RAI
No ratings yet
MLR Example 2predictors
Document5 pages
MLR Example 2predictors
wangshiui2002
No ratings yet
System Analysis Using Laplace Transform: 1. Polynomials
Document11 pages
System Analysis Using Laplace Transform: 1. Polynomials
neku
No ratings yet
SGD For Linear Regression
Document4 pages
SGD For Linear Regression
Rahul Yadav
No ratings yet
Capital Gains
Document8 pages
Capital Gains
hariprasanna951
No ratings yet
Laboratory Work #1 Researching Interface of The MATLAB, Working With Numbers and Arrays, Plotting 1d Functions
Document16 pages
Laboratory Work #1 Researching Interface of The MATLAB, Working With Numbers and Arrays, Plotting 1d Functions
Олександр Андрійович Клименко
No ratings yet
Lab 1 To Study The Lead Compensator Design
Document6 pages
Lab 1 To Study The Lead Compensator Design
Muhammad Asaad
No ratings yet
Cholesky Decomposition - Rosetta Code
Document35 pages
Cholesky Decomposition - Rosetta Code
Deepak Raina
No ratings yet
Chapter One: Introduction To Matlab What Is MATLAB?: Desktop - Desktop Layout - Default
Document16 pages
Chapter One: Introduction To Matlab What Is MATLAB?: Desktop - Desktop Layout - Default
kattaswamy
No ratings yet
Norwegian Spruces - Main Effects and Interaction: 1 Model
Document4 pages
Norwegian Spruces - Main Effects and Interaction: 1 Model
araz arta
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Solution - Astrophysics - D - Test - IB 2016, 2017 2018, 2019 and 2023
Document17 pages
Solution - Astrophysics - D - Test - IB 2016, 2017 2018, 2019 and 2023
Đạt Nguyễn
No ratings yet
Palomaria - Module 3
Document9 pages
Palomaria - Module 3
ALMIRA LOUISE PALOMARIA
No ratings yet
Experiment 2 - Torsion Test
Document3 pages
Experiment 2 - Torsion Test
Nabiella Atiera
No ratings yet
22ESC141 - Syllabus
Document4 pages
22ESC141 - Syllabus
W03 AKASH
No ratings yet
Safety Induction Checklist FINAL
Document2 pages
Safety Induction Checklist FINAL
Ahmed Abdelrady
No ratings yet
Lecture Notes On Construction Project Management
Document7 pages
Lecture Notes On Construction Project Management
moondonoo7
No ratings yet
MSDS - Racumin
Document6 pages
MSDS - Racumin
Rizka Nataasha
No ratings yet
(5.3) Shear and Moment in Beams
Document62 pages
(5.3) Shear and Moment in Beams
Ian Arnold Fami
No ratings yet
Biopol™ L Viscocifier: Viscosifiers
Document1 page
Biopol™ L Viscocifier: Viscosifiers
smithyry2014
No ratings yet
Option d1 Pages
Document9 pages
Option d1 Pages
Mohammad Sulieman
No ratings yet
Dynamic Flow Products Pvt. LTD
Document3 pages
Dynamic Flow Products Pvt. LTD
Thee Bouyy
No ratings yet
War Otv 00 GDC 0001 B
Document10 pages
War Otv 00 GDC 0001 B
Omar ATE
No ratings yet
Categories of Nursing Theories: Classification/ Groupings Ng-Theories
Document16 pages
Categories of Nursing Theories: Classification/ Groupings Ng-Theories
Alih Kathlyann
No ratings yet
2foundation Moments Hydraulics and CM Self Study Questions
Document40 pages
2foundation Moments Hydraulics and CM Self Study Questions
Angel Tey
No ratings yet
Guided Writing (Informal Letter)
Document11 pages
Guided Writing (Informal Letter)
gnolaz
100% (1)
Summary - Quizizz
Document1 page
Summary - Quizizz
Dylan LALLY
No ratings yet
Types of Electrical Transducer
Document6 pages
Types of Electrical Transducer
Dipesh Dhola
100% (1)
Shiva Alloys Mandi Gobindgarh - Met. Lab Eqpts.
Document3 pages
Shiva Alloys Mandi Gobindgarh - Met. Lab Eqpts.
Mandeep Sodhi
No ratings yet
Cv. M Hamka 2023
Document13 pages
Cv. M Hamka 2023
Muhammad Hamka
No ratings yet
Solutions Acids and Bases Test Review Answers
Document2 pages
Solutions Acids and Bases Test Review Answers
api-305204604
No ratings yet
3.delamination Monitoring of Graphiteepoxy Laminated Composite PDF
Document10 pages
3.delamination Monitoring of Graphiteepoxy Laminated Composite PDF
MICHEL RAJ
No ratings yet
Mati National Comprehensive High School: Edeson John M. Cabanes Teacher
Document33 pages
Mati National Comprehensive High School: Edeson John M. Cabanes Teacher
Edeson John Cabanes
No ratings yet
Determining Reaction Order Initial Rates and The Method of Isolation
Document4 pages
Determining Reaction Order Initial Rates and The Method of Isolation
Deepak Pandey
No ratings yet
Dmitri Wright Artist Statement(s)
Document3 pages
Dmitri Wright Artist Statement(s)
Marko Djurica
No ratings yet
Ganit Pravinya Sample Paper 8
Document1 page
Ganit Pravinya Sample Paper 8
Prathamesh . M Laddhad
No ratings yet
CPCCCA3025 Presentation
Document34 pages
CPCCCA3025 Presentation
Pranay Bansal
No ratings yet
Uc Boyutlu Baglanma Stilleri Olcegi Toad
Document21 pages
Uc Boyutlu Baglanma Stilleri Olcegi Toad
4sq8vnbsv2
No ratings yet
UTS Module 1
Document13 pages
UTS Module 1
Mark Christian Brl
No ratings yet
Seminar Presentation On Microscopy Department of Microbiology Skims Soura
Document52 pages
Seminar Presentation On Microscopy Department of Microbiology Skims Soura
Wani Snaw
No ratings yet