Tutorial 8

Uploaded by

Low Jia Hui

0% found this document useful (0 votes)

5 views2 pages

Original Title

Tutorial8

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

5 views2 pages

Tutorial 8

Uploaded by

Low Jia Hui

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Tutorial 8

DSA1101
Introduction to Data Science
October 26, 2018

Exercise 1. Receiver Operating Characteristic (ROC) Curve in R

In last week’s lecture, we introduced the ROC curve which is a common
tool to evaluate classifiers in terms of trade-off between True Positive Rate
(TPR) and False Positive Rate (FPR) when the classification threshold varies.
We have also studied area under the curve (AUC), which is calculated by
measuring the area under the ROC curve. Higher AUC scores mean the
classifier performs better. In this week’s tutorial, we will illustrate how AUC
can be computed with the R package ‘ROCR’.
Consider the dataset ‘bank-sample.csv’ we discussed in the lectures. For
this exercise, we will predict the binary outcome subscribed using the naı̈ve
Bayes classifier, and generate the ROC curve of the naı̈ve Bayes classifier built
on a training set of 2,000 instances and tested on a testing set of 100 instances.
All the datasets for this exercise are posted under the folder ‘Tutorial 8’ in
IVLE.

(a) Load the training and testing bank sample datasets.

1 # training set
2 banktrain <- read . table ( " bank - sample . csv " , header = TRUE , sep = " ," )
3 # drop a few columns
4 drops <- c ( " balance " , " day " , " campaign " , " pdays " , " previous " , " month "
)
5 banktrain <- banktrain [ , ! ( names ( banktrain ) % in % drops ) ]
6 # testing set
7 banktest <- read . table ( " bank - sample - test . csv " , header = TRUE , sep = " ," )
8 banktest <- banktest [ , ! ( names ( banktest ) % in % drops ) ]

1
(b) Build the naı̈ve Bayes classifier based on the training dataset, and per-
form prediction for the test dataset.

1 library ( e1071 )
2
3 # build the naive Bayes classifier
4 nb _ model <- naiveBayes ( subscribed ~ . ,
5 data = banktrain )
6 # perform on the testing set
7 nb _ prediction <- predict ( nb _ model ,
8 # remove column " subscribed "
9 banktest [ , - ncol ( banktest ) ] ,
10 type = ’ raw ’)

1 library ( ROCR )
2
3 score <- nb _ prediction [ , c ( " yes " ) ]
4
5 actual _ class <- banktest $ subscribed == ’ yes ’
6 pred <- prediction ( score , actual _ class )
7
8 perf <- performance ( pred , " tpr " , " fpr " )
9 plot ( perf , lwd =2 , xlab = " False Positive Rate ( FPR ) " ,
10 ylab = " True Positive Rate ( TPR ) " )
11 abline ( a =0 , b =1 , col = " gray50 " , lty =3)

(d) Compute AUC for the naı̈ve Bayes classifier.

1 auc <- performance ( pred , " auc " )

2 auc <- unlist ( slot ( auc , " y . values " ) )
3 auc

Project 2A Report
Document18 pages
Project 2A Report
T0mahawk 45
No ratings yet
An Introduction To Bayesian VAR (BVAR) Models R-Econometrics
Document16 pages
An Introduction To Bayesian VAR (BVAR) Models R-Econometrics
eruygur
No ratings yet
Husky Training
Document5 pages
Husky Training
Selmir Omic
No ratings yet
Wi-Expd-01 Rev.00 Work Instruction - Expediting Report Contents Guidelines
Document19 pages
Wi-Expd-01 Rev.00 Work Instruction - Expediting Report Contents Guidelines
Rakesh Mishra
No ratings yet
sectionSVM PDF
Document10 pages
sectionSVM PDF
Arif Rahman Hakim
No ratings yet
Proc
Document85 pages
Proc
Alberto Becerra
No ratings yet
Data Analysis and Evaluation Methods Comparison
Document11 pages
Data Analysis and Evaluation Methods Comparison
Jelena Nađ
No ratings yet
Package Sier': R Topics Documented
Document7 pages
Package Sier': R Topics Documented
Chandrajeet Chavhan
No ratings yet
Question No1
Document6 pages
Question No1
aniketgupta8433244176
No ratings yet
Receiver Operating Characteristic (ROC) With Cross Validation
Document3 pages
Receiver Operating Characteristic (ROC) With Cross Validation
amir
No ratings yet
R Studio Quality Management
Document20 pages
R Studio Quality Management
Pankaj Barupal
No ratings yet
Supervised Learning in R Classification
Document7 pages
Supervised Learning in R Classification
Octavio Flores
No ratings yet
Precision-Recall-Gain Curves
Document9 pages
Precision-Recall-Gain Curves
Radim Slovak
No ratings yet
Tutorial 10
Document2 pages
Tutorial 10
Low Jia Hui
No ratings yet
What Is PCA: When Should You Use PCA?
Document21 pages
What Is PCA: When Should You Use PCA?
sailusha
No ratings yet
Classification 3
Document1 page
Classification 3
Elisée TEGUE
No ratings yet
Programming With R Test 2
Document5 pages
Programming With R Test 2
KamranKhan
50% (2)
Exp 3
Document4 pages
Exp 3
jay
No ratings yet
Basic Trend Line
Document12 pages
Basic Trend Line
krishna
No ratings yet
Exp 3 Bi
Document12 pages
Exp 3 Bi
Smaranika Patil
No ratings yet
Mis Notas de R PDF
Document396 pages
Mis Notas de R PDF
Maicel Monzon
100% (1)
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Document2 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Raheel Aslam
No ratings yet
Software Testing File Lovish
Document30 pages
Software Testing File Lovish
Mohit Singla
No ratings yet
Automatic Activity Recognition of Weight Lifting Exercises Using Sensor Data
Document4 pages
Automatic Activity Recognition of Weight Lifting Exercises Using Sensor Data
svglolla
No ratings yet
Maxbox - Starter67 Machine Learning
Document7 pages
Maxbox - Starter67 Machine Learning
Max Kleiner
No ratings yet
Machinelearning - Alisya Athirah Binti Mohd Huzzainny (Updated)
Document26 pages
Machinelearning - Alisya Athirah Binti Mohd Huzzainny (Updated)
Alisya Athirah
No ratings yet
ML 5th
Document8 pages
ML 5th
sahugungun76
No ratings yet
TYCS Practical
Document26 pages
TYCS Practical
latestfullmovies74
No ratings yet
R T2 Econ3005
Document10 pages
R T2 Econ3005
bellance xavier
No ratings yet
R Bootstrap PDF
Document5 pages
R Bootstrap PDF
SAPPA NARESH
No ratings yet
Lab 1. Boston House
Document7 pages
Lab 1. Boston House
dimas bayu
No ratings yet
ML Manual final
Document35 pages
ML Manual final
Priya Mohana
No ratings yet
Descriptive and Inferential Statistics With R
Document6 pages
Descriptive and Inferential Statistics With R
Trần Thị Bích Thảo 3KT -19
No ratings yet
PCA Explained
Document9 pages
PCA Explained
Raj kumar
No ratings yet
RaSEn Demo
Document9 pages
RaSEn Demo
gluetilc
No ratings yet
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
Document24 pages
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
Darnell Larsen
No ratings yet
Machine Learning LAB: Practical-1
Document24 pages
Machine Learning LAB: Practical-1
Tsering Jhakree
100% (2)
Prais
Document8 pages
Prais
Worldbank Org
No ratings yet
ML Week10.1
Document5 pages
ML Week10.1
gowrishankar nayana
No ratings yet
Apiriori and K-Medoids
Document3 pages
Apiriori and K-Medoids
TAMIL SARAVAN
No ratings yet
Machine Learning With SQL
Document12 pages
Machine Learning With SQL
prince krish
100% (1)
Ancova in R Handout
Document8 pages
Ancova in R Handout
Eduardo
No ratings yet
Week14 N9
Document3 pages
Week14 N9
20131A05N9 SRUTHIK THOKALA
No ratings yet
Intro To Forecasting
Document15 pages
Intro To Forecasting
trickledowntheroad
No ratings yet
ML Lab 10 - Ensemble Learning
Document7 pages
ML Lab 10 - Ensemble Learning
PRIYANSH AGGARWAL
No ratings yet
Navie Bayes
Document3 pages
Navie Bayes
HARSHITHA D
No ratings yet
Expt8 21blc1084 EDA
Document5 pages
Expt8 21blc1084 EDA
vishaldev.hota2021
No ratings yet
Solution First Point ML-HW4
Document6 pages
Solution First Point ML-HW4
Juan Sebastian Otálora Montenegro
100% (1)
R Code For Linear Regression Analysis 1 Way ANOVA
Document8 pages
R Code For Linear Regression Analysis 1 Way ANOVA
Hagos Tsegay
No ratings yet
ML Lab Manual
Document37 pages
ML Lab Manual
apekshapandekar01
100% (1)
ISYE 6501 Georgia Tech hmwk3.1b
Document5 pages
ISYE 6501 Georgia Tech hmwk3.1b
aletia
No ratings yet
ML File
Document37 pages
ML File
25532aayushi.2020
No ratings yet
Code
Document25 pages
Code
Classy Man
No ratings yet
MATLAB Command Window: Fisheriris
Document9 pages
MATLAB Command Window: Fisheriris
Abyan Jadidan
No ratings yet
Fa Ca2 19BSR18029-2
Document6 pages
Fa Ca2 19BSR18029-2
yuktha
No ratings yet
Urmi ML Practical File
Document37 pages
Urmi ML Practical File
bekar ka
No ratings yet
Naive Bayes Classifiers - Parta
Document17 pages
Naive Bayes Classifiers - Parta
Akshay kashyap
No ratings yet
Week 6
Document36 pages
Week 6
divya.kapoor04
No ratings yet
DS&IP Journal (Munotes - In)
Document25 pages
DS&IP Journal (Munotes - In)
nishaya
No ratings yet
Kelbie Davidson (44817015) COMP4702 - Assignment 2
Document8 pages
Kelbie Davidson (44817015) COMP4702 - Assignment 2
Kelbie Davidson
No ratings yet
21bme145 Exp 4
Document3 pages
21bme145 Exp 4
21bme145
No ratings yet
Machine Learning
Document56 pages
Machine Learning
Mani Vrs
100% (3)
Quant Developers' Tools and Techniques: Quant Books, #1
From Everand
Quant Developers' Tools and Techniques: Quant Books, #1
Manfred Hindering
No ratings yet
Classement, Déclassement, Reclassement
Document25 pages
Classement, Déclassement, Reclassement
Ricardo Ricardo
No ratings yet
hjfb5m8w) 25 D HN
Document27 pages
hjfb5m8w) 25 D HN
Sav Oli
No ratings yet
Fuzzy Soft Set Theory and Its Applications
Document12 pages
Fuzzy Soft Set Theory and Its Applications
Bharathi Selvam
No ratings yet
Morphological Research in Planning Urban Design and Architecture 2021
Document261 pages
Morphological Research in Planning Urban Design and Architecture 2021
chicha
No ratings yet
KAHAN, D (2016) - Politically Motivated Reasoning
Document16 pages
KAHAN, D (2016) - Politically Motivated Reasoning
Rafaella Ribeiro
No ratings yet
Facts Versus Opinion
Document20 pages
Facts Versus Opinion
Irish Ga-a
No ratings yet
Cug Syllabus MPhil PHD PDF
Document11 pages
Cug Syllabus MPhil PHD PDF
Harshit Srivastava
No ratings yet
2023jan18 - Mapuauniversity - Ar116-1p - Portfolio Guidelines and Format
Document5 pages
2023jan18 - Mapuauniversity - Ar116-1p - Portfolio Guidelines and Format
Justin Dominic San Mateo
No ratings yet
Economics - Definition and Nature & Scope of Economics - Divisions of Economics
Document8 pages
Economics - Definition and Nature & Scope of Economics - Divisions of Economics
Nikita 07
No ratings yet
2018-3-Joh-Batu Pahat
Document2 pages
2018-3-Joh-Batu Pahat
Keertana Subramaniam
No ratings yet
Esas Module (Concepts) : Encoded By: Fausto, Dimazana, Matuto, Maniquis, Montanno, Malicdem
Document313 pages
Esas Module (Concepts) : Encoded By: Fausto, Dimazana, Matuto, Maniquis, Montanno, Malicdem
Alven Pulla
No ratings yet
Research Works Cited
Document2 pages
Research Works Cited
Samual Pauley
No ratings yet
Exercise 1: Group A Dictation
Document2 pages
Exercise 1: Group A Dictation
Костя Гарась
No ratings yet
Percentage: © Department of Analytical Skills
Document19 pages
Percentage: © Department of Analytical Skills
Crazy Devil
No ratings yet
Articulo 2 Bioseparations Basics-2014
Document7 pages
Articulo 2 Bioseparations Basics-2014
miguel terron mejia
No ratings yet
Advanced Planning Techniques
Document15 pages
Advanced Planning Techniques
fdarchitect
No ratings yet
Checklist Test Environment: Pass Description Remarks
Document4 pages
Checklist Test Environment: Pass Description Remarks
Guru Prasad
No ratings yet
Outline
Document3 pages
Outline
Sammy Dee
No ratings yet
Linguistic Theory and Empirical Evidence
Document307 pages
Linguistic Theory and Empirical Evidence
谷光生
No ratings yet
Change Your Mind 57 Ways To Unlock Your Creative Self
Document194 pages
Change Your Mind 57 Ways To Unlock Your Creative Self
hquecan9634
No ratings yet
10.4324 9781315009285 Previewpdf
Document21 pages
10.4324 9781315009285 Previewpdf
Annika
No ratings yet
Fiber Optic Training
Document32 pages
Fiber Optic Training
asifaliabid
No ratings yet
The Nose-Hoover Thermostat
Document7 pages
The Nose-Hoover Thermostat
ranesh
No ratings yet
CSS Essay Poverty Alleviation - Complete Solved Essay (CSS Exams 2005)
Document5 pages
CSS Essay Poverty Alleviation - Complete Solved Essay (CSS Exams 2005)
Muhammad Gulraiz Muhammad Gulraiz
No ratings yet
2020 Non Technical Skills Implementation Summary Guide
Document24 pages
2020 Non Technical Skills Implementation Summary Guide
Ismail Ibn Hashim
No ratings yet
Homework 2: Run Initial Concentration, Half Life
Document12 pages
Homework 2: Run Initial Concentration, Half Life
iqra khan
No ratings yet
Directions (11-15) : Answer Question 11 To 15 Based On The Following Sequence
Document41 pages
Directions (11-15) : Answer Question 11 To 15 Based On The Following Sequence
Pradeep B Appi
No ratings yet