Welcome to Scribd!

1 1regressionANDclassification

Uploaded by

0% found this document useful (0 votes)

5 views20 pages

The document discusses machine learning models for classification and regression. It explains that models are trained on labeled datasets that are divided into training and testing sets. Classification models assign inputs to discrete classes, which can be evaluated using a confusion matrix measuring true/false positives and negatives. Regression models predict continuous outputs and are evaluated using mean squared error. The document warns that models can overfit the training data and fail to generalize, or underfit by not learning the underlying patterns.

Original Description:

Original Title

1.1regressionANDclassification

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

5 views20 pages

1 1regressionANDclassification

Uploaded by

Nasser Al-Badareen

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 20

Search inside document

▪ The outputs are finite and the developed model must assign a single

class to new inputs.

▪ the dataset is a collection of labeled-data records in the form: {independent
variables as inputs, and the associated classes (i.e., labels) as outputs}. The
objective is to develop a machine learning model to relate the inputs to the
outputs, and to predict the class of new inputs.
▪ In practice, the dataset is divided into two sets, which are the training and the
testing sets.
▪ The training set is employed to develop the classification model, while the testing
set is utilized afterwards to evaluate the accuracy of the developed model.
▪ For binary classification models, the outputs of the model can be presented in a
confusion matrix form.
▪ The confusion matrix shows how many instances are correctly classified by the
developed model, as seen in the following figure for a two-class problem.
True Predicted
0 1
1 1
1 0
0 0
1 1
0 0
0 1
1 1
• Compute the confusion matrix
0 0
1 1
Predicted (Model output)
0 1

Actual 0 TN FP
(desired
output) 1 FN TP
• Precision
The number of true positives (i.e., the number of correctly labeled instances) to a specific
class, divided by the total number of positive predictions.
• Precision
The number of true positives (i.e., the number of correctly labeled instances) to a specific
class, divided by the total number of instances assigned to this class.

The precision is a measure of the

model’s exactness.
• Precision
The number of true positives (i.e., the number of correctly labeled instances) to a specific
class, divided by the total number of positive predictions.

• Recall
The number of true positives to a specific class, divided by the total number of instances
actually in this class.
• Precision
The number of true positives (i.e., the number of correctly labeled instances) to a specific
class, divided by the total number of positive predictions.

The recall is a measure of the

model’s completeness.
• Recall
The number of true positives to a specific class, divided by the total number of instances
actually in this class.
• F-score
measures the weighted average of precision and recall.
▪ One example is the historical dataset of real estate values. If the characteristics and
corresponding prices for many houses within a certain city are provided, can the
price of a different house in this area be predicted by its characteristics?
▪ One example is the historical dataset of real estate values. If the characteristics and
corresponding prices for many houses within a certain city are provided, can the
price of a different house in this area be predicted by its characteristics?
▪ The evaluation metric that is routinely calculated to judge the model’s performance
is the mean squared error (MSE), as given by the following equation:

Where y is the output of the developed regression

model, d is the desired output, and n is the total number
of the instances.
Where y is the output of the developed regression
model, d is the desired output, and n is the total number
of the instances.
▪ In general, when exposed to more observations, the model improves its predictive
performance.
▪ However, too much adaptability will force the model to learn the noise within the
training data rather than the underlying input/output relations.
▪ Therefore, the resultant model overfits the training set, and will not perform equally
well on the testing set.
▪ Overfitting happens when the model highly adapts to the training set but by doing
so fails on the testing set.
Croteau et al. (2017) explain, “overfitting will
impact negatively on the degree of
generalization to new data and thus must be
avoided in order for solutions to be useful for
practical application” (p. 306)
▪ An efficient machine learning model attempts to decrease generalization errors
and thus have good predictions on data that the model was not trained for.
▪ On the other hand, underfitting happens when the model does not capture enough
of the inherent structure in the training data, which results in poor performance
with both training and testing sets.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
Rating: 5 out of 5 stars
5/5 (1)
Poem Analysis
Document4 pages
Poem Analysis
Charles Ting
67% (3)
120 DS-With Answer
Document32 pages
120 DS-With Answer
Asim Mazin
100% (1)
Machine Learning Interview Questions.
Document43 pages
Machine Learning Interview Questions.
hari krishna reddy
100% (1)
Simoreg Spare Parts
Document34 pages
Simoreg Spare Parts
iyilmaz1
No ratings yet
COUPP - 60 Hydraulic Hose Failure Analysis
Document26 pages
COUPP - 60 Hydraulic Hose Failure Analysis
Romulus Situ Morank
No ratings yet
Pinto pm2 Ism ch10
Document40 pages
Pinto pm2 Ism ch10
Jesha Carl Jotojot
No ratings yet
Lecture 16
Document36 pages
Lecture 16
Abood Fazil
No ratings yet
Evaluation Metrics
Document11 pages
Evaluation Metrics
Sreetam Ganguly
No ratings yet
Evaluation Metrics
Document11 pages
Evaluation Metrics
Subha OP
No ratings yet
Introduction To Machine Learning
Document11 pages
Introduction To Machine Learning
Heidar Usmael Jundi
No ratings yet
Module 5 Advanced Classification Techniques
Document40 pages
Module 5 Advanced Classification Techniques
Saurabh Jagtap
No ratings yet
Coding Neural Networks-Classification & Regression
Document39 pages
Coding Neural Networks-Classification & Regression
Hasanur Rahman
No ratings yet
Lec - 4
Document43 pages
Lec - 4
Yonatan tamiru
No ratings yet
Lecture 2 Classifier Performance Metrics
Document60 pages
Lecture 2 Classifier Performance Metrics
iamboss086
No ratings yet
TR Rain Error
Document6 pages
TR Rain Error
VaggelarasB
No ratings yet
Ds Module 4
Document73 pages
Ds Module 4
Prathik Srinivas
No ratings yet
Performance Evaluation
Document29 pages
Performance Evaluation
Sherwin Lopez
No ratings yet
Model Selection: Dr.K.Murugan Assistant Professor Senior VIT, Vellore
Document35 pages
Model Selection: Dr.K.Murugan Assistant Professor Senior VIT, Vellore
Chidvi Reddy
No ratings yet
Week11-Lecture 11ML Algorithms Metrics - Updated
Document29 pages
Week11-Lecture 11ML Algorithms Metrics - Updated
fgfdgfdgfd
No ratings yet
Hands On Machine Learning 3 Edition
Document18 pages
Hands On Machine Learning 3 Edition
saharabdouma
No ratings yet
Unit III 1
Document21 pages
Unit III 1
mananrawat537
No ratings yet
Model Evaluation and Selection
Document41 pages
Model Evaluation and Selection
Dev kartik Agarwal
No ratings yet
Types of Machine Learning Algorithms
Document14 pages
Types of Machine Learning Algorithms
Vipin Rajput
No ratings yet
Unit III - I
Document15 pages
Unit III - I
Shiv Kumar Singh
No ratings yet
AI Capstone Project - Notes-Part2
Document8 pages
AI Capstone Project - Notes-Part2
minha.fathima737373
No ratings yet
Week 2: Machine Learning Intro: Instructor: Ting Sun
Document21 pages
Week 2: Machine Learning Intro: Instructor: Ting Sun
Wenbo Pan
No ratings yet
Machine Learning HC
Document4 pages
Machine Learning HC
lucaseveleens
No ratings yet
ML 19.03 Sidenotes
Document30 pages
ML 19.03 Sidenotes
asma
No ratings yet
All Notes
Document6 pages
All Notes
tiajung humtsoe
No ratings yet
Model Evaluation and Selection
Document22 pages
Model Evaluation and Selection
discodancerhasan
No ratings yet
Chapter 5 Evaluation Metrics
Document18 pages
Chapter 5 Evaluation Metrics
liyu agye
No ratings yet
Evaluating Model Performance Unit 6
Document46 pages
Evaluating Model Performance Unit 6
jahnabi122
No ratings yet
AI & ML Notes
Document22 pages
AI & ML Notes
karthik singarao
No ratings yet
Machine Learning Models: by Mayuri Bhandari
Document48 pages
Machine Learning Models: by Mayuri Bhandari
mayuri
No ratings yet
Basics of ML and Evaluation
Document42 pages
Basics of ML and Evaluation
animeshrajak649
No ratings yet
ML.1Lecture.2 (Old)
Document23 pages
ML.1Lecture.2 (Old)
Annayah Usman
No ratings yet
ML Lec 2
Document32 pages
ML Lec 2
Saqlain Arshad
No ratings yet
Unit 2
Document18 pages
Unit 2
rk73462002
No ratings yet
Kami Export - 13. Model Evaluation
Document25 pages
Kami Export - 13. Model Evaluation
YENI SRI MAHARANI -
100% (1)
Quiz-1 Notes
Document4 pages
Quiz-1 Notes
Shruthi Shetty
No ratings yet
Evaluating A Machine Learning Model
Document14 pages
Evaluating A Machine Learning Model
Jean
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
Document73 pages
Data Mining: Practical Machine Learning Tools and Techniques
Arvind
No ratings yet
Advance Cluster - Classification: Ha Le Hoai Trung
Document50 pages
Advance Cluster - Classification: Ha Le Hoai Trung
Phuong Nguyen Thi Bich
No ratings yet
Introduction To Machine Learning
Document29 pages
Introduction To Machine Learning
Aayush Kansara
No ratings yet
Data Mining and Classification
Document50 pages
Data Mining and Classification
komal kashyap
No ratings yet
Ensemble Methods
Document15 pages
Ensemble Methods
brm1shubha
100% (1)
ML Sit1305
Document127 pages
ML Sit1305
Kannan Thangavelu
No ratings yet
Evaluation Metrics
Document10 pages
Evaluation Metrics
Aly Boy
No ratings yet
ML 1 2 3
Document54 pages
ML 1 2 3
Shoba Natesh
No ratings yet
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Document32 pages
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Mohammed Danlami Yusuf
100% (1)
CSL0777 L06
Document24 pages
CSL0777 L06
Konkobo Ulrich Arthur
No ratings yet
DM - Ch4 - Classification (Part1)
Document20 pages
DM - Ch4 - Classification (Part1)
C.RadhiyaDevi
No ratings yet
Confusion Matrix
Document4 pages
Confusion Matrix
Dev Goyal
No ratings yet
Ensemble Learning Methods
Document24 pages
Ensemble Learning Methods
khatri81
100% (1)
1-Descriptive Statistics
Document44 pages
1-Descriptive Statistics
Suchismita Sahu
No ratings yet
1-Descriptive Statistics
Document44 pages
1-Descriptive Statistics
Suchismita Sahu
No ratings yet
1-Descriptive Statistics
Document44 pages
1-Descriptive Statistics
Suchismita Sahu
No ratings yet
ML 5
Document14 pages
ML 5
dibloa
No ratings yet
Interview Questions For DS & DA (ML)
Document66 pages
Interview Questions For DS & DA (ML)
pratikmovie999
100% (1)
Classification Accuracy Logarithmic Loss Confusion Matrix Area Under Curve F1 Score Mean Absolute Error
Document9 pages
Classification Accuracy Logarithmic Loss Confusion Matrix Area Under Curve F1 Score Mean Absolute Error
Harpreet Singh Bagga
No ratings yet
Business Simulation - F2F 3
Document27 pages
Business Simulation - F2F 3
shiva kulshrestha
No ratings yet
Tycs Ai Unit 2
Document84 pages
Tycs Ai Unit 2
jeasdsdasda
No ratings yet
Fundamental of ML Week 2
Document12 pages
Fundamental of ML Week 2
Raj Physio
No ratings yet
Microsoft For Startups Deck 19
Document20 pages
Microsoft For Startups Deck 19
Rajni Kant Sinha
100% (1)
Synopsis For Breast Cancer
Document50 pages
Synopsis For Breast Cancer
maya verma
No ratings yet
Food Grade Anti-Corrosion Grease: Special Features
Document2 pages
Food Grade Anti-Corrosion Grease: Special Features
chem Khan
No ratings yet
Complete CV Julialee
Document8 pages
Complete CV Julialee
Yus Kai
No ratings yet
The Ice Creamery Cookbook
Document18 pages
The Ice Creamery Cookbook
Weldon Owen Publishing
83% (12)
Engineers Syndicate Company Profile
Document17 pages
Engineers Syndicate Company Profile
Sakshi Nanda
No ratings yet
How To Create and Play Kahoot!
Document15 pages
How To Create and Play Kahoot!
Sri Raman Nair
No ratings yet
Yama Zatdaw: Myanmar
Document3 pages
Yama Zatdaw: Myanmar
C.N. Krishna
No ratings yet
NHD Website Bibliography
Document11 pages
NHD Website Bibliography
api-120944114
No ratings yet
Berkeleyme - CIMA Executive Program
Document22 pages
Berkeleyme - CIMA Executive Program
Muhammad Naeem
No ratings yet
2300 SIGE 2016 Final
Document6 pages
2300 SIGE 2016 Final
Carlos Junior
No ratings yet
The 1 Catholic Mass in The Philippines
Document16 pages
The 1 Catholic Mass in The Philippines
Rosalie Alitao
No ratings yet
Power-Linker Training Centre: Group
Document1 page
Power-Linker Training Centre: Group
Sunil Singh
No ratings yet
DNA For The Defense Bar
Document192 pages
DNA For The Defense Bar
shaninrose
No ratings yet
Ready2Invest Off Plan Property Investor Book
Document51 pages
Ready2Invest Off Plan Property Investor Book
Ready2Invest
No ratings yet
MK17C01-Group 3-MKT328m-Final Report DIFFERENT
Document14 pages
MK17C01-Group 3-MKT328m-Final Report DIFFERENT
Tran Minh Quy (K17 QN)
No ratings yet
Chap 3 Parallel and Perpendicular Lines
Document17 pages
Chap 3 Parallel and Perpendicular Lines
Alrianne Batonghinog
100% (1)
Perbandingan Harga Bhineka
Document4 pages
Perbandingan Harga Bhineka
Julio Mariscal
No ratings yet
DLL - Science 6 - Q4 - W7
Document10 pages
DLL - Science 6 - Q4 - W7
Jefferson Beralde
50% (2)
NIV Excerpt PDF
Document57 pages
NIV Excerpt PDF
Anonymous tSYkkHToBP
No ratings yet
At MCQ Salogsacol Auditing Theory Multiple Choice
Document32 pages
At MCQ Salogsacol Auditing Theory Multiple Choice
Jannaviel Mirandilla
No ratings yet
Country Profile Malawi 2014 PDF
Document12 pages
Country Profile Malawi 2014 PDF
Jack Malambe
No ratings yet
Polymerization of Vegetable Oils and Their Uses in Printing Inks
Document4 pages
Polymerization of Vegetable Oils and Their Uses in Printing Inks
José Antônio Nascimento Neto
No ratings yet
Appendix G Elastic and Inelastic Response Spectra
Document11 pages
Appendix G Elastic and Inelastic Response Spectra
cedaserdna
No ratings yet
Case Nancy: Example
Document38 pages
Case Nancy: Example
shweta G
No ratings yet
A Presentation On: "Gswan"
Document19 pages
A Presentation On: "Gswan"
Sunil Pillai
No ratings yet