Welcome to Scribd!

55.how To Perform ML

Uploaded by

0% found this document useful (0 votes)

5 views16 pages

This document provides an overview of the 7 main steps for performing machine learning: 1) Specify the problem, 2) Prepare data, 3) Choose learning method, 4) Apply the learning method, 5) Assess the method and results, 6) Parameter tuning, and 7) Making predictions. It describes each step in detail, including preparing data by selecting features, labeling and sampling data, transforming data, choosing machine learning algorithms and programming languages, training and testing models, evaluating performance, and tuning hyperparameters to improve accuracy for making predictions.

Original Description:

Original Title

55.How to Perform ML

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

5 views16 pages

55.how To Perform ML

Uploaded by

TariqMalik

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 16

Search inside document

INTRODUCTION TO

ARTIFICIAL INTELLIGENCE
FOR IT & NON-IT PROFESSIONALS
HOW TO PERFORM MACHINE LEARNING
HOW TO PERFORM MACHINE LEARNING

It involves the following 7 steps:

1. Specify the problem
2. Prepare data
3. Choose learning method
4. Apply the learning method
5. Assess the method and results
6. Parameter tuning
7. Making predictions
SPECIFY THE PROBLEM

• This involves understanding the problem, how we can solve the

problem, and how it can be evaluated
• It is useful to understand why you want to solve the problem
PREPARING DATA

• Data is information
• Any time we have a table with information, we have data
• each row in the table is a data point
• Consider a dataset of pets. Each row represents a pet. Each pet is
described by certain features.
WHAT ARE FEATURES?

• Features are the columns of the table

• Some features are special, and we call them labels. If we are trying to
predict a feature based on the others, then that feature is called the
label
• Labels depends on the context of the problem we are solving
LABELED AND UNLABELED DATA.

Labeled data: Data that comes with a label.

Unlabeled data: Data that comes without a label.
PREPARING DATA

• ML algos learn from the data they are trained on

• It is mission critical to provide the model with valid, correct data to
learn from
• Data must be prepared in a usable format
• The data must then be processed to ensure correct formatting,
removal of erroneous data, and the fixing of any missing data
PREPARING DATA

• Sampling: The dataset size may be more than required, and so

dataset sampling may also be required
• Data pre-processing is essential to have tidy, valid data
• Tidy, valid data is key to having robust, veracious (true) outcomes
ML DATASET

• The UC Irvine ML Repository is a collection of DBs, domain theories,

and data generators that are used by the ML community for the
empirical analysis of ML algorithms
ATTRIBUTE, VARIABLE, FEATURE
SELECTION
• It is essentially filtering- and refers to the selection of a subset of the
original example set that is most relevant in the predictive modeling at
hand
• Feature selection includes and excludes attributes rather than
creating new ones.
TRANSFORMING DATA

• Check your datasets for errors, biases, and inconsistencies

• Data may also need to be transformed. This is typically guided by the
algorithm you are using and the data available
• Scaling: Data can contain attributes with varying quantities
APPLY LEARNING METHODS

• ML tasks are typically conducted in a variety of programming

languages: predominantly R, Python, MATLAB, SQL, Java, and C
• R is typically used for statistical analysis
• Python is well suited to ML
• MATLAB is the language used for fast prototyping
• SQL is used for managing data held in a traditional database
management system
TRAINING AND TEST DATA

• Test set and training set is selected from the prepared data
• The algorithm is trained on the training dataset and evaluated against
the test dataset
• Signal: the true underlying pattern in a dataset
• Noise: random or irrelevant patterns in a dataset
ASSESS METHOD AND RESULTS

• The performance of ML tasks depends on the representation of data

given
• It is not necessary to require complete feature sets as part of
representations to have highly confident outputs
PARAMETER TUNING

• After evaluation, for further improvement, we can tune the parameters

by showing the model our full dataset multiple times, rather than just
once, to increase accuracy
• It’s important that to define what makes a model “good enough”,
otherwise you might find yourself tweaking parameters for a very long
time. These parameters are called “hyperparameters”.
PREDICTIONS

• Prediction, or inference, is the step where we get to answer some

questions
• This is the real objective of all this work, where the value of ML is
realized

A Crash Course in Data Science Review
Document11 pages
A Crash Course in Data Science Review
huka
No ratings yet
EVPA Measuring and Managing Impact
Document140 pages
EVPA Measuring and Managing Impact
JeremiahOmwoyo
No ratings yet
Chapter 2 Data Preprocessing
Document23 pages
Chapter 2 Data Preprocessing
liyu agye
No ratings yet
02.data Preprocessing PDF
Document31 pages
02.data Preprocessing PDF
sunil
100% (1)
Building Good Training Sets UNIT 1 PART2
Document46 pages
Building Good Training Sets UNIT 1 PART2
Aditya Sharma
No ratings yet
Module 1 ML Mumbai University
Document47 pages
Module 1 ML Mumbai University
2021.shreya.pawaskar
No ratings yet
Unit 1 - Exploratory Data Analysis Fundamentals
Document47 pages
Unit 1 - Exploratory Data Analysis Fundamentals
patilamrutak2003
No ratings yet
Data Preprocessing Implementation 13112023 061217pm
Document31 pages
Data Preprocessing Implementation 13112023 061217pm
AHSAN HAMEED
No ratings yet
ML PDF
Document237 pages
ML PDF
Komi David ABOTSITSE
100% (1)
10 AI Success Metric and Performance Indicators
Document30 pages
10 AI Success Metric and Performance Indicators
Shampa Nasrin
No ratings yet
Machine Learning Introduction
Document20 pages
Machine Learning Introduction
nada1914465
No ratings yet
Eda
Document12 pages
Eda
Inspiring Evolution
100% (1)
Introduction and Performance Analysis
Document53 pages
Introduction and Performance Analysis
Pratham Agarwal
No ratings yet
Module 1
Document36 pages
Module 1
Mhd Aslam
No ratings yet
ML.1Lecture.2 (Old)
Document23 pages
ML.1Lecture.2 (Old)
Annayah Usman
No ratings yet
Data Science II: Charles C.N. Wang
Document38 pages
Data Science II: Charles C.N. Wang
sar
No ratings yet
Supervised Machine Learning
Document25 pages
Supervised Machine Learning
syedmar3297
No ratings yet
Choosing Model and Tuning
Document20 pages
Choosing Model and Tuning
kar20201214
No ratings yet
CSC 3301-Lecture06 Introduction To Machine Learning
Document56 pages
CSC 3301-Lecture06 Introduction To Machine Learning
AmalienaHilmy
No ratings yet
3 Persiapan Data Mining
Document83 pages
3 Persiapan Data Mining
icobes ur
No ratings yet
Data Poison Detection Schemes For Distribution Machine Learning
Document22 pages
Data Poison Detection Schemes For Distribution Machine Learning
Telu Tejaswini
No ratings yet
Semi Supervised Learning
Document86 pages
Semi Supervised Learning
chaudharylalit025
No ratings yet
3-Data Considerations
Document46 pages
3-Data Considerations
max biscene
No ratings yet
Coursera - Data Analytics - Course 4
Document6 pages
Coursera - Data Analytics - Course 4
Utjale
No ratings yet
Business Analytics Process and Data Exploration
Document38 pages
Business Analytics Process and Data Exploration
J Warneck Gultøm
No ratings yet
Introduction To Machine Learning and Data Science: by Myself and Slidedeck Ai:)
Document6 pages
Introduction To Machine Learning and Data Science: by Myself and Slidedeck Ai:)
parithi
No ratings yet
Data Analytics-Methods-Tools-And-Techniques
Document19 pages
Data Analytics-Methods-Tools-And-Techniques
Jaamees Talamo
No ratings yet
Oral Questions LP II
Document21 pages
Oral Questions LP II
bigbang
No ratings yet
Module 4 - Session 1 - Intro To ML
Document31 pages
Module 4 - Session 1 - Intro To ML
Thomas
No ratings yet
Working With Data - Annotated
Document62 pages
Working With Data - Annotated
Hala M
No ratings yet
Data Science PDF
Document11 pages
Data Science PDF
sredhar s
No ratings yet
Data Science - Sem6
Document118 pages
Data Science - Sem6
Dinesh K Lohar
100% (1)
Unit6 Part3 General Procedure
Document19 pages
Unit6 Part3 General Procedure
tamanna sharma
No ratings yet
Regression
Document109 pages
Regression
Pranati Bharadkar
100% (2)
Research Process: - Steps in Research - Data Sets Preparation - Experimental Research - Performance Evaluation
Document27 pages
Research Process: - Steps in Research - Data Sets Preparation - Experimental Research - Performance Evaluation
yekoyesew
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
Document33 pages
Chapter Five Principal Comonent Analysis (PCA)
Ruun Mohamed
No ratings yet
Working With Data - Annotated
Document62 pages
Working With Data - Annotated
Hala M
No ratings yet
Supervised and Unsupervised Learning: Ciro Donalek Ay/Bi 199 - April 2011
Document69 pages
Supervised and Unsupervised Learning: Ciro Donalek Ay/Bi 199 - April 2011
Emmanuel Harris
No ratings yet
Lect3 Machine Learning
Document27 pages
Lect3 Machine Learning
Amrin Mulani
No ratings yet
Machine Learning Chapter 2
Document37 pages
Machine Learning Chapter 2
Cherenet Toma
No ratings yet
Machine Learning The Way To Better Thinking
Document11 pages
Machine Learning The Way To Better Thinking
Rick Mitra
No ratings yet
C1000-154 STU C1000154v2STUSGC1000154
Document10 pages
C1000-154 STU C1000154v2STUSGC1000154
Gisele Souza
No ratings yet
Unit Online 1.4
Document132 pages
Unit Online 1.4
Nitesh Saini
No ratings yet
Lab Assignment 1 Title: Data Wrangling I: Problem Statement
Document12 pages
Lab Assignment 1 Title: Data Wrangling I: Problem Statement
Mr. Legendperson
No ratings yet
Section 1
Document49 pages
Section 1
HuanYu
No ratings yet
UNIT I - Introduction - DataScience - New
Document34 pages
UNIT I - Introduction - DataScience - New
Sid S
No ratings yet
Inside: The Excel-Lent Learnings of Fundamentals of Computer System
Document9 pages
Inside: The Excel-Lent Learnings of Fundamentals of Computer System
Safwan Jamil
No ratings yet
Dwina DM 03 Persiapan 2018
Document82 pages
Dwina DM 03 Persiapan 2018
Hanny Febrii Elizabeth
No ratings yet
Unit - 1 To Data Structure
Document32 pages
Unit - 1 To Data Structure
Darshna Sharma
No ratings yet
INF30036 DataTypes Lecture2-1
Document42 pages
INF30036 DataTypes Lecture2-1
Yehan Abayasinghe
No ratings yet
Be A 65 Ads Exp 3
Document6 pages
Be A 65 Ads Exp 3
Ritika dwivedi
No ratings yet
Data Mining Pengantar 2019
Document18 pages
Data Mining Pengantar 2019
munji
No ratings yet
@chapter 1 DSA
Document16 pages
@chapter 1 DSA
Baruk Umeta Dego
No ratings yet
Romi DM 03 Persiapan Mar2016
Document82 pages
Romi DM 03 Persiapan Mar2016
Tri Indah Sari
No ratings yet
Data Pre-Processing Python For Beginner
Document12 pages
Data Pre-Processing Python For Beginner
Bongkar Taktik
No ratings yet
Data Pre-Processing Python For Beginner
Document12 pages
Data Pre-Processing Python For Beginner
Bongkar Taktik
No ratings yet
Unit 1 PDF
Document135 pages
Unit 1 PDF
jvgirish74
No ratings yet
Pattern Recognition Application
Document43 pages
Pattern Recognition Application
Khaled Omar
No ratings yet
22mca341 - Data Science
Document109 pages
22mca341 - Data Science
akashyadav4846
No ratings yet
Data Preprocessing
Document12 pages
Data Preprocessing
Prashant Sahu
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Introduction To For It & Non-It Professionals: Artificial Intelligence
Document7 pages
Introduction To For It & Non-It Professionals: Artificial Intelligence
TariqMalik
No ratings yet
Inspection Checklist: Analyzers / Analyzer Shelters and Racks - Material Receiving STG Project 10-01759-001 Enppi
Document9 pages
Inspection Checklist: Analyzers / Analyzer Shelters and Racks - Material Receiving STG Project 10-01759-001 Enppi
TariqMalik
No ratings yet
Inspection Checklist: Analyzers / Analyzer Shelters and Racks - Installation
Document5 pages
Inspection Checklist: Analyzers / Analyzer Shelters and Racks - Installation
TariqMalik
100% (1)
Material Preservation Report 25-May-22
Document2 pages
Material Preservation Report 25-May-22
TariqMalik
No ratings yet
41.inference Mechanism (Expert Systems)
Document20 pages
41.inference Mechanism (Expert Systems)
TariqMalik
No ratings yet
Enppi Inspection and Test Plan
Document3 pages
Enppi Inspection and Test Plan
TariqMalik
No ratings yet
Asamco Typical Inspection Plan: ASTIP-E-01 Overhead Transmission Line (Up To 230 KV) Electrical
Document1 page
Asamco Typical Inspection Plan: ASTIP-E-01 Overhead Transmission Line (Up To 230 KV) Electrical
TariqMalik
No ratings yet
46.how To Build An NLP Pipeline
Document12 pages
46.how To Build An NLP Pipeline
TariqMalik
No ratings yet
43.fuzzy Inference System
Document8 pages
43.fuzzy Inference System
TariqMalik
No ratings yet
Sipchem Instrument QC Notes
Document31 pages
Sipchem Instrument QC Notes
TariqMalik
No ratings yet
Saudi Aramco Typical Inspection Plan: Soil Improvement (Vibro Replacement & Vibro Compaction) 31-Nov-2018 Civil
Document10 pages
Saudi Aramco Typical Inspection Plan: Soil Improvement (Vibro Replacement & Vibro Compaction) 31-Nov-2018 Civil
TariqMalik
0% (1)
54.difference Between ML and AI
Document7 pages
54.difference Between ML and AI
TariqMalik
No ratings yet
Introduction To For It & Non-It Professionals: Artificial Intelligence
Document6 pages
Introduction To For It & Non-It Professionals: Artificial Intelligence
TariqMalik
No ratings yet
4.weak and Strong AI
Document12 pages
4.weak and Strong AI
TariqMalik
No ratings yet
Rosemount 2120 Vibrating Fork Liquid Level Switch
Document16 pages
Rosemount 2120 Vibrating Fork Liquid Level Switch
TariqMalik
No ratings yet
Saudi Aramco Typical Inspection Plan: Pneumatic Pressure Testing SATIP-A-004-01 Mechanical
Document10 pages
Saudi Aramco Typical Inspection Plan: Pneumatic Pressure Testing SATIP-A-004-01 Mechanical
TariqMalik
No ratings yet
Introduction To For It & Non-It Professionals: Artificial Intelligence
Document6 pages
Introduction To For It & Non-It Professionals: Artificial Intelligence
TariqMalik
No ratings yet
Analyzer Performance Monitoring: by M.S.Mani N.A.Baxi H.Madhvani J.F.D'Souza V.R.Patel
Document35 pages
Analyzer Performance Monitoring: by M.S.Mani N.A.Baxi H.Madhvani J.F.D'Souza V.R.Patel
TariqMalik
No ratings yet
7.trends and Technologies and Tools
Document29 pages
7.trends and Technologies and Tools
TariqMalik
No ratings yet
Two-Wire Radar Level Transmitter: Rosemount 5400 Series
Document40 pages
Two-Wire Radar Level Transmitter: Rosemount 5400 Series
TariqMalik
No ratings yet
Gas Chromatograph
Document20 pages
Gas Chromatograph
TariqMalik
No ratings yet
6.precursor and Birth of AI
Document17 pages
6.precursor and Birth of AI
TariqMalik
No ratings yet
8.role of AI Professional
Document23 pages
8.role of AI Professional
TariqMalik
No ratings yet
3.AI As Formal and Casual Perspective
Document4 pages
3.AI As Formal and Casual Perspective
TariqMalik
No ratings yet
5.foundations of AI
Document17 pages
5.foundations of AI
TariqMalik
No ratings yet
Taguig City University
Document68 pages
Taguig City University
Glindsay Sarmiento
No ratings yet
The Role of Customer Intimacy in Increasing Islamic Bank Customer Loyalty in Using E-Banking and M-Banking
Document27 pages
The Role of Customer Intimacy in Increasing Islamic Bank Customer Loyalty in Using E-Banking and M-Banking
Adrian Nugroho
No ratings yet
The Use of Laboratory Method in Teaching Secondary School Students A Key To Improving The Quality of Education
Document6 pages
The Use of Laboratory Method in Teaching Secondary School Students A Key To Improving The Quality of Education
msujoy
No ratings yet
Final Bridge Paper
Document25 pages
Final Bridge Paper
api-350307805
No ratings yet
NSTP Student Competencies
Document12 pages
NSTP Student Competencies
edwineiou
100% (1)
Are Tattoos Fashion
Document84 pages
Are Tattoos Fashion
Ana Maria
No ratings yet
Final Notes For QTM
Document24 pages
Final Notes For QTM
supponi
No ratings yet
4811 Philosophy of Music Education
Document12 pages
4811 Philosophy of Music Education
Soundof Saint
No ratings yet
2015 IV Fluid in Children
Document22 pages
2015 IV Fluid in Children
ltgcanlas
No ratings yet
Security Management Standard - Physical Asset Protection ANSI - ASIS PAP AMERICAN NATIONAL STANDARD - 1 - 部分3
Document20 pages
Security Management Standard - Physical Asset Protection ANSI - ASIS PAP AMERICAN NATIONAL STANDARD - 1 - 部分3
安靖
No ratings yet
G9 MYP History - Topic 3 TRADE OR AID
Document2 pages
G9 MYP History - Topic 3 TRADE OR AID
kenza
No ratings yet
A Forensic Method For Detecting Image Forgery Using Codebook
Document5 pages
A Forensic Method For Detecting Image Forgery Using Codebook
Agnes Elaarasu
No ratings yet
Panel Testing Condensed Version Participant Text
Document6 pages
Panel Testing Condensed Version Participant Text
mayur dhande
No ratings yet
Every Science That Has Thriven Has Thriven Upon Its Own Symbols
Document23 pages
Every Science That Has Thriven Has Thriven Upon Its Own Symbols
kayakbluemail
No ratings yet
USM Response To MGA Letter Regarding MTAP
Document15 pages
USM Response To MGA Letter Regarding MTAP
Anonymous sKgTCo2
No ratings yet
Chapter II - RRL
Document4 pages
Chapter II - RRL
Cassandra Luche
No ratings yet
Basic Statistics For The Behavioral Sciences 7th Edition Heiman Test Bank
Document25 pages
Basic Statistics For The Behavioral Sciences 7th Edition Heiman Test Bank
AmyValenzuelaqxtk
100% (56)
Title: A Comparative and Exploratory Study of Motor Oil Branding in Nigeria and The UK Name: Shehu Mohammed Jallo
Document361 pages
Title: A Comparative and Exploratory Study of Motor Oil Branding in Nigeria and The UK Name: Shehu Mohammed Jallo
Irfan Ramdhani
No ratings yet
Group 1 App 002 2NDQ Major PT
Document5 pages
Group 1 App 002 2NDQ Major PT
GERALD ABADAY SATORRE
No ratings yet
2015 Determinants of Dietarybehavior
Document32 pages
2015 Determinants of Dietarybehavior
Sonia Ruiz de Paz
No ratings yet
Tiago Rocha Damasceno - Portfolio Exploring Interests Through My Architecture School Years
Document50 pages
Tiago Rocha Damasceno - Portfolio Exploring Interests Through My Architecture School Years
tiago_rochadamasceno
No ratings yet
Stem Cells
Document20 pages
Stem Cells
alibrown
No ratings yet
BCS BJS 70 - 23 July
Document112 pages
BCS BJS 70 - 23 July
MM Orvin
No ratings yet
Session 1-2 The Call of Rural India
Document13 pages
Session 1-2 The Call of Rural India
Laksh Singhal
No ratings yet
ODL Course - GTB Program PDF
Document4 pages
ODL Course - GTB Program PDF
Abehurayra Abdulgani
No ratings yet
OJT Training On The Job
Document27 pages
OJT Training On The Job
FrancisC.Postrado
50% (2)
Training Development
Document132 pages
Training Development
Princess Mogul
100% (1)
Assignment 2 Baman Technology Building Supply
Document14 pages
Assignment 2 Baman Technology Building Supply
Jeriel Rosh Pel
No ratings yet
Applied Statistics in Business and Economics 5th Edition Doane Test Bank 1
Document76 pages
Applied Statistics in Business and Economics 5th Edition Doane Test Bank 1
michelle
100% (42)