Professional Documents
Culture Documents
Chapter 0 Intorduction To ML
Chapter 0 Intorduction To ML
Machine Learning
Machine Learning Team
UP GL-BD
What is Machine Learning (ML) ?
Fundamentals:
Artificial
intelligence
Fundamentals:
Learning is useful when:
Why now ?
7
Machine Learning
• Business Understanding
Project objectives and requirements understanding, Data mining problem definition
• Data Understanding
Initial data collection and familiarization, Data quality problems identification
• Data Preparation
Table, record and attribute selection, Data transformation and cleaning
• Modeling
Modeling techniques selection and application, Parameters calibration
• Evaluation
Business objectives & issues achievement evaluation
• Deployment
Result model deployment, Repeatable data mining process implementation
IBM Master Plan
IBM Master Plan Phases
1. [Business Understanding]: allows to determine which data will be used to answer the
core question. Two things must be set: The goal and the objectives.
1. [analytic approach]: helps limit the algorithm(s) that will be used during the modeling
(predictive model / descriptive model).
1. [Data Requirements]: Identify the necessary data content, formats and sources for
initial data collection.
1. [Data Understanding]: Represent the collected data according to the problem we want
to solve
1. [Data Preparation]: Many operations such as addressing missing or invalid values and
removing duplicates. This step generally takes almost 90% of the overall project time.
1. [Modeling]: Generation of the model based on the analytic approach that was taken.
1. [Evaluation] : It’s the step in which we check if the model we have already generated
answer the initial request or not.
Data Science Tools
• Data Science and its Relationship to Big Data and Data-Driven Decision
Making, Foster ProvostData Science and its Relationship to Big Data and
Data-Driven Decision Making, Foster Provost and Tom Fawcett,
VOLUME 1, ISSUE 1 / MARCH 2013