Professional Documents
Culture Documents
UCS551 Chapter 5 - Machine Learning (Intro)
UCS551 Chapter 5 - Machine Learning (Intro)
MACHINE
LEARNING
CONCEPT AND
TECHNIQUES
DR AZLIN AHMAD
CONTENT
Fraud detection
Retail
Websites recommending items you might like based on previous purchases are using machine learning
to analyze your buying history. Retailers rely on machine learning to capture data, analyze it and use it to
personalize a shopping experience, implement a marketing campaign, price optimization, merchandise
supply planning, and for customer insights.
Oil and gas
Finding new energy sources. Analyzing minerals in the ground. Predicting refinery sensor failure.
Streamlining oil distribution to make it more efficient and cost-effective. The number of machine learning
use cases for this industry is vast – and still expanding.
Transportation
Analyzing data to identify patterns and trends is key to the transportation industry, which relies on
making routes more efficient and predicting potential problems to increase profitability. The data analysis
and modeling aspects of machine learning are important tools to delivery companies, public
transportation and other transportation organizations.
LEARNING IN
MACHINE
LEARNING
WHAT IS LEARNING?
RESULTS/
OUTPUT
INPUT
PROCESS:
Machine learning
It’s an apple
It’s a banana
TYPES OF LEARNING
SPLIT RATIO:
SPLIT RATIO
(TRAINING, VALIDATION(OPTIONAL) AND TESTING )
Training Dataset: Validation Dataset: The sample of data used to provide Test Dataset: The sample of data used
The sample of data used an unbiased evaluation of a model fit on the training to provide an unbiased evaluation of a
to fit the model. dataset while tuning model hyperparameters. The final model fit on the training dataset.
The actual dataset that evaluation becomes more biased as skill on the The Test dataset provides the gold
we use to train the validation dataset is incorporated into the model standard used to evaluate the model.
model (weights and configuration. It is only used once a model is
biases in the case of The validation set is used to evaluate a given model, but completely trained(using the train and
Neural Network). The this is for frequent evaluation. We as machine learning validation sets). The test set is
model sees and learns engineers use this data to fine-tune the model generally what is used to evaluate
from this data. hyperparameters. competing models.
CROSS VALIDATION
1000 ROWS OF DATA 5 FOLDS (EACH FOLD CONSISTS OF 200 DATA)
Movie clips:
https://drive.google.com/drive/folders/14mGBMMcLtxvVv9KEZBO_BLhbT7QuoY1v?usp=sharing
REFERENCE
https://www.sas.com/en_my/insights/analytics/machine-learning.html
https://www.oreilly.com/ideas/machine-learning-a-quick-and-simple-definition
https://www.openml.org/a/estimation-procedures/7
https://machinelearningmastery.com/k-fold-cross-validation/
https://www.quora.com/What-is-training-learning-and-testing-in-machine-learning