Professional Documents
Culture Documents
Introduction To Machine Learning
Introduction To Machine Learning
LEARNING
WHAT IS MACHINE LEARNING?
• Only inputs are provided to the model. No explicit outputs are provided
• Algorithms work on the data to group together inputs with similar traits.
These groups are also called ‘clusters’
• Clusters give us an idea about the similarity of the points in the cluster
• Unsupervised learning can help provide inferences/outputs that are not
discernable normally
Example:
Inputs = Economic and demographic features of countries such as GDP, Happiness Index
ranking, etc
Model = Model that clusters together countries that have similar situations economically
Outputs/Inferences = Clusters of countries with similar feature. E.g. – countries with
broken economies and below par standards of living (Venezuela, Somalia, Liberia),
countries with booming economies and better quality of living (USA, Germany, Canada)
SEMI-SUPERVISED LEARNING
• Inputs are provided along with a proportion of labelled output data
• This serves the purpose of calculating specific outputs while also enabling
clustering to take place
SEMI-SUPERVISED LEARNING
• Initially, a classifier (e.g., naïve Bayesian classifier) is trained with the small labeled
set considering all features.
• The classifier is then applied to classify the unlabeled set.
• Those most confidently classified (or unlabeled) documents of each class, together
with their predicted class labels, are added to the labeled set.
• The classifier is then re-trained and the procedure is repeated.
• This process iterates until all the unlabelled documents are given class labels. The
basic idea of this method is that the classifier uses its own predictions to teach itself.
REINFORCEMENT LEARNING
• Mathematically:
=
OR
OR
OR
•
• If α is too small, gradient descent can be
slow.
• Precision : It is the number of correct positive results divided by the number
of positive results predicted by the classifier.
• Recall : It is the number of correct positive results divided by the number
of all relevant samples (all samples that should have been identified as
positive).
F1-SCORE
• F1 Score is the Harmonic Mean between precision and recall. The range for F1
Score is [0, 1]. It tells you how precise your classifier is (how many instances it
classifies correctly), as well as how robust it is (it does not miss a significant
number of instances).
• High precision but lower recall, gives you an extremely accurate, but it then
misses a large number of instances that are difficult to classify. The greater the F1
Score, the better is the performance of our model. Mathematically, it can be
expressed as :
THANK YOU!