Professional Documents
Culture Documents
ML Bu
ML Bu
Examples
Handwriting recognition learning problem
Training experience E : A sequence of images and steering commands recorded while observing a
human driver
Supervised learning
Unsupervised learning
Semi-supervised learning
Reinforcement learning
Classification
Regression
Clustering
Challenges of Machine Learning!
Training data.
Irrelevant features.
Let’s say for a child, to make him learn what an apple is, all it takes for you to point to an apple and say apple
Well, machine learning is still not up to that level yet; it takes a lot of data for most of the algorithms to function
properly. For a simple task, it needs thousands of examples to make something out of it, and for advanced tasks
Data plays a significant role in the machine learning process. One of the significant issues that machine
learning professionals face is the absence of good quality data. Unclean and noisy data can make the whole
The training data has lots of errors, outliers, and noise, it will make it impossible for your machine learning
model to detect a proper underlying pattern. Hence, it will not perform well.
Irrelevant Features
Training data must always contain more relevant and less to none irrelevant features.
The credit for a successful machine learning project goes to coming up with a good set of features on which it
has been trained (often referred to as feature engineering ), which includes feature selection, extraction, and
creating new features which are other interesting topics to be covered in upcoming blogs.
Imperfections in the Algorithm When Data Grows
The best model of the present may become inaccurate in the coming Future and require further
rearrangement. So you need regular monitoring and maintenance to keep the algorithm working.
This is one of the most exhausting issues faced by machine learning professionals.
Overfitting
In machine learning, we call this overfitting i.e model performs well on training data but fails to generalize well.
This process occurs when data is unable to establish an accurate relationship between input and output variables
It happens when our model is too simple to learn something from the data. For E.G., you use a linear model on a set with multi-
collinearity it will for sure underfit and the predictions are bound to be inaccurate on the training set too.
Machine learning is all set to bring a big bang transformation in technology. It is one of the most rapidly growing
technologies used in medical diagnosis, speech recognition, robotic training, product recommendations, video
surveillance, and this list goes on. This continuously evolving domain offers immense job satisfaction, excellent
opportunities, global exposure, and exorbitant salary. It is a high risk and a high return technology.
Session 3
opularMachineLearning
P
Algorithms?
•Linear regression
•Logistic regression
•Decision tree
•SVM algorithm
•KNN algorithm
•K-means
•C:\Users\91991\Desktop\BU ML alog.docx