Resource (ML & DL)

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

Machine Learning Recruitment Material

Machine Learning (ML) is that field of computer science with the help of which computer systems can
provide sense to data in much the same way as human beings do. In simple words, ML is a type of
artificial intelligence that extracts patterns out of raw data by using an algorithm or method. The main
focus of ML is to allow computer systems to learn from experience without being explicitly programmed
or human intervention.

ML is a field of AI consisting of learning algorithms that:


❖ Improve their performance (P)
❖ At executing some task (T)
❖ Over time with experience (E)

Challenges in Machines Learning


Issue of overfitting & underfitting: If the model is overfitting or underfitting, it cannot be represented
well for the problem.
Curse of dimensionality: Another challenge ML model faces is too many features of data points. This can
be a real hindrance.
Time-Consuming task: Another challenge faced by ML models is the consumption of time especially for
data acquisition, feature extraction and retrieval.

Applications of Machines Learning


● Emotion analysis
● Sentiment analysis
● Error detection and prevention
● Weather forecasting and prediction
● Stock market analysis and forecasting
● Speech synthesis Speech recognition

NumPy
It is a useful component that makes Python as one of the favorite languages for Data Science. It basically
stands for Numerical Python and consists of multidimensional array objects.

By using NumPy, we can perform the following important operations:


- Mathematical and logical operations on arrays.
- Fourier transformation.
Matplotlib.pyplot
It is another useful component that makes Python to visualize a data , the name pyplot gives the
significance that this library is used for plotting of different kind of plots

By using this library , we can plot :-


- Scatter plot
- A line graph
- A bar plot
- Etc.
Pandas
The pandas library is used to load and edit data frames , providing high-performance, easy-to-use
data structures and data analysis tools for the Python programming language.

Machine learning deals with a wide range of concepts.

The concepts are listed below:


Supervised
Unsupervised
reinforcement learning
linear regression
cost functions
overfitting
under-fitting
Hyper-parameter.
Classification

Classification is a prediction problem that predicts the categorical class labels, which are
discrete and unordered. It is a two-step process, consisting of a learning step and a
classification step.

K-NEAREST NEIGHBORS

The algorithm can be used to solve both classification and regression problem statements. The
number of nearest neighbors to a new unknown variable that has to be predicted or classified is
denoted by the symbol 'K'.

Clustering:

A clustering problem is where you want to discover the inherent groupings in the data, such as
grouping customers by purchasing behavior.

Support Vector Machine Algorithm:


Supervised Learning algorithms, which is used for Classification as well as Regression problems.
However, primarily, it is used for Classification problems in Machine Learning.

The goal of the SVM algorithm is to create the best line or decision boundary that can segregate
n-dimensional space into classes so that we can easily put the new data point in the correct
category in the future. This best decision boundary is called a hyperplane.

decision trees
Decision Trees are a type of Supervised Machine Learning where the data is continuously split
according to a certain parameter. The tree can be explained by two entities, namely decision
nodes and leaves.

Random Forest
Random Forest Algorithm Random forest algorithm can be used for both classifications and
regression tasks. It provides higher accuracy through cross validation. Random forest classifiers
will handle the missing values and maintain the accuracy of a large proportion of data.

Dimensionality reduction :

Dimensionality reduction statistical technique of reducing the amount of random variables in


a problem by obtaining a set of principal variables.

Reinforcement learning:

It is a machine learning training method based on rewarding desired behaviors and/or


punishing undesired ones. In general, a reinforcement learning agent is able to perceive and
interpret its environment, take actions and learn through trial and error.

Linear Regression:
It performs a regression task. Regression models a target prediction value based on
independent variables.

Naive Bayes classifiers:

Naive Bayes classifiers are a collection of classification algorithms based on Bayes’ Theorem. It
is not a single algorithm but a family of algorithms where all of them share a common principle,
i.e. every pair of features being classified is independent of each other.
cheat sheet
Deep Learning

Deep Learning is a computer software that mimics the network of neurons in a brain.
It is a subset of machine learning based on artificial neural networks with
representation learning. It is called deep learning because it makes use of deep
neural networks. This learning can be supervised, semi-supervised or unsupervised.

The learning occurs in two phases:


First Phase: The first phase consists of applying a nonlinear transformation of the
input and creating a statistical model as output.
Second Phase: The second phase aims at improving the model with a mathematical
method known as derivative.
Recurrent neural networks (RNNs)
Recurrent neural networks (RNN) are a class of neural networks that are helpful in modeling sequence
data. Derived from feedforward networks, RNNs exhibit similar behavior to how human brains function.
Simply put: recurrent neural networks produce predictive results in sequential data that other
algorithms can’t.

Convolutional neural networks (CNN)

Convolutional Neural Network or CNN is a type of artificial neural network, which is widely used for
image/object recognition and classification. Deep Learning thus recognizes objects in an image by using a
CNN.
Deep learning applications

Fraud detection.
Customer relationship management systems.
Computer vision.
Vocal AI.
Natural language processing.
Data refining.
Autonomous vehicles.
Supercomputers.

Cheat sheet

You might also like