Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 6

Assignment-2

1)Explain Classification with logistic regression and sigmoid function.

Logistic regression is one of the most popular Machine Learning algorithms, which comes under
the Supervised Learning technique. It is used for predicting the categorical dependent variable
using a given set of independent variables.
Logistic regression predicts the output of a categorical dependent variable. Therefore the
outcome must be a categorical or discrete value. It can be either Yes or No, 0 or 1, true or False,
etc. but instead of giving the exact value as 0 and 1, it gives the probabilistic values which lie
between 0 and 1.
Logistic Regression is much similar to the Linear Regression except that how they are used.
Linear Regression is used for solving Regression problems, whereas Logistic regression is used
for solving the classification problems.
In Logistic regression, instead of fitting a regression line, we fit an "S" shaped logistic function,
which predicts two maximum values (0 or 1).
The curve from the logistic function indicates the likelihood of something such as whether the
cells are cancerous or not, a mouse is obese or not based on its weight, etc.
Logistic Regression is a significant machine learning algorithm because it has the ability to
provide probabilities and classify new data using continuous and discrete datasets.
Logistic Regression can be used to classify the observations using different types of data and can
easily determine the most effective variables used for the classification. The below image is
showing the logistic function:

Logistic Function (Sigmoid Function):


The sigmoid function is a mathematical function used to map the predicted values to
probabilities.
It maps any real value into another value within a range of 0 and 1.
The value of the logistic regression must be between 0 and 1, which cannot go beyond this limit,
so it forms a curve like the "S" form. The S-form curve is called the Sigmoid function or the
logistic function.
In logistic regression, we use the concept of the threshold value, which defines the probability of
either 0 or 1. Such as values above the threshold value tends to 1, and a value below the
threshold values tends to 0.

Roll no-14
Name-Pranesh
In order to map predicted values to probabilities, we use the sigmoid function. The function
maps any real value into another value between 0 and 1. In machine learning, we use sigmoid to
map predictions to probabilities. S(z)=11+e−z
s(z)s(z) = output between 0 and 1 (probability estimate)
zz = input to the function (your algorithm’s prediction e.g. mx + b)
ee = base of natural log

Assumptions for Logistic Regression:


The dependent variable must be categorical in nature.
The independent variable should not have multi-collinearity.
Logistic Regression Equation:
The Logistic regression equation can be obtained from the Linear Regression equation. The
mathematical steps to get Logistic Regression equations are given below:
We know the equation of the straight line can be written as:

In Logistic Regression y can be between 0 and 1 only, so for this let's divide the above equation
by (1-y):

But we need range between -[infinity] to +[infinity], then take logarithm of the equation it will
become:

The above equation is the final equation for Logistic Regression.

2)Define Support vector Machine and explain process of getting maximum margin linar
separator.

Support Vector Machine or SVM is one of the most popular Supervised Learning algorithms,
which is used for Classification as well as Regression problems. However, primarily, it is used
for Classification problems in Machine Learning.
The goal of the SVM algorithm is to create the best line or decision boundary that can segregate
n-dimensional space into classes so that we can easily put the new data point in the correct
category in the future. This best decision boundary is called a hyperplane.
SVM chooses the extreme points/vectors that help in creating the hyperplane. These extreme
cases are called as support vectors, and hence algorithm is termed as Support Vector Machine.
Consider the below diagram in which there are two different categories that are classified using a
decision boundary or hyperplane:

Roll no-14
Name-Pranesh
“The support vector machine (SVM) is a supervised learning method that generates input-output
mapping functions from a set of labeled training data." A Support Vector Machine (SVM)
performs classification by finding the hyperplane that maximizes the margin between the two
classes. The vectors (cases) that define the hyperplane are the support vectors.

Algorithm:
Define an optimal hyperplane: maximize margin.
Extend the above definition for non-linearly separable problems: have a penalty term for
misclassifications.
Map data to high dimensional space where it is easier to classify with linear decision surfaces:
reformulate problem so that data is mapped implicitly to this space.
To define an optimal hyperplane we need to maximize the width of the margin (w).
The beauty of SVM is that if the data is linearly separable, there is a unique global minimum
value. An ideal SVM analysis should produce a hyperplane that completely separates the vectors
(cases) into two non-overlapping classes. However, perfect separation may not be possible, or it
may result in a model with so many cases that the model does not classify correctly. In this
situation SVM finds the hyperplane that maximizes the margin and minimizes the
misclassifications.
1. Maximum Margin Linear Separators
For the maximum margin hyperplane only examples on the margin matter (only these affect the
distances). These are called support vectors. The objective of the support vector machine
algorithm is to find a hyperplane in an N-dimensional space (N — the number of features) that
distinctly classifies the data points.

Roll no-14
Name-Pranesh
To separate the two classes of data points, there are many possible hyperplanes that could be
chosen. Our objective is to find a plane that has the maximum margin, i.e the maximum distance
between data points of both classes. Maximizing the margin distance provides some
reinforcement so that future data points can be classified with more confidence.
Hyperplanes
Hyperplanes are decision boundaries that help classify the data points. Data points falling on
either side of the hyperplane can be attributed to different classes. Also, the dimension of the
hyperplane depends upon the number of features. If the number of input features is 2, then the
hyperplane is just a line. If the number of input features is 3, then the hyperplane becomes a two-
dimensional plane. It becomes difficult to imagine when the number of features exceeds 3.

Roll no-14
Name-Pranesh
3)

Roll no-14
Name-Pranesh
Roll no-14
Name-Pranesh

You might also like