Professional Documents
Culture Documents
Logistic Regressions
Logistic Regressions
Logistic Regression
• Logistic regression is one of the most common machine learning
algorithms. It can be used to predict the probability of an event
occurring, such as whether an incoming email is spam or not, or
whether a tumor is malignant or not, based on a given labeled data
set.
• The model has the word “logistic” in its name, since it uses
the logistic function (sigmoid) to convert a linear combination of the
input features into probabilities
• Recall that in supervised ML problems, we are given a training set
of n labeled samples: D = {(x₁, y₁), (x₂, y₂), … , (xn, yn)}, where xᵢ is a m-
dimensional vector that contains the features of sample i,
and yᵢ represents the label of that sample. Our goal is to build a
model whose predictions are as close as possible to the true labels.
• In classification problems, the label yᵢ can take one of k values,
representing the k classes to which the samples belong. More
specifically, in binary classification problems, the label yᵢ can assume
only two values:
• 0 (representing the negative class)
• and 1 (representing the positive class).
Logistic Regression Model
• Logistic regression is a probabilistic classifier that handles binary
classification problems. Given a sample (x, y), it outputs a
probability p that the sample belongs to the positive class
• where α is a learning rate that controls the step size (0 < α < 1).