Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 15

MBA/ Data Mining/: Ch-4/Lecture 1 & 2

Residual = Observed –
predicted value
e=y–ŷ
Example No:00
Example No:00 Using online Regression calulator : https://www.socscistatistics.com/tests/regression/default.aspx
EXAMPLE 2: https://www.socscistatistics.com/tests/regression/default.aspx)
https://home.ubalt.edu/ntsbarsh/business-stat/otherapplets/MultRgression.htm
Multiple Regression
• Multiple regression analysis is a powerful technique used for predicting
the unknown value of a variable(Response/Dependent ) from the
known value of two or more variables- also called the
predictors(Independent variables).
• More precisely, multiple regression analysis helps us to predict the
value of Y for given values of X1, X2, …, Xk.

For example the yield of rice per acre depends upon quality of seed,
fertility of soil, fertilizer used, temperature, rainfall. If one is
interested to study the joint affect of all these variables on rice yield,
one can use this technique.
• An additional advantage of this technique is it also enables us to study
the individual influence of these variables on yield.
EXAMPLE 2 :
•Consider the data set "mtcars"
available in the R environment.
It gives a comparison between
different car models in terms
of mileage per gallon (mpg),
cylinder displacement("disp"),
horse power("hp"), weight of
the car("wt") and some more
parameters
.The goal of the model is to
establish the relationship
between "mpg" as a response
variable with "disp","hp" and
"wt" as predictor variables. We
create a subset of these
variables from the mtcars data
set for this purpose.
Researcher : Dr. P.Kundu
[Ph.D,M.Phil,MBA,MCA,]
Logistic Regression

• Logistic Regression is a regression model in which the response/dependent variable has


categorical values such as True/False or 0/1. It actually measures the probability of a binary
response as the value of response variable based on the equation relating it with the predictor
variables.
The general mathematical equation for logistic regression is −
y = 1/(1+e^-(a+b1x1+b2x2+b3x3+...)) 

⮚ input <- mtcars[,c("am","cyl","hp","wt")]

⮚ am.data = glm(formula = am ~ cyl + hp + wt, data = input, family = binomial) # generalized


linear function

⮚ print(summary(am.data)
⮚ plot(am.data)
more example : https://stats.idre.ucla.edu/r/dae/logit-regression/
Description of
mtcars dataset

ASSIGNMENT TO FIND Logistics


Regression Analysis Based on
am(Autometic/Manual)

You might also like