Professional Documents
Culture Documents
Presentation Business Applications
Presentation Business Applications
Regression methods allow you to effectively obtain information from data in order to make proper management
decisions
Even a simple regression analysis in MS Excel can give a competitive advantage because you can better understand your
clients, your employees, your competitors and your products
QUANTITATIVE AND QUALITATIVE VARIABLES
Quantitative data take the form of numeric variables – sales in units, revenue in dollars,
count of employees and so on
Qualitative data, on the other hand, takes the form of categorical variables – type of a
product (smartphone, watch, notebook etc), customer satisfaction on a scale or binary
response (did you buy our product?)
Linear regression can be used to analyze both quantitative and qualitative data (after
certain modifications)
Business data can be cross-sectional (measured
simultaneously for many units: employee salaries,
revenue by product etc) or time series (monthly
interest rate, stock price etc)
CROSS
Panel data are combination of those two types when
a group of units is observed over several periods
SECTIONAL
(sales of different brands per month) VERSUS TIME
SERIES DATA
What determines price of an apartment? There are potentially many factors such as size, years since last
renovation, number of bedrooms etc
A realtor may collect information on 100 apartments including apartment price (the key variable of interest) and
potential factors affecting the price
*Later we will also need to add t-statistics or p-values for statistical significance
Let’s continue with our simple model:
OTHER
COEFFICIENTS Here an additional bedroom is associated with a price
increase of 15,000 because it indicates a larger apartment
For example, an apartment with two bedrooms which was renovated 5 years
ago would cost 86,000:
EXAMPLE 1:
PREDICTION
Apartment Price = 65,000 + 15,000*2 – 1,800*5 = 86,000
Notice that we have not yet discussed the notion of statistical significance –
despite a substantial effect of additional bedroom (based on the coefficient of
15,000) the number of bedrooms may be not associated with price in
statistical sense
Regression is a great three-in-one tool
allowing you to:
PRICE OF AN APARTMENT IS AN EXAMPLE OF A MANY BUSINESS PROBLEMS INVOLVE QUALITATIVE BINARY VARIABLES REPRESENT AN IMPORTANT
QUANTITATIVE VARIABLE (SIMILARLY TO SALES, VARIABLES – A CLIENT’S DECISION TO BUY OUR EXAMPLE OF QUALITATIVE VARIABLES: WILL
WAGES ETC) PRODUCT (YES/NO), SATISFACTION WITH THE CLIENT BUY? WILL EMPLOYEE CHURN?
SERVICE (VERY SATISFIED, SATISFIED, NEUTRAL,
DISSATISFIED, VERY DISSATISFIED)
EXAMPLE 2: WILL CUSTOMER CHURN?
It is easy to apply linear regression to answer this question by modifying “Yes” to 1 and “No” to 0
Here a new client (Months Using Service = 0) with no complaints (Number of Complaints = 0) is likely to churn next
month with probability of 7% (intercept of 0.07)
*Logit and probit are two common alternatives to LPM but their interpretation is not as easy
EXAMPLE 2: OTHER COVARIATES
Each additional month will decrease the probability of churn by 1% point – client who has been with us for 1 year
is less likely to leave by 12% points ( = -0.01*12)
Finally, a client with two complaints is more likely to leave by 40% points ( = 0.20*2)
We can evaluate all our customers on the probability of churn and make special offers to those who are most
likely to leave
EXAMPLE 3: TIME SERIES
Many business variables represent a time series – stock price, interest rate, weekly sales etc
Imagine that Ice-cream Sales in a given week depend on outside Temperature and whether it is a week in Summer
Ice-cream Sales = 345 + 15*Temperature + 120*Summer
As usual, the intercept of 345 shows Sales when all other variables are equal to 0
We can continue entertaining our simple model for time series of Ice-cream Sales
Ice-cream Sales = 345 + 15*Temperature + 120*Summer
Variable Summer is called an indicator or dummy variable – it is equal to 1 if week is in Summer and 0 otherwise
Observe that our model predicts that Ice-cream Sales are higher by 120K dollars during summer weeks
As before, we can use the model for prediction – Sales in May when Temperature = 50 will be equal to 1095K
dollars ( = 345 + 15*50)
One of the common mistakes is to draw causal
conclusions based on the linear regression model
CAUSATION!
dependence of Temperature on Sales we will also
find a positive effect!