Professional Documents
Culture Documents
Week2 Mon IntroToRegression
Week2 Mon IntroToRegression
2
Topics:
● The main idea
3
Models
4
The main idea of a regression model
Example: Factory production*
Y = RunTime (the time of a production run in minutes)
X Y
When the model has only one predictor variable, it is Simple Linear Regression.
5
*Based on the example in A Modern Approach to Regression in R (2009) by Dr Simon Sheather.
The two types of regression models:
Population model vs. Fitted model
● By Assuming the entire population has a linear relationship we can write the
population (regression) model.
● We can estimate this true population model with a fitted (regression) model
(shown in Blue).
The two types of regression models:
Population model vs. Fitted model
Most of the time, we don’t have access to the entire population, but want to draw
conclusions about the population.
Intercept
Basic: When x =0, the value of y will be ________ on average.
In context: The average setup time for the production is 149.75 min.
The intercept doesn’t always have a meaningful interpretation in context.
Slope
Basic: When x increases by 1 unit, the value of y (increases/decreases) by 0.26
on average.
In context: When we produce one more product, the production time
increases by 0.26 min on average. 8
Error vs. Residual
Topics:
● The concept of error
10
Error is the difference between the
observed Y and the expected Y.
Point A
Point B x y E(Y|X=x) e
=5-0.5x
Point A 4.5 4.2 2.75 1.45
Point B 0.8 4 4.6 -0.6
Signal Noise
12
Error vs. Residual
Error Residual
Notation
Definition
Random?
Distribution