Professional Documents
Culture Documents
Statistics For Management: Omar Paccagnella
Statistics For Management: Omar Paccagnella
Statistics For Management: Omar Paccagnella
Omar Paccagnella
omar.paccagnella@unipd.it
http://www.stat.unipd.it/~paccagnella
Introduction
What happens if:
Introduction
(X − X) ( Y − Y)
ZX = ZY =
SX SY
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 10/ 23
University of Padua
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 11/ 23
University of Padua
The best fitting line is the one that minimises the sum of
the squared distances between the data points and the line itself,
as measured in the vertical (Y ) direction
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 12/ 23
University of Padua
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 13/ 23
University of Padua
Y = β 0 + β1 X
Y = b0 + b1 X
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 14/ 23
University of Padua
Least Squares
The idea behind this method is that the line will be appropriate to
describe the relationship under investigation if the observed values are
closed to the straight line.
The distance between observed and fitted values is the residual:
ei = Yi − Ŷi = Yi − b0 − b1 Xi
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 15/ 23
University of Padua
Least Squares
First order conditions (that is the derivates of f (b0 , b1 ) with respect to b0
and b1 ) are applied to minimise SSE. Using little calculi:
Pn
i =1 (X − X)(Y − Y)
b̂1 = Pn 2
i =1 (X − X)
b̂0 = Y − b̂1 X
Hence, b̂1 and r are proportional to one another and have the same sign.
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 16/ 23
University of Padua
formally
Y = Ŷ + (Y − Ŷ )
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 17/ 23
University of Padua
For all values of X , the observed values for Y are identically distributed
around a mean µ that depends linearly on X :
µy = β0 + β1 X
As X changes, the means of the distributions of the possible values of Y
lie along a straight line. This is the so-called
population regression line
Y = β0 + β1 X +
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 19/ 23
University of Padua
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 20/ 23
University of Padua
µY = β0 + β1 X
Y = b0 + b1 X + e
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 21/ 23
University of Padua
Some notes
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 22/ 23
University of Padua
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 23/ 23