Professional Documents
Culture Documents
SESI 11 - CH 13 - SIMPLE-REGRESSION - Levine - Smume6 - ppt13
SESI 11 - CH 13 - SIMPLE-REGRESSION - Levine - Smume6 - ppt13
SESI 11 - CH 13 - SIMPLE-REGRESSION - Levine - Smume6 - ppt13
Microsoft Excel
6th Edition
Chapter 13
Y Y
X X
Y Y
X X
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-6
Types of Relationships
DCOVA
(continued)
Strong relationships Weak relationships
Y Y
X X
Y Y
X X
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-7
Types of Relationships
DCOVA
(continued)
No relationship
X
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-8
13.2 Simple Linear Regression Model
DCOVA
Population Random
Population Independent Error
Slope
Y intercept Variable term
Coefficient
Dependent
Variable
Yi β0 β1Xi ε i
Linear component Random Error
component
Y Yi β0 β1Xi ε i
Observed Value
of Y for Xi
εi Slope = β1
Predicted Value
Random Error
of Y for Xi
for this Xi value
Intercept = β0
Xi X
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-10
Simple Linear Regression
Equation (Prediction Line) DCOVA
The simple linear regression equation provides an
estimate of the population regression line
Estimated
(or predicted) Estimate of Estimate of the
Y value for the regression regression slope
observation i
intercept
Value of X for
millions of dollars
Independent variable (X) = square feet
(thousands)
12
10
0
0 1 2 3 4 5 6 7
ANOVA
df SS MS F Significance F
Regression 1 105.7476 105.7476 113.2335 1.82269E-07
Residual 12 11.20668 0.93389
Total 13 116.9543
12 Y = 0.9645 + 1.6699X
R2 = 0.9042
10
0
0 1 2 3 4 5 6 7
0.9645 + 1.6699
b0 is the estimated average value of Y when the
value of X is zero (if X = 0 is in the range of observed
X values)
Because the square footage of the store cannot be
0, this Y intercept has little or no practical
interpretation.
0.9645 + 1.6699
0.9645 + 1.6699
SSXY 568
SSX 142
b1 4
b0 80
DCOVA
SST = total sum of squares (Total Variation)
Measures the variation of the Yi values around their mean
Y
SSR = regression sum of squares (Explained Variation)
Variation attributable to the relationship between X and Y
SSE = error sum of squares (Unexplained Variation)
Variation in Y attributable to factors other than X
DCOVA
Y
Yi
SSE = (Yi - Yi )2 Y
_
SST = (Yi - Y)2
Y _
_ SSR = (Yi - Y)2 _
Y Y
Xi X
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-30
Coefficient of Determination, r2
DCOVA
The coefficient of determination is the portion of
the total variation in the dependent variable that is
explained by variation in the independent variable
The coefficient of determination is also called r-
squared and is denoted as r2
note: 0 r 1 2
X
r =1
2
X
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-33
Examples of Approximate
r2 Values
DCOVA
r2 = 0
Y
No linear relationship
between X and Y:
SSE
ˆ
(Yi Yi ) 2
i 1
S YX
n2 n2
Where
SSE = error sum of squares
n = sample size
ANOVA
df SS MS F Significance F
Regression 1 105.7476 105.7476 113.2335 0.0000
Residual 12 11.2067 0.9339
Total 13 116.9543
S YX S YX
Sb1
SSX i
(X X ) 2
where:
S=bEstimate
1
of the standard error of the slope
SSE
S YX = Standard error of the estimate
n2
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-56
Inferences About the Slope:
t Test
DCOVA
t test for a population slope
Is there a linear relationship between X and Y?
Null and alternative hypotheses
H0: β1 = 0 (no linear relationship)
H1: β1 ≠ 0 (linear relationship does exist)
Test statistic where:
b1 β1
t STAT b1 = regression slope
coefficient
Sb β1 = hypothesized slope
1
Sb1 = standard
d.f. n 2 error of the slope
b1 Sb1
b 1 β 1 1.6699 0
t STAT 10.6411
S b1 0.1569
d.f. = 14- 2 = 12
a/2=.025 a/2=.025
Decision: Reject H0
p-value
Decision: Reject H0, since p-value < α
There is sufficient evidence that the
size of the store affects annual sales.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 13-61
F Test for Significance
DCOVA
F Test statistic: MSR
FSTAT
MSE
where SSR
MSR
k
SSE
MSE
n k 1
where FSTAT follows an F distribution with k numerator and (n – k - 1)
denominator degrees of freedom
Regression Statistics
Multiple R 0.9509
MSR 105.7476
R Square 0.9042 FSTAT 113.2335
Adjusted R MSE 0.9339
Square 0.8962
Standard Error 0.9664
With 1 and 12 degrees p-value for
Observations 14 of freedom the F-Test
ANOVA Significance
df SS MS F F
Regression 1 105.7476 105.7476 113.2335 0.0000
Residual 12 11.2067 0.9339
Total 13 116.9543
DCOVA
H0: β1 = 0 Test Statistic:
H1: β1 ≠ 0 MSR
FSTAT 113.2335
= .05 MSE
b1 Sb1
b1 β1 40
t STAT 10.3401
S b1 0.3868
d.f. = 10- 2 = 8
a/2=.025 a/2=.025
Decision: Reject H0
Regression Statistics
Multiple R 0.9646
MSR 2272
R Square 0.9304 FSTAT 106.9176
Adjusted R MSE 21.25
Square 0.9217
Standard Error 4.6098
With 1 and 8 degrees p-value for
Observations 10 of freedom the F-Test
ANOVA Significance
df SS MS F F
Regression 1 2272 2272 106.9176 0.0000
Residual 8 170 21.25
Total 9 2442
DCOVA
H0: β1 = 0 Test Statistic:
H1: β1 ≠ 0 MSR
FSTAT 106.9176
= .05 MSE