Professional Documents
Culture Documents
Business Statistics: Fourth Canadian Edition
Business Statistics: Fourth Canadian Edition
Chapter 20
Multiple Regression
ŷ b0 b1x
For multiple regression, we write the regression model with more
predictor variables:
yˆ b0 b1x1 b2 x2 bk xk
Table 20.2 Multiple regression output for the linear model predicting Price
from Bedrooms and Living Area.
Variable Coeff SE(Coeff) t-ratio P-value
Intercept 20986.09 6816.3 3.08 0.0021
Bedrooms −7483.10 2783.5 −2.69 0.0073
Living area 93.84 3.11 30.18 ≤0.0001
Multiple Regression:
• Residuals: e y yˆ (as with simple regression)
• Degrees of freedom: df n k1
n = number of observations
k = number of predictor variables
• Standard deviation of residuals:
y yˆ
2
se =
n k 1
Correct answer:
• “increase” if “Bedrooms” is the only predictor (“more
bedrooms” may mean “bigger house”, after all!)
• “decrease” if “Bedrooms” increases for fixed Living Area
(“more bedrooms” may mean “smaller, more-cramped
rooms”)
Figure 20.5 A histogram of the residuals shows a unimodal, symmetric distribution, but the tails seem a
bit longer than one would expect from a Normal model. The Normal probability plot confirms that.
Confidence interval:
Table 20.3 Typical ANOVA table in multivariate regression analysis. The table
indicates the formulas that are used in the software to produce numerical results.
Degrees of Sum of
Blank Freedom (df) squares Mean Square F-Ratio P-Value
M S R = start fraction S S R over F = M S R over
Regression K SSR k end fraction MSE P
(explained
variability)
N minus k minus 1 M S E = start fraction S S E over
Errors SSE n minus k minus end fraction Blank Blank
(unexplained n−k−1
variability)
Total (Sum of n−1 SSTotal Blank Blank Blank
Squares, Total)
2 k n 1 SSE / (n k 1)
R 2
adj R 1 .
n 1 n k 1 SST / ( n 1)