Professional Documents
Culture Documents
Goodness of Fit: Squares (ESS) To The Total Sum of Squares (TSS)
Goodness of Fit: Squares (ESS) To The Total Sum of Squares (TSS)
Goodness of Fit: Squares (ESS) To The Total Sum of Squares (TSS)
Any two variables (even with no connection in real life) will provide you with some regression
equation, and you can manipulate the coefficients by choice of units of measurement of X and Y.
How can you tell good from bad regressions?
R² (the coefficient of determination), the square of the correlation coefficient, is one way. It
measures the ‘goodness of fit’ of the equation. The closer the observations are to the line, on
average, the higher will be R².
0 R² 1
To calculate R2 in simple regression, just calculate r² = 0.824² = 0.678. This means that about
68% of variation of Y is explained by variation in X. This is a respectable result.
Alternatively, R2 can be calculated as the ratio of two sums of squares, the explained sum of
squares (ESS) to the total sum of squares (TSS).
ESS
R2
TSS
Each observation’s deviation from the overall mean (of Y) contributes to the TSS. These
deviations can be divided into a part due to the regression ( Yˆ Y ) and a random, or residual,
part ( Y Yˆ ). Hence we have:
Y Y Yˆ Y Y Yˆ
If we square each term and sum over all observations, we obtain
Y Y = Ŷ + Y Ŷ
2 2
Y
2
i i i i
Prediction
The calculated regression line can be used for prediction (making the assumption of a zero error),
for values of X not in the sample. For X = 3, the predicted value of Y is 40.711 - 2.7 3 = 32.6