Professional Documents
Culture Documents
Solutions Manual To Accompany Business Forecasting With Business Forecastx 6th Edition 9780073373645
Solutions Manual To Accompany Business Forecasting With Business Forecastx 6th Edition 9780073373645
Solutions Manual To Accompany Business Forecasting With Business Forecastx 6th Edition 9780073373645
CHAPTER 5
FORECASTING WITH MULTIPLE REGRESSION
CHAPTER OVERVIEW
This chapter extends our discussion on linear regression to multiple regression models. In
addition, qualitative factors such as seasonality are modeled using dummy variables.
LEARNING OBJECTIVES
NOTES TO TEACHERS
This chapter extends the classical linear regression model to multiple regression and its
accompanying modifications. Emphasis is placed on using dummy variables to account for data
seasonality.
2. Having more than one independent variable brings up the issue of overlapping causation
and the problem of multicollinearity. Accordingly, the correlation among independent variables
in the model is now an important concern in the model selection process. When two variables
are a linear combination of each other, OLS fails and no reliable estimates are obtained. Near
multicollinearity, on the other hand, arises when two independent variables are highly, but not
perfectly, correlated. This causes OLS estimates to be imprecise, i.e., having large standard
errors. Finally dummy variables are introduced as a way of measuring qualitative attributes in
regression.
5-1
It is also important to point out to the student how to correctly interpret the estimated coefficients
on a set of dummy variables. In the text example on data seasonality, dummy variables act to
shift the estimated intercept relative to the base period (in this case quarter one). Accordingly,
dummy variables are interpreted according to some based period, which may be arbitrarily
selected by the forecaster. The base period is the period not accounted for by a dummy variable
(e.g., if quarters two, three, and four are represented by dummy variables and there is no quarter
one dummy variable, then quarter one is the base period; if two quarters are not represented by
dummy variables then the average of the two “missing quarters” is the base period).
Accordingly, dummy variables allow forecasters to measure the effects of qualitative factors on
quantitative random variables.
5-2
Chapter 05 - Forecasting with Multiple Regression
1. In evaluating multiple regression, the adjusted R-squared (sometimes called the multiple
coefficient of determination) should always be considered. The reason for the adjustment is that
adding another independent variable will always increase the unadjusted R-squared, even if the
variable has no meaningful relation to the dependent variable. To get around this and show only
meaningful changes in R-squared; an adjustment is made to account for a loss in degrees of
freedom as additional independent variables are added to the model.
2. The estimated coefficients of the model for jewelry sales and their respective t-ratios are
reported in the table below.
Step #1: For the most part, the signs of the level variables are of the correct sign. Sales increase
with real personal disposable income (DPI). Sales decrease with increases in the unemployment
rate. The model also reveals a September 11th effect with the negative sign on the 911 variable
(although it is only significant at the 87% level)
The dummy variables are interpreted as follows: their magnitude is the difference between the
base period (the period left out; in this case, period 1 or January) and the measured period, i.e.,
they are always compared with a base period. Of the eleven seasonal dummy variables, all are
significant at least at the 90% level (and most at the 95% level). Since all the seasonal dummy
variables are positive in sign, sales in each of these months can be expected to be above January
sales levels.
5-3
Chapter 05 - Forecasting with Multiple Regression
Step #2: All of the estimated coefficients of the independent variables are significantly different
from zero, using a one-tailed t-test, at the 5 percent level with the exception of the September
11th variable and the March dummy variable. Note carefully that we do not check the t-statistic
for the constant term as part of this test.
Step #3: The adjusted R-squared is 95.51%, suggesting that the model explains 95.51 percent of
the variability in jewelry sales. This is substantially higher than previous model versions and
suggests that the model above is a candidate for forecasting JS. The Durbin-Watson statistic is
1.79 (close to two), suggesting that serial correlation is not a problem with this model. Finally,
the significance of the R-squared statistic is tested with the F-test (the calculated F-statistic is
218.36), in which we reject the null of no model fit at the 99% level of confidence.
In conclusion, the regression model with additional variables appears to explain jewelry sales
fairly well based on in-sample results. This is shown in Table 5-9 in which the RMSE for the in-
sample data (RMSE = 214.56) is lower than for the two independent variable regression (RMSE
= 1,008.87) also shown in Table 5-9 .
3. A dummy variable has a value of either zero or one. It is zero if the event does not exist
for that observation, and one, if the event does exist. A dummy variable is a special type of
variable that is used to effectively account for the impact of seasonality (or other qualitative
attributes). For example, you might use the coefficients on dummy variables to measure
seasonality of ski equipment sales. For the fourth and first quarters of the calendar year, you
would expect positive signs, however during the second and third quarters we expect the dummy
coefficients to have negative signs, depending of course on the base period. This would be
expected, since the demand for ski equipment increases during the fall and winter and declines
during the spring and summer.
4. The model for miles per gallon is summarized by the following results:
a) The t-ratios were calculated by dividing each coefficient by the corresponding standard error.
For example, the t-ratio for US is:
(4.64/2.48) = 1.87.
5-4
Solutions Manual to accompany Business Forecasting with Business ForecastX 6th edition 97800
b) The signs on all five independent variables should be evaluated according to one’s
expectations. In some cases such as for cubic inch displacement (CID) and US (for cars made in
the United States) differing arguments could be made. For D (diesel), M4 and M5 (manual 4 and
5 speed transmissions) positive signs would be expected. Given the large sample size of 120, the
critical value of the t-ratio would be 1.645 for one-tailed tests and 1.96 for two-tailed tests at a
95% confidence level. Since all of the calculated t-ratios shown are above these critical values,
we can conclude that all five independent variables are influential in determining MPG. Finally,
the adjusted R-square of .569 indicates that 56.9% of the variation in miles per gallon is
accounted for by variations in the independent variables included in the model.
5-5