Professional Documents
Culture Documents
Multiple Regression Materials
Multiple Regression Materials
Multiple Regression Materials
Using the “D70 for Chicken Data” file posted on Campus Web, run a regression of the per capita
consumption of chicken on the price of chicken, the price of beef, and the per capita disposable income,
and then answer the following questions:
a. Write down the estimated equation.
b. What does each estimated coefficient mean?
c. Compare the estimated coefficient of the price of chicken in this model with your
estimated coefficient in the SLM regression example 2.
d. Compare 𝑅 2 of this model to the 𝑅 2 you obtained in the SLM regression example 2 and
explain why they are different.
e. Compare 𝑅 2 and 𝑅̅ 2 for your model and explain why they are different.
f. Using the F-test, test the significance of the model: Are all your explanatory variables
significant?
g. Using the t-test, test the hypothesis that each of your individual estimated coefficients is
significant.
h. Using the F-test, test the joint hypothesis that the price of beef and the per capita
disposable income are jointly significant.
i. Run the following three regressions:
Present the results of these several multiple regressions the proper way, comment on the
significance of each individual estimated coefficient, and comment on the measures of
goodness-of-fit for all models.
Page 1 of 8
Page 2 of 8
Presenting Several Multiple Regression Results:
Regression of per capita consumption of chicken on the price of chicken, the price of beef, and the per
capita disposable income:
Summary Statistics
SER 15.18 8.70 1.82
R2 0.26 0.77 0.99
𝑅̅2 0.23 0.75 0.989
N 30 30 30
Page 3 of 8
Multiple Regression Example 2 (Subprime Mortgages):
Using the “Subprime Mortgage Data” file posted on Campus Web, run a regression of the APR on the FICO
score, the loan-to-value ratio (LTV), income, and home value, and then answer the following questions:
a. Write down the estimated equation.
b. What does each estimated coefficient mean?
c. Using the F-test, test the significance of the model: Are all your explanatory variables
significant?
d. Using the t-test, test the hypothesis that each of your individual estimated coefficients is
significant.
e. What is the 95% confidence interval for FICO score? What does it mean?
Page 4 of 8
Page 5 of 8
Econometrics and Forecasting for Business – ECO627
Before you look at the data or the graphs, ask yourself these two questions:
1. Start with the correlation matrix. Look at the correlation between the response (y variable) and
each explanatory variable. Also, look at the correlations among the explanatory variables.
2. Look at the scatterplots of the response versus each explanatory variable (or at least the ones
that are highly correlated with the response variable). If these scatterplots show linear
relationship between the response and the explanatory variables, then fit the multiple
regression model.
3. Look at the residual plots: The scatterplot of the residuals versus the estimated 𝑌̂ and the
scatterplot of Y versus 𝑌̂.
4. Look at the residual plots of the residuals vs. each explanatory variable.
5. Make sure the residual plots show no patterns nor increase in variance.
6. Make sure the residuals are normally distributed, look at the Histogram of the residuals.
7. Using the F-test, test the overall fit of the model.
8. If the F-test is statistically significant, then use the t-test to test the significance of each
explanatory variable.
Source: Stine and Foster, Univ of Penn, different sources.
Page 6 of 8
Regression Applications
Prepared By
Fatma Abdel-Raouf, Ph. D.
Source: Edward M. Saunders, Jr. (1993). “Stock Prices and Wall Street Weather.” American Economic Review. 76 (1). P. 1337 –
1346. Cited in Studenmud (2011) Using Econometrics: A practical Guide. 6th ed. Pearson Addison-Wesley.
Page 7 of 8
where PRICEi the price at which the ith iPod sold on eBay.
NEWi a dummy variable =1 if the ith iPod was new and 0 otherwise.
SCRATCHi a dummy variable =1 if the ith iPod had a minor cosmetic defect and 0
otherwise.
BIDRSi the # of bidders on the ith iPod.
PERCENTi the percentage of customers of the seller of the ith iPod who gave that
seller a positive rating for quality and reliability in previous transactions.
Source: Leonardo Rezende (2008). “Econometrics of Auctions by Least Squares.” Journal of Applied Econometrics.
November/December 2008. P. 925 – 948. Cited in Studenmud (2011) Using Econometrics: A practical Guide. 6th ed. Pearson
Addison-Wesley.
Alumni Donations:
Your friend at the University at Albany has an on-campus job of making phone calls to alumni asking for
donations to the university’s annual fund. To measure the impact of her calls on fund raising, she
collects data from 50 alums and estimates the following equation:
Where GIFTi the 2011 annual fund donation (in dollars) from the ith alum.
INCOMEi the 2011 income (in dollars) of the ith alum.
CALLi the # of calls to the ith alum asking for a donation in 2011.
Note: If your friend changed the units of INCOME from “dollars” to “thousands of dollars,” its coefficient
will be 1.
Where Gi the grade of the ith student in Romer’s class (A=4, B=3,…)
ATTi the percent of class lectures that the ith student attended
PSi the percent of the problem sets that the ith student completed.
Page 8 of 8