Professional Documents
Culture Documents
ECONOMETRICS
ECONOMETRICS
ECONOMETRICS
Multicollinearity
Def : high correlation exists among two or more independent variables can affect the
regression results, because of instable coeficients, coeff signs may not match prior
expectations.
Example: Pie sales = 100 + 25*Temperature (°C) + Temperature (F)
Sales: 350 units
Search for the higher VIF and eliminate it, than we run the regression again, and if there is
another too high VIF, we do it again until all the variables are below 5.
If the VIF is greater than 5, we should eliminate the variable
Analyse of R2: the selected variables explain X% of the model, so they are significant
R² (adj) should be close to R² otherwise not good
Global sig. : F-test P-value < 5%
FICHE FINAL
Linear regression
SST (total of Sum Square) = SSR (Sum of Square Regression) + Sum of Square Error
So SST = SSR + SSE
From that we can calculate R² = SSR/SST ou R² = SSR/ (SSR + SSE)
0 < r2 < 1 (perf linear relationship)
Adjusted R2 shows the proportion of variation in Y explained by all X. If we want to know
which model is the best, it is smaller than R²
LINE assumptions
Linearity and Independence
Linear : look at the residuals by predicted for…(variable) ; if no distinguishing shape, good,
Independence : look at the Scatter plot and we shouldn’t distinguish any shape either
around 0.
Searching for outliers Cook’s distance, if not many, we can keep them
Normality : graph from linear regression, showing the residuals as an histogram, the curve
should be similar to the reference curve Kolmogorov-Smirnov Test (table distribution of
analysis : residuals …) we have X% chance (multiply value by 100 to have a percentage and
must be under 10%) to reject H0, if small percentage, reject H1 and keep H0 distrib of
residuals is normal
(can also be detected from the 2 other test, CvM and AD)
+QQPlots for graph assumption, must follow the line
Equal variance or Homoscedasticity : no specific shape in the distribution
run a White test : compare the (R2 value * nmb of observations) and the Chi2 value
if White test stat is < chi2 value, we don’t reject H0 regression model is not subject to
heteroscedasticity
Consequences : sig test too high or too low, stand erro biaised