Professional Documents
Culture Documents
kinh tế lượng
kinh tế lượng
Midterm project
𝑥1 = 𝑡𝑦𝑝𝑒
Price
Score:
Type:
Histogram:
Price:
Score:
3. Develop an estimated regression equation
4. Did the estimated regression equation provide a good fit to the data? Explain
→ R-squared (R2) of the new model equals 0.5434, which is lower than 0.7 (70%). Hence, the
estimated regression equation provides not a good fit to the actual data.
- Test for overall significance: F-test (p-value = 0.0009 < the significance level, 0.05). → We reject
the null hypothesis, which means that the model regression is significant between variables.
- Test for individual significance: T-test
● x1 = 0.014 < 0.05
● x2 = 0.001 < 0.05
Test for individual significance: T-test
● x1 = 0.014 < 0.05
● x2 = 0.001 < 0.05
→ All p-values of independent variables are lower than the significance level (0.05),
which means that all independent variables are individually significant.
The p-value is 0.9825, which is greater than the significance level (0.05). Therefore, we do not reject
the null hypothesis, which means that the residuals have a mean of zero.
=> Assumption 1 satisfied.
- Assumption 2: The variance of ε (residuals) is the same for all values of the
independent variable.
The p-value equals 0.8692, which is greater than the significance level (0.05). Therefore, we
do not reject the null hypothesis, which means that the variance of the standardized residuals
is constant.
→ Assumption 2 satisfied.
- Assumption 3: The error ε is a normal distributed random variable.
The p-value in both tables are greater than the significance level (0.05). Therefore, we do not
reject the null hypothesis which means the standardized residuals are normally distributed.
→ The scatter plot of the standardized residuals has no trend. We can conclude that
the values of residuals are independent.
7. Give examples of confidence intervals and prediction intervals
Example:
The 95% confidence interval and prediction interval of the random restaurant which
has x1 = 1; x2 = 5
^ α/2
+ 95% confidence interval 𝑦 ± 𝑡𝑛−𝑝−1 𝑠𝑡𝑑𝑒𝑟𝑟𝑜𝑟
0.025
605641. 63 ± 𝑡𝑛−3 × 23012. 18
0.025
605641. 63 ± 𝑡𝑛−3 × 53916. 36