Professional Documents
Culture Documents
Econometrics Project Tut3 H
Econometrics Project Tut3 H
Econometrics Project Tut3 H
HANOI UNIVERSITY
ECONOMETRICS PROJECT
Table of Contents
I. Introduction ................................................................................................................................. 2
III.Methodology .............................................................................................................................. 4
2. Testing ..................................................................................................................................... 6
1. Multicollinearity ...................................................................................................................... 9
2. Autocorrelation...................................................................................................................... 11
3. Heteroskedasticity ................................................................................................................. 12
References ..................................................................................................................................... 15
Appendices .................................................................................................................................... 16
I. Introduction
In recent years, the international economy integration of Vietnam has sharply developing steps,
with exercising national campaigns and participating global economic organizations. In the
context, human capital is considered as one of the most crucial factors for Vietnam’s long term
revolution, and university students play an indispensable part in the domestic labor pool (Jades,
2015). Regarded as one of the most privileged and global universities in Vietnam, Hanoi
University (HANU) is always proud of well-trained students, contributing the good quality labor
resource to the national economy. Beside specialization knowledge, students here can study
themselves myriad of practical skills, in which skills related to manage their own finance should
be mentioned first and foremost because their math exercise about balancing reasonably
expenditure is not straightforward. Thus, after taking everything into consideration, we decided
to choose and elaborate on the project “THE DETERMINANTS OF HANU STUDENTS’
MONTHLY EXPENDITURE”.
This study utilizes an online survey distributed to students at Hanoi University, for 110
observations, with the aim to provide with more in-depth understanding about some major
factors dominating daily spending of HANU’s students. We hope that arguments and statistics in
the project will be helpful in drawing a strategic plan of expenditure for the time being.
Our problem can be stated as: What are the factors affecting HANU students’ monthly
expenditure?
We come up the issue as we were a genuinely concerned with a variety of factors such as age,
personality traits, knowledge and so on. We narrow down to only 5 variables: family allowances,
income, gender, spending styles and accommodation. Family allowances and income are two
quantitative variables, the rest are qualitative variables. These are highly expected to influence
on HANU students consumption, in order to answer the following questions:
• How these factors affect spending habits?
• Is there any correlation between these factors?
• Are there any errors in the model; if there are, what are remedial measures?
It can be said that family allowances and income, and expenditure are critical elements of the
market economy, and as predicted, the results show that there also a close-knit relationship
between those factors.
In conclusion, based on theoretical foundations and empirical results, 5 factors: income, family
allowances, spending styles, gender and accommodation are chosen to test in our research on
“The determinants of HANU students’ expenditure”.
III. Methodology
The main purpose of this project is to examine the influence level of 5 selected determinants on
the Hanu students’ expenditure per month: their family allowances, their income, their spending
styles, the current accommodation and their gender. Based on mathematical expression, we
construct the following model:
EXPENDITURE = β1 + β2*ALL + β3*INC + β4*ACCOM + β5*STYLES + β6*GENDER
Whereas: β1: intercept
β2, β3, β4, β5, β6: slope coefficients of students’ expenditure: family allowances(ALL),
income(INC), current accommodation(ACCOM) (accom_d4 =1 if accom = “rent” or = 0 if
accom = “others”), spending styles(STYLES) (styles_d5 =1 if styles = “saving” or = 0 if styles =
“others”) , and gender(GENDER) (gender_d6= 1 if gender = “female” or =0 if gender =
“others”) respectively.
• Step 2: After running Eviews software, the most appropriate model was found and used for
some next tests. Testing for functional form was utilized, particularly, four models including lin-
lin, lin-log, log-lin, and log-log were considered. Since logarithm of gender variable was not
identified synchronized with the smallest calculated CV from these models, lin-lin was
considered as the most suitable form.
EXPENDITURE = β1 + β2*ALL + β3*INC + β4*ACCOM + β5*STYLES + β6*GENDER
• Step 3: Some tests were be suggested, such as testing individual partial coefficients (t-test),
testing, the overall significance of all coefficients (F-test), adding/dropping test, etc. With the
goal of taking into consideration of how each independent variable influences on the HANU
students’ expenditure per month, t test was used while the purpose of F-test was to indicate
whether at least one of five variables impacting on the expenditure and so on.
• Step 4: Errors were detected and some remedial measures were given. Because of not being
time-series data, autocorrelation error was eliminated. Multicollinearity means when there are
some functional relationships existing among explained variables whereas
Heteroskedasticity occurs when Var(ui) = σi2 # σ2 . After checking, some methods were
suggested to make them better to interpret.
Table 2: Covariance
The following graphs show the relationship between Expenditure and others factors which are
family support, income, accommodation, spending styles and gender respectively.
It can be seen from the table above that the distribution of 3 independents Accommodation,
Spending styles and Gender are negligible, so that is the reason why the standard deviation are
zero. However, the range of dependent variable and some independent variables like family
support and income are large, hence their standard deviation are much greater than the rest.
2. Testing
a. Testing the functional form of regression model
In order to determine the most suitable form of the model, we consider R-squared as the
conditions because R2 represents “goodness of fit” of the model means that how well the data fit
the model.
The decision rule is the higher R2 of the model, the more preferable model. After data was
processed, we collected the result as follows:
Note: R2 = 0.168115
Note: R2 = 0.156391
7
From Eview tables we can see that the Lin-lin Model has higher R2 than Log-lin Model
(0.168115 versus 0.156391). As noted before, we choose model having higher R2, so we
determine that Lin-lin Model is best one to fit the data and it can explain considerately the
relation between dependent variables and independent variables.
Hence , we estimated the regression line model:
Conclusion:
Hence, at the 5% level of significance, the Income and the Accommodation are significantly
different from 0, whereas the Allowance, Styles and Gender are not.
V. Checking errors
1. Multicollinearity
Multicollinearity arises due to the exact or approximate linear relationship among some or all
explanatory variables which can lead to some consequences:
• OLS estimators have large variances and covariance, making the estimation with less accuracy
• The estimation confidence intervals tend to be much wider, leading to type II error more readily
• The R2 can be very high although the t-statistic of one or more coefficients is statistically
insignificant.
• The OLS estimators and their standard errors can be sensitive to small change in the data.
To detect this problem, we will use the method 1: the auxiliary regression and compute the F-
statistics to make conclusion about multicollinearity or method 2: VIF for conclusion.
=> There is not enough evidence to infer at 5% level of significance that linear relationship not
exists between Allowance and INCOME variable.
Method 2: VIF
EXPENDITURE = β1’’ + β2’’*ALLOW + β3’’*INCOME + u’’
Table7: VIF
From table Eviews above, we have Central VIF= 1.015283 < 10
=> The linear relationship not exists between Allowance and INCOME variable.
Hence, our model has violated the assumption that there is no multicollinearity among
independent variables. As a result, we do not need to drop any of the independent variables
(ALLOW and INCOME)
2. Autocorrelation
*DW test
• Ho: There is no positive autocorrelation
Ha: There is a positive autocorrelation
• From Table 4, we obtain the Durbin-Watson stat being equal to 1.459303. Then, at 5%
level of significance, n=110, k’=5, the lower bound and upper bound are respectively:
dL =1.618, dU=1.755.
• Decision rule: We have: DW stat = 1.459303 < dL = 1.618=> Reject Ho.
• Conclusion: There is an existence of the first order positive autocorrelation
11
3. Heteroskedasticity
One of ten assumptions of simple regression is homoscedasticity or equal variance of u i and the
conditional variances of ui are identical. As it is not satisfied or Var ( ui |Xi)= σi2 # σ2 ,
heteroskedasticity occurs, which results in some bad effects such as the not-best estimators or the
non-reliable F_stat and t-stat, etc. Therefore, it is crucial for us to detect and give remedial
measurements in our paper.
We will use the following formal method: Park test
12
13
Eventually, we can conclude that family allowance and income have significant influence on the
HANU students’ expenditure while the effect of 3 other factors are negligible on the contrary to
our pre-test expectations.
However, during testing process, there also exists some limitations: when running two Lin-Lin
and Lin-Log models, we can see R-squares of both are relatively small, 0.168 and 0.156
respectively. However, low R- squares are not a problem for such studies in the social science
field like ours as human behavior cannot be accurately predicted.
Also, curing autocorrelation problem is the study’s drawback. Despite the effort in using First-
difference transformation and finding effective remedies, we cannot correct this error.
All in all, it is figured out that the undergraduate are most affected by family allowance and
income. Thus, if students want to change their expenditure, they should focus on adjusting two
factors mentioned above.
14
REFERENCE
1. Tang, T. L. P. (1995). The development of a short money ethic scale: Attitudes toward money
and pay satisfaction revisited. Personality and Individual Differences, 19, 809-816
2. Keynes, J. M. (1936). The general theory of employment, interest, and money. London:
Macmillan
3. Penman, S., & McNeil .S .L. (2008). Spending their way to adulthood: Consumption outside
the nest. Otago, New Zealand: University of Otago
15
APPENDICES
2.Table 2: Covariance
16
7. Table 7: VIF
17
18