Professional Documents
Culture Documents
HSTS423 - Unit 3 Autocorrelation and Generalised Least Squares Estimation (GLS)
HSTS423 - Unit 3 Autocorrelation and Generalised Least Squares Estimation (GLS)
HSTS423 - Unit 3 Autocorrelation and Generalised Least Squares Estimation (GLS)
Objectives
2
1.1. INTRODUCTION TO AUTOCORRELATION 3
Definition 1.1 Let {ut } be a stationary time series. Then the series is said to be
auto-correlated if γ(k) = cov (ut , ut+k ) 6= 0 for some k ≥ 1
Whenever there is serial correlation in the error terms all inference i.e. estimation,
hypothesis testing and forecasting must take into account the effects of auto-correlation
for the conclusions to be valid.
We examine below some common causes of auto-correlation and how they be avoided
or taken into account when making statistical inference.
Thus although the OLS estimator β̂ is still unbiased the for β, the conventional formula
for parameter covariance σ 2 (X0 X)−1 or its sample counterpart s2 (X0 X)−1 , no longer
measures the sampling variance of the OLS estimator β̂. Thus any application of it
is potentially misleading. It is also clear that the β̂ has now larger standard errors.
The minimum variance property of OLS no longer holds. We can now summarize the
undesirable consequences of auto-correlation as follows.
Although there are other consequenses of auto-correlation these are apparently, the
ones that in general have more serious impact on estimation, hypothesis testing and
forecatsing.
It is clear from the foregoing that in econometric modeling there should be routine
checks or tests for auto-correlation. Autocorrelation can of course, take various forms.
It is however, customary in empirical work and in the literature to concentrate on first
order autocorrelation as many empirical studies have shown that first auto-correlation
is apparently the most common form of auto-correlation found in many economic vari-
ables.
A series {ut } is said to have first order auto-correlation if it satisfies the first order
auto-regressive AR(1) model
ut = ρut−1 + t
The properties of an auto-regressive process were presented in Unit 1 and are discussed
in detail in the course on Time Series Analysis. Here it suffices to say that the auto-
correlation function(ACF) of an AR(1) decays either exponentially if 0 ≤ ρ < 1 or in
a dumped cosine wave, if −1 < ρ < 0. The Partial Auto-correlation fuction (PACF)
cuts off i.e. becomes zero after lag 1, that is, from lag 2 on.
It is important to note that in general serial correlation may not be of just simple
auto-regressive form. Consider therefore, some time series [TSM] models which are
frequently encountered in practice e.g. error terms which follows general ARMA(p,q)
model.
6UNIT 1. AUTO-CORRELATION AND GENERALIZED LEAST SQUARES ESTIMATION
1. Examining the correlogram i.e. a plot of the ACF, of the residuals {ût }, against
lag. If there is no auto-correlation then the ACF coeficients should lie within the
95% confidence band
1.96
±√ .
n
where n is the sample size. Otherwise there is auto-correlation of some sort.
The advantage of this graphical test is that it applies not only to first order
auto-correlation, but to all forms of auto-correlation.
2. A more formal test for first order auto-correlation is provided by the Durbin-
Watson test which we discuss in some detail below.
H0 : ρ = 0 versus H1 : ρ 6= 0
The above hypotheses are tested indirectly by testing the hypothesis that that
H0 : µd = 2 versus H1 : µd 6= 2
Pn
(ût − ût−1 )2
t=2P û0 Aû Y0 (I − H)A(I − H)Y
d = n = =
2
t=1 ût û0 û Y0 (I − H)Y
1 −1 0
0 0 ... 0
−1 2 −1 0 ..
Figure 1.1 Lower and Upper 0
distributions . of Durbin-Watson statistic
0 −1 2 −1 0
where A = . and H = X(X0 X)−1 X0 .
.. .
.. .
..
The decision
rule is, if the value of−1d ≤ 2,
2 then−1using an appropriate significance level,
0 ... 0 −1 1
1. reject H0 in favour of H1 i.e. in favour of positive correlation if d < dL .
It is2.easy to show
accept H0Pifthat
dU <ford large
<4− ≈ 2(1
n,d d as − ρ̂) or equivalently
illustrated ρ̂ ≈ 1 − d/2 = (2 − d)/2,
in the diagram.
U
where ρ̂ = 1 − nt=2 ût ût−1 )2 / nt=1 û2t . The diagram below shows the distribution of
P
d. 3. If dL < d < dU the test is considered inconclusive.
The critical values dL and dU are given in the Durbin-Watson tables, found in the
appendix of most text books.
If the value of d > 2, compute d0 = 4 − d and compare this value with the tabulated
values of dL and dU as if one were testing for positive auto-correlation.
1.4. TESTING FOR AUTO CORRELATION: DURBIN-WATSON TEST 7
Note that for the inconclusive case, it is generally incorrect to say that {ut } is nearly
correlated or that there is little auto-correlation. A practical conservative procedure
is to simply reject H0 , as the consequences of assuming no auto-correlation are more
serious than assuming that it is there.
2 2.0
3 2.5
4 2.6
5 2.9
6 3.0
(i) Using the fitted regression Y = 1.64 + 0.24X, compute the residual series {ût }.
(ii) Hence test for first order serial correlation. Use a 5% significance level.
Solution 1.1 ,
(i) The residuals series and its computation are displayed in the following table.
(ii) Suppose that {ut } satisfies ut = ρut−1 + t . Then the hypothesis to be tested is
H0 : ρ = 0
That is the error terms are serially uncorrelated.
The test statistic is
P5 2
t=2 (ût − ût−1 )
d = P5 2
t=1 ût
(0.14 − (−0.12))2 + . . . + (−0.08 − 0.06)2
=
0.0440
0.1104
=
0.0440
= 2.51.
Since d = 2.51 > 2.0 , we compute d0 = 4 − d = 1.49 From the Durbin-Watson
table with α = 5%, k=1 and n=5 we obtain dL = 0.6 and dU = 1.4. Thus there
is no sufficient evidence of negative auto-correlation. That is, we keep keep H0
and conclude that the error terms are serially uncorrelated.
8UNIT 1. AUTO-CORRELATION AND GENERALIZED LEAST SQUARES ESTIMATION
It is important to emphasize that the Durbin-Watson test deals with only one type
of auto-correlation, first order auto-regressive type of auto-correlation. Several auto-
correlation tests are often defined for specific specification cases as is the case in the
Durbin-Watson test. Such tests have some limitations. For example the Durbin-Watson
test assumes that the regressor is non-stochastic and thus is invalid if for instance lagged
values of the dependent appear among the regressors, which is quite common as we
will see later.
Y? = PY and X? = PX
Y ? = X ? β + vt .
which has the desirable property of being the best linear unbiased estimator of β where
β is as in the original model. In the case of first order auto-correlation, it can be shown
that p
1 − ρ2 0 . . . 0
..
−ρ 1 0 .
0 −ρ 1
P=
.. .. ..
. . .
−ρ 1 0
0 ... 0 −ρ 1
Frequently, however, ρ is not known. There are several ways of estimating ρ. The sim-
plest is to use the estmate of ρ̂ = (2 − d)/2 obtained from the Durbin-Watson statistic.
Other methods incude the maximum likelihood estimation, iterative procedures etc.
The resulting estimator β̂ is called a Generalised Least Squares (GLS) or Estimated
1.6. PREDICTION I.E. FORECASTING USING A MODEL WITH AUTO-CORRELATED ERRORS
Generelaised EGLS if ρ is replaced by its sample estimate ρ̂. If the aim of the model
is for example to forecast, then more accurate forecasts are obtained using the GLS
estimated model than with model estimated using Ordinary Least squares.
Suppose that we detected auto-correlation and that we have transformed the data and
obtained estimates of the parameters of the model using GLS. Let Y? , X?1 , X?2 , . . . , X?k , be
the transformed variables or data and let
Xn+1 = (X1(n+1) , X2(n+1) , . . . , Xk(n+1) )0 be the values of the explanatory variables at
time t = n + 1. Then it can be shown that the most efficient prediction of Yn+1 is given
by
In this Unit we have learnt that in practice there are several causes or sources of auto-
correlation. These include omission of relevant explanatory variables, inherent corre-
lation structure in the dependent practices, prolonged effects of external shocks, data
treatment etc. The serious consequences of this phenomenon are inefficient parameter
estimation and low forecasting power of the resulting model. Tests for auto-correlation
can be conducted by examining the correlogram or formally by conducting tests such as
the Durbin Watson test. Compensation for effects of auto- correlation can be achieved
by generalized least Squares, noting however that if auto-correlation is generated by
mis-specification then there can be no effective remedy other than appropriate re-
specification. Further many problem may arrive together, such as auto-correlation and
no-constant i.e heterogeneous variance. These may have to be treated together.
Activity 1.1 .
1. The following table shows data on imports and Gross Natinal Product (GNP).
10UNIT 1. AUTO-CORRELATION AND GENERALIZED LEAST SQUARES ESTIMATION
imports GNP
Zt Xt
3748 21777
4010 22418
3711 22308
4004 23319
4151 24180
4569 24893
4582 25310
4697 25799
4753 25886
5062 26868
5669 28134
5628 29091
5736 29450
5946 30705
6501 32372
6549 33152
6705 33764
7104 34411
7609 35429
8100 36200
Fit the imports model: imports = a + b*GNP + error, to the data and test for
first order auto-correlation.
2. Show that for large n d ≈ 2(1 − ρ̂) where ρ̂ = nt=2 ût ût−1 / nt=1 û2t−1
P P
3. Assuming that ρ = 0.5, compute the OLS estimate of the consumption function
of exercise 1 with the appropriately transformed data. Compare your results with
those obtained in exercise 1.
4. Prove the Gauss-Markov theorem, that is, if the assumptions of the GLM are
satisfied then the Ordinary Least Squares(OLS) estimator of β is efficient among
the class of linear unbiased estimators.
Pn
(û −û )2 y 0 (I−H)A(I−H)y
5. Let d = Pn t 2 t−1
t=2
= where
t=1 ût−1
y 0 (I−H)y
−1
1 0 0 0 ... 0
−1 ..
2 −1 0 0 .
0 −1 2 −1 0
A= . and H = X(X 0 X)−1 X 0
.. 0
−1 2 −1
0 ... 0 −1 1
be the Durbin-Watson test statistic. Show that the eigen-values of A are
λj = 2 [1 − cos (π(j − 1)/n) , j = 1, 2, . . . , n]
Hence prove that 0 ≤ d ≤ 4.
1.7. SUMMARY OF THE UNIT 11
6. The following table shows the annual consumption and the disposable income in
$million for a certain country.
Year C Yd
1957 11378 11617
1958 13012 13297
1959 15263 15790
1960 16873 18017
1961 17764 19314
1962 18857 20198
1963 20074 21512
1964 21439 23124
1965 22833 24724
1966 24205 26174
1967 25307 27219
1968 27020 28915
(b) Estimate the value of ρ and use your estimate of it to transform your original
data
(c) Apply OLS to the transformed data and compare your results with the OLS
estimates obtained from the original sample observations.
Year (ût )
1950 1.0
1951 -1.5
1952 -0.7
1953 -1.3
1954 -4.6
1955 -0.3
1956 -3.1
1957 -5.5
1958 -4.7
1959 -1.3
1960 -4.6
1961 -4.3
1962 1.9
1963 1.9
1964 2.9
1965 -2.6
1966 -2.3
1967 0.9
1968 1.4
1969 3.7
Calculate d and estimate ρ. Do the results support the use of first difference to
estimate β0 and β1 ?
1. Christ, C.F (1966), Econometric Models and Methods, John Wiley, New York.