Co Integration

Cointegration
Kevin Kotzé
Contents
1. Introduction
2. Defining Cointegration
3. Cointegration and Common Trends
4. Error Correction Models
5. Additional features of cointegration
6. Engle-Granger test
7. Johansen Procedure
2 / 42
Introduction
Most macroeconomic and many financial variables are nonstationary
They drift upwards over time and often exhibit characteristics of a stochastic trend
Engle & Granger proposed that the combined integration order of several variables
may be lower than their individual integration order
Allows for the analysis of nonstationary data, where we are able to retain
information relating to the level
Facilitates an equilibrium analysis, where variables exhibit departures from

nonstationary trend (steady-state)
3 / 42
Introduction
Granger received a Noble Prize for this idea in 2003 "for methods of analysing
economic time series with common trends"
He had long been puzzling over the problem of integrated variables and at a seminar
Hendry mentioned that the "difference between integrated series might be
stationary"
Granger did not believe him: My response was that it could be proved that he was
wrong, but in attempting to do so, I showed that he was correct, and generalised it to
cointegration...
4 / 42
Cointegration defined
In most cases, when the variables y and y are nonstationary I (1) variables, a
1,t 2,t
linear combination of these variables will also be nonstationary
However, in a few cases the linear combination of the variables can be stationary
This occurs when the variables share the same stochastic trend
The effect of the common stochastic trend may be contained when the variables are
combined
In these cases, we say that the variables are cointegrated
5 / 42
If both y 1,t
and y 2,t
are I (1), they have stochastic trends
We could rearrange the linear regression model, such that
ut = y1,t − β1 y2,t
If u is I (0), then by definition the combined y

t 1,t
− β1 y2,t is also stationary
While both y and y have stochastic trends
1,t 2,t
We say that the variables y and y are cointegrated

1,t 2,t
These stochastic trends are related through β , which contains this feature of the
1
data
If u is I (1), then y
t 1,t
and y 2,t
are not cointegrated and regressing y 2,t
on y 1,t
will yield
a spurious result
6 / 42
200 Different Stochastic Trend Common Stochastic Trend
200
150
150
100
100
y1,y2
y1,y2
50
50
0
0
0 100 200 300 400 500 0 100 200 300 400 500
20
20
10
10
resids
resids
0
0
-10
-10
-20
-20
0 100 200 300 400 500 0 100 200 300 400 500
Figure : Unit roots, spurious relations & common stochastic trends

y 1,t = 0.1 + μy,1t + υy1t
and y = 0.3 + μ + υ 2,t y,2t y2t
7 / 42
Using matrix algebra, if we let y t
= (y1,t , y2,t )
′
denote a (2 × 1) vector of I (1)
variables, and β = (1, −β ) 1
′
It follows that the relationship β ′

yt = y1,t − β1 y2,t
Then the system is cointegrated when β ′

yt ∼ I (0)
In this case the variables in the y vector share a common stochastic trend and will
t
drift together in long-run equilibrium
The vector β is termed a cointegrating vector
When components of y are integrated of order d and the reduction in the order of
t
the combined variables is b, then we note that y ∼ CI (d, b) t
8 / 42
Cointegration and Common Trends
Stock & Watson (1988) propose the following decomposition for the I (1) variables
y1,t = μy + υy , y2,t = μy + υy
1,t 1,t 2,t 2,t
where μ is the random walk component representing the trend in variable y , and
it t
υit is the stationary component
We are then able to multiply y 1,t

by β and y
1 2,t
by β to yield
2
β1 y1,t = β1 μy + β1 υ y , β2 y2,t = β2 μy + β2 υ y
1,t 1,t 2,t 2,t
9 / 42
If these variables are CI (1, 1) a linear combination of these variables yields;
β1 y1,t + β2 y2,t = β1 (μy + υy ) + β2 (μy + υy )

1,t 1,t 2,t 2,t
= (β1 μy + β2 μ y ) + (β1 υy + β2 υ y )
1,t 2,t 1,t 2,t
If the errors, (β 1
υy
1,t
+ β2 υ y
2,t
) are stationary
And the linear combination of the variables are stationary β 1 y1,t + β2 y2,t
Then the stochastic trends (β 1 μy1,t + β2 μ y

2,t
) must vanish
−β2 μy
Hence, for y and y to be CI (1, 1), μ

2,t
1,t 2,t y1,t =

β1
−β2
This implies that they must have the same stochastic trend up to the scalar β1
10 / 42
Stock & Watson's essential insight is that the parameters in the cointegrating vector
must purge the trend from the linear combination
Such a cointegrating vector is unique up to the scalar

−β2
β1
This can also be extended to the n variable case
y t = μ t + ut
where y is a vector {y , y , … , y } containing various I (1) variables and μ is a

t 1t 2t nt t
vector of stochastic trends (μ , μ , … , μ ), and u is an n × 1 vector of irregular

1t 2t nt t
components
11 / 42
If we can then express one trend as a linear combination of other trends it means
that there exists a vector β such that
β1 μ 1 + β2 μ 2 + … + βn μ n = 0
t t t
where we once again multiply though by β to get βy t

= βμt + βut
since the linear combination of all βμ t

= 0
we are left with βy t

= βut , where both sides are stationary
12 / 42
Cointegration and Equilibrium
A cointegration model may make use of the term equilibrium which refers to the
existence of a long-run relationship
This can only occur if there is a common stochastic trend amongst the variables
i.e. two variables share a common equilibrium path
These variables will periodically move away from the equilibrium path, but the effect
of this will not be permanent (i.e. the errors are stationary)
The variables return towards the equilibrium path over time
The residuals from the cointegrated model are then described as equilibrium errors
13 / 42
Error Correction Models
A cointegrating relationship defines an equilibrium relationship
Time paths of cointegrated variables are influenced by the extent of any deviation
from long run equilibrium
In a cointegrated model variables return to the equilibrium value

Cointegrated system has an error correction representation
Engle & Granger formalised the connection between this dynamic response to the
errors and co-integration in the Engle-Granger representation theorem
14 / 42
For example, if P and P are cointegrated share prices
1 2
Assume:
gap between the prices is relatively large when compared to the long run
equilibrium values (i.e. some dis-equilibria)
the low priced share P must rise relative to the high priced share P .
2 1
This may be accomplished by:

↑ in P or ↓ in P
2 1
↑ in P with larger ↑ in P
1 2
↓ in P with smaller ↓ in P
1 2
15 / 42
The OLS regression then takes the form,
P1,t = β1 P2,t + ut
When the errors are stationary they may be expressed as,
ut = ϕ1 ut−1 + εt with |ϕ1 | < 1
Hence combining the two where, u t

= P1,t − β1 P2,t ,
P1,t − β1 P2,t = ϕ1 (P1,t−1 − β1 P2,t−1 ) + εt
P1,t = β1 P2,t + ϕ1 (P1,t−1 − β1 P2,t−1 ) + εt
16 / 42
Adding and subtracting P 1,t−1
and P 2,t−1
on both sides
ΔP1,t = −(1 − ϕ1 )(P1,t−1 − β1 P2,t−1 ) + (β1 ΔP2,t + ε1,t )
= α(P1,t−1 − β1 P2,t−1 ) + ε1,t
where α = −(1 − ϕ1 ) , while ΔP 2,t

is stationary and ε 1,t
= (β1 ΔP2,t + ε1,t )
Note that large persistence in the autoregressive error would imply a slow speed of
adjustment
17 / 42
To describe the manner in which the variables return to equilibria use an ECM
Illustrates how the variables are influenced by deviations from equilibrium
If we assume that both share prices are I (1) and;
ΔP1 = α1 (P2,t−1 − β1 P1,t−1 ) + ε1,t
ΔP2 = α2 (P2,t−1 − β1 P1,t−1 ) + ε2,t
Long term equilibrium described by (P 2

− β1 P1 ) , which is stationary when variables
are CI (1, 1)
If P is I (1) then ΔP is stationary

1 1
When variables are CI (1, 1), the term ε 1,t

is stationary
18 / 42
Hence the two share prices must be cointegrated with the vector (1, −β 1
)
′
, when
they are of the order CI (1, 1)
The parameters α and α are speed of adjustment parameters that describe how
1 2
changes to share prices react to past deviations from the equilibrium path in the
respective share prices
Small values of α imply a relatively unresponsive relationship, where it would take a

i
long time to return to equilibrium
19 / 42
This can be generalised to include lagged changes of both equations,
K L
Δy1,t = γ0 + α1 [y1,t−1 − β1 y2,t−1 ] + ∑ ζ1,i Δy1,t−1 + ∑ ζ2,j Δy2,t−1 + εy ,t

1
i=1 j=1
K L
Δy2,t = η0 + α2 [y1,t−1 − β1 y2,t−1 ] + ∑ ξ1,i Δy2,t−1 + ∑ ξ2,j Δy1,t−1 + εy ,t

2
i=1 j=1
This is a representation for a VECM
If both α and α equal zero there is:

1 2
no equilibrium relationship
no error-correction
no cointegration
20 / 42
Autoregressive distributed lag model
The error correction model may also be represented by an autoregressive distributed
lag (ARDL) model
Consider the example,
y1,t = ϕ1 y1,t−1 + ϕ2 y2,t + εt
Subtract y 1,t−1
and ϕ 2
y2,t−1 from both sides,
Δy1,t = ϕ2 Δy2,t − (1 − ϕ1 )y1,t−1 + ϕ2 y2,t−1 + εt
ϕ2
= ϕ2 Δy2,t − (1 − ϕ1 )(y1,t−1 − y2,t−1 ) + εt
1 − ϕ1
21 / 42
Autoregressive distributed lag model
Note that the long-term steady-state, with y 1,t
= y1,t−1 = ȳ
1
, may then be described
as,
ȳ = ϕ1 ȳ + ϕ2 ȳ
1 1 2
Such that,
ϕ2
ȳ = ȳ
1 2
1 − ϕ1
Hence the relationship between y and y is described by

ϕ2
1 2
1−ϕ1
This is equivalent to the ECM that we derived earlier
22 / 42
Worthwile Noting ...
Cointegration refers to linear combinations of non-stationary variables
It's possible that there are nonlinear cointegrating relationships, but we don't
know how to test for this.
We can however model regime-switching cointegrating relationships (Balke &
Fomby, 1997)
Cointegrating vectors are unique up to a scalar, for every β 1
, β2 , … there exists
λβ , λβ , …, where λ is the scalar
1 2
All variables must be integrated of the same order

usually a set of I (d) variables are not cointegrated
when two variables are integrated of different orders they cannot be
cointegrated
it is possible to have multicointegration (i.e. b = 2)
23 / 42
Worthwile Noting ...
If y has n nonstationary components there may be n − 1 linear cointegrating
t
vectors
If y has two variables then there can only be one cointegrating vector
t
The number of cointegrating vectors is called the cointegrating rank
If you have three variables and two are I (2) and one is I (1):
The two I (2) variables may be CI (2, 1)
The remaining I (1) variable may share a common stochastic trend with other
two variables, whereupon the system will be stationary
The chance of this occurring are very small
Most of the literature focuses on CI (1, 1) cases but many other possibilities exist
24 / 42
The Engle-Granger Procedure - Step 1
Suppose y and y are possibly I (1) and you want to know whether they are
1,t 2,t
cointegrated
Pretest the variables for their order of integration
Use Augmented Dickey Fuller or similar test
If both are stationary there is no cointegrating vector
If variables of different orders there is no cointegrating vector
For three or more variables they can be of different orders

A group could be C(2, 1) and this group may be cointegrated with a further set
of I (1) variables.
25 / 42
If both are I (1) estimate LR relationship;
y1,t = β0 + β1 y2,t + ut
If the variables are cointegrated OLS yields a super consistent estimate of β and β
0 1
The variables are cointegrated when u

^ is stationary
t
To test for stationarity construct an Augmented Dickey Fuller test
Δu
^ t = π1 u
^t−1 + ∑ γj Δu
^t−j + εt
j=1
where π 1
= (1 − ϕ)
If we cannot reject π 1
= 0 then they are not cointegrated
If we can reject π 1
= 0 then they are cointegrated
26 / 42
Use tables from Engle & Granger (1987) or Engle & Yoo (1987)
If the OLS regression includes a constant then do not include a constant in the ADF
regression, and vice versa
27 / 42
Estimate the error-correction model - if the variables are cointegrated use the
residuals to estimate the error correction model
Δy1,t = γ0 + α1 [y1,t−1 − β1 y2,t−1 ] + …
K L
∑ ζ1,i Δy1,t−1 + ∑ ζ2,j Δy2,t−1 + εy ,t

1
i=1 j=1
Δy2,t = η0 + α2 [y1,t−1 − β1 y2,t−1 ] + …
K L
∑ ξ1,i Δy2,t−1 + ∑ ξ2,j Δy1,t−1 + εy ,t

2
i=1 j=1
where β is the cointegrating vector, ε

1
and ε y1 ,t y2 ,t
are white noise, while α and α
1 2
are the speed of adjustment parameters
We can substitute (y 1,t−1

− β1 y2,t−1 ) with u
^ t−1
OLS provides efficient estimates, t and F test statistics are appropriate
28 / 42
To assess model adequacy
Check residuals ε 1,t

and ε 2,t
to make sure they are white noise
If there is a problem then consider changing the lag length
Speed of adjustment α and α describe dynamics

1 2
Large value of α is associated with a large Δy

2 2,t
If α 2
= 0 and ξ 2,j
= 0 then Δy 1,t
can not Granger-cause Δy 2,t
If both α 1
= 0 and α 2
= 0 then there is no cointegration or error correction
α1 and α should also not be too big since they should converge to the LR values
2
over time (i.e. not immediately, or over-correct too drastically)
29 / 42
Problems with Engle Granger procedure
Has several important limitations
Need to specify left-hand side variables up front
y1,t = β1 y2,t + υy1,t
y2,t = β2 y1,t + υy2,t
υy1,t may be stationary but υ 1,t

not?
reversing the order may give different results

Can't identify multiple cointegrating vectors (or their form)
Reliance on a two-step procedure
first step generates residuals
second step uses these in a regression to obtain, α with ECM1
an error in step 1 is carried over to step 2 (i.e. we may incorrectly including a constant
in step 1)
30 / 42
Cointegration in a multivariate setting
Multivariate cointegration makes use of a VAR structure
Consider a first order V AR(1) for the n × 1 vector y t

= [y1,t , y2,t , … , yn,t ]
′
,
yt = μ + Π1 yt−1 + ut
where μ = [μ1 , μ2 , … , μn ]
′
is a vector of constants
ut = [u1,t , u2,t , … , un,t ]

′
is a vector of error terms
Π1 is a (n × n) matrix of coefficients
The stability of the VAR model is determined by the eigenvalues of Π that are 1
obtained by solving the characteristic equation
| Π1 − I | = 0
If all eigenvalues are modulus less than 1, the VAR is stable

31 / 42
Vector error correction model (VECM)
Now by writing the V AR(1) as a VECM,
Δyt = μ + (Π1 − I )yt−1 + ut
= μ + Πyt−1 + ut where Π = (Π1 − I )
With n variables, the number of of linear combinations of the variables in y that are
t
stationary will provide us with information on the number of cointegration vectors
32 / 42
VECM model
Three possibilities exist:
Π has full rank n = r
The VAR must be stable as there is no instability in the
system of equations. This model of stationary variables should be estimated in

levels.
Π has rank 1 ≤ r ≤ n − 1
The number of linear combinations is smaller than the
number of variables. Hence, some of the variables must be unstable and at least
one combination of the variables is stable. The number of cointegrating vectors is
given by r
Π has rank r = 0 (i.e. Π = 0)
There is evidence of instability and no combination
of the variables is stable. The unstable VAR cannot be cointegrated and should
be estimated in first differences.
33 / 42
VECM model
As before, the VECM includes the coefficients α and β
When cointegration exists, we can decompose Π as,

′
Π = αβ
where α and β are both dimensions n × r
β is the cointegration matrix where the linear combination of β ′

yt is stationary
each of the r rows of β ′

yt is a cointegrated (long-run) relation that induces stability
α measures the speed of adjustment back to equilibrium
34 / 42
VECM model
Consider the bivariate case that was examined previously,
Δy1,t μ1 α1 y1,t−1 u1,t

[ ] = [ ] + [ ] [β1 β2 ] [ ] + [ ]
Δy2,t μ2 α2 y2,t−1 u2,t
which can be written as,
Δy1,t = μ1 + α1 (β1 y1,t−1 + β2 y2,t−1 ) + u1,t
Δy2,t = μ2 + α2 (β1 y1,t−1 + β2 y2,t−1 ) + u2,t
and the cointegration relationship β ′

yt is given by,
′
β yt = β1 y1,t + β2 y2,t ∼ I (0)
which is what we had previously
35 / 42
VECM model
Now in the three variable case, we have n = 3 and r = 2
Δy1,t μ1 α11 α12

⎡ ⎤ ⎡ ⎤ ⎡ ⎤
⎢ Δy2,t ⎥ = ⎢ μ2 ⎥ + ⎢ α21 α22 ⎥ …
⎣ ⎦ ⎣ ⎦ ⎣ ⎦
Δy3,t μ3 α31 α32
y1,t−1 u1,t
⎡ ⎤ ⎡ ⎤
β11 β21 β31
[ ] ⎢ y2,t−1 ⎥ + ⎢ u2,t ⎥
β12 β22 β32
⎣ ⎦ ⎣ ⎦
y3,t−1 u3,t
with r = 2 , there are two cointegration relationships, which we denote β ′

1
yt−1 and
′
β yt−1
2
,
′
β yt = β11 y1,t + β21 y2,t + β31 y3,t ∼ I (0)
1
′
β yt = β12 y1,t + β22 y2,t + β32 y3,t ∼ I (0)
2
Hence, we have two linear relationships between the variables, y 1, y2 and y , which
3
ensures that this combined system is stationary
36 / 42
VECM model
The V AR(1) model can be generalized to a V AR(p),
Δyt = μ + αβyt−1 + Γ1 Δyt−1 + Γ2 Δyt−2 + …
+Γp−1 Δyt−p−1 + ut
where we have added p lags of the vector of variables
37 / 42
Johansen Approach
Testing for cointegration in the multivariate case amounts to determining the rank of
Π
Effectively, we need to determine the number of non-zero eigenvalues in Π
Johansen's maximum likelihood approach suggests that one should order the
eigenvalues,
λ
^
1
^ ^
> λ2 > … > λn
Then test the null hypothesis that there are at most r cointegrating vectors, where
^
H0 : λ i = 0 for i = r + 1, … , n
For example, if n = 2 and r = 1 as in the first example:

First eigenvalue, λ
^
1 ≠ 0 (able to reject null)
Second eigenvalue, λ
^
2 = 0 (unable to reject null)
38 / 42
Johansen Approach
The trace statistic specifies the null of hypothesis H of r cointegration relations as,
0
^
λtrace = −T ∑ log(1 − λi ) r = 0, 1, 2, … , n − 1
i=r+1
where the alternative hypothesis is that there are more than r cointegration
relationships
The maximum eigenvalue statistic for the null hypothesis of at most r cointegration
relations can be computed as,
^
λmax = −T log(1 − λr+1 ) r = 0, 1, 2, … , n − 1
where the alternative hypothesis is that there are r + 1 cointegration relations
39 / 42
Johansen Approach
For both tests, the asymptotic distribution is nonstandard and depends upon the
deterministic components included (constant and trend)
Tabulated critical values can be found in Johansen (1988) or Osterwald-Lenum

(1992)
Calculated values must be greater than tables to reject null
40 / 42
Summary
Single-equation modelling of nonstationary variables gives spurious results: unless
the variables cointegrated
Cointegration implies that the variables share a common trend
Cointegration describes the long-run relationship between variables
Often such relationships are given by economic theory
To determine if variables share one or more cointegrating relationships we can do the

following:
Determine whether the time series are stationary or nonstationary (graphical
analysis, unit root testing)
If the series are stationary, proceed to estimate dynamic model in levels of the
data
If the series are nonstationary, proceed to test for cointegration
41 / 42
Summary
To test for cointegration, two approaches can be used:
Single equation approach: Engel-Granger two-step procedure
Multivariate approach: Johansen test
Since cointegration is inherently a system property, the multivariate approach is
usually preferred
If we are able to reject the null hypothesis of no-cointegration, proceed to estimate an

equilibrium correction model either by:
Single equation model: ECM or ARDL
Multivariate VECM
If the null of no-cointegration cannot be rejected, proceed to estimate the model in

first differences
42 / 42

Co Integration

Uploaded by

Copyright:

Available Formats

You might also like

Co Integration

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Co Integration

Uploaded by

Copyright:

Available Formats

Cointegration

Facilitates an equilibrium analysis, where variables exhibit departures from

linear combination of these variables will also be nonstationary

In these cases, we say that the variables are cointegrated

We could rearrange the linear regression model, such that

If u is I (0), then by definition the combined y

We say that the variables y and y are cointegrated

Figure : Unit roots, spurious relations & common stochastic trends

and y = 0.3 + μ + υ 2,t y,2t y2t

It follows that the relationship β ′

Then the system is cointegrated when β ′

drift together in long-run equilibrium

The vector β is termed a cointegrating vector

the combined variables is b, then we note that y ∼ CI (d, b) t

υit is the stationary component

We are then able to multiply y 1,t

β1 y1,t + β2 y2,t = β1 (μy + υy ) + β2 (μy + υy )

Then the stochastic trends (β 1 μy1,t + β2 μ y

Hence, for y and y to be CI (1, 1), μ

1,t 2,t y1,t =

Such a cointegrating vector is unique up to the scalar

This can also be extended to the n variable case

where y is a vector {y , y , … , y } containing various I (1) variables and μ is a

vector of stochastic trends (μ , μ , … , μ ), and u is an n × 1 vector of irregular

where we once again multiply though by β to get βy t

since the linear combination of all βμ t

we are left with βy t

The variables return towards the equilibrium path over time

In a cointegrated model variables return to the equilibrium value

This may be accomplished by:

When the errors are stationary they may be expressed as,

ut = ϕ1 ut−1 + εt with |ϕ1 | < 1

Hence combining the two where, u t

P1,t − β1 P2,t = ϕ1 (P1,t−1 − β1 P2,t−1 ) + εt

P1,t = β1 P2,t + ϕ1 (P1,t−1 − β1 P2,t−1 ) + εt

ΔP1,t = −(1 − ϕ1 )(P1,t−1 − β1 P2,t−1 ) + (β1 ΔP2,t + ε1,t )

= α(P1,t−1 − β1 P2,t−1 ) + ε1,t

where α = −(1 − ϕ1 ) , while ΔP 2,t

Illustrates how the variables are influenced by deviations from equilibrium

If we assume that both share prices are I (1) and;

ΔP1 = α1 (P2,t−1 − β1 P1,t−1 ) + ε1,t

ΔP2 = α2 (P2,t−1 − β1 P1,t−1 ) + ε2,t

Long term equilibrium described by (P 2

If P is I (1) then ΔP is stationary

When variables are CI (1, 1), the term ε 1,t

Small values of α imply a relatively unresponsive relationship, where it would take a

long time to return to equilibrium

Δy1,t = γ0 + α1 [y1,t−1 − β1 y2,t−1 ] + ∑ ζ1,i Δy1,t−1 + ∑ ζ2,j Δy2,t−1 + εy ,t

Δy2,t = η0 + α2 [y1,t−1 − β1 y2,t−1 ] + ∑ ξ1,i Δy2,t−1 + ∑ ξ2,j Δy1,t−1 + εy ,t

This is a representation for a VECM

If both α and α equal zero there is:

Consider the example,

y1,t = ϕ1 y1,t−1 + ϕ2 y2,t + εt

Δy1,t = ϕ2 Δy2,t − (1 − ϕ1 )y1,t−1 + ϕ2 y2,t−1 + εt

Hence the relationship between y and y is described by

This is equivalent to the ECM that we derived earlier

All variables must be integrated of the same order

The number of cointegrating vectors is called the cointegrating rank