Professional Documents
Culture Documents
Integrating Modelling Technique
Integrating Modelling Technique
This manuscript has been reproduced from the microfilm master. UMI films the
text directly from the original or copy submitted. Thus, some thesis and
dissertation copies are in typewriter face, while others may be from any type of
computer printer.
In the unlikely event that the author did not send UMI a complete manuscript and
there are missing pages, these will be noted. Also, if unauthorized copyright
material had to be removed, a note will indicate the deletion.
UMI 800-521-0600
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
R eproduced w ith perm ission o f the copyright owner. Further reproduction prohibited w itho ut perm ission.
INTEG RATING MODELLING TECHNIQUES FO R FINANCIAL TIM E SERIES
A Thesis
Presented to
The Faculty of G raduate Studies
of
by
VICTOR KHOON-LEE TAN
for th e degree of
Doctor of Philosophy
August, 1999
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
National Library Bibliotheque nationale
1+1 of Canada du Canada
Acquisitions and Acquisitions et
Bibliographic Services sen/ices bibliographiques
395 Wellington Street 395, rue Wellington
Ottawa ON K1A0N4 Ottawa ON K1A0N4
Canada Canada
Your 1Ue Votre reference
The author has granted a non L’auteur a accorde une licence non
exclusive licence allowing the exclusive permettant a la
National Library o f Canada to Bibliotheque nationale du Canada de
reproduce, loan, distribute or sell reproduire, preter, distribuer ou
copies o f this thesis in microform, vendre des copies de cette these sous
paper or electronic formats. la forme de microfiche/film, de
reproduction sur papier ou sur format
electronique.
0-612-43275-0
Canada
R eproduced w ith perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
TAN
UNIVERSITY
W iUELPH Victor Khoon Lee 80302 (928 009 570)
The Examination. Committee has concluded that the thesis presented by the above-named candidate in partial
fulfilment o f the requirements for the degree
Doctor o f Philosophy
is worthy of acceptance and may now be formally submitted to the Dean o f Graduate Studies.
1.
Chair, Dqcjoral Examination Committee J . Mokariski
n.
External Examiner R* S^QC
in.
Graduate Faculty Member G. Hines
IV .
v. Date:
Advisory Committee T. Stengos
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
ABSTRACT
In standard statistical treatm ent of tim e series, tim e domain and frequency domain
are the classical techniques used as a basis for characterising an observed system and
forecasting its future behaviour. However, due to complicated d a ta structures of
tim e series, recent research developments-along with the availability of high-speed
com puters-have facilitated the development of m odem nonparam etric methods such
as the moving blocks bootstrap and the neural networks.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
classical techniques and m odem nonparam etric m ethods may offer a m ore effective
solution to financial forecasting problems.
This thesis will im plem ent a financial tim e series m odel which utilizes b o th clas
sical techniques and nonparam etric m ethods for estim ation and inference. It will
go on to propose a modified bivariate transfer function model. B oth tim e dom ain
and frequency dom ain techniques will be used in order to determ ine a relationship
between bivariate tim e series. After generating the less model de-pendent sam ples
using the m oving blocks bootstrap, a neural network will be applied to achieve b e tte r
point estim ation. Integrating these techniques provides th e best alternative solution
to financial forecasting problems. T he bivariate tim e series in question is interest
rate spreads and the spot Canadian dollar. Simulation results showed undercoverage
overall, but th e integrated modelling technique appears robust to th e choice of noise
distribution as well as th e sample size.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
To: My Family
and
The Memory of My G randparents and Parents
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
ACKNOW LEDGEM ENTS
I would like to thank th e D epartm ent of M athem atics and Statistics for all the
physical and financial supports. My thanks are also extended to Linda Allen, John
McCormick, departm ental stuff and m y fellow graduate students for th eir help and
friendship.
A special thank to m y lovely girlfriend, Sandy Pui-Yee Li, and h er family for
encouragem ent, support and love. Finally, I would like to thank my fam ily for their
endless support in everything!
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Contents
1 In tr o d u c tio n 1
ii
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2.2.3 Neural N e t w o r k s ................................................................................. 20
3 In te g r a te d M o d e llin g 28
5 S im u la tio n 50
iii
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
5.5 Sim ulated Series V and Simulation R e s u lts ............................................... 60
5.7 Sim ulated Series VTI and Sim ulation R e s u lts ............................................ 64
6 C on clu sion s 69
R eferen ces 73
A p p e n d ix 80
A. T he Back-Propagation A l g o r i th m ........................................................ 80
iv
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
List of Figures
2.1 A schem atic diagram of the moving blocks bootstrap for tim e series
4.1 T im e Series Plots for Spot Canadian Dollar and Interest R ate Spread. 40
4.2 Changes in Spot Canadian Dollar (Yt) and Interest R ate Spread (A t). 41
4.3 Histograms of the Changes in Spot Canadian Dollar and Interest R ate
Spread D ata....................................................................................................... 43
4.7 T he plot of standard error estim ates for all chosen block sizes. T he
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
List of Tables
5.8 Results for 95% Forecast Intervals on Sim ulated Series ........................ 67
vi
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 1
Introduction
1.1 Introduction
The use of a related series of historical values (or past d ata collected) as evidence
or a guide line to predict and to draw conclusions about future events, is a practice
researchers are m otivated to search for rules that can explain relationships between
the past and future. However, any forecast is affected by the unpredictable nature of
future events as well as by th e lim itations of past data. Moreover, a forecasting m odel
used by one field may not necessarily be used by another, because th ere are problems
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
In financial tim e series, the following two problems always exist. They axe:
reflect the available information set. However, in the actual m arket, deviations
exist between true prices and market prices because not all traders have all
the existing information, and may not always act rationally. Moreover, “the
[27, p.12].
In standard statistical treatm ent of tim e series, the first step is to ‘look’ a t the
data, and then select suitable m athem atical models for the data. Before we can do
this, we must first study and understand the techniques which have been developed
for forecasting and drawing inferences from the series. Classical tim e series techniques
are used as a basis for characterising an observed system and forecasting its future
behaviour and, therefore, is the starting point for conventional statistical tim e series
m odel building.
In the time series literature, classical tim e series analysis is based on the theory of
stationary processes, and it has been dem onstrated that m ost stationary processes can
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
be approxim ated by a model drawn from the class of autoregressive moving average
moving average (ARIMA) class of models, for which a model selection strateg y has
been developed by Box and Jenkins [8]. The significant contribution of Box and
Jenkins to tim e series analysis is based on the invertible ARMA class of processes and
th e four phases of: identification, estim ation, diagnostic checking, and forecasting.
and the sam ple partial autocorrelation function can yield im portant inform ation in
m odel. Because of th e developments of Box and Jenkins [8], the problem of estim at
ing the ARMA (p, q) m odel has attracted considerable attention in the tim e series
literature.
The frequency dom ain analysis of tim e series has its own share in the tim e series
literature. Frequency domain analysis is m athem atically equivalent to tim e dom ain
density function describes how the variation in a tim e series m ay be accounted for by
dom ain analysis provides many im portant insights which would not be apparent in
tivariate stationary processes and in th e analysis of very large d a ta sets, for which
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
num erical calculations can be perform ed rapidly using the fast Fourier transform” ,
Brockwell and Davis [9, p .112]. By combining the frequency dom ain technique and
the transfer function m odel-as proposed by Box and Jenkins [8] in the tim e d o m ain -
we are able to exploit th e relationship between any two financial tim e series.
Due to the com plicated d ata structures of financial markets such as stock returns,
exchange rates, and interest rates, there is a need to improve th e perform ance of
param eter estim ates in tim e series. In addition, th e behaviour of these financial
tim e series could perhaps be described more accurately by non-linear m athem atical
models. Such m ajor developments such as bootstrap techniques have been applied to
problems from time series analysis. In the tim e series literature th e use of bootstrap
techniques can be found in Efron and Tibshirani [19], Freedman [23], Kunsch [43] and
Recently, artificial neural networks (A N N s) have formed a basis for an entirely dif
ferent non-linear approach to the analysis of tim e series. Together w ith the widespread
availability of powerful com puter hardw are and efficient training algorithms, neural
field and there is increasing focus on th e potential benefits that neural networks can
offer. N eural networks have been applied to a wide variety of problems, such as p a tte rn
statistical perspective include: Cheng and T itterington [11], Ripley [51] and W hite
[55] [56]. Weigend an d Gershenfeld [54] provide a com puter scientist’s perspective of
neural networks in tim e series. In addition, Cybenko [13], Funanhashi [24], G allant
and W hite [25] [26], Hecht-Nielsen [31], and Hom ik, Stinchcombe and W hite [32]
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[33] provide some im portant references on the existence properties of the feedforward
networks.
T he m ain objective of this thesis is to model financial tim e series. After a review
of classical tim e series approaches to tim e domain and frequency dom ain approaches,
the thesis will examine the moving blocks bootstrap and neural networks before going
C hapter 2 will discuss classical and m odem nonparam etric methodologies to tim e
series. In Section 2.1, tim e and frequency domain techniques are reviewed as these
techniques are used (in a later section of the thesis) to determ ine the existence
function model. In Section 2.2, nonparam etric m ethods-the moving blocks bootstrap
and neural netw orks-are discussed in detail. These methods, together with classical
techniques, provide the theoretical basis for the derivation of an integrated technique.
forecasting problems.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 4 presents an application of the integrated modelling technique to fi
nancial time series, interest ra te spreads and spot Canadian dollar d a ta sets. The
empirical results support the use of the integrated modelling technique to financial
tim e series.
T he integrated modelling technique will be applied to sim ulated tim e series data
in Chapter 5, which will also summarise and present the results of the sim ulation.
size. Conclusions and suggestions for future work will be presented in C h ap ter 6.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 2
fields. Over th e years, a num ber of quantitative techniques and nonparam etric
m ethods have been developed for analysing and predicting tim e series values (essential
To simplify the review of classical m ethods, we will not review tim e dom ain
average (ARIM A) models, known as the Box and Jenkins approach, can be found
in studies by Brockwell and Davis [9], Chatfield [10], and Priestley [48], which pro
vide readable introductions to tim e series analysis and cover both tim e domain an d
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
frequency domain techniques. In this section, a brief review of frequency dom ain
techniques is presented.
it suitable for stochastic rather th an determ inistic functions of tim e” Chatfield [10,
k (n ~~ 1)
Vk .......... [?]■
are called th e Fourier frequencies of Z t and [•] denotes the greatest integer function.
sinusoidal function
length of tim e required to complete one cycle is called the period, the frequency i/*
measures the num ber of cycles per tim e point t. Note that, by knowing th e Fourier
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
frequencies, we can capture the tim e p ath of Zt as a weighted sum of the sinusoids in
(2.2).
In order to estim ate the contribution of each sinusoid (2.2), we estim ate th e
autocovariances; y z (k ) = C<rv(Zt+k, Z t ), by
1
= - E (z ‘+w - ^ - £) <2-3 )
n «=i
n —1
In.{vk) = cz(0) + 2 5 2 cz(&) cos(2iri/kk) ■ (2.4)
fc=i
T he periodogram appears to be a natu ral way of estim ating the spectral density
where
lz (k ) = I 2 f z ( v ) cos{2rkv)du .
Moreover, the to ta l area under the periodogram is an estim ate of the variance of Z t ]
th a t is,
t~]
£ (Z t - Z f = £ / n( ^ ) . (2.6)
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Although, th e periodogram is asym ptotically unbiased; th a t is
the variance of I n(vk) is not zero as n —►oo. Hence I n(i/k) is not a consistent estim ator
procedure of estim ating a power spectrum -that is, the m ethod of sm oothing the
where the lag window, iu(-), is a weight function on [0,1] w ith 0 < w (x) < io(0) = 1,
the inconsistency of the periodogram w ithout destroying its asym ptotic unbiasedness.
given tim e series. However, the choice of m is im portant because “the larger th e value
of m the smaller will be the variance of the resulting estim ate but the larger will be
the bias, and if m is too large th en interesting features of f z { y ) , such as peaks, may
be sm oothed out” Chatfield [10, p .117]. Since there is no effective m ethod for m aking
the choice of m , we can try several values in the region of ^ according to Chatfield
[10]. The most basic lag window estim ator is th e uniformly weighted average. In this
thesis, the modified Daniell sm oothing function, which is defined below, will be used
10
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
for d a ta analysis
X [I- + l i E ^ l ) _ 1I + | . x [ i + f c l i l , } , (2.9)
where span ( = 2 m -+-1) is an integer giving the length of th e smoothing window for
the given series. Note th a t u;(x,-) in (2.9) is the com putational expression of w ( ^ ) in
values an d x,- in sm oothing x,-; the sm oothing window actually includes span points
(2.10)
fo fz { v )d v
By defining the sam ple version of equation (2.10) in the following way
E
^ ) =& r ‘'it
"€N ' (2 -u )
we can use Un(is) to test whether or not a tim e series is w hite noise. T h at is, we
plot th e Un{y) function and check its com patibility with the diagonal line (uniform
11
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
distribution, function; F (x) = x , 0 < x < 1) using the Kolmogorov Smirnov test.
Moreover, for n > 62 (see Brockwell and Davis [9]), a good approxim ation to a level
a Kolmogorov Sm irnov test is to reject th e null hypothesis if the statistic falls outside
of the bounds.
Thus fax, we have been concerned with analysing a univariate tim e series. We will
now exam ine th e relationship between two tim e series. For two given non-white noise
tim e series, Yt and X t , we use the cross spectrum, to establish whether a stru ctu ral
fv x M = £ lY x W e -* ™ * , „ € f - i |] , (2.12)
fc = —oo
where
7 r x { k ) = Cov(Yt+k, X t ) , k = 0 , ± 1, ± 2 , . . . (2.13)
12
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Note th a t f r x ( v ) is a complex valued function. The reason is th at y y x (k ) (2.13) is
and the Cauchy - Schwarz inequality ensures that squared coherence, K 2(v), is botinded
as follows:
Here K 2(u) can be interpreted as the spectral equivalent of the ordinary linear
series axe correlated at each frequency. Two situations of particular interest arise:
where
00 r 1 11
A (u) = £ (2.18)
OO W
13
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
C a s e I I : W hen no correlation between Yt and X t series exists, we have
fx v M = 0 v „ 6 [ _ i, i] . (2.19)
and
These axe the two extrem es th a t can happen to th e Yt and X t series. However, in
exists.
as well as
14
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
tu( 0 ) = 1 , and m is chosen to be a sequence such th a t - — > oo as n - t o o w ith
n —k
and
K 2 ( 2 . 22)
- [ 4 -3 -
S - * ! * * .™ . (2.23)
l-I^ J I2 L 2 ’ 2j
S ~ F2Am under the hypothesis that K 2{u) = 0, we can therefore reject the hypothesis
of m can be im portant; it can also affect the F test as dem onstrated here.
15
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2.1.3 Bivariate Transfer Function Model
1712
Yt = £ a sX t. s + N t , (2.24)
5=771 X
X t . Here, m i and m 2 have to be estim ated. Note th a t if m i > 0, we assume the past
X t will influence future Yt , but not vice versa. Thus, in the former case the model
Having considered the model, we follow the procedures in Brockwell and Davis
and neural networks are computer based (nonparam etric) techniques. Although
the two techniques use different com puter algorithms, they provide an alternative
complicated d a ta structures.
16
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
2.2.1 Moving Blocks Bootstrap Sample
The bootstrap is a com puter based resampling technique (as initiated by Efron
[15]) used to gain information on the distribution of an estim ator. A m ore general
resam pling technique, called th e moving blocks bootstrap (see Kunsch [43], Liu and
Singh [46], and Leger, Politis and Romano [44], and Efron and Tibshirani [20]), is used
for m odel dependent data. T he procedure for obtaining th e moving blocks bootstrap
Simple bootstrap resam pling involves fitting a model and then sampling from th e
tim e series: For any given tim e series sample, we can generate a bootstrap tim e series.
In order to do so we m ust (1) choose a block length; (2) form the tim e points to all
possible contiguous blocks of chosen length; (3) draw th e samples with replacem ent
from these blocks; and (4) join all th e samples together to form a bootstrap tim e
series. These steps can be applied to generate a bivariate bootstrap tim e series-that
is, we pair-up the original bivariate tim e series and then apply the four steps to th e
series. For exam ple, if we have samples of size n, and the block length is chosen to be
/, then we can have k blocks to form the bootstrap tim e series. The num ber of blocks,
17
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
tim e series with, block length I = 3. ‘‘T he idea for the moving blocks bootstrap is to
choose a block size I large enough that observations m ore th an I time u n its apart will
be nearly independent” Efron and Tibshirani [20, p.102]. However, Kunsch [43] and
Liu and Singh [46] show th a t B{ = (X{, X i+ i, . . . ,X,-+r_i), i = 1,2,— , k, are i.i.d .
for fixed I under some conditions. Note th a t the moving blocks bootstrap coincides
w ith the classical i.i.d. bootstrap of Efron [15] when 1 = 1. The advantage of using
try ing to capture since we axe resampling from the blocks of observations. Next, we
and Tibshirani [20] suggest th a t we have to m ultiply the bootstrap stan d ard errors
step to the moving blocks bootstrap procedure; th at is, the addition of a random
18
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Original Tim e Series Sam ple
Figure 2.1: A schematic diagram of the moving blocks bootstrap for tim e
(Modified from Efron and Tibshirani [21]).
19
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
2.2.3 Neural Networks
Artificial neural networks (ANNs) axe m athem atical m odels for hum an brain
Neural networks are considered a nonparam etric m ethod for drawing statistical
nonlinear relationship between th e given input and output d ata. W hen used together
w ith powerful com puter hardware and training algorithms, ANNs allow statistical
tim e series are more suitable for analysis by neural networks. Because tim e series are
Although there are a numerous of networks of which m any are recurrent neural
networks (e.g. Kuan and Liu [42]), the multilayer perceptrons is the sim plest neural
network representation, and it can well approxim ate a large class of functions. Hence,
in this thesis, the feed-forward networks are used as the only nonparam etric model.
Authors such as Cybenko [13], Funanhashi [24], Gallant and W hite [25] [26], Hecht-
Nielsen [31], and Homik, Stinchcombe and W hite [32] [33] provide some im portant
from a statistical perspective include Cheng and T itterington [11], Ripley [51], and
W hite [55] [56]. Weigend and Gershenfeld [54] present a com puter scientist’s perspec
20
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2.2.4 Feed-forward Networks : Multilayer Perceptrons
network consists of a set of source nodes that constitute the input layer of one or
m ore hidden layers of com putation nodes, and an output layer of com putation nodes.
by-layer basis. These feed-forward networks are com m only referred to as mulitlayer
perceptrons (M LP) when there is a t least one hidden layer. From th e statistical point
th a t are estim ated by optimising some measure of fit to the training data. Figure 2.2
shows a typical m ultilayer perceptron with only one hidden layer, flowing from the
bottom up.
training, each input d a ta vector is paired with a corresponding desired taxget for
th eir desired targets, the difference being the o u tp u t errors. T he errors axe th en
fed back to th e network to improve the performance. This process is known as the
backpropagation algorithm. This algorithm is based on the mean squared error (MSE)
of two passes: th e forward pass a n d backward pass. In the forwaxd pass, the input
d a ta vectors axe applied to the source nodes and propagate, layer by layer, through
21
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
O utput layer
H idden layer
Input layer
th e network. Once one forward pass is completed, a set of outputs is obtained as the
actual o u tp u t of the network. Note that the synaptic weights (which are th e inter
node connection weights of the network) are fixed during the forward pass. During
th e backward pass, the synaptic weights of th e network are all adjusted according
actual output of the network from the desired target. This updated error signal is
synaptic connections. The synaptic weights are adjusted to improve the perform ance
until a zero, or m inim um, error signal is obtained. T he learning process perform ed
or iteration, comprises the forward pass and backward pass cycles. One com plete
22
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
m aintained on an epoch by epoch basis, until the synaptic weights and the threshold
levels of the network stabilise and the E m se over th e entire training set converges to
where N s is the num ber of training vectors in one complete presentation of the entire
training set during th e learning process (an epoch), and N0 is th e num ber of output
neurons.
epoch to th e next in order to make the search in weight space stochastic over the
learning cycles, and avoid the possibility of limit cycles in the evolution of the synaptic
weight vectors. Moreover, the randomisation of th e training sets allows for a testing
problems.
T he normalised mean squared error (NMSE) is used to evaluate the model of the
Y K (Y — Y ) 2
N M SE = * vT - •1 , (2.26)
where K is the num ber of d a ta in the validation set, Ys is the desired target, and Ys
is the actual output. Note th a t the quantity ( 1— NMSE) measures the percentage
variance in the d a ta which is explained by the model. Moreover, if we plot the learning
23
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
curve (the NMSE versus the epoch) for th e function estim ated from the train in g set,
m ultilayer perceptron will be considered well train ed if it learns enough abo u t th e past
to be able to generalize about the future. However, if we allow the learning process
to proceed indefinitely, we will overfit the d a ta and begin modelling only noise-that
consider the linear regression case: if we constantly add param eters to its full rank,
the R-square becomes 1. Hence, we need a stopping criteria in order to end the
Kim, M artin and Staley [39] identify the technique most commonly used to end
th e learning process: First, the available d a ta set (or the moving blocks bootstrap
tim e series samples) is randomly partitioned into a training set and a validation set
(there are no fixed rules for selecting the am ount of data for either the train in g set or
th e validation set). We propose to select 75% o f the data to be used for th e training
set, and 25% of the d a ta to be used for the validation set. T he m otivation here is to
validate the model on a data set which is different from the one used for param eter
estim ation. In this way, we can com pute th e NMSE (2.26) based on a validation
set using the resulting network used on the training set. Consequently, we plot the
learning curve of th e validation set, and look for th e epoch a t th e point on th e curve
set will not necessarily monotonicaily decrease. This number of epochs will then be
used to determ ine th e stopping point for the learning process. It should be n oted th at
24
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
this situation does not always occur, b u t if it does happen, th e network should be
perceptron network on th e given training set and validation set is used for prediction
In neural network architecture, Figure 2.3 shows the commonly used threshold
functions. The threshold function is used in the forward pass com putation. Using
= + (2-27)
h=1
where / ( - ) is the threshold function, i/hi is the weight between the input layer an d hid
den layer, and is the hidden layer threshold values. Note th at th e backpropagation
25
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
(c)topta
26
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
function defined by the logistic function :
Y = , ■, (2.28)
1 + e x p ( —x)
f = (l - Y ) Y , (2.29)
where f is the derivative of the logistic function. Note th a t the logistic function
is a commonly used function which transform s the input value in the range x 6
[—oo, + oo] into the output range O € [0, 1]. “The presence of nonlinearities is
im portant because, otherwise, the input - output relation of the netw ork could
2. For the choice of the maximum number of layers in th e network, Lippm ann [45]
recommends th a t three layer networks, w ith just one hidden layer, are sufficient
3. The hidden neurons which are not part of the input or output of th e network
27
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 3
Integrated Modelling
occur in the financial world. The technique utilizes both classical techniques and
nonparam etric m ethods for estim ation and inferences. This chapter will begin w ith
basis of im plem enting techniques in modelling tim e series. In this section, with th e
28
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
support of these techniques and m ethods, we present the theoretical foundations in
An initial approach is to follow th e m ethod of Kim and Maxtin [40], using spectral
analysis to uncover certain statistical details. For a given bivariate tim e series, the
sta tistic Un(v) (see C hapter 2 ) is used to determ ine whether the two tim e series
axe w hite noise. T hat is, the Kolmogorov Sm irnov test is used to reject th e null
hypothesis w ith level a. if the function Un(u) falls outside of th e bounds for each tim e
series. Once we reject the null hypothesis and conclude that the two given tim e series
are not w hite noise, the cross spectrum is applied to determ ine the existence of the
A fter establishing the existence of a structural relationship between the two series,
applied to furth er analyse the tim e series. Before implementing th e proposed model,
it is helpful to provide the rationale for m odifying the bivariate transfer function
7Jl2
y t = £ a *x > - + N ‘ • (3-i)
Recall th a t (3.1) is valid only if it is causal, th a t is, the past X t will influence future
Yt , b u t not vice versa. In particular, the m odel requires th at m x < m 2 and m i > 0 .
N ext, we follow the procedures listed in Brockwell and Davis [9, pp.455-457] in order
29
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S te p 1. We calculate the cross correlation function (C C F ); f>wtY‘ {h) of the two
filtered series; nam ely W t and Y ' . The filtered series Wt and Y ' are obtained
i(B )0 ~ l (B )X , = Wt , (3.2)
obtain
S te p 3. From the previous step, we estim ate rhi and m 2 to be the m inim um and
30
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
To continue the process of estim ating ocs, s = m l7 — , m 2 for th e bivariate transfer
function model, we apply the filter <j>(B)0~l (B ) to th e model (3.1), which yields
7712
V,- = £ a ,W t- , + n ; . (3.4)
s=mi
T his implies th a t the transfer function between X t and Yt series isthe sam e as
expectation, we obtain
Since
E (W t- kY~) = iw trc<k)
E (W t- kWt-,-) = 0 for j / 1
IW tY'iE ) = a k&wt ( 3 -6 )
31
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
and
lw tYt-
CCk = ------ -
C\V?
_ gyt‘ l w tY?{k)
<*Wt G Y ’ GWt
= - ^ - p W tY 'ik ) , fc = 0,1, 2, . . .
crwt
The estim ates of a for lag h at which pwtYfi}1) is found to be significantly different
p
E
Yt = £ w s( X t- s +
s=0
Pt-s) + Nt , (3.8)
K if X t- 1 > X t-2
0 if X t—i = X t —2
X ifX t-i< X t-2
where P ’s, k, and A axe unknown param eters. Note th a t we fix th e term p to b e 4
in ( 3 .8 ) for daily closing d ata and, for simplicity we let X t — X t + Pt- In addition,
the coefficient w3 will be estim ated by using the maximum likelihood estimate (M LE).
32
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
Furtherm ore, we introduce two constant term s, namely k and A, in (3.8). The reason
will result in an increase (decrease) in present tim e point X t. The estim ates of k and
and
A = average of \ ^ 2 ( X t — X t- i \ X t < X t- i)
U=2
So far, we have only used the classical techniques w ith modifications. The next
erate bootstrap tim e series samples using th e moving blocks bootstrap m ethod for
bootstrap m ethod to the residuals. The reason for this is th at the bootstrapping of
residuals m ethod is “model dependent”, th a t is, the model, for exam ple an A R ( 1)
Next, we apply the multilayer perceptron network to the original bivariate tim e
series and th e bootstrap generated bivariate tim e series samples for estim ation and in
ference. These applications complete the derivation of the theoretical foundations for
integrated m odelling where the procedures of the nonparam etric methods introduced
33
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
3.2 Modelling Procedures
T he correct approach for analysing bivariate financial tim e series is to use spectral
determ ine the existence of the relationship between the two given tim e series. Lf it is
technique to fu rth er the analysis of the bivariate tim e series. Having defined the
m odel structure, the moving blocks bootstrap is applied to th e given tim e series.
However, we need to know the block size I in order to generate the bootstrap tim e
series samples. Since there is no effective way of choosing the block size a t this
coefficient and its associated standard error as tru e values. For exam ple, we
zt = f i z t - i + e .
34
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S te p 2 . Here we fix the block size to be I = 1,5,10, - -. , 100, and select B = 200
from zt.
S te p 3. For each I, we fit the ARMA(p,^) m odel to each bootstrap tim e series
S te p 4. For each I, we estim ate th e standard error <tf(P) by the sample standard
deviation of th e B replications
1_
where
6=1
S te p 5. Here we plot the standard errors obtained from Step (4) versus th e block
35
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
3.2.2 Generating Bootstrap Bivariate Time Series.
Once the block size is chosen, th e bootstrap tim e series samples axe generated by-
S te p 2 . Select B independent bootstrap samples (X*1, l^*1), (X*2, Y ' 2), - - * , ( X “B, YtmB),
each consisting of n data values drawn w ith replacement from ( X t, Yt)- Here we
pairs. Note th a t we need to have B > 1000 for constructing the intervals (See
Tibshirani [53]).
A fter obtaining th e moving blocks bootstrap tim e series for each set of bootstrap
sam ple pairs (X "b, Ytmb), b = 1 , 2 ,... , B , we construct the m ultilayer perceptron net
work w ith 4 input layers, n hidden units for th e hidden layer (where n is determ ined
by trial and error in th e region of | of input vectors), and 1 o utput layer. The three-
X - 3 , X - 2 , X - i and X 0 axe set to zero. Since the underlying (2.28) activation function
36
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
th a t does not “saturate” the network neurons. Thus, we have th e range of th e inputs
Finally, we can construct the bootstrap forecast intervals for the financial tim e
series. For example, if we want to have a forecast interval for one step ahead future
D‘ = Y n+ i - 6 = 1, 2, . . . ,2?,
where Yn+i is the forecasted future value obtained from th e original tim e series,
[^ n + l Wl —a> ^n+1 U a •
[20].
37
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 4
grated m odel presented in C hapter 3. This chapter will apply th e integrated model
to a given set of data, nam ely the interest rate spread and th e spot C anadian dollar
series, in order to exam ine how the integrated m odel will perform .
T h e interest rate spread d a ta (I S t) and the spot Canadian dollar d ata (C t) are
taken from Kim and M artin [40]. A brief description of the d a ta set is given below.
1996, axe used in this study. T he interest rate spread IS t (in percent) is defined as
the difference between 90 day Canadian and 90 day US treasury bill rates, and the
38
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
spot Canadian, dollar Ct is defined as th e closing price of th e exchange rate measured
X£ = IS t ~ I S t-i , (4.1)
Note th a t th e tim e unit for Yt is one day. T he time plots of Yt and X t series axe
presented in Figure 4.2A and 4.2C. It is apparent that th e two series appear to be
In Figure 4.2C however, the X t series exhibits extrem e movements, while the Yt
series (Figure 4.2A) is bounded within a sm aller range over th e tim e period. Moreover,
since we axe trying to identify the “stable” model for the long term link between th e
X t and Yt series, we will remove these extrem e points and replace them as follows:
In particular, we will bound the m ovem ent in X t between ±38 ( = ± 4 • &xt) basis
39
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Figure 4.1: Tim e Series Plots for Spot Canadian Dollar and Interest R ate Spread.
St
St
5
tN t tWZ 1M tm * (MS IIM
TkM
T im *
40
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Figure 4.2: Changes in Spot Canadian Dollar (Y t) and Interest R ate Spread (X t).
C ) T h e C h a n g e s in In terest R a te S p r e a d . D ) T h e C h a n g e s in In te re s t R a te S p r e a d will
A djustm ent.
41
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
points, and bound the movement in Yt between ± 1.1 (= ± 4 * <5yt) basis points. This
In Figure 4.2B and 4.2D, wereplot the tim e plots of Yt and X twith these changes.
T he histogram of Yt and X t series axe presented in Figure 4.3. Notice th a t all the
to determ ine the existence of the relationship between the Yt and X t series. From
apply to the Yt and X t series using the modified Daniell sm oothing function in Splus
series.
From the figures, we can see th a t there is some power in th e high frequencies in
Yt series and low frequencies in X t series. Although we do not know where th e power
is com ing from, we can relate some of the power to the short noisy appearance in the
N ext, Figure 4.5 shows the cumulative periodogram and th e 95% confidence limits
42
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Figure 4.3: Histograms of the Changes in Spot Canadian Dollar and Interest R ate
Spread D ata.
43
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Figure 4.4: Spectral Density Plots of Yt and X t Series.
e
£
H U M
F req u en cy
••
FftqtMncy
44
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
(Kolmogorov Smirnov boundaries) for both, the X t and Yt series. It is apparent th at
th e Yt series falls w ithin the confidence lim its, whereas the X t series fall outside of the
w hite noise, although it is interesting to note th a t the spectral density of the Yt series
in Figure 4.4A is not “flat” enough; that is, this “visual” criterion disagrees w ith the
Kolmogorov Smirnov test. Thus, we reject th a t the two tim e series are white noise,
Since the bivariate tim e series is not w hite noise, we carry out the hypothesis
testing defined in C hapter 2 in order to test w hether these series are correlated at
each frequency. To continue our example, using the modified Daniell sm oothing
In Figure 4.6, a plot of the statistic S is presented with the accompanying F statistic
( i ' 0.90, 2,140)• It is apparent th a t the hypothesis K 2(v) = 0 is rejected for all u € [o, |]
and th a t the Yt and the X t series have correlation over the entire frequency range.
Therefore, there is strong statistical evidence th a t the two series are correlated and
45
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
Figure 4.5: Cum ulative Spectrum Plots of Yt and X t Series.
F re q u e n c y
A ) C u m u la tiv e S p e c t r u m o f Y _ t S e r i e s .
F re q u e n c y
B ) C u m u la tiv e S p e c t r u m o f X _ t S e r i e s .
46
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
To proceed to our d a ta analysis, th a t is, to generate bootstrap tim e series samples,
we first apply the algorithm for choosing appropriate block size. Note th at here we
apply the algorithm to the Yt series (spot Canadian dollar d ata). We plot the standard
errors versus the block size in Figure 4.7. Note th a t the horizontal line is th e tru e
value of crp obtained from the Yt series. It is evident th at the plot suggests th a t 1 = 60
is preferred here. After fixing the block size I = 60, we follow the steps described in
can use the integrated model to obtain a point estim ate a n d /o r inference for spot
In this section, the bootstrap forecast interval for the spot Canadian dollar is
presented. T he intervals are constructed with B = 1500, moving block size I = 60,
and the m ultilayer perceptron network with 4 input layers, 3 hidden units for the
hidden layer, and one output layer. Note th a t th e three-layer m ultilayer perceptron
Before we provide the point estim ate and th e forecast interval for spot Canadian
47
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Figure 4.6: P lot of Squared Coherence Statistic.
§ -
st -
I
05
Bloctc SlM 0)
Figure 4.7: The plot of standard error estim ates for all chosen block sizes. T he
standard error is 0.02605
48
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Recall that Yt in (4.2) is defined as
c n+1 =
As an example, the forecasted one step ahead future value forspot C anadian
C n+1 = 73.0678 ,
Note th a t, the a ctu al value of Cn+i is 73.0033. These results justify the use of the
49
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 5
Simulation
A sim ulation study is carried out to evaluate the performance of the model
th e classical techniques and nonparam etric m ethods for this d ata analysis. For this
study, we generate n + 1 data with sample sizes n = 500, and n = 1000, for th e X t and
Yt series and sim ulate X t as Interest rate spreads d a ta and Yt as spot Canadian dollars
d ata. For each sam ple size, the sim ulated tim e series are generated w ith Gaussian
and non-Gaussian noises. In addition, the X t series are generated as follows: For
sim ulation studies I - V, we have M A = 0 order and the orders for autoregressive
(A R ) axe assigned to be 2, 3, 5,7, and 10; For sim ulation study VI, we have A R = 2
steps axe applied to each sim ulated series: (1) The extrem e points axe adjusted
50
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
used to determ ine the existence of the relationship between th e two series; (3) if
the two series axe not white noise and are correlated over th e frequency range, we
proceed to the next step, otherwise, we have to staxt on a new sim ulated series; (4) the
algorithm for choosing the block size is applied; (5) the bootstrap tim e series samples
are generated with B = 1000; ( 6 ) a three-layer m ultilayer perceptron network w ith the
value for i ^ +i, and the forecast interval for are forecasted; a n d ( 8 ) after obtaining
the 95% forecast intervals for the Yn+i, we then com pare them to th e actual Yn+i- By
repeating this procedure 200 times, with only the complete procedure counted, we can
then calculate the percentage of the intervals th at cover the a c tu a l or targeted Yn+i
for sam ple sizes n = 500 and n = 1000, respectively. Note th a t we use a bootstrap
51
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
where th e coefficients <f>j, j = 1,2 axe random ly chosen to be 0.65 an d —0.2.
Zt = —0J2Xt -f v /F —ffi2^ - H t
where the function m m (-) returns a num ber th a t is the minimum of th e inputs,
trunc(-) is the function th a t creates integers from floating point num bers by
(1 - 4>B)(l - B )H t = Vt , K ~ W ( 0 ,1 ) (5.3)
i= i
Ut ~ (80% of N ( 0,1) -I- 20% of C auchy{0 , 1)) (5.4)
where the coefficients <py, j = 1,2 axe random ly chosen to be 0.58 and —0.12.
Zt = -0 .2 X t + V l - 0.22 - H t
52
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
where Ht is defined by
(1 - 4>B){ 1 - B )H t = Vt , Vt ~ N ( 0 , 1) (5.6)
S im u la tio n R e su lt I
T he sim ulation results are sum m arised in Table 5.1. Note th a t th e coverage
probability for sim ulation results axe lower than the nom inal level of 95%.
Moreover, th e results show indifferently for both sam ple sizes as well as the
noise distributions. However, for both samples sizes, th e average error is higher
A = 500 N = 1000
A R = 2 and M A = 0 A R = 2 and M A = 0
is defined by
Average Error = ^ ■
t^ 1 ^ — — . (5.7)
53
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
5.2 Sim ulated Series H and Sim ulation R esu lts
(1 - £ - B ) X t = Ut , Ut ~ N ( 0 , 1) . (5.8)
i=i
Note th at th e series is generated using the Splus function. We sim ulate tim e
series Yt as follows:
Zt = -Q .2 X t + V l - 0 .22 - H t
where H t is defined by
(1 - 4>B){1 - B )H t = Vt , K ~ i V ( 0 ,l ) - (5-10)
54
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S im u la ted S eries 11(b): W ith N on -G au ssian N o ise
- B ) X t = Ut ,
i= i
Ut ~ (80% of 1V(0 , 1 ) + 20% of C a u ch y{0 , 1)) . (5-11)
Zt = —0 .2X t + V l - 0 .22 - H t
where Ht is defined by
(1 - <f>B){1 - B )H t = Vt , Vt ~ N ( 0 , 1 ) . (5.13)
S im u la tio n R e su lt H
The sim ulation results are su m m arised in Table 5.2 . Note th at the autoregres
com pared w ith the sim ulation result I, the coverage probability is still under the
nominal level of 95%. The average error for non-Gaussian noise series rem ained
high at about 1 basis point (which is about 4-oyt in actual spot C anadian dollar
da ta in C hapter 4).
55
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Table 5.2: R esults for Forecast Intervals on Sim ulated Series II
N = 500 N = 1000
A R = 3 and M A = 0 A R = 3 and M A = 0
(1 - j r ^ ) ( l - B ) X t = Ut , Ut ~ iV(0 , 1 ) . (5.14)
i= i
Zt = -Q .2 X t + V T - O J P • H t
where H t is defined by
( l - 4 > B ) ( l - B ) H t = Vt , Vt ~ N ( 0 , 1 ) . (5.16)
56
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S im u la te d S eries 111(b): W ith N o n -G a u ssia n N o ise
i=i
Ut ~ (80% of 1V(0,1) + 20% of Cauchy( 0 , 1)) . (5-17)
Zt = - 0 .2X* + V l - 0 . 22 - H t
where H t is defined by
S im u la tio n R e su lt II I
T he sim ulation results are su m m arised in Table 5.3. In this study, the autore
gressive order isincrease from 3 to 5. The coverage probability for sim ulation
results show decreases as compared with sim ulation results I and II. However,
the average error for non-Gaussian noises axeappears closer to Gaussian noise
series for both sample sizes. T he coverage probability for these sim ulation
57
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Table 5.3: Results for Forecast Intervals on Sim ulated Series III
N = 500 AT = 1000
A R = 5 and M A = 0 A R = 5 and M A = 0
(1 - £ ^ ) ( 1 - B ) X t = Ut , Ut ~ W(0 , 1 ) . (5.20)
i=i
Z t = — 0. 2X t + V l - 0 . 22 • Ht
where H t is defined by
(1 - 1 - B )H t = Vt , Vt ~ W (0, 1) . (5.22)
58
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S im u la te d S eries IV (b ): W it h N on -G au ssian N o ise
{ l - Y f 4>i B i ) { l - B ) X t = Ut ,
i=i
Ut ~ (80% of i\T(0,1) + 20% of C auchy(0,1)) . (5.23)
Zt = - 0 .2 X * + Vl - 0 .2 2 - Ht
where H t is defined by
S im u la tio n R e su lt IV
T he sim ulation results axe sum m arised in Table 5.4. Note th a t th e autore
gressive order is now 7. T h e coverage probability for the sim ulation results
sim ulation results. As usual, th e coverage probability for the sim ulation results
are lowerth an th e nominal level. But the average errors are alm ost th e sam e
59
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Table 5.4: Results for Forecast Intervals on Sim ulated Series IV
lV = 500 N = 1000
Aft = 7 and M A = 0 AR = 7 and M A = 0
10
where H t is defined by
60
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S im u la te d S eries V (b ): W ith N o u -G a u ssia n N o ise
10
(1 - £ f c # ) ( l - B ) X t = U t,
i= 1
Zt = - 0 .2 X t + Vl - 0 .2 2 - Ht
where Ht is defined by
S im u la tio n R e su lt V
T he sim ulation results axe sum m arised in Table 5.5. Notice th a t the coverage
probability for sim ulation results are decreasing as the autoregressive order
increases to 10. In fact, the average errors are higher than usual. In this
sim ulation study, we still have th e coverage probability lower than nom inal
level.
61
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Table 5.5: Results for Forecast Intervals on Sim ulated Series V
N = 500 JV = 1000
A R = 10 and M A = 0 A R = 10 and M A = 0
Zt = -Q .2X t + ■Ht
where Ht is defined by
62
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S im u la te d Series VT(b): W ith N o n -G a u ssia n N oise
(1 - £ t i & X 1 - B )X , = (1 + £ » iB s)Ut ,
j =1 j —1
Zt = -0 .2 X £ + V l - 0.22 - Ht
Yt = \trunc { m in (Z t )} \ + 1 + Z t , (5.36)
where Ht is defined by
S im u la tio n R e su lt VT
T he sim ulation results are sum m arised in Table 5.6. Note that we have the
for sim ulation results axe very sim ilar to sim ulation results I and II. However,
for respective sam ple size, the average errors are almost indifferent for both
63
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Table 5.6: Results for Forecast Intervals on Simulated Series VT
IV = 500 IV = 1000
A R = 2 and M A = 2 AJ2 = 2 and MA = 2
Zt = - 0 .2 X t + Vl - 0 .2 2 - Ht
where is defined by
64
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
S im u la ted S eries VTI(b): W ith N on -G au ssian N o ise
u-E -
b )x , = (i+e
h& W t ,
i= i i= i
Zt = - 0 .2 X t + V l - 0.22 - Ht
where Ht is defined, by
S im u la tio n R e su lt V H
The sim ulation results axe sum m arised in Table 5.7. As we increase b o th the
order for M A and A R to 5, the coverage probability for sim ulation results axe
65
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Table 5.7: Results for Forecast Intervals on Sim ulated Series V II
IV = 500 N = 1000
A R = 5 and M A = 5 A R = 5 and M A = 5
5.8 Discussion
In Table 5.8, it is apparent th a t the coverage probability for all sim ulation results
are lower than th e nom inal level of 95%. T here are five possible reasons for this
low coverage probability: (1) The estim ates taken to be the true values used for
intervals based on bootstrap percentiles m ethod m ay not have performed well due to
th e consequence of nonparam etric inferences. It may have underestim ated the tails
procedure in this study was too low (in this study the sim ulation procedure was
only repeated 200 tim es). (4) T he adjustm ent rule for extrem e points m ight affect
th e results for coverage probability. However, because we are trying to model the
long term relationship for bivariate financial series, it seems reasonable to apply
th e adjustm ent rule to those extrem e points for better m odel analysis. (5) As the
sim ulation results show, the coverage probability decreases as we increase the order
66
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
for M A a n d /o r A R . This m ay be due to com putation errors occurring as th e model
N = 500 N = 1000
T he results obtained when n = 500 do not differ greatly as compared w ith those
obtained when n = 1000. In addition, the sim ulation results show that the integrated
modelling technique is insensitive to the choice of distribution for the noise. Since we
m ust clean up th e extrem e points for each of th e sim ulated series, this m ay explain
there is a slight decrease in coverage probability when we increase the order for A R
67
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
and M A . This also suggests that we m ay want to keep the model simple in order
to save com putation tim e and obtain b e tter results. Again, the integrated modelling
A lthough th e results seem reasonable when using the integrated modelling tech
in future studies.
68
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 6
Conclusions
This thesis has demonstrated, the usefulness of both classical m ethods and non-
C hapter 2 give the basic ideas for im plem enting techniques in modelling financial
tim e series.
els) provide us with estim ation and forecast interval. However, in financial m arkets,
tim e series are statistically more volatile as compared to other applications. Thus,
simple models alone are unable to solve difficult problems such as exchange rate
the forecast estim ation as compared with th e univariate case. However, th e transfer
function m odel requires the value of X n+ i, which is not available and m ust itself be
forecast in order to forecast Yn+1. The accuracy of the forecasting interval for 3^+i can
be improved by not requiring the estim ation of X n+i, thus reducing th e probability
69
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
T he frequency dom ain methods provide us (w ith the aid of high speed com puter
hardw are), a more efficient way to uncover certain statistical details. Particularly for
the bivariate tim e series, we need to determ ine th e existence of a relationship between
two tim e series in order to further th e data analysis. Together with the classical
techniques, m odem nonparam etric m ethods such as th e moving blocks bootstrap and
neural netw ork provide us with a theoretical basis for the derivation of an integrated
m odelling technique.
the tools to further analyse the bivariate tim e series. The technique utilizes b o th
classical techniques and nonparam etric methods for b etter estim ation and inference.
technique.
In C hapter 5, although the simulation study was only carried out for n=500 and
n=1000, the sim ulated tim e series were generated w ith Gaussian and non-Gaussian
noises. T he sim ulation results showed undercoverage overall, but the integrated
70
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
6.1 Future Research
chapter 3. For the spot Canadian dollar example, we can add oth er variables such
as the dem and an d /o r supply of Canadian dollars, dem and a n d /o r supply of U.S.
dollars, and th e stock indexes to the m odel. The dem and and supply of currency will
help us predict w hether or not future values will soon rise or fall. As for the stock
indices, increase or decrease, this would help us to determ ine w hether the m arket is
perform ing well. O ther possible variables to add axe those pertaining to available
By adding more variables to the model, two approaches to analysing the tim e
series are possible. The first approach would be to utilize m ultiple linear regression
model. Although m ultiple linear regression models are widely used due to th eir
com putational simplicity, there are problems inherent to this approach which need to
m atrix of the explanatory variables before carrying out a m ultiple linear regression
assumed th a t th e errors are i.i.d.N (0, a 2), but this assum ption m ay not be correct.
If the residuals are correlated, one can try fitting a m ultiple linear regression m odel
71
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
with, autocorrelated errors (See Kendall et al. [37]).
pair up the spot Canadian dollar with other variables. However, how do we combine
the information from different pairings of Yt (spot Canadian dollar series) and other
the confidence intervals w ith m ore advanced bootstrap intervals methods such as bias-
network, we m ay achieve a b etter point estim ation and satisfactory probability cov
erage.
72
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Bibliography
[2] Anderson, T .W . [1971]. The statistical analysis o f tim e series. John Wiley &
[3] Beran, R. [1982]. Estim ating sam pling distributions : The bootstrap and
[4] Beran, R. [1988a]. Balanced sim ultaneous confidence sets. Journal o f the
[5] Beran, R. and Ducharme, G .R. [1991]. A sym ptotic theory fo r bootstrap methods
[6] Bickel, P .J. and Freedman, D.A. [1981]. Some asym ptotic theory for th e
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[8] Box, G.E.P. an d Jenkins, G.M . [1976]. Tim e series analysis : Forecasting and
[9] Brockwell, P.J. and Davis R .A . [1987]. Tim e series : Theory and methods.
[11] Cheng, B. and T itterington, D.M . [1994]. Neural networks : A review from a
[12] Christensen, R. [1991]. Linear models fo r multivariate, time series, and spatial
[15] Efron, B. [1979a]. Bootstrap m ethods: A nother look at the jackknife. The Annals
o f Statistics 7 , 1-26.
[16] Efron, B. [1979b]. Computers and the theory of statistics : Thinking the
[17] Efron, B. [1981]. Nonparam etric estim ates of standard error : T he jackknife, th e
74
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[18] Efron, B. [1982]. The jackknife, the bootstrap and other resampling plans. Society
[19] Efron, B. and Tibshirani, R .J. [1986]. Bootstrap m ethods for standard errors,
Science 1, 54-77.
[20] Efron, B. and Tibshirani R .J. [1993]. A n introduction to the bootstrap. C hapm an
[21] Franke, J. and Hardle, W . [1992]. O n bootstrapping kem al spectral estim ation.
[22] Freedman, D.A. [1981]. Bootstrapping regression models. The Annals o f Statis
tics 9, 1218-1228.
[24] Funanhashi, K.I. [1989]. On the approxim ate realization of continuous mappings
[25] Gallant, A.R. and W hite, H. [1988]. There exists a neural network th a t does not
Neural Networks, San Diego, CA, Vol 1 (IEEE Press, New York), 657-664.
75
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[27] Granger, C -W .J. [1992]. Forecasting stock market prices : Lessons for forecasters.
[28] Hall, P. [1986]. O n the num ber of bootstrap simulations required to construct a
New York.
[33] Homik, K., Stinchcombe, M. and W hite, H. [1990]. Universal approxim ation of
an unknown m apping and its derivati ves using m ultilayer feedforward networks.
[34] Homik, K ., Stinchcombe, M., W hite, H. and Auer, P. [1994]. Degree o f approx
[35] Hwang, J .T . and Ding, A.A. [1997]. Prediction intervals for artificial neural
76
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
[36] Jenkins, G.M. and W atts, D.G. [1969]. Spectral analysis and its applications.
Holden-Day, California.
[37] Kendall, M.G., S tuart, A. and Ord, J.K . [1983]. The advanced theory o f statistics.
[38] K im , P.T ., M artin, P., and Kelly, D. [1992]. Unanticipated changes in interest
rate spreads and th eir effect on the Canadian spot exchange rate : A n update.
[39] Kim P.T ., M artin P. and Staley M. [1996]. A neural network approach to
forecasting the Canadian spot exchange rate. Fixed Income Research : Wood
G undy Inc.
[40] Kim P.T . and M artin P. [1996]. On the relationship between the interest rate
spread and the spot Canadian dollar. University of Guelph Lecture Notes.
[41] Kuan, C.M. and W hite, H. [1994]. Artificial neural networks: An econometric
[42] Kuan, C.M . and Liu, T. [1995]. Forecasting exchange rates using feedforward
[43] Kunsch, H.R. [1989]. The jackknife and th e bootstrap for general stationary
[44] Leger, C., Politis, D.N. and Romano, J.P . [1992]. Bootstrap technology and
77
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[45] Lippm ann, R.P. [1987]. An introduction to computing with neural nets. IE E E
[46] Liu, R.Y. and Singh, K. [1992]. Moving blocks jackknife and bootstrap capture
[47] Ljung, G.M. and Box, G.E.P. [1978]. On a measure of lack of fit in tim e series
[48] Priestly, M.B. [1981]: Spectral analysis and tim e Series. Academic Press, New
York.
[49] Quenouille, M.H. [1956]. Notes on bias in estim ation. Biometrika 43, 353-360.
[50] Rao, V.B. and Rao, H.V. [1993]. C++ neural networks and fuzzy logic. MIS:Press,
New York.
[51] Ripley, B.D. [1993]. Statistical aspects of neural networks. In Networks and
[52] Stacey, D. [1995] Lecture notes : COMP 27-6420 (Spring Semester). Depaxtment
[53] T ibshirani, R .J. [1985]. How many bootstraps ? (Tech. rep. no. 362). Stanford
[54] Weigend, A.S. and Gershenfeld, N.A. (eds) [1994], Time series prediction,
Reading, MA : Addison-Wesley.
78
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[55] W hite, H. [1989]. Learning in artificial neural networks : A statistical perspective.
[56] W hite, H. [1992]. Artificial neural networks: approximation and learning theory.
79
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
A p p e n d ix A : T h e B ack -P rop agation A lg o rith m
perceptron network model used for financial d a ta analysis. For the given input
defined as
G e (X ) = r„ + £ m, f ( v ' , X + * .)
3=1
where th e activation function /(*) is the logistic function defined in (2.28), and 0 =
the weights and threshold values, and the superscript t is denoted as transpose.
Stacey [52].
1. N o ta tio n s :
80
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2. F o rw a rd P a s s C o m p u ta tio n : For each input-output pair ( X a, 0 S):
3. B a c k w a rd P a s s C o m p u ta tio n :
ey = Oj{l - Oj)dj
A w ij = a6,-ey
4. I t e r a t i o n : Repeat Step 2 and Step 3 until ey’s are all either zero or at th e
m inim um value.
81
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
5. R e c a ll : T he recall mechanism consists of th e two feed-forward operations.
1. C reate Lh P E values
k
bi = f (5 2 x hVhi + $ f )
h=l
82
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.