Professional Documents
Culture Documents
JOURNAL OF FINANCIAL ECONOMETRICS-2005-Audrino-422-41
JOURNAL OF FINANCIAL ECONOMETRICS-2005-Audrino-422-41
3, 422–441
abstract
The daily term structure of interest rates is filtered to reduce the influence of
cross-correlations and autocorrelations on its factors. A three-factor model is
fitted to the filtered data. We perform statistical tests, finding that factor loadings
are unstable through time for daily data. This finding is not due to the presence
of outliers nor to the selected number of factors. Such an instability problem can
be solved when applying the factor analysis on multivariate scaled residuals,
filtered using a nonparametric technique based on functional gradient descent.
This article analyzes the stability of the daily factor structure of interest rates.
Although daily interest rates exhibit an unstable structure, suitably filtered inno-
vations appear to be well described by three factors with stable loadings. Since
our factor structure is constructed on filtered innovations (independent over time)
and not on raw interest rate data, stable factors cannot be interpreted as level,
slope, and curvature as in most studies. Our use of daily data departs from the
more common use of monthly data. The motivation for our study is that the
stability of factor loadings is necessary for effective management of interest rate
risk. In fact, daily interest rate changes can then be described as functions of
underlying factor scores, which need to be unpredictable to be interpreted as
sources of risk.
Financial support from the National Centre of Competence in Research ‘‘Financial Valuation and Risk
Management’’ (NCCR FINRISK) and from F.A.R. 2004 of the Universtiy of Insubria is gratefully acknowl-
edged. Address correspondence to Autonietta Mira, Department of Economics, University of Insubria, Via
Ravasi 2, I-21100 Varese, Italy, or e-mail: antonietta.mira@uninsubriait.
doi:10.1093/jjfinec/nbi019
ª The Author 2005. Published by Oxford University Press. All rights reserved. For permissions,
please e-mail: journals.permissions@oupjournals.org.
Audrino et al. | The Stability of Factor Models 423
Factor models of the term structure proposed to date assume the stability of
underlying factors and their loadings without testing them. Our study is the first
study to apply a high-dimensional filter, based on functional gradient descent
[see Friedman, Hastie, and Tibshirani (2000) and Friedman (2001)], to remove
autocorrelations and cross-correlations from interest rate innovations. We per-
form formal tests of stability on filtered innovations, finding that three factors are
sufficient to describe them. Our work complements the large number of studies
that in the last decade have provided some insight into the number and the nature
of common factors needed to describe the dynamic evolution of the term structure
1
See for example Stambaugh (1988), Steeley (1990), Carverhill and Strickland (1992), and Litterman and
Scheinkman (1991) for a factor analysis of the U.S. term structure. A critical reexamination of classical
results of factor analysis of interest rates has been proposed by Leikkos (2000).
2
Phoa (2000) and Chapman and Pearson (2001) also found, in their empirical investigations, that the factor
decomposition of the U.S. term structure seems to be robust through time.
3
Analogous results were found by Diebold and Li (2002) in a different setting, that is, the Nelson-Siegel
framework, for the monthly forecasts of the U.S. term structure of government zero-coupon bond yield
levels and changes.
424 Journal of Financial Econometrics
loadings and some dummy variables. We perform tests based on the hypothesis
that the regression coefficients (loadings) in the different subperiods are equal (or,
in other words, that coefficients of the dummy variables are equal to zero). Using
this strategy, we are able to detect changes in factor loadings.
In contrast with the common belief and the empirical results found by Bliss
(1997), Perignon and Villa (2002), and Diebold and Li (2002) for monthly data, our
statistical investigation, for daily U.S. zero-coupon bond yields at 30 different
maturities from 1 year to 30 (from the J.P. Morgan database), leads to the conclu-
sion that factor loadings are not stable through time. The null hypothesis of equal
leptokurtic. On the other hand, interest rate changes are stationary, their distribu-
tion is more leptokurtic, with drastically reduced autocorrelations. However,
autocorrelations are not completely removed by differencing. Excess kurtosis
does not affect factor analysis, but time dependence clearly affects estimation
procedures, since one of the assumptions inherent in standard factor analysis is
that the data under consideration represent random, independent samples from a
multivariate distribution. For all these reasons, we filter, in a first step, the
autocorrelation left in the series of interest rate changes using a nonparametric
VAR-type model. In addition, we present a computationally feasible estimation
where rt,Ti denotes the spot interest rate on day t for maturity Ti. We assume
stationarity of xt,Ti.
Our goal is to filter the autocorrelation of the process {xt} of interest rate
changes by considering the whole term structure dynamics (i.e., looking simulta-
neously at all available maturities), so that a standard factor analysis can be
performed on the filtered data. To this purpose, we consider a nonparametric
VAR-type model in connection with a functional gradient descent (FGD) estima-
tion [see Friedman, Hastie and Tibshirani (2000) and Friedman (2001)] of the
multivariate conditional mean mt ¼ E½xt j F t1 of xt ¼ ðxt,T1 ,:::, xt,Td ÞT , where F t-1
denotes the information available up to time t–1, that is, the s-algebra generated
by fxs ; s t 1g.
In our model, the dynamics of the conditional mean are specified by
T
xt ¼ mt þ xt , mt ¼ mt, 1 ,:::, mt ,d ,
the multivariate conditional mean, mt , and volatility matrix, Vt, overcoming the well
known curse of dimensionality problem. All details about the use of FGD for
estimating mt in the general model of Equation (2) are presented in the next section.
with xt ¼ xt mt ¼ xt Gðxt1
tp Þ. For this reason, a natural loss function is
1 1 d
lV ðy,gÞ ¼ ðy gÞT V 1 ðy gÞ þ logðdetðVÞÞ þ logð2pÞ,
2 2 2
y g ¼ ðy1 g1 ,:::, yd gd ÞT , ð4Þ
4
We assume that
Vt ¼ Dt RDt ,
d ðN1Þ
Dt ¼ diagðst,1, :::,st,d Þ, R ¼ ½rij i,j¼1:
In this model, the parameter rij ¼ corrðxt;Ti , xt,Tj jF t1 Þ equals the constant conditional correlation and
hence 1 rij 1, rii ¼ 1: Moreover, the individual conditional variances s2t,i are approximated by the
general nonparametric function, Fi,
that are also estimated using the FGD methodology; for all details, see Audrino and Trojani (2004). It
follows that the empirical criterion of Equation (3) and the loss function of Equation (4) can be further
simplified as
Audrino et al. | The Stability of Factor Models 427
n
X 1 T 1
logðdetðDt ÞÞ þ ðD1 1 0 0
t xt Þ R ðDt xt Þ þ n d logð2pÞ=2 þ n logðdetðRÞÞ=2 and
t¼pþ1 2
1 1 T 1 1 1 d
lR ðy, gÞ ¼ logðdetðDÞ þ ðD ðy gÞÞ R ðD ðy gÞÞ þ logðdetðRÞÞ þ logð2pÞ,
2 2 2
respectively, where n0 ¼ n p: Estimation of the correlation matrix R can be easily done via empirical
moments of residuals having (previous) estimates Ĝ ¼ ðĜ1 ,:::, Ĝd Þ and D̂t ¼ diagðŝt,1 ,:::,ŝt,d Þ:
428 Journal of Financial Econometrics
X
K
x̂ t ,Ti ¼ LTi, j Ft,j þ Zt, Ti ,
j¼1
where L are the factor loadings, F are the common factor scores, and Zt are the
X
K XX
K
x̂t,Ti ¼ ~t,j þ
aj F bl,j Dlðt,jÞ þ Zt, Ti ,
j¼1 l˛D j¼1
where D is one of the sets (2, 3, 4), (1, 3, 4), (1, 2, 4), or (1, 2, 3).
To check the stability over time of factor loadings, we perform a test to verify
the null hypothesis that the coefficients of the dummy variables are equal to zero,
that is, that the regression coefficients (loadings) in the different subperiods are
equal.
To construct the test statistic, we estimate the robust variance-covariance
P
matrix, , following White (1980), by
X
X
n
ˆ ¼ Z2t, Ti BBT ,
t¼pþ1
Audrino et al. | The Stability of Factor Models 429
~ D1 D2 D3 :
B ¼ ½F
5
Stock and Watson (2002) present a methodology to forecast a macroeconomic time-series variable where a
large number of predictors is summarized by a small number of indexes constructed by principal
components analysis. An approximate dynamic factor model serves as the statistical framework for the
estimation of the indexes and construction of the forecasts. In their model, factor loadings are allowed to
vary over time and are modeled as evolving according to a random walk. However, they prove uniform
consistency of the factors from the principal components analysis only in the case where the number of
predictors is considerably larger than the number of observations. This is not the case in our study.
430 Journal of Financial Econometrics
Figure 2 Three-dimensional plot of positive (left) and negative (right) filtered innovations: the
sample consists of 2330 daily yield data for the time period between January 2, 1986, and May 10,
1995, at all yearly maturities from 1 year to 30 years.
at the 5% level. However, the same result occurs when filtering the data with a
standard diagonal VAR model for the conditional mean in Equation (2).
We perform a robust three-factor analysis of the filtered innovations and
investigate the time robustness of factor loadings. Note that applying a robust
Audrino et al. | The Stability of Factor Models 431
Table 1 Summary statistics for filtered innovations computed with the model
presented in Section 1, based on a multivariate FGD estimation of conditional
mean functions, at some significant maturities.
Maturity Mean St. Dev. Min. Max. MAE RMSE r̂ð1Þ r̂ð20Þ r̂ð50Þ
The sample consists of 2329 daily yield data for the time period between January 2, 1986, and May 10,
1995, where an outlier corresponding to the crash of the 20th of October 1987 is recognized and eliminated.
St. Dev. and r̂ðxÞ mean sample standard deviation and sample autocorrelation at lag x, respectively. MAE
and RMSE are the mean absolute error and the root mean-squared error of the filtered innovations,
respectively.
FA, we are able to take into account the nongaussianity detected in our filtered
innovations.
We report results for a robust three-factor analysis, because this choice is
popular in the term structure literature. Moreover, testing the optimal number
of factors for our data sample using conventional information criteria and
standard rules, such as the number of eigenvalues greater than one or the
elbow rule for the scree plot, we find that three factors are empirically a good
choice. More in detail, when performing a robust principal component analysis
of the filtered innovations, we find two eigenvalues greater than one. The scree
plot for a robust factor analysis also supports the choice of three factors. In
addition, when setting a 99% threshold for the explained cumulative variance
we choose exactly three factors.
In our empirical analysis6 we set n1 ¼ n2 ¼ n3 ¼ 582 and n4 ¼ 583. Results for
a classical investigation of factor loading stability are summarized in Appendix B.
6
Results do not depend on a particular choice of the subperiods. We obtain similar results also when
dividing the data sample in subperiods of different (not necessarily equal) lengths or considering partial
overlapping subperiods, as in Perignon and Villa (2002).
432 Journal of Financial Econometrics
Table 2 Value of test statistics described in Section 2 for all different D’s and a
representative short maturity of 5 years.
Factor 1 Factor 2 Factor 3
= {2, 3, 4}
2 – 1 0.1395 0.8252 2.3964
3 – 1 0 0 0
4 – 1 0 0 0
= {1, 2, 4}
1 – 3 0 0 0
2 – 3 0.0896 0.6339 2.3649
4 – 3 0 0 0
= {1, 2, 3}
1 – 4 0 0 0
2 – 4 0.0872 0.6532 2.6405
3 – 4 0 0 0
Results are reported for a robust three-factor analysis of filtered innovations x̂t using the nonparametric
FGD methodology. One and two stars indicate a rejection of the null hypothesis of equal factor loadings in
the different subperiods at significance levels of 5% and 1%, respectively.
Table 3 Value of test statistics described in Section 2 for all different D’s and a
representative intermediate maturity of 15 years.
Factor 1 Factor 2 Factor 3
= {2, 3, 4}
2 – 1 2.7587 0.5829 0.0148
3 – 1 1.0300 3.1976 0.5701
4 – 1 0.3191 3.7888 0.4456
= {1, 2, 4}
1 – 3 1.0300 3.1976 0.5701
2 – 3 2.0640 4.2859 1.1294
4 – 3 1.8936 0.8013 0.4173
= {1, 2,3}
1 – 4 0.3191 3.7888 0.4456
2 – 4 4.8858 0.9228 0.7338
3 – 4 1.8936 0.8013 0.4173
Results are reported for a robust three-factor analysis of filtered innovations x̂t using the nonparametric
FGD methodology. One and two stars indicate a rejection of the null hypothesis of equal factor loadings in
the different subperiods at significance levels of 5% and 1%, respectively.
How can we interpret these statistical results, also in comparison to the gra-
phical analysis of Appendix B? When performing specific FA in each subperiod, as
in Appendix B, the vector of factor scores spans the space in a different way in each
subperiod. Consequently, factor loadings show similar patterns across different
time periods, even though they are not constant through time. In contrast, in the
tests of Section 3.2, the common factors are estimated from a FA over the whole
period and then fixed. Regressing the filtered innovations on the identified common
factors, we find that factor loadings in the different subperiods are not robust
through time. Therefore a model allowing for time-varying factor loadings should
be used to better understand the behavior of the term structure.
As an alternative, researchers may apply FA on full multivariate scaled
residuals. Clearly, residuals must be obtained by multivariate techniques that are
still computationally feasible in large dimensions. Do multivariate standardized
residuals display a stable factor structure after filtering? In Section 4 we report
results for a parametric VAR-CCC-GARCH model compared with our nonpara-
metric FGD methodology.
To be sure that the factor loadings instability is not due to the selection of an
insufficient number of factors, we also perform the FA for more than three factors.
434 Journal of Financial Econometrics
Table 4 Value of test statistics described in Section 2 for all different D’s and a
representative long maturity of 30 years.
Factor 1 Factor 2 Factor 3
= {2, 3, 4}
2 – 1 0.9446 1.5402 0.1357
3 – 1 0.6449 0.7930 0.1379
4 – 1 2.0811 0.6954 0.7668
= {1, 2,4}
1 – 3 0.6449 0.7930 0.1379
2 – 3 0.3689 0.8659 0.0219
4 – 3 4.3956 1.1992 0.6361
= {1, 2,3}
1 – 4 2.0811 0.6954 0.7668
2 – 4 3.9889 4.8587 1.1748
3 – 4 4.3956 1.1992 0.6361
Results are reported for a robust three-factor analysis of filtered innovations x̂t using the nonparametric
FGD methodology. One and two stars indicate a rejection of the null hypothesis of equal factor loadings in
the different subperiods at significance levels of 5% and 1% respectively.
In particular, a four-factor analysis shows that, as for the case of three factors, the
instability issue remains. In fact, the first, second, and fourth factor loadings are
not constant over time.7
7
Explicit results for a four-factor analysis are available on request.
Audrino et al. | The Stability of Factor Models 435
Table 5 Value of test statistics described in Section 2 for all different D’s and a
representative short maturity of 5 years.
Factor 1 Factor 2 Factor 3
D = {2, 3, 4}
2 – 1 0 0.2958 0 0.7915 0 1.8174
3 – 1 0 0.3471 0 0.6410 0 1.6644
4 – 1 0 0.9461 0 2.2191 0 0.6721
D = {1, 3, 4}
1 – 2 0 0.2958 0 0.7915 0 1.8174
3 – 2 0 0.1178 0 1.6551 0 1.1788
4 – 2 0 1.2142 0 1.7965 0 1.2411
D = {1, 2, 4}
1 – 3 0 0.3471 0 0.6410 0 1.6644
2 – 3 0 0.1178 0 1.6551 0 0.1788
4 – 3 0 1.1244 0 3.0025 0 0.9792
D = {1, 2, 3}
1 – 4 0 0.9461 0 2.2191 0 0.6721
2 – 4 0 1.2142 0 1.7965 0 1.2411
3 – 4 0 1.1244 0 3.0025 0 0.9792
Results are reported for a robust three-factor analysis of multivariate scaled residuals x̂t using the
nonparametric FGD methodology (FGD) and a standard parametric VAR-CCC-GARCH (1,1) model
(VAR). One and two asterisks indicate a rejection of the null hypothesis of equal factor loadings in the
different subperiods at significance levels of 5% and 1%, respectively.
436 Journal of Financial Econometrics
Table 6 Value of test statistics described in Section 2 for all different D’s and a
representative intermediate maturity of 15 years.
Factor 1 Factor 2 Factor 3
D = {2, 3, 4}
2 – 1 1.0359 2.1252 1.5813 0.3351 0.7784 1.7290
3 – 1 0.1015 1.5042 0.7657 0.1866 0.3809 2.3779
D = {1, 3, 4}
1 – 2 1.0359 2.1252 1.5813 0.3351 0.7784 1.7290
3 – 2 2.1852 1.1151 1.2393 0.3903 1.3451 0.8059
4 – 2 1.1948 3.1582 1.5786 0.9014 0.5183 0.4138
D = {1, 2, 4}
1 – 3 0.1015 1.5042 0.7657 0.1866 0.3809 2.3779
2 – 3 2.1852 1.1151 1.2393 0.3903 1.3451 0.8059
4 – 3 1.1307 2.4221 0.8848 0.5795 1.2613 1.2891
D = {1, 2, 3}
1 – 4 0.4527 0.0517 0.0189 1.4746 0.4404 0.6721
2 – 4 1.1948 3.1582 1.5786 0.9014 0.5183 0.4138
3 – 4 1.1307 2.4221 0.8848 0.5795 1.2613 1.2891
ˆ t using the
Results are reported for a robust three-factor analysis of multivariate scaled residuals ˛
nonparametric FGD methodology (FGD) and a standard parametric VAR-CCC-GARCH (1,1) model
(VAR). One and two asterisks indicate a rejection of the null hypothesis of equal factor loadings in the
different subperiods at significance levels of 5% and 1%, respectively.
Tables 5–7 clearly show the strong potential of the nonparametric FGD
technique, used as a filter for the raw interest rate data. In fact, the multivariate
scaled residuals from the FGD fit display a stable factor structure. On no occasion we
reject the null hypothesis that the coefficients of the dummy variables are equal to
zero at the 1% confidence level and only on one occasion at the 5% confidence level.
In contrast, filtering the data using a full parametric VAR-CCC-GARCH model yields
several rejections of the null hypothesis at both the 5% and 1% confidence levels for
short and intermediate maturities. When considering long maturities, the factor
decomposition of the multivariate scaled residuals is stable for both approaches.
This result highlights even more the fact that a stable factor structure of the
interest rate data connot be generally assumed, but it can be obtained if an
appropriate multivariate filtering technique is implemented. It follows the need
of filtering the raw data using nonparametric methodologies that are able to
eliminate all important linear and nonlinear cross-dependencies among the dif-
ferent series. A simple parametric model is not able to perform as well as our
nonparametric model based on FGD. Furthermore, results of our tests indicate
Audrino et al. | The Stability of Factor Models 437
Table 7 Value of test statistics described in Section 2 for all different D’s and a
representative long maturity of 30 years.
Factor 1 Factor 2 Factor 3
D = {2, 3, 4}
2 – 1 0.7421 0.9155 0.1130 0.1912 0.2512 0.3432
3 – 1 0.1411 0 0.6158 0 0.7694 0
D = {1, 3, 4}
1 – 2 0.7421 0.9155 0.1130 0.1912 0.2512 0.3432
3 – 2 0.6542 0.6509 0.4649 0.1881 0.4516 0.2989
4 – 2 0.3167 0.8101 0.6936 0.1996 0.4522 0.3476
D = {1, 2, 4}
1 – 3 0.1411 0 0.6158 0 0.7694 0
2 – 3 0.6542 0.6509 0.4649 0.1881 0.4516 0.2989
4 – 3 0.8915 0 0.1587 0 0.0175 0
D = {1, 2, 3}
1 – 4 1.0993 0 0.9074 0 0.7894 0
2 – 4 0.3167 0.8101 0.6936 0.1996 0.4522 0.3476
b3 – b4 0.8915 0 0.1587 0 0.0175 0
ˆ t using the
Results are reported for a robust three-factor analysis of multivariate scaled residuals ˛
nonparametric FGD methodology (FGD) and a standard parametric VAR-CCC-GARCH (1,1) model
(VAR). One and two asterisks indicate a rejection of the null hypothesis of equal factor loadings in the
different subperiods at significance levels of 5% and 1%, respectively
how the detected instability problem of the factor structure of interest rate data
can be overcome. It is necessary to consider full nonparametric techniques, like
the FGD one, to filer raw interest rate data. Multivariate scaled residuals ensure a
better consistency than simple innovations, filtered only to eliminate cross-corre-
lations and autocorrelations in their conditional means.
5 CONCLUSION
Filtering interest rates so that innovations conform to standard assumptions in
multivariate statistics has become possible. Our use of FGD produces innovations
that are well described by three factors with stable loadings. We develop and
perform tests of the stability of factor loadings. Results support our parsimonious
description of term structure innovations, which appear to be driven by three
variables. On the contrary, our results also show that a standard parametric VAR-
CCC-GARCH model is not sufficiently precise to filter raw interest rate data and
does not yield multivariate scaled residuals displaying a stable factor structure.
438 Journal of Financial Econometrics
where SX ðUi ÞðxÞ denotes the predicted value at x from the base learner S using the
response vector Ui predictor variables X.9
Step 3i (line search). Perform a one-dimensional optimization for the step length,
X
n
ŵm,i ¼ argminw l ðm1Þ ðxt, Ĝm1 ðtÞ þ wĝm,i ðxt1
tp ÞÞ,
V̂t
t¼pþ1
where Ĝm1 ðtÞ þ wĝm,i ðÞ is defined as the function which is constructed by
adding in the ith component only. (Note that the line search guarantees that the
negative log-likelihood decreases in every iteration.)
Step 4 (update). Select the best component as
X
n
im ¼ argmini l ðm1Þ ðxt, Ĝm1 ðtÞ þ ŵm,i ĝm,i xt1
tp :
V̂t
t¼pþ1
Update10
8
Initialization is important to achieve good estimates. As a starting function we propose to use the fit from
a parametric diagonal VAR (1)-CCC-GARCH(1,1) model.
9
The base learner clearly determines the FGD estimates ĜM ðÞ: This should be ‘‘weak’’ (not involving too
many parameters to be estimated) enough not to immediately produce an over fitted estimate at the first
iteration. The complexity of the FGD estimates ĜM ðÞ is increased by adding further terms at every
iteration. We choose decision trees as base learners, since, particularly in high dimensions, they have the
ability to do variable selection by choosing few of the explanatory variables for prediction.
10
It is often desirable to make a base learner sufficiently ‘‘weak.’’ A simple, but effective way to reduce the
complexity of the base learner is via shrinkage toward zero. The update is then replaced by
Obviously this reduces the variance of the base learner by the factor n.
Audrino et al. | The Stability of Factor Models 439
11
Similar to Bliss (1997), we rotate the original solution until factor loadings with a meaningful economic
interpretation are obtained. For instance, for this article the loadings were first rotated so that filtered
innovations at all maturities had approximately the same loading on the first factor. Since any
perturbation to the first factor then affects all maturities equally, this factor is interpreted as a level
shift.
440 Journal of Financial Econometrics
values of the curves in the four subperiods are approximately the same, even
if in the first subperiod the peak is reached after those of the other
subperiods.
Summarizing, from a graphical point of view, we find that factor loadings
seem to present a consistent pattern across different subperiods despite some
minor variation. This is similar to what is reported by Bliss (1997).
Received March 3, 2004; revised November 29, 2004; accepted March 8, 2005.
REFERENCES
Audrino, F., and P. Bühlmann. (2003). ‘‘Volatility Estimation with Functional Gradient
Descent for Very High-Dimensional Financial Time Series.’’ Journal of Computational
Finance 6, 1–26.
Audrino, F., and F. Trojani. (2004). ‘‘Accurate Yield Curve Scenario Generation Using
Functional Gradient Descent.’’ Unpublished manuscript, USI, Lugano, Switzerland.
Bliss, R. R. (1997). ‘‘Movements in the Term Structure of Interest Rates.’’Federal Reserve
Bank of Atlanta Economic Review 82, 16–33.
Audrino et al. | The Stability of Factor Models 441