Professional Documents
Culture Documents
Skewing Up Correlation: Understanding Correlation Skew in Equity Derivatives
Skewing Up Correlation: Understanding Correlation Skew in Equity Derivatives
Valer Zetocha
Abstract
The goal of this paper is to ll in the gap between the multi-asset equity derivatives market and the mathematical
models developed to treat the correlation skew problem. We analyze the structure of the equity correlation market and
explain which parts are aected by correlation skew. Using a Jacobi instantaneous correlation model as a vehicle to
gain intuition, we approach the issue from both the perspective of historical realized correlation as well as from the
market-implied one. We study the eect of correlation skew on the typical structures traded in the correlation market.
The main result is the calibration procedure for such a model. Once calibrated, the model provides a link between
various correlation derivatives, most importantly between the basket dispersion trades and Worst-Of structures, the two
main vehicles of correlation business.
Keywords: correlation skew, stochastic correlation, Jacobi process, index skew, implied correlation
implied correlation
3M correlation
0.6 0.65
0.6
0.4 0.55
0.5
0.2
0.45
0 0.4
-0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 70 % 80 % 90 % 100 % 110 % 120 % 130 %
3M return strike
Figure 1: 3M realized correlation vs 3M historical log-returns for Figure 2: SX5E implied correlation skew as of 13 Jan 2014.
SX5E from Jan 2009 to Jan 2014
Miscellaneous trades:
rainbow options, individu- the stochasticity of correlation is the correlation skew: the
ally capped baskets, out-performance options, etc, are tendency of stocks to correlate more when the market falls.
all o-shoots of the above trades and can be either Let us have a look at the problem from two dierent per-
included in the above categories or they represent a spectives: historical distribution of realized correlation on
small part of the business. one hand and correlation implied by the market on the
other.
Taking a naive approach of using models with constant
correlation one arrives at observing three dierent lev- 2.1. Realized correlation
els of implied correlation traded in the market: correla-
tion swaps (CS) trade at discount to index dispersion (ID) It has been pointed out repeatedly in the literature that
which trade at discount to worst-of trades (WO). The dif- the correlation rises when the market falls5 . Instead of dis-
ference between ID and CS levels can in large part be playing the co-movement of price and correlation, Figure
explained by component weights2 , dividends3 and ow4 . 1 shows 3M realized correlation for SX5E vs the index 3M
However, that does not apply for WO implied correlation performance. Although loaded with noise, the tendency
levels. In this case, as well as in various other miscel- of realizing a higher correlation when the market falls ma-
laneous trades, one needs a stochastic correlation model terializes in the negative slope of about -0.4 of the linear
that explains the correlation skew. regression line.
The rest of the paper is organized as follows. After a
2.2. Implied correlation
brief description of the correlation skew problem in Sec-
tion 2, we present a model that explains it in Section 3. Correlation skew is best observed in index vanilla op-
Section 4 focuses on calibration while Section 5 describes tions. The market charges higher implied correlation for
an extension of the model to pair-wise non-constant cor- lower strikes, as seen in Figure 2.
relation matrix. We nish the paper with an example of a Like the volatility skew itself, the correlation skew is
WO put. most pronounced at shorter maturities, having a decaying
structure as shown in Figure 3.
2. Correlation skew
3. Modeling stochastic correlation
Correlation between stocks is far from constant. How-
ever, what can (and did) hurt correlation books more than Various dierent approaches have been developed in re-
2 CS is equally-weighted while indices are usually capitalization-
cent years along the following lines:
weighted. The heavier members of the index tend to be more cor- Local correlation: an umbrella for various approaches
related than the less-important ones. Indeed, looking at the last 5
years of data in yearly intervals, the top 20 stocks of SX5E have con- where the correlation depends on stock prices in a
sistently realized a higher correlation than that of full SX5E index. similar fashion to local volatility. See eg, Langnau
The dierence stayed between 1% and 6%. The non-equal weights of (2010), Reghai (2010).
SX5E index then eectively increase the index implied correlation.
This eect can be of the order of 2-3%. Non-Gaussian copula: a quick model for products de-
3 CS do not adjust the closing prices for the amount of dividend
on ex-dividend dates. This, together with a random distribution of pending on one maturity only. See eg, Lucic (2013).
ex-dividend dates of members of an index, eectively decreases the
correlation realized by a swap. The eect is of the order of 1%.
4 The client ow in CSs is always in the same direction: hedge 5 See for example the co-movements of S&P500 index price and
funds sell correlation, so there is a downwards pressure on CS levels. monthly realized correlation exhibited in Boortz (2008)
2
0.12
0.06
long-term solution, ρ̄, and it has a volatility coecient of
0.04
β . The process starts at ρ0 .
0.02
The Wiener process Ŵ t which drives the instantaneous
0
0 1 2 3 4 5 6 correlation is itself correlated with the rest of Brownian
maturity (years) drivers:
Figure 3: Term structure of SX5E implied correlation skew dened < dŴt , dWt‘j >= ρCS δ(t − t‘)
as the dierence in implied correlation for strikes 90% / 100%. As
of 13 Jan 13. The constant correlation-stock correlation, ρCS , brings
to ve the number of parameters of the model. The cor-
: usually
Stochastic instantaneous correlation models relation skew eect appears when ρCS is set to a negative
based on Jacobi process as in van Emmerich (2006), number.
Ahdida and Alfonsi (2012). The correlation matrix for N+1 processes then looks as
follows:
Variance-covariance models : based on Wishart
1 ρt ρt · · · ρt ρCS
stochastic process, see for example Da Fonseca (2006) 1 ρt · · · ρt ρCS
: exten-
1 ··· ··· ···
Common jump and/or market factor models
(3)
sions of Merton model to multi-asset settings with
1 ρt ρCS
common jumps whose sizes are correlated negatively
1 ρCS
with the brownian motions driving the stocks.6 1
For the purpose of the current paper we need a model The minimal correlation that can be input in the above
that has the ability to match the market prices and that matrix while still preserving the positive semi-deniteness
is consistent with historical distributions. We would also is ρmin and denes the left boundary for the Jacobi process
welcome intuitiveness and calibrability, and it would be a (2). A simple computation shows that ρmin is dependent
great advantage if the model were robust and quick. The on the number of stocks N and ρCS :
next section explains a model we believe satises our re-
quirements. −1 + N · ρ2CS
ρmin = (4)
N −1
3.1. The model The fact that the upper left N×N sub-matrix of (3) is
The model we shall use is based on a Jacobi process for pair-wise constant makes the Choleski decomposition a lin-
instantaneous correlation7 . ear operation. In fact the step of correlating the Brownian
We set the model within a N-asset local volatility frame- increments in the Monte Carlo simulation can be made
work. Each stock follows a process of type8 so ecient that it requires just a minimal extra compu-
tational time. This proves crucial in implementing the
dSti model, as the running time is indeed a make-or-break is-
= rdt + σ i (St , t)dWti (1) sue.
Sti
where σti (St , t) is the local volatility and Wti are the stan- 3.2. Properties
dard Wiener processes correlated as follows: The model has a number of properties that will prove
very useful when we calibrate it. The following results are
< dWti , dWt‘j >= ρt δ(t − t‘) based on van Emmerich (2006), Delbaen and Shirakawa
(2002).
6 The author is not aware of any publication that would address 1. Moments are analytically tractable. In particular, the
directly the correlation skew using common jumps. mean and variance are easily computable as follows:
7 For more details on Jacobi processes see eg, van Emmerich
(2006), Delbaen and Shirakawa (2002). E[ρT ] = ρ̄ + (ρ0 − ρ̄)e−αT (5)
8 For simplicity we suppress dividends. They can be added in a
straight-forward manner. V ar[ρT ] = U1 + U2 + U3 − E[ρT ]2 (6)
3
where the U -terms are dened as follows: Term structure of expected value of average realized correlation
β2 0.8
α = 0.5
U1 = (ρ̄ − ρmin )(1. − ρ̄) + ρ̄2 α = 1.0
2α + β 2 0.75 α = 5.0
ρ0 − ρ̄ 2
β (ρmin + 1.) + 2αρ̄ e−αT
U2 =
correlation mean
0.7
α+β 2
0.65
1 n 2α(ρ − ρ̄)
0
U3 = 2
α(ρ0 − ρ̄) 0.6
2α + β α + β2
1 0.55
−β 2 ( (ρmin + 1.) − ρ0 )
2 o 2 0.5
−β 2 (ρ0 − ρmin )(1. − ρ0 )) e−(2α+β )T 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
maturity
0.1
0.04
12 78.0% 9.2%
0.02
24 79.8% 6.0%
0
0 0.5 1 1.5 2 2.5 3
36 79.3% 5.3%
maturity (years)
Table 2: Correlation mean and St.Dev. of various pairs of stocks for
running time window of 3M to 3Y. Computed from 10Y and 5Y of
Figure 6: Term structure of St.Dev. of correlation computed from data as indicated.
historical SX5E correlation vs model results for various values of
mean-reversion speed α. β has been chosen to match the magnitude
of St.Dev. of SX5E historical correlation. For completeness, we used lower volatility of correlation.
ρ0 = ρ̄ = 0.5, ρCS = −0.5 and N = 50. The historical correlation is
based on last 10 years of data.
4.3. Correlation skew
on the number of stocks and maturity11 . For example, the As we have already seen in section 2.2, index options do
St.Dev. of Pearson coecient for 50 stocks and a corre- have correlation skew priced in. Let us have a look how
lation of 50% is displayed in the third column of Table the model compares with this market.
1. Figures 7 and 8 show the volatility skew of SX5E as of
As a result, to compare the model with the historical 15 Jan 2013 for 3M and 1Y maturity. We also plotted the
distribution, one has to adjust the variance obtained from skew one would obtain with constant ATM correlation as
the model by the variance of Pearson for the same maturity well as the one predicted by the model with ρCS = −0.5%,
and number of stocks.12 α = 5 and maximum volatility of correlation allowed by
Figure 6 shows the St.Dev. of SX5E historical correla- the Feller condition. We can see that constant correlation
tion along with the results of the model for various values accounts for just a fraction of the full volatility skew. The
of α, adjusted for the above-mentioned eect. Unlike in model matches quite well the upside volatilities; however,
Figure 5, we chose β such that the magnitude of St.Dev. it comes short of bridging the full gap at low strikes, es-
given by the model matches the historical one. pecially at very short maturities. It is our belief that the
One important characteristic worth pointing out from short-term skew can not be matched completely without
the start is the downward-sloping shape of the function: the involvement of jumps15 .
the St.Dev. decreases with the increase in measurement One interesting point to make is that the market im-
window 13 . It is obvious from the graph that one needs plies a much higher volatility of correlation than we see
α of the order of 5 to obtain a shape comparable to the in the historical distribution. In fact one needs to take
historical term structure. That means that the correlation the maximum β allowed by Feller's condition in order to
has high volatility14 and it is pulled towards the mean with come close to the market-implied skew. This is a very im-
a time scale of the order of 3 months. portant point, which we will use in setting the calibration
As we see from Table 2, decreasing St.Dev. with time guidelines of the model.
is generally true for pairs of individual stocks. As ex- Although falling short of completely explaining the
pected, stocks with correlation close to the limits have short-term skew, we consider the model to satisfy our
needs as the typical trades range from 1Y to 5Y.
11 In other words, measuring a correlation swap in a Monte Carlo
Calibration guidelines
simulation would have some non-zero distribution width even for
constant correlation. The analysis of the previous subsections brought us to
12 We do this by computing the variance of a correlation swap given
the following guidelines for calibrating the model:
by the Jacobi model via a full Monte Carlo simulation. However, the
simple addition of variances works surprisingly well too. the level of correlation, ρ0 and ρ̄, should be obtained
13 This is a common feature to all of the mean-reverting processes
with a stable limit distribution. The same would hold for St.Dev. of from IDB CvC quotations
variance swaps for example, but not for the average of a stock price.
That would mean that the straddles on correlation and on variance α should be set to 5
would have decreasing price with increasing maturity. As a contrast
this is clearly not the case for asian options on a stock.
14 but not as high as to reach the limit imposed by the Feller con- 15 The problem of short-term skew is similar in nature to the one
dition. experienced in the Heston model.
6
SX5E 3M skew: market vs constant correlation vs model matrix. The key to the extension is the linear perturbation
0.26
market
of a correlation matrix16 dened as follows.
0.24
constant correlation
model
Let us consider a N × N correlation matrix M and let us
0.22 denote by Ωmax , Ωmin the pair-wise constant correlation
index implied volatility
0.2
matrices with maximum and minimum average correla-
tion:
0.18
0.16 1 1 1 ··· 1
0.14
1 1 ··· 1
Ωmax = 1 ··· 1
0.12
··· 1
0.1 1
70 % 80 % 90 % 100 % 110 % 120 % 130 %
strike
1 ρmin ρmin ··· ρmin
Figure 7: 3M SX5E volatility skew compared to the model with 1 ρmin ··· ρmin
constant and stochastic correlation. The model parameters were Ωmin
= 1 ··· ···
ρ0 = ρ̄ = 0.45, ρCS = −0.5, α = 5, β = 1.66, with a St.Dev. of
correlation of 10.2%
··· ρmin
1
SX5E 1Y skew: market vs constant correlation vs model
0.24 where ρmin is dened in (4). For any 0 ≤ ≤ 1 the
market
constant correlation following perturbations of the matrix M are also positive
0.22 model
semi-denite matrices whose average correlation has been
increased/decreased:
index implied volatility
0.2
0.18
M+ = M + (Ωmax − M ) (11)
M− = M + (Ωmin − M ) (12)
0.16