Download as pdf or txt
Download as pdf or txt
You are on page 1of 33

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/264718376

Arbitrage-free SABR

Article in Wilmott · January 2014


DOI: 10.1002/wilm.10290

CITATIONS READS

46 13,317

4 authors, including:

Patrick S Hagan Andrew Lesniewski


XBTO Inc. City University of New York - Bernard M. Baruch College
106 PUBLICATIONS 2,587 CITATIONS 98 PUBLICATIONS 2,315 CITATIONS

SEE PROFILE SEE PROFILE

Diana Woodward
Saint Thomas University
28 PUBLICATIONS 867 CITATIONS

SEE PROFILE

All content following this page was uploaded by Patrick S Hagan on 12 October 2017.

The user has requested enhancement of the downloaded file.


ARBITRAGE FREE SABR
PATRICK S. HAGAN AND DEEP KUMAR
PATHAGAN1954@YAHOO.COM
MATHEMATICS INSTITUTE, 24-29 ST. GILES, OXFORD, UK OX1 3LB, UK
AVM LTD, BOCA RATON, FL

Abstract. Smile risk is often managed using the explicit implied vol formulas developed for the SABR model. These
asymptotic formulas are not exact, and this can lead to arbitrage for low strike options. Here we provide an alternate method
for pricing options under the SABR model: We use asympotic techniques to reduce the SABR model from two dimensions to
one dimension. This leads to an effective one dimensional forward equation for the probability density which has the same
asymptotic order of accuracy as the explicit implied vol formulas. We obtain arbitrage-free option prices by numerically solving
this PDE. The implied volatilities obtained from the numerical solutions closely match the explicit implied volatility curves,
apart from ultra-low strikes. For very low strikes, the implied absolute (normal) vol dips downwards, closely matching market
observations. We also show how negative rates can be accomodated by replacing the   factor with ( + ) .

1. The SABR model. European option prices are often quoted by using the normal model. In this
model the forward asset price ̃ () follows the process

(1.1a) ̃ =   ̃ 

and the (forward) price of European calls and puts works out to be
µ ¶ µ ¶
  − √  −
(1.1b)  = [ − ]  √ +      √ 
      
µ ¶ µ ¶
  − √  −
(1.1c)  = [ −  ]  √ +     √ 
      

 
Here  (·) is the cumulative normal distribution and (·) is the Gaussian density. Both  and  are
increasing functions of the volatility   . Consequently, European option prices can be quoted by stating the
implied normal vol (aka absolute vol or bps per year vol), the unique value of   that yields the option’s
dollar price when substituted into these formulas.
Alternatively, in Black’s model the forward asset price is modeled by

(1.2) ̃ =   ̃ ̃ 

Here, too, there is a one-to-one relation between the European option price and the Black (log normal) vol
  , so option prices can be quoted in terms of   Implied Black vols   and implied normal vols  are
equivalent, and the mapping between  and   is well understood, so in this article we focus on the normal
vols   .
If the normal model correctly described the behavior of the asset price, then the same implied volatility
  would correctly price options with different strikes  and times-to-expiry   . In practice, matching
market prices requires substantially different implied vols for options with different strikes and expiries,
  =  (   ). For a given expiry   , the implied vol   as a function of  is the options’s smile, or
skew. Handling the smile and skew judiciously is critical to an options desk, since the risks of options at
all different strikes  have to be consolidated before the risks can be hedged efficiently. Although offsetting
risks of options with different expiries is less common, handling the volatility surface (the dependence of  
on both  and   ) is also be important for correctly pricing path dependent options, such as mid-curve
options.
1
The need for handling smile and skew risk effectively led to the development of the SABR models[1-
SABR]. These are models of the form

(1.3a) ̃ = ̃(̃ )̃1 


(1.3b) ̃ =  ̃̃2 

with

(1.3c) ̃1 ̃2 = 

where most commonly ( ) is taken to be   . In [1-SABR], singular perturbation techniques were used
to analyze this class of models in the small volatility regime. To carry out the expansion systematically, a
small parameter  was introduced,

(1.4a) ̃ = ̃(̃ )̃1 


(1.4b) ̃ =  ̃̃2 
(1.4c) ̃1 ̃2 = 

The model was analyzed in the   1 limit, and then  was set to 1 in the final result1 . This analysis was
used to obtain explict formulas for the implied vols of European options under the SABR model. There are
now several variants of these formulas [1-5, SABR variants], all correct through (2 ), but our favorite is
µ ¶ ½ ∙ ¸ ¾
 ( − )  2 1 ( ) − () 2 − 32 2 2
(1.5a)   () = · · 1 +  + 4  +     + · · · 
R   0 ()  − 24
 ( 0 )

Here  = ̃ (0)  = ̃(0) are today’s value of the foward price and the volatility,   is the time to exercise,
Z Ãp !
   0 1 − 2 +  2 −  + 
(1.5b) =  () = log 
  ( 0 ) 1−

and the geometric factor is


à Z p !, ÃZ !2
 
1 ( )() 0 1 0
(1.5c)  = log   
 −  ( 0 ) 0
 ( )

Although there are simpler formulas, this one seems to be the most robust.
The classic SABR model is the special case ( ) =   . For this case the implied vol formula reduces
to
µ ¶ ½ ∙ ¸ ¾
 (1 − ) ( − )  2 1   −  2 − 32 2 2
(1.6a)   () = · · 1 +  + 4  +     + · · · 
 1− −  1− ()  − 24
where
Ãp !
  1− −  1− 1 − 2 +  2 −  + 
(1.6b) =  () = log 
 1− 1−

1 Although this appears inconsistent, it is equivalent to non-dimensionalizing the problem, expanding in the low volatility

regime, and then re-writing the answers in terms of the original dimensioned variables.
2
and
2 µ 1−

(1 − ) 2  −  1−
(1.6c) = 2 log ( ) 
( 1− −  1− ) (1 − ) ( − )

Two problems have developed using these explicit implied vol formulas. First, for some low strike, long
dated options, the explict implied vol formulas can lead to arbitrageable prices; this is discussed in the next
section. Second, the SABR model has a barrier wherever ( ) = 0; this is at  = 0 in the classical SABR
model. There is an () thick region next to this boundary, a region in which the original asymptotic
analysis does not pertain. In the current, ultra-low rate environment, this region has a substantial influence
on pricing, especially with the advent of zero strike options.
In this article we address both problems. We use singular pertubation methods to show that the reduced
density,
n ¯ o
¯
(1.7) (  ) = Prob   ̃ ( )   +  ¯ ̃ () =  ̃() =  

satisfies the effective forward equation


h¡ ¢ 2 i
(1.8a)  = 12 2 2 1 + 2 + 2  2  2  Γ( )( −)  2 ( ) for   


where ( ) and Γ( ) are given by


Z 
1  0 ( ) − ( )
(1.8b) ( ) ≡  Γ( ) ≡ 
  ( 0 )  −

This reduction is not exact, but is accurate through (2 ), the same accuracy as the explicit implied vol
formulas. Our approach is to solve the PDE numerically to obtain the probability density (    ) at the
exercise date   ; the call and put prices are then obtained by integrating to find the expected value of the
payoffs.
To simplify notation, let
p 1 2
(1.9) ( ) = 1 + 2( ) + 2  2  2 ( ) 2  Γ( )( −) ( )

Then the effective forward equation is


£ ¤
(1.10a)  = 12 2 2 2 ( )   for   

Note that ( ) = 0 where, and only where, ( ) = 0


Solving the problem numerically requires a finite domain, min    max . The appropriate lower
boundary min is usually the barrier, where (min ) = 0, and thus (min ) = 0. In some situations it can
make sense to use other boundaries. For example, one may wish to assume that the forward ̃ ( ) cannot
go below zero, in which case the boundary has to be set at min = 0, regardless of whether (0) = 0. Or
the barrier may be more than 5 or 6 standard deviations below the forward, so there is no reason to extend
the grid all the way to the barrier. The upper barrier max simply needs to be large enough so that there
is a negligible probability of ̃ (  ) reaching or exceeding max ; usually this is 4 to 6 standard deviations
above the forward  . See Appendix D, which gives  in terms of the number of standard deviations above
(or below) 
The appropriate boundary conditions are investigated in Appendix B. There it is found that we must
use absorbing boundary conditions,
+
(1.10b) 2 ( ) → 0 as  → min  2 ( ) → 0 −
as  → max 
3
in order for ̃ ( ) to be a Martingale. Clearly the appropriate initial condition is

(1.10c)  (  ) → ( −  ) as  → 

Equations 1.10a - 1.10c form a well posed problem for the density  (  ). Since the boundaries are
absorbing, the probability density will develop -functions at the boundaries min , max in addition to the
continuous density  (  ). Crudely speaking,
⎧ 
⎨  ( )( − min ) at  = min
(1.11) (  ) =  (  ) for min    max 
⎩ 
 ( )( − max ) at  = max

The rate at which probability accumulates at min and max is determined by the flux reaching the barriers
from the interior:
 £ ¤
(1.12a) = lim+ 12 2 2 2 ( )  
  →min

 £ ¤
(1.12b) = − lim+ 12 2 2 2 ( )  
  →max

We shall see that this ensures that the combined probability in  ( ), (  ), and  ( ) totals 1. Clealy
the initial condition is

(1.12c)  ( ) → 0  ( ) → 0 as  → 

Of course if the probability  ( ) at the upper boundary is significant, max should be increased.
At first glance, the delta function at min looks unusual. However, consider a situation in which the
( ) has been modified to prevent the forward  from becoming negative, so that the effective volatility
½
( ) for   
(1.13) ̃( ) =
(0 ) for 0    

is used in place of ( ). This situation is analyzed in Appendix B. We find that for  ¿ 1, there is a thin
boundary layer in 0     which has very high densities (  ). In the limit  → 0 the total probability
in 0     is  ( ), and yet  2 ( )(  ) goes to zero at we approach  from above:
Z 
(1.14) lim+  2 ( )(  ) → 0 (  ) →  ( ) as  → 0
 → 0

We believe that this is representative of the general situation: when the forward ̃ ( ) is near enough to
the boundary, other mechanisms come into play; after all, there must be some reason that ̃ ( ) doesn’t cross
the boundary. For example, interest rates in zero, near zero, and even slightly negative rate environments,
can be expected to behave differently than rates in more moderate regimes. If we put in a detailed model for
these boundary mechanisms, and looked on a fine enough scale, then the delta functons should be resolved
into structured boundary layers. Yet the total probability in the layer should match  ( ), as this is required
by conservation, and as we move away from the boundary, (  ) should transition into the solution of
1.10a with the absorbing boundary condition 1.10b, as this is required for ̃ ( ) to be a Martigale.
We use a moment-preserving Crank-Nicholson scheme to solve the PDE 1.10a - 1.10c numerically
for       , while simultaneously integrating the ODE’s 1.12a - 1.12c. Once we have obtained
4
 (  )  (    ) and  (  ) we can obtain the option prices for all strikes  by integration:
Z max
(1.15a)  (   ) = ( − )  (    ) + (max − )  (  )

Z 

(1.15b)  (   ) = ( − min ) (  ) + ( −  )  (    )
min

If we wish, we can then find the implied normal vol that matches these prices at each . That is, by solving
the PDE once, we obtain the smile for all  at   .
We shall find that the implied volatilities obtained from these numerical solutions closely match the
explicit implied volatility formulas, apart from ultra-low strikes. For very low strikes, the implied absolute
(normal) vol dips downwards, closely matching market behavior.

We also show show how negative rates can be accomodated by replacing   with ( + ) to move the
barrier below zero.
2. Arbitrage using the explicit formulas for the SABR model. The explicit implied vol formulas
make the SABR model easy to implement, calibrate, and use. These implied volatility formulas are usually
treated as if they are exactly correct, even though they are derived from an expansion which requires that
√ √ √
   ,    and | − |    be not too large. The unstated argument is that instead of treating
these formulas as an (accurate) approximation to the SABR model, they should be treated as the exact
solution to some other model which is well approximated by the SABR model.
This is a valid viewpoint as long as the option prices obtained using the explicit formulas 1.5a-1.5c for
  () are arbitrage free. There are two key requirements for these prices to be arbitrage free [6, Dupire].
The first is call-put parity, which holds automatically since we are using the same implied vol   () for
both calls and puts. The second is that the probability density implied by the call and put prices needs to
be positive. To explore this, note that the call and put values can be written quite generally as
Z ∞ Z 
(2.1)  (   ) = ( − ) (    )  () = ( −  ) (    )
 −∞

where (    ) is the probability density at the exercise date (including any delta functions). Clearly,

2 2
(2.2) 2
 () =  () = (    ) ≥ 0
  2
For the explicit implied vol formulas 1.5a-1.5c to represent an arbitrage free model, then, we need

2  2 
(2.3)  (    ()   ) =  (    ()   ) ≥ 0 for all 
 2  2 
That is, there cannot be a “butterfly arbitrage.”
It is not terribly uncommon for this requirement to be violated for very low strike options for sufficiently
large   . The problem does not appear to be the quality of the call and put prices obtained from the explicit
implied vol formulas, because these usually remain quite accurate. Rather, the problem seems to be that
implied volatility curves are not a stable representation of option prices for low strike options. It is very
easy to find nearly identical, reasonable looking, volatility curves   (), for which some of the curves are
arbitrage free and others violate the arbitrage-free constraint of eq. 2.3.
This is illustrated in Figure 2.1. There the smile   () obtained from the explicit formula 1.5a-1.5c is
graphed (explicit) along with a very similar smile (arb free) obtained from the arbitrage free procedure. These
two smiles look very similar, and lead to nearly identical option prices, as shown in Figure 2.2. Differentiating
5
Implied normal vol
100%

80%

60%

40% Explicit
Arb Free
20%

0%
0 0.5 1 1.5 2 2.5 3
Strike
Fig. 2.1. The implied normal vol for the SABR model for  = 35%  = 025  = −10% and  = 100% Shown are
 () from the explicit formula and from the arb-free approach for   = 1yr.

Time Value (% of notional)


16%
14%
Explicit
12%
Arb Free
10%
8%
6%
4%
2%
0%
0 0.5 1 1.5 2 2.5 3
Strike

Fig. 2.2. Call and put values from the SABR model for  = 35%  = 025  = −10%  = 100%. Shown are the put
prices (for    ) and call prices (for    ) obtained from the explicit formulas and the arb-free approach for   = 1yr.

these option prices with respect to the strike  yields the probability densities shown in Figure 2.3. The
“explicit” smile   () leads to negative probabilities for ultra low strikes, and so is not arbitrage free, whilst
the second curve has only positive probabilities, and is arbitrage free.
Using implied Black (log normal) vols   () instead of implied normal vols does not help. Figure 2.4
compares the implied Black vols from the explict formulas for   () with those from the arb free approach.
There is no obvious way to discern that one curve leads to arbitrage free prices, while the other does not.
6
Probability density
1.5
Explicit
1.0
Arb Free
0.5

0.0
0 0.5 1 1.5 2 2.5
‐0.5

‐1.0
Strike

Fig. 2.3. Probabiity density for the SABR model for  = 35%,  = 025  = −10% and  = 100% Shown are the
densities obtained from the explicit formulas for  () and from the arb free approach for   = 1yr.

Implied Black vol


200%
175%
Explicit
150%
125% Arb Free
100%
75%
50%
25%
0%
0 0.5 1 1.5 2 2.5 3
Strike
Fig. 2.4. The implied log normal volatility from the SABR model at   = 1yr for  = 35%  = 25%  = −10%
 = 100% Shown are the implied volatilities from the option prices obtained from the explicit normal curve   () and the
arb free curve.

3. Arbitrage free pricing.

3.1. The effective forward equation. Here we present an alternative pricing approach which is
arbitrage free and retains the (2 ) accuracy of the original SABR analysis. We believe that variations of
this approach have been used by other firms [7, Anderson, 8, the French team].
Consider the probability density that ̃ ( ) =  and ̃( ) =  at time  , given that we start at ̃ () =
7
 and ̃() =  at time :
(3.1a) n ¯ o
¯
(  ;   )  = Prob   ̃ ( )   +    ̃( )   +  ¯ ̃ () =  ̃() =  

This density  satisfies the Fokker-Planck equation (the forwards Kolmogorov equation),
£ ¤ £ ¤ £ ¤
(3.1b)  = 12 2 2  2 ( )   + 2  2 ( )   + 12 2  2 2   for   
In Appendix A we define the reduced (marginal) probability density,
Z ∞
(3.2) (  ) = (  ;   )
0

which is the probability density that ̃ ( ) =  at time  regardless of the value of ̃( ). Although  is also
a function of the backwards variables   , for clarity we have omitted explicitly showing this dependence.
In appendix A we use singular perturbation techniques to analyze the Fokker-Planck equation 3.1b. This
analysis shows that the marginal density satisfies the PDE
£ ¤
(3.3a)  = 12 2 2 2 ( )   for   

through (2 ), where


£ ¤ 2
(3.3b) 2 ( ) = 1 + 2( ) + 2  2  2 ( )  Γ( )( −)  2 ( )
and where( ) and Γ( ) are defined by
Z 
1  0 ( ) − ( )
(3.3c) ( ) ≡  Γ( ) ≡ 
  ( 0 )  −
This effective forward equation reduces the dimensionality from two space dimensions ( and ) to one
space dimension ( only), while retaining the same order of accuracy as the original SABR analysis [1].
Note that we have added a  superscript to denote that this is the continuous part of the density.
For the special case of ( ) =   , we have
 1− −  1−   − 
(3.4) ( ) ≡  Γ( ) ≡ 
 (1 − )  −
3.2. Boundary conditions. The SABR model has an innate barrier where ( ) = 0, or, equivalently,
where ( ) = 0. Traditionally this barrier is at  = 0, as it is for ( ) =   , but currently there is an
active debate over whether rates can be negative, and if so, how negative they can become. Appendix E

explores alternative models for ( ), such as ( + ) , which puts the barrier at −.
We solve the PDE numerically, which requires a finite domain min    max . It is natural to place
the lower boundary at the barrier, but not essential, and there may well be situations in which a different
boundary makes more sense. So we do not assume that min is necessarily at the barrier. max should be
chosen to be large enough so that the boundary doesn’t affect the pricing appreciably.
Boundary conditions are examined in Appendix B. Since there may be a net probablity current at the
boundaries, and we are not considering models in which the forward can leave the domain, we need to allow
for probability accumulating at the boundaries. I.e., we need to allow -functions in the probabiliy density
at min and max :
⎧ 
⎨  ( )( − min ) at  = min
(3.5) (  ) =  (  ) for min    max 
⎩ 
 ( )( − max ) at  = max
8
Here the superscript  is being used to denote the continuous part of the density.
The total probability has to be 1,
Z max
(3.6)  ( ) +  (  ) +  ( ) = 1
min

for all  , so
( Z )
max
   
(3.7)  ( ) +  (  ) +  ( ) = 0
 min

Substituting 3.3a for  and integrating leads to

 £ ¤ ¯max  ( )


(3.8) + 12 2 2 2 ( )  ¯ + = 0
 min 
Thus, conservation of probability requires that

 £ ¤
(3.9a) = lim+ 12 2 2 2 ( )  
  →min

 £ ¤
(3.9b) = lim− − 12 2 2 2 ( )  
  →max

Similarly, for ̃ ( ) to be a Martingale, we need its expected value to be fixed:


n ¯ o Z ∞
¯
(3.10)  ̃ ( )¯ ̃ () =  ̃() =  = min  ( ) +   (  ) + max  ( ) = 
min

Therefore,
Z max
 ( )  ( )
(3.11) min +   (  ) + max = 0
 min 

Substituting 3.3a for  and integrating by parts twice and using equations 3.9a, 3.9b leads to
¯max
(3.12) 2 ( ) ¯min = 0

So we must require absorbing boundary conditions,


+
(3.13a) 2 ( ) → 0 as  → min 
(3.13b) 2 ( ) → 0 −
as  → max 

to ensure that the ̃ ( ) to be a Martingale.


In summary, we solve
£ ¤
(3.14a)  = 12 2 2 2 ( )   for min    max 

with the boundary condition


+
(3.14b) 2 ( ) → 0 as  → min  2 ( ) → 0 −
as  → max 
9
for       . The probabilities at the boundary are given by

 £ ¤
(3.14c) = lim+ 12 2 2 2 ( ) 
  →min

 £ ¤
(3.14d) = lim− − 12 2 2 2 ( ) 
  →max

and the initial conditions are

(3.14e)  (0) = 0  (  ) → ( −  )  (0) = 0 as  → + 

3.3. Option pricing. In Appendix C we sketch out a Crank-Nicholson scheme [NumRecInC] for solving
3.14a - 3.14e for  (  ),  ( ) and  ( ). This scheme conserves probability and the expected value of
̃ ( ), so equations 3.6 and 3.10 remain exactly true for the numerical solution. Once we have solved these
equations numerically, the call and put prices are obtained by integrating:

(3.15a)  (   ) =  −  for   min


Z max
(3.15b)  (   ) = ( − ) (    ) + (max − )  (  ) for min    max

(3.15c)  (   ) = 0 for   max

(3.15d)  (   ) = 0 for   min


Z 
(3.15e)  (   ) = ( −  ) (    ) + ( − min )  (  ) for min    max
min
(3.15f)  (   ) =  −  for   max

The conservation of probablity 3.6 and Martingale proprerty 3.10 show that call-put parity holds exactly for
the numerical solution:

(3.16)  (   ) −  (   ) =  − 

Since the effective forward equation has only one space dimension, solving the PDE is essentially instan-
taneous. Moreover, solving these equations for       yields the option prices for all strikes . Thus
the implied normal vols   () for all strikes  can be obtained by solving the PDE once.
The maximum principle for parabolic equations [Protter & Weinberg] guarantees that  (  ) ≥ 0 for
min    max , and that  ( ) and  ( ) are increasing, and hence positive. For fine enough grids,
the numerical solutions will also be non-negative. Since call-put parity is also satisfied, the numerical option
prices are arbitrage free [6, Dupire].
4. Discussion.
4.1. Results. Singular perturbation techniques can be used to analyze the effective forward equation,
£ ¤
(4.1a)  = 12 2 2 2 ( )   for   

where
¡ ¢ 2
(4.1b) ( ) = 1 + 2 + 2  2  2  Γ( )( −)  2 ( )
10
with
Z 
1  0 ( ) − ( )
(4.1c) ( ) ≡  Γ( ) ≡ 
  ( 0 )  −

From this analysis we can obtain explicit option prices, and these prices can be used to find explicit formulas
for the implied volatility   (). Away from the boundaries min and max , this analysis would lead to
the same explicit implied vol formulas 1.5a-1.5c as before, at least if we continue to work through (2 ).
Thus, away from the boundaries, the “arbitrage free” implied volatility (obtained by numerically solving the
effective forward equation) should match the explicit implied volatility formulas to within (2 ). This is
indeed our experience. Even in relatively extreme cases, such as Figure 2.1, the “arbitrage free” and explicit
implied volatility smiles match closely, except when the strike  or the forward  gets too close to the
boundary at min .
For values of  within () of the lower boundary min , a non-negligible percentage of paths that would
have reached  hit the boundary. This creates an () thick boundary layer at min , in which the explicit
implied volatity formulas do not pertain. Instead, as the strike approaches min , the implied normal volatility
  () bends towards zero. See Figure 2.1
Figure 4.1 shows the effect of the boundary layer on the normal volatility of at-the-money options. As
today’s forward  decreases to within () of min , an increasing percentage of the paths reach the boundary
before the expiry date, which reduces the ATM volatility. Seeing the “knee” in this graph, one might naively
believe that the market switches from a normal regime to a log normal regime when the forward is sufficiently
small. This is an illusion; this graph comes from the SABR model with  = 0 and  = 0, which is a stochastic
normal model. The reduction is caused solely by the boundary.

ATM normal vol vs. forward


50%

40%

30%

20%

10%

0%
0 0.25 0.5 0.75 1 1.25 1.5
Forward

Fig. 4.1. The at-the-money implied normal vol   () at   = 1yr for the arbitrage free SABR model with  = 35%
 = 0  = 0and  = 100%.

Below are the smiles   () obtained for different values of the forward  , using the same SABR
parameters in each case.
Market data for at-the-money swaption volatilities exhibit this “knee”: Historical studies comparing
ATM normal vols with forward rates show that when the forward rate is above a critical value, the ATM
11
Smiles at different forwards
1.2
1.0
0.8 0.1
0.6 0.5
0.4 1
0.2 1.5
0.0 2
0 0.5 1 1.5 2 2.5 3 3.5
Strikes

Fig. 4.2. The smiles  () at   = 1yr for different values of the forward  for the SABR model with  = 35%  = 0
 = 0% and  = 100%.

normal vols are reasonably constant; for forward rates below the critical value, the ATM vols decrease linearly
with the rate. See figure ??.

Figure caption: The normal vols for 1Y into 1Y at the money swaptions vs the forward swap rate.
Shown is the historic data for 2002 through 2012 for USD, GBP, EUR, and JPY swaptions. Also shown is
the implied volatility obtained from the SABR model for  = 65%  = 025  = 0%, and  = 75%.
Typically this knee is explained as the market switching from normal to log normal behaviour in ultra-
low rate environments. This belief is often reinforced by using the explicit implied vol formulas to calibrate
the SABR model. Calibrating the explicit implied vol formulas to observed smiles can lead to relatively high
values of  and/or  for low forward rates. Since high values of  and  push up the high strike vols, this
can create significant mis-pricing for instruments which are sensitive to high strikes, like constant maturity
caps, floors, and swaps. To counter this, some firms have chosen to use
½
 for   0
(4.2) ( ) =
0 for   0
in place of   . We believe that our approach provides a more natural explanation, since the knee occurs
automatically, without requiring gross changes between the SABR parameters for low and moderate rate
environments.
4.2. Hedging. The coefficients in the effective forward equation 4.1a - 4.1c depend on the current
forward  as well as  . This means that
· there is no obvious “effective backwards equation” equivalent to the effective forward equation;
· the effective forward equation is not the Fokker-Planck equation (forward Kolmogorov equation) for some
one dimensional Ito process.
I.e., there is not an effective one dimensional local volatility model corresponding to the effective forward
equation. This should have been anticipated as the SABR model was created because of perceived short-
comings of local volatility models[1, SABR].
12
When the SABR model is calibrated to market data, it is very difficult to distinguish between  and
; both control the skew. If we fix , say, and calibrate the rest of the parameters, the quality of the fit is
usually pretty much independent of the particular value of  chosen. This is illustrated in Figure 4.3. There
we have chosen  = 0,  = 12  and  = 1, and calibrated the SABR model to the same market data for all
three cases, which yields the following set of SABR parameters.
 318% 329% 351%
 0 05 1
 −183% −455% −644%
 0777 0867 0985
Although the tail of the smiles are somewhat different, all three sets of parameters seem to fit the actual
market data.

Calibrated smiles
80%
60% 0

40% 0.5
1
20%
data
0%
0 0.5 1 1.5 2 2.5 3
Strikes

1
Fig. 4.3. The SABR model calibrated to the same market data for  = 0  = 2
, and  = 1. Because  can largely
compensate for , all three fits are well within market noise

The conventional delta risk is calculated by shifting the current forward  and keeping the current
volatility  fixed:
(4.3)  →  + ∆  → 
If the delta risk is calculated in the conventional way, then the delta depends on the particular value of 
used. This is shown in Figure 4.4. There we have calculated the conventional delta of a call option as a
function of the strike  for the same three sets of SABR parameters. Even though all three sets lead to
essential the same smile (especially for strikes which are not too extreme), we see that the different choices
of  have led to different values of delta. This means that if we hedge our positions using the conventional
delta, choosing a poor  may lead to a poor hedge, even though it may lead to a superb fit of the market
data.
This issue led to an alternative approach for calculating delta hedges. Since ̃ and ̃ are correlated,
when ̃ changes then ̃ changes as well, at least on average. It is argued that accounting for this shift should
result in a better hedge [Bartlett]:
(4.4)  →  + ∆  →  + ∆
13
Conventional delta
1.00
0.80
0
0.60
0.5
0.40
1
0.20
0.00
0 0.5 1 1.5 2
Strikes

Fig. 4.4. Conventional delta,   as a function of the strike  for  = 0,  = 12  and  = 1. For each , the other
three parameters   and  are chosen to match the arb free SABR smile to the market data.

To compute the amount of the  shift, we re-write the SABR model as

(4.5a) ̃ = ̃(̃ )̃1 


n p o
(4.5b) ̃ =  ̃ ̃1 + 1 − 2 ̃ 

where ̃1 and ̃ are independent. This implies that


 p
(4.6) ̃ = ̃ + 1 − 2  ̃̃
(̃ )

Therefore, changes in ̃( ) can be split into two independent components, one caused by the changes in
̃ ( ), and one due to the idiosyncratic changes in the volatility ̃( ). Accordingly, the delta hedge should
be calculated with respect to the scenario [Bartlett]

(4.7)  →  + ∆ →+ ∆
( )

Figure 4.5 shows this alternative delta as a function of the strike  for the same three sets of SABR
parameters used above. We calculated these deltas by simply bumping the  and  values input into the
pricing code according to 4.7. We see that this alternative delta is nearly independent of the particular value
of  chosen. This has proven true for all cases we have investigated: as long as the SABR model fits market
data decently, the alternative delta is nearly independent of the particular values of  or  used in the fitting.
Apparently this new delta depends mostly on the actual market smile, and not how the smiles and skews
are represented in the model.
It is generally accepted that hedges based on the alternative delta are superior to the conventional delta
hedge when a desk is only using delta to hedge. When a desk is hedging both delta and vega, which delta
hedge is used is irrelevent provided one doesn’t double count when putting on the vega hedge.
Appendix A. Derivation of the effective forward equation.
14
Alternative delta
1.00
0.80
0
0.60
0.5
0.40
1
0.20
0.00
0 0.5 1 1.5 2
Strikes

Fig. 4.5. Alternative delta,   +[( )]   for  = 0,  = 12 , and  = 1. For each , the other three parameters
  and  are chosen to match the arb free SABR smile to the market data.

Here we analyze the SABR model


(A.1a) ̃ = ̃(̃ )̃1 
(A.1b) ̃ =  ̃̃2 
(A.1c) ̃1 ̃2 = 
in the limit  ¿ 1, using singular perturbation methods to derive the effective forward equation.
Recall that (  ;   ) is the probability density that ̃ ( ) =  and ̃( ) =  at time  , given
that ̃ () =  and ̃() =  at time . Define the moments
Z ∞
(A.2a) () (  ;   ) =  (  ;   )
0
n ¯ o
¯
=   ( ̃ ( ) −  ) ¯ ̃ () =  ̃() =  

Clearly the zeroeth moment (0) is the probability density of being at  at time  ,
(A.2b) (  ) = (0) (  ;   )
regardless of the value of ̃( ).
The density  satisfies the Fokker-Planck equation
£ ¤ £ ¤ £ ¤
(A.3a)  = 12 2  2 ( )2    + 2  ( )2    + 12 2  2 2   for all   
with the initial condition
(A.3b)  = ( −  )( − ) for all  → + 
Now
Z ∞£ ¤ £ ¤ ¯=+∞
(A.4a) ( )2     = ( )2   ¯=0 = 0
0
Z ∞
£ 2 ¤ £ 2 ¤ ¯=+∞
(A.4b)     =    ¯=0 = 0
0
15
for all  . This just states that there is no probability flux across the boundaries at  = 0 and  = ∞; i.e.,
that probability is conserved. Integrating the Fokker-Planck equation across all  now yields

h i
(0)
(A.5)  = 12 2  2 ( )(2) for   


I.e., the evolution of the reduced density (0) depends on the second moment (2) .
Under the SABR model these moments satisfy the backward Kolmogorov equation
() () ()
(A.6a)  + 12 2 2  2 ( )  + 2 2 ( )  + 12 2  2 2 
()
=0 for   

subject to the condition

(A.6b) () (  ;   ) →  ( −  ) as  →  − 

We will successively transform this equation order-by-order until all the  derivatives are negligibly small.
This effectively reduces the problem from two dimensions ( and ) to one dimension ( only). Instead
of constructing explicit asymptotic solutions to the resulting one dimensional problem, as was done in the
original SABR paper [1, SABR], here we seek to write (2) in terms of (0) . This provides the “constituitive
law” needed to close the “conservation law” A.5, which is then the effective forward equation. Throughout
we work through (2 ), neglecting higher order terms.
Since the backwards equation is autonomous, the moments () only depend on

(A.7)  =  − 

and not on  or  seperately. We first change independent variables from  to


Z 
1  0
(A.8a) = 
  ( 0 )

For clarity, we also introduce

(A.8b) () = ( )

Then
 −1  −1     
(A.9a) −→ =  −→ −
 ( )  ()     

½ ¾
2 1 2  0 () 
(A.9b) 2
−→ 2 2 2 −  
    ()  2 () 

½ ¾
2 1 2  2 1 
(A.9c) −→ − + + 
  ()    2  

2 2 2  2 2 2 2 
(A.9d) 2
−→ 2
− + 2 2+ 2 
       
16
and
1
(A.9e) ( −  ) = (( )) = ()
(0)

The backwards equation now becomes


¡ ¢  0 () ()
(A.10a) ()
 =
1
1 + 2 + 2  2  2 () 1
 − 2  
2 () 
¡ ¢³ ´
+  + 2  2  −()  + 
()
+ 12 2  2 2 ()
 for   0

with the initial condition


−1
(A.10b) () (   ) → () as  → 0+ 
(0)

Accordingly, we define ̂() (   ) by

−1 ()
(A.11) () (   ) = ̂ (   )
(0)

Then ̂() (   ) satisfies

() 1
¡ ¢ ()  0 () ()
̂ = 2 1 + 2 + 2  2  2 ̂ − 12  ̂
¡ ¢ ¡ () ¢
(A.12a) () ()
−  + 2  2  ( − 2) ̂ −  + 2  2  ̂
n o
() ()
+ 12 2  2 2 ̂ + 2( − 1)̂ + ( − 1)( − 2)̂() for   0

with

(A.12b) ̂() (   ) → () as  → 0+ 

To leading order in ()the equation and initial condition for ̂() (   ) are

(A.13a) ̂() = 1 ()


2 ̂
(A.13b) ̂() (   ) → () as  → 0+ 

which are independent of  to leading order. Therefore, if we were to expand


() () ()
(A.14) ̂() (   ) = ̂0 (  ) + ̂1 (   ) + 2 ̂2 (   ) + · · · 
() ()
then the leading term ̂0 (  ) would not depend on  as indicated. Consequently, the terms 2  2 ̂ 
() ()
2  2 2 ̂ and 2  2 ̂ are actually no larger than (3 ) Since we are only working through (2 ), we
drop these terms, obtaining

() 1
¡ ¢ ()  0 () ()
̂ = 2 1 + 2 + 2  2  2 ̂ − 12  ̂
¡ ¢ ()
(A.15) ()
−  + 2  2  ( − 2) ̂ − ̂
()

+ 12 2  2 ( − 1)( − 2)̂() for   0


17
We now define  () (   ) by
p
(A.16a) ̂() = ()(0) ()

Then
½ ¾
p  0 ()
(A.16b) ̂() () 1
= ()(0)  + 2   

½ µ ¶ ¾
p  0 () 00 0 0
() () 2 2 1 1  ()
(A.16c) ̂ = ()(0)  +   +   2 −4 2  
  
½ ¾
()
p ()  0 () 1  0 () 1  0 () 2
1
(A.16d) ̂ = ()(0)  + 2   + 2   + 2   + ( ) 
  
So
¡ ¢ () ¡ ¢ 0
(A.17a) () = 1
21 + 2 + 2  2  2  − ( − 2)  + 2  2  () + 12 2  ()
½ µ ¶¾
1 2 2 1 2 0 2 2 1
00 0 0
3 
+ 2   ( − 1) ( − 2) − 2   ( − 1) +  4 −8 2  ()
  
() 0
− − 12 2 2 () 

with

(A.17b)  () (   ) → () as  → 0+ 

Inspection of eqs. A.17a, A.17b shows that if we were to expand


() () ()
(A.18)  () (   ) = 0 (  ) + 1 (  ) + 2 2 (   ) + · · · 

then  () would be independent of  until (2 ), as indicated. Thus, the last two terms in eq. A.17a are
both asymptotically smaller then (2 ), and can be neglected. Hence, we can write
¡ ¢ () ¡ ¢ 0
(A.19a) () = 1
2 1 + 2 + 2  2  2  − ( − 2)  + 2  2  () + 12 2  ()
½ µ ¶¾
0 00 0 0
1 2 2 1 2  2 2   
+ 2   ( − 1) ( − 2) − 2   ( − 1) +  4 1 3
−8 2  () 
  
with

(A.19b)  () (   ) → () as  → 0+ 

through (2 )
There are no longer any derivatives with respect to  in eq. A.19a. Therefore  can be treated as a
parameter instead of as a variable. I.e., the problem has been reduced to one spatial dimension, at least
through (2 ). In [1, SABR] we constructed explicit asymptotic solutions to eqs. A.19a, A.19b. Here we
take a different approach: We note that for  = 2,
¡ ¢ (2) 1 2 0
(A.20a) (2) = 1
2 1 + 2 + 2  2  2  + 2   (2)
µ  ¶
1 2  0 (2) 2 2 1
00
00
− 2    +   4 − 38 2  (2) 
  
18
whilst the equation for the  = 0 can be written as
h¡ ¢ i 0
(A.20b) (0) = 1
2 1 + 2 + 2  2  2  (0)
+ 12 2  (0)

µ  ¶
0 00 0 0
1 2  (0) 2 2  3 
+ 2    +   4 1
−8 2  (0) 
  
Both satisfy the same initial condition

(A.20c)  (0) → ()  (2) → () as  → 0+ 


¡ ¢
If the term + 12 2  ( 0 )  (0) in eq. A.20b had the opposite sign, then 1 + 2 + 2  2  2  (0)
would satisfy the equation for  (2) , namely ¡ eq. A.20a,2 through (2 ). Since it also satisfies the initial
¢ (0)
condition for  , we could conclude that 1 + 2 +     and  (2) are identical, at least through
(2) 2 2

(2 ). Pursuing this idea, we consider


¡ ¢ 2
(A.21a)  (   ) = 1 + 2 + 2  2  2  Γ  (0) (   )

where Γ matches − 0 ()() to leading order,

 0 ()
(A.21b) Γ=− {1 + ()} 
()

A precise choice of Γ will be made later. Since 2 Γ = (3 ) and 2 Γ = (3 ), eq. A.20b shows that 
satisfies
¡ ¢ 0
(A.22a)  = 1
2 1 + 2 + 2  2  2  + 12 2  
µ  ¶
0 00 0 0
1 2  2 2 1  3 
− 2    +   4 −8 2 
  

through (2 ), with

(A.22b)  → () as  → 0+ 

This is identical to the PDE and initial condition for  (2) , so uniqueness allows us to conclude that  and
 (2) are the same through (2 ):
2 ¡ ¢
(A.23)  (2) (   ) =  (0) (   ) Γ 1 + 2 + 2  2  2 

We now chase back through the transformations, and noting that  0 ()() is − 0 ( ) we obtain

© ª 2
(A.24a) (2) (  ;   ) = 2 (0) (  ;   ) 1 + 2 + 2  2  2 + Γ( −)

through (2 ), where

(A.24b) Γ =  0 ( ) {1 + ()} 

Substituting equation A.24a into the conservation equation, A.5, we obtain


h¡ ¢ 2
i
(0)
(A.25)  = 12 2 2 1 + 2 + 2  2  2 + Γ( −)  2 ( )(0) for   

19
This is the effective forward equation for (  ) ≡ (0) (  ;   )
This is the effective forward equation. In the derivation we could have used Γ =  0 ( ) or  0 ( ) or even
0
 ([ +  ] 2) or any other reasonable choice without losing the (2 ) accuracy of the final result. However,
higher order analysis suggests that using an average value is more accurate than  0 ( ) or  0 ( ). This is
why we have chosen
( ) − ( )
(A.26) Γ=
 −
in the text. The examples seem to indicate that this is a good choice.
Appendix B. Boundary conditions.
To simplify notation, let
¡ ¢ 2
(B.1) 2 ( ) = 1 + 2 + 2  2  2  Γ( )( −)  2 ( )

so the effective forward equation is


£ ¤
(B.2)  = 12 2 2 2 ( )   for min    max 

Note that ( ) can be zero where, and only where, ( ) = 0 The change in the total probability in any
interval 1    2 is
Z 2
 £ ¤ ¯2 £ ¤ ¯1
(B.3) (  ) = 12 2 2 2 ( )  ¯ − 12 2 2 2 ( )  ¯ 
 1
so clearly the probability flux at any point  is
£ ¤
(B.4) ( ) = − 12 2 2 2 ( )  

It is natural to place the lower boundary at the barrier, where ( ), and hence ( ), is zero. So let us
first consider the cases where (min ) = 0 with

(B.5) ( ) ∼ const · ( − min ) as  → min 

Barriers have been studied extensively in stochastic processes and PDEs [Feller, others, Hagan  − ].
If 0    12 , it is known that min is a regular boundary. Paths can both enter and leave the barrier,
and it is theoretically possible for probability to diffuse through the barrier, reaching the “forbidden region”
  min . We do not consider any models in which paths reach the region /  min below the boundary.
Any flux of probability from the interior   min to the boundary min must accumulate as a delta function
at the boundary:

(B.6a) (  ) =  ( )( − min ) at  = min 

Conservation requires that the accumulation of probability balances the flux,

 £ ¤
(B.6b) = lim+ 12 2 2 2 ( )  
  →min

For regular boundaries we also need to prescribe a boundary condition at min . Typically this boundary
condition would be absorbing,
+
(B.7a) 2 ( )(  ) → 0 as  → min 
20
no flux,
£ 2 ¤ +
(B.7b)  ( )  → 0 as  → min 

or even mixed
£ 2 ¤ +
(B.7c)  ( )  − 2 ( )(  ) → 0 as  → min 

To determine the correct boundary condition, note that the expected value of ̃ ( ) has to be constant,
n ¯ o Z ∞
¯
(B.8)  ̃ ( )¯ ̃ () =  ̃() =  = min  ( ) +  (  ) = 
min

for ̃ ( ) to be a Martingal. Therefore,


½ Z ∞ ¾
 
(B.9) min  ( ) +  (  ) = 0
 min

Substituting B.2 for  and B.6b for  ( ), and integrating by parts shows that
½ Z ∞ ¾ Z ∞
 
£ 2 ¤
(B.10) min  ( ) +  (  ) = 12 2 2  ( )  
 min min
= lim+
− 12 2 2 2 ( )(  )
 →min

Therefore the requirement that ̃ ( ) be a Martingale means that (  ) must satisfy absorbing boundary
conditions at min ,

(B.11) lim
+
2 ( )(  ) = 0
 →min

If 12    1, then min is an exit boundary. In this case some paths reach the barrier in finite time,
but no paths leave the barrier. Therefore there is a finite amount of probability at the boundary min ,

(B.12a) (  ) =  ( )( − min ) at  = min 

and it accumulates according to the flux,

 £ ¤
(B.12b) = lim 12 2 2 2 ( )  
 +
 →min

For exit boundaries, the probability density automatically satisfies the absorbing boundary condition
+
(B.12c) 2 ( )(  ) → 0 as  → min 

as is well known [Feller]. Theoretically no boundary condition is needed at min , since the absorbing boundary
condition occurs automatically. In practice, however, the absorbing boundary condition should be applied
explicitly when solving the effective forward equation numerically, since most numerical finite difference
schemes engender a slight amount of numerical dispersion, meaning that (min ) is effectively slightly
positive. Even if we could develop and employ a dispersion-free finite difference scheme, applying this
boundary condition would be redundant, and not lead to any contradictions.
21
If  ≥ 1, the barrier at min is an inaccessible, or natural, boundary. No paths can reach the boundary,
and the probability and flux both go to zero near the boundary,
£ 2 ¤ +
(B.13) 2 ( )(  ) → 0 1
2  ( )  → 0 as  → min 
In theory, no delta function is needed at min , since no paths reach the boundary. In practice, due to
numerical dispersion, small amounts of probability reach the boundary. By incorporating a delta function
at the boundary,
(B.14a) (  ) =  ( )( − min ) at  = min 
one can keep track of this probability,
 £ ¤
(B.14b) = lim+ 12 2 2 2 ( )  
  →min

This ensures that probability is exactly conserved. Even if we had a dispersion-free numerical scheme, it would
just result in  ( ) being exactly zero, so the  function would be redundant, but not erroneous. Similarly,
we keep the absorbing boundary conditions at min , even though it should be satisfied automatically.
In summary, whenever there is a barrier at min , we use absorbing boundary bounary conditions
+
(B.15a) 2 ( )(  ) → 0 as  → min 
and use a delta function
(B.15b) (  ) =  ( )( − min ) at  = min 
with
 £ ¤
(B.15c) = lim+ 12 2 2 2 ( )  
  →min

in all cases.
There may be situations in which it makes sense to place the boundary min at a point where (min ) 6=
0. For example, one may wish to put the boundary at min = 0, regardless of whether (min ) is zero or
not. In this case we must still use B.15a - B.15c at the boundary: Since we are not allowing any paths to go
below min , we have to allow a delta function at min with the probability at min increasing according to
B.15c, and to preserve the Martingale property of ̃ ( ) we need to use the absorbing boundary condition
B.15a
The upper boundary max should be set high enough so it has no appreciable effect on option prices;
typically setting max to be roughly 4 to 6 standard deviations above the forward suffices. This requires
Z max
1  0 2
(B.16a)  (max ) = = sinh  (cosh  +  sinh )
  ( 0 ) 
with

(B.16b)  = 12  · (4 to 6)   
|as shown in Appendix D. Although the boundary condition is irrelevent if max is chosen large enough, we
find it cleanest to treat the boundary at max the same as the boundary at min : We allow a delta function
at max 
(B.17a) (  ) =  ( )( − max ) at  = max 
22
where
 £ ¤
(B.17b) = − lim− 12 2 2 2 ( )  
  →max

This ensures that probability is conserved exactly, and by examining the size of  ( ), we can determine
whether the boundary max needs to be increased. We also use absorbing boundary conditions,

(B.17c) 2 ( )(  ) → 0 −
as  → max 

which ensures that ̃ ( ) is exactly a Martingale.


To summarize, let us write the density as
⎧ 
⎨  ( )( − min ) at  = min
(B.18) (  ) =  (  ) for min    max 
⎩ 
 ( )( − max ) at  = max

where the superscript  is being used to denote the continuous part of the density. Then  (  ) satisfies
the boundary value problem
£ ¤
(B.19a)  = 12 2 2 2 ( )   for min    max 

with the boundary conditions


+
(B.19b) 2 ( ) → 0 as  → min 
(B.19c) 2 ( ) → 0 −
as  → max 

for       , and the initial condition

(B.19d)  (  ) → ( −  ) as  → +

The probability at the boundaries is

 £ ¤
(B.19e) = lim+ 12 2 2 2 ( ) 
  →min

 £ ¤
(B.19f) = − lim− 12 2 2 2 ( ) 
  →max

and the initial conditions are

(B.19g)  (0) = 0  (0) → 0 as  → + 

B.1. Boundary layer analysis. We believe that the delta function  ( ) arises because when the
forward ̃ ( ) is near enough to the boundary, other mechanisms come into play. After all, there must
be some reason that ̃ ( ) doesn’t cross the boundary. To show how this could come about, consider the
effective forward equation
h i
(B.20a)  = 12 2 2 ̃2 ( ) for 0    ∞

(B.20b)  = ( −  ) at  → 0
23
where ( ) has been modified near the boundary:
½
() for 0    
(B.20c) ̃( ) = 
( ) for   

In the limit  → 0, we will recover the absorbing boundary condition and the delta function  ( ).
In the region 0    , the new effective forward equation is
µ ¶2
1  £ 2 ¤
(B.21a)  = 2 ()    for 0    

(B.21b) =0 at  → 0

The boundary at  = 0 is an inaccessible (natural) boundary, and requires no boundary condition. At


 = ,

(B.22a) (  − ) = (  + )

µ ¶2
 £ ¤ £ ¤
(B.22b) () lim−  2 (  )  = lim+ 2 ( )(  ) 
  →  →

since the probability  and the flux must be continuous.


Define  ( ) as the total amount of probablity in 0    ,
Z 

(B.23)  ( ) = (  )
0

We note that
Z  µ ¶2 Z  µ ¶2
 ( )  £ 2 ¤  £ ¤
(B.24) =  (  ) = ()      = () lim  2   
 0  0   →−

so
 ( ) £ ¤
= lim+ 2 ( )(  )  
  →

In the limit  → 0, we have a finite probability  ( ) in an infinitely thin region; i.e., a delta function:

(B.25a) (  ) =  ( )( )

with
 ( ) £ ¤
(B.25b) = lim+ 2 ( )(  )  
  →0

To investigage the effective boundary condition, define the Fourier transform of ,


Z ∞
(B.26) ̃(  ) = (  )− 
0

Then
h 2 i
(B.27a)  ̃ − ̃ = 0 for 0    
2 
24
where the constant  is

(B.27b)  = 12 2 2 2 ()  0

The general solution to ?? is


1 2
(B.28a) ̃(  ) = 1 () () + 2 () () 

where
p
−3 ± 1 + 4 2 
(B.28b)  12 = 
2
The integral
Z 
(B.29) () 2 
0

is infinite when Re {} ≥ 0. (It suffices to consider the region Re {} ≥  for any constant ; and then
using analytic continuation. See [CKP].) Therefore 2 () = 0 as ̃(  ) must be integrable, and thus
1
(B.30) ̃(  ) = 1 () () for 0    

Since

(B.31) ( + ) = ̃( − ) = 1 ()

and
£ ¤ h i
(B.32) lim+ 2 ( )(  )  = (())2 1 () ()2+ 1
 → 
2
= (2 +  1 ) (()) 1 ()

we have
2 £ ¤
(B.33) 1
2 (())2 (  + ) = lim+ 2 ( )(  )  
2 +  1  →
In the limit that  → 0, the right hand side goes to zero, and we obtain the absorbing boundary condition:

(B.34) lim 1 (())2 ( ) = 0


→0 2

Appendix C. Moment preserving finite difference schemes.


We require our numerical scheme to conserve probability and the first moment exactly, so that
Z max
(C.1a)  ( ) +  (  ) +  ( ) = 1
min
Z max
(C.1b) min  ( ) +   (  ) + max  ( ) = 
min

This ensures call-put parity, and provided that  (  ) ≥ 0 for all  , it also ensures that the numerical
solution itself represents an exactly arbitrage free model.
25
To simplify notation, define
¡ ¢ 2
(C.2a)  (  ) = 12 2 2 2 ( ) = 12 2 2 1 + 2 + 2  2  2  Γ  2 ( )

where

Z 
1  0 ( ) − ( )
(C.2b) ( ) =  Γ= 
  ( 0 )  −

We also set  = 0 without loss of generality.


C.1. Grid generation. The integration domain is

(C.3) min    max 

We discretize  so that

(C.4a) max = min + 

and define
¡ ¢
(C.4ba)  ≡ min +  − 12  for  = 0 1   + 1

to be the midpoints of the intervals from ( − 1)  to . In this process we adjust  slightly so that  occurs
exactly at the midpoint of it’s interval:
¡ ¢
(C.4c)  ≡ 0 = min + 0 − 12 

This allows us to implement the initial conditions in an exactly bias-free manner.


Let  =  (  ) be the probability density at  =   =  . Specifically, define
Z min +
1
(C.5)  =  (  0 ) 0 for  = 1  
 min +(−1)

so that  is the total probability in the   grid cell at time step . We usually use around 200 to 500
points for our grid, and divide 0      into 30 to 100 timesteps.
C.2. Finite difference scheme. We integrate the effective forward equation using a Crank-Nicholson
scheme, which averages explicit and implicit centerred difference equations[8, NumRecInC]. Not only is this
scheme unconditionally stable, it is second order accurate in time.
To advance from timestep  to  + 1 we need to solve
 © +1 +1 ª
(C.6) +1
 −  = +1 +1 − 2+1 +1

+1 +1
+ −1 −1
22
 ©  ª
+1 +1 − 2  + −1

−1 for  = 1 2  
22
which we re-write as
 © +1 +1 ª
(C.7a) +1
 − 2
+1 +1 − 2+1 +1

+1 +1
+ −1 −1 =
2
 ©  ª
 + 2 +1 +1 − 2  + −1

−1 for  = 1 2  
2
26
The absorbing boundary conditions yield

(C.7b) 0+1 +1


0 = −1+1 +1
1 at  = 0
+1 +1
(C.7c) +1 +1 = −+1 +1
 at  =  + 1

Note that the points  = 0 and  =  + 1 fall outside the domain; these shadow points simply enable us to
obtain the correct boundary condition at the “true” boundaries min = 12 and max = +12 . Note that
this scheme only requires the solution of a tridiagonal system, so the computational work scales linearly with
 and the number of timesteps 
We also need to solve for the probabilities  () and  () at the left and right boundaries. Let

(C.8)  =  ()  =  ()

be the boundary probabilities at timestep . At each time step, after solving for +1
0  +1
1      +1
  +1
+1 ,
+1 +1
we update the values  and 

 © +1 +1 ª
(C.9a) +1
 −  = 1 1 − 0+1 +1
0 + 1 1 − 0 0 
2

 © +1 +1 ª
(C.9b) +1
 −  = − +1 +1 − +1 +1


+ +1 +1 −   
2
Note that this is also second order accurte in time.
To set the initial condition, recall that we adjusted  so that
¡ ¢
(C.10)  = 0 ≡ min + 0 − 12 

for some integer 0 when we set up the grid. The initial condition is simply
½
0 0 0 for  6= 0
(C.11)  = 0  =  0 = 0
1 for  = 0

C.3. Moments. It is easily seen that this scheme conserves probability. The total probabilty is
Z max 
X
(C.12)  +  (   0 ) 0 +  ≡  +  +  
min =1

Summing equation C.7a over and using equations C.9a, C.9b, yields

X
¡ ¢  © +1 +1 ª
(C.13)  +1
 −  = +1 +1 − +1 +1
 − 1+1 +1
1 + 0+1 +1
0
=1
2
 ©  ª
+ +1 +1 −   − 1 1 + 0 0
2
¡ ¢ ¡ ¢
= − +1
 −  − +1 − 

for each timestep . Thus



X 
X
(C.14a) +1
 + +1
 + +1
 =  +  +  
=1 =1
27
so the total probability is the same for all timesteps . Since the total probability starts at 1 at  = 0,


X
(C.15)  +  +  = 1 for all 
=1

Similarliy, the first moment is

Z max 
X
(C.16) min  +  0  (   0 ) 0 + max  ≡ min  +   + max  
min =1

We multiply eq. C.7a by  and sum over . Since +1 − 2 + −1 = 0 this yields


X ¡ ¢
(C.17)  +1
 − 
=1
 © +1 +1
ª
(C.18) =  +1 +1 − 0 1+1 +1
1 − +1 +1 +1
 + 1 0+1 +1
0
2
 © 
ª
 +1 +1 − 0 1 1 − +1   + 1 0 0 
2

Using C.9a, C.9b, this becomes


¡ ¢ X ¡ ¢ ¡ ¢
(C.19) min +1
 − 
 +  +1
 −  + max +1
 − 
=1
 ¡ +1 +1 ¢  ¡ +1 +1 ¢
= 1 1 + 0+1 +1
0 − +1 +1 + +1 +1 
4 4
      ¡  ¢
+ (1 1 + 0 0 ) − +1 +1 +   
4 4

The absorbing boundary conditions C.7b, C.7c ensure that the right hand side is zero, so the first moment
is conserved for each time step :


X 
X
(C.20) min +1
 +  +1
 + max +1
 = min  +   + max  
=1 =1

Hence,


X
(C.21) min  +   + max  = 0 00 = 
=1

Thus the expected value of ̃ ( ) remains exactly  for the numerical solution. This is a key reason we
decided to use a uniform mesh; if we used a non-uniform mesh, a moment preserving finite difference scheme
would yield a linear problem with a matrix of at least five non-zero diagonals instead of three.
28
C.4. Option pricing. We integrate the option prices assuming that the probability 
 is spread
uniformly in each cell . This yields
(C.22a)  (   ) =  −  for   min

X £ ¡ ¢ ¤
(C.22b)  (   ) = 1
2 (min +  − )2 
 + min +  − 12  −  

=+1

+ (max − ) 
 for min + ( − 1)    min + 
(C.22c)  (   ) = 0 for   max
for the call prices, and
(C.23a)  (   ) = 0 for   min
−1
X £ ¡ ¢ ¤
(C.23b)  (   ) = ( − min ) 
 +  − min −  − 12  

=1

[ − min − ( − 1) ]2 
+ 12  for min + ( − 1)    min + 
(C.23c)  (   ) =  −  for   max
for the puts.
Appendix D. Dispersion.
We would like to be able to measure  , at least crudely, in terms of standard deviations from today’s
forward  . The effective forward equation can be written as
£ ¤
(D.1a)  = 12 2 2 2 ( )   for   
(D.1b)  → ( −  ) as  → 
with
¡ ¢ 2
(D.1c) 2 ( ) = 1 + 2 + 2  2  2  Γ( )( −)  2 ( )
To leading order, the solution is a Gaussian density
1 2 
(D.2a)  (  ) ≈ p − 2( −)
2 ( − ) 

with
Z 
1  0
(D.2b) ( ) = 
  ( 0 )
2
Since we are working only to leading order, we can neglect the exponential factor  Γ( )( −)
in ( )
Integrating then yields
Ãp !
1 1 + 2( ) + 2  2  2 ( ) +  + ( )
(D.3a) ( ) = log 
 1+

where
Z 
1  0
(D.3b) ( ) = 
  ( 0 )
29

For  to be roughly  standard deviations above  on the exercise date    we need  to be +   − .
This occurs at the  where
Z 
1  0 2
(D.4a)  ( ) = = sinh  (cosh  +  sinh )
  ( 0 ) 

with
 √
(D.4b) =    − 
2
Similarly,
√ for  to be roughly  standard deviations below  on the exercise date, we need  to be
−   − . This occurs where
Z 
1  0 2
(D.5a) − ( ) = = sinh  (cosh  −  sinh )
  ( 0 ) 
with
 √
(D.5b) =    − 
2

Appendix E. Shifted SABR model.


It is now commonly accepted that interest rates need not be strictly positive. Surely, though, if interest
rates were to become too negative, then increasing amounts of money would be withdrawn from the banking
system, putting a squeeze on deposits. So there should be some barrier to how negative interest rates can
become, but this barrier is probably below zero. Thus, it may make more sense to use a shifted SABR model,

(E.1) ( ) = ( + ) 

for some   0.
For this model, the explicit implied vol formula is
µ ¶
 (1 − ) ( − ) 
(E.2a)   () = 1− 1−
· ·
( + ) − ( + ) ()
⎧ ⎡ ⎤ ⎫
⎪  + ⎪
⎨  (2 − ) (1 − )2 2 log2 
( + ) − ( + )

2 − 3 2 ⎥ 2 ⎬
2
⎢ 1  +  1
1 + ⎣− 24 h i2 + 4  +  ⎦    

⎩  − 24 ⎪

( + )1− − ( + )1−

where
Ãp !
 ( + )1− − ( + )1− 1 − 2 +  2 −  + 
(E.2b) =  () = log 
 1− 1−

Our effective forward equation is


h¡ ¢ 2 i
(E.3a)  = 12 2 2 1 + 2 + 2  2  2  Γ( −)  2 ( ) for   


where
( + )1− − ( + )1− ( + ) − ( + )
(E.3b) ( ) =  Γ= 
 (1 − )  −
30
E.1. Stochastic normal model. As  → 0, the shifted SABR model simplifies to the stochastic
normal model,

(E.4) ( ) = 1

For this model, the explicit implied vol formula is


µ ¶
 © £ ¡ ¢ ¤ ª
(E.5a)   () =  · · 1 + 14  + 1
24 2 − 32  2 2   + · · · 
()
where
Ãp !
 1 − 2 +  2 −  + 
(E.5b)  = ( − )  () = log 
 1−

The effective forward equation is


h³ ´ i
(E.6a)  = 12 2 2 + 2 [ −  ] +  2 [ −  ]2  for   


(E.6b) (  ) → ( −  ) as  → + 

Here there is no barrier, and the placement of min is an independent modeling decision.

REFERENCES

[1] D.T. Breeden and R. H. Litzenberger, Prices of state-contingent claims implicit in option prices, J. Business, 51
(1994), pp. 621-651.
[2] B. Dupire, Pricing with a smile, Risk, Jan. 1994, pp. 18—20.
[3] B. Dupire, Pricing and hedging with smiles, in Mathematics of Derivative Securities, M.A. H. Dempster and S. R. Pliska,
eds., Cambridge University Press, Cambridge, 1997, pp. 103—111
[4] E. DermaN and I. Kani, Riding on a smile, Risk, Feb. 1994, pp. 32—39.
[5] E. Derman and I. Kani, Stochastic implied trees: Arbitrage pricing with stochastic term and strike structure of volatility,
Int J. Theor Appl Finance, 1 (1998), pp. 61—110.
[6] J.M. Harrison and S. Pliska, Martingales and stochastic integrals in the theory of continuous trading, Stoch. Proc.
Appl, 11 (1981), pp. 215-260.
[7] J.M. Harrison and D. Krebs, Martingales and arbitrage in multiperiod securities markets, J. Econ. Theory, 20 (1979),
pp. 381-408.
[8] I. Karatzas, J.P. Lehoczky, S.E. Shreve, and G.L. Xus, Martingale and duality methods for utility maximization in
an incomplete market, SIAM J. Control Optim, 29 (1991), pp. 702—730.
[9] J. Michael Steele, Stochastic Calculus and Financial Applications, Springer, 2001
[10] F. Jamshidean, Libor and swap market models and measures, Fin. Stoch. 1 (1997), pp. 293-330
[11] F. Black, The pricing of commodity contracts, Jour. Pol. Ec., 81 (1976), pp. 167-179.
[12] John C. Hull, Options, Futures, and Other Derivative Securities, Prentice Hall, 1997.
[13] P. Wilmott, Paul Wilmott on Quantitative Finance, John Wiley & Sons, 2000.
[14] Patrick S. Hagan and Diana E. Woodward, Equivalent Black volatilities, App. Math. Finance, 6 (1999), pp. 147—157.
[15] P. S. Hagan, A. Lesniewski and D. E. Woodward, Geometric optics in finance, in preparation.
[16] F. Wan, Mathematical Models and Their Analysis, Harper-Row, 1989.
[17] J. Hull and A. White, The pricing of options on assets with stochastic volatilities, J. of Finance, 42 (1987), pp. 281-300.
[18] S.L. Heston, A closed-form solution for options with stochastic volatility with applications to bond and currency options,
Rev of Fin Studies, 6 (1993), pp. 327-343.
[19] A. Lewis, Option Valuation Under Stochastic Volatility, Financial Press, 2000.
[20] J.P. Fouque, G. Papanicolaou, K.R. Sirclair, Derivatives in Financial Markets with Stochastic Volatility, Cambridge
Univ Press, 2000.
[21] N. A. Berner, Hedging vanna & volga, DKW, private communicatons
[22] J.D. Cole, Perturbation Methods in Applied Mathematics, Ginn-Blaisdell, 1968.
31
[23] J. Kevorkian and J.D. Cole, Perturbation Methods in Applied Mathematics, Springer-Verlag, 1985.
[24] J.F. Clouets, Diffusion Approximation of a Transport Process in Random Media, SIAM J Appl Math, 58 (1998),
pp. 1604—1621.
[25] I. Karatzas and S. Shreve, Brownian Motion and Stochastic Calculus, Springer, 1988.
[26] B. Okdendal, Stochastic Differential Equations, Springer, 1998.
[27] M. Musiela and M. Rutkowski, Martingale Methods in Financial Modelling, Springer, 1998.
[28] G. B. Whitham, Linear and Nonlinear Waves, Wiley, 1974.
[29] J.C. Neu, Thesis, California Institute of Technology, 1978

32

View publication stats

You might also like