Professional Documents
Culture Documents
9、Cumulative Prospect Theory, Option Returns, And the Variance Premium(2019RFS)
9、Cumulative Prospect Theory, Option Returns, And the Variance Premium(2019RFS)
9、Cumulative Prospect Theory, Option Returns, And the Variance Premium(2019RFS)
Joost Driessen
Sebastian Ebert
Frankfurt School of Finance and Management and Tilburg University
Juan M. Londono
International Finance at the Federal Reserve Board of Governors
Oliver G. Spalt
Tilburg University
We develop a tractable equilibrium asset pricing model with cumulative prospect theory
(CPT) preferences. Using GMM on a sample of U.S. equity index option returns, we show
that by introducing a single common probability weighting parameter for both tails of the
return distribution, the CPT model can simultaneously generate the otherwise puzzlingly
low returns on both out-of-the-money put and out-of-the-money call options as well as the
high observed variance premium. In a dynamic setting, probability weighting and time-
varying equity return volatility combine to match the observed time-series pattern of the
variance premium. (JEL C15, G11, G13)
Received May 30, 2017; editorial decision August 10, 2018 by Editor Andrew Karolyi.
We thank editors Robin Greenwood and Andrew Karolyi; two anonymous referees; Nicholas Barberis, Ing-Haw
Cheng (discussant), Shimon Kogan (discussant), Lei Sun (discussant), and David Veredas (discussant); and
seminar participants at Tilburg University, the 2012 EFA meeting in Copenhagen, the 2015 FIRS conference in
Reykjavik, the 2016 Belgian Financial Research Forum in Brussels, the 2016 Research in Behavioral Finance
Conference in Amsterdam, and the 2017 AEA meeting in Chicago for useful comments and suggestions.
Send correspondence to Joost Driessen, Tilburg University, PO Box 90153, 5000 LE, Tilburg, the Netherlands;
telephone: +31-13-4662324. E-mail: j.j.a.g.driessen@uvt.nl.
© The Author(s) 2018. Published by Oxford University Press on behalf of The Society for Financial Studies.
All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
doi:10.1093/rfs/hhy127 Advance Access publication December 7, 2018
1 For example, Coval and Shumway (2001), Driessen and Maenhout (2007), and Eraker (2013) find that the Sharpe
ratio for volatility-selling strategies, such as shorting straddles, which effectively bet on the variance premium, is
at least twice the Sharpe ratio of the underlying equity index (see also Bakshi and Kapadia 2003; Jiang and Tian
2005; Bakshi and Madan 2006; Carr and Wu 2009; Bollerslev, Tauchen, and Zhou 2009; Bollerslev, Gibson, and
Zhou 2011, among others). If the equity premium is a puzzle, then, by this metric, the variance premium is an
even bigger puzzle.
3668
which says that the expected return in excess of the risk-free rate of an asset is
a function of the covariance between the asset’s return and the CRRA pricing
kernel mCRRA . The CRRA pricing kernel mCRRA increases in the marginal utility
of the investor. Equation (1) thus implies that assets that pay off in states where
the investor’s marginal utility is low have high expected returns, because the
covariance term is negative.
Even though the CRRA pricing equation is widely used, it does not explain
observed option returns. Figure 1 illustrates the disconnect between the CRRA
model and the data by plotting the observed average option returns for different
which says that the expected return in excess of the risk-free rate of an asset is
equal to the return implied by the CRRA model, minus an additional covariance
term. This additional covariance term depends on mCPT , which we label the
“CPT pricing kernel.” Equation (2) highlights an advantage of our CPT model
in terms of theory building: it nests the standard CRRA model as a special case,
and thus allows us to directly compare the standard model with our proposed
extension.
Figure 1 shows the fit of our CPT model. The CPT model-implied option
returns (red crosses) line up remarkably well with the actual data for both calls
3669
-20
%
-40
-20
%
-40
-60
Figure 1
Average option returns and model fit
This figure compares the average observed 30-day return computed from all options in our sample with the
returns predicted by our benchmark CPT representative agent model (specification (2) in Table 2) as well as by a
CRRA model—a restricted CPT model with c = 1 and b0 = 0 (specification (1) in Table 2). The sample comprises
daily S&P 500 index option returns from 1996 to March 2016 for various strike prices. Of the 26 options in the
sample, 13 are calls, with deltas ranging from 0.2 to 0.8 (from OTM to ITM). The other 13 options are puts, with
deltas ranging from −0.8 to −0.2 (from ITM to OTM). The option returns are plotted by moneyness (K/S0 ) for
a given delta, separately for calls (panel A) and puts (panel B). We use a discrete approximation of the lognormal
distribution for the S&P 500 index, taking a grid of 200 potential outcomes for the 1-month-ahead equity return
distribution, ranging from −50% to 150%.
(panel A) and puts (panel B), which sharply contrasts with the poor fit of the
CRRA model. Because fitting returns of puts and calls is a sufficient condition
for explaining the variance premium, Figure 1 also implies that the CPT model
does a good job fitting the variance premium.
3670
3671
A Pricing Kernel
2.5
CPT
2 CRRA
1.5
m
0
0.92 0.94 0.96 0.98 1 1.02 1.04 1.06 1.08
Market return (R E )
0
*
-10
-20
0.92 0.94 0.96 0.98 1 1.02 1.04 1.06 1.08
Market return (R E )
Figure 2
The pricing kernel and implied risk aversion
Panel A shows the CRRA and CPT components of the pricing kernel implied by our CPT setting (see Equation
(10)) as a function of the market return, R E . Panel B shows the state-dependent risk aversion, (RiE ), implied by
∂ln(m(RiE )) E
the pricing kernel, m = mCPT +mCRRA . The implied risk aversion is calculated as = − Ri , which is
∂RiE
consistent with the definition of local risk aversion in Ait-Sahalia and Lo (2000). In both panels, the reference
level, WRef , is calculated using a risk-free rate of 0.19%; the level of risk aversion, γ , is fixed at 1; the parameter
driving probability weighting, c1 = c2 = c, is fixed at 0.65; and the scale, b0 , is fixed at 0.95, which is the scale
that yields a 50% contribution of CPT to total utility for this set of parameters. Note that these parameter values
match those in the benchmark GMM estimation (see Table 2).
Barberis, Huang, and Santos (2001), we take into account the probability
weighting feature of prospect theory and analyze its ability to fit option returns
and the variance premium puzzle. Our model is not designed to address
3672
the equity premium puzzle.2 We show that the resultant model yields the
equilibrium pricing Equation (2) above, which allows us to derive the expected
return on the stock market index and on a range of options on that index. We
can then derive the model-implied variance premium from these equilibrium
returns.
A key advantage of the model is that it is parsimonious and very tractable. It
allows us to prove that the model-implied portfolio weights are unique, finite,
and wealth independent. This is relevant because exploding portfolio weights
are a well-known challenge for existing CPT portfolio choice models (see,
2 We leave fully integrating the intertemporal consumption-based model of Barberis, Huang, and Santos (2001)
with our model that features probability weighting to future research. Because dynamic probability weighting
models imply, in general, that investors behave in a time-inconsistent fashion, such models pose significant
modeling challenges that are beyond the scope of this paper.
3673
approach of allowing for probability distortions for both gains and losses
is consistent with findings in the psychology and experimental economics
literatures which suggest probability weighting for both gains and losses is
exhibited by a majority of decision makers (e.g., Tversky and Kahneman 1992;
Abdellaoui 2000; Bruhin, Fehr-Duda, and Epper 2010).
In Section 3, we show that the model can also generate meaningful time
variation in the variance premium. We focus on three potential drivers: time-
varying equity return volatility, time-varying probability weighting, and time-
varying loss aversion. The main finding is that, with time-varying volatility,
3674
3675
1. The Model
1.1 Preferences
Following Barberis, Huang, and Santos (2001) (BHS), the representative
agent’s total utility is given by a weighted sum of expected utility over terminal
wealth WT and CPT utility over the gain or loss XT (Tversky and Kahneman
1992). The final gain or loss is defined as XT = WT −WRef , where WRef denotes
the reference point, which determines whether terminal wealth levels are
perceived as gains (WT ≥ WRef ) or losses (WT < WRef ). Formally, the investor
seeks to maximize
EU [WT ]+b0 CP T [XT ], (3)
where b0 ≥ 0 is a scaling term that governs the relative importance of the
expected utility part, EU [WT ], and the CPT part, CP T [XT ]. Taking a
combination of “traditional” expected utility preferences over terminal wealth
and a “behavioral” prospect theory term over gains and losses—a hybrid model
of traditional and behavioral preferences—is crucial to the results in this paper.
We provide details about why we believe these preferences are economically
and psychologically more attractive than a pure CPT model at the end of this
subsection.
For the expected utility part, we assume that EU [WT ] = E[U (WT )] satisfies
CRRA, that is, the utility function U is given by
1−γ
WT
γ = 1
U (WT ) = 1−γ for , (4)
lnWT γ =1
where γ is the risk aversion coefficient. To define the CPT part, first, let v denote
the value function, which is defined over gains and losses XT . We assume that
v takes the piecewise linear form
XT XT ≥ 0
v(XT ) = for , (5)
λXT XT < 0
3676
where π1 = w − (p1 ) and πN = w + (pN ). The CPT value of the gain or loss XT is
then computed as
N
CP T (XT ) = πi v(XT ,i ). (7)
i=1
3 The lower bound is a technical condition that ensures that decision weights are positive (Rieger and Wang (2006);
Ingersoll (2008)).
3677
assume that WRef = W0 R f ; that is, the investor experiences positive gain-loss
utility if and only if her investment yields more than the risk-free return.
Our baseline preference specification in Equation (3) is a combination of
rational (consumption or terminal wealth) and behavioral (reference-dependent
or gain-loss) elements. The use of such a hybrid model is motivated by BHS,
as well as by Kőszegi and Rabin (2006, 2007), who argue strongly in favor of
a hybrid setup. Two reasons make such a setup attractive. First, in contrast to
a pure CPT model, a hybrid model can capture the fact that investors may also
care about absolute wealth levels. As Kőszegi and Rabin (2006, p. 1138) put it,
4 Barberis and Huang (2008) show that an equilibrium can exist in a CPT setting with multiple investors. Allowing
for multiple investors, instead of a representative investor, would substantially complicate our analysis, which is
why we leave it for future research.
3678
Finally, note that probability distortion is effective only in the CPT part of
preferences, but not in the EUT part. This modeling choice allows us again to
interpret the CPT part as a deviation from the rational EUT benchmark, and
it allows us to quantify the economic importance of that deviation from EUT
rationality for explaining option returns and the variance premium. Applying
probability weighting only to the CPT part of the utility function is in line with
earlier work of De Giorgi and Legg (2012). While we believe setting up our
model in this way is the most natural baseline in light of the existing literature,
alternative ways to setting up the model are conceivable, and we will explore
where the indicator function 1RE <Rf is equal to one if the investment in state
i yields a loss and equal to zero otherwise. The FOC shows that the hybrid
model inherits two attractive properties from the standard CRRA model. First,
the second-order condition (the derivative of the right-hand side of the FOC with
respect to the portfolio weight, set equal to zero) is identical to the one obtained
in a pure CRRA model. Therefore, the second-order condition is satisfied, which
implies a unique and finite optimal portfolio weight. Second, like in the model
of BHS, our choice of b0 ensures that the optimal portfolio weight is wealth
independent. We summarize our results in the following proposition.
3679
Our baseline model above is static. On the one hand, this simplicity is a feature
of our modeling strategy, because it emphasizes that we do not need to resort
to a dynamic argument for generating a variance premium. Instead, our model
shows that probability weighting can generate a variance premium even in a
static setting, which is different from most explanations in the existing literature.
5 Note that log utility of terminal wealth can be rewritten as the sum of log utility of the per-period wealth returns.
The investor thus evaluates per-period returns in both components of the total utility function.
3680
⇐⇒ 0 = E (R E −R f ) mCRRA +mCPT , (10)
where mCRRA and mCPT refer to the random variables that, in state i, yield
−γ
outcome RiE and b̂ pπii 1+(λ−1)1RE <Rf , respectively. Given a risk-free
i
rate, probabilities pi , and preference parameters of the investor, we can use
Equation (10) to determine the equilibrium expected return on the equity index,
E[R E ].
6 Our results are related to, but different from those of Merton (1969) and Samuelson (1969). One way in which
their results are different is that they model consumption, whereas we do not.
3681
If the pricing kernel also prices the risk-free asset, we have Rf = 1/E[mCRRA +
mCPT ] and Equation (11) can be rewritten into Equation (2), which we have
already discussed in the Introduction.8
To gain some intuition on the workings of the model, panel A of Figure 2
illustrates the two components of the pricing kernel. If b̂ = 0, then mCPT = 0,
−γ
and the second covariance term vanishes. Because mCRRA = R E describes
the investor’s marginal utility from equity returns, mCRRA is large when equity
payout is low. Hence, by the return Equation (11), the expected returns of
derivatives whose payoffs correlate with the market (cov(mCRRA ,R d ) > 0) are
lower. As such, calls should have high expected returns while puts should have
low returns.
When b̂ > 0, derivative returns are influenced by a second covariance term
cov(mCPT ,R d ). The prospect theory part of the pricing kernel, mCPT , is larger
when pπii is large and/or when the equity payoff is experienced as a loss (i.e.,
when the equity return is smaller than R f ). To see when pπii is large, note that
πi
pi
is approximately equal to the derivative of the weighting function used in
the computation of πi . Because the probability weighting functions have an
inverse S shape, the derivatives of these functions are large for both small and
large probabilities. The pricing kernel is thus elevated in states of extreme
7 This is a technical assumption to ensure the existence of a representative agent equilibrium. Without this
assumption, the ranking of states i may be different for different portfolios, such that the investor might
endogenously influence the ranking via her portfolio choice. Such “endogenous rankings” would complicate
the analysis substantially, as shown by Ingersoll (2016). Using the terminology from Ingersoll (2016), the above
assumption implies that we are analyzing an “order-constrained” optimization problem, where the ordering is
derived from the equity index. Note that, in equilibrium, where the investor holds only equity and no derivatives,
the rankings of equity states and wealth states coincide.
8 In the empirical analysis, we do not price the risk-free asset. Instead, we take the (observed) risk-free rate as
given and directly focus on excess returns like in Equation (11). Note that, to match any given risk-free rate level,
our model can be generalized by scaling the total utility function (and hence the pricing kernel mCRRA +mCPT )
with an additional free parameter. This would not affect Equation (11) or change any of our results.
3682
equity payouts, both large and small. With respect to losses, the factor (1+(λ−
1)1RE <Rf ) increases the CPT contribution to marginal utility by the constant
i
loss aversion parameter λ if and only if state i is a loss state, thus leading to a
discontinuity in the pricing kernel.
To get an idea of the quantitative effect of adding CPT preferences to a CRRA
framework, panel B of Figure 2 shows the state-dependent risk aversion implied
by the pricing kernel given our benchmark parameter estimates (described in
Section 2). To calculate state-dependent relative-risk aversion, we follow Ait-
Sahalia and Lo (2000) and Chabi-Yo, Garcia, and Renault (2008), who show that
V P ≡ V Q −V P
2 E 2
= qi ln RiE −E Q ln R E − pi ln Ri −E P ln R E ,
i i
(12)
3683
Proposition 4. Let R c (K) and R p (K) denote the return to maturity on a call
and put option with strike K, respectively. If there are no arbitrage opportunities,
we have
E
s0 ∞
VP ≈ v p (K)E P R p (K)−R f dK + v c (K)E P R c (K)−R f dK, (14)
0 s0E
where s0E is the time-0 price of equity and v c (K) and v p (K) are scalars whose
expressions are given in Appendix B.
3684
positive outcomes determine the value of OTM puts and calls, respectively.
Equation (14) thus shows that the pricing of both tails of the return distribution
affects the variance premium.
Note that Equation (14) is “model-free” in the sense that it neither requires
assumptions on preferences nor on the equity return distribution. The only
assumption is the absence of arbitrage opportunities. Hence, the equation holds
both in the data, as well as in the model. We can thus write the difference between
the model-implied variance premium, V PCP T , and the observed variance
premium, V P , as follows
∞
c c
+ v c (K) ECP T R (K) − Ê R (K) dK,
P
s0E
P
where ECP T [R] denotes the expected return implied by the model and Ê[R]
the sample average return for any return R.9 Thus, if the model-implied option
returns match the option returns in the data, then the variance premium is the
same for both the model and the actual data. Pricing the cross-section of OTM
options is therefore a sufficient condition for fitting the variance premium.10
The equation also shows that a model can match the variance premium while, at
the same time, generating completely counterfactual option prices. Thus, fitting
the variance premium by fitting options, as we do in this paper, is a stricter test
than fitting only the variance premium.
Finally, as an important point of reference for our subsequent analysis,
note that the variance premium is zero for the CRRA case if the market
payoff, x E (and hence the terminal wealth of the investor, WT ) have a
lognormal distribution, and if the representative agent has pure CRRA
preferences (b0 = 0) (see, e.g., the results in Samuelson and Merton 1969;
Rubinstein 1976).11
In Appendix C we further decompose the variance premium into upside and
downside variance premiums by conditioning on positive and negative market
9 In small samples, Equation (15) is subject to the sampling error in the estimates of the average returns and the
variance premium.
10 This reasoning comes with two minor caveats. First, Equation (14) is only an approximation. We show in Appendix
B that the approximation error is equal to the difference between central and noncentral second moments. For our
application, this term is numerically small, as we focus on 1-month returns—it is equal to 4.4 in our benchmark
model, while empirically the variance premium is equal to 146.7 (both numbers are in percentage-squared).
Second, empirically, we only have options for a finite number of strike prices.
11 In a setting with discrete trading and lognormal returns, the results in Brennan (1979) imply that CRRA
preferences are the only preferences that generate a zero variance risk premium. Drechsler and Yaron (2011)
show that CRRA preferences generate a zero variance premium also in a multiperiod setting with long-run risk.
3685
returns, respectively. We show that both the upside and downside variance
premium contribute positively to the total variance premium, and that our model
generates a decent fit of these components.
2. Empirical Results
This section explores the ability of the CPT model described in Section 1 to
fit the data. We first show that the model yields a good fit for observed option
returns for a set of plausible input parameter values, which implies that the
2.1 Data
Our main dataset consists of daily closing midquotes of S&P 500 index options
for various deltas and maturities obtained from the OptionMetrics database from
January 1996 to March 2016. OptionMetrics creates a “surface” of interpolated
option prices for fixed levels of the Black-Scholes delta and for fixed maturities
(30 calendar days, 60 days, etc.) for both puts and calls. We focus on 30-day
options because they are most liquid and the ones most frequently used in the
literature. For each day and option delta, and for both put and call options,
we construct the return on buying an option and holding it to maturity, that is,
for 30 calendar days. We thus create overlapping returns. This yields a panel
containing the returns on 26 options across deltas. Of these 26 options, 13 are
calls with deltas ranging from 0.2 to 0.8 (from OTM to in-the-money (ITM)),
and 13 are puts with deltas ranging from −0.8 to −0.2 (from ITM to OTM).
Table 1 presents summary statistics for option returns. The observed values
reported in Figure 1 (black squares) are identical to the numbers in Table
1 except that deltas are replaced by the average moneyness of the options
(K/S0 ) for a given delta. The observed option returns are in line with the
documented stylized facts in the literature (e.g., Coval and Shumway 2001;
Jones 2006; Driessen and Maenhout 2007; Bakshi, Madan, and Panayotov
2010; Constantinides, Jackwerth, and Savov 2013).12
We focus on option returns, instead of option prices, for two reasons. First,
Proposition 4 shows that the variance premium is directly related to expected
option returns, and that matching option returns is a sufficient condition for
matching the variance premium. Second, option returns are directly informative
12 Bakshi, Madan, and Panayotov (2010), Constantinides, Jackwerth, and Savov (2013), and Chaudhuri and
Schroder (2015) also find call returns to be low compared to the underlying index return, although in our
sample call returns seem to be somewhat lower than in previous work. This is due to (1) differences in sample
periods and (2) the fact that, to make more efficient use of data, we use overlapping option returns. Given that
the return distribution of OTM options is skewed, using overlapping returns helps to reduce estimation error
(Broadie, Chernov, and Johannes 2009).
3686
3687
3667–3723
The Review of Financial Studies / v 32 n 9 2019
about the preference parameters. This can be seen easily from Equation (11),
where the expected excess option return depends on the covariance of the
option payoff and the pricing kernel (which, in turn, depends on the preference
parameters). In contrast, option prices depend on this covariance term as well as
on the expected option payoff. Given that the expected option payoff depends
strongly on the assumed equity distribution, if we fitted to option prices rather
than returns, the results would be very sensitive to the assumption on this
underlying return distribution. As shown in Section 2.4.3, where we fit to
option returns, our estimates hardly depend on the assumed distribution for
13 Formally, this risk-neutral variance is given by the integrated variance of continuous-time returns over a given
time period. We work in discrete time. If the continuous-time returns are uncorrelated over time, this integrated
variance equals the variance of the return over a discrete time period.
3688
14 Our main results for the ability of our model with CPT to match the observed variance premium remains robust
for grids ranging from 50 to 500 outcomes and for equity returns for narrower (-25% to 125%) and wider (−75%
to 175%) intervals.
15 The relative contribution of CPT to total is calculated as b0 CP T [XT ]
EU [WT ]+b0 CP T [XT ] (see Equation (3)).
16 Equivalently, one could derive the VP by computing the risk-neutral probabilities and then using Equation (12).
3689
variance premiums for values of b0 ranging from zero, in which case the CPT
contribution to the investor’s utility is also zero, to infinity, in which case
the CPT contribution approaches 100%. The solid line in panel A shows that
varying b0 induces economically significant variation in the variance premium.
As b0 approaches zero, the variance premium approaches zero. The variance
premium increases monotonically with b0 and reaches a level above 300 for
b0 → ∞, which is about twice the level of the variance premium observed in
the data.
The dashed line in panel A of Figure 3 offers first insights into why CPT can
3690
%
200
-50
0
-100
0 20 40 60 80 100 0 20 40 60 80 100
400
0
%2
%
200
-50
0
-100
0.5 0.6 0.7 0.8 0.9 1 0.5 0.6 0.7 0.8 0.9 1
c c
E O (loss aversion) F O (loss aversion)
400
0
%2
200
-50
0
-100
1 2 3 4 1 2 3 4
O O
Figure 3
CPT-implied variance premium and OTM option returns for alternative preference parameter values
This figure shows the annualized CPT-implied variance premium (left) and 30-day returns for out-of-the-money
(OTM) puts and calls (right) as a function of the scale b0 (panels A and B), which determines the contribution
of CPT utility to total utility, probability weighting, c (panels C and D), and loss aversion, λ (panels E and F).
The representative agent’s CPT preferences are defined in Section 1.1. The variance premium is defined as the
difference between the risk-neutral and the expected realized variance of equity returns, as explained in Section
1.4. We report the CPT-implied returns for puts and calls with 0.95 and 1.05 moneyness (K/S ), respectively. In
all panels, we set c = 0.65, b0 = 0.95, and λ = 2.25, apart from the parameter that is varied.
effect dominates. Our results in panel E show that, qualitatively, the put effect
dominates (because the variance premium depends negatively on expected put
and call returns). Quantitatively, however, loss aversion has only a minor effect
on the variance premium.
3691
1 t
μc (θ )−T −1 T1 Rtc
where μE (.), μp (.), and μc (.) are the model-implied equilibrium expected
returns for equity, calls, and puts, respectively, and V P (·) is the model-implied
variance premium, all matched to their empirical counterparts in our sample.
We include 13 calls and 13 puts with different strike prices, so that μp (.), and
μc (.) are both 13×1 vectors. We thus have 28 moment conditions in total. We
find the
θ that minimizes the GMM goal function V F = g(θ) Wg(θ ), where W is
the weighting matrix for the moment conditions. We use a diagonal weighting
matrix which assigns an equal weight of one-fourth to the equity, call, put,
and variance premium moments. We calculate Newey-West corrected standard
errors for the estimated parameters to account for potential autocorrelation
and heteroscedasticity in the residuals due to overlapping returns as we
construct holding-to-maturity returns each day. Thus, we can do inference on
the estimated parameters by exploiting that
θ ∼ N (θ,V /T ),
where V = [G W G]−1 G W SW G[G W G]−1 ,S is the standard Newey-West-
corrected covariance matrix, and G = δg/δθ . The matrix G captures the
sensitivity of the moment conditions to the parameters, as analyzed in
Section 2.3.
2.4.2 GMM baseline results. Table 2 presents our baseline GMM estimates.
As a point of reference, specification (1) sets b0 = 0 and γ = 1, which means that
our model collapses to the pure CRRA case with log utility. Not surprisingly,
the statistics shown in Table 2 reflect the inability of the CRRA model to fit the
option data. The average put option return estimated in this case is −5.86%,
while the first column shows that the return in the data is −30.14%. The CRRA
model also fits calls poorly. While the model predicts a positive 6.50% average
call return, the return in the data is −5.77. The blue x’s in Figure 1, which
we have already discussed in the introduction, show the model-implied returns
from specification (1) for each of our 26 options.
The variance premium is zero for the CRRA specification (1), as it should
be. Finally, the CRRA model slightly undershoots the equity premium, but note
3692
Table 2
GMM estimates in the unconditional setting
(1) (2)
CRRA Benchmark
Observed log-utility CPT
γ (risk aversion) 1 1
b0 (scale) 0 0.95
λ (loss aversion) 1.39
[0.30]
c (probability weighting) 0.65
[0.00]
3693
below. The CPT model matches the variance premium almost perfectly with
a value of 146.33%2 , relative to 146.72%2 in the data. At the same time, our
model also fits the underlying option returns well. For example, the average
call return is −9.23% in the model, compared to −5.77% in the data, and the
average put return in the model is −31.71%, compared to −30.14% in the data.
The red +’s in Figure 1 show the model-implied returns from specification (2)
for each of our 26 options, and confirm that the model is successful at pricing
options across the whole spectrum of strike prices. Finally, the value of the
GMM objective function, VF, which can be thought of as a weighted average
3694
Table 3
GMM estimates in the unconditional setting: Alternative scales
(1) (2) (3) (4) (5)
Observed Benchmark Low b0 High b0 Estimated b0
γ (risk aversion) 1 1 1 1 1
b0 (scale) 0.95 0.30 5 100 0.27
– – – – [0.23]
λ (loss aversion) 1.39 1.60 1.31 1.28 1.63
[0.30] [0.37] [0.18] [0.14] [0.37]
c (probability weighting) 0.65 0.49 0.75 0.78 0.48
[0.00] [0.00] [0.00] [0.00] [0.03]
3695
the estimates in Table 3 imply that, for arguably plausible levels of the
CPT contribution (21% to 99%), the CPT model yields probability weighting
estimates that are in the same region as estimates found in lab studies.
Third, and importantly, across all specifications, we can statistically reject the
hypothesis of no probability weighting. Hence, while the value we pick for b0
has an influence on the probability weighting parameter, it does not change the
central conclusion that probability weighting is the central ingredient to fitting
the data. Finally, specifications (2) to (4) show that the CPT model yields an
excellent fit of the variance premium and the underlying option returns in all
3696
OTM option returns, and that specific assumptions about the curvature of the
utility or gain-loss functions are relatively less important.
3697
Table 4
GMM estimates in the unconditional setting: Asymmetric probability weighting
The results in Table 4 provide further insight into the workings of the model.
What is notable is the fact that both specifications (3) and (4) feature substantial
degrees of probability weighing. Because the VP can be thought of as a portfolio
of strangles, and because strangles generate a positive return when there is
demand for both lottery and insurance, the absence of a preference for lottery-
like payoffs (induced by weighting on the upside) requires a strong desire
for insurance to still induce high enough strangle returns, which implies a
substantial degree of probability weighting in the model. In the absence of a
desire for insurance, an analogous pattern emerges.
Combined, the results in this section show that distorting probabilities in
both tails of the equity return distribution is important. Distorting only one
tail yields substantial pricing errors for individual options. The ability to price
both puts and calls using only one underlying driver, probability weighting, is
a particular advantage of the CPT model.
3698
0
Percent
-20
-40 Observed
0
Percent
-20
-40
-60
Figure 4
Average option returns and model fit for CPT models with asymmetric probability weighting
This figure compares the average observed 30-day return for all options in our sample with the ones predicted by a
CPT representative agent model with asymmetric probability distortion (c1 = c2 ), a model in which only the left-
tail probabilities are subject to probability distortion (c2 = 1), and a model in which only the right-tail probabilities
are subject to probability weighting (c1 = 1); see specifications (2), (3), and (4) in Table 4, respectively. The sample
comprises daily S&P 500 index option returns from 1996 to March 2016 for various strike prices. Of the 26
options in the sample, 13 are calls, with deltas ranging from 0.2 to 0.8 (from OTM to ITM). The other 13 options
are puts, with deltas from −0.8 to −0.2 (from ITM to OTM). Results are presented by moneyness (K/S0 ) for a
given delta, separately for calls (panel A) and puts (panel B).
3699
1500
Benchmark
Low vol (V=0.02)
High vol (V=0.1)
1000
%2
500
Figure 5
CPT-implied variance premium for alternative volatility levels
This figure shows the annualized CPT-implied variance premium as a function of the probability weighting
parameter, c, for three different values of monthly equity volatility: 2% (low), 4.84% (benchmark), and 10%
(high). We set b0 = 0.95 and λ = 2.25.
is important for two reasons: first, the VP fluctuates widely, from about 40%2
in November 2015 to more than 600%2 in January 2009. Second, fitting the
unconditional level of the VP well does not imply that the model does also a
good job fitting each period’s VP.
Our analysis focuses on time variation in equity market volatility, loss
aversion, and the degree of probability weighting. To guide the intuition for
our subsequent results, we start by exploring the impact of the level of equity
volatility on the variance premium in a comparative statics exercise.
17 Over our sample, realized monthly volatility ranges from a low of 1.18% to a high of 23.33%. Its interquartile
range is 3.13%–5.57%.
3700
to more extreme outcomes, which are then overweighted by CPT investors. The
figure also shows that high volatility alone is not sufficient to generate a VP:
only when coupled with probability weighting does higher volatility lead to a
higher VP. In sum, Figure 5 suggests that volatility and probability weighting
interact in generating a VP.
where rt represents daily S&P 500 index returns. All estimated parameters are
statistically significant at the 1% confidence level. Consistent with previous
evidence, we find the daily conditional variance process to be highly persistent
and stationary. To calculate the conditional variance for the next month, we
take the sum of the model-implied predictions of the variance for all days over
the next month (“integrated variance”).18
The existing literature on probability weighting offers little to no guidance
on how to properly specify time variation in probability weighting. We consider
two specifications. The first one makes the end-of-month distortion parameter
ct a function of a dummy Ic,t that equals 1 when the return on the S&P 500 over
month t belongs to the lowest or highest quartile of returns over our sample
period, and zero otherwise:
18 Although we believe that the integrated variance from a GARCH process is a good approximation of the actual
variance, in unreported results (available on request), we show that the results in this section are robust to
alternative conditional variance methods, such as an exponentially weighted moving average or, even more
simply, last month’s realized variance.
3701
up
dn
the S&P 500 belongs either to the top (Ic,t ) or bottom (Ic,t ) quartile of returns
over our sample period:
up
c1,t = cˆ1 +κcup I +κcdn
1 c,t
I dn ,
1 c,t
up
c2,t = cˆ2 +κcup I +κcdn
2 c,t
I dn .
2 c,t
(18)
Here, cˆ1 and cˆ2 are the unconditional estimates of 0.62 and 0.69, respectively,
from Table 4. Finally, closely following BHS, we also allow for time variation
in loss aversion through a slope parameter κλ on a state variable zt capturing
prior gains and losses (see Appendix G for details on the construction).19
19 We have also estimated an alternative specification which models time variation in probability weighting
parameters as a function of the same variable zt that we use to model time-varying loss aversion. We find
results which are in line with the ones presented above. Also, we find similar results when we use dummies that
equal 1 when S&P 500 returns are in the top and/or bottom decile, rather than quartile.
20 Because we compare the conditional moments implied by the model with conditional moments obtained from
regressing equity and option returns on predicting variables, and we do not take unconditional expectations of
these conditional moments, our estimation method no longer fits into the GMM framework. The approach in this
section is thus one of calibration. The goal is to see whether our CPT model can capture the time variation in
risk premiums of equity and options, as implied by a data-driven regression model.
21 The work of Bollerslev, Tauchen, and Zhou (2009) provide evidence that the VP has predictive power for equity
returns.
3702
Table 5
Parameter calibration in the time-varying setting
A. Parameter estimates for time-varying specifications
Parameter (1) (2) (3) (4)
κλ 2.42
κc −0.03
up
κc1 −0.02
dn
κc1 0.06
up
κc2 0.06
dn
κc2 −0.05
where IGF C is a dummy variable that takes the value of 1 for each month between October 2008 and March
2009. For brevity, we only report the two components of the coefficient on V Pt . t-statistics for these coefficients
are reported in brackets.
3703
3704
1000
0
Mar-96 Sep-98 Mar-01 Sep-03 Mar-06 Sep-08 Mar-11 Sep-13 Mar-16
B Equity premium
6
4
%
-1
Mar-96 Sep-98 Mar-01 Sep-03 Mar-06 Sep-08 Mar-11 Sep-13 Mar-16
Figure 6
Observed versus CPT-implied equity and variance premiums
This figure compares the observed equity and variance risk premiums with those implied by a CPT setting in
which the model’s CPT parameters are constant and the stock return volatility is time varying (specification (1)
in Table 5). The variance premium, in panel A, is calculated as the difference between the square of the VIX and
the expected realized variance (see Section 2.4). The expected realized variance is calculated as an in-sample
forecast of the stock return variance using the lagged stock return variance and the square of the VIX. In panel
B, the observed equity premium is calculated as an in-sample forecast of equity returns from two predictors, the
price-dividend ratio and the variance risk premium.
between 0.65 and 0.75. Given that the models in specifications (3) and (4)
are computationally intensive, and given that specifying the functional form
of time-varying probability weights provides additional degrees of freedom to
the model, it is not clear that the small improvements in the value functions
warrant making the probability weighting parameter time varying. In our view,
3705
22 We calculate the observed VP as the difference between the square of the VIX and expected return variance on
the S&P 500 for the next month, following the existing literature. During crisis periods, though, the VIX estimate
may be affected by the poor liquidity of the options it is calculated from. Similarly, expected return variance,
which is based on a predictive regression relating next month’s variance to the current month’s variance and VIX,
is likely estimated with very large confidence bounds during the GFC. These same measurement issues in turn
also affect our estimate of the model-implied VP.
3706
returns. Quantitatively, future returns in our model are not as sensitive to the
VP as they are in the data, especially during the crisis months.
4. Conclusion
This paper investigates the potential of prospect theory to explain the puzzlingly
low average returns on both out-of-the-money put and out-of-the-money call
options on the S&P 500 index, and the level and the dynamics of the variance
premium. We build on the representative investor model of Barberis, Huang,
Appendices
A Proof of Proposition 2
Without loss of generality, we focus on one risky asset (equity).23 The investor’s problem is now
given by
−1
T
max V= max E[U (WT )]+ bt CPT (Xt+1 ),
α0E ,...,αTE−1 α0E ,...,αTE−1 t=0
where, as before, we follow Barberis, Huang, and Santos (2001) and let the weight on the prospect
theory utilities be given by a multiple of current period’s marginal utility of wealth, that is, by
−γ
bt = b̂Wt . The investor’s budget constraint is
23 Not introducing d = 1,...,D derivatives (with portfolio shares α d for derivative d in period t ) simplifies the
t
notation in the following derivation significantly and does not cause any loss of economic insight.
3707
Let us first consider the log-utility case (γ = 1), whose proof is simple and intuitive. In that case,
the investor seeks to maximize
−1
T
−1 E
V = E[log((W0 Tt=0 (αt rt+1 +R f ))]+ bt CPT (Wt αtE rt+1 )
t=0
−1
T −1
T
= E[logW0 + log(αtE rt+1 +R f )]+ b̂Wt−1 CPT (Wt αtE rt+1 ).
t=0 t=0
Because the CPT functional is homogeneous of degree 1, it follows that CPT (Wt αtE r t+1 ) =
−1
T
= logW0 + E[log(αtE rt+1 +R f )]+αtE b̂CPT (rt+1 ) .
t=0
The objective functional is thus additively separable in the decision variables so that the FOC for
each αtE is independent of the αsE , s = t and structurally identical. In particular, the weights αtE
do not depend on the period t itself or on the number of periods remaining—the myopia result.
When returns are iid, we further have that α0E = ... = αTE−1, so that the investor does not rebalance
her portfolio.
Now consider the case of general CRRA utility, which we prove by dynamic programming. Let
τ denote the number of periods that remain from current time t so that T = t +τ . The value of the
agent’s problem at time t is given by
1−γ t+τ −1
Wt+τ
V (τ,Wt ) = max Et + bs CPT (rs+1 αsE Ws )
{α E }t+τ −1
s s=t
1−γ s=t
−1
1−γ t+τ
Wt+τ
= max Et + b̂αsE Ws1−γ CPT (rs+1 ) ,
{αsE }t+τ −1 1−γ
s=t s=t
where, similarly to the log case, we exploit the fact that the CPT preference functional is
homogeneous of degree 1. The problem has become one of maximizing terminal wealth expected
utility plus a sum of flow utilities, which are linear in the decision variable. In particular, letting
1−γ
ct+1 := b̂CPT (rt+1 ), the time-t value of the next period’s flow utility is given by πt+1 := ct+1 αtE Wt .
By the dynamic programming principle, the investor’s problem at time t can be written as
⎡ ⎤
⎢ ⎥
⎢ 1−γ t+τ −1 ⎥
⎢ ⎥
⎢ Wt+τ 1−γ ⎥
V (τ,Wt ) = max Et ⎢ max Et+1 + cs+1 αsE Ws1−γ + αtE ct+1 Wt ⎥,
αtE ⎢{α E }t+τ −1 1−γ ⎥
⎢ s s=t+1 s=t+1 ⎥
⎣ πt+1 (αtE ,Wt ,ct+1 )⎦
V (τ −1,Wt (αtE rt+1 +R f ))
1−γ
t+τ +c W 1−γ
subject to the terminal condition V (0,Wt+τ ) = 1−γ t+τ αt+τ −1 Wt+τ −1 . This means that τ periods
before the model ends, the value of the problem is determined by the problem’s continuation
value next period, V (τ −1,Wt (αtE rt+1 +R f )), and next period’s flow utility, πt+1 (αtE ,Wt ,ct+1 ).
The notation indicates that the continuation value is that of τ −1 periods before the end, where
3708
wealth from the previous period has changed to Wt (αtE rt+1 +R f ). This continuation value includes
the τ −2 expected flow utilities on from two periods in the future. Note that the value of the next
period’s flow utility (i.e., πt+1 ) is deterministic, because it is a function of the current time t portfolio
weight and current period wealth. Continuing,
⎡ t+τ −1
(Wt (αsE rs+1 +R f ))1−γ
V (τ,Wt ) = max Et ⎣ max Et+1 s=t
αtE {αsE }t+τ −1 1−γ
s=t+1
⎡
(Wt (αtE rr+1 +R f ))1−γ
= max Et ⎣αtE ct+1 Wt
1−γ
+ ·
αtE 1−γ
⎡ "1−γ ⎤⎤
!
t+τ −1
t+τ −1 1−γ
cs+1 αsE Ws (1−γ ) ⎦⎦
max Et+1 ⎣ (αsE rs+1 +R f ) + 1−γ
t+τ −1
{αsE }s=t+1 Wt+1
s=t+1 s=t+1
⎡
(αtE rt+1 +R f )1−γ
= max Wt Et ⎣αtE ct+1 +
1−γ
·
αtE 1−γ
⎡ "1−γ ⎤⎤
!
t+τ −1
t+τ −1 1−γ
cs+1 αsE Ws (1−γ ) ⎦⎦
max Et+1 ⎣ (αsE rs+1 +R f ) + 1−γ
.
t+τ −1
{αsE }s=t+1 Wt+1
s=t+1 s=t+1
t+τ −1 1−γ
t+τ −1 !
s
cs+1 αsE Ws (1−γ )
1−γ
= (1−γ ) cs+1 αsE (αs̃E rs̃+1 +R f )1−γ ,
s=t+1 Wt+1 s=t+1 s̃=t+2
t+1
where we use the convention that E f 1−γ
s̃=t+2 (αs̃ r s̃+1 +R ) ≡ 1. Therefore, we obtain
1−γ (αtE rt+1 +R f )1−γ
V (τ,Wt ) = max Wt Et αtE ct+1 + ·
αtE 1−γ
t+τ −1 −1
⎤
!
t+τ !
s
max Et+1 (αsE rs+1 +R f )1−γ +(1−γ ) cs+1 αsE (αs̃E rs̃+1 +R f )1−γ ⎦,
{αsE }t+τ −1
s=t+1 s=t+1 s=t+1 s̃=t+2
3709
and see that the problem is homogeneous in current wealth so that, without loss of generality, we
let Wt ≡ 1. Thus,
⎡
⎢
⎢
⎢ E (α E rt+1 +R f )1−γ
V (τ,Wt ) = max Et ⎢
⎢ αt ct+1 + t · (A1)
αtE ⎢ 1−γ
⎣ πt+1
=:ut+1
t+τ −1 ⎥
⎥
1−γ
−1
=1 =1
t+τ !
s
1−γ
+(1−γ ) cs+1 αsE (αs̃E rs̃+1 +R f )1−γ · Wt
s=t s̃=t+1
−1
ψt (τ ) Wt+τ
1−γ t+τ
⇐⇒ = max Et+1 + cs αsE Ws1−γ
1−γ {α E }t+τ −1 1−γ s=t
s s=t
1−γ t+τ −1
ψt (τ ) Wt+τ
⇐⇒ = max Et+1 + cs αsE Ws1−γ ≡ V (τ,Wt ).
1−γ {α E }t+τ −1 1−γ s=t
s s=t
ψt (τ )
Therefore, we can substitute V (τ,Wt ) in Equation (A1) with to obtain the recursion
1−γ
ψt (τ ) = max Et αtE ct+1 (1−γ )+(αtE rt+1 +R f )1−γ ·ψt+1 (τ −1) .
αtE
We see that when γ = 1, the optimal portfolio weight αtE depends on the investor’s horizon τ, that is,
is nonmyopic. We now show that even when returns are independent (and/or identically distributed),
investment is nonmyopic. When the per-period returns are independent, because ψt+1 (τ −1) is free
of rt+1 , the maximizing αtE satisfies
ct+1 (1−γ )+Et ψt+1 (τ −1) ·Et (αtE rt+1 +R f )−γ rt+1 = 0.
We see now that if, and only if, γ = 1, we can divide by Et [ψt+1 (τ −1)] so that each αtE satisfies
Et (αtE rt+1 +R f )−γ rt+1 = 0,
which says that αtE is horizon independent.24 For all other CRRA utilities, we have shown that the
αtE are time and horizon dependent.
24 Note, however, that the proof for general γ does not nest the proof for γ = 1, because it involves expressions with
1−γ in the denominator.
3710
B Proof of Proposition 4
Without loss of generality, we work in a setting with an infinite number of outcomes for the equity
payoff x E > 0 at time T . As shown by, for example, Bakshi, Kapadia, and Madan (2003) and Carr
and Madan (2001), any continuous and twice-differentiable payoff function G(x E ) can be written
as
∞
G(x E ) = G(x̄)+G (x̄)(x E − x̄)+ G (K)max(x E −K,0)dK
x̄
x̄
for any value of x̄. This equation shows that any payoff function can be replicated by an equity
position and a position in an infinite number of call and put options with different strike prices.
If there are no arbitrage opportunities, a risk-neutral measure exists and the risk-neutral expected
value of the payoff function G can be written as a function of call and put prices
∞ x̄
E Q G(x E ) = G(x̄)+G (x̄)(R f s0E − x̄)+ G (K)R f C(K)dK + G (K)R f P (K)dK, (B.2)
x̄ 0
where we use the risk-neutral pricing equations for equity, s0E = 1f E Q x E , call options
R
with strike K, C(K) = 1f E Q max(x E −K,0) , and put options with strike K, P (K) =
1
R
Q max(K −x E ,0) . Similar to Equation (B.2) for the risk-neutral expectation, we can
Rf E
compute the expected value of the payoff function G under the physical measure P as
E P G(x E ) = G(x̄)+G (x̄)(E[x E ]− x̄) (B.3)
∞ x̄
+ G (K)E max(x E −K,0) dK + G (K)E max(K −x E ,0) dK.
x̄ 0
# # $$2
E
Now choose G(x E ) = ln xE , and by subtracting (B.3) from (B.2) it directly follows that
s0
⎡ ""2 ⎤ ⎡ ""2 ⎤ "
xiE xiE E
E Q⎣
ln ⎦ −E ⎣ ln
P ⎦ = −G (x̄)s0E E[x ] −R f (B.4)
s0E s0E E
s0
∞ # $
E[max(x E −K,0)]
− G (K)C(K) −R f dK
C(K)
x̄
x̄ # $
E[max(K −x E ,0)]
− G (K)P (K) −R f dK,
P (K)
0
# $ # # $$
x 1 2 x
with G (x) = 2ln and G (x) = 1−ln . We consider the approximation at x̄ = s0E ,
s0E x x2 s0E
so that G (x̄) = 0. Defining
R p (K) = E max(K −x E ,0) /P (K)
3711
and
R c (K) = E max(x E −K,0) /C(K),
and defining weights
v c (K) = −G (K)C(K)
and
v p (K) = −G (K)P (K),
then yields the right-hand side of Equation (14).
To see that the right-hand side of Equation (14) is indeed an approximation to the variance
Proposition 5. Let R c (K) and R p (K) denote the return on a call and put option with strike K,
respectively. Define equity index returns as R E = x E /s0E . If there are no arbitrage opportunities,
we can define and calculate downside and upside variance premiums as follows:
E
s0
2 2
V Pd ≡ E Q
ln R E 1R E <1 −E P ln R E 1R E <1 = v p (K)E P R p (K)−R f dK, (C.1)
0
∞
2 2
V Pu ≡ E Q ln R E 1R E >1 −E P ln R E 1R E >1 = v c (K)E P R c (K)−R f dK, (C.2)
s0E
where 1R E >1 is the indicator function which equals 1 if the equity return is above 1, and v c (K)
and v p (K) are scalars whose expressions are given in Appendix B. The total variance premium in
Proposition 4 can then be decomposed as
V P ≈ V Pu +V Pd . (C.3)
3712
Proof. The proof of Proposition 5 follows the same steps used in Appendix B, where we add
indicator functions for the final stock price being below or above the current price. For the sake of
completeness, we go through these steps here. Multiplying Equation (B.1) by the indicator function
1x<x̄ ,
∞ x̄
1x E <x̄ G (K)max(x −K,0)dK +1x E <x̄ G (K)max(K −x E ,0)dK,
E
x̄ 0
(C.5)
x̄
G(x E )1x E <x̄ = G(x̄)1x E <x̄ +G (x̄)(x E − x̄)1x E <x̄ + G (K)max(K −x E ,0)dK. (C.6)
0
If there are no arbitrage opportunities, a risk-neutral measure exists and the risk-neutral expected
value of the payoff function G can be written as a function of call and put prices, as follows:
x̄
E Q G(x E )1x E <x̄ = G(x̄)E Q 1x E <x̄ +G (x̄)E Q (x E − x̄)1x E <x̄ + G (K)R f P (K)dK,
0
(C.7)
where we use the risk-neutral pricing equations for put options with strike K, P (K) =
# # $$2
1
Q max(K −x E ,0) . Now choose G(x E ) = ln x E
f E E and x̄ = s0E , so that G(x̄) = 0 and
R s0
G (x̄) = 0. We then obtain that
⎡ ""2 ⎤ E
s0
xE
E Q ⎣ ln E 1x E <s E ⎦ = G (K)R f P (K)dK, (C.8)
s0 0
0
which implies that the risk-neutral downside second moment is linked to OTM put option prices
only.
In a similar way, we can compute the expected value of the payoff function G under the physical
measure P as (filling in the choices for G and x̄)
⎡ ""2 ⎤ E
s0
xE
E ⎣ ln E
P
1x E <s E ⎦ = G (K)E P max(K −x E ,0) dK. (C.9)
s0 0
0
Because R p (K) = max(K −x E ,0)/P (K), for v p (K) = −G (K)P (K), we obtain
⎡ ""2 ⎤ ⎡ ""2 ⎤
x E x E
V Pd = E Q ⎣ ln E 1x E <s E ⎦ −E P ⎣ ln E 1x E <s E ⎦
s0 0 s0 0
E
s0
= v p (K)E P R p (K)−R f dK. (C.10)
0
3713
∞
= v c (K)E P R c (K)−R f dK. (C.11)
s0E
25 For strike levels beyond the lowest and highest strike in our sample, we assume option returns are equal to the
return on the option with the closest strike, and use Black-Scholes option prices to calculate the weights.
26 These add up to 146.0, which is very close to the estimated variance premium of 146.7 that we use in the paper.
As explained in Section 2.1, this latter estimate of the variance premium is calculated as the difference between
the squared VIX and the expected realized variance. Our Proposition 4 shows how the variance premium can
be calculated using either the difference in variances or using option returns, and we thus find numerically very
similar variance premiums using both methods.
3714
in Section 1.4, in that case the total variance premium is equal to zero. However, it is easy to see
that, in this CRRA case, V Pu < 0 and V Pd > 0. This is because, as shown in Figure 1, in such a
CRRA model, OTM calls have positive expected returns and OTM puts have negative expected
returns, and from Equations (C.10) and (C.11), it then directly follows that V Pu < 0 and V Pd > 0
(because the weights are negative). For example, for the pure CRRA model used in Table 2, we find
V Pu = −17.7 and V Pd = 16.4.27 Hence, the positive upside variance premium that we observe in
the data contradicts substantially with the negative V Pu implied by the standard CRRA-lognormal
model.
Our empirical results for the upside and downside variance premiums for the S&P 500 index are
similar to those of Andersen and Bondarenko (2009) and Held, Kapraun, Omachel, and Thimme
2 2
27 These add up to −1.3, and adding E P ln R E −E Q ln R E gives a total variance premium equal to
zero.
3715
B). In panel A, we see that the sensitivities of b0 and λ are very similar. As discussed in Section
2.4.2, this problem arises because “too little” probability weighting can be compensated for by
giving a higher weight to the CPT part in the utility function. Conversely, even extreme levels of
probability weighting do not matter much if the CPT component enters the investor’s total utility
function with only little weight. Once we fix b0 , panel B shows that we can separately identify λ
and c as their sensitivities differ substantially across moments. In particular, the variance premium
affects the estimator for c much more than the estimator for λ, in line with the comparative statics
analysis in Section 2.3.
rt = μ+εt (E.1)
εt = σ ςt , (E.2)
where the random variable ςt is SKST (0,1,ξ,v) distributed; that is, it follows a standardized
skewed-t distribution with parameters v > 2 (the number of degrees of freedom) and ξ > 0
(a parameter related to skewness). The density of this function is given by
⎧
⎪
⎪ 2
sg ξ (sςt +m)|v
⎨ ξ + ξ1 if ςt < −m/s
f (ςt |ξ,v) = (E.3)
⎪
⎪ 2
sg ξ (sς +m)/ξ |v if ς t −m/s,
⎩ ξ+ 1 t
ξ
where g(·|v) is a symmetric (zero mean and unit variance) Student-t density with v degrees of
freedom defined as
−(v+1)/2
v−1 x2
g(x|v) = √ 2 v 1+ , (E.4)
π (v −2) 2 v −2
# $
1
s 2 (ξ,v) = ξ 2 + 2 −1 −m2 . (E.6)
ξ
The parameters ξ and v are related to the distribution’s skewness and kurtosis, respectively. Because
ξ 2 can be shown to be equal to the ratio of the probability masses above and below the mode, the
distribution has zero skewness when ξ = 1, negative skewness when ξ < 1, and positive skewness
when ξ > 1. The fatness of tails (kurtosis) decreases with the degrees of freedom v, and converges
to a skewed normal distribution as v → ∞. When v → ∞ and ξ = 1, this distribution collapses to a
standard normal distribution.
3716
A Scale b0 estimated
0.3
0.2
0.1
-0.3
EP VP put(0.95) put(1.04) call(0.95) call(1.05)
-0.2
-0.4
-0.6
-0.8 O
c
-1
EP VP put(0.95) put(1.04) call(0.95) call(1.05)
Figure D.1
Sensitivity of CPT parameter estimates to sample moments
This figure shows the standardized sensitivities of the CPT parameter estimates to the following sample moments:
the equity premium, the variance premium, and the put and call option returns. To facilitate the interpretation
of sensitivities, we orthogonalized option returns with respect to the equity and the variance premium. Panel A
shows the sensitivities for the setting in which the scale is also estimated (specification (5) in Table 3), and panel
B shows the sensitivities for the benchmark CPT setting (specification (1) in Table 3). Standardized sensitivities
are calculated following Gentzkow and Shapiro (2015) as the sensitivity of the expected value of each parameter
to a 1-standard-deviation change in the realization of each moment condition.
Table E.1 reports estimates for θ = (μ,σ,ξ,v), using our daily sample of overlapping 30-day log
returns on the S&P 500 over the period running from 1996 to 2010. ξ is well below 1, implying
negative skewness. The ratio of the probability masses above and below the mode is equal to
ξ 2 = 0.59, implying that the degree of negative skewness is economically important. The estimated
degrees of freedom parameter, v, equals 4.58, indicating that the return distribution is not only
left-skewed but also fat-tailed.
Table E.2 reports results for our GMM estimation using these parameter values in the skewed-t
distribution, and using the normal distribution in specifications (2) and (1), respectively.
3717
Table E.1
Parameter estimates of the skewed-t distribution
μ σ ξ v
Estimate 0.0046 0.0533 0.7692 4.5776
This table reports the estimated parameters for the Skewed-t distribution described in Equations (E.1) to (E.6).
Table E.2
GMM estimates in the unconditional setting: Alternative equity return distributions
F Results for a Model with Distortion in Both CRRA and CPT Components
Table F.1 reports GMM estimates for the degree of loss aversion, λ, and probability weighting, c,
when the CRRA and CPT components of total utility are distorted. For comparison, we also show
the results for the benchmark specification (specification (1)), in which only the CPT component of
the utility function is distorted using the probability weighting parameter cCPT . In all specifications,
the reference level, WRef , is assumed equal to the risk-free rate (0.19%), the level of risk aversion,
γ , is fixed at 1. In specifications (1) and (2), the scale, b0 , is fixed at 0.95, which is the scale that
yields a 50% contribution of CPT to total utility in the benchmark specification (specification (1)).
3718
Table F.1
GMM estimates in the unconditional setting: Distortion of the CRRA and CPT components
(1) (2) (3)
Distorted CPT
Benchmark and CRRA Quiggin (82)
b0 (scale) 0.95 0.95 0
γ (risk aversion) 1.00 1.00 1.00
λ (loss aversion) 1.39 1.40
[0.30] [0.23]
cCRRA 0.78 0.78
[0.00] [0.00]
cCPT 0.65 0.78
In specification (3), the scale, b0 , is fixed at 0, a setting which is effectively the rank-dependent
utility model from Quiggin (1982). p-values of the hypotheses that λ = 1 and c = 1, respectively,
are reported in brackets. The p-values are calculated using Newey-West standard errors. For each
specification, we also report the CPT contribution to total utility, the model-implied variance and
equity risk premiums, the average model-implied call and put returns, the average model-implied
OTM call and put returns, the value function (VF) evaluated at the optimum, and the level of
risk aversion (RA) that would be needed in the no-CPT case to obtain the model-implied equity
premium. These parameters are estimated by applying GMM to returns on the S&P 500 index and
on S&P 500 call and put options with different strikes and 30 days to maturity, and to the equity
and variance premium. The GMM weighting matrix gives an equal weight of one-fourth to the
equity, call, put, and variance premium moments. The sample period runs from January 1996 to
March 2016.
where λ̂ is the baseline loss aversion parameter, z¯t is the average of zt , and κλ is the sensitivity of
loss aversion to recently realized gains and losses as measured by zt . Following BHS, we assume
that the representative agent compares the current price of the risky asset with a benchmark level,
where the benchmark level of the price responds sluggishly to changes in the value of equity. This
sluggishness is motivated in BHS (p. 14) as follows: “when the stock price moves up by a lot,
the benchmark level also moves up, but by less. Conversely, if the stock price falls sharply, the
3719
RE
zt+1 = η(zt E
)+(1−η),
Rt+1
where R E is a fixed parameter calibrated to guarantee that half of the time the agent has prior gains
(RtE > R E ) and the rest of the time she has prior losses (RtE < R E ). The parameter η measures
the degree of sluggishness, which can be interpreted as the agent’s memory horizon. We assume
η = 0.5, which implies that the half life of the representative agent’s memory is 1 month.
Abdellaoui, M. 2000. Parameter-free elicitation of utility and probability weighting functions. Management
Science 46:1497–512.
Ait-Sahalia, Y., and A. Lo. 2000. Nonparametric risk management and implied risk aversion. Journal of
Econometrics 94:9–51.
Andersen, T. G., and O. Bondarenko. 2009. Dissecting the market pricing of return volatility. Working Paper.
Ang, A., G. Bekaert, and J. Liu. 2005. Why stocks may disappoint. Journal of Financial Economics 76:471–508.
Bakshi, G., and N. Kapadia. 2003. Delta-hedged gains and the negative market volatility risk premium. Review
of Financial Studies 16:527–66.
Bakshi, G., N. Kapadia, and D. Madan. 2003. Stock return characteristics, skew laws, and the differential pricing
of individual equity options. Review of Financial Studies 16:101–43.
Bakshi, G., and D. Madan. 2006. A theory of volatility spreads. Management Science 52:1945–56.
Bakshi, G., D. Madan, and G. Panayotov. 2010. Returns of claims on the upside and the viability of U-shaped
pricing kernels. Journal of Financial Economics 97:130–54.
Barberis. 2013. Thirty years of prospect theory in economics: A review and assessment. Journal of Economic
Perspectives 27:173–96.
Barberis, N., and M. Huang. 2008. Stocks as lotteries: The implications of probability weighting for security
prices. American Economic Review 98:2066–100.
Barberis, N., M. Huang, and T. Santos. 2001. Prospect theory and asset prices. Quarterly Journal of Economics
116:1–53.
Barberis, N., A. Mukherjee, and B. Wang. 2016. Prospect theory and stock returns: An empirical test. The Review
of Financial Studies 29:3068–107.
Barberis, N., and W. Xiong. 2009. What drives the disposition effect? An analysis of a long-standing preference-
based explanation. Journal of Finance 64:751–84.
Bauwens, L., and S. Laurent. 2005. A new class of multivariate skew densities, with applications to GARCH
models. Journal of Business and Economic Statistics 3:346–54.
Bekaert, G., and E. Engstrom. 2016. Asset return dynamics under bad environment-good environment
fundamentals. Journal of Political Economy 125:713–60.
Benartzi, S., and R. Thaler. 1995. Myopic loss aversion and the equity premium puzzle. Quarterly Journal of
Economics 110:75–92.
Bernard, C., and M. Ghossoub. 2010. Static portfolio choice under cumulative prospect theory. Mathematics and
Financial Economics 2:277–306.
3720
Bollerslev, T., M. Gibson, and H. Zhou. 2011. Dynamic estimation of volatility risk premiums and investor risk
aversion from option-implied and realized volatilities. Journal of Econometrics 160:235–45.
Bollerslev, T., G. Tauchen, and H. Zhou. 2009. Expected stock returns and variance risk premiums. Review of
Financial Studies 22:4463–92.
Bondarenko, O. 2003. Statistical arbitrage and securities prices. Review of Financial Studies 16:875–919.
———. 2014. Why are puts so expensive? Quarterly Journal of Finance 4:1–50.
Bordalo, P., N. Gennaioli, and A. Shleifer. 2012. Salience theory of choice under risk. Quarterly Journal of
Economics 127:1243–85.
Boyer, B. H., and K. Vorkink. 2014. Stock options as lotteries. Journal of Finance 69:1485–527.
Brennan, M. J. 1979. The pricing of contingent claims in discrete time models. Journal of Finance 34:53–68.
Broadie, M., M. Chernov, and M. Johannes. 2009. Understanding index option returns. Review of Financial
Studies 22:4493–529.
Bruhin, A., H. Fehr-Duda, and T. Epper. 2010. Risk and rationality: Uncovering heterogeneity in probability
distortion. Econometrica 78:1375–412.
Camerer, C. F., and T. Ho. 1994. Violations of the betweenness axiom and nonlinearity in probability. Journal
of Risk and Uncertainty 8:167–96.
Carr, P., and D. Madan. 2001. Optimal positioning in derivative securities. Quantitative Finance 1:19–37.
Carr, P., and L. Wu. 2009. Variance risk premiums. Review of Financial Studies 22:1311–41.
Chabi-Yo, F. 2012. Pricing kernels with stochastic skewness and volatility risk. Management Science 58:624–40.
Chabi-Yo, F., R. Garcia, and E. Renault. 2008. State dependence can explain the riskaversion puzzle. Review of
Financial Studies 21:973–1011.
Chaudhuri, R., and M. Schroder. 2015. Monotonicity of the stochastic discount factor and expected option returns.
Review of Financial Studies 28:1462–505.
Christoffersen, P., S. Heston, and K. Jacobs. 2013. Capturing Option Anomalies with a Variance-Dependent
Pricing Kernel, Review of Financial Studies 26:1962–2006.
Conrad, J., R. F. Dittmar, and E. Ghysels. 2012. Ex ante skewness and expected stock returns. Journal of Finance
68:85–124.
Constantinides, G. M., J. C. Jackwerth, and A. Savov. 2013. The puzzle of index option returns. Review of Asset
Pricing Studies 3:229–57.
Coval, J., and T. Shumway. 2001. Expected option returns. Journal of Finance 56:983–1009.
Cuesdeanu, H., and J. C. Jackwerth. 2018. The pricing kernel puzzle: Survey and outlook. Annals of Finance
14:289–329.
De Giorgi, E., and S. Legg. 2012. Dynamic portfolio choice and asset pricing with narrow framing and probability
weighting. Journal of Economic Dynamics and Control 36:951–72.
Drechsler, I. 2013. Time-varying fear, and asset prices. Journal of Finance 68:1843–89.
Drechsler, I., and A. Yaron. 2011. What’s vol got to do with it? Review of Financial Studies 24:1–45.
Driessen, J., and P. Maenhout. 2007. An empirical portfolio perspective on option pricing anomalies. Review of
Finance 11:561–603.
3721
Du, D. 2011. General equilibrium pricing of options with habit formation and event risks. Journal of Financial
Econometrics 99:400–26.
Ebert, S., and P. Strack. 2015. Until the bitter end: On prospect theory in a dynamic context. American Economic
Review 105:1618–33.
Epper, T., and H. Fehr-Duda. 2012. Probability and risk: Foundations and economic implications of probability-
dependent risk preferences. Annual Review of Economics 4:567–93.
Eraker, B. 2013. The performance of model based option trading strategies. Review of Derivatives Research
16:1–23.
Eraker, B., and M. Ready. 2015. Do investors overpay for stocks with lottery-like payoffs? An examination of
Feunou, B., M. R. Jahan-Parvar, and C. Okou. 2017. Downside variance risk premium. Journal of Financial
Econometrics 16:341–83.
Gabaix, X. 2012. Variable rare disasters: An exactly solved framework for ten puzzles in macro-finance. Quarterly
Journal of Economics 127:645–700.
Gentzkow, M., and J. M. Shapiro. 2015. Measuring the sensitivity of parameter estimates to sample statistics.
Working Paper.
Gollier, C. 2011. Portfolio choices and asset prices: The comparative statics of ambiguity aversion. Review of
Economic Studies 78:1329–44.
Green, C., and B.-H. Hwang. 2012. Initial public offerings as lotteries: Skewness preference and first-day returns.
Management Science 58:432–44.
Held, M., J. Kapraun, M. Omachel, and J. Thimme. 2018. Up and downside variance risk premia in global equity
markets. Working Paper.
Ilmanen, A. 2012. Do financial markets reward buying or selling insurance and lottery tickets? Financial Analysts
Journal 68:26–36.
——–. 2016. Cumulative prospect theory, aggregation, and pricing. Critical Finance Review 5:305–50.
Ingersoll, J. E., and L. J. Jin. 2013. Realization utility with reference-dependent preferences. Review of Financial
Studies 26:723–67.
Jackwerth, J. 2000. Recovering risk aversion from option prices and realized returns. Review of Financial Studies
13:433–67.
Jiang, G., and Y. Tian. 2005. Model-free implied volatility and its information content. Review of Financial
Studies 18:1305–42.
Jin, L. J. 2015. A speculative asset pricing model of financial instability. Working Paper.
Jones, C. 2006. A nonlinear factor analysis of S&P 500 index option returns. Journal of Finance 61:2325–63.
Kahneman, D. 2011. Thinking, fast and slow. New York: Farrar, Straus and Giroux.
Kahneman, D., and A. Tversky. 1979. Prospect theory: An analysis of decision under risk. Econometrica 47:
263–91.
Kilic, M., and I. Shaliastovich. 2017. Good and Bad Variance Premia and Expected Returns. Management Science.
Advance Access published January 16, 2018.
Kliger, D., and O. Levy. 2009. Theories of Choice under Risk: Insights from Financial Markets. Journal of
Economic Behavior and Organization 71:330–46.
3722
Kőszegi, B., and M. Rabin. 2006. A Model of Reference-Dependent Preferences. Quarterly Journal of Economics
121:1133–66.
Kumar, A. 2009. Who Gambles in the Stock Market? Journal of Finance 64:1889–933.
Lambert, P., S. Laurent, and D. Veredas. 2012. Testing conditional asymmetry. A residual-based approach.
Journal of Economic Dynamics and Control 36:1229–47.
Londono, J. 2014. The Variance Risk Premium around the World. Working Paper, Federal Reserve Board.
Machina, M. J. 1989. Dynamic Consistency and Non-Expected Utility Models of Choice under Uncertainty.
Merton, R. C. 1969. Lifetime Portfolio Selection under Uncertainty: The Continuous-Time Case. Review of
Economics and Statistics 51:247–57.
Mitton, T., and K. Vorkink. 2007. Equilibrium Underdiversification and the Preference for Skewness. Review of
Financial Studies 20:1255–88.
Polkovnichenko, V. 2005. Household Portfolio Diversification: A Case for Rank-Dependent Preferences. Review
of Financial Studies 18:1467–502.
Polkovnichenko, V., and F. Zhao. 2013. Probability Weighting Functions Implied in Option Prices. Journal of
Financial Economics 107:580–609.
Quiggin, J. 1982. A theory of anticipated utility. Journal of Economic Behavior and Organization 3:323–43.
Rieger, M. O., and M. Wang. 2006. Cumulative prospect theory and the St. Petersburg paradox. Economic Theory
28:665–79.
Rosenberg, J., and R. F. Engle. 2002. Empirical pricing kernels. Journal of Finanical Economics 64:341–72.
Rubinstein, M. 1976. The valuation of uncertain income streams and the pricing of options. Bell Journal of
Economics 7:407–25.
Samuelson, P. A. 1969. Lifetime Portfolio Selection By Dynamic Stochastic Programming. Review of Economics
and Statistics 51:239–46.
Samuelson, P. A., and R. C. Merton. 1969. A Complete Model of Warrant Pricing that Maximizes Utility.
Industrial Management Review 10:17–46.
Santa-Clara, P., and A. Saretto. 2009. Option strategies: Good deals and margin calls. Journal of Financial
Markets 12:391–417.
Schneider, C., and O. Spalt. 2016. Conglomerate Investment, Skewness, and the CEO Long-Shot Bias. Journal
of Finance 71:635–71.
Schreindorfer, D. 2014. Tails, fears, and equilibrium option prices. Working Paper.
Shaliastovich, I. 2015. Learning, confidence, and option prices. Journal of Econometrics 187:18–42.
Snowberg, E., and J. Wolfers. 2010. Explaining the favorite-long shot bias: Is it risk-love or misperceptions?
Journal of Political Economy 118:723–46.
Song, Z., and D. Xiu. 2016. A tale of two option markets: Pricing kernels and volatility risk. Journal of
Econometrics 190:176–96.
Tversky, A., and D. Kahneman. 1992. Advances in prospect theory, cumulative representation of uncertainty.
Journal of Risk and Uncertainty 5:297–323.
Wu, G., and R. Gonzalez. 1996. Curvature of the probability weighting function. Management Science 42:1676–
90.
Ziegler, A. 2007. Why does implied risk aversion smile? Review of Financial Studies 20:859–904.
3723