Professional Documents
Culture Documents
SSRN Id4668578
SSRN Id4668578
Lecture Notes on
International Finance
March 7, 2024
This is a preliminary draft of my lecture notes on international finance, with an emphasis on the theoretical
foundation. Its preliminary nature is reflected in the fact that some sections are still missing or incomplete.
I will continue to update the notes and welcome comments and suggestions.
I am grateful to my students for their help and feedback. I am also grateful to my colleagues, coauthors, and
friends for their comments and suggestions. All errors are mine.
0 Preface 6
I Introduction 13
4 Convenience Yields 88
4.A An Illustrative Model 90
4.B Exchange Rate Accounting 94
4.C Measuring the Convenience Yields 100
4.D Connecting the Short Term with the Long Term 104
4.E Discussions (TODO) 107
Bibliography 327
link between fiscal deficits and exchange rates. We will discuss this
link in detail in Chapter 6. This period also witnessed the rise of the
U.S. dollar as the dominant international currency and the transfer
of international financial center from the U.K. to the U.S. We will dis-
cuss the financial hegemon and the architecture of the international
monetary system in Chapter 7.
20
100
0
-20
-40 10-1
-60
-80 10-2
-100
-120 10-3
-140
-160 10-4
1910 1915 1920 1925 1930 1935 1940 1910 1915 1920 1925 1930 1935 1940
10
-10
-20
-30
-40
1975 1980 1985 1990 1995 2000 2005 2010 2015 2020
0 Germany
investors, the Fed, and the rest of the world (ROW). We observe three
different regimes. First, from the mid 1970s until the mid 1990s, the
U.S. domestic agents, including the financial sector such as banks,
insurance companies, pensions and the household sector, absorbed
a significant fraction of the new debt issuance. They were the major
players in the Treasury market, whereas the flows and the positions
of the Fed and the ROW were relatively small.
There was a distinct shift in the mid-to-late 1990s, when the ROW
became the main Treasury buyers while the domestic investors be-
came net sellers. In this second regime through 2015, the ROW was
by far the most important buyer of U.S. Treasurys. Their inflows
were particularly pronounced during the global financial crisis,
which is consistent with the flight to safety observed in safe dollar
bond prices. The Fed also started playing a more active role since
the global financial crisis, as it undertook quantitative easing and
expanded its balance sheet.
Since 2015, we have entered a third regime characterized by
weaker demand from the ROW. In fact, contrary to its countercyclical
purchases of the U.S. Treasurys in previous decades, it became a net
seller in the Treasury market during the Covid-19 crisis. In compari-
son, the Fed became a much more active buyer during the Covid-19
crisis. Bond purchases from the ROW re-emerged after the Covid-19
crisis, albeit smaller in magnitude than prior to 2015. In Chapters
7 and 9, we will discuss different approaches to understanding the
cross-border asset allocations and the implications for the capital
flows and exchange rates.
The goal of this note is to develop a coherent theoretical frame-
work for understanding all these phenomena in international macroe-
0.05
0.00
conomics and finance. That said, I do not mean to give the impres-
sion that all is known. Instead, much remains to be understood, and
a lot is left out by this note. I hope this note offers curious readers a
starting point for further exploration.
This note is organized by three parts: (1) introduction, (2) under-
standing exchange rates, and (3) understanding quantities and flows.
In introduction, I start by building a benchmark two-country model
of international business cycles that is a natural generalization of the
standard closed-economy real business cycle model. This benchmark
model offers a clear starting point, although it falls short in terms
of explaining many stylized facts in the exchange rate and quantity
data. Then, the part on exchange rates develops a series of extensions
and modifications of the benchmark model that can help us under-
stand the exchange rate dynamics. Building on this part, the part on
quantities and flows further develops frameworks that can help us
speak to the quantity dynamics.
In the course of working on this note, I have benefited from many
discussions with my colleagues, coauthors, and friends. I am thank-
ful to Xuning Ding, Jialu Sun, and Yudan Ying for their research
assistance for related research projects. I am grateful for Ravi Jagan-
nathan, Robert Korajczyk, and the William and Mary Breen Fund for
funding support. All errors in this note are mine.
I assume that the readers have taken an entry-level graduate
course in macroeconomics or asset pricing. I also assume that the
readers are familiar with the basic tools in dynamic programming
and (for a few sections) stochastic calculus. I will try to keep the
math simple, and give priority to the development of intuitions.
So, welcome to the field of international macroeconomics and fi-
nance! I very much hope you enjoy the journey. As many of you have
seen closed-economy macroeconomics and finance, let me end with
some quotes from More Is Different: Broken symmetry and the nature
of the hierarchical structure of science [Anderson, 1972]. Despite ad-
dressing a totally different research field in science, this article may
provide us with some useful insights for our uneven path to transi-
tion from one-country, closed-economy economics to international,
open-economy economics.
The reductionist hypothesis may still be a topic for controversy among philoso-
phers, but among the great majority of active scientists I think it is accepted
without question. The workings of our minds and bodies, and of all the an-
imate or inanimate matter of which we have any detailed knowledge, are
assumed to be controlled by the same set of fundamental laws, which except
under certain extreme conditions we feel we know pretty well...
The main fallacy in this kind of thinking is that the reductionist hypothesis
does not by any means imply a "constructionist" one: The ability to reduce
everything to simple fundamental laws does not imply the ability to start
from those laws and reconstruct the universe. In fact, the more the elementary
particle physicists tell us about the nature of the fundamental laws, the less
relevance they seem to have to the very real problems of the rest of science,
much less to those of society. The constructionist hypothesis breaks down
when confronted with the twin difficulties of scale and complexity. The behav-
ior of large and complex aggregates of elementary particles, it turns out, is not
to be understood in terms of a simple extrapolation of the properties of a few
particles. Instead, at each level of complexity entirely new properties appear,
and the understanding of the new behaviors requires research which I think is
as fundamental in its nature as any other. That is, it seems to me that one may
array the sciences roughly linearly in a hierarchy, according to the idea: The
elementary entities of science X obey the laws of science Y.
But this hierarchy does not imply that science X is "just applied Y." At each
stage entirely new laws, concepts, and generalizations are necessary, requiring
inspiration and creativity to just as great a degree as in the previous one.
Psychology is not applied biology, nor is biology applied chemistry...
The arrogance of the particle physicist and his intensive research may be
behind us (the discoverer of the positron said "the rest is chemistry"), but
we have yet to recover from that of some molecular biologists, who seem de-
termined to try to reduce everything about the human organism to "only"
chemistry, from the common cold and all mental disease to the religious in-
stinct. Surely there are more levels of organization between human ethology
and DNA than there are between DNA and quantum electrodynamics, and
each level can require a whole new conceptual structure.
In closing, I offer two examples from economics of what I hope to have said.
Marx said that quantitative differences become qualitative ones, but a dialogue
in Paris in the 1920’s sums it up even more clearly:
FITZGERALD: The rich are different from us.
HEMINGWAY: Yes, they have more money.
Introduction
Summary
• Trading in the risk-free bond market implies that the currency’s expected return is determined
by its risk premium:
1
Et [rxt+1 ] = −covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ),
2
and that the exchange rate level is determined by the expectation of future interest rates and
currency risk premia:
∞ ∞
et = ∑ Et [rt+ j − rt∗+ j ] − ∑ Et [rpt+ j ] + ē.
j =0 j =0
• When markets are complete, we can further simplify the exchange rate movement and cur-
rency expected return as
niques.
1.A.1 Households
We consider an endowment economy with two countries, home and
foreign. Each country has a continuum of identical households.
Equivalently, we can think of a representative household in each
country. Each country has a unique type of goods, labeled as home
goods and foreign goods. The home households receive an endow-
ment of yt units of the home goods, and the foreign households
receive an endowment of y∗t units of the foreign goods.
Home and foreign households derive utility from consuming both
countries’ goods. Let c H denote the home households’ consumption
of the home goods, and c F denote the home households’ consump-
tion of the foreign goods. The home households’ aggregate consump-
tion is a Cobb-Douglas aggregation of the home and foreign goods:
which means that c H,t units of the home goods and c F,t units of the
foreign goods can be combined to form ct units of the home consump-
tion bundle, which is the unit of account for the home households’
aggregate consumption. In our analysis, we carefully distinguish
between the home consumption bundle and the home goods. Simi-
larly, foreign households also combine the home and foreign goods
into the foreign consumption bundle, which we will specify later. Some
papers also refer to the home and foreign goods as the intermediate
goods, and the home and foreign consumption bundles as the final
goods. These terms will carry more meaning when we discuss the
production side of the economy such as in Section 3.C.
The expected lifetime utility for home households is
∞
" #
E0 ∑ δt u(ct ) ,
t =0
where u(ct ) is a generic utility function that takes the home house-
holds’ aggregate consumption ct as the argument, and δ is the sub-
jective discount factor. We usually impose some regularity conditions
on the utility function, so that it is twice continuously differentiable,
increasing in ct , and concave.
The log real exchange rate et is defined as the log exchange ratio
between the home consumption bundle and the foreign consumption
bundle. That is, to afford 1 unit of the home consumption, we need
to spend exp(et ) units of the foreign consumption. Our convention
δt u′ (ct ) − ζ t = 0,
Et [−ζ t + ζ t+1 exp(rt )] = 0,
Et [−ζ t exp(−et ) + ζ t+1 exp(rt∗ − et+1 )] = 0.
which describe how the households trade off consumption and sav-
ing intertemporally. For example, we can express the first Euler equa-
tion as
def u ′ ( c t +1 )
exp(mt+1 ) = δ ,
u′ (ct )
∞
" #
pm
t = Et ∑ exp(mt,t+k ) pt+k yt+k .
k =1
∞
"
E0 ∑ δt u((c H,t )α (cF,t )1−α )
t =1
∞
#
+ ∑ ζ t ( pt yt + bH,t−1 exp(rt−1 ) + bF,t−1 exp(rt∗−1 − et ) − pt c H,t − exp(−et ) p∗t cF,t − bH,t − bF,t exp(−et )) .
t =1
which implies that the consumption ratio c F,t /c H,t between foreign
and home goods is a function of the relative prices:
pt α c F,t
∗ = . (1.3)
pt exp(−et ) 1 − α c H,t
1−α 1
ct = pt c H,t + pt c H,t = pt c H,t ,
α α
α 1
ct = p∗t c F,t exp(−et ) + p∗t c F,t exp(−et ) = p∗ c F,t exp(−et ),
1−α 1−α t
which implies that the expenditure shares for home and foreign
goods are constant under the Cobb-Douglas aggregator and a general
utility function:
goods. As such, the terms of trade are simply the ratio of prices
between home and foreign goods:
def pt
exp(tott ) = ∗ ,
pt exp(−et )
Likewise, we can derive the same problem from the foreign house-
holds’ perspective, and obtain
1− α −α
1−α 1−α
p∗t = α exp(−tott ) , pt exp(et ) = (1 − α) exp(−tott ) ,
α α
which implies that the real exchange rate and the terms of trade are
tightly connected:
et = (2α − 1)tott .
Given α > 1/2, the foreign consumption bundle leans towards the
foreign goods, whereas the home consumption bundle leans towards
the home goods.
The foreign households’ budget constraint is
p∗t y∗t + b∗H,t−1 exp(rt−1 + et ) + b∗F,t−1 exp(rt∗−1 ) = c∗t + b∗H,t exp(et ) + b∗F,t
The Lagrangian is
∞ ∞
" #
E0 ∑δ t
u(c∗t ) + ∑ ζ t∗ ( p∗t y∗t + exp(rt−1 + et )b∗H,t−1 + b∗F,t−1 exp(rt∗−1 ) − c∗t − b∗H,t exp(et ) − b∗F,t ) .
t =1 t =1
plus two auxiliary variables exp(mt+1 ) and exp(m∗t+1 ) that denote the
home and foreign SDFs:
u ′ ( c t +1 )
def
exp(mt+1 ) = δ ,
u′ (ct )
′ ∗ )
def u ( c
exp(m∗t+1 ) = δ ′ t+∗ 1 .
u (ct )
yt = c H,t + c∗H,t ,
y∗t = c F,t + c∗F,t ,
0 = b H,t + b∗H,t ,
0 = bF,t + b∗F,t ,
1 = Et [exp(mt+1 ) exp(rt )] ,
1 = Et [exp(mt+1 ) exp(−∆et+1 + rt∗ )] ,
1 = Et exp(m∗t+1 ) exp(rt∗ ) ,
1 = Et exp(m∗t+1 ) exp(∆et+1 + rt ) .
1 = Et [exp(mt+1 + rt )] , (1.6)
1= Et [exp(mt+1 − ∆et+1 + rt∗ )] , (1.7)
Et exp(m∗t+1 + rt∗ ) ,
1= (1.8)
Et exp(m∗t+1 + ∆et+1 + rt ) .
1= (1.9)
This excess return captures the return of the strategy that takes a
long position on the home bond and a short position on the foreign
bond, which is exposed to the bilateral exchange rate movement.
Then, the Euler equations imply the following result:
Proposition 1.1. The home currency’s expected log excess return is deter-
mined by the covariance between the log foreign SDF and log exchange rate
movement minus a Jensen’s term:
1
Et [rxt+1 ] = −covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ). (1.10)
2
We interpret the right-hand side of Eq. (1.10) as the currency risk
premium in log, since the covariance term describes how the exchange
rate comoves with the foreign investors’ SDF. If the covariance is
positive, the home currency tends to appreciate when the foreign
households’ marginal utility is high. Then, the home currency is
a good hedge from the perspective of the foreign households and
should earn a low risk premium.
The right-hand side of Eq. (1.10) also has a variance term − 12 vart (∆et+1 )
that we refer to as the Jensen’s term. To understand this term, it is
useful to consider the risk premium expression for the level of the
currency return:
which implies that the foreign currency has to offer a higher risk
premium if it tends to depreciate when the home SDF is high.
If we combine the expected return expressions from the home and
foreign households’ perspectives, we obtain the following expression:
Recall that all these restrictions are derived from the Euler equations
for holding the risk-free bonds. So, allowing the households to freely
trade in the risk-free bond markets imposes restrictions not only
between the currency expected return and the covariance between
the SDF and the exchange rate movement from each country’s per-
spective, but also between the home and the foreign perspectives.
Specifically, the home and foreign investors need to agree on the
equilibrium currency risk premium after the second-order adjustment
by the Jensen’s term.
Using this as a more natural reference point for the exchange rate
level, we can decompose today’s exchange rate level in the following
way [Campbell and Clarida, 1987, Froot and Ramadorai, 2005]:
Proposition 1.2. The exchange rate level is equal to the sum of expected
future interest rate differentials, the sum of expected future currency risk
premia, and the long-run exchange rate level:
∞ ∞
et = ∑ Et [rt+ j − rt∗+ j ] − ∑ Et [rpt+ j ] + ē. (1.14)
j =0 j =0
∞ ∞
(Et − Et−1 )[et ] = ∑ (Et − Et−1 )[rt+ j − rt∗+ j ] − ∑ (Et − Et−1 )[rpt+ j ],
j =0 j =0
∞
" #
E0 ∑δ t
(πu(ct ) + (1 − π )u(c∗t )) ,
t =0
∞ ∞ ∞
" #
E0 ∑δ t
(πu(ct ) + (1 − π )u(c∗t )) + ∑ ζ H,t (yt − c H,t − c∗H,t ) + ∑ ζ F,t (y∗t − c F,t − c∗F,t ) ,
t =1 t =1 t =1
α α (1 − α )1− α
ct = α
k yα (y∗t )1−α ,
1− α t t
((1 − α) + αk t ) (α + (1 − α)k t )
α α (1 − α )1− α
c∗t = 1− α
y1−α (y∗t )α ,
α t
((1 − α) + αk t ) ( α + (1 − α ) k t )
where k t can be solved via the following implicit equation:
!α
∗
π u′ (ct ) α c F,t 1−α c H,t
= 1.
1 − π u′ (c∗t ) 1 − α c H,t c∗F,t
1.6
1.4
1.2
0.8
0.6
0.4
0.2
0
0 0.5 1 1.5 2 2.5 3 3.5 4
0.3
0.2
0.1
-0.1
-0.2
-0.3
0 0.5 1 1.5 2 2.5 3 3.5 4
1 1 1
Et [rxt+1 ] = −covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ) = vart (m∗t+1 ) − vart (mt+1 ),
2 2 2
(1.17)
απ (1 − α)(1 − π )
c H,t = yt , c∗H,t = yt ,
απ + (1 − α)(1 − π ) απ + (1 − α)(1 − π )
(1 − α ) π α (1 − π )
c F,t = y∗ , c∗F,t = y∗ .
(1 − α ) π + α (1 − π ) t (1 − α ) π + α (1 − π ) t
2.5
1.5
0.5
-0.5
0 0.5 1 1.5 2 2.5 3 3.5 4
mt+1 = log δ − ∆ log ct+1 = log δ − α∆ log yt+1 − (1 − α)∆ log y∗t+1 ,
m∗t+1 = log δ − ∆ log c∗t+1 = log δ − (1 − α)∆ log yt+1 − α∆ log y∗t+1 ,
1 = Et [exp(mt+1 + rt )] ,
1 = Et [exp(mt+1 − ∆et+1 + rt∗ )] ,
1 = Et exp(m∗t+1 + rt∗ ) ,
1 = Et exp(m∗t+1 + ∆et+1 + rt ) ,
More generally, if we use u(c H,t , c F,t ) and u∗ (c∗F,t , c∗H,t ) to denote the
home and foreign households’ utilities as functions of their goods-
specific consumption, we can express the within-period solutions
as
∂u(c H,t , c F,t )/∂c H,t 2α−1
exp(et ) = ,
∂u(c H,t , c F,t )/∂c F,t
!2α−1
∂u∗ (c∗F,t , c H,t )/∂c∗F,t
exp(−et ) = .
∂u∗ (c∗F,t , c H,t )/∂c∗H,t
This set of equations relates the exchange rate level to how the
households trade off consumption expenditures between different
goods within the same period, which gives rise to the goods market view
of exchange rates. Absent frictions in international trade and con-
sumption, a stronger home currency in real terms must correspond to
a lower consumption share in the home goods. Conversely, if house-
holds experience shocks to how they value home vs. foreign goods,
this preference shock also impacts the equilibrium exchange rate.
Moreover, if the economy is stationary and we log-linearize around
a symmetric steady state with ȳ = ȳ∗ , c̄ = c̄∗ , and ē = 0, we can com-
bine the within-period solutions with the market clearing conditions
in the goods market to obtain the following result:
which depends on the relative endowments (log yt − log y∗t ), plus the
asset market wedge τtAM and the goods market wedge τtGM .
Any model we consider in this note can be characterized by its
stance on the supply of consumption goods and the two wedges.
While the goods market issues are equally important for understand-
ing the international macro and financial outcomes, we focus more
on the asset market view in this note.
Summary
• The stochastic properties of the exchange rate movement—cyclicality, volatility, expected re-
turn, and comovement—exhibit puzzling patterns that are difficult to explain by the complete-
market benchmark model.
• Combining bond and currency forward positions generates non-zero risk-free returns. These
near-arbitrage spreads also require significant modifications on the benchmark model.
• International portfolio quantities and capital flows exhibit strong asymmetry and cyclicality,
which also impose important restrictions on how we specify the model.
var (∆et+1 ) = var (mt+1 − m∗t+1 ) = var (mt+1 ) + var (m∗t+1 ) − 2cov(mt+1 , m∗t+1 ),
which states that the exchange rate variance should be equal to the
variance of the SDF differential, which can be decomposed to the
sum of SDF variances minus two times the SDF covariance.
By the Hansen and Jagannathan [1991] bound, we can derive a
lower bound on the SDF volatility based on the Sharpe ratio of any
risky asset with return r̃:
E[r̃ − r f ]
std(mt+1 ) ≥ exp(−r f ) .
std(r̃ )
For example, if the stock market has a Sharpe ratio of 0.6 per annum,
then, the SDF’s volatility must be at least roughly 60% per annum.
There may be other trading strategies that produce higher Sharpe
ratios, implying an even higher SDF volatility.
Brandt, Cochrane, and Santa-Clara [2006] compares the implied
SDF volatility with the exchange rate movement’s volatility. Between
developed economies, the exchange rate movement’s volatility is
roughly 10% per annum, which is much lower than the SDF volatility.
Then, if markets are complete, either the exchange rate movement’s
volatility is anomalously low, or the correlation between home and
foreign SDFs is incredibly high, which implies a high degree of inter-
national risk-sharing.
This puzzle triggers two responses in the subsequent literature.
First, many papers stay within the complete-market benchmark but
posit mechanisms that generate a high correlation between home
and foreign SDFs. Chapter 3 develops this idea in detail. Second,
other papers deviate from the complete-market benchmark and study
alternative settings in which Eq. (1.16) do not hold. Chapters 4 and 5
develop some of these ideas in detail.
against the U.S. dollar, whereas some currencies like Japanese Yen
and Swiss Francs have persistently low excess returns against the
U.S. dollar.
Moreover, currencies’ average returns are also correlated with their
interest rates. Low-return currencies like Japanese Yen and Swiss
Francs tend to have low interest rates, while high-return currencies
like Australian dollar and New Zealand dollar tend to have high
interest rates. Let rxti +1 denote the log excess return of currency i
against the dollar. Then, if we regress the realized currency return on
ex-ante interest rates in the cross-section of currencies:
The simplest model that the data do not reject features a cross-sectional asym-
metry that makes some currencies pay permanently higher expected returns
than others, and larger time series variation in expected returns on the U.S.
dollar than on other currencies.
rxti = αi + βi f t + εit .
E[rxti +1 ] ∝ βi .
This alignment of risk exposures and risk premia, central to all asset
pricing, is also confirmed in the currency market.
The exchange rate puzzles above are all related to the stochastic
properties of exchange rates. A more recent literature examines re-
turns from currency market strategies that have no risks, at least in
theory. The fact that these risk-free strategies earn non-zero returns
suggests violations of the no-arbitrage condition, and could therefore
shed light on the frictions and preferences faced by the investors in
the international financial markets.
The covered interest rate parity (CIP) describes the relation be-
tween the forward premium and the interest rate differential, which
is given by Eq. (2.1), reproduced below,
This parity holds in frictionless markets because the investors can get
access to the home risk-free rate by either investing in the home bond
with a return of rt$ , or investing in the foreign bond and hedging the
exchange rate risk using the forward contract, which has a net return
of rti − f ti + eit . In the absence of arbitrage, these two risk-free rates in
home currency units must be equal.
Traditionally, this is regarded as an identity since its deviation,
known as the CIP deviation, will be quickly exploited by the arbi-
trageurs who actively trade in both the currency forward market and
the interest rate market. Indeed, if we use the home and foreign Li-
bor rates as proxies for the home and foreign interest rates rt and rt∗ ,
this condition holds quite tightly before the Global Financial Crisis.
However, Du, Tepper, and Verdelhan [2018b] shows that, after the cri-
sis, this condition breaks down and there is non-trivial variations in
both the time series and the cross-section of the Libor CIP deviation.
We use the Libor CIP basis to measure the severity of the Libor CIP
deviation, defined as
def
xtLibor = libort$ − liborti + ( f ti − eit ).
Figure 2.1 plots the Libor CIP basis between the U.S. and the average
developed countries, which is very close to 0 before the Global Finan-
cial Crisis, but has been persistently negative since then. A negative
Libor CIP basis means that the dollar Libor rate is below the foreign
Libor rate after adjusting for the exchange rate risk, which makes the
dollar cheaper to borrow.
Similarly, the Treasury CIP basis is defined by using the Treasury
yields as proxies for the interest rates rt and rt∗ :
def
xtTreas = rt$ − rti + ( f ti − eit ).
Figure 2.1 also plots the Treasury CIP basis. Unlike the Libor CIP
basis, the Treasury CIP basis has been negative both before and after
the Global Financial Crisis [Du, Im, and Schreger, 2018a, Jiang, Kr-
ishnamurthy, and Lustig, 2018]. In other words, the U.S. government
has been able to borrow at a lower interest rate than the foreign gov-
ernments, after adjusting for the exchange rate risk. The Treasury CIP
deviation will play a central role when we discuss bond convenience
yields in Chapter 4.
20
-20
-40
-60
-80
-100
-120
-140
1990 1995 2000 2005 2010 2015
We start with the U.S. net external imbalances vis-à-vis the rest of the
world. We consider the equity-like, riskier asset classes and the debt-
like, safer asset classes separately. Let us define the net risky position
as the sum of portfolio equity assets and foreign direct investment
(FDI) assets minus portfolio equity liabilities and FDI liabilities, and
define the net safe position as the reserve assets plus debt assets
minus debt liabilities [Gourinchas, Rey, and Sauzet, 2019].
Figure 2.2 shows the U.S. net risky and safe positions normalized
by GDP, from 1970 to 2021. We make three observations. First, on
average, the U.S. net risky position is positive and the U.S. net safe
position is negative. In other words, the U.S. holds more foreign risky
assets than it issues to the rest of the world, which implies that the
U.S. is a net lender of risky assets. On the other hand, the U.S. holds
less foreign safe assets than it issues, which implies that the U.S. is a
net borrower of safe assets.
Second, these net asset positions have notable cyclical properties.
Risky assets tend to depreciate during recessions, whereas safe assets
tend to appreciate. As a result, during the 2008 Global Financial Cri-
sis and the early-2000 stock market crash, the U.S. net risky position
contracted while its net safe position expanded.
Third, the U.S. net safe position has been widening steadily since
mid-1980s. Currently (end of 2021), it is about 50% of the U.S. GDP,
with a large fraction attributable to the U.S. government debt. In
10
-10
-20
-30
-40
-50
-60
1980 1990 2000 2010 2020
comparison, the U.S. net risky position widened before 2008, but it
has been contracting since 2010. This contraction is driven by a rising
foreign demand for the U.S. equity assets.
We also sum up the U.S. net risky and safe positions to obtain
the U.S. net foreign assets (NFA). This is equal to the difference be-
tween the foreign assets held by the U.S. investors (i.e., the U.S. ex-
ternal assets) and the U.S. assets held by the foreign investors (i.e.,
the U.S. external liabilities). Figure 2.3 shows that the U.S. NFA has
been widening steadily since the 1980s, and the widening tends to
accelerate during recessions. The widening in the past decade is also
notable: the U.S. NFA expanded from −30% in early 2010s to −80%
right now, at a speed never seen before.
As we will see in Chapter 7, these imbalances reflect a fundamen-
tal asymmetry between the U.S. and the rest of the world. They play
an important role in our understanding of the international monetary
system, as they are closely tied to the global risk-sharing arrange-
ments and the unique position of the U.S. and the dollar.
-10
-20
-30
-40
-50
-60
-70
-80
1980 1990 2000 2010 2020
the capital gains or losses. The U.S. has been earning consistently
positive payoffs from the 1960s to the mid 1990s, in early 2000s, and
in some years after 2008. Moreover, these magnitudes have been in-
creasing since the 1960s, possibly reflecting increases in international
portfolio positions and developments in international financial assets.
-10
-20
-30
1970 1980 1990 2000 2010 2020
U.S. equity assets they held. As this difference in return is not driven
by the U.S. paying off foreign investors in recessions, this potentially
represents a loss of exorbitant privilege as opposed to the exorbitant
duty [Jiang, Richmond, and Zhang, 2022c, Atkeson, Heathcote, and
Perri, 2022].
There is also a trade side that parallels the fluctuations on the
portfolio side. As we will see in Section 9.A, the trade balance (TB)
and the portfolio returns from the income balance (IB) and capital
gains (CG) jointly determine the evolution of the national accounts
captured by the net foreign assets (NFA):
strategy even when the underlying asset offers the same return.
Panel A: Description of the Example Table 2.1: Example of Cash Flows and
the IRR Computation
Foreign Investors Home Investors
Year Holdings Yield Holdings Yield
1 1 5% 10 5%
2 10 1% 1 1%
Time-Weighted Return 2.98% 2.98%
Panel B: IRRs of Cash Flows
Foreign Investors Home Investors
Year Holdings Cash Holdings Cash
Flows Flows
1 1 −1 10 −10
2 10 −8.95 1 9.5
3 10.10 10.10 1.01 1.01
Dollar-Weighted Return 1.37% 4.65%
total government debt held by public/GDP ratio and the U.S. exter-
nal liability/GDP. The first series represents the amount of debt owed
by the U.S. government to the U.S. private sector and the rest of the
world, whereas the second series represents the amount of debt owed
by the U.S. to the rest of the world. Since the 2008 global financial
crisis, both the U.S. government and the U.S. as a whole have been
borrowing a lot more.
This comovement shows that international finance and govern-
ment finance are closely intertwined. On the one hand, the foreign
investors, private and official, have been financing a large portion of
the U.S. external liabilities. To understand why this is the case, it is
important to understand why they find the U.S. government debt
particularly desirable. On the other hand, a large portion of the U.S.
government debt has been financed by foreign investors. To evalu-
ate the U.S. fiscal sustainability, it is also important to take a global
perspective. We will consider these issues in detail in Chapters 7 and
8.
There are many more empirical patterns that this note does not have
space to cover. For example, home bias is a salient pattern in inter-
national asset allocation. Lewis [1999] provides a summary of the
literature. More recently, micro-data shed new light on cross-border
portfolio positions and global capital allocations. Florez-Orrego,
Maggiori, Schreger, Sun, and Tinda [2023] provide a review of this
literature.
40
20
-20
1980 1990 2000 2010 2020
Understanding the
Exchange Rates
Summary
• We first investigate the role of currency risk premium rpt , which drives both the currency
expected return:
def 1
Et [rxt+1 ] = rpt = −covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ),
2
and the exchange rate level:
∞ ∞
et = ∑ Et [rt+ j − rt∗+ j ] − ∑ Et [rpt+ j ] + ē.
j =0 j =0
• The currency risk premium is related to the currency’s loadings on the risk factors, which give
rise to a factor structure in currency returns:
We first adopt a no-arbitrage approach to organize the currency risk premium and the corre-
sponding factor structure.
• A natural question is what drives the time-series and cross-sectional variations in the currency
risk premium, which is to be answered by general equilibrium models.
• Finally, we consider the long-term currency risk premia. When markets are complete and the
exchange rate is stationary, the long-run UIP condition holds:
To make sense of the exchange rate puzzles and other salient pat-
terns in international financial markets, we first enrich the risk pre-
mium in the baseline model. To be clear, as long as the households’
utility function u(c) is non-linear, the currency risk premium is al-
Et [∆et+1 ] = rt∗ − rt ,
Et [rxt+1 ] = 0.
Our first approach to modeling the currency risk premia starts di-
rectly from the SDFs. Because this approach only relies on investors’
ability to correctly price the assets based on the covariances between
asset returns and their SDFs, it is known as the no-arbitrage ap-
proach. This approach asks the following question: what properties
where εw i
t+1 is a global shock and ε t+1 is a country-specific shock.
Both shocks are i.i.d. standard normal. When the number of coun-
tries N is large, the country-specific shocks average to zero:
N q
∑ γzit εit+1 = 0.
i =1
∆ei/$ i $
t +1 = m t +1 − m t +1
√ √ q q q
i $ i $ w w i i $ $
= − µt − µt − δ − δ z t ε t +1 − γzt ε t+1 − γzt ε t+1 .
Since the currency excess return is equal to the exchange rate move-
ment plus the interest rate differential, which is known ex-ante, the
innovation in the currency excess return is also equal to the innova-
tion in the exchange rate movement:
1 √ i √ $ 2 w 1
vart (∆ei/$
t +1 ) = δ − δ zt + γ(zit + z$t ), (3.3)
2 2
which shows that the variance zw t of the global shock and the vari-
i $
ances zt and zt of the country-specific shocks both contribute to the
exchange rate variance. Moreover, for two countries whose SDFs have
very different loadings on the global shock, their bilateral exchange
rate movement loads heavily on the global shock and is therefore
more volatile.
Next, we consider the currency risk premium. Plugging the SDF
dynamics into Eq. (1.17), we obtain
def
h i 1 1
rpi/$
t = E t rx i/$ $ i
t+1 = 2 vart ( mt+1 ) − 2 vart ( mt+1 )
(3.4)
1 $ 1 $
= δ − δi z w t + γ z t − z i
t .
2 2
The first term is easy to interpret: if two countries have different
loadings on the global shock, i.e., |δ$ − δi | > 0, then, their bilateral
exchange rate movement is exposed to the global shock and requires
a higher magnitude of risk premium.
The second term is increasing in the difference between the volatil-
ities of the two countries’ idiosyncratic shocks, i.e., z$t − zit , which
is more subtle. The readers might wonder why idiosyncratic risks
appear in the risk premium. For example, from the U.S. households’
perspective, should they not care about the foreign country’s idiosyn-
cratic risk zit since it is uncorrelated with the U.S. SDF?
exp(ei/$ $
" #
i
i/$ def t+1 + rt ) − exp(rt )
Et [exp(rxt+1 )] = Et 1 + ,
exp(rt$ )
Et [exp(rxti/$ $ i/$
+1 )] = exp(− covt ( mt+1 , ∆et+1 ))
√ √ √
$
= exp δ$ δ $ − δi z w t + γzt ,
√ √ √
which does not contain the zit term. The term δ$ ( δ$ − δi )zw t
comes from the U.S. SDF’s and the exchange rate movement’s load-
$
ings on the global shock εw t+1 , and the term γzt comes from their
$
loadings on the U.S.-specific shock ε t+1 . As the global shock and
the U.S.-specific shock both affect the U.S. SDF, they are priced risk
factors from the U.S. perspective.1 1
That said, the bilateral exchange rates
Now, let us revisit the currency risk premium in log, which is between different foreign currencies
and the dollar only have heterogeneous
related to the levels by loadings on the global shock, but not
on the U.S.-specific shock. Therefore,
1 the U.S.-specific shock leads to iden-
rpi/$
t = log Et [exp(rxti/$ i/$
+1 )] − 2 vart ( ∆et+1 ). tical risk premia across all foreign
currencies, which is γz$t .
Similarly, if we take the foreign perspective,
1 i/$
rp$/i
t = log Et [exp(rxt$/i
+1 )] − 2 vart ( ∆et+1 ).
Combine these two expressions and use the definition that implies
the log risk premia are symmetric, i.e., rpi/$
t = −rp$/i
t . We have
1
rpi/$
t = (log Et [exp(rxti/$ $/i
+1 )] − log Et [exp(rxt+1 )]).
2
Therefore, the log risk premium can be thought of as an average
between the level risk premia from the U.S. and the foreign perspec-
tives, which care about the U.S.-specific and the foreign-specific risks,
respectively.
It is also worth noting that the currency risk premia in level are
not symmetric:
Because of this property, we use the currency risk premium in log for
most parts of this paper.
These results connect currency risk premia and realized currency
excess returns to the factor structure of the SDFs. We make three
more observations. First, for a country whose loading δi on the global
shock is high, its currency tends to appreciate during global reces-
sions (i.e., rxti/$ w
+1 increases when ε t+1 declines), and have a lower
currency risk premium rpi/$ t . Therefore, currencies with higher load-
ing δi are considered as safe currencies, and currencies with lower
loading δi are considered as risky currencies.
Second, as for the country-specific shocks, even though they are
idiosyncratic from the aggregate perspective and there is full risk-
sharing between countries, they still affect currency returns and risk
premia. This feature can arise from general equilibrium models if the
local households have a consumption bias towards home goods, so
that their consumption allocation and marginal utility tilt towards
the local shock despite full risk-sharing. Specifically, when zit is high,
country i’s SDF is volatile, and its local households require a higher
risk premium to hold foreign currencies.
Third, if two countries have high and identical idiosyncratic
volatilities, i.e., zit = z$t ≫ 0, then, their bilateral exchange rate move-
ment is volatile, while the currency risk premium can remain low.
Conversely, we can generate a high currency risk premium without
a volatile exchange rate movement. To do so, notice that the global
risk loadings enter the exchange rate variance expression (3.3) as
√ √ 2
δi − δ$ , whereas they enter the currency risk premium ex-
√ √ √ √
pression (3.4) as δ$ − δi = δi − δ $ δi + δ$ . Then, high
loadings on the global shock, i.e., δ$ ≈ δi ≫ 0, can generate a high
currency risk premium without a volatile exchange rate movement.
√ √ 2
For example, if δ$ = 10 and δi = 9, δi − δ$ = 0.03 implies a
low exchange rate volatility, whereas δ$ − δi = 1 implies a high cur-
rency risk premium. As such, this no-arbitrage framework provides
a flexible way to match different patterns in exchange rate volatilities
and currency risk premia.
duced below,
h i
1 = Et exp(mit+1 + rti ) ,
and under joint normality, the interest rate in country i can be ex-
pressed as
1 1 1
rti = −Et [mit+1 ] − vart (mit+1 ) = µit − δi zw − γzit .
2 2 t 2
1 $ 1
rti − rt$ = ( δ − δi ) z w i $
t − ( γ − χ )( zt − zt ), (3.5)
2 2
which implies that the interest rate differential also comoves with
the global and country-specific risk premia. A country with a higher
loading δi on the global shock has a lower interest rate, in addition
to having a lower risk premium. Moreover, if 21 γ − χ > 0, which
means that the interest rate is not set high enough to offset the pre-
cautionary saving motive, then, a country with a higher idiosyncratic
volatility zit also has a lower interest rate, in addition to having a
lower risk premium.
This positive relationship between the interest rate differential and
the currency risk premium holds both across countries and over time.
We next explore these two dimensions in detail.
i∈ H i∈ L
In other words, the currency trading strategy that buys high interest
rate currencies and sells low interest rate currencies earns a positive
risk premium on average.
Third, each currency’s loading on the carry trade factor informs us
about its systematic risk exposure. Consider the conditional regres-
sion
rxti/$ i i i
+1 = αt + β t · hmlt+1 + ε t+1 ,
and that the risk exposure determines the asset’s risk premium, i.e.,
assuming that the factors are tradable. In the context of the currency
market, the carry trade captures a risk factor: f t+1 = hmlt+1 . The
carry trade beta βit captures the risk exposure of currency i, and it re-
lates to the currency’s risk premium via Eq. (3.4), which is increasing
in βit .
eign currency i:
1 $ 1 $
rpi/$
t = δ − δi z w t + γ z t − z i
t ,
2 2
1 1
rti − rt$ = (δ$ − δi )zw i $
t − ( γ − χ )( zt − zt ).
2 2
In this subsection, we turn to the time-series variation in the vari-
ances of the SDF shocks zw i
t and zt , which drive variations in both the
currency risk premium and the interest rate differential over time.
As we will show in the following proposition, this common variation
gives rise to predictive power of the interest rate differential for the
currency excess return. Specifically, we compute the slope coefficient
in the forecasting regression of the currency excess return on the
interest rate differential:
rxti/$ i i i $ i
+1 = α + φ ( r t − r t ) + ε t +1 ,
def cov(rxti/$ i $
+1 , r t − r t )
φi = .
var (rti − rt$ )
Proposition 3.2. The slope coefficient from the regression of the currency
excess return on the interest rate differential is given by
1 1 i $ 1 $ i 2 w
2 γ ( 2 γ − χ )( var ( zt ) + var ( zt )) + 4 ( δ − δ ) var ( zt )
φi = $
,
(χ − 12 γ)2 (var (zit ) + var (zt )) + 14 (δ$ − δi )2 var (zwt )
∆ei/$ i i i $ i
t +1 = α + ψ ( r t − r t ) + ε t +1 .
def cov(∆ei/$ i $
t +1 , r t − r t ) χ( 12 γ − χ)(var (zit ) + var (z$t ))
ψi = = ,
var (rti − rt$ ) (χ − 12 γ)2 (var (zit ) + var (z$t )) + 14 (δ$ − δi )2 var (zw
t )
def
h i 1 $ 1 $
rpi/$
t = Et rxti/$ i
+1 = 2 δ − δ z t +
w
γ zt − zit ,
2
1 i − δ$ times the
the interest rate differential only loads on ζ − 2 δ
global shock’s volatility zwt :
1 1
rti − rt$ = ζ− i
δ −δ $
zw
t + χ− γ zit − z$t ,
2 2
which does not fully reflect this global component of the currency
risk premium.
As a result, the expected exchange rate movement also loads on
the global SDF volatility zw
t . This might be a more realistic speci-
fication since the global SDF volatility also induces exchange rate
predictability by the interest rate differential:
2
cov(∆ei/$ i $ χ 21 γ − χ (var (zit ) + var (z$t )) + ζ 12 − ζ δi − δ$ var (zw t )
t +1 , r t − r t )
ψi = = 2 2 > 0.
var (rti − rt$ )
2
χ − 12 γ (var (zit ) + var (z$t )) + ζ − 12 δi − δ$ var (zw
t )
used in practice to reflect the overall strength of the U.S. dollar. The
dollar’s base factor ∆e$t is the unweighted version of the dollar index.
By the law of large numbers, the SDF structure in Section 3.A
implies
√ √ q w q
∆eit+1 = mit+1 − mt+1 = −χ zit − zt − δi − δ zw
t ε t + 1 − γzit εit+1 ,
which measures currency i’s exposures to the global shock εw t+1 and
i
to its own idiosyncratic shock ε t+1 , in a way that is invariant to the
choice of the base currency j. As such, if we want to study the varia-
tion in the dollar, the dollar’s base factor might be more informative
than the dollar’s bilateral exchange rate against an individual foreign
currency.
For example, the conditional and the unconditional variances of
the currency base factor can be expressed as
√ √ 2 w
vart (∆eit+1 ) = δ zt + γzit ,
δi −
√ √ 2
var (∆eit+1 ) = χ2 var (zit ) + δi − δ θ w + γθ,
which uncovers this currency’s idiosyncratic variance zit and its expo-
sure to the global shock δi .
Moreover, Lustig and Richmond [2020] also consider the following
regression:
i/j
∆et+1 = α + φi/j ∆eit+1 + ε t+1 .
where the common shock εw t+1 is tied to the currency carry trade re-
g
turn and the other common shock ε t+1 is tied to the U.S. dollar. The
loadings are time-varying, with the loadings on the dollar shock and
on the idiosyncratic shock both tied to a local state variable zit and the
loading on the carry trade shock tied to a global state variable zw t .
We can also bring the SDF dynamics closer to the data. If the
global and the country-specific shocks load on a vector of observable
variables, we can model the SDF process using a VAR representation.
For example, suppose the state vector xti follows a VAR(1) process:
and the SDF innovations depend on the same VAR shocks εit+1 :
1
mit+1 = −rti − (λit )′ λit − (λit )′ εit+1 .
2
q
The vector of loadings λit generalizes the scalar loadings δi zw t
q
and γzit in the previous case, and, for tractability, they are linear in
the state vector xti :
∆ei/$ i $
t +1 = m t +1 − m t +1 .
In this way, we can jointly model exchange rates and asset prices
using the same set of observable variables [Chernov and Creal, 2023].
We will discuss this class of affine models in detail in Section 8.E.
3.B.1 Households
There are two countries, home and foreign. The households have the
Epstein and Zin [1989] recursive preferences. For the home house-
holds,
1
( 1
1− ψ
) 1
1− ψ
1− ψ1
vt = (1 − δ)(ct ) + δEt [(vt+1 )1−γ ] 1−γ , (3.6)
∆ct+1 = µ g + xt + σt ε g,t+1 ,
xt+1 = ρxt + φe σt ε x,t+1 ,
σt2+1 = σ2 + ϕ(σt2 − σ2 ) + ωε w,t+1 .
claim as
w t +1 c wct+1
exp(rtc+1 ) = = t +1 .
wt − ct ct wct − 1
Using the Campbell-Shiller log-linearization of the log total wealth
return around the long-run average log wealth consumption ratio
µwc = E[log(wct )]:
θ
mt+1 = θ log δ − ∆c + (θ − 1)rtc+1 ,
ψ t +1
which depends on not only the log consumption growth ∆ct+1 , but
also the log return on the wealth portfolio rtc+1 . The following propo-
sition shows that the log wealth-consumption ratio and the log SDF
are linear functions of the state variables after the log-linearization.
where
1
1− ψ
Wx = ,
κ1c − ρ
(1 − γ) 1 − ψ1 φ2e 2νφe
Wσ = + +1 ,
2(κ1c − ϕ) (κ1c − ρ)2 κ1c − ρ
Implied Values:
Wealth-cons. loading on long-run growth Wx 31.07
Wealth-cons. loading on variance Wσ −5031
Backus-Smith coefficient β BS −1.3975
Mean wealth-cons. ratio exp(µwc ) 579.1
Campbell-Shiller constant κ0c 0.0127
Campbell-Shiller constant κ1c 1.0017
SDF loading on short-run growth shock −0.0256
SDF loading on long-run growth shock −0.1203
SDF loading on volatility shock 0.1273
where ait is the productivity level, ℓit is the labor input, and xijt is the
quantity of the intermediate goods produced by country j that are
used as production inputs in country i. The parameter θ i measures
the contribution of country i’s labor, and the parameters wij measure
the contribution of each country j’s input. We assume that
N
θi + ∑ wij = 1 and θ i , wij > 0,
j =1
N
cit = ∏(cijt )vij , (3.10)
j =1
N
∑ vij = 1 and vij > 0.
j =1
We collect the production parameters for final goods vij into a matrix
V. Along with the intermediate goods’ production weights W, the
matrix V will be a key source of international comovements in the
model.
The households have log preferences over their aggregate con-
sumption, and discount future utility at rate β. The utility function
is
∞
∑ δt log cit .
t =1
N
∑
j j
xit = cit + xit + dit .
j =1
On the left-hand side, we have the total output of country i’s inter-
mediate good. On the right-hand side, we have the demand for this
good from each country’s consumption and production sectors. We
have an additional term dit as a reduced-form proxy for demand
shocks. This term can represent government taxation and spending
as in Acemoglu, Akcigit, and Kerr [2016], or within-country trans-
fer to inactive/hand-to-mouth investors as in Jiang and Richmond
[2023b].
Labor supply is fixed. The market clearing condition for country
i’s labor is
i
ℓit = ℓ .
given by
! !
N N N
i
∑ πi log cit + φit θi
∏(xijt )wij −∑
j j
ait (ℓit ) cit + xit − dit + χi (ℓ − ℓit ),
i =1 j =1 j =1
The exogenous variables are the productivity shocks and the de-
mand shocks:
( ait , dit )∞
t =0 , i = 1, 2, . . . , N.
N production equations
!
N
θi
xit = ait (ℓit ) ∏(xijt )wij ,
j =1
2N 2 + N first-order conditions
j j
w.r.t. cit : π j v ji (cit )−1 = φit ,
j j j j
w.r.t. xit : φt x̄t w ji ( xit )−1 = φit ,
w.r.t. ℓit : φit x̄ti θ i (ℓit )−1 = χit ,
where π i is the Pareto weight that the social planner assigns to coun-
try i. This Pareto weight is determined by the initial level of wealth
held by each country’s households.
A key variable that emerges from the model is a function of trade
network parameters for intermediate usages, W, and consumption,
V. We define the network profile matrix as:
H ≡ V ( I − W ) −1 .
(b) Closer countries have more correlated currency base factors and less
volatile bilateral real exchange rate movements:
j
cov ∆eit , ∆et = C(i, j) − C(i ) − C( j) + κ e ,
i/j
var (∆et ) = −2C(i, j) + C(i, i ) + C( j, j),
where κ e is a constant:
N N
1
κe =
N2 ∑ ∑ C(k, ℓ),
k =1 ℓ=1
and C(i ) is the average closeness between country i and all countries:
N
1
C(i ) =
N ∑ C(i, j).
j =1
Fixing its closeness to itself C(i, i ), the country’s currency base factor is
less volatile if it has a higher average closeness.
The proof is presented in Appendix A.9. This proposition shows
that the consumption and exchange rate covariance is directly related
to the closeness matrix C , which depends on both the covariance
structures of supply and demand shocks as well as the structure of
the trade network that propagates these shocks.
with θ = 0.5.
In this trade network, countries 2, 3, and 4 rely on country 1’s
export as intermediate input. The dependency is increasing from
country 2 to 4. The implied centrality measure is
C(1, 1) 4.00
C(2, 2) 2.72
= ,
C(3, 3) 2.22
C(4, 4) 2.04
(1)
and the central country’s shock ε t+1 becomes the common factor in
the cross-section of exchange rates. In fact, even when we consider
the bilateral pair between countries 2 and 3 that does not involve the
(1)
central country 1, ε t+1 still shows up:
φ t x t = H ′ π + ( I − W ′ ) −1 d t ,
where φt x t is a vector whose i-th element is φit xit . That is, supply
shocks at do not affect the value of a country’s aggregate output;
only demand shocks do. This is because, while a higher supply shock
raises the quantity of a country’s output, it proportionally lowers
its price, so that the value of the output is unchanged [Cole and
Obstfeld, 1991]. In contrast, a higher demand shock raises the price
of the local goods while leaving the quantity unchanged, so that the
value of the output increases.
While this is a strong assumption, it is technically convenient.
From the production equations (3.9) and the social planner’s first-
order conditions, the country-level production quantities satisfy
!
N φit x̄ti wij
log x t = log at + θ log ℓt + ∑ wij log
i i i i
j j j
,
j =1 φt x̄t / x̄t
pt yt + ∑ b H,t−1 (h) exp(rt−1 (h)) + ∑ bF,t−1 (h) exp(rt∗−1 (h) − et ) = ct + ∑ b H,t (h) + ∑ bF,t (h) exp(−et ).
h h h h
def
where ∆et,t+h = et+h − et denotes the h-period log exchange rate
movement.
If we define the multi-horizon SDFs as
def u′ (ct+h )
exp(mt,t+h ) = δh ,
u′ (ct )
def u′ (c∗ )
exp(m∗t,t+h ) = δh ′ t+∗ h ,
u (ct )
then, the Euler equations can be written as
While both strategies are exposed to the same exchange rate risk
captured by ∆e0,h , the cumulative interest rate differential between
one-period bonds, i.e., ∑th=1 (rt−1 (1) − rt∗−1 (1)), can be different from
the interest rate differential between h-period bonds, i.e., h(r0 (h) −
r0∗ (h)). So, these two strategies generate different returns.
where
mc T
ϕ log β log β
mctP = − + + log δ − t − γ log ytP ,
1−ϕ 1−ϕ1−ϕ 1−ϕ
mc T ϕ log β log β
mctT = − + t − γ log ytT ,
1−ϕ 1−ϕ1−ϕ 1−ϕ
log β 1
= (γνP )2 + log δ − µγ,
1−ϕ 2
1 ϕ
mc T = − γ2 νT2 − γ2 νP νT ρ TP + log β.
2 + 2ϕ 1−ϕ
where
σP = −γνP ,
σT = −γνT .
δ̃t+h
exp(mctT ) = lim , (3.16)
h→∞ pt ( h)
1 = Et [exp(mt,t+h ) pt (h)].
def
(b) Let δ̃ = β1/(1−ϕ) . Then, the long-term bond yield satisfies
and
∞
1 1
lim (rt (h) + log δ̃)h =
h→∞
∑ ϕi σT εTt−i − 2 − 2ϕ2 σT2 − 1 − ϕ σTP . (3.19)
i =0
Eq. (3.19) shows that the long-term bond price, after taking out
the deterministic trend (log δ̃)h, converges to a function of the past
transitory shocks ε Tt−i . This quantity is closely related to the exchange
rate level, as we will see next. Moreover, using the definition of the
def
bond price pt (h) = exp(−rt (h)h), we can show that the bond price
scaled by δ̃ is finite:
1
rt (h) = − σP2 − log δ + µγ.
2
The second special case is one in which the permanent endow-
ment shock ε Pt is always zero. In this case, according to Eq. (3.17),
reproduced below,
p t +1 ( h − 1 )
lim = exp(mctT − mctT+1 ),
h→∞ pt ( h)
When markets are complete, the exchange rate is equal to the ratio
of the two countries’ SDFs:
Finally, assuming log δ̃ = log δ̃∗ and iterating this equation for-
ward, we obtain the following result [Lustig, Stathopoulos, and
Verdelhan, 2019].
Proposition 3.10. When markets are complete and the two countries’ SDFs
share the same permanent component, the deviation of the current exchange
rate level et from its long-run mean ē = lim j→∞ Et [et+ j ] is purely driven
by the infinite-horizon bond yield differential:
et − ē = lim h(rt (h) − rt∗ (h)). (3.21)
h→∞
mt,t+1 = −µ − σε t+1 ,
m∗t,t+1 = −µ + ϕet .
Conjecture that
∗
Et [ Mt,t +h ] = exp( h ( a h + bh et ))
∗ ∗
Et [ Mt,t +h ] = Et [exp(− µ + ϕet )Et+1 [ Mt+1,t+h ]]
= Et [exp(−µ + ϕet ) exp((h − 1)( ah−1 + bh−1 et+1 ))]
exp(h( ah + bh et )) = Et [exp(−µ + ϕet + (h − 1) ah−1 + (h − 1)bh−1 (1 − ϕ)et − (h − 1)bh−1 σε t+1 )]
1
= exp(−µ + (h − 1) ah−1 + (h − 1)2 bh2−1 σ2 + ((h − 1)(1 − ϕ)bh−1 + ϕ)et )
2
which implies
!
σ2 1 − (1 − ϕ)2h 2 − 2(1 − ϕ ) h
ah = −µ + 2
− +h
2h 2ϕ − ϕ ϕ
1 − (1 − ϕ ) h
bh = .
h
1
lim rt∗ (h) = lim (− ah + bh et ) = µ − σ2 = lim rt (h).
h→∞ h→∞ 2 h→∞
That is, the foreign long-term yield rt∗ (h) converges to the home long-
term yield rt (h) for large h, even when the next-period marginal
utility is stochastic in the home country (i.e., vart (mct+1 ) = σ2 ) while
it is fully predictable in the foreign country (i.e., vart (mc∗t+1 ) = 0).
This result confirms our earlier result in Eq. (3.20) that a stationary
exchange rate implies that the two countries’ long-term bond yields
converge to the same level.
1
mctP = mctP−1 − σP2 + σP ε Pt ,
2
log yt = log ytP−1 + µ + νP ε Pt + log ytT ;
Summary
• We deviate from the complete-market benchmark and introduce the bond convenience yield
λ∗t as a wedge in the households’ Euler equations for holding the home bond:
• Under additional assumptions about the liquidity benefits of the currency forward, the conve-
nient yield can be measured from the Treasury CIP deviation xtTreas :
1
λ∗t = − x Treas .
1 − β∗ t
1 = Et [exp(mt+1 + ρt )] .
rt < ρt ,
which implies that the Euler equation fails for the Treasury yield:
1 = Et [exp(mt+1 + r̃t+1 )] .
pt yt + b H,t−1 exp(rt−1 ) + bF,t−1 exp(rt∗−1 − et ) + b̄t = ct + b H,t + bF,t exp(−et ) + b̄t−1 exp(rt−1 ),
where b̄t on the left-hand side is the proceeds from the home gov-
ernment’s debt issuance in period t, which is transferred to the home
households, and b̄t−1 exp(rt−1 ) on the right-hand side is the home
government’s debt repayment for its borrowing in period t − 1, which
is financed by a tax on the home households.
In this way, we set up a simple structure that generates a positive
supply of risk-free debt that home and foreign households can hold.
This setting implicitly assumes the government is a natural manufac-
turer of risk-free debt, a theme that we will return to in this chapter
and in Chapters 7 and 8. In our setting, consider any bond utility
function v that satisfies the Inada condition, such that the marginal
utility of bond holding approaches infinity when the bond quantity
approaches zero, e.g., v(b) = log(b). Then, we can show that home
and foreign households have to hold positive quantities of home risk-
free bonds in equilibrium. In this case, the home government is the
only provider of safe assets.
δt u′ (ct ) − ζ t = 0,
Et δt v′ (b H,t ; θt ) − ζ t + ζ t+1 exp(rt ) = 0,
def v′ (b H,t ; θt )
exp(−λt ) = 1 − , (4.2)
u′ (ct )
exp(−λt ) = Et [exp(mt+1 + rt )] ,
1 = Et [exp(mt+1 − ∆et+1 + rt∗ )] .
∞
" #
E0 ∑ δt (u∗ (ct ) + v(b∗H,t ; θt∗ )) ,
t =0
1 = Et exp(m∗t+1 + rt∗ ) .
yt = c H,t + c∗H,t ,
y∗t = c F,t + c∗F,t .
In bonds market, the home and foreign bonds are in positive sup-
ply that is provided by their governments:
bt = b H,t + b∗H,t ,
bt∗ = bF,t + b∗F,t .
(c H,t , c F,t , b H,t , bF,t , pt , c∗H,t , c∗F,t , b∗H,t , b∗F,t , p∗t , rt , rt∗ , et )∞
t =0 .
u′ (c ) u′ (c∗t+1 )
Let exp(mt+1 ) = δ u′ (tc+)1 and exp(m∗t+1 ) = δ u′ (c∗t )
denote the
t
home and foreign SDFs. Let
v′ (b H,t ; θt )
exp(−λt ) = 1 − ,
u′ (ct )
v′ (b∗H,t ; θt∗ ) exp(−et )
exp(−λ∗t ) = 1 − ,
u′ (c∗t )
pt yt + b̄t + b H,t−1 exp(rt ) + bF,t−1 exp(rt∗−1 − et ) = (c H,t )α (c F,t )1−α + b̄t−1 exp(rt−1 ) + b H,t + bF,t exp(−et ),
(c H,t )α (c F,t )1−α = pt c H,t + p∗t c F,t exp(−et ),
yt = c H,t + c∗H,t ,
b̄t = b H,t + b∗H,t ,
exp(−λt ) = Et [exp(mt+1 ) exp(rt )] ,
1 = Et [exp(mt+1 ) exp(−∆et+1 + rt∗ )] ,
p∗t y∗t + b̄t∗ + exp(rt−1 + et )b∗H,t−1 + b∗F,t−1 exp(rt∗−1 ) = (c∗F,t )α (c∗H,t )1−α + b̄t∗−1 exp(rt∗−1 ) + exp(et )b∗H,t + b∗F,t ,
(c∗F,t )α (c∗H,t )1−α = pt c∗H,t exp(et ) + p∗t c∗F,t ,
y∗t = c F,t + c∗F,t ,
b̄t∗ = bF,t + b∗F,t ,
1 = Et exp(m∗t+1 ) exp(rt∗ ) ,
which look like the original Euler equations (1.6)–(1.9) that we de-
rived in Section 1.B for risk-free bonds without convenience yields.
However, a key difference makes these two cases not observation-
ally equivalent: the home and foreign investors may have different
convenience yields λt and λ∗t . In this case, the home investors effec-
tively perceive a home risk-free rate without convenience yield equal
to ρt = rt + λt , whereas the foreign investors effectively perceive
a different home risk-free rate without convenience yield equal to
ρ̃t = rt + λ∗t . Therefore, we cannot map the setting with bond conve-
nience yields back to the baseline model without bond convenience
yields by modifying the bond yields.
exp(−λt ) = Et [exp(mt+1 + rt )] .
Since the home households’ SDF and convenience yield are assumed
to be unaffected, this Euler equation implies that the dollar risk-free
rate rt does not change.
Second, from the foreign households’ Euler equation for holding
home bonds, Eq. (4.7), reproduced below,
we learn that the dollar exchange rate movement also raises the ex-
pected return on purchasing foreign currency bonds from the home
perspective. Since the home households do not derive a convenience
yield on foreign bonds that can adjust, all adjustment must happen in
the dollar’s currency risk premium. In our equilibrium, this happens
via endogenous changes in the cyclicality and volatility of the dollar.
Thus, these four Euler equations require endogenous responses in
both first moments (i.e., exchange rate level and expected return) as
well as second moments (i.e., currency cyclicality and volatility) in
response to the shock to the convenience yield. As such, although the
convenience yield and the risk premium are conceptually different,
they could be correlated in practice.
Proposition 4.1. The home currency’s expected log excess return is de-
termined jointly by the home currency’s risk premium and the home bond’s
convenience yield:
def
where rpt = −covt (m∗t+1 , ∆et+1 ) − 21 vart (∆et+1 ) is the currency risk
premium.
currency risk premium and the convenience yield from their different
perspectives:
1 1
−covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ) − λ∗t = covt (mt+1 , −∆et+1 ) + vart (∆et+1 ) − λt ,
2 2
(4.9)
Proposition 4.2. The exchange rate level is equal to the sum of expected
future interest rate differentials, the sum of expected future convenience
yields, the sum of expected future currency risk premia, and the long-run
exchange rate level:
∞ ∞ ∞
et = ∑ Et [rt+ j − rt∗+ j ] + ∑ Et [λ∗t+ j ] − ∑ Et [rpt+ j ] + ē.
j =0 j =0 j =0
1 = Et [exp(mt+1 + ρt )] ,
1 = Et exp(m∗t+1 + ρ∗t ) .
We can relate these two risk-free rates to the bond yields rt and rt∗ by
ρt = rt + λt ,
ρ∗t = rt∗ .
which implies that the variations in the exchange rate level are driven
by the variations in the risk-free rate differentials (which carry no
convenience yields), in the currency risk premia, and in the difference
between the convenience yields that home and foreign households
derive from holding the home bond.
This expression shows why it is natural to focus on the case in
which home and foreign households derive different convenience
yields, i.e., λt ̸= λ∗t . If the home and foreign households have the
identical convenience yield, i.e., λt = λ∗t , then, the exchange rate level
is determined only by the risk-free rate differentials and the currency
risk premia, both of which are driven by the SDFs via
1
rpt = −covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ).
2
In other words, only the home and foreign households’ convenience
yield differential, rather than their convenience yield levels, matters
for the exchange rate level.
Proposition 4.3. When the markets are complete, the home and foreign
households derive the same convenience yields from holding the dollar bond:
λt = λ∗t ,
and the exchange rate is determined only by the marginal utility differential:
∆etcm = mt − m∗t .
1
rt = −Et [mt+1 ] − vart (mt+1 ) − λt .
2
def
covt (et , rt |εm m m
t ) = covt ( proj [ et | ε t ], proj [rt | ε t ]) > 0.
def
covt (et , rt |εdt ) = covt ( proj[et |εdt ], proj[rt |εdt ]) < 0.
The overall relationship between the interest rate and the exchange
rate depends on the relative strength of these two effects.
This intuition also carries over to the case of long-term debt,
whose yield also depends on the expected interest rates and the
convenience yields. In particular, an increase in the expected path of
interest rates raises the long-term bond yield and appreciates the cur-
rency, whereas a flight-to-safety shock can lower the long-term bond
yield if it also carries some convenience benefits, while appreciating
the currency. However, as we see in Section 3.D, when markets are
complete and exchange rates are stationary, the presence of currency
risk premium does not break the long-run UIP condition (3.21). As a
result, like interest rate shocks, risk premium shocks affect the long-
run bond yield and the exchange rate in the same direction, which
also shows a stark difference between currency risk premium and
convenience yield.
discuss this issue in the context of the Treasury basis, which follows
Jiang, Krishnamurthy, and Lustig [2021a].
Recall that a dollar forward allows investors to lock in the dollar
exchange rate at a fixed rate f t in the next period. Now, consider the
following trading strategy. In period t, an investor converts 1 unit of
dollar to exp(et ) units of the foreign currency, and purchase foreign
risk-free bonds. At the same time, the investor enters a currency
forward contract to lock in the dollar exchange rate at f t in period
t + 1 for exp(et + rt∗ ) units of foreign currency.
Then, in period t + 1, the investor has exp(et + rt∗ ) units of foreign
currency, which are then converted back to exp(− f t + et + rt∗ ) units
of dollar using the forward contract. This strategy generates a return
of exp(− f t + et + rt∗ ) per dollar invested. Since the investor does not
hold the actual dollar risk-free bond but still earns a risk-free return
in dollar terms, we regard this combination of foreign risk-free bond
and currency forward contract as a synthetic dollar bond.
If this synthetic dollar bond does not carry a convenience yield,
then, its Euler equation can be expressed as
1 = Et [exp(mt+1 − f t + et + rt∗ )]
−λ∗t = rt − (rt∗ − f t + et ),
−λt = rt − (rt∗ − f t + et ),
which implies that the convenience yields that foreign and home
households impute to the dollar risk-free bonds are equal to each
other, and equal to the negative of the dollar Treasury basis, defined
as
def
xtTreas = rt − (rt∗ − f t + et ).
When the dollar risk-free bond has a convenience yield, its risk-free
rate rt is lower than the synthetic dollar yield, i.e., (rt∗ − f t + et ),
leading to a negative dollar Treasury basis:
def
xtTreas = rt − (rt∗ − f t + et ) = −λ∗t < 0.
which states that the Treasury basis captures the fraction of the con-
venience yield that the U.S. Treasury earns on top of the synthetic
dollar bond.
Similarly, from the home households’ perspective, we assume that
the convenience yield derived from the synthetic dollar bond is a
fraction β of the convenience yield derived from the actual dollar
bond:
which implies
1
λ∗t = − x Treas ,
1 − β∗ t
1
λt = − x Treas .
1−β t
def
xtLibor = libort − (libort∗ − f t + et ).
Unlike the Treasury basis, the Libor basis was very close to zero
before the Global Financial Crisis, and widened afterwards [Du,
Tepper, and Verdelhan, 2018b]. This pattern suggests that the dollar
Libor and the synthetic position based on the foreign Libor and the
currency forward have a similar level of convenience yields before the
crisis, i.e.,
This is not a coincidence if there are investment banks who are indif-
ferent between the two positions and actively arbitrage any spread
between the Libor market and the currency forward market. As such,
both the dollar Libor and the synthetic dollar bond earn the same,
non-zero convenience yield β∗ λ∗t , so that their spread, i.e., the Libor
basis xtLibor , is zero.
After the crisis, likely due to regulatory constraints, the dollar
Libor now contains a higher level of convenience yield, and banks
face higher costs to arbitrage this spread.2 As a result, the dollar 2
There is a debate about whether
the post-crisis regulations distort the
Libor has a higher level of convenience yield than the synthetic dollar
interest rate market or the currency
bond, leading to a negative Libor basis: forward market [Augustin, Chernov,
Schmid, and Song, 2020].
exp(−γ∗ λ∗t ) = Et exp(m∗t+1 + ∆et+1 − libort ) ,
with γ∗ > β∗ ,
def
xtLibor = libort − (libort∗ − f t + et ) = −(γ∗ − β∗ )λ∗t < 0.
The convenience yields accrue to bonds that are safe and liquid,
which is why we begin our analysis with one-period Treasury bill.
In this section, we broaden our perspective by considering how the
convenience yield on the short-term bond also affects the pricing of
the long-term bond. To do so, we first focus on the domestic house-
holds’ perspective and understand how the convenience yields affect
the long-term bond prices. Then, we study how the home and for-
eign households trade the long-term bonds with each other and
understand how the convenience yields and bond risk premia jointly
determine the exchange rate level.
∞
" #
E0 ∑ δ (u(ct ) + v(qt (1) pt (1); θt ))
t
.
t =0
v ′ ( q t (1); θ t )
def 1
exp(−λt ) = 1 − = E t exp ( m t,t+1 ) ,
u′ (ct ) p t (1)
which does not contain a convenience yield term since the 2-period
bond does not offer any immediate convenience benefits. However, if
we substitute the 1-period bond price into the equation, we obtain
which implies that the bond price today reflects the expectation of the
bond’s convenience yield when it becomes 1-period bond in the next
period. In this way, the convenience yield that only accrues to the
short-term bond can affect the entire term structure of bond prices.
Moreover, the next period’s convenience yield λt+1 is priced by
the SDF mt,t+1 in the sense that their covariance also matters. We can
rewrite the 2-period bond price as
On the right-hand side, the first term shows that a higher expected
convenience yield increases the price of the 2-period bond today, and
the second term shows that a more counter-cyclical convenience yield
can additionally raise the bond price by lowering its risk premium.
We can iteratively derive the valuation of the longer-term bonds
from that of the nearer-term bonds. Jiang and Richmond [2022] pro-
vide a general affine expression for the bond price in this setting
under the additional assumption that the SDF volatility and the con-
venience yield jointly follow an affine process. As we expected, the
1-period bond’s convenience yield and its cyclical property affects not
only the long-term bonds’ expected service flows and prices, but also
their cyclical properties and hence risk premia.
More realistically, we can additionally assume that the investors
derive utility from holding long-term bonds as well. In this case, we
where the new term ηt+1 introduces a wedge in the exchange rate
expression. We will discuss this term in greater detail in the next sec-
tion. For now, it is sufficient to note that this term could be correlated
with the convenience yield λ∗t .
We can express this equation iteratively from period 0:
t t
e t − e0 = ∑ (mk − m∗k ) + ∑ ηk .
k =1 k =1
Jiang, Krishnamurthy, Lustig, and Sun [2021b] show that, if the cu-
mulative wedge ∑tk=1 ηk is a random walk, the cumulative SDF dif-
ferential ∑tk=1 (mk − m∗k ) needs to have a permanent component that
In fact, the left-hand side is finite whereas the right-hand side can be
infinite in this case. The convenience yield also introduces a wedge
in this long-run UIP condition. A full characterization of this wedge
remains an open question.
Summary
• Incomplete markets also lead to a wedge between the exchange rate movement and the SDF
differential:
• This wedge affects the exchange rate volatility, cyclicality, and currency risk premium.
• Market incompleteness is a precondition for convenience yields to impact the exchange rate
dynamics, and convenience yields enrich the effects of market incompleteness on the exchange
rate dynamics.
risk-free bonds are tradable, the original Euler equation (1.6) holds,
i.e.,
1 = Et [exp(mt+1 + rt )] ,
exp(−λt ) = Et [exp(mt+1 + rt )] .
given by
y
dyt = µy yt dt + σy yt dZt ,
y∗
dy∗t = µy∗ y∗t dt + σy∗ y∗t dZt .
where u∗ (c∗t ) = γ∗ log(c∗t ) and c∗t = (c∗F,t )α (c∗H,t )(1−α) . We assume that
there are no time-varying demand shocks for foreign household, i.e.
γ∗ is constant.
More formally, we fix a probability space (Ω, F , P) and a given
filtration F = {F0 : t ≥ 0} satisfying the usual conditions. We assume
that all stochastic processes are adapted to this filtration. Specifically,
uncertainty is represented by a standard three-dimensional Brownian
y y∗
motion Zt = [ Zt , Zt , Zt ]′ .
γ
The only tradable assets are the two countries’ risk-free bonds,
which are denominated in the local consumption bundles and have
interest rates rt and rt∗ , respectively. The real exchange rate exp(et )
is the conversion ratio between the home and foreign consumption
bundles. 1 unit of the home bundle is worth exp(et ) units of the
foreign bundle. Furthermore, assume that the log exchange rate
follows
det = κt dt + σt dZt ,
(ct , c H,t , c F,t , wt , xt , pt , c∗t , c∗H,t , c∗F,t , wt∗ , xt∗ , p∗t , rt , rt∗ , et )tT=0 ,
The model implies the following 15 equations in each period t,
which are very similar to the equilibrium conditions in the baseline
model listed in Section 1.A.6. They include 2 consumption aggrega-
tion equations,
yt = c H,t + c∗H,t ,
y∗t = c F,t + c∗F,t ,
−σt′
∗ 1 ′
λt = r − κ t + σt σ − r t ,
||σt ||2 t 2 t
It is worth noting that this family of SDFs correctly price the home
and foreign bonds:
Z t
A exp mν,t + rt dt = 0,
0
Z t
A exp mν,t − et + rt∗ dt = 0,
0
where
Proposition 5.1. (1) The fraction of wealth xt invested in the foreign bonds
by the home households is given by
!
wt + st ωσ3,t (rt∗ − κt + 12 σt σt′ − rt ) st σts σt′
xt = − 2
− 2
+ , (5.4)
wt ∥σt ∥ ∥σt ∥ wt ∥σt ∥2
and the fraction of wealth xt∗ invested in the home bonds by the foreign
households is given by
r ∗ − κ + 1 σ σ′ − r ∗
∗
w̃ + s̃ ∗ t t 2 t t t s̃∗t σts̃ σt′
xt∗ = 1 − t ∗ t − . (5.5)
w̃t ∥σt ∥2 w̃t∗ ∥σt ∥2
(2) The processes νH and ν̃F , which enter the specification of the home
state price densities personalized for the home and foreign households respec-
tively, are given by:
σt′ σt
′ st s ′
νH,t = I3 − −ωi3 + (σ ) ,
∥σt ∥2 wt + st t
s̃∗ σ′ σt
∗
ν̃F,t = ∗ t ∗ I3 − t 2 (σts̃ )′ ,
w̃t + s̃t ∥σt ∥
( pt , p∗t , et , rt , rt∗ )
Lemma 5.3. The Pareto weight is determined by the relative wealth of the
two countries’ households, i.e., wt + st and w̃t∗ + s̃∗t , scaled by their discount
rates γt and γ∗ :
w̃t∗ + s̃∗t γt
πt = . (5.7)
wt + st γ ∗
(2) When such equilibrium exists, the log exchange rate dynamics follow
det = κt dt + σt dZt ,
where
2
1 1 1 1 ∥αγt ωi3 + γ∗ (1 − α)πt (ν̃F,t − νH,t )′ ∥2 1 ∥(1 − α)γt ωi3 + γ∗ απt (ν̃F,t − νH,t )′ ∥
κt = µy∗ − µy + σy2 − σy2∗ − +
2α − 1 2 2 2 (γt α + πt γ∗ (1 − α))2 2 ( γt ( 1 − α ) + π t γ ∗ α ) 2
1 αγt ωi3 + γ∗ (1 − α)πt (ν̃F,t − νH,t )′ (1 − α)γt ωi3 + γ∗ απt (ν̃F,t − νH,t )′
σt = −σy i1 + σy∗ i2 + ∗
− .
2α − 1 γt α + π t γ ( 1 − α ) γt ( 1 − α ) + π t γ ∗ α
(3) The log home state price density based on the home household’s con-
sumption, mνH ,t , follows
where
and the log foreign state price density based on the foreign household’s
consumption, m∗ν∗ ,t , follows
F
where
where
1
dηt = (∥νH,t ∥2 − ∥ν̃F,t ∥2 )dt + (νH,t − ν̃F,t )′ dZt .
2
(4) We can then express the market price of risk λt , and the interest rates
rt and rt∗ as functions of exogenous state variables.
′
λt = −σm,t − νH,t ,
1
rt = −µm,t − ∥σm,t ∥2 ,
2
1
rt∗ = −σt λt + κt − σt σt′ + rt .
2
The proof is presented in Appendix A.17. This proposition solves
the equilibrium exchange rate, SDF, and risk premium dynamics im-
plicitly as functions of νH,t and ν̃F,t , which are themselves functions
of the exchange rate dynamics and wealth distribution. The central
equation is (5.8), which shows that the exchange rate movement is
no longer equal to the SDF differential as we saw in the complete-
market model in Section 1.C. Instead, an additional wedge dηt arises
and gives to additional variations in the exchange rate. As such, we
refer to the dηt term as the incomplete-market wedge. However, this
wedge is driven by the same fundamental shocks dZt that drive the
SDF processes. In this sense, the additional exchange rate variations
in incomplete-market models are not necessarily non-fundamental.
T
Z
∗
max E exp(−δt) (u(ct ) + πu (c∗t )) dt
c H,t ,c F,t ,c∗H,t ,c∗F,t 0
which implies
γt α γt ( 1 − α )
c H,t = yt , c F,t = y∗ ,
γt α + πγ∗ (1 − α) γt (1 − α) + πγ∗ α t
πγ∗ (1 − α) πγ∗ α
c∗H,t = yt , c∗F,t = y∗ .
γt α + πγ∗ (1 − α) γt (1 − α) + πγ∗ α t
γt ( 1 − α )
∂u(c H,t , c F,t ) ∂u(c H,t , c F,t ) γt α
exp(totcm
t )= / = / ,
∂c H,t ∂c F,t c H,t c F,t
∂u∗ (c∗H,t , c∗F,t ) ∂u∗ (c∗H,t , c∗F,t )
! ! ! !
γ ∗ (1 − α ) γ∗ α
exp(totcm
t )= / = / .
∂c∗H,t ∂c∗F,t c∗H,t c∗F,t
γt α + πγ∗ (1 − α) y∗t
exp(totcm
t )= .
γt (1 − α) + πγ∗ α yt
Proposition 5.3. In the complete-market case, the log exchange rate follows
where
∗,cm
detcm = dmcm
t − dmt .
w̃t∗ γt
π= . (5.9)
wt γ∗
is always a valid foreign SDF that is consistent with the asset market
equilibrium. While this SDF correctly prices the tradable assets, it
does not necessarily coincide with the foreign households’ marginal
utility growth. To the extent that we are interested in understanding
the relationship between the exchange rate dynamics and the macroe-
conomy, we would like to focus on the home and foreign SDFs that
reflect the marginal utility growth, i.e.,
′
u′ (c ) u∗ (c∗t+1 )
m t +1 = δ ′ t +1 , m∗t+1 =δ ′ .
u (ct ) u∗ (c∗t )
1 = Et exp(m∗t+1 + ∆et+1 + rt ) ,
to obtain
1 = Et [exp(mt+1 + ηt+1 + rt )] .
1 = Et [exp(mt+1 + rt )] ,
1 = Et exp(m∗t+1 + rt∗ ) ,
1
Et [ηt+1 ] − vart (ηt+1 ) + covt (m∗t+1 , ηt+1 ) = 0. (5.11)
2
vart (∆et+1 ) = vart (mt+1 − m∗t+1 + ηt+1 ) = vart (mt+1 − m∗t+1 ) + vart (ηt+1 ) + 2covt (mt+1 − m∗t+1 , ηt+1 )
= vart (mt+1 − m∗t+1 ) − vart (ηt+1 ).
Proposition 5.4. (a) The conditional exchange rate volatility can be ex-
pressed as
1
Et [rxt+1 ] = (vart (m∗t+1 ) − vart (mt+1 )) + Et [ηt+1 ].
2
When a volatile ηt+1 term lowers the exchange rate volatility and the
covariance between the SDF and the exchange rate movement, it also
covt (∆et+1 , mt+1 )vart (m∗t+1 ) − covt (∆et+1 , m∗t+1 )covt (mt+1 , m∗t+1 )
yt = ,
vart (mt+1 )vart (m∗t+1 ) − covt (mt+1 , m∗t+1 )2
covt (∆et+1 , m∗t+1 )vart (mt+1 ) − covt (∆et+1 , mt+1 )covt (mt+1 , m∗t+1 )
zt = ,
vart (mt+1 )vart (m∗t+1 ) − covt (mt+1 , m∗t+1 )2
xt = Et [∆et+1 ] − yt Et [mt+1 ] − zt Et [m∗t+1 ],
wt ε t+1 = ∆et+1 − xt − yt mt+1 − zt m∗t+1 .
The last term wt ε t+1 captures the residual in the bilateral exchange
rate movement that is not explained by the SDFs. Compared to the
ηt+1 term in Eq. (5.1), the residual wt ε t+1 is orthogonal to the SDFs,
whereas ηt+1 can have arbitrary correlations with both countries’
SDFs.
Under joint normality, the exchange rate dynamics can be repre-
sented by ( xt , yt , zt , wt ). Alternatively, they can also be represented
by the expected exchange rate movement Et [∆et+1 ], the bilateral ex-
change rate movement’s variance vart (∆et+1 ), and its covariances
with the two countries’ SDFs covt (∆et+1 , mt+1 ) and covt (∆et+1 , m∗t+1 ).
Using this exchange rate decomposition, we can provide a differ-
ent characterization of the exchange rate volatility, cyclicality, and risk
premium [Jiang, 2023b].
Proposition 5.5. (a) The conditional exchange rate volatility can be ex-
pressed as
covt (mt+1 − m∗t+1 , ∆et+1 ) = covt (yt mt+1 + zt m∗t+1 , mt+1 − m∗t+1 ).
(c) Using the Euler equations (1.6)–(1.9), the conditional currency risk
premium can be expressed as
def 1
rpt = Et [rxt+1 ] = − covt (yt mt+1 + zt m∗t+1 , mt+1 + m∗t+1 ).
2
The proof is in Appendix A.20. As a special case, when markets
are complete, yt = 1, zt = −1 and ε t+1 = 0. These exchange rate
moments simplify to
∗
vart (∆etcm
+1 ) = vart ( mt+1 − mt+1 ),
covt (mt+1 − m∗t+1 , ∆etcm ∗
+1 ) = vart ( mt+1 − mt+1 ),
1 ∗ ∗ 1 ∗ 1
rpcm
t = − covt ( mt+1 − mt+1 , mt+1 + mt+1 ) = vart ( mt+1 ) − vart ( mt+1 ),
2 2 2
which recovers the complete-market solution in Section 1.C.
When markets are incomplete, if the magnitudes of the SDF-FX
pass-through coefficients yt and zt are smaller than 1, then, we obtain
a less volatile SDF term yt mt+1 + zt m∗t+1 , which reduces the exchange
rate’s variance and covariance with the SDF differential towards 0.
At the same time, a smaller SDF term also reduces the currency risk
premium. In this way, we recover the dilemma in Lustig and Verdel-
han [2019] that, as market incompleteness reduces the exchange rate
volatility, it also shrinks the currency risk premium towards zero. By
teasing out the SDF term yt mt+1 + zt m∗t+1 from the residual wt ε t+1 in
Then, using the Euler equations (1.6)–(1.9), we can derive the fol-
lowing bound on the exchange rate variance.
0.5
-0.5
-1
-1.5
-1.5 -1 -0.5 0 0.5 1 1.5
As a result, the three bilateral exchange rates are not mutually inde-
pendent. Given the simplicity of Eq. (5.17), it suffices to understand
the dynamics of the bilateral exchange rates between the home coun-
try and each foreign country. The dynamics between the two foreign
countries are easily implied.
Proposition 5.7. The Euler equations for risk-free bonds between countries
1 and 2 imply
(0) (1) (0) (2)
covt (mt+1 − mt+1 − ∆e0/1 0/2 0/2 0/1
t+1 , ∆et+1 ) + covt ( mt+1 − mt+1 − ∆et+1 , ∆et+1 ) = 0.
(5.20)
between the home country and each foreign country directly imply
the bilateral exchange rate movement ∆e2/1
t+1 between the two foreign
countries, the Euler equations (5.18) and (5.19) between the home
country and each foreign country do not imply the Euler equations
(5.20) between the two foreign countries in the general case. As a
result, when we extend our analysis from two to three countries, the
number of independent exchange rate movements increases from 1
to 2, while the number of unique sets of Euler equations for cross-
country bond holdings increases from 1 to 3. The additional Euler
equations impose restrictions on the exchange rate dynamics that are
absent in the two-country setting.
(0) (i )
The complete-market case is an exception: mt+1 − mt+1 − ∆e0/i t +1 =
0 satisfies all the conditions above trivially. Perhaps this is why we
are used to studying only the bilateral dynamics between the home
country and each foreign country in the complete-market setting,
which is no longer sufficient when markets are incomplete, as the
Euler equations between the two foreign countries impose additional
restrictions on the exchange rate dynamics.
What do these additional restrictions imply for the exchange rate
dynamics? To answer this question, we first study a specific three-
country model, and then we generalize the results using the no-
arbitrage approach.
1−α
log ct = α log ci,t + ∑
(i ) (i ) (i )
log c j,t .
j ̸ =i
2
not span the shocks. The risk-free bonds are freely tradable, but the
existence of other assets such as equities and long-term bond is not
central to our result. For simplicity, we assume that the agents can
also trade equities which are claims to the country-level endowment
claims.
(i )
Let w̃t denote the wealth of country i’s agent, expressed in a
common numéraire. We have the following result characterizing the
equilibrium exchange rate.
Proposition 5.8. The equilibrium log real exchange rate between the foreign
countries 1 and 2 can be expressed as
(1) (2)
1− α w̃t w̃t 1−α
2 + (0) α + (0) 2 ( 2 )
3α − 1 w̃t w̃t yt
e1/2
t = log
( 1 ) ( 2 )
+ log (1)
,
2 1− α w̃t 1−α w̃t y
2 + (0) 2 + (0) α
w̃t w̃t
t
(1) (2)
which depends not only on the endowments yt and yt , but also on
(1) (0)
the endogenous wealth distribution between agents, i.e., w̃t /w̃t and
(2) (0)
w̃t /w̃t .
The proof is presented in Appendix A.23. This expression relates
the bilateral exchange rate between the foreign countries 1 and 2
to the wealth of country 0’s agent. In particular, a shock to country
(0)
0’s endowment yt could directly impact the foreign countries’ bi-
lateral exchange rate by affecting the wealth distribution. This can
happen when the agents in countries 1 and 2 hold different portfolios
according to their heterogeneous wealth and preferences, which ex-
pose them differently to country 0’s endowment shock. In this way,
a shock originating from country 0 affects the bilateral exchange rate
between countries 1 and 2, which gives us a concrete example of the
spill-over effect.
In stark contrast, when the markets are complete, full risk-sharing
implies the following wealth distribution:
(i ) (i )
π (i ) w /γ
= (t0) t(0) ,
π (0) wt /γt
(1)
which is only a function of the country-specific endowments yt and
(2) (0) (1) (2)
yt and demand shocks γt , γt , and γt .
Summary
• We develop a stylized model in the New Keynesian tradition. In this model, when prices are
flexible, monetary and fiscal policies have no real effects: they do not affect consumption and
real exchange rate. They only affect inflation, which affects the nominal interest rate and the
nominal exchange rate.
• When prices are sticky, monetary and fiscal policies have real effects: they influence the real
interest rate, which affects consumption and the real exchange rate.
• Monetary and fiscal policies are not necessarily independent. Taking a strong stance on one
policy can affect the other policy.
• While it is beyond the scope of this chapter, monetary and fiscal policies can affect the ex-
change rate dynamics through other channels such as the risk premium and the convenience
yield as well.
In this chapter, we will consider a model with the nominal layer for
the first time. Let us begin with some basic definitions and character-
izations, which apply to not only this model but most models with
a nominal layer. In particular, we may introduce home and foreign
currencies to the baseline model in Section 1.A and consider it as a
special case.
Let Pt denote the price index in the home country, which measures
the price of the home consumption bundle in the unit of the local
currency. For example, if one unit of the U.S. home consumption
bundle costs 100 dollars, then the U.S. price level Pt = 100. Inflation
is defined as the change in the price level:
def
πt = ∆ log Pt .
Recall that ct is the real consumption. We use Ct = Pt ct to denote
the nominal consumption. In the same example, if the U.S. house-
holds consume 10 units of the home consumption bundle, then the
real consumption is ct = 10 units of the consumption bundle and
the nominal consumption is Ct = 10 × 100 = 1000 dollars. More
generally, we use uppercase letters to denote nominal prices and
quantities, and we use lowercase letters to denote real prices and
quantities. The nominal interest rate it and the inflation rate πt are
the exceptions to this notation, as it is common to express them in
lowercase.1 1
In our derivation, we will also use the
Recall that the log real exchange rate et measures the conversion lower case pt (h) to denote the price of a
specific variety of goods, which helps to
ratio between the home and foreign countries’ consumption baskets, distinguish it from the aggregate price
and a higher value means that the home consumption basket is more index Pt .
If an asset with real return r̃t+1 , its nominal return is r̃t+1 + πt+1 . We
can price this asset either by
or equivalently by
1 = Et [exp( Mt+1 + it )] ,
which implies
where
def 1
irpt = covt (mt+1 , πt+1 ) − vart (πt+1 )
2
denotes the inflation risk premium. This expression shows that the
nominal interest rate it can be decomposed into the real interest
rate rt , the expected inflation rate Et [πt+1 ], and the inflation risk
premium irpt .
Define the nominal currency excess return as
Proposition 6.1. The home currency’s expected log excess return is deter-
mined by the covariance between the log foreign SDF and log exchange rate
movement minus a Jensen’s term:
def 1
RPt = Et [ RXt+1 ] = −covt ( Mt∗+1 , ∆Et+1 ) − vart (∆Et+1 ).
2
6.B.1 Households
There are two countries, home and foreign. Each country contains
a unit mass of households, a unit mass of firms, and a government.
Home households are indexed by j ∈ [0, 1], and home firms are
indexed by h ∈ [0, 1]. Foreign households are indexed by j∗ ∈ [0, 1],
and foreign firms are indexed by f ∈ [0, 1]. Each firm produces a
unique variety of good, which is an imperfect substitute for other
varieties.
The lifetime expected utility of home household j is
∞ ∞
ℓ t ( j )1+ ν
def
E0 ∑ δ ut ( j) = E0 ∑ δ log ct ( j) − κ
t t
,
t =0 t =0
1+ν
where ℓt ( j) is the labor effort, and ν > 0 is the labor curvature coef-
ficient. ct ( j) is the consumption composed of a Cobb-Douglas basket
of home and foreign bundles:
1 α 1− α
Pt = PH,t PF,t .
α α (1 − α )1− α
and
Pt Pt
c H,t = α ct , c F,t = (1 − α) ct .
PH,t PF,t
def Pt ct ( j)
exp( Mt+1 ) = δ ,
Pt+1 ct+1 ( j)
which is related to its real SDF via
def ct
exp(mt+1 ) = δ = exp( Mt+1 + πt+1 ).
c t +1
Since the markets are complete, we can show that the nominal
exchange rate movement equals the ratio between the two countries’
nominal SDFs:
6.B.2 Firms
Each home firm produces a variety h using labor supplied by home
households. The production technology is linear in labor input:
y t ( h ) = z t ℓ t ( h ),
where yt (h) is the output of firm h, ℓt (h) is the labor input, and zt
is a productivity process common to all home firms. To produce
yt (h) units of goods, firm h faces a wage cost of Wt ℓt (h). Similarly,
the foreign firm has a production function linear in labor input with
productivity z∗t .
Aggregating across home and foreign households, we obtain the
total demand for variety h:
Z 1 Z 1
yt ( h) = ct (h, j)dj + c∗t (h, j)dj.
0 0
This equation states that the real value of government debt is equal
to the real present value of government surpluses. In particular, if
Qt /Pt is greater than or smaller than the present value of government
surpluses, an arbitrage opportunity will exist.
Qt + Pt gt = Pt τt + Qt+1 exp(−it ).
When the monetary authority raises the nominal interest rate it , if the
fiscal policy (τt , gt ) and the current price level (Pt ) remain the same,
then, the government has to issue a higher nominal amount of debt
Qt+1 . This higher nominal amount of debt then raises the price level
Pt+1 in the next period via
∞
" #
Q t +1
= Et+1 ∑ exp(mt+1,t+1+k )st+1+k .
Pt+1 k =0
def def
Equivalently, let qt = Qt /Pt and recall πt = ∆ log Pt . Then, we can
express the government budget condition Eq. (6.5) in real terms as
qt + gt = τt + qt+1 exp(πt+1 − it ).
Pt P∗
yt = α (ct + gt ) + (1 − α) ∗t (c∗t + gt∗ ). (6.7)
PH,t PH,t
Note that Qt+1 denotes the quantity of the home government debt
that is due at time t + 1. The market clearing condition (6.8) requires
that the total amount of home currency that both countries’ house-
holds receive from their holdings of Arrow-Debreu securities in state
σt+1 is equal to the amount of nominal debt the home government
pays back at time t + 1.
κPt ct ℓνt = Wt ,
1 α 1− α
Pt = PH,t PF,t ,
α α (1 − α )1− α
PH,t c H,t = αPt ct ,
PF,t c F,t = (1 − α) Pt ct ,
Pt P∗
zt ℓt = α (ct + gt ) + (1 − α) ∗t (c∗t + gt∗ ),
PH,t PH,t
max Dt ( h ) ,
pt (h),p∗t (h)
where the firm dividend Dt (h) is given by Eq. (6.4). The solution is
ρ
pt (h) = exp(−Et ) p∗t (h) = MCt ,
ρ−1
which implies that the firm sets the same price in the home and
foreign markets, and the price is equal to a constant ρ/(ρ − 1) times
the marginal cost MCt . If we define the mark-up as the ratio between
the price and the marginal cost, the mark-up is a constant in this
case. This is a standard result in the oligopoly problem whenever the
substitution pattern is modeled by the CES aggregator.
Then, the model is closed by the following equations for prices:
∗ ρ
PH,t = exp(−Et ) PH,t = MCt , (6.11)
ρ−1
∗ ρ
PF,t = exp(Et ) PF,t = MCt∗ . (6.12)
ρ−1
which implies that the aggregate price level is always set at the mark-
up multiplier times the nominal marginal cost.
g∗ g∗
gt gt
ℓt = ℓ̄ 1 + α + (1 − α) ∗t , ℓ∗t = ℓ̄ 1 + α ∗t + (1 − α) ,
ct ct ct ct
log ct − log c̄ = α log zt + (1 − α) log z∗t , log c∗t − log c̄ = α log z∗t + (1 − α) log zt ,
gt∗
ν 2 gt
log ct − log c̄ = α log zt + (1 − α) log z∗t
− 2
( α + (1 − α ) ) + 2(1 − α ) α ∗ ,
1+ν ct ct
∗
∗ ∗ ν 2 2 gt gt
log ct − log c̄ = α log zt + (1 − α) log zt − ( α + (1 − α ) ) ∗ + 2(1 − α ) α ,
1+ν ct ct
g∗
ν gt
et = − log ct + log c∗t = −(2α − 1)(log zt − log z∗t ) + (2α − 1)2 − ∗t ,
1+ν ct ct
pt ( h ) −ρ
" #
max Et−1 exp( Mt−1,t )( pt (h) − MCt ) c H,t .
pt (h) PH,t
which implies
∗ ρ
exp(−Et ) PH,t = E [ MCt ] .
ρ − 1 t −1
So, the entire price block can be described as
∗ ρ
PH,t = exp(−Et ) PH,t = E [ MCt ] , (6.13)
ρ − 1 t −1
∗ ρ
PF,t = exp(Et ) PF,t = E [ MCt∗ ] . (6.14)
ρ − 1 t −1
In other words, under Producer Currency Pricing, the local-currency
price of the local consumption bundle (e.g., PH,t ) is fixed and the
local-currency price of the foreign consumption bundle (e.g., PF,t )
comoves with the exchange rate.
Alternatively, some models assume Local Currency Pricing, under
which case exports are priced and invoiced in the foreign (importers’)
currency. Then, the price of the home firms’ production sold in the
foreign country is set according to
!−ρ
p ∗ (h)
max Et−1 exp( Mt−1,t )(exp(−Et ) p∗t (h) − MCt ) t
∗ c∗H,t .
p∗t (h) PH,t
∗ ρ
PH,t = E [ MCt exp(Et )] .
ρ − 1 t −1
Compared to the case of Producer Currency Pricing, in this case the
exchange rate Et goes into the expectation operator, which is consis-
tent with the assumption that the price is fixed at the consumers’ (i.e.
which implies
it = rt + Et [πt+1 ] + irpt ,
the nominal interest rate it is equal to the real interest rate plus the
expected inflation plus the inflation risk premium. When the prices
are sticky, expected inflation and inflation risk premium do not re-
spond fully to an increase in the nominal interest rate. As a result,
the real interest rate rt responds as well.
Specifically in our setting, the Euler equation
1 = Et [exp( Mt+1 + it )]
Proposition 6.5. When the monetary authorities set the nominal interest
rates in home and foreign countries, the equilibrium consumption is
rt = r̄ + αit + (1 − α)it∗ ,
∞ ∞
" # " #
s
Et ∑ exp(mt,t+k )st+k = s t + c t · Et ∑ δk ctt++kk .
k =0 k =1
Qt ct
Pt ct = . (6.16)
st + ct A
∗ i
k st+k
h
Likewise, we define A∗ = Et ∑∞
def
k =1 δ c∗t+k , and express the for-
eign country’s nominal aggregate demand as
Q∗t c∗t
Pt∗ c∗t = . (6.17)
s∗t + c∗t A∗
Plug the expressions (6.16) and (6.17) for the aggregate demand
into Proposition 6.4, we obtain
c∗
ct
log ct = κtc−1 + α log + (1 − α) log ∗ t ∗ ∗ ,
st + ct A s +c A
t ∗t
∗ ct c
log c∗t = κtc−1 + (1 − α) log + α log ∗ t ∗ ∗ .
st + ct A st + ct A
(2α − 1)c̄
et = −κc + κc∗ + (st − s∗t ),
Ac̄ + (1 − α)2s̄
∞
" #
Qt
exp(et )
Pt
= exp(et )st + Et ∑ exp(m∗t,t+k ) exp(et+k )st+k .
k =1
∞
" #
Qt
exp(et )
Pt
− st = Et ∑ exp(m∗t,t+k ) exp(et+k )st+k .
k =1
For this thought experiment, let us assume that the home country
is very small so that its fiscal shock does not affect the foreign SDF
m∗t,t+k , and that the shock is transitory so that the future surpluses
st+k and the future real exchange rate et+k are unaffected. Then, to
equilibrate this equation in response to an increase in the current
surplus st , the current real exchange rate et has to appreciate. In this
sense, the real exchange rate behaves like the asset price for the claim
to government surpluses, which has to adjust when the fiscal cash
flows change. We can also use this logic to show that the U.S. real
exchange rate has to appreciate when the expected future surpluses
Et [st+k ] increase [Jiang, 2022].
Notably, this result is opposite to what we obtained under flexible
prices. Specifically, Proposition 6.3 shows that, with a positive cur-
vature parameter ν for labor, a higher government spending in the
home country gt , which corresponds to a lower government surplus
st , crowds out the local households’ private consumption, increases
their marginal utility, and appreciates the home currency in real
terms. Sticky prices overturn this result and generate home currency
depreciation through the valuation channel.
Along with
c∗t
ct
log ct = κtc−1
+ α log + (1 − α) log ∗ ,
st + ct A s + c∗ A∗
t ∗t
∗ ct c
log c∗t = κtc−1 + (1 − α) log + α log ∗ t ∗ ∗ ,
st + ct A st + ct A
6.E.3 Discussions
Understanding the
Quantities and Flows
Summary
• The architecture of the international monetary system has important implications for exchange
rate and capital flow dynamics, and reserve assets are at its cornerstone.
• We examine two complementary views of the architecture: the insurance provision view em-
phasizes the U.S.’ role as the world’s insurance provider, and the safe asset view emphasizes
the foreign demand for dollar safe assets. Both views emphasize the centrality of the U.S. in
the global financial markets, but they have different implications for the U.S. external imbal-
ances and the dollar exchange rate.
• We will also briefly survey topics related to the stability of the international monetary system.
Starting from this chapter, we will shift our focus from the ex-
change rates to international portfolio positions and capital flows.
For this purpose, studying the Euler equations alone is no longer suf-
ficient, and general equilibrium models become necessary to under-
stand the financial quantities. We will begin with the most important
asymmetry in the international financial system: the U.S. vs. the rest
of the world.
A prototype financial system involves households who save and
consume, firms or entrepreneurs who produce, and financiers who
intermediate the funds between the households and the firms. These
financiers can be the banks or many types of shadow banks. They
provide funds to the firms by investing in their risky projects, and
they provide saving vehicles to the households by taking safe de-
posits. In doing so, these financiers engage in the safety, liquidity,
and maturity transformations.
This summary of financial system also applies at the global level,
with the U.S. playing the central role as the financier who intermedi-
ates capital flows to the rest of the world and earns a premium from
the intermediation process. Other countries, playing the roles of the
households and the firms, invest at and get funding from the inter-
1
u(ct ) = ( c t )1− γ ,
1−γ
1 ∗
u∗ (c∗t ) = ( c ∗ )1− γ ,
1 − γ∗ t
∞
" #
E0 ∑ δt (πu(ct ) + (1 − π )u(c∗t )) .
t =0
The social planner tells the home and foreign households how much
to consume, subject to the resource constraints
yt = c H,t + c∗H,t ,
y∗t = c F,t + c∗F,t .
∞ ∞ ∞
" !#
1 1 ∗ 1− γ ∗
E0 ∑ δ π t
(ct ) 1− γ
+ (1 − π ) (c ) + ∑ ζ H,t (yt − c H,t − c H,t ) + ∑ ζ F,t (yt − c F,t − c F,t )
∗ ∗ ∗
,
t =1
1−γ 1 − γ∗ t t =1 t =1
allocations ct and c∗t for home and foreign households as we vary the
endowments. We can see that, as the endowments become higher
in both countries, the home country’s consumption increases faster
while the foreign country’s consumption increases slower. In other
words, the home country takes away a greater share of the endow-
ments in high-endowment states, while the foreign country takes
away a greater share of the endowments in low-endowment states.
In this way, the home country insures the foreign country in the bad
states in exchange for higher pay-off in good states.
2.5
1.5
0.5
0
0 0.5 1 1.5 2 2.5 3 3.5 4
average.
0.5
-0.5
0 0.5 1 1.5 2 2.5 3 3.5 4
Figure 7.3 plots the equilibrium real exchange rate et , which mea-
sures the strength of the home currency (i.e., the dollar). The dollar
is stronger in high-endowment states and weaker in low-endowment
states. This is because, due to home bias in consumption, when the
home households receive a greater share of the endowments, they
prefer to consume more home goods. Their demand bids up the
price of the home goods, and generates real dollar appreciation. Con-
versely, when the aggregate endowment is low, the foreign house-
holds receive a wealth transfer from the U.S. and consume more
domestic goods, leading to real dollar depreciation.
0.1
-0.1
-0.2
-0.3
-0.4
0 0.5 1 1.5 2 2.5 3 3.5 4
which implies
σ2
Z t
−κt −κt −κt −κt −κt −2κt
log yt = e log y0 + (1 − e ) log ȳ + σe e dWs ∼ N
κs
e log y0 + (1 − e ) log ȳ, (1 − e ) .
0 2κ
The social planner’s problem in continuous time is
Z T
t 1 1− γ 1 ∗ 1− γ ∗
E0 δ π (ct ) + (1 − π ) (c ) dt
0 1−γ 1 − γ∗ t
subject to c H,t + c∗H,t = yt and c F,t + c∗F,t = y∗t . As we discussed in
Proposition 1.3 in Section 1.C, we can solve for the consumption ct , c∗t
and the exchange rate et as functions of k t , which in turn depends on
yt = y∗t .
We define the home and foreign SDFs as
∗
exp(mt ) = δt π (ct )−γ , exp(m∗t ) = δt (1 − π )(c∗t )−γ ,
Similarly, we can evaluate the world equity claim in the home numéraire
as the present value of the endowment streams
p∗k
Z T
exp(mk ) ∗
s t = Et pk yk + y dk .
t exp( mt ) exp(ek ) k
1 ′′
dat = [ f ′ (log yt )κ (log ȳ − log yt ) + f (log yt )σ2 ]dt + f ′ (log yt )σdWt
2
def
= µ a,t dt + σa,t dWt .
(7.2)
with the wealth dynamics implied from consumption and the repli-
cating portfolio, i.e.,
0.52
0.5
0.48
0.46
0.44
0.42
0.4
1 1.5 2 2.5 3 3.5 4
Since the U.S. households have lower risk aversion, as their wealth
share declines in low endowment states, the wealth-weighted aver-
age investor’s risk aversion is also countercyclical. As a result, the
risk premium on the world equity is higher and the equity return
becomes more volatile in bad times. Figure 7.5 shows the overall neg-
ative relationship between the endowment level and the equity return
volatility.
Perhaps paradoxically, as the equity becomes riskier, it magnifies
the difference in risk aversion between the U.S. and foreign house-
holds, which leads to further divergence in their portfolio choices.
Figure 7.6 reports the equilibrium portfolio allocation towards the
world equity by the U.S. and foreign households, xt and xt∗ . The U.S.
households’ equity share is always above 1, and the foreign house-
holds’ is always below 1, which is consistent with our intuition that
the U.S. takes a more levered position on risky assets while the for-
eign households seek safety in the U.S. risk-free bond. When the
endowment level is lower, the U.S. households take a more levered
position on the world equity despite suffering greater wealth losses,
0.15
0.145
0.14
0.135
0.13
0.125
0.12
0.115
0.11
1 1.5 2 2.5 3 3.5 4
while the foreign households further reduce their equity shares. That
said, in the region of very low endowments, the U.S. wealth share can
become so small that they offer only tiny amount of risk-free bond
despite their highly levered positions. In this case, the foreign house-
holds’ equity share increases. In the limit, as the U.S. wealth shrinks
much faster than the foreign wealth, the foreign households have to
hold the entire world equity and no risk-free bond.
1.15
1.1
1.05
0.95
0.9
0.85
1 1.5 2 2.5 3 3.5 4
Section 3.A.
3.5
2.5
1.5
0.5
1 1.5 2 2.5 3 3.5 4
investors’ risk aversion goes up in bad times, which increases the risk
price and generates greater demand for safe assets.
An alternative approach is to introduce frictions in the model. In
Section 7.C, we will consider a reduced-form way of capturing these
frictions by introducing a countercyclical convenience yield for the
U.S. bond. In this case, non-pecuniary benefits of safe assets lead
to greater demand for the U.S. bond after the negative shocks have
already happened.
sloping demand for the U.S. goods, the U.S. goods have to become
cheaper and generate real dollar depreciation.
Other types of demand shocks could also reverse the dollar ex-
change rate’s cyclicality. Consider, for example, the bond convenience
yields we studied in Chapter 4, which capture demand shocks for
assets instead of goods. In particular, the foreigners’ demand for U.S.
safe assets increases in bad times, which leads to dollar appreciation.
Similarly, by introducing time-varying risk aversion as another type
of demand shock, Gourinchas and Rey [2022] also generates dollar
appreciation in bad times in a slight extension of the baseline model.
-1
-2
-3
1 1.5 2 2.5 3 3.5 4
are related by
at = s H,t + n f at , (7.3)
which states that the U.S. wealth is equal to the value of U.S.-issued
assets adjusted by the U.S. NFA. If the U.S. households hold a lot
of foreign assets, the NFA will be positive and the U.S. wealth will
be higher than the value of the domestic assets. Conversely, if the
foreign households hold a lot of U.S. assets, the NFA will be negative
and the U.S. wealth will be lower than the value of the domestic
assets. In our model, both the value of the domestic wealth s H,t and
the NFA n f at are procyclical, leading to a procyclical U.S. wealth
share that we saw in Figure 7.4. In Section 9.A, we will have a more
detailed discussion of the net foreign assets accounting.
One essential feature of this model that leads to the reserve cur-
rency paradox is the procyclical U.S. wealth share: as the foreign
households are relatively wealthier in bad times, they tend to con-
sume more and tilt their consumption towards the foreign goods,
which requires the U.S. goods to become cheaper. However, does the
U.S. wealth share have to be procyclical? In the next section, we will
examine a complementary view under which the U.S.-issued assets
are better hedged against bad states of the world. As a result, while
the U.S. NFA n f at still declines in bad times, the U.S. domestic assets
s H,t depreciate less and offset the procyclical nature of the U.S. NFA,
leading to a countercyclical U.S. wealth share.
pt yt + exp(rt−1 )b H,t−1 + exp(rt∗−1 − et )bF,t−1 = ct + b H,t + exp(−et )bF,t + (exp(rt−1 )b̄t−1 − b̄t ).
p∗t y∗t + exp(rt∗−1 )b∗F,t−1 + exp(rt−1 + et )b∗H,t−1 = c∗t + b∗F,t + exp(et )b∗H,t + (exp(rt∗−1 )b̄t∗−1 − b̄t∗ ).
∗ ∞
(yt , y∗t , θ H,t ) t =0 .
(ct , c H,t , c F,t , b H,t , bF,t , pt , c∗t , c∗H,t , c∗F,t , b∗H,t , b∗F,t , p∗t , rt , rt∗ , et )∞
t =0 .
pt yt + exp(rt−1 )b H,t−1 + exp(rt∗−1 − et )bF,t−1 = ct + b H,t + exp(−et )bF,t + (exp(rt−1 )b̄t−1 − b̄t ),
ct = pt c H,t + p∗t c F,t exp(−et ),
p∗t y∗t + exp(rt∗−1 )b∗F,t−1 + exp(rt−1 + et )b∗H,t−1 = c∗t + b∗F,t + exp(et )b∗H,t + (exp(rt∗−1 )b̄t∗−1 − b̄t∗ ),
c∗t = p∗t c∗F,t + pt c∗H,t exp(et ),
c H,t + c∗H,t = yt ,
c F,t + c∗F,t = y∗t ,
∗
pt α c F,t 1 − α c F,t
= = ,
p∗t exp(−et ) 1 − α c H,t α c∗H,t
ω H b−
"
c t +1 − γ
σ
#
H,t
1 = Et δ exp(rt ) + −γ ,
ct ct
ω F b−
"
c t +1 − γ
σ
#
∗ F,t
1 = Et δ exp(−∆et+1 + rt ) + −γ exp(et ) ,
ct ct
c∗t+1 −γ ω F∗ (b∗F,t )−σ
" #
∗
1 = Et δ exp ( r t ) + ,
c∗t (c∗t )−γ
c∗t+1 −γ ω ∗H (b∗H,t )−σ + (c̄∗ )−γ θ H,t
∗
" #
1 = Et δ exp(∆et+1 + rt ) + exp(−et ) .
c∗t (c∗t )−γ
−(λ∗H,t − λ H,t ) = Et [m∗t+1 − mt+1 ] + Et [∆et+1 ] = Et [γ(∆ log ct+1 − ∆ log c∗t+1 )] + Et [∆et+1 ].
def
Let ē = limt→∞ et denote the long-term exchange rate level, which is
well defined in a stationary economy. Iterating this equation forward,
we obtain the following result.
Proposition 7.2. The exchange rate level is equal to the expected consump-
tion growth differential and the expected convenience yield differential:
∞ ∞
et − ē = ∑ Et [γ(∆ log ct+ j − ∆ log c∗t+ j )] + ∑ Et [λ∗H,t+ j − λ H,t+ j ]
j =1 j =0
∗ ∞
ct ct
= −γ log
c̄
− log
c̄∗
+ ∑ Et [λ∗H,t+ j − λ H,t+ j ], (7.4)
j =0
ω ∗H (b∗H,t )−σ ω H b− σ
H,t
λ∗H,t − λ H,t ≈ exp(−et ) − ∗
− θ H,t exp(−et ). (7.5)
(c∗t )−γ −γ
ct
per se that affects the exchange rate, but the differential between the
foreign investors’ perspective and the home investors’ perspective.
To obtain a stronger dollar (i.e., a higher et ), we need the foreign
investors’ convenience yield on the dollar bond to exceed the home
investors’ convenience yield.
Eq. (7.5) further connects the convenience yield differential to the
quantities of the U.S. bond held by foreign and U.S. households,
b∗H,t and b H,t , and the exogenous demand shifter θ H,t
∗ . If the foreign
households hold more U.S. bonds, their marginal utility from holding
the U.S. bond is lower, which reduces their convenience yield λ∗H,t
and depreciates the dollar, unless the increase in the foreign house-
holds’ holding is accompanied by a change in the demand shifter
∗ .
θ H,t
In this way, the exchange rate dynamics are driven not only by
the marginal utility differential, but also by the convenience yield
differential. Heuristically, in complete markets, the exchange rate
movement is determined by the differential in marginal utilities over
consumption:
∆e = (∆uc − ∆u∗c ).
where ∆u∗b − ∆ub captures the marginal utility differentials over bond
holding for current and future periods as we specified in Eq. (7.5).
Take the foreign households as an example. Recessions lower their
wealth and reduce their holdings of the dollar safe assets. Given
their downward-sloping demand curve for dollar safe assets, they
impute a higher convenience yield and accept a lower expected re-
turn to hold the U.S. bond. To equilibrate their demand with the U.S.
households, the dollar has to appreciate to generate an expected de-
preciation, which leads to a lower expected return to hold the U.S.
bond from the foreign perspective. In this way, the demand for safe
assets connects wealth decline to local currency depreciation. If the
wealth declines more than consumption in recessions, this channel
c∗t+1 −γ
" #
∗
exp(−λ H,t ) = Et δ exp(∆et+1 + rt ) .
c∗t
1 1
2
1.5
0.5 0.5 1
0.5
0 0 0
10 20 30 40 10 20 30 40 10 20 30 40
0
2 2
1.5 1.5 -0.2
1 1
-0.4
0.5 0.5
-0.6
0 0
10 20 30 40 10 20 30 40 10 20 30 40
0 6
6
-0.5
4
4
-1
2 2
-1.5
-2 0 0
10 20 30 40 10 20 30 40 10 20 30 40
This wealth effect is the key distinguishing feature of the safe asset
view, as it disentangles the response in the U.S. wealth at from the re-
sponse in the U.S. net foreign assets n f at . On the one hand, the U.S.
buys some foreign bonds from the foreign country and sells some
dollar bonds to the foreign country. In a flight-to-dollar episode, the
dollar appreciates and the U.S. suffers a loss on its external portfo-
lio, leading to a decline in the U.S. NFA. On the other hand, the U.S.
receives a higher seigniorage revenue from issuing the dollar bonds,
which increases the U.S. wealth despite its loss on the external port-
folio.
In this model, a higher U.S. wealth relative to the foreign wealth
leads to a higher U.S. consumption relative to the foreign consump-
tion. In the special case with log preferences, the U.S. households’
consumption is proportional to their wealth:
c t = (1 − δ ) a t .
Due to the home bias in the U.S. households’ consumption, the U.S.
spends more on the U.S. goods, which appreciates the dollar in real
terms. In this way, the goods market clearing also implies a stronger
dollar that is consistent with the convenience yield channel in the
asset market. As such, a decline in the U.S. NFA does not necessar-
ily imply a relative wealth gain for the foreign households and a
stronger demand for the foreign goods. The reserve currency para-
dox is resolved by engineering a countercyclical U.S. wealth share
from the seigniorage revenue.
This discussion highlights the countercyclicality of the U.S. wealth
share as the key to understand the cyclical properties of international
transfers. Jiang, Krishnamurthy, and Lustig [2020a] presents a simple
calculation around 2008, and shows that the decline in the market
value of the U.S. equities, bonds, and deposits was indeed less than
the decline in the market value of the equities, bonds, and deposits
in major developed foreign countries after converting to the dollar
units. This relative gain in market value was also higher than the loss
in the U.S. NFA. Dahlquist, Heyerdahl-Larsen, Pavlova, and Pénasse
[2022], Kim [2022] present additional empirical evidence in favor of
a countercyclical U.S. wealth share, whereas Sauzet [2022] presents a
calculation that suggests the opposite. It is possible that the wealth
supply dollar assets will grow. In particular, if the growth in the world de-
mand for dollar safe assets exceeds the growth in U.S. supply, the result
will be growth in currency-mismatched balance sheets around the world.
The conclusion is that financial spillovers and the global financial cycle may
grow in importance.”
This currency mismatch can be incurred by foreign governments,
firms, and banks as they take advantage of the convenience yield
on dollar funding. A large literature studies this issue from both
theoretical and empirical angles. See Caballero and Krishnamurthy
[2003], Schneider and Tornell [2004], Bocola and Lorenzoni [2020],
Du, Pflueger, and Schreger [2020], Du and Schreger [2022b], Salomao
and Varela [2022], Gutierrez, Ivashina, and Salomao [2023].
yield and the risk premium terms, we can expect the variations in the
exchange rate to absorb the shocks to the autonomous interest rate
policies. In comparison, the impossible duo view emphasizes the risk
premium terms rpt+ j and the convenience yield terms λ∗t+ j , which af-
fect not only exchange rates but also domestic financial and economic
conditions. Even with floating exchange rates, monetary policy re-
sponses are needed to address the influences of these external factors.
As a result, monetary policies are not truly independent if the capital
accounts are open and allow the risk premium and convenience yield
shocks to spill over.
Summary
• If we look backward, the market value of government debt is determined by past debt, pri-
mary surpluses, and debt returns:
t −1 S
Dt t− j D0
=−∑ exp( RtD− j→t − Xt− j→t ) + exp( R0D→t − X0→t ).
Yt Y
j =0 t − j
Y0
• If we look forward, the market value of government debt is backed by the present value of
future primary surpluses:
∞
" #
Dt = E t ∑ exp( Mt,t+k )St+k + lim Et [exp( Mt,t+k ) Dt+k ] .
k→∞
k =1
• This forward-looking valuation equation also implies a trade-off in the risk dimension. The
government has to choose between insuring the debtholders and insuring the taxpayers.
• In the cases of the U.S. and past reserve asset issuers, the market value of government debt
may exceed its fiscal backing, leading to a public debt valuation puzzle.
Qt = St + Qt+1 exp(−it ),
Year Balance Payment Interest payment Principal payment Table 8.1: Personal Loan Example
1 $100,000 $12,950 $5,000 $7,950
2 $92,050 $12,950 $4,602 $8,348
3 $83,702 $12,950 $4,185 $8,765
4 $74,936 $12,950 $3,747 $9,204
5 $65,733 $12,950 $3,287 $9,664
6 $56,069 $12,950 $2,803 $10,147
7 $45,922 $12,950 $2,296 $10,654
8 $35,267 $12,950 $1,763 $11,187
9 $24,080 $12,950 $1,204 $11,746
10 $12,334 $12,950 $617 $12,334
Proposition 8.1. (a) The market value of government debt can be expressed
as
t −1
Dt = − ∑ St− j exp( RtD− j→t ) + D0 exp( R0D→t ), (8.2)
j =0
where RtD− j→t = RtD− j+1 + . . . RtD−1 + RtD is the cumulative debt return.
(b) We can restate this expression to obtain a backward-looking expres-
sion for the debt/GDP ratio:
t −1 S
Dt t− j D0
=−∑ exp( RtD− j→t − Xt− j→t ) + exp( R0D→t − X0→t ), (8.3)
Yt Y
j =0 t − j
Y0
where exp( Xt− j→t ) = Yt /Yt− j is the cumulative nominal GDP growth.
this ratio instead of the debt level. Proposition 8.1(b) shows that the
current debt/GDP ratio depends on past primary surpluses and
deficits: a greater borrowing in the past to finance deficits −St− j
leads to a higher debt/GDP ratio today. Moreover, the borrowing is
compounded by the growth-adjusted return RtD− j→t − Xt− j→t , so that
high real debt returns, high inflation rates, and low output growth all
magnify the past borrowing and contribute to a high debt/GDP ratio
today.
In this way, the backward-looking approach provides a useful way
to attribute the increase in the government debt/GDP ratio from time
0 to t to four components: government deficits, real debt returns,
inflation, and output growth. Hall and Sargent [2011, 2022], Ander-
son and Leeper [2023] conduct such decomposition exercises in the
history of the U.S. For example, the debt/GDP ratio increased from
86% to 101% in the Covid period from 2019 to 2023. Government
deficits were the main contributor to the increase, while high infla-
tion, positive real GDP growth, and negative debt returns all helped
to partially offset the deficits.
We can also examine this result in a steady state in which debt
returns, growth rates, and surplus/GDP ratios are all constants.
Then, the steady-state debt/GDP ratio can be expressed as
D S 1 S 1
=− D
≈ D
,
Y Y 1 − exp( R − X ) YR −X
S D
= ( R D − X ).
Y Y
In real terms, we can express this relation as
s d
= (r D − x ), (8.4)
y y
more debt to pay the interests due, then, the government debt quan-
tity rises at the interest rate r D , while the GDP grows at a higher rate
of x. If we wait long enough, the government debt/GDP ratio will
decline to zero, making it easy for the government to eventually raise
a one-time tax as a small fraction of the GDP and pay down the debt.
It is also straightforward to consider bond convenience yields in
this calculation. In the absence of convenience yields, the debt return
is equal to the risk-free rate, r D = r. In the presence of convenience
yields, the government is able to finance its debt at a rate lower than
the risk-free rate ρ:
r D = ρ − λ.
s d
= ( ρ − λ − x ),
y y
H
Dt = ∑ Qt (h) Pt (h),
h =1
H
0 = St + ∑ Qt (h) Pt (h).
h =1
This time, we iterate forward these equations, and derive the fol-
lowing intertemporal government budget condition. We use Mt,t+k to
denote the multi-horizon nominal SDF from period t to period t + k.
Proposition 8.2. Even when the government debt is defaultable, the market
value of government debt is equal to the present value of future government
surpluses plus a transversality term:
∞
" #
Dt = E t ∑ exp( Mt,t+k )St+k + lim Et [exp( Mt,t+k ) Dt+k ] .
k→∞
(8.6)
k =1
Using the one-period budget constraint (8.1) again, we can also express
The proof is given in Appendix A.31. Eq. (8.6) and (8.7) are equiv-
alent ways to express the same intertemporal government budget
condition. Eq. (8.6) equates the end-of-period market value of the
government debt after issuances and repayments, Dt , to the ex-
dividend present value of future surpluses and the transversality term,
whereas Eq. (8.7) equates the beginning-of-period market value of the
government debt, Dt−1 exp( RtD ), to the cum-dividend present value
of future surpluses and the transversality term. Our results can be
expressed in either convention we use. To avoid confusion, we stick
to the first expression in this section.
If the transversality condition holds, i.e.,
where Dt on the left-hand side is the market value of debt at the end
of period t, and PtT and PtG on the right-hand side denote the present
value of current and future tax revenues and government spending:
∞
" #
def
PtT = Et ∑ exp( Mt,t+k )Tt+k ,
k =1
∞
" #
def
PtG = Et ∑ exp( Mt,t+k )Gt+k .
k =1
If this equality does not hold, these two investors should trade with
each other using the government debt portfolio.
Second, this result implies that, in order to back up Dt dollars’
worth of debt, the government needs to generate a positive present
value from its future primary surpluses. Conversely, when the
present value of primary surpluses increases in period t, the gov-
ernment debt appreciates in value and generates a higher return RtD .
That is, Eq. (8.9) implies
h i h i
Dt−1 (Et − Et−1 ) exp( RtD ) = (Et − Et−1 ) ( PtT + Tt ) − ( PtG + Gt ) ,
where exp( RtD ) denotes the holding return of the aggregate govern-
ment bond portfolio:
which implies that the fiscal backing can be created either by running
a higher government surpluses on average (i.e., higher Et [St+k ]), or
by making government surpluses more countercyclical (i.e., higher
covt (exp( Mt,t+k ), St+k )). The latter channel creates fiscal backing by
conditionally generating higher cash flows in high marginal utility
∞
" #
d t = Et ∑ exp(mt,t+k )st+k + lim Et [exp(mt,t+k )dt+k ] ,
k→∞
k =1
where dt and st are the real market value of government debt and the
real primary surplus, and mt,t+k is the real SDF which is related to
the nominal SDF via Eq. (6.1).
Pt+1 (h − 1)
exp(−λt (h)) = Et exp( Mt+1 ) .
Pt (h)
When the convenience yields are identical for bonds of all maturi-
ties, i.e., λt (h) = λt , then, we can simplify this seigniorage revenue
as the product between the market value of debt and the convenience
yield level:
Kt = Dt (1 − exp(−λt )) ≈ Dt λt ,
∞ ∞
" # " #
Dt = E t ∑ exp( Mt,t+k )St+k + Et ∑ exp( Mt,t+k )Kt+k + lim Et [exp( Mt,t+k ) Dt+k ] .
k→∞
k =1 k =0
(8.11)
∑∞
h=1 Pt+1 ( h − 1) Qt ( h )
exp( RtD+1 ) = .
∑∞h=1 Pt ( h ) Qt ( h )
returns as
PtT+1 + Tt+1
exp( RtT+1 ) = ,
PtT
PtG+1 + Gt+1
exp( RtG+1 ) = .
PtG
Dt + PtG h i h i PG h i
Et exp( RtT+1 ) = Et exp( RtD+1 ) + t Et exp( RtG+1 ) .
Dt Dt
(b) The betas of the aggregate government debt portfolio, the claim to tax
revenues, and the claim to government spending satisfy
Dt + PtG T PtG G
βt = βD
t + β .
Dt Dt t
The proof is presented in Appendix A.33. This proposition resem-
bles the Modigliani-Miller Theorem in corporate finance, which states
that, within a firm, its equity is a levered claim on the underlying
asset. As a result, the return beta and the risk premium on the firm
asset are equal to the weighted average of those on the firm’s equity
and debt claims:
D+E A E E
βt = βD
t + β ,
D D t
where D is the market value of debt and E is the ex-dividend market
value of equity. Similarly, the government holds the claim to tax
revenues and splits up these cash flows to debtholders and recipients
of government spending. As a result, the return beta and the risk
premium on the tax claim are equal to the weighted average of those
on the government debt and the spending claim.
This result implies a very tight constraint on how the government
insures taxpayers and debtholders. To simplify the argument, we
assume that the taxpayers both pay tax and receive the government
spending as a transfer.
If the government decides to make its debt risk-free, which implies
a zero debt beta β D
t = 0, then, Proposition 8.4 implies
PtG
β Tt = βG
t .
Dt + PtG
PtT PK PG Kt
+ t − t = 1− < 1.
Dt Dt Dt Dt
This is a feature, not a bug. Suppose the tax, spending, and seignior-
age claims all have the same discount rate. Then, as long as the
weights sum up to 1, a linear combination of these claims also has
the same discount rate. The presence of the bond convenience yield
introduces a “missing weight,” which allows the debt to have a lower
discount rate RtD+1 . More precisely, we can write Eq. (8.12) as
!
D Kt PtT T PtK K PtG
exp( Rt+1 ) 1 + T = exp ( R t + 1 ) + exp ( R t + 1 ) − exp( RtG+1 ).
Pt + PtK − PtG PtT + PtK − PtG PtT + PtK − PtG PtT + PtK − PtG
defaultable, their yields are not necessarily the same as the nomi-
nal risk-free rates, which we denote by it (h). Let us use itTreas (h) to
denote the yield of the government debt with maturity h, which satis-
fies
def 1
itTreas (h) = − log Pt (h).
h
This government debt yield can be decomposed into a risk-free
component, a default spread component, and a convenience yield
component:
def 1
ρt (h) = − log Et [exp( Mt,t+h )],
h
the default spread component captures the risk-neutral expectation of
sovereign default,
" # !
h
def 1
δt (h) = − log Et exp( Mt,t+h ) ∏ (1 − χt+ j ) − log Et [exp( Mt,t+h )] ,
i
h j =1
1
ρt = ρ∗t − Et [∆Et+1 ] − vart (∆Et+1 ) − covt ( Mt∗+1 , ∆Et+1 ),
2
which implies that the nominal home risk-free rate ρt relative to a
foreign benchmark ρ∗t is decreasing in the home currency’s expected
nominal appreciation Et [∆Et+1 ], and increasing in the home cur-
rency’s risk premium RPt = −covt ( Mt∗+1 , ∆Et+1 ) − 12 vart (∆Et+1 ).2 As 2
The longer-term nominal rates have
such, government bonds compensate investors for not only credit risk similar expressions that contain the
marginal utility growth and the ex-
but also exchange rate risk. change rate movement from period t to
How do we connect this determination of individual bond prices t + h.
∞
" #
E0 ∑ δt u(ct ) .
t =0
∞
" #
ψ(wt , t) = max E0 ∑ δs u(cs ) ,
s=t
Proposition 8.6. Assume that the value function satisfies limt→∞ ψ(wt , t) =
0. Then, the optimal solution {ct }∞ ∞
t=0 , { dt }t=0 to the households’ problem
implies the Euler equation (8.13) and the transversality condition
lim E0 δt u′ (ct ) at = 0.
t→∞
This implies that the households’ holdings in other assets are nega-
tive in the infinite horizon, i.e.,
∞
" #
E0 ∑ δt (u(ct ) + vt (dt )) .
t =0
v′t (dt )
exp(−λt ) = 1 − .
u′ (ct )
Proposition 8.7. In the presence of the bond convenience yield, the optimal
solution {ct }∞ ∞
t=0 , { dt }t=0 to the households’ problem implies the Euler
equation (8.13) and the transversality condition
where ε t+1 denotes the innovation to GDP growth that is i.i.d. nor-
mally distributed with mean zero and standard deviation one.
The log nominal SDF is driven by the same GDP shock
1
Mt,t+1 = −i − γ2 − γε t+1 ,
2
which implies that a lower GDP growth leads to a higher marginal
utility. By the Euler equation for risk-free debt, the one-period log
risk-free rate is a constant: R f = i.
Given these specifications, the present value of a GDP strip k peri-
ods from now is
1
Et [exp( Mt,t+k )Yt+k ] = exp k( X − i + σ2 − γσ) Yt ,
2
1
i + γσ > X + σ2 .
2
We note that the discount rate is the benchmark risk-free rate i im-
plied from the SDF plus the risk premium term γσ. The growth rate
is the log growth rate X plus the Jensen’s term 21 σ2 .
We assume that the government commits to issuing one-period
risk-free debt. The government also commits to a simple fiscal policy,
with a constant spending/GDP ratio γ̄ = Gt /Yt and a constant
debt/GDP ratio δ̄ = Dt /Yt .
Then, the government budget constraint (8.1) implies a counter-
cyclical process for the surplus/GDP ratio:
St Tt − Gt Dt D Y
= =− + exp( R f ) t−1 t−1 = −δ̄ (1 − exp (−( X − i + σε t ))) .
Yt Yt Yt Yt−1 Yt
This result implies that ∂(St /Yt )/∂ε t < 0. Therefore, to insure the
debtholders by keeping its debt risk-free, the government must gen-
erate counter-cyclical primary surpluses. In particular, when the GDP
declines (ε t < 0) in recessions, the tax revenue needs to increase as
a fraction of GDP or the government spending needs to decrease.
The magnitude of the required counter-cyclical response in primary
surpluses is increasing in the debt/GDP ratio δ̄.
∞
" #
(Et +1 − Et ) ∑ exp( Mt+1,t+ j )St+ j = (Et+1 − Et ) [ Dt+1 + St+1 ] = 0.
j =1
1
i + γσ > X + σ2 .
2
In this case, discounting growing surpluses and future debt at the
risk-free rate i < X would not produce the right answer for the
valuation of the current debt.
However, this is not a free lunch: the beta constraint characterized
by Proposition 8.4 still binds: in order to keep debt risk-free, the
government needs to raise taxes or lower spending in high marginal
utility states.
In other words, the average deficits reflect an insurance premium
that the government earns by extracting countercyclical cash flows
from the taxpayers. This risk-based view is very different from the
∞
" #
Et ∑ exp( Mt,t+k )(Tt+k − Gt+k ) ,
k =1
we need to model the joint dynamics of the SDF Mt,t+k and the fiscal
cash flows Tt+k and Gt+k . Jiang, Lustig, Van Nieuwerburgh, and
Xiaolan [2019] adopts the affine term structure framework, which
Λ t = Λ0 + Λ1 z t ,
where Λ0 collects the average prices of risk and Λ1 governs the time
variation in risk premia. Asset pricing in this model amounts to
taking a stance on the market prices of risk in Λ0 and Λ1 .
If the state vector zt contains the short rate, inflation, GDP growth,
the stock dividend/GDP ratio, and the stock price/dividend ratio,
this SDF generates affine solutions for the value of the aggregate
stock market and the term structure of nominal and real interest
rates. We also include fiscal variables in the state vector in order to
price the fiscal cash flows.
def
Let it (h) = (1/h) log Pt (h) denote the nominal risk-free rate
with maturity h. For notational convenience, let ei denote the vector
(0, . . . , 1, . . . , 0)′ that selects the variable corresponding to the short
rate, and let i0 (1) denote the mean value of the short rate. Then,
Similarly, let e∆d , ex , and eπ denote vectors that select the variables
corresponding to the growth in the dividend/GDP ratio, the real
GDP growth, and the inflation, respectively.
Let Divt denote the stock dividend strip. The value of the divi-
dend strip of horizon h is defined as
pdm m
t ( h ) = log( Pt ( h ) /Divt ).
g
Similarly, let Ptτ (h) and Pt (h) denote the price of the tax and
g
spending strips of horizon h, and let pdτt (h) and pdt (h) denote the
price-dividend ratios of the tax and spending strips.
Proposition 8.8. (a) The nominal yield curve is affine in the state vector:
A(h) B( h)′
it ( h) = − − zt ,
h h
where the coefficients A(h) and B(h) satisfy the following recursions:
1 1
A(h + 1) = −i0 (1) + A(h) + B(h)′ ΣB(h) − B(h)′ Σ 2 Λ0 ,
2
1
B(h + 1)′ = −ei′ + B(h)′ (Ψ − Σ 2 Λ1 ),
′
pdm m m
t ( h) = A ( h) + B ( h) zt ,
where
1
Am (h + 1) = −i0 (1) + Am (h) + x0 + π0 + ( Bm (h) + e∆d + ex + eπ )′ Σ( Bm (h) + e∆d + ex + eπ )
2
1
− ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ0 ,
1
Bm (h + 1)′ = −ei′ + ( Bm (h) + e∆d + ex + eπ )′ (Ψ − Σ 2 Λ1 ).
(c) The tax and spending strips’ log price-dividend ratios are also affine in
the state vector:
where
1
Aτ (h + 1) = −i0 (1) + Aτ (h) + x0 + π0 + ( Bτ (h) + e∆τ + ex + eπ )′ Σ( Bτ (h) + e∆τ + ex + eπ )
2
1
− ( Bτ (h) + e∆τ + ex + eπ )′ Σ 2 Λ0 ,
1
Bτ (h + 1)′ = −ei′ + ( Bτ (h) + e∆τ + ex + eπ )′ (Ψ − Σ 2 Λ1 ),
1
A g (h + 1) = −i0 (1) + A g (h) + x0 + π0 + ( B g (h) + e∆g + ex + eπ )′ Σ( B g (h) + e∆g + ex + eπ )
2
1
− ( B g (h) + e∆g + ex + eπ )′ Σ 2 Λ0 ,
1
B g (h + 1)′ = −ei′ + ( B g (h) + e∆g + ex + eπ )′ (Ψ − Σ 2 Λ1 ).
which implies
1
−1
B(∞)′ = −ei′ I − (Ψ − Σ 2 Λ1 ) .
1
It is useful to interpret (Ψ − Σ 2 Λ1 ) as the risk-neutral transition ma-
trix, and then the risk loadings of the infinite-horizon bond are given
by the loadings of the one-period bond, ei , multiplied with the Leon-
1
tief inverse of (Ψ − Σ 2 Λ1 ). The same Leontief inverse operator also
appeared in the trade network model in Section 3.C, which captures
the effects of higher-order connections in the trade network.
Following this derivation, we can now express the risk premium
of the long-term bond and, in a similar way, the risk premia of stock
dividend and fiscal cash flows in the infinite horizon.
Proposition 8.9. (a) In the infinite horizon, the long-run nominal yield and
(b) In the infinite horizon, the expected returns on the dividend, tax, and
spending strips are
1 1
lim Et [log Ptm+1 (h − 1) − log Ptm (h)] = i0 (1) − B′perm ΣB perm + B′perm Σ 2 Λ0 + B′perm (Ψ − I )zt + (e∆d + ex + eπ )′ zt ,
h→∞ 2
where
def
1
−1
B′perm = ( Bm (∞) + e∆d + ex + eπ )′ = (−ei + e∆d + ex + eπ )′ I − (Ψ − Σ 2 Λ1 ) .
′ 1 1 ′
lim E[log Pt+1 (h − 1) − log Pt (h)] − i0 (1) = Btrans Σ 2 Λ0 − Btrans ΣBtrans ,
h→∞ 2
(8.16)
1
′
which is also the product of the risk loadings Btrans Σ 2 and the mar-
ket prices of risk Λ0 , plus a second-order Jensen’s term. Note that Eq.
(8.16) is a special case of Eq. (8.15), obtained when the cash flow does
not grow, i.e., by setting e∆d + ex + eπ = 0.
Figure 8.1 plots the term structures of risk premia for the spend-
ing, tax, equity dividend, and GDP claims, and the long-term nom-
inal bonds. The parameterization is taken from the homoscedastic
-2
-4
0 20 40 60 80 100
but it allows us to deal with the possibility that the primary surplus
st can be negative.
By iterating this equation forward T times and taking expectations,
we obtain
" #
T
v t = Et ∑ κ j −1 s t + j − e
r t + j + Et [ κ T v t + T ],
j =1
which states that the debt/GDP ratio varies only because it predicts
future surpluses, future returns, or the future debt/GDP ratio.
If we assume that the transversality condition holds, which in this
context means limT →∞ Et [κ T vt+T ] = 0, then, we obtain
∞
" #
v t = Et ∑ κ j −1 s t + j − e
rt+ j .
j =1
s t = εE
t ,
v t = v t −1 − ε E
t .
st = bvt−1 + εFt ,
T T
v t = Ft ∑ r t + j + Ft [ v t + T ] = Ft ∑ bvt+ j−1 + Ft [vt+T ]
st+ j − e (8.17)
j =1 j =1
We conjecture and verify that the law of motion for the debt/output
ratio under the subjective measure F is
ψ = 1−b
ηtF+1 = −εFt .
In other words, due to their biased belief, the investors think that the
debt/output process is stationary with a persistence that is consistent
with their expected response of surplus to the debt/output ratio.
Therefore, if we take Eq. (8.17) to the limit,
T
vt = lim Ft
T →∞
∑ bvt+ j−1 + Tlim
→∞
Ft [ v t + T ],
j =1
T
v t = Et v t + T = 0 + (Ft − Et ) ∑ ( s t + j − e
r t + j ),
j =1
Summary
• International portfolio choices underlie each country’s external imbalance and capital flow
dynamics. A country’s net foreign assets are directly a consequence of portfolio choices:
def
n f at = at ∑ x F,t (ι) − a∗t ∑ x ∗H,t (ι),
ι ι
and the profits from the chosen portfolios determine how a country’s external wealth evolves:
• The standard asset pricing literature models portfolio choices as the mean-variance trade-off
plus additional dynamic hedging terms:
x t = γ −1 Σ − 1
t Et [rt+1 ] + dynamic hedging terms.
• Alternatively, the demand system approach models portfolio choices based on asset prices,
characteristics, and latent demand terms:
which countries play a central role, and the benefits and costs of
being at the central location.
In this chapter, we relate the households’ equilibrium portfolio
choices on the finance side to important quantities on the macro side
such as net foreign assets and balance of payments. Then, we discuss
two complementary approaches to modeling portfolio choices. One
is the traditional mean-variance approach, which is also central in the
standard asset pricing literature. The other is the demand system ap-
proach, which results from more recent developments. We will also
discuss specifications and applications of these approaches adapted
for studying international portfolio dynamics.
Let pt (ι) and p∗t (ι) denote the asset prices, and let divt (ι) and
div∗t (ι) denote the dividend or interest payouts, all in home currency
terms. We use rt (ι) to denote the cum-dividend returns in log:
Finally, let qt (ι) denote the notional quantity of the asset, and let
mt (ι) denote the total market value (i.e., market capitalization). The
market clearing condition for each asset can be expressed as
def
mt (ι) = pt (ι)qt (ι) = at x H,t (ι) + a∗t x ∗H,t (ι). (9.3)
We can relate the market value of all home assets to the home house-
holds’ wealth by the NFA:
at = ∑ mt (ι) + n f at ,
ι
where tb denotes the trade balance, ib the income balance, and cg the
capital gains.
The trade balance is defined in the standard way as the value of
exports minus the value of imports, expressed in the home currency.
The income balance is defined as the earnings on foreign invest-
ments minus the payments made to foreign investors:
The capital gains are defined as the changes in the value of assets
that the home households hold abroad minus changes in the value of
domestic assets held by foreign investors:
The sum of income balance and capital gains (i.e., ibt + cgt ) cap-
tures the overall wealth transfer between home and foreign house-
holds due to their holdings of each other’s assets. The balance of
payments identity (9.4) states that the home country accumulates a
higher NFA position when (i) it runs a trade surplus, which results
in claims that foreigners owe to the home country, and (ii) it earns a
higher financial return than it pays to foreigners, which includes both
capital gains and dividend payouts. Conversely, if the NFA remains
constant, any trade deficit (i.e., negative tb) has to be offset by a posi-
tive financial return (i.e., positive ib + cg), as we have seen in the case
of the U.S. in Chapter 7.
In international macroeconomics, it is also common to bundle
together the trade balance and the income balance into the current
def
account: cat = tbt + ibt , which describes the net earnings of the home
country from its international trade and investments. As capital gains
cgt capture important variations in cross-border wealth transfers
from financial markets in the data, we find it more convenient to
conceptually bundle the income balance and the capital gains as total
financial transfers.
Finally, like the government budget condition in Chapter 8, we
can also think of the NFA dynamics (9.4) as a backward-looking
Therefore, any deficit in the NFA relative to its long-run mean has to
be paid off by either future trade surpluses tbt+s or future financial
incomes ibt+s + cgt+s . We will see an example of this decomposition
in the context of the segmented market and safe asset models in
Section 10.B. Moreover, since the world economy grows over time,
we can also normalize the NFA by the GDP or its low-frequency
component. See Gourinchas and Rey [2007a] for details.
Similarly, the market value of net position changes in the home assets
held by the foreign households is
pt (ι)
∑ t H,t
a ∗ ∗
x ( ι ) − a ∗
x ∗
t−1 H,t−1 ( ι )
p t −1 ( ι )
.
ι
The first term on the right-hand side captures new asset issuances,
which channel funds from the households to the corporate sector.
The second term captures dividend payouts, which are transfers in
the opposite direction from the corporate sector to the households.
The households’ financial savings are equal to the transfers from
households to the corporate sector in the form of asset issuances
minus the transfers from the corporate sector to the households in
the form of dividend payouts.
tbt = st ,
which states that the trade balance is equal to the home households’
net financial savings. In this case, absent transfers to and from the
financial markets, savings can only result from exporting more goods
to foreigners and earning their IOUs.
Next, consider the case in which the trade balance is tbt = 0, and
the dividend payouts are dt (ι) = 0. Then, we obtain
!
st = ∑ mt (ι) − ∑ mt−1 (ι)(1 + ρt (ι)) ,
ι ι
which implies that the asset issuances in the home country are en-
tirely financed by the home households’ financial savings. If the
foreign households participate in the financing of the new issuances,
we should expect to see capital flows which, by the previous sec-
tion, should result in non-zero trade balance. Thus, the trade balance
becomes the sufficient statistics for the foreign households’ participa-
tion in the home asset issuances and financial savings process.
Finally, we can understand the general case by again understand-
ing that (∑ι mt (ι) − ∑ι mt−1 (ι)(1 + ρt (ι))) − ∑ι mt−1 (ι)dt (ι) captures
the net transfers between (home and foreign) investors and the home
corporate sector. When the home investors contribute funds to the
The left-hand side is the national income for the home households,
which consists of financial asset payoff and labor income. The right-
hand side contains two types of expenditure: consumption and finan-
cial investments.
Using Eq. (9.2), we obtain
ct = wtℓ − st ,
or
!
ct = wtℓ + at−1 ∑ x H,t−1 (ι)dt (ι) + ∑ xF,t−1 (ι)d∗t (ι) − ft.
ι ι
Let zt denote the vector of state variables that determine the distri-
bution of returns. We define the value function as
1− γ
" #
T
def s cs
ψt (wt , zt ) = max Et ∑ δ ,
{cs ,xs }sT=t s=t 1−γ
Proposition 9.3. When the asset returns are i.i.d. and normally distributed,
the optimal portfolio choice is
1 2 f
x t = γ −1 Σ − 1
E [ r t +1 ] + σ − r . (9.10)
t
2 t t
∞
!
1 1 2 1
r t +1 , − (Et +1 − Et ) ∑
f f
xt = Σ− 1
E t r t +1 − r + σ + 1 − Σ− 1
t covt ρ j rt+ j .
γ t t 2 t γ j =1
(9.11)
This solution contains two terms. The first term is similar to the
solution under i.i.d. returns derived above, which depends on the
assets’ excess returns and variance-covariance matrix. The second
term is commonly known as the hedging term, as the investors buy
certain assets to hedge shocks to future investment opportunities.
In this case, since risk premia are constant, variations in future in-
vestment opportunities are only driven by variations in the risk-free
rate. Specifically, a surprise reduction in expected future risk-free
rates represents a negative shock to future investment opportunities.
When the risk aversion coefficient γ > 1, investors would like to load
up more on assets that offer higher returns when expected future
risk-free rates go down.
Second, if the investors have log utility, then, Eq. (9.9) can be sim-
plified to
h
p
i
f 1 1
xt = arg max Et rt+1 ≈ arg max xt′ (Et [rt+1 ] − rt + σt2 ) − xt′ Σt xt ,
xt xt 2 2
which does not depend on the state variable. In this sense, investors
with log utility are myopic, as they choose portfolios like one-period
investors. They do not pay attention to how different assets’ re-
turns respond to shocks to future investment opportunities. These
investors also have a predetermined wealth-consumption ratio, which
is a constant when the horizon T is infinite.
Moreover, with log utility
(i.e., γ = 1), Eq. (9.11) in the first special
f
case becomes xt = Σ− t
1
Et [rt+1 ] − rt + 12 σt2 . The optimal portfolio
allocation no longer loads on the term that hedges the variations
in future interest rates, and the myopic investors only focus on the
mean-variance trade-off.
Eq. (9.10) offers an intuitive way to think about the portfolio alloca-
tion problem as a mean-variance trade-off. While this approach has
been powerful and universal, several challenges stand out. First, es-
timating the expectation and the variance-covariance matrix of asset
returns is empirically challenging. As the assets’ risk characteristics
and investors’ risk appetite are time-varying, the moments estimated
from the past return data do not necessarily represent the distribu-
tion of future returns.
Second, investor demand for different assets and asset classes
appears to be less elastic than what standard models imply. For ex-
ample, according to Koijen and Yogo [2021], “a calibration of the
capital asset pricing model (CAPM) implies a demand elasticity for
individual stocks that exceeds 5,000,” whereas empirical estimates
of the demand elasticity are closer to 1 or lower [Gabaix and Koijen,
2021], which is three orders of magnitude smaller.
In response to these challenges, Koijen and Yogo [2019] turn to
a complementary approach based on demand systems. Instead of
deriving the optimal portfolio choice from the mean-variance trade-
off, their starting point is that the households allocate their wealth
based on the assets’ characteristics. In this section, we study the most
basic demand system model based on logit demand functions.
where log pt (ι) denotes the log asset price, Ξt (ι) denotes the vector
of additional observable asset characteristics, and κt (ι) denotes an
unobservable latent demand term. We expect α < 0, in which case
the households find an asset more attractive when its price is lower.
The asset also becomes more attractive when it has some desirable
attributes captured by Ξt (ι), which for example could include a lower
price volatility, a higher credit rating, or simply a larger asset supply.
With heterogeneous investors indexed by n, the characteristics could
also be bilateral, i.e., Ξn,t (ι), which could for example capture the
investors’ home bias towards domestic assets, or their preferences for
foreign assets according to proximity in geographic distance or trade
network as we saw in Section 3.C. Finally, the latent demand term
allows us to capture the unobservable characteristics of the asset that
simultaneously generate a higher market share and a higher asset
price by making the asset more attractive [Berry, 1994].
We assume that the households’ allocation on this asset follows
exp(δt (ι))
xt (ι) = . (9.12)
1 + ∑k exp(δt (k))
We make several observations. First, if α < 0, the households have a
downward-sloping demand curve for this asset. This can be thought
of as a generalization of the downward-sloping demand curve for re-
serve assets that we considered in Section 4.A. In our current setting,
all assets face downward-sloping demand curves.
Second, the 1 in the denominator models the existence of an out-
side asset, which is a benchmark asset whose desirability δt (0) is nor-
malized to be 0. The choice of this outside asset is usually model-
specific. For example, it could be bank deposits when studying the
demand system of money market funds, or it could be the assets is-
sued by small countries with missing characteristics when studying
the demand system of international portfolio allocation.
Third, the households’ allocation to asset ι also depends on the
desirability of other assets. When some other assets become relatively
more desirable, then, the allocation to asset ι goes down. Moreover,
when all inside assets experience a uniform increase in their de-
sirabilities, the allocations to all inside assets go up relative to the
allocation to the outside asset.
Fourth, this set-up could be micro-founded in several ways. The
most classical micro-foundation is borrowed from the field of indus-
trial organization [Berry, 1994, Berry, Levinsohn, and Pakes, 1995,
p t ( ι ) q t ( ι ) = a t x t ( ι ). (9.13)
where the issuance cost c(q) is a function of the quantity q, and the
quantity q is determined by the demand curve at a given price p.
We drop the period-t subscripts for simplicity. Then, the first-order
condition implies that the marginal revenue equals the marginal cost,
which implies
log qdemand
t (ι) = log( at xt (ι)/pt (ι)),
∂ log xt (ι)
which implies that the demand elasticity is3 3
The derivation is simple: ∂ log pt (ι)
=
∂δt (ι) ∂ log(1+∑k exp(δt (k)))
− = α −
∂ log qdemand
t (ι) ∂ log xt (ι) ∂ log pt (ι) ∂ log pt (ι)
− = 1− = 1 − (1 − xt (ι))α. exp(δ (ι))
α 1+∑ expt (δ (k)) .
∂ log pt (ι) ∂ log pt (ι) k t
We assume the wealth at−1 and the asset price pt−1 in the last period
are both 1. In this case, if the saving st = 0, the market clearing
condition and the investor’s wealth dynamics are consistent with
any asset price pt . If the saving st ̸= 0, then, there will be no asset
price that is consistent with the market clearing condition and the
investor’s wealth dynamics. Both cases are pathological.
In comparison, if we add the outside option, the portfolio choice
becomes
exp(α log pt )
xt = ,
1 + exp(α log pt )
pt = pt qt = at xt ,
and, assuming that the outside option pays no return, the investor’s
wealth dynamics follows
pt
a t = a t −1 (1 − x t −1 ) + x t −1 + st = ((1 − xt−1 ) + xt−1 pt ) + st .
p t −1
√
We assume α = −1 and xt−1 = 1/ 2. Then, this system of
equations has only one root with a positive asset price, with,
√ √ √
pt = 2 − 1, at = 2 − 2, xt = 1/ 2,
because the investor can substitute towards the outside asset if the
inside asset is too expensive. As we plot in Figure 9.1, this demand
supply
function crosses a flat supply curve qt = 1 and determines the
unique equilibrium price. In comparison, without the outside asset,
the demand function for the book value of the inside asset is flat, and
the equilibrium price is not determined.
0
0 0.2 0.4 0.6 0.8 1
where xn,t (ℓ) denotes the portfolio share of the entire asset class ℓ,
and xn,t (ι|ℓ) denotes the portfolio share of the issuer country ι within
asset class ℓ. This two-tier structure offers a richer structure to model
the cross-sectional heterogeneity in the demand for different asset
classes and countries separately.
The allocation within the asset class is similar to the one we con-
sidered in the baseline case:
exp(δn,t (ι, ℓ))
xn,t (ι|ℓ) = ,
1 + ∑k exp(δn,t (k, ℓ))
where
captures the desirability of this asset (ι, ℓ). We refer to the currency
of the home country as the dollar. We use pt (ι, ℓ) to denote the asset
price in the local currency units, and et (ι) to denote the log exchange
rate of the local currency per dollar. Then, pt (ι, ℓ) exp(et (n) − et (ι))
captures the price of the asset from the issuer country ι in the units of
the local currency in the investor country n. For simplicity, we ignore
the asset-specific characteristics Ξt (ι). We allow the latent demand
κn,t (ι, ℓ) to be specific to the investor n.
The allocation across asset classes is modeled as
(1 + ∑k exp(δn,t (k, ℓ)))λ exp(ψℓ + ξ n,t (ℓ))
xn,t (ℓ) = ,
∑m (1 + ∑k exp(δn,t (k, m)))λ exp(ψm + ξ n,t (m))
where ψℓ + ξ n,t (ℓ) captures the investor’s preference for asset class
ℓ, which is similar to the latent demand term κn,t (ι, ℓ) for a specific
for all issuer country ι representing the set of insider assets and asset
class ℓ. We do not impose a market clearing condition on the outside
asset, which is assumed to have an infinitely elastic supply at a fixed
local-currency price and desirability.
Finally, as emphasized by Jiang, Richmond, and Zhang [2022c,d],
the investor country n’s wealth responds endogenously to the asset
returns. Let at denote the dollar value of its wealth, which follows the
following law of motion:
!
an,t = an,t−1 ∑ ∑ xn,t−1 (ι, ℓ) exp(rt (ι, ℓ)) + sn,t ,
ℓ ι ≥0
pt (ι, ℓ)
exp(rt (ι, ℓ)) = exp(−∆et (ι)).
pt−1 (ι, ℓ)
We take the saving sn,t , the asset quantity qt (ι, ℓ), and the latent de-
mand κn,t (ι, ℓ) as exogenously given. The portfolio choices xn,t (ι, ℓ),
the market clearing conditions, and the wealth law of motion jointly
determine the equilibrium asset prices, exchange rates, and capital
flows.
The exogenous variables are the savings, the latent demand terms,
and the asset quantities:
δn,t (ι, ℓ) = α log( pt (ι, ℓ) exp(et (n) − et (ι))) + κn,t (ι, ℓ), for n, ι, ℓ = 1, 2,
Summary
• First, asset markets can be segmented between countries. Local households may not be able to
directly hold foreign assets, and they rely on financiers to intermediate the cross-border capital
flows. The financiers face financial constraints, which connect the portfolio imbalances they
have to absorb to the risk premium they charge on foreign assets and currencies for providing
this intermediation service.
• Second, asset markets can be segmented within a country. Some households may not have
access to the financial markets, which disconnects the aggregate consumption from the pricing
of financial assets and exchange rates.
10.A.1 Households
The households’ preferences are similar to those we considered in
Section 6.B. Home and foreign households have preferences over
10.A.2 Firms
We consider very simple firms in this model. In the home country,
the firms use capital k t , labor ℓt , and intermediate input which is xt
units of the home consumption bundle to produce yt units of the
home goods:
(1− ϑ ) 1− ϕ ϕ
yt = exp( at )kϑt ℓt xt .
The firms are competitive, unlike the firms with market power
that we considered in Section 6. Their revenue pt yt is split among the
three factors of production according to their marginal products:
ℓt wt = (1 − ϑ)(1 − ϕ) pt yt ,
k t q t = ϑ (1 − ϕ ) p t y t ,
xt = ϕpt yt .
where
The financiers are myopic and maximize the CARA utility of their
return:
1
E0 − exp(−ω (1 − exp(rxt+1 ))d∗t ) ,
ω
def
where rxt+1 = ∆et+1 + rt − rt∗ is the home currency’s expected excess
return in log. Itskhoki and Mukhin [2021] consider the continuous-
time limit of this problem, in which the optimal solution is given
by
and the home and foreign goods’ market clearing conditions can be
expressed as
yt = y H,t + y∗H,t ,
y∗t = y F,t + y∗F,t .
As for the bond market, the home and foreign bonds are in zero
net supply. Their market clearing conditions can be expressed as
0 = b H,t exp(rt ) + dt + nt ,
0 = b∗F,t exp(rt∗ ) + d∗t + n∗t .
( at , a∗t , ψt )∞
t =0 .
(yt , y H,t , y F,t , ct , k t , zt , xt , ℓt , b H,t , qt , wt , rt , pt , y∗t , y∗H,t , y∗F,t , c∗t , k∗t , z∗t , xt∗ , ℓ∗t , b∗F,t , q∗t , wt∗ , rt∗ , p∗t , et )∞
t =0 ,
plus two auxiliary variables exp(mt+1 ) and exp(m∗t+1 ) that denote the
home and foreign SDFs/marginal utility growth:
u ′ ( c t +1 )
exp(mt+1 ) = δ ,
u′ (ct )
u′ (c∗ )
exp(m∗t+1 ) = δ ′ t+∗ 1 .
u (ct )
η 1/η
h i
ct + zt + xt = α1−η y H,t + (1 − α)1−η y F,t
η
,
h i1/η
c∗t + z∗t + xt∗ = α1−η (y∗F,t )η + (1 − α)1−η (y∗H,t )η ,
yt = y H,t + y∗H,t ,
y∗t = y F,t + y∗F,t ,
η 1/η −1 1−η η −1
h i
α1−η y H,t + (1 − α)1−η y F,t
η
α y H,t = pt ,
η 1/η −1
h i
η −1
α1−η y H,t + (1 − α)1−η y F,t (1 − α)1−η y F,t = p∗t exp(−et ),
η
ℓ1/ν
t = c− γ
t wt ,
h i1/η −1
α1−η (y∗F,t )η + (1 − α)1−η (y∗H,t )η α1−η (y∗F,t )η −1 = p∗t ,
h i1/η −1
α1−η (y∗F,t )η + (1 − α)1−η (y∗H,t )η (1 − α)1−η (y∗H,t )η −1 = pt exp(et ),
(ℓ∗t )1/ν = (c∗t )−γ wt∗ ,
(1− ϑ ) 1− ϕ ϕ
yt = (exp( at )kϑt ℓt ) xt ,
ℓt wt = pt (1 − ϑ)(1 − ϕ)yt ,
k t q t = p t ϑ (1 − ϕ ) y t ,
xt = pt ϕyt ,
y∗t = (exp( a∗t )(k∗t )ϑ (ℓ∗t )(1−ϑ) )1−ϕ ( xt∗ )ϕ ,
ℓ∗t wt∗ = p∗t (1 − ϑ)(1 − ϕ)y∗t ,
k∗t q∗t = p∗t ϑ (1 − ϕ)y∗t ,
xt∗ = p∗t ϕy∗t ,
κ (∆k t+1 )2
z t = k t +1 − (1 − d ) k t + ,
2 kt
"
c t +1 − γ
!#
∆k t+1 ∆k t+2 κ ∆k t+2 2
1+κ = Et δ q t +1 + 1 − d + κ + ,
kt ct k t +1 2 k t +1
κ (∆k∗t+1 )2
z∗t = k∗t+1 − (1 − d)∗ k t + ,
2 k∗t
!2
∗ −γ
∆k∗ c t +1 ∆k∗ ∆k∗t+2
1 + κ t∗+1 = Et δ q∗t+1 + 1 − d + κ t+2 + κ ,
kt c∗t k∗t+1 2 k∗t+1
b H,t exp(rt )
χ1 ψt − χ2 = Et [∆et+1 − (mt+1 − m∗t+1 )], (10.9)
Ȳ
which deviates from the complete-market case 0 = ∆et+1 − (mt+1 −
m∗t+1 ) in Section 1.C by introducing a wedge on the left-hand side
and by introducing the expectation operator so that the condition
only holds on average.
If we replace these equations with the following Euler equations
from the model with convenience yields:
Both Eq. (10.9) in the segmented market model and Eq. (10.11) in
the convenience yield model introduce a wedge to the risk-sharing
condition, which breaks the tight link between the exchange rate
and the households’ marginal utilities. The difference is that, in the
model with market segmentation, the wedge is driven by the port-
folio imbalances born by the financiers, whereas in the model with
convenience yields, the wedge is driven by the households’ demand
for safe assets.
ω H b−
"
c t +1 − γ
σ
#
H,t
1 = Et δ exp(rt ) + −γ ,
ct ct
ω F b−
"
c t +1 − γ
σ
#
∗ F,t
1 = Et δ exp(−∆et+1 + rt ) + −γ exp(et ) ,
ct ct
− ∗ ∗ −
" #
c∗t+1 γ
ω F (bF,t ) σ
1 = Et δ ∗ exp(rt∗ ) + ,
ct (c∗t )−γ
c∗t+1 −γ ω ∗H (b∗H,t )−σ + (c̄∗ )−γ θ H,t
∗
" #
1 = Et δ exp(∆et+1 + rt ) + exp(−et ) .
c∗t (c∗t )−γ
SM,∗
wedges λSM
F,t and λ H,t as
"
c t +1 − γ
#
∗
exp(−λSM = Et δ
F,t ) exp(−∆et+1 + rt ) ,
ct
∗ −γ
" #
∗ c
exp(−λSM,
H,t ) = Et δ
t +1
exp(∆et+1 + rt ) ,
c∗t
and the other two wedges for domestic bond holdings, λSM
H,t and
∗
λSM,
F,t , are always zero because the agents can freely trade their local
3 15 0 0
-0.2
2 10
-0.5
-0.4
1 5
-0.6
-1
0 0
-0.8
-1 -5 -1.5 -1
20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100
8 2 2 2
6 1
0
0
4 0
-2
2 -1
-4 -2
0 -2
-6
-2 -3 -4
20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100
0.15 0.4
-0.2 0.1
0.1 0.2
-0.4 0.05
0.05 0
0 -0.2 -0.6 0
20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100
2 0.6 0
0.2
-0.2
1.5
0.15 0.4
-0.4
1
0.1 -0.6
0.2
0.5 0.05 -0.8
0 0 0 -1
20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100
3 10 0 0
8 -0.5 -20
2
6 -1 -40
4 -1.5 -60
1
2 -2 -80
0 0 -2.5 -100
20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100
In both models, the NFA shifts upon the shock’s arrival, but it needs
to revert to the steady-state level in the long run, which requires
adjustments in either the trade balance tb or the financial flow ib + cg
in subsequent periods. Formally, since limk→∞ Et [n f at+k ] = n f a, we
can iterate the NFA dynamics (9.4) and obtain
∞
n f at = n f a − ∑ [tbt+k + (ibt+k + cgt+k )],
k =1
which means that if the U.S. runs a negative NFA today, it has to
be offset by positive trade balances or by positive financial gains
relative to the foreign country [Gourinchas and Rey, 2007a]. In the
segmented market model, this is achieved only by the trade channel:
the negative U.S. trade balance in the near term is offset by a positive
trade balance in the long term.1 1
In Itskhoki and Mukhin [2021], the
In the convenience yield model, the valuation effects matter: the model is linearized around a steady
state with zero bond positions, which
negative U.S. NFA is offset by higher financial gains that the U.S. means that the valuation effects ib + cg
earn from holding the foreign bond relative to what the foreign are equal to zero in the first order. In
extensions with non-zero NFA or non-
country earns from holding the U.S. bond, which is precisely the zero gross positions in the steady state,
seigniorage revenue that we characterize in Section 7.C. In this case, the valuation effects can matter in the
the U.S. does not have to run positive trade balances in the long term first order.
to pay back their trade deficits in the short term. Another place to
see this difference is in the NFA dynamics that we plot in the last
panel of Figure 10.1. While the U.S. households run trade deficits
in both models, they accumulate negative NFA in the segmented
market model, which has to be paid back by trade surpluses in the
long term. In comparison, the U.S. households do not accumulate a
large negative NFA in the convenience yield model, because the trade
Given the similarity between the segmented market model and the
convenience yield model, how do we think about them? We make
three points. First, they both suggest that the wedge in the Euler
equations for cross-country bond holdings is an important ingredient
to understand exchange rate dynamics. This wedge can have differ-
ent microfoundations, but they have similar effects that disentangle
the exchange rate movement from the households’ marginal utilities.
Second, the convenience yield mechanism provides an economic
interpretation for the cyclicality of the wedge, because it tends to
move in the direction that appreciates the dollar in flight-to-safety
episodes. As a result, the wedge does not just produce exchange
rate disconnect, it also makes the dollar more counter-cyclical to the
global business cycles than other currencies. Moreover, the conve-
nience yield interpretation also offers empirical measurement of the
wedge, as we discussed in Section 4.C.
Third, is the bond convenience yield on the dollar bond all there
is to the wedge-based view of the exchange rate dynamics? Let us
return to the Euler equations (10.10) with slightly different labeling:
exp(−λt ) = Et [exp(mt+1 + rt )] ,
exp(−ξ t ) = Et [exp(mt+1 − ∆et+1 + rt∗ )] ,
exp(−ξ t∗ ) = Et exp(m∗t+1 + rt∗ ) ,
which emphasizes that λt and λ∗t are the non-pecuniary utilities that
home and foreign households derive from the U.S. bond, whereas ξ t
and ξ t∗ are the non-pecuniary utilities that home and foreign house-
holds derive from the foreign bond. Assuming joint normality, Jiang,
Krishnamurthy, and Lustig [2023a] show the following result.
λ∗t − λt > 0,
will not suffice. In fact, holding the exchange rate variance constant,
a higher convenience yield differential would increase the covariance
between the SDF differential and the exchange rate movement.
We need the foreign households to also derive higher convenience
yield on the foreign bond than the U.S. households, i.e.,
ξ t∗ − ξ t > 0,
which could lower the exchange rate cyclicality and even make it
negative, so that the foreign currency depreciates when the foreign
marginal utility is high. This condition can be interpreted as the U.S.
households finding it very undesirable to hold the foreign bond,
which can result from a strong home bias. In fact, we can rewrite Eq.
(10.12) as
Bruno and Shin [2015a,b] emphasize the role of global banks, who
tap dollar funding from financial centers and lend to local banks in
foreign countries. When the dollar appreciates, currency mismatch
weakens the local banks’ balance sheets and contracts their risk-
taking capacity. As a result, the dollar’s strength impacts global fi-
nancial conditions and generates dollar shortage in foreign countries.
Greenwood, Hanson, Stein, and Sunderam [2020], Gourinchas, Ray,
and Vayanos [2022] develop models with financiers who trade in both
the currency market and the home and foreign long-term bond mar-
kets. The variation in their risk-taking capacity leads to comovements
in exchange rates and bond term premia.
The deviation from covered interest rate parity as we discussed in
Section 2.A.6 is also a natural place to consider market segmentation
and financial intermediation. The key mechanism usually involves
constrained financiers and other sectors’ imbalanced hedging de-
mand [Borio, Iqbal, McCauley, McGuire, and Sushko, 2018, Andersen,
Duffie, and Song, 2019, Avdjiev, Du, Koch, and Shin, 2019, Liao and
Zhang, 2020, Amador, Bianchi, Bocola, and Perri, 2020, Cenedese,
Della Corte, and Wang, 2021, Rime, Schrimpf, and Syrstad, 2022]
The presence of financial frictions also makes it possible for the
policy interventions to improve outcomes and mitigate frictions. See
Fanelli and Straub [2020], Bocola and Lorenzoni [2020], Itskhoki and
Mukhin [2022]. In the context of the model in Section 10.A, if the pol-
icymaker can lower the exchange rate volatility, or at least convince
the financiers that the policymaker is committed to interventions that
lower the exchange rate volatility, then, the financiers will require a
lower currency risk premium to absorb a given amount of portfolio
imbalances, which, by Eq. (10.4), will indeed make the exchange rate
less volatile.
Empirically, there is a large literature that shows intermediary
balance sheet quantities do matter to asset prices [Adrian, Etula,
and Muir, 2014, He, Kelly, and Manela, 2017], which is either loosely
or precisely related to the equilibrium conditions in models with
financial intermediaries.
10.C.1 Households
We again consider two countries: home and foreign. There is a unit
mass of households in each country. We deviate from the baseline
model in Section 1.A by assuming that some households do not have
access to the financial markets. Specifically, we assume that each
country has two types of households: active and inactive.
Active households can trade a complete set of state-contingent
claims in international markets. They behave like the households in
the complete-market version of the baseline model in Section 1.C. Let
ϕ ∈ (0, 1) denote the fraction of these households in each country. We
use ( a) to denote the active households. The active households in the
home country maximize their expected lifetime utility
∞
" #
E0 ∑ δt u(ct (a)) ,
t =0
They cannot trade any financial claims. Instead, they cede their en-
dowments to the active households and receive a stochastic transfer,
which is denoted as τt for each inactive household in the home coun-
try and τt∗ for each inactive household in the foreign country. Inactive
households can still participate in the goods market, where they use
their transfers to purchase home and foreign goods.
Because the active and inactive households in the home country
have identical preferences, they face the same within-period problem
and will choose the same consumption bundle, which is again iden-
tical to the solution in Section 1.A. Therefore, we can use the home
consumption bundle to refer to the numéraire shared by both types
of households.
Let pt denote the price of home goods in the home consumption
bundle, let at−1 ( a) denote the financial wealth held by the active
households, and let exp(rta ) denote the gross return on this wealth
portfolio. The home active households’ budget constraint can be
expressed as
(1 − ϕ)τt = (1 − ϕ)ct (i ).
If we only consider the active agents from home and foreign coun-
tries, the transfer to the inactive households takes away resources
available to them. Effectively, the resources that the social planner al-
locates between the active households in home and foreign countries
are equal to the total endowment minus the transfer. Thus, we can
c F,t ( a) 1−α
t ′
w.r.t. c H,t ( a): δ πu (ct ( a))α = ϕζ H,t ,
c H,t ( a)
!α
c∗F,t ( a)
w.r.t. c∗H,t ( a): δt (1 − π )u′ (c∗t ( a))(1 − α) = ϕζ H,t ,
c∗H,t ( a)
c H,t ( a) α
w.r.t. c F,t ( a): δt πu′ (ct ( a))(1 − α) = ϕζ F,t ,
c F,t ( a)
!1− α
∗ t ′ ∗
c∗H,t ( a)
w.r.t. c F,t ( a): δ (1 − π )u (ct ( a))α = ϕζ F,t .
c∗F,t ( a)
(ct ( a), c H,t ( a), c F,t ( a), ct (i ), c H,t (i ), c F,t (i ), ζ H,t , c∗t ( a), c∗H,t ( a), c∗F,t ( a), c∗t (i ), c∗H,t (i ), c∗F,t (i ), ζ F,t )∞
t =0 .
τt = ct (i ),
τt∗ = c∗t (i ),
c H,t ( a) c (i )
= H,t ,
c F,t ( a) c F,t (i )
c∗H,t ( a) c∗H,t (i )
= ∗ .
c∗F,t ( a) c F,t (i )
ȳŷt = ϕ(c̄ H ( a)ĉ H,t ( a) + c̄∗H ( a)ĉ∗H,t ( a)) + (1 − ϕ)(c̄ H (i )ĉ H,t (i ) + c̄∗H (i )ĉ∗H,t (i )),
ȳ∗ ŷ∗t = ϕ(c̄ F ( a)ĉ F,t ( a) + c̄∗F ( a)ĉ∗F,t ( a)) + (1 − ϕ)(c̄ F (i )ĉ F,t (i ) + c̄∗F (i )ĉ∗F,t (i )),
τ̂t = ĉt (i ),
τ̂t∗ = ĉ∗t (i ),
where A is given by
A=
−(1 − ϕ)(1 − α)c̄ H (i ) ϕc̄∗H ( a) + (1 − ϕ)αc̄∗H (i ) −(1 − ϕ)αc̄∗H (i )
ϕc̄ H ( a) + (1 − ϕ)(1 − α)c̄ H (i )
−(1 − ϕ)αc̄ F (i ) ϕc̄ F ( a) + (1 − ϕ)αc̄ F (i ) −(1 − ϕ)(1 − α)c̄∗F (i ) ϕc̄∗F ( a) + (1 − ϕ)(1 − α)c̄∗F (i )
,
α − 1 − γα (1 − γ)(1 − α) α + γ − γα ( γ − 1) α
( γ − 1) α α + γ − γα (1 − γ)(1 − α) α − 1 − γα
and the total consumption is given by
c̄ H ( a) = c̄ H (i ) = c̄∗F ( a) = c̄∗F (i ) = α,
c̄ F ( a) = c̄ F (i ) = c̄∗H ( a) = c̄∗H (i ) = 1 − α.
which implies
1 − (1 − ϕ)ρ σστy
corrt (êt+1 , ĉ∗t+1 − ĉt+1 ) = r .
2
1 + (1 − ϕ)2 σστ2 − 2(1 − ϕ)ρ σστy
y
1 ∗
ĉ∗t+1 ( a) − ĉt+1 ( a) = [(ŷ − ŷt+1 ) − (1 − ϕ)(τ̂t∗+1 − τ̂t+1 )],
ϕ t +1
multiply the first and the last first-order conditions and divide it by
the product of the second and third first-order conditions to get
∗
c F,t c H,t α2
=1
c∗F,t c H,t (1 − α)2
α c F,t 1 − α c H,t
⇒ ∗ =
1 − α c F,t α c∗H,t
Let
def α c F,t 1 − α c H,t
kt = =
1 − α c∗F,t α c∗H,t
α 1−α ∗
⇒ c H,t = k t c∗ , c F,t = k t c F,t
1 − α H,t α
Plugging the equations above into the market clearing conditions
yields
αk t 1−α
c H,t = yt , c∗H,t = yt ,
(1 − α) + αk t (1 − α) + αk t
(1 − α ) k t α
c F,t = y∗ , c∗F,t = y∗ ,
α + (1 − α ) k t t α + (1 − α ) k t t
.
Divide the first first-order condition by the second to get
!α
c∗H,t
1− α
π u′ (ct ) α
c F,t
= 1.
1 − π u′ (c∗t ) 1 − α c H,t c∗F,t
where c H,t , c F,t , c∗H,t and c∗F,t are functions of k t . Hence, we can solve
k t via the implicit equation above.
p̄ȳ(log pt − log p̄ + log yt − log ȳ) = αc̄(log ct − log c̄) + (1 − α)c̄∗ exp(−ē)(log c∗t − log c̄∗ − et + ē)
p̄∗ ȳ∗ (log p∗t − log p̄∗ + log y∗t − log ȳ) = αc̄∗ (log c∗t − log c̄∗ ) + (1 − α)c̄ exp(ē)(log ct − log c̄ + et − ē)
subtracting the second equation from the first and plugging in steady
states yields
p̄ȳ(log yt − log y∗t ) + p̄ȳ(log pt − log p∗t ) = (2α − 1)c̄(log ct − log c∗t ) − 2(1 − α)c̄et
recall that et = (2α − 1)tott , where tott = log pt − log p∗t + et , hence
log pt − log p∗t = (2 − 2α)/(2α − 1)et , and
2 − 2α
p̄ȳ(log yt − log y∗t ) + p̄ȳ et = (2α − 1)c̄(log ct − log c∗t ) − 2(1 − α)c̄et
2α − 1
2 − 2α
(2α − 1)(log ct − log c∗t ) − 2(1 − α)et − et = log yt − log y∗t
2α − 1
2α
(2α − 1)(log ct − log c∗t ) − 2(1 − α) et = log yt − log y∗t
2α − 1
Proof. Plug in
h i √ √ q q q
rxti/$
+1 − E t rx i/$
t +1 = − δ i − δ$ z w εw −
t t +1 γz i εi
t t +1 − γz $ $
ε
t t +1
1 √ √ q 1
q q
hmlt+1 − Et [hmlt+1 ] = −
NH ∑ δi
− δ $ w w
z t ε t +1 −
NH i∑
i i
γzt ε t+1 − γzt ε t+1$ $
i∈ H ∈H
1 √ √ q q
w εw + 1
q
NL i∑ NL i∑
i − δ$ i εi $ $
+ δ z t t +1 γz t t +1 − γz ε
t t +1
∈L ∈L
q q q
1 1
q q
NH i∑ ∑ γzit εit+1
w i εi
= δtL − δtH zw ε
t t +1 − γz t t +1 +
∈H
N L i∈ L
q q q
w
= δtL − δtH zw
t ε t +1
q q
The country-specific shock terms, N1H ∑i∈ H γzit εit+1 and N1L ∑i∈ L γzit εit+1
averages out according to the law of large numbers. Hence,
√ √
δ $ − δi
q q
i/$ i/$ i $ $
rxt+1 = rpt + q q (hmlt+1 − Et [hmlt+1 ]) − i
γzt ε t+1 − γzt ε t+1
δtL − δtH
and
1 1
rti − rt$ = (χ − γ)(zit − z$t ) − (δi − δ$ )zw
t .
2 2
$
Note that zw i
t is uncorrelated zt and zt . Hence,
2 2
1 1 i
var (rti − rt$ )
= χ − γ var (zit − z$t ) + δ − δ$ var (zw
t )
2 4
1
cov(∆ei/$ i $ i $
t+1 , rt − rt ) = − χ χ − 2 γ var ( zt − zt )
cov(rxti/$ i $ i/$ i $ i $
+1 , rt − rt ) = cov ( ∆et+1 + rt − rt , rt − rt )
= cov(∆ei/$ i $ i $
t+1 , rt − rt ) + var (rt − rt )
j j
Note that zit , zt , zw i w
t , ε t+1 , ε t+1 , ε t+1 are independent for i ̸ = j. By the
law of large numbers,
j 1 1 j
cov(zit − zt , zit − zt ) = var (zit ) − var (zit ) + var (zt ) = var (zit )
N N
2 1 N
var (zit − zt ) = var (zit ) − var (zit ) + 2 ∑ var (zt ) = var (zit )
j
N N j =1
q q h i q
cov( zit εit+1 , zit ) = E zit zit εit+1 − E zit E zit εit+1 = 0
q h i hq i2
w w w 2 w (εw )
var ( zw ε
t t +1 ) = E z t ( ε t +1 ) − E z t t +1
h i
= E [zw w
t ] E ( ε t +1 )
2
= θw
q
var ( zit εit+1 ) = θ
Hence,
√
p √ √
i/j
cov(∆et+1 , ∆eit+1 ) = χ2 var (zit ) + δj δi − δ θ w + γθ
δi −
√ √ 2
var (∆eit+1 ) = χ2 var (zit ) + δi − δ θ w + γθ
√ p 2
i/j j
var (∆et+1 ) = χ2 (var (zit ) + var (zt )) + δi − δ j θ w + 2γθ
i/j
cov(∆et+1 , ∆eit+1 )
φ=
var (∆eit+1 )
√ √ √ √
δi − δ δj − δ θw
= 1− √ √ 2 ,
2 i
χ var (zt ) + i w
δ − δ θ + γθ
φ2 var (∆eit+1 )
R2 = i/j
var (∆et+1 )
h √ √ √ √ i2
χ2 var (zit ) + δi − δ j δi − δ θ w + γθ
= √ √ 2
√ √ 2 .
j
χ2 var (zit ) + δi − δ θ w + γθ χ2 (var (zit ) + var (zt )) + δi − δ j θ w + 2γθ
which implies
θ
mt+1 = θ log δ − ∆c + (θ − 1)rtc+1
ψ t +1
θ
= µs − ( xt + σt ε g,t+1 )
ψ
n o
+ (θ − 1) [1 + Wx (ρ − κ1c )] xt + Wσ (σt2 − σ2 )(ϕ − κ1c ) + σt ε g,t+1 + Wx σt φe ε x,t+1 + Wσ ωε w,t+1
θ
= µs + − + (θ − 1)[1 + Wx (ρ − κ1c )] xt − γσt ε g,t+1
ψ
+ {Wσ (ϕ − κ1c )(θ − 1)} (σt2 − σ2 ) + (θ − 1) {Wx σt φe ε x,t+1 + Wσ ωε w,t+1 }
which implies
0 = r0c + µs
θ2 n 2 2 2
o 1 θn o
+ Wx φe σ + Wσ2 ω 2 + θ − + [1 + Wx (ρ − κ1c )] xt + 2Wσ (ϕ − κ1c ) + θWx2 φ2e (σt2 − σ2 )
2 ψ 2
1
+ (1 − γ)2 σt2 + θν(1 − γ)Wx φe σt2
2
θ2 n 2 2 2 o
r0c + µs + Wx φe σ + Wσ2 ω 2 − Wσ θ (ϕ − κ1c )σ2 = 0 (A.1)
2
1 c
θ − + [1 + Wx (ρ − κ1 )] = 0 (A.2)
ψ
1 θ n o
(1 − γ )2 + 2Wσ (ϕ − κ1c ) + θWx2 φ2e + θν(1 − γ)Wx φe = 0 (A.3)
2 2
which implies
1
1− ψ
Wx =
κ1c − ρ
(1 − γ)(1 − ψ1 ) φ2e 2νφe
Wσ = + c +1
2(κ1c − ϕ) c
(κ1 − ρ ) 2 κ1 − ρ
Furthermore, we can solve additionally for r0c , κ0c , κ1c and µwc , using
Eq. (A.1) and
eµwc
κ0c = − log(eµwc − 1) + µwc
eµwc−1
eµwc
κ1c =
eµwc − 1
r0 = κ0c + µ g + µwc (1 − κ1c )
c
1
Et [rxt+1 ] = (vart (m∗t+1 ) − vart (mt+1 ))
2
1 h 2 ∗2 i
= γ σt + (θ − 1)2 Wx2 φ2e σt∗2 + (θ − 1)2 Wσ2 ω 2 − 2νγ(θ − 1)Wx φe σt∗2
2
1h 2 2 i
− γ σt + (θ − 1)2 Wx2 φ2e σt2 + (θ − 1)2 Wσ2 ω 2 − 2νγ(θ − 1)Wx φe σt2
2
1h 2 i
= γ + (θ − 1)2 Wx2 φ2e − 2νγ(θ − 1)Wx φe (σt∗2 − σt2 )
2
1
rt = −Et [mt+1 ] − vart (mt+1 )
2
θ c c 2 2
= − µs + − + (θ − 1)[1 + Wx (ρ − κ1 )] xt + {Wσ (ϕ − κ1 )(θ − 1)} (σt − σ )
ψ
1 n o
− γ2 σt2 + (θ − 1)2 Wx2 φ2e σt2 + (θ − 1)2 Wσ2 ω 2 + νγ(θ − 1)Wx φe σt2
2
1
rt∗ = −Et [m∗t+1 ] − vart (m∗t+1 )
2
θ c ∗ c ∗2 2
= − µs + − + (θ − 1)[1 + Wx (ρ − κ1 )] xt + {Wσ (ϕ − κ1 )(θ − 1)}(σt − σ )
ψ
1 n 2 ∗2 o
− γ σt + (θ − 1)2 Wx2 φ2e σt∗2 + (θ − 1)2 Wσ2 ω 2 + νγ(θ − 1)Wx φe σt∗2
2
where vart (σt∗ ε∗g,t+1 − σt ε g,t+1 ) = σt2 + σt∗2 − 2σt σt∗ covt (ε g,t+1 , ε∗g,t+1 ).
γ − d = V′π + W′γ
γ = ( I − W ′ )−1 (V ′ π + d) = H ′ π + ( I − W ′ )−1 d.
where κ x = θ log ℓi + ∑ N
j=1 wij log wij is a vector of constants.
Define H = V ( I − W )−1 . The log consumption of active house-
holds is
!
N π i vij j
log c = ∑ vij log
i
x
j =1 γj
N
log c = ∑ vij log π i vij + V log x − V log γ
j =1
N
= ∑ vij log π i vij + H (κ x + log a − θ log γ)
j =1
j
cov(∆ log cit , ∆ log ct ) = C(i, j).
!
N N
1 1
∑ ∑
j j
cov(∆eit+1 , ∆et+1 ) = cov ∆ckt+1 − ∆cit+1 , ∆ckt+1 − ∆ct+1
N k =1
N k =1
= C(i, j) − C(i ) − C( j) + κ e .
c̄t
mit+1 = log δ
c̄t+1
= log δ − ∆ log c̄t+1
As shown in Section 1.B, the log interest rate and currency risk pre-
mium are given by:
1
rti = −Et [mit+1 ] − vart (mit+1 )
2
θ 1
= − log δ − HEt [∆ log at+1 ] + H ( I − W ′ )−1 Et [∆dt+1 ] − vart (∆ log c̄it+1 )
H′ π 2
1
= − log δ − C(i, i )
2
h i
def
rpi/$
t = Et rxti/$
+1
= Et [−ei/$ i $
t +1 ] + r t − r t
1 1
= − vart (mit+1 ) + vart (m$t+1 )
2 2
1
= (C($, $) − C(i, i ))
2
Proof. The long term bond return and price are given by
P T
exp(−rt (h)h) = pt (h) = Et [exp(mt,t +h + mt,t+h )]
= Et [exp(mctP+h − mctP + mctT+h − mctT )]
"
h
1
= Et exp ∑ (− σP2 + σP ε Pt+k )
k =1
2
!#
h h h
+ ∑ϕ h−k T
mc + ∑ϕ h−k
(t + k) log β + (ϕ h
− 1)mctT + ∑ ϕ h−k
σT ε Tt+k
k =1 k =1 k =1
!
1 − ϕh T t + h − ϕ h ( t + 1) ϕ − ϕh
= exp mc + − log β + (ϕh − 1)mctT
1−ϕ 1−ϕ (1 − ϕ )2
!
1 − ϕ2h 2 1 − ϕh
+ σ + σTP .
2 − 2ϕ2 T 1−ϕ
Hence,
!
1 − ϕh T t + h − ϕ h ( t + 1) ϕ − ϕh h T 1 − ϕ2h 2 1 − ϕh
−r t ( h ) h = mc + − log β + ( ϕ − 1 ) mc t + σ + σTP ,
1−ϕ 1−ϕ (1 − ϕ )2 2 − 2ϕ2 T 1−ϕ
!
1 − ϕh T t − ϕ h ( t + 1) ϕ − ϕh 1 − ϕ2h 2 1 − ϕh
log β
rt ( h) + h=− mc − − log β − (ϕh − 1)mctT − σ − σTP ,
1−ϕ 1−ϕ 1−ϕ (1 − ϕ ) 2 2 − 2ϕ2 T 1−ϕ
which implies that the constant δ̃ = β1/(1−ϕ) satisfies 0 < limh→∞ pt (h)/δ̃h <
∞, and the long-term bond yield satisfies
and
t
lim (rt (h) + log δ̃)h = − log β + mctT .
h→∞ 1−ϕ
Recall that
mc T ϕ log β log β
mctT = − + t − γ log ytT
1−ϕ 1−ϕ1−ϕ 1−ϕ
∞
mc T ϕ log β log β
= − + t + ∑ ϕi σT ε Tt−i
1−ϕ 1−ϕ1−ϕ 1−ϕ i =0
plugging mctT into the bond return immediately yields Eq. (3.19).
exp(−rt ) = Et [exp(mt,t+1 )]
= Et [exp(mctP+1 − mctP + mctT+1 − mctT )]
1 2 P T T T
= Et exp − σP + σP ε t+1 + mc + (ϕ − 1)mct + (t + 1) log β + σT ε t+1
2
T T 1 2
= exp mc + (ϕ − 1)mct + (t + 1) log β + σT + σTP ,
2
1
−rt = mc T + (ϕ − 1)mctT + (t + 1) log β + σT2 + σTP .
2
Taking h → ∞ yields
1 2
lim Et [log pt−1 (h − 1) − log pt (h) − rt ] = σ + σTP .
h→∞ 2 T
y
pt (h) = Et [exp(mt,t+h + log yt+h )]
= Et [exp(mctP+h − mctP + mctT+h − mctT + log yt+h )]
"
h
1
= Et exp ∑ (− σP2 + σP ε Pt+k )
k =1
2
h h h
+ ∑ ϕh−k mcT + ∑ ϕh−k (t + k) log β + (ϕh − 1)mctT + ∑ ϕh−k σT ε Tt+k
k =1 k =1 k =1
!#
h h
+ log ytP + hµ + ∑ νP εPt+k + ϕh log ytT + ∑ ϕh−k νT εTt+k
k =1 k =1
!
1 − ϕh T t + h − ( t + 1) ϕ h
ϕ − ϕh
= exp mc + − log β + (ϕh − 1)mctT
1−ϕ 1−ϕ (1 − ϕ )2
!
1 − 2γ 2 1 − ϕ2h γ − 1 2 2 1 − ϕh γ − 1 2
P h T
+ log yt + hµ + ϕ log yt + hσP + σT + σTP ,
2γ2 2(1 − ϕ2 ) γ 1−ϕ γ
y y
Et [log pt+1 (h − 1) − log pt (h) − rt ]
!
h −1 ( t + 1 ) ϕ h − ( t + 2 ) ϕ h −1 ϕ h −1
= − ϕ mc + T
+ log β + (ϕh−1 − 1)(mc T + ϕmctT + (t + 1) β) − (ϕh − 1)mctT
1−ϕ 1−ϕ
1 − 2γ 2 ϕ2(h−1) γ − 1 2 2
2
h −1 γ − 1
− σ − σT − ϕ σTP − rt
2γ2 P 2 γ γ
" # " 2 #
2γ − 1 2 1 ϕ 2( h −1) γ − 1 2 2 h −1 γ − 1
= σ + − σT + 1 − ϕ σTP .
2γ2 P 2 2 γ γ
Taking h → ∞ yields
y y 2γ − 1 2 1 2
lim Et [log pt+1 (h − 1) − log pt (h) − rt ] = σ + σ + σTP .
h→∞ 2γ2 P 2 T
ct ct
pt = α , p∗t exp(−et ) = (1 − α)
c H,t c F,t
and
c∗t c∗t
p∗t = α , pt exp(et ) = (1 − α) ,
c∗F,t c∗H,t
∞ ∞
" # " #
∑δ t
(u(ct ) + v(b H,t ; θt )) + (1 − π )E0 ∑δ t
u(c∗t ) + v(b∗H,t ; θt∗ )
L = πE0
t =0 t =0
∞ ∞
" # " #
+ E0 ∑ ζ H,t yt − c H,t − c∗H,t + ζ F,t y∗t − c F,t − c∗F,t + E0 ∑ ξ H,t b̄t − b H,t − b∗H,t
.
t =0 t =0
Proof. (1) The first-order condition for the home household’s portfo-
lio choice problem (5.2) is:
γt
exp(−δt) = ζ H exp(mνH ,t ), (A.7)
ct
where exp(m∗ν∗ ,t ) denotes the foreign country state price density that
F
bounds all the budget constraints, and the Lagrangian multiplier ζ F
is such that the budget constraint evaluated at the optimal consump-
tion expenditure c∗t is satisfied with equality
Z T
E exp(m∗ν∗ ,t )(c∗t − p∗t y∗t )dt = w0∗ .
0 F
Define
T
Z
exp(mνH ,u )
s t = Et pu yu du .
t exp(mνH ,t )
Assume the law of motion of st is given by
For the foreign households, the first-order condition for the foreign
household’s static variational portfolio choice problem in the home
numéraire is:
γ∗
exp(−δt) = ζ̃ F exp(mν̃F ,t ), (A.12)
c̃∗t
where exp(mν̃F ,t ) denotes an appropriate home country state price
density that bounds all the budget constraints of the foreign house-
hold, and ζ̃ F is the Lagrange multiplier such that the budget con-
straint evaluated at the optimal consumption expenditure, c̃∗t , is satis-
fied with equality:
Z T
E exp(mν̃F ,t )(c̃t − pt exp(−et )yt )dt = w̃0∗ .
∗ ∗ ∗
0
Similarly, denote
T
Z
exp(mν̃F ,u ) ∗
s̃∗t = Et pt exp(−et )y∗t du .
t exp(mν̃F ,t )
Assume the law of motion of s̃∗t is given by
∗ ∗
ds̃∗t = µs̃t s̃∗t dt + σts̃ s̃∗t dZt
In incomplete markets, the matrix σt′ σt has a solution if and only if its
right hand side lies in Span(σt ). This entails a restriction:
s̃∗ σ′ σt
∗
ν̃F,t = ∗ t ∗ I3 − t 2 (σts̃ )′ , (A.15)
w̃t + s̃t ∥σt ∥
σ′ σ
where we apply operator I3 − ∥σt ∥t2 to both sides of (A.14).
t
To get the optimal portfolio choice for the foreign household, we
σt′
apply the operator ∥σt ∥2
from right to both sides of (A.14).
∗
w̃t∗ + s̃∗t σt′ s̃∗t σts̃ σt′
xt∗ − 1 = [( λ t + ν̃F,t ) ′
] −
w̃t∗ ∥σt ∥2 w̃t∗ ∥σt ∥2
w̃ ∗ + s̃∗ rt∗ − κ t + 21 σt σt′ − rt ∗
s̃∗t σts̃ σt′
xt∗ = 1 − t ∗ t − .
w̃t ∥σt ∥2 w̃t∗ ∥σt ∥2
Then, the marginal utilities of the representative agent and the indi-
vidual agents evaluated at the optimum are related as
α (1 − α ) γt
c H,t = ct , c F,t = ct , exp(−δt) = ζ H exp(mνH ,t ),
pt p∗t exp(−et ) ct
we have
We substitute the solutions of c H,t , c F,t , c∗H,t , c∗F,t into the above gradi-
ents of ∇u c H,t , c∗H,t to get the state price density that bound all the
budget constraints of the home consumers.
exp(mνH ,t ) γt c p c γt
exp(mνH ,t ) = = exp(−δt) 0 = exp(−δt) 0 H,0
exp(mνH ,0 ) γ0 ct pt c H,t γ0
p 0 y 0 γt α + π t γ ∗ ( 1 − α )
= exp(−δt) (A.16)
pt yt γ0 α + π0 γ∗ (1 − α)
Therefore, by the expression of the state-price-density in A.16, the
price of the home stock in the home numéraire is
Z T
1
st = Et exp(mνH ,u ) pu yu du (A.17)
exp(mνH ,t ) t
Z T
exp(δt) pt yt ∗
= E t exp (− δu )( γ u α + π u γ ( 1 − α )) du
γt α + π t γ ∗ ( 1 − α ) t
Z T
exp(δt) pt yt 1 ∗
= ( exp (− δt ) − exp (− δT )) γ t α + E t [ exp (− δu ) π u γ ( 1 − α ) du ]
γt α + π t γ ∗ ( 1 − α ) δ t
1 − exp(−δ( T − t)) exp(δt) pt yt γ∗ (1 − α) T
Z
= pt yt + E t exp (− δu )( π u − π t ) du ,
δ γt α + π t γ ∗ ( 1 − α ) t
Therefore, we have
1 − exp(−δ( T − t)) 1 − exp(−δ( T − t))
wt + st + w̃t∗ + s̃∗t = (ct + c̃∗t ) = ( pt yt + p∗t y∗t exp(−et )) ,
δ δ
where we use the fact that the home and foreign households’ total
consumption expenditure at time t equals the value of their endow-
ments. On the other hand, from the resource constraint,
1 − exp(−δ( T − t))
st + s̃∗t = ( pt yt + p∗t y∗t exp(−et )),
δ
with (A.17) and (A.19), we have
Then, we can conclude that the home stock price in the home numéraire
is:
1 − exp(−δ( T − t))
st = pt yt ,
δ
and the price of the foreign stock in the foreign numéraire is:
1 − exp(−δ( T − t)) ∗ ∗
s∗t = pt yt .
δ
Therefore, the price of the foreign stock prices in the home numéraire
is:
1 − exp(−δ( T − t)) ∗ ∗
s̃∗t = exp(−et ) pt yt .
δ
By the previous results for good prices and the exchange rate,
1− α α −1
1−α
α
pt = α exp(tott ) , p∗t =α exp(tott ) , exp(et ) = exp(tott )2α−1 ,
α 1−α
we have
1− α
1 − exp(−δ( T − t)) 1−α
st = α exp(tott ) yt , (A.22)
δ α
α −1
1 − exp(−δ( T − t))
α
s∗t = α exp(tott ) y∗t ,
δ 1−α
α −1
1 − exp(δ( T − t))
∗ α
s̃t = α exp(tott )−α y∗t , (A.23)
δ 1−α
and hence
y∗t
s∗t = st exp(−tott ) .
yt
which lead to
γt α γt ( 1 − α ) y∗
wt + st = ∗
st + ∗
exp(tott )−1 t st ,
γt α + π t γ ( 1 − α ) γt ( 1 − α ) + π t γ α yt
∗
π t γ (1 − α ) yt ∗
πt γ α
wt∗ + s̃∗t = exp(tott ) ∗ s∗t + s∗ .
γt α + π t γ ∗ ( 1 − α ) yt γt ( 1 − α ) + π t γ ∗ α t
γt α + πt γ∗ (1 − α) y∗t
exp(tott ) = ,
γt ( 1 − α ) + π t γ ∗ α y t
(2) Define
def def def
i1 = (1, 0, 0), i2 = (0, 1, 0), i3 = (0, 0, 1).
γt α + πt γ∗ (1 − α) y∗t
d exp(tott ) = d
γt ( 1 − α ) + π t γ ∗ α y t
1 1 1 1
= exp(tott ) ∗ dy∗t − dyt − [dyt , dy∗t ] + 2 [dyt , dyt ]
yt yt yt y∗t yt
1 1
+ (αdγt + γ∗ (1 − α)dπt ) − ((1 − α)dγt + γ∗ αdπt )
γt α + π t γ ∗ ( 1 − α ) γt ( 1 − α ) + π t γ ∗ α
1 [dy∗t , αdγt + γ∗ (1 − α)dπt ] 1 [dyt , αdγt + γ∗ (1 − α)dπt ]
+ ∗ −
yt γt α + π t γ ∗ ( 1 − α ) yt γt α + π t γ ∗ ( 1 − α )
∗
1 [dyt , (1 − α)dγt + γ αdπt ]∗ 1 [dyt , (1 − α)dγt + γ∗ αdπt ]
− ∗ +
yt γt ( 1 − α ) + π t γ ∗ α yt γt ( 1 − α ) + π t γ ∗ α
∗ ∗
[(1 − α)dγt + γ αdπt , (1 − α)dγt + γ αdπt ] [αdγt + γ∗ (1 − α)dπt , (1 − α)dγt + γ∗ αdπt ]
+ −
( γt ( 1 − α ) + π t γ ∗ α ) 2 [γt α + πt γ∗ (1 − α)][γt (1 − α) + πt γ∗ α]
σt αγt ωi3 + γ∗ (1 − α)πt (ν̃F,t − νH,t )′ (1 − α)γt ωi3 + γ∗ απt (ν̃F,t − νH,t )′
= −σy i1 + σy∗ i2 + ∗
−
2α − 1 γt α + π t γ ( 1 − α ) γt ( 1 − α ) + π t γ ∗ α
2
κt 1 1 1 ∥αγt ωi3 + γ∗ (1 − α)πt (ν̃F,t − νH,t )′ ∥2 1 ∥(1 − α)γt ωi3 + γ∗ απt (ν̃F,t − νH,t )′ ∥
= µy∗ − µy + σy2 − σy2∗ − + .
2α − 1 2 2 2 (γt α + πt γ∗ (1 − α))2 2 ( γt ( 1 − α ) + π t γ ∗ α ) 2
Plug in Eq. (A.22) and (A.23) to obtain the drift and volatility of stock
price dynamics
κt ( α − 1)2 σt σt ′
µst = (1 − α) + ∥ ∥2 + µy + (1 − α)σy i ,
2α − 1 2 2α − 1 2α − 1 1
σt
σts = (1 − α) + σy i1 ,
2α − 1
∗ κt α2 σt σt ′
µs̃t = −α + ∥ ∥2 + µy∗ − ασy∗ i ,
2α − 1 2 2α − 1 2α − 1 2
∗ σt
σts̃ = −α + σy∗ i2 ,
2α − 1
˜
Combined with Eq. (A.11) and (A.15), we can solve κt , σt , νH,t , νF,t
simultaneously as function of exogenous state variables.
1
exp(mνH ,t ) = pαt [ p∗t exp(−et )]1−α exp(mνH ,t )
α α (1 − α )1− α
1 α ∗ 1− α p 0 γt α + π t γ ∗ ( 1 − α ) y 0
= p [ p exp (− e t )] exp (− δt )
α α (1 − α )1− α t t pt γ0 α + π0 γ∗ (1 − α) yt
∗ 1− α
γt α + π t γ ∗ ( 1 − α )
1 pt exp(−et ) y0
= α exp(−δt) p0
α (1 − α )1− α pt γ0 α + π0 γ∗ (1 − α) yt
γt α + π t γ ∗ ( 1 − α )
1 y0
= α 1 −
exp (( α − 1 ) tot t − δt ) ∗
p0 .
α (1 − α ) α γ0 α + π 0 γ ( 1 − α ) yt
It follows that
1 2
d exp(mνH ,t ) = exp(mνH ,t ) µm,t + ∥σm,t ∥ dt + exp(mνH ,t )σm,t dZt ,
2
where
κt 1 1 ||αγt ωi3 + (1 − α)γ∗ πt (ν̃F,t − νH,t )′ ||2
µm,t = −δ + (α − 1) − µy + σy2 − ,
2α − 1 2 2 (γt α + πt γ∗ (1 − α))2
σt αγt ωi3 + (1 − α)γ∗ πt (ν̃F,t − νH,t )′
σm,t = ( α − 1) − σy i1 + .
2α − 1 γt α + π t γ ∗ ( 1 − α )
1
exp(m∗ν∗ ,t ) = p∗α [ pt exp(et )]1−α exp(m∗ν∗ ,t )
F α α (1 − α )1− α t F
1 exp(e0 )
= p∗α [ pt exp(et )]1−α exp(mν̃F ,t )
α α (1 − α )1− α t exp(et )
1 ∗α 1−α exp( e0 ) p0∗ exp(−e0 )y0∗ π0 γt (1 − α) + πt γ∗ α
= p t [ p t exp ( e t )] exp (− δt )
α α (1 − α )1− α exp(et ) p∗t exp(−et )y∗t πt γ0 (1 − α) + π0 γ∗ α
pt exp(et ) 1−α p ∗ y ∗ π 0 γt ( 1 − α ) + π t γ ∗ α
1
= ∗ exp(−δt) 0 ∗ 0
α (1 − α )
α 1 − α pt yt πt γ0 (1 − α) + π0 γ∗ α
∗
γt ( 1 − α ) + π t γ ∗ α
1 y0 π0 ∗
= exp (( 1 − α ) tot t − δt ) p0 .
α α (1 − α )1− α γ0 (1 − α) + π0 γ∗ α y∗t πt
It follows that
1
d exp(m∗ν∗ ,t ) = exp(m∗ν∗ ,t ) µm∗ ,t + ∥σm ,t ∥ dt + exp(m∗ν∗ ,t )σm∗ ,t dZt
∗
2
F F 2 F
where
κt 1 1 1 ∥(1 − α)γt ωi3 + αγ∗ πt (ν̃F,t − νH,t )′ ∥2
µm∗ ,t = −δ + (1 − α) − µy∗ + σy2∗ + (∥νH,t ∥2 − ∥ν̃F,t ∥2 ) − ,
2α − 1 2 2 2 ( γt ( 1 − α ) + π t γ ∗ α ) 2
σt (1 − α)γt ωi3 + αγ∗ πt (ν̃F,t − νH,t )′
σm∗ ,t = (1 − α ) − σy∗ i2 − (ν̃F,t − νH,t )′ + .
2α − 1 γt ( 1 − α ) + π t γ ∗ α
We introduce a stochastic wedge ηt that reconciles the log change in
exchange rates with the domestic and foreign consumption-based
SDFs:
d(totcm cm cm
t ) = At dt + Bt dZt .
Matching the drift and volatility terms of the log exchange rate, we
have
Therefore,
It follows that
1 cm 2
d exp(mcm
t ) = exp(mcm
t ) µcm
m,t + ∥σm,t ∥ dt + exp(mcm cm
t ) σm,t dZt
2
where
1 2 1 α2 γt2 ω 2
µcm cm
m,t = − δ + ( α − 1) At − µy + σy −
2 2 (γt α + πt γ∗ (1 − α))2
cm αγt ωi3
σm,t = (α − 1) Btcm − σy i1 +
γt α + π t γ ∗ ( 1 − α )
1
exp(m∗t ,cm ) = p∗α [ pt exp(etcm )]1−α exp(m∗t ,cm )
α α (1 − α )1− α t
1 exp(e0cm ) p∗ exp(−e0cm )y0∗ γt (1 − α) + πγ∗ α
= α p∗t α [ pt exp(etcm )]1−α exp(−δt) 0∗
α (1 − α ) 1 − α exp(et )cm pt exp(−etcm )y∗t γ0 (1 − α) + πγ∗ α
pt exp(etcm ) 1−α p0∗ y0∗ γt (1 − α) + πγ∗ α
1
= α exp (− δt )
α (1 − α )1− α p∗t y∗t γ0 (1 − α) + πγ∗ α
∗
γt (1 − α) + πγ∗ α
1 y0
= α exp (( 1 − α ) tot cm
t − δt ) p0∗
α (1 − α )1− α γ0 (1 − α) + πγ∗ α y∗t
It follows that
1 cm 2
d exp(m∗ν∗,cm ) = exp ( m ∗,cm
∗ ) µ cm
∗
m ,t + ∥ σ ∗ ∥ dt + exp(m∗ν∗,cm )σmcm∗ ,t dZt
F ,t νF ,t 2 m ,t F ,t
where
1 (1 − α)2 γt2 ω 2
µcm cm
m∗ ,t = − δ + (1 − α ) At − µy∗ −
2 (γt (1 − α) + πγ∗ α)2
cm (1 − α)γt ωi3
σm ∗ ,t = (1 − α) Btcm − σy∗ i2 +
γt ( 1 − α ) + π t γ ∗ α
∗,cm
detcm = dmcm
t − dmt
1 = Et [exp(mt+1 + rt )] ,
1 = Et exp(m∗t+1 − ηt+1 + rt∗ ) ,
1 = Et exp(m∗t+1 + rt∗ ) ,
1 = Et [exp(mt+1 + ηt+1 + rt )] .
Recall
vart (∆et+1 ) = vart (mt+1 − m∗t+1 ) + vart (ηt+1 ) + 2covt (mt+1 − m∗t+1 , ηt+1 )
= vart (mt+1 − m∗t+1 ) − vart (ηt+1 ),
covt (mt+1 − m∗t+1 , ∆et+1 ) = vart (mt+1 − m∗t+1 ) + covt (mt+1 − m∗t+1 , ηt+1 )
= vart (mt+1 − m∗t+1 ) − vart (ηt+1 )
= vart (∆et+1 ).
covt (∆et+1 , mt+1 − m∗t+1 ) = covt ( xt + yt mt+1 + zt m∗t+1 + wt ε t+1 , mt+1 − m∗t+1 )
= covt (yt mt+1 + zt m∗t+1 , mt+1 − m∗t+1 ),
1
0 = Et [mt+1 ] + vart (mt+1 ) + rt ,
2
1 1
0 = Et [mt+1 ] + vart (mt+1 ) − Et [∆et+1 ] + vart (∆et+1 ) − covt (mt+1 , ∆et+1 ) + rt∗ ,
2 2
which implies
1
rpt = −covt (mt+1 , ∆et+1 ) + vart (∆et+1 ).
2
Similarly, Eq. (1.8) and (1.9) imply
1
rpt = −covt (m∗t+1 , ∆et+1 ) − vart (∆et+1 ).
2
1 1
rpt = − covt (mt+1 , ∆et+1 ) − covt (m∗t+1 , ∆et+1 )
2 2
1
= − covt (mt+1 + m∗t+1 , ∆et+1 )
2
1
= − covt (yt mt+1 + zt m∗t+1 , mt+1 + m∗t+1 ).
2
where we use the fact that by construction, covt (wt ε t+1 , mt+1 ) =
covt (wt ε t+1 , m∗t+1 ) = 0. w2t ≥ 0 leads to constraint Eq. (5.15).
The Euler equation Eq. (5.18) also implies
where the first two terms are zero from the Euler equations (5.18) and
(5.19). Hence,
(0) (1) (0) (2)
0 = covt [∆e0/2 0/1 0/1 0/2
t+1 , ∆et+1 − ( mt+1 − mt+1 )] + covt [ ∆et+1 , ∆et+1 − ( mt+1 − mt+1 )].
Proof. Assume the equity claims (i.e., endowment claims) and risk-
free bonds are tradable. Assume the endowment and demand shock
processes are exogenous and adapted to the filtration that is gener-
ated by some d-dimensional Brownian motion Zt . The set of tradable
assets does not span all the Brownian shocks. In each country i, the
households’ problem is given by
"Z ! #
T 1−α
α log ci,t + ∑
(i ) (i ) (i )
max E exp(−ρt)γt log c j,t dt
(i ) (i ) (i )
c ,c ,c 0 j ̸ =i
2
0,t 1,t 2,t
∑
(i ) (i ) (i ) (i ) (i ) (i ) (i ) (i ) (i ) ( j) i/j (i )
dwt = wt (rt + (xt )′ (µt − rt 1))dt + wt ((xt )′ σt )dZt − pt exp(−et )c j,t dt,
j∈{0,1,2}
( j)
where pt is the price of good j in the numéraire of country j’s con-
( j) (i ) 1− α (i ) i/j
sumption bundle ct = exp(α log ci,t + ∑ j̸=i 2 log c j,t ), and et
denotes the bilateral exchange rate, which takes a higher value if cur-
(i )
rency in country i is stronger. xt is the column vector of portfolio
(i )
weights for investors in country i, µt is the column vector of ex-
(i )
pected return of risky assets in country i’s numéraire, and σt is the
corresponding volatility matrix.
Following He and Pearson [1991], Cuoco and He [1994] and
Pavlova and Rigobon [2012], we can solve the system by transforming
the dynamic budget constraints to a static one, which is given by
Z T
(i ) (i ) (i )
E exp(mt )ct ≤ w0 , (A.28)
0
(i )
where exp(mt ) is the state price density in country i. The house-
holds’ Lagrangian is given by
Z T Z T
(i ) (i ) (i ) (i ) (i ) (i )
L=E exp(−ρt)γt log ct dt + ζ w0 − exp(mt )ct dt ,
0 0
∑
(i ) ( j)
s.t. c j,t = yt , j ∈ {0, 1, 2}.
i ∈{0,1,2}
When markets are complete, the Pareto weights are constant, i.e.
(i )
πt = π (i) . Plugging in Eq. (A.32) yields
(1) (2)
1− α π (1 ) γt π (2 ) γt 1− α
2 + (0) α+ (0) 2 (2)
π (0 ) γt π (0 ) γt yt
q1/2,cm
t = log ( 1) (2)
+ log (1)
.
1− α π (1 ) γt 1− α π (2 ) γt yt
2 + (0) 2 + (0) α
π (0 ) γt π (0 ) γt
1− α
(2) 1− α 2
Similarly, pt =α 2α exp(q2/1 2/3
t ) exp( qt ) . Recall that exp(q1/2
t )=
(1) (2)
pt exp(e1/2
t ) /pt , which implies
(1)
! 1− α
exp(q1/2 1/3 2
3(1 − α) 1/2
pt t ) exp( qt )
exp(q1/2 1/2
t ) exp(− et ) = = = exp qt .
pt
(2) exp(q2/1 2/3
t ) exp( qt )
2
Hence, the log bilateral exchange rate is linear in the log terms of
trade:
3α − 1 1/2
e1/2
t = qt ,
2
gt∗
Pt ct gt
zt ℓt = 1 + α + (1 − α ) ∗
PH,t ct ct
∗
Pt ct∗
gt∗ gt
∗ ∗
zt ℓt = ∗ 1 + α ∗ + (1 − α ) .
PF,t ct ct
By Wt = κPt ct ℓνt ,
Pt ct ρ − 1 −ν
= zt ℓt
PH,t κρ
ρ −1
Let ℓ̄ = ρκ . The market clearing equations become
g∗
gt
ℓ1t +ν = ℓ̄ 1 + α + (1 − α) ∗t
ct ct
By the market clearing equations again,
Pt ct
zt ℓ̄ℓ− ν
t =
PH,t
PH,t PH,t
ct = zt ℓ̄ℓ−
t
ν
= αα (1 − α)1−α ℓ̄ℓ− ν
t zt α 1− α
Pt PH,t PF,t
By price setting,
ρ Wt Pt ct ℓν
PH,t ρ −1 z t zt
t
z∗t (ℓ∗t )−ν
= ∗ = ∗ ∗ ∗ =
PF,t ρ W
exp(−Et ) ρ−1 z∗t
P c (ℓ )
exp(−Et ) t tz∗ t
ν
zt ℓ−t
ν
t t
so
gt∗
ν gt
log c∗t − log c̄ ≈ (1 − α) log zt + α log z∗t − ( α2 + (1 − α )2 ) ∗ + 2(1 − α ) α ,
1+ν ct ct
def Pt∗
−et = log exp(−Et ) = log ct − log c∗t
Pt
gt∗
∗ ν 2 gt
≈ (2α − 1)(log zt − log zt ) − (2α − 1) − ∗ ,
1+ν ct ct
Proof. When prices are sticky, the model is closed by the following
equations for prices:
ρ Wt ∗ ρ Wt
PH,t = Et −1 exp(−Et ) PH,t = Et −1 (A.33)
ρ−1 zt ρ−1 zt
∗ ∗
ρ Wt ∗ ρ W
exp(Et ) PF,t = E PF,t = E t
(A.34)
ρ − 1 t−1 z∗t ρ − 1 t−1 z∗t
g∗
Pt ct gt
zt ℓt = 1 + α + (1 − α) ∗t (A.35)
PH,t ct ct
gt∗ gt∗
Pt ct gt Pt ct /zt gt
ℓt = 1 + α + (1 − α ) ∗ = ℓ̄ 1 + α + (1 − α) ∗
PH,t zt ct ct Et−1 [ Pt ct ℓνt /zt ] ct ct
Then
which implies
where
ℓ̄αα (1 − α)1−α
κtc−1 = log
(Et−1 [ Pt ct ℓνt /zt ])α (Et−1 [ Pt∗ c∗t (ℓ∗t )ν /z∗t ])1−α
ℓ̄αα (1 − α)1−α
κtc−1 = log
(Et−1 [ Pt ct ℓt /zt ])1−α (Et−1 [ Pt∗ c∗t (ℓ∗t )ν /z∗t ])α
ν
∗
κte−1 = κtc−1 − κtc−1
similarly,
" #!
∗ 1 1
log c∗t = κtc−1 −α it∗ + log Et δ ∗ ∗ − (1 − α) it + log Et δ
Pt+1 ct+1 Pt+1 ct+1
∗
= κ̃tc−1 − αit∗ − (1 − α)it
where
" #
1 1
κ̃tc−1 = κtc−1
− α log Et δ − (1 − α) log Et δ ∗ ∗
Pt+1 ct+1 Pt+1 ct+1
" #
∗ c∗ 1 1
κ̃tc−1 = κt−1 − α log Et δ ∗ ∗ − (1 − α) log Et δ
Pt+1 ct+1 Pt+1 ct+1
rt = r̄ + αit + (1 − α)it∗ .
where
Proof. From Proposition 6.4, plug the last two equations into the two
above:
! 1− α
α
h 1 i ct 1 c∗
s∗ t
s t + c t Et ∑ ∞ k−t sk
k = t +1 β ck s∗t +c∗t Et ∑∞
k = t +1 β k−t k
c ∗
ct = ℓ̄αα (1 − α)1−α !α
k
1− α
Et −1 [ h 1
k−t sk
i ct ℓν /zt ]
t
Et −1 [ 1
s∗
c∗ (ℓ∗ )ν /z∗ ]
t t t
s t + c t Et ∑ ∞
k = t +1 β c s∗t +c∗t Et ∑∞ β k−t k
k k = t +1 c∗
k
!1− α α
h 1 i ct 1 c∗
s∗ t
k−t sk
s t + c t Et ∑ ∞
k = t +1 β ck s∗t +c∗t Et ∑∞ βk−t ck∗
k = t +1
c∗t = ℓ̄αα (1 − α)1−α !1− α
k
α .
Et −1 [ h 1
k−t sk
i ct ℓν /zt ]
t
Et −1 [ 1
s∗
c∗ (ℓ∗ )ν /z∗ ]
t t t
s t + c t Et ∑ ∞
k = t +1 β ck s∗t +c∗t Et ∑∞ βk−t ck∗
k = t +1
k
We conjecture that ct and c∗t are functions of st and s∗t only. Then,
by the i.i.d. assumption of the
i government surpluses,i the expecta-
s∗
h h
tion terms Et ∑∞
k = t +1 β
k −t sk and E
ck
∞
t ∑ k = t +1 β
k −t k are constants,
c∗ k
ℓ̄αα (1 − α)1−α ct c∗
log ct = log ℓν c∗ (ℓ∗ )ν
+ α log( ) + (1 − α) log( ∗ t ∗ ∗ )
(Et−1 [ st +cct t A ztt ])α (Et−1 [ s∗ +ct∗ A∗ zt∗ ])1−α st + ct A st + ct A
t t t
ℓ̄α (1 − α)
α 1 − α ct c∗
log c∗t = log ℓν c∗ (ℓ∗ )ν
+ (1 − α) log( ) + α log( ∗ t ∗ ∗ )
(Et−1 [ st +cct t A ztt ])1−α (Et−1 [ s∗ +ct∗ A∗ zt∗ ])α st + ct A st + ct A
t t t
where
The log real exchange rate −et = log ct − log c∗t . The log price level
is
where
1 c̄A
κ P = − log(s̄ + c̄A) − (−s̄) − (κ − log c̄) .
s̄ + c̄A s̄ + c̄A
Proof. Divide the first condition by the third, and the second by the
fourth to get:
∗
α c F,t 1 − α c F,t µ def
= = H,t = exp(tott )
1 − α c H,t α c∗H,t µ F.t
then, plug in the equations above into the market clearing conditions
to get the consumption rule
α −1 −α
1−α
α
exp(tott ) ct + exp(tott ) c∗t = yt
α 1−α
−α α −1
1−α
α
exp(−tott ) ct + exp(−tott ) c∗t = y∗t
1−α α
which implies that exp(tott ) is actually the relative price between two
goods. Dividing the first condition by the second one to pin down
the expression of exp(tott ):
Similarly,
which implies
−(λ∗H,t − λ H,t ) = Et [m∗t+1 − mt+1 ] + Et [∆et+1 ] = Et [γ(∆ log ct+1 − ∆ log c∗t+1 )] + Et [∆et+1 ]
⇒ et = (λ∗H,t − λ H,t ) + Et [γ(∆ log ct+1 − ∆ log c∗t+1 )] + et+1 .
Let lim j→∞ et+ j+1 = ē. Plugging ∆ct+1 = log ct+1 − log ct yields
∞ ∞
et − ē = ∑ (λ∗H,t+ j − λ H,t+ j ) + ∑ Et [γ(log ct+ j+1 − log ct+ j − log c∗t+ j+1 + log c∗t+ j )]
j =0 j =0
∞
= ∑ (λ∗H,t+ j − λ H,t+ j ) − γ(log ct − log c∗t ) + γ(log c̄ − log c̄∗ )
j =0
where we let c̄ = lim j→∞ ct+ j+1 , c̄∗ = lim j→∞ c∗t+ j+1 . This yields Eq.
(7.4). Finally, recall that
similarly,
k
Dt = − ∑ St− j exp( RtD− j→t ) + Dt−k−1 exp( RtD−k−1→t )
j =0
where RtD− j→t = RtD + RtD−1 + · · · + RtD− j+1 and RtD→t = 0. Let k = t − 1
to get Eq. (8.2):
t −1
Dt = − ∑ St− j exp( RtD− j→t ) + D0 exp( R0D→t )
j =0
t −1 S
Dt t− j Yt− j D0 Y0
⇒ =−∑ exp( RtD− j→t ) + exp( R0D→t )
Yt j =0
Y t − j Yt Y0 Yt
t −1St − j D0
=−∑ exp( RtD− j→t − Xt− j→t ) + exp( R0D→t − X0→t )
Y
j =0 t − j
Y0
Proof. First, let us assume that the government never defaults. Con-
sider the time t + 1 constraints,
H H
∑ Qt (h) Pt+1 (h − 1) = St+1 + ∑ Qt+1 (h) Pt+1 (h)
h =1 h =1
multiply both sides by nominal SDF exp( Mt+1 ) and take expectations
conditional on time t:
" #
H H
∑ Qt (h) Pt (h) = Et [exp( Mt+1 )St+1 ] + Et exp( Mt+1 ) ∑ Qt+1 (h) Pt+1 (h)
h =1 h =1
(A.36)
where we use the asset pricing equations Et [exp( Mt+1 )] = Pt (1), and
Et [exp( Mt+1 ) Pt+1 (h − 1)] = Pt (h).
Similarly, the time t + 2 constraints imply:
" #
H H
∑ Qt+1 (h) Pt+1 (h) = Et+1 [exp( Mt+1,t+2 )St+2 ] + Et+1 exp( Mt+1,t+2 ) ∑ Qt+2 (h) Pt+2 (h)
h =1 h =1
multiply both sides by nominal SDF exp( Mt+1 ) and take expectations
conditional on time t:
" #
H H
∑ Qt (h) Pt (h) = Et [exp( Mt+1 )St+1 ] + Et exp( Mt+1 ) ∑ Qt+1 (h) Pt+1 (h)
h =1 h =1
where we use the asset pricing equations Et [exp( Mt+1 )(1 − χt+1 )] =
Pt (1), and Et [exp( Mt+1 ) Pt+1 (h − 1)(1 − χt+1 )] = Pt (h).
Again, we reach the condition in Eq. (A.36). Iterate forward the
same way in no default case to time n → ∞ and we will get Eq. (8.6)
as long as the government does not default in time t. Since bond
payment is not certain any more, the money that government can
raise from 1% face value of bond drops (Pt decrease).
Proof. We present the proof for general case with default. Consider
the time t + 1 constraint,
H H
(1 − χ t +1 ) ∑ Qt (h) Pt+1 (h − 1) = St+1 + ∑ Qt+1 (h) Pt+1 (h)
h =1 h =1
multiply both sides by nominal SDF exp( Mt+1 ) and take expectations
conditional on time t:
" #
H H
∑ Qt (h) Pt (h)e −λt (h)
= Et [exp( Mt+1 )St+1 ] + Et exp( Mt+1 ) ∑ Qt+1 (h) Pt+1 (h)
h =1 h =1
H H
∑ Qt (h) Pt (h) = Et [exp( Mt+1 )St+1 ] + ∑ Qt (h) Pt (h) 1 − e−λt (h)
h =1 h =1
" #
H
+ Et exp( Mt+1 ) ∑ Qt+1 (h) Pt+1 (h) (A.38)
h =1
where we use the asset pricing equations Et [exp( Mt+1 )(1 − χt+1 )] =
Pt (1)e−λt (1) , and Et [exp( Mt+1 ) Pt+1 (h − 1)(1 − χt+1 )] = Pt (h)e−λt (h) .
Similarly, consider the time t + 2 budget constraint. Multiply
both sides by nominal SDF exp( Mt+1,t+2 ) and take expectations
conditional on time t + 1:
H H
∑ Qt+1 (h) Pt+1 (h) = Et+1 [exp( Mt+1,t+2 )St+2 ] + ∑ Qt+1 (h) Pt+1 (h) 1 − e−λt+1 (h)
h =1 h =1
" #
H
+ Et+1 exp( Mt+1,t+2 ) ∑ Qt+2 (h) Pt+2 (h)
h =1
H H
∑ Qt+ℓ (h) Pt+ℓ (h) = Et+ℓ [exp( Mt+ℓ,t+ℓ+1 )St+ℓ+1 ] + ∑ Qt+ℓ (h) Pt+ℓ (h) 1 − e−λt+ℓ (h)
h =1 h =1
" #
H
+ Et+ℓ exp( Mt+ℓ,t+ℓ+1 ) ∑ Qt+ℓ+1 (h) Pt+ℓ+1 (h)
h =1
Proof. By the definition of exp( RtD+1 ), exp( RtT+1 ) and exp( RtG+1 ):
PtT exp( RtT+1 ) − PtG exp( RtG+1 ) = PtT+1 + Tt+1 − PtG+1 − Gt+1 = Dt exp( RtD+1 )
Divide both sides by Dt and take expectations to get part (a). Simi-
larly,
Proof. Multiply both sides of Eq. (8.12) by Dt /( PtT + PtK − PtG ) and
plug in Dt = PtT + PtK + Kt − PtG to obtain
!
D Kt
exp( Rt+1 ) 1 + T
Pt + PtK − PtG
PtT PtK PtG
= exp( RtT+1 ) + exp ( R K
t +1 ) − exp ( R G
t +1 ) .
PtT + PtK − PtG PtT + PtK − PtG PtT + PtK − PtG
Taking conditional expectation Et for both sides yields part (a).
To get part (b), take conditional covariance with market return
exp( RtM+1 ) and divide by its variance for both sides
!
covt (exp( RtD+1 , exp( RtM+1 ))) Kt
1+ T
vart (exp( RtM+1 )) Pt + PtK − PtG
covt (exp( RtT+1 , exp( RtM+1 ))) PtT
=
vart (exp( RtM+1 )) PtT + PtK − PtG
covt (exp( RtK+1 , exp( RtM+1 ))) PtK
+
vart (exp( RtM+1 )) PtT + PtK − PtG
covt (exp( RtG+1 , exp( RtM+1 ))) PtG
− ,
vart (exp( RtM+1 )) PtT + PtK − PtG
which implies
!
Kt PtT PtK PtG
βD
t 1+ T = β Tt + β K
t − β G
t ,
Pt + PtK − PtG PtT + PtK − PtG PtT + PtK − PtG PtT + PtK − PtG
where we plug in the expression of beta defined above.
Hence,
ψ(wt + ∆w, t) − ψ(wt , t)
lim ≥ δt u′ (ct )
∆w→0 ∆w
This implies that ψ′ (wt , t) ≥ δt u′ (ct ). Now, take ∆w sufficiently small
such that 0 < ∆w < ct . Define a feasible process ({c∗s }∞ ∗ ∞
s=t , { ds }s=t )
∗ ∗ ∗
from wt − ∆w, such that ct = ct − ∆w, dt = dt , cs = cs , ds = ds ,∗
s = t + 1, t + 2, . . .. Similarly,
∞
" #
ψ(wt − ∆w, t) ≥ Et ∑ δs u(c∗s )
s=t
∞ ∞
" # " #
⇒ ψ(wt − ∆w, t) − ψ(wt , t) ≥ Et ∑ δs u(c∗s ) − Et ∑ δs u(cs )
s=t s=t
Hence,
ψ(wt − ∆w, t) − ψ(wt , t)
lim ≤ δt u′ (ct )
∆w→0 −∆w
This implies that ψ′ (wt , t) ≤ δt u′ (ct ), hence ψ′ (wt , t) = δt u′ (ct ).
then
h i h i
0 = lim E0 δt u′ (ct )dt−1 exp(rtd ) = lim E0 δt−1 u′ (ct−1 )dt−1 ,
t→∞ t→∞
Proof. First, We derive the Euler equation with utility from holding
the bonds. The Lagrangian is:
∞ ∞
" #
E0 ∑ δt (u(ct ) + vt (dt )) + ∑ ζ t (yt + dt−1 exp(rtD ) − ct − dt )
t =0 t =0
δt u′ (ct ) − ζ t = 0
δt v′t (dt ) − ζ t + Et [ζ t+1 exp(rtD+1 )] = 0
hence,
ψ′ ( wt , t ) = δt u′ ( ct )
hence,
exp( pdm m
t−1 ( h + 1)) = Et [exp( Mt + pdt ( h ) + ∆ log Divt )],
1
Am (h + 1) + Bm (h + 1)′ zt−1 = −it−1 (1) − Λ′t Λt + Am (h) + Bm (h)′ Ψzt−1 + x0 + π0 + (e∆d + ex + eπ )′ Ψzt−1
2
1 1
+ Λ′t Λt + ( Bm (h) + e∆d + ex + eπ )′ Σ( Bm (h) + e∆d + ex + eπ )
2 2
1
− ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λt
1
= −i0 (1) + Am (h) + x0 + π0 + ( Bm (h) + e∆d + ex + eπ )′ Σ( Bm (h) + e∆d + ex + eπ )
2
1
− ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ0
1
+ (−ei′ + ( Bm (h) + e∆d + ex + eπ )′ Ψ − ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ1 )zt−1 .
1
Am (h + 1) = −i0 (1) + Am (h) + x0 + π0 + ( Bm (h) + e∆d + ex + eπ )′ Σ( Bm (h) + e∆d + ex + eπ )
2
1
− ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ0 ,
1
Bm (h + 1)′ = −ei′ + ( Bm (h) + e∆d + ex + eπ )′ Ψ − ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ1 .
and
1 1
lim A(h)/h = −i0 (1) + B(∞)′ ΣB(∞) − B(∞)′ Σ 2 Λ0 ,
h→∞ 2
1
−1
′ ′ ′
B(∞) = lim B(h) = −ei I − (Ψ − Σ 2 Λ1 )
h→∞
Et [log Pt+1 (h − 1) − log Pt (h)] = A(h − 1) + B(h − 1)′ Ψzt − A(h) − B(h)′ zt
1 1
= i0 (1) − B(h − 1)′ ΣB(h − 1) + B(h − 1)′ Σ 2 Λ0
2
+ B(h − 1)′ Ψzt − B(h)′ zt .
Letting h → ∞ yields
1 1
lim Et [log Pt+1 (h − 1) − log Pt (h)] = i0 (1) − B(∞)′ ΣB(∞) + B(∞)′ Σ 2 Λ0 + B(∞)′ (Ψ − I )zt ,
h→∞ 2
1 1
lim it (h) = i0 (1) − B(∞)′ ΣB(∞) + B(∞)′ Σ 2 Λ0 .
h→∞ 2
(b) Recall that
1
Am (h + 1) = −i0 (1) + Am (h) + x0 + π0 + ( Bm (h) + e∆d + ex + eπ )′ Σ( Bm (h) + e∆d + ex + eπ )
2
1
− ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ0 ,
1
Bm (h + 1)′ = −ei′ + ( Bm (h) + e∆d + ex + eπ )′ Ψ − ( Bm (h) + e∆d + ex + eπ )′ Σ 2 Λ1 .
Let h → ∞ to obtain
Proof. Substitute the n f at , ibt , cgt expressions into Eq. (9.4) to get:
at ∑ x F,t (ι) − a∗t ∑ x ∗H,t (ι) − at−1 ∑ x F,t−1 (ι) + a∗t−1 ∑ x ∗H,t−1 (ι)
ι ι ι ι
= tbt + at−1 ∑ x F,t−1 (ι)d∗t (ι) − a∗t−1 ∑ w∗H,t−1 (ι)dt (ι) + at−1 ∑ x F,t−1 (ι)ρ∗t (ι) − a∗t−1 ∑ x ∗H,t−1 (ι)ρt (ι)
ι ι ι ι
tbt = at − ∑ mt (ι) − at−1 ∑ x F,t−1 (ι)(1 + d∗t (ι) + ρ∗t (ι)) + a∗t−1 ∑ x ∗H,t−1 (1 + dt (ι) + ρt (ι)).
ι ι ι
note that households consume all their wealth in the last period, i.e.
1− γ
wt
ψt (wt , zt ) = δt
1−γ
f T (z T ) = 1.
Plug the conjectured expression of ψt+1 (wt+1 , zt+1 ) into the recur-
sive form of ψt (wt , zt ) to get
1− γ 1− γ
( " #)
c w
ψt (wt , zt ) = max δt t + Et δ t +1 f t +1 ( z t +1 ) t +1
ct ,xt 1−γ 1−γ
1− γ
((wt − ct ) xt′ exp(rt+1 ))1−γ
( " #)
t ct t +1
= max δ + Et δ f t +1 ( z t +1 )
ct ,xt 1−γ 1−γ
1− γ 1− γ
( " #)
c ( x ′ exp(r ))
t + 1
= max δt t + δ t +1 ( w t − c t )1− γ Et f t +1 ( z t +1 ) t
ct ,xt 1−γ 1−γ
( γ −1
( xt′ exp(rt+1 ))1−γ
" #)
t 1− γ wct 1 1− γ
= δ wt max + δ (1 − ) Et f t +1 ( z t +1 )
wct ,xt 1−γ wct 1−γ
since δ(1 − wc1 t )1−γ is always positive and xt only enters into the
expectation term in the objective function, the optimal portfolio can
be solved for independently of the optimal consumption, i.e.
( xt′ exp(rt+1 ))1−γ
xt = arg max Et f t+1 (zt+1 )
xt 1−γ
which gives Eq. (9.9). For now, denote
( x ′ exp(rt+1 ))1−γ
gt (zt ) = max Et f t+1 (zt+1 ) t
xt 1−γ
such that 1 = xt′ 1. Plug gt (zt ) back into ψt (wt , zt ) to get
( γ −1 )
1 − wc 1
ψt (wt , zt ) = δt wt max
γ t
+ δ (1 − )1− γ gt ( z t )
wct 1−γ wct
Proof. When the asset returns are i.i.d., the optimal portfolio choice is
given by
h i
p
xt = arg max E exp((1 − γ)rt+1 )
xt
h i 1
p p
= arg max exp(E (1 − γ)rt+1 + var ((1 − γ)rt+1 ))
xt 2
h i 1
p p
= arg max E (1 − γ)rt+1 + var ((1 − γ)rt+1 )
xt 2
p
plug rt+1 in to get
h
p
i 1 p f 1 2 f 1 ′ 1
E (1 − γ ) r t +1 ′
+ var ((1 − γ)rt+1 ) = (1 − γ) xt (Et [rt+1 − rt ] + σt ) + rt − xt Σt xt + (1 − γ)2 xt′ Σt xt
2 2 2 2
f 1
Et [rt+1 − rt ] + σt2 − γΣxt = 0
2
Hence,
f 1 2
x t = γ −1 Σ − 1
E [ r t +1 − r ] + σ .
t t 2 t
Et [∆et+1 + rt − rt∗ ]
= exp(r̄ )nψt − exp(ē + r̄ ∗ − r̄ )b H,t exp(rt )
ωvart (∆et+1 )/m
b exp(rt )
⇒ Et [∆et+1 + rt − rt∗ ] = χ1 ψt − χ2 H,t ,
Ȳ
Proof. First, subtract the first first-order condition by the second and
plug in the consumption aggregation equations to get:
straints to get:
αĉ H,t (i ) + (1 − α)ĉ F,t (i ) = τ̂t
αĉ∗F,t (i ) + (1 − α)ĉ∗H,t (i ) = τ̂t∗
ĉ H,t (i ) − ĉ F,t (i ) = ĉ H,t ( a) − ĉ F,t ( a)
ĉ∗H,t (i ) − ĉ∗F,t (i ) = ĉ∗H,t ( a) − ĉ∗F,t ( a)
taken active households’ consumption as given, the 4 simultaneous
equations solve for inactive households’ consumption:
ĉ H,t (i ) = τ̂t + (1 − α)ĉ H,t ( a) − (1 − α)ĉ F,t ( a)
ĉ F,t (i ) = τ̂t − αĉ H,t ( a) + αĉ F,t ( a)
ĉ∗H,t (i ) = τ̂t∗ + αĉ∗H,t ( a) − αĉ∗F,t ( a)
ĉ∗F,t (i ) = τ̂t∗ − (1 − α)ĉ∗H,t ( a) + (1 − α)ĉ∗F,t ( a)
plug into the social planner’s resource constraints to get
ȳŷt = ϕ(c̄ H ( a)ĉ H,t ( a) + c̄∗H ( a)ĉ∗H,t ( a))
+ (1 − ϕ)(c̄ H (i )(τ̂t + (1 − α)ĉ H,t ( a) − (1 − α)ĉ F,t ( a)) + c̄∗H (i )(τ̂t∗ + αĉ∗H,t ( a) − αĉ∗F,t ( a)))
(A.42)
ȳ∗ ŷ∗t = ϕ(c̄ F ( a)ĉ F,t ( a) + c̄∗F ( a)ĉ∗F,t ( a))
+ (1 − ϕ)(c̄ F (i )(τ̂t − αĉ H,t ( a) + αĉ F,t ( a)) + c̄∗F (i )(τ̂t∗ − (1 − α)ĉ∗H,t ( a) + (1 − α)ĉ∗F,t ( a)))
(A.43)
Recast Eq. (A.40), Eq. (A.41), Eq. (A.42) and Eq. (A.43) to solve for
ĉ H,t ( a), ĉ F,t ( a), ĉ∗H,t ( a) and ĉ∗F,t ( a) simultaneously:
Proof. First, we show the steady state under in special case where
π = 1/2, ȳ = ȳ∗ = 1, τ̄ = τ̄ ∗ = αα (1 − α)1−α . Note that the
two countries are perfectly symmetric in the steady state. Hence,
c̄ H,t (.) = c̄∗F,t (.), c̄ F,t (.) = c̄∗H,t (.). Plug this equality into the active
households’ first-order conditions and divide the first first-order
condition by the second one to get:
c̄ F,t ( a) 1−α
= .
c̄ H,t ( a) α
c̄ F,t (i ) 1−α
= .
c̄ H,t (i ) α
which implies
1 1 ∗
ĉt ( a) = (ŷt − (1 − ϕ)τ̂t ), ĉ∗t ( a) = (ŷ − (1 − ϕ)τ̂t∗ ).
ϕ ϕ t
which implies
γ γ
êt = − (ŷt − ŷ∗t ) + (1 − ϕ)(τ̂t − τ̂t∗ ),
ϕ ϕ
ĉ∗t − ĉt = −(ŷt − ŷ∗t ).
where
Hence,
Leif Andersen, Darrell Duffie, and Yang Song. Funding value adjust-
ments. The Journal of Finance, 74(1):145–192, 2019.
Stefan Avdjiev, Wenxin Du, Catherine Koch, and Hyun Song Shin.
The dollar, bank leverage, and deviations from covered interest
parity. American Economic Review: Insights, 1(2):193–208, 2019.
Kerry Back. Asset pricing and portfolio choice theory. Oxford University
Press, 2010.
Ravi Bansal and Ivan Shaliastovich. Risk and return in bond, cur-
rency and equity markets. Manuscript, Duke University, 2007.
Valentina Bruno and Hyun Song Shin. Capital flows and the risk-
taking channel of monetary policy. Journal of monetary economics, 71:
119–132, 2015a.
Ricardo J Caballero and Emmanuel Farhi. The safety trap. The Review
of Economic Studies, 85(1):223–274, 2018.
John Y Campbell and Richard H Clarida. The dollar and real inter-
est rates. In Carnegie-Rochester Conference Series on Public Policy,
volume 27, pages 103–139. Elsevier, 1987.
YiLi Chien and Kanda Naknoi. The risk premium and long-run
global imbalances. Journal of Monetary Economics, 76:299–315, 2015.
YiLi Chien, Hanno Lustig, and Kanda Naknoi. Why are exchange
rates so smooth? a household finance explanation. Journal of Mone-
tary Economics, 112:129–144, 2020.
John H Cochrane. The fiscal theory of the price level. Princeton Univer-
sity Press, 2023.
Ric Colacito, Max Croce, Steven Ho, and Philip Howard. Bkk the
ez way: International long-run growth news and capital flows.
American Economic Review, 108(11):3416–49, 2018b.
Riccardo Colacito and Mariano M Croce. Risks for the long run and
the real exchange rate. Journal of Political economy, 119(1):153–181,
2011.
Michael B Devereux, Charles Engel, and Steve Pak Yeung Wu. Col-
lateral advantage: Exchange rates, capital flows and global cycles.
Technical report, National Bureau of Economic Research, 2023.
Wenxin Du and Jesse Schreger. Cip deviations, the dollar, and fric-
tions in international capital markets. In Handbook of International
Economics, volume 6, pages 147–197. Elsevier, 2022a.
Wenxin Du, Joanne Im, and Jesse Schreger. The us treasury premium.
Journal of International Economics, 112:167–181, 2018a.
Darrell Duffie. Still the world’s safe haven? redesigning the us trea-
sury market after the covid-19 crisis. 2020.
Charles Engel. Exchange rates, interest rates, and the risk premium.
American Economic Review, 106(2):436–474, 2016.
Larry G. Epstein and Stanley E. Zin. Risk aversion, and the temporal
behavior of consumption and asset returns: a theoretical frame-
work. Econometrica, 57(4):937, 1989.
Xiang Fang, Bryan Hardy, and Karen K Lewis. Who holds sovereign
debt and why it matters. Technical report, National Bureau of
Economic Research, 2022.
Paul Fontanier. Dollar debt and the inefficient global financial cycle.
2023.
Jordi Galí. Monetary policy, inflation, and the business cycle: an intro-
duction to the new Keynesian framework and its applications. Princeton
University Press, 2015.
Gita Gopinath and Jeremy C Stein. Banking, trade, and the making
of a dominant currency. The Quarterly Journal of Economics, 136(2):
783–830, 2021.
George J Hall and Thomas J Sargent. Interest rate risk and other
determinants of post-wwii us government debt/gdp dynamics.
American Economic Journal: Macroeconomics, 3(3):192–214, 2011.
Tarek A Hassan and Rui C Mano. Forward and spot exchange rates
in a multi-currency world. The Quarterly Journal of Economics, 134
(1):397–450, 2019.
Zhiguo He, Bryan Kelly, and Asaf Manela. Intermediary asset pric-
ing: New evidence from many asset classes. Journal of Financial
Economics, 126(1):1–35, 2017.
Olivier Jeanne and Andrew K Rose. Noise trading and exchange rate
regimes. The Quarterly Journal of Economics, 117(2):537–569, 2002.
Scott Joslin, Wenhao Li, and Yang Song. The term structure of liq-
uidity premium. USC Marshall School of Business Research Paper,
2021.
Rohan Kekre and Moritz Lenel. The flight to safety and interna-
tional risk sharing. Technical report, National Bureau of Economic
Research, 2021.
Sun Yong Kim. The dollar, fiscal policy and the us safety puzzle.
Available at SSRN 4204972, 2022.
Ralph SJ Koijen and Motohiro Yogo. Exchange rates and asset prices
in a global demand system. Technical report, National Bureau of
Economic Research, 2020.
Gordon Liao and Tony Zhang. The hedging channel of exchange rate
determination. International finance discussion paper, (1283), 2020.
Yang Liu, Lukas Schmid, and Amir Yaron. The risks of safe assets.
Available at SSRN 3699618, 2020.
Jun Pan and Kenneth J Singleton. Default and recovery implicit in the
term structure of sovereign cds spreads. The Journal of Finance, 63
(5):2345–2384, 2008.
Anna Pavlova and Roberto Rigobon. Asset prices and exchange rates.
The Review of Financial Studies, 20(4):1139–1180, 2007.
Robert Ready, Nikolai Roussanov, and Colin Ward. After the tide:
Commodity currencies and global trade. Journal of Monetary Eco-
nomics, 85:69–86, 2017a.
Hélène Rey. Dilemma not trilemma: the global financial cycle and
monetary policy independence. Technical report, National Bureau
of Economic Research, 2015.
Juliana Salomao and Liliana Varela. Exchange rate exposure and firm
dynamics. The Review of Economic Studies, 89(1):481–514, 2022.
Robert Triffin. Gold and the dollar crisis: the future of convertibility,
volume 39. New Haven: Yale University Press, 1960.
Haonan Zhou. The fickle and the stable: Global financial cycle trans-
mission via heterogeneous investors. Available at SSRN 4616182,
2023.