Professional Documents
Culture Documents
Mathematical Finance: Hiroshi Toyoizumi September 26, 2012
Mathematical Finance: Hiroshi Toyoizumi September 26, 2012
Hiroshi Toyoizumi 1
1 E-mail: toyoizumi@waseda.jp
Contents
1 Introduction 6
5 Easy Money 23
5.1 Virtual Market . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
5.1.1 Risk Neutralization . . . . . . . . . . . . . . . . . . . . . 24
5.1.2 No Easy Money? . . . . . . . . . . . . . . . . . . . . . . 25
5.2 Ideal Dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5.3 Duplication and Law of One Price . . . . . . . . . . . . . . . . . 28
5.4 Arbitrage Theorem . . . . . . . . . . . . . . . . . . . . . . . . . 30
2
CONTENTS 3
5.5 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
5.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
9 Stochastic Integral 65
9.1 Diffusion Process . . . . . . . . . . . . . . . . . . . . . . . . . . 65
9.2 Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
9.3 Definition of Stochastic Integral . . . . . . . . . . . . . . . . . . 68
9.4 Martingale . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
9.5 Calculus of Stochastic Integral . . . . . . . . . . . . . . . . . . . 70
11 Differential Equations 77
11.1 Ordinary Differential Equation . . . . . . . . . . . . . . . . . . . 77
11.2 Geometric Brownian Motion . . . . . . . . . . . . . . . . . . . . 78
11.3 Stochastic Process and Partial Differential Equation . . . . . . . . 79
12 Portfolio Dynamics 81
12.1 Portfolio Model . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
12.2 Rate of Return of Stock and Risk-free Asset . . . . . . . . . . . . 83
12.3 Arbitrage and Portfolio . . . . . . . . . . . . . . . . . . . . . . . 83
T. Bjork. Arbitrage Theory in Continuous Time. Oxford Finance. Oxford Univ Pr,
2nd edition, 2004.
5
Chapter 1
Introduction
we will learn the advanced probability models and its evaluation related to Math-
ematical Finance.
Especially, we pick up the risk management, portfolio, and evaluation of deriva-
tives. Also, we will even learn the basics of the advanced mathematics such as
geometric Brownian motion, Black-Scholes formula and Ito-calculus.
6
Chapter 2
E[X] = µ, (2.2)
Var[X] = σ 2 . (2.3)
7
8 CHAPTER 2. NORMAL RANDOM VARIABLES
Proof. omit.
Lemma 2.1. Let X be a normal random variable with its mean µ and standard
deviation σ . Set Z = (X − µ)/σ . Then, Z is the standard normal random variable.
Theorem 2.2. Let (Xi )i=1,2,...,n are independent normal random variables with
its mean µi and standard deviation σi . Then the sum of these random variables
X = ∑ni=1 Xi is again a normal random variable with
n
µ = ∑ µi , (2.4)
i=1
n
σ 2 = ∑ σi2 . (2.5)
i=1
Also, we have
" #
n n n
σ 2 = E[X] = Var ∑ Xi = ∑ Var [Xi] = ∑ σi2.
i=1 i=1 i=1
Y = eX , (2.6)
Remark 2.1. It is important to see that although the mean of the lognormal random
variable is subjected to not only the mean of the original normal random variable
but also the standard deviation.
Problem 2.2. Explain why the mean E[Y ] is larger than the “expected” mean
eE[X] .
Proof. Let us assume X is the standard normal random variable, for a while. Let
m(t) be the moment generation function of X, i.e.,
where the integrand of the right hand side is nothing but the density of the normal
random variable N(t, 1). Thus,
2 /2
m(t) = et .
More generally, when X is a normal random variable with N(µ, σ ), we can obtain
2 t 2 /2
m(t) = E[etX ] = eµt+σ . (2.11)
where if X(n) = logY (n) is regarded as the error term and normally distributed,
the above assumption is validated.
Example 2.1 (Stock price rises in two weeks in a row). Suppose Y (n) is the
growth rate of a stock at the n-th week, which is independent and lognormally
distributed with the parameters µ and σ . We will find the probability that the
stock price rises in two weeks in a row.
First, we will estimate P{the stock rises}. Since it is equivalent that y > 1 and
log y > 0, we have
X −µ
Z= , (2.17)
σ
as the standard normal distribution. Hence, we have
X −µ 0−µ
P{S(1) > S(0)} = P >
σ σ
= P{Z > −µ/σ }
= P{Z < µ/σ },
where we used the symmetry of the normal distribution (see exercise 2.1).
Now we consider the probability of two consecutive stock price rise. Since
Y (n) is assumed to be independent, we have
12 CHAPTER 2. NORMAL RANDOM VARIABLES
P{the stock rises two week in a row} = P{Y (1) > 1,Y (2) > 1}
= P{Y (1) > 0}P{Y (2) > 0}
= P{Z < µ/σ }2
Problem 2.4. 1. Derive the probability that stock price down two days in a
row.
2. Derive the probability that stock price up at the first and down at the second
day.
2.4 References
2.5 Exercises
Exercise 2.1. Prove the symmetry of the normal distribution. That is, let X be the
normal distribution with the mean µ and the standard deviation σ , then for any x,
we have
Exercise 2.3. Let X be a normal random variable with the parameters µ = 0 and
σ = 1.
Exercise 2.4. Use Microsoft Excel to draw the graph of probability distribution
f (x) = dP{X ≤ x}/dx of normal random variable X with
1. µ = 5 and σ = 1,
2. µ = 5 and σ = 2,
3. µ = 4 and σ = 3.
What can you say about these graphs, especially large x? Click help in your Excel
to find the appropriate function. Of course, it won’t help you to find the answer
though...
Exercise 2.5. Let X be a normal random variable with its mean µ and standard
deviation σ . Set Z = (X − µ)/σ . Prove Z is the standard normal random variable.
1 n
E[X] ≈ ∑ Xi. (3.1)
n i=1
Theorem 3.1 (Weak law of large numbers). Let X1 , X2 , .... be an i.i.d. sequence
of random variables with
µ = E[Xn ]. (3.2)
Or, if we take sufficiently large number of samples, the average will be close to µ
with high probability.
14
3.1. THE LAW OF LARGE NUMBERS 15
130
120
110
100
90
80
20 40 60 80 100
You can say, “OK. I understand, we can expect the average will be close to the
mean most of the times. So, it might be possible to have some exceptions...”
Problem 3.2. Take the case of coin flipping. Give an example of this kind of
exception.
Actually, the exceptions should follow the rule, they will vanish...
Theorem 3.2 (Strong law of large numbers). Let X1 , X2 , .... be an i.i.d. sequence
of random variables with
µ = E[Xn ]. (3.4)
With probability 1, we have
Sn
→ µ as n → ∞. (3.5)
n
So, now we cay say that almost all trials, the average will be converges to µ.
Problem 3.3. What is the difference between “most of the time” and “almost all”?
for a large n. However, there is an “error” even though n is large. Even this error
should follow some rule!
Let X1 , X2 , .... be an i.i.d. sequence of random variables with
µ = E[Xn ], (3.9)
2
σ = Var[Xn ]. (3.10)
Now we would like to estimate the error of the sum of this random variables to
nµ, i.e.
n
Sn − nµ = ∑ Xi − nµ. (3.11)
i=1
Sn ∼ N(nµ, nσ 2 ). (3.13)
The central limit theorem indicates that no matter what the random variable Xi
is like, the sum Sn can be regarded as a normal random variable.
Instead of stating lengthy and technically advanced proof of laws of large num-
bers and central limit theorem, we give some examples of how Sn converges to a
Normal random variable.
1 n
A= ∑ Xi. (3.14)
n i=1
Of course, we expect that A is close to the mean E[Xi ] = 1/2 but how close for a
given n?
Figure 3.3 shows the histogram of 1000 different runs of A with n = 10. Since
n = 10 is relatively small samples, we have large deviation from the expected
mean 1/2. On the other hand, when we have more samples, the sample average A
tends to be 1/2 as we see in Figure 3.4 and 3.5, which can be a demonstration of
Weak Law of Large Numbers (Theorem 3.1).
As we see in in Figure 3.3 to 3.5, we may still have occasional high and low
averages. How about individual A for large sample n. Figure 3.6 shows the sam-
ple path level convergence of the sample average A to E[X] = 1/2, as it can be
expected from Strong Law of Large Numbers (Theorem 3.2).
Now we know that A converges to 1/2. Further, due to Central Limit Theorem
(Theorem 3.5), magnifying the histogram, the distribution can be approximate
by Normal distribution. Figure 3.7 show the detail of the histogram of A. The
histogram is quite similar to the corresponding Normal distribution.
Remark 3.1. In this example, Sn /n → 1/2 but not saying your wealth will be even-
tually balancing if you bet on coin-tossings. Sn will diverge instead of converge,
as n getting large, which means that you have great chance to be bankrupting.
3.5 References
The proofs omitted in this chapter can be found in those books such as [Ito, 1991]
and [Durrett, 1991].
3.6 Exercises
Exercise 3.1. Something here!
3.6. EXERCISES 19
Figure 3.3: Histogram of the sample average A = n1 ∑ni=1 Xi when n = 10, where
Xi is a Bernouilli random variable with E[Xi ] = 1/2.
0.8
0.6
0.4
0.2
Figure 3.6: The sample path of the sample average A = n1 ∑ni=1 Xi up to n = 100,
where Xi is a Bernouilli random variable with E[Xi ] = 1/2.
30
25
20
15
10
5
Figure 3.7: The detailed histgram of the sample average A = 1n ∑ni=1 Xi when n =
10, where Xi is a Bernouilli random variable with E[Xi ] = 1/2. The solid line is
the corresponding Normal distribution.
Chapter 4
aI{X≥a} ≤ X. (4.2)
Theorem 4.2 (Chernov Bounds). Let X be the random variable with moment gen-
erating function M(t) = E[etX ]. Then, we have
21
22 CHAPTER 4. USEFUL PROBABILITY THEOREMS
4.2 Exercises
Exercise 4.1. Something here!
Chapter 5
Easy Money
Problem 5.1. Are there any easy money? If so, what will happen?
23
24 CHAPTER 5. EASY MONEY
holds regardless of the stock price X1 , the value of the investment V1 can be fixed.
By solving (5.3), we have
y = −3x, (5.4)
So, we will borrow the money form a bank. At time 1 we need to pay its interest,
but at the same time we earn the money V1 from our investment. Thus, overall
gain G1 is
Now from the argument in Section 5.1.1, if we adopt the investment strategy such
as in (5.4), we know that we can avoid the risk and V1 = 50x. Thus,
Now you are ready to learn how to get easy money. Consider the following
situations:
1. Suppose 3c > 100 − 50(1 + r)−1 . Set some positive x, then G1 > 0. That
means our investment guaranteed to be profitable. Actually, in this case, the
option price c satisfies
2. Suppose 3c < 100 − 50(1 + r)−1 . In this case by setting negative x, we can
also get positive gain G1 at time 1. In this case
3. Suppose 3c = 100 − 50(1 + r)−1 . The option price satisfies (5.1). We’re
sorry but you cannot obtain easy money.
Of course, in some temporary cases, we may experience case 1 and 2. We call
these sure-win situation arbitrage. However, for example, if the option price is
expensive compared to (5.1), as soon as many investors and the dealer of options
realized that they can exploit it by selling the option, everybody starts to sell the
options. Thus, in the end the price of the options will drop. If the option price
is lower than (5.1), everybody wants to buy them, and eventually the option price
will soar. Hence the price of option will converge to (5.1).
Example 5.1. Consider the following thought experiment. Option price is larger
than c = {100 − 50(1 + r)−1 }/3.
Problem 5.5. What are you going to do, when you find the option price is larger
than c = {100 − 50(1 + r)−1 }/3?
Of course, you sell options in short and obtain easy money. However, if other
players read this text, they also start selling options. (They may realize the situa-
tion independent of this text...) Eventually, the price of option falls. But how far
you can continue selling? It should be just before the option price hit c = {100 −
50(1 + r)−1 }/3. Thus, the option price should be c = {100 − 50(1 + r)−1 }/3.
taking into account of the initial payment s. Thus, the expected return of this
investment is
To vanish the opportunity of easy money from a financial product, there should
be a particular situation where its expected profit should be equal to zero (See
Section 5.4). Thus, by setting E[R1 ] = 0, we have
s(1 + r) − b
p= . (5.12)
a−b
Thus, given that the expected return of this investment R1 equals to zero, p should
should have the value like in (5.12).
Next, suppose we buy one option. The present value of the return of this
investment Q1 is
(
(a − K)(1 + r)−1 −C if X1 = a,
Q1 = (5.13)
−C if X1 = b.
Thus, we have
Hence, by Theorem 5.3, we don’t have sure-win if the option price is (5.16).
Remark 5.1. The option price (5.16) coincides with (5.1).
28 CHAPTER 5. EASY MONEY
p = P{X1 = 200}
100(1 + r) − 50
=
150
2r + 1
= ,
3
from (5.12). Thus,
This means that the only “rational” probability structure for the future stock price
should has the same expected present value. Of course, given that this probability
structure, we have the option price,
Theorem 5.1 (law of one price). Consider two investments and their costs c1 and
c2 respectively. If the present values of their payoff are same, then either
1. c1 = c2 , or
Proof. Suppose c1 < c2 , then we can buy c1 and selling c2 , and obtain money.
Corollary 5.1. If there is no arbitrage, the price of the investments with the same
return should be identical.
5.3. DUPLICATION AND LAW OF ONE PRICE 29
Now using Theorem 5.1, we will derive the option cost as in the same situation
as in Section 5.1 by considering two different investments:
2. Borrow x from bank, and buy y shares. The initial investment is 100y − x.
At time 1 you need to pay back (1 + r)x, while selling the y share. Thus, the
payoff Q1 is
(
200y − x(1 + r) if the price of stock is $200,
Q1 = (5.18)
50y − x(1 + r) if the price of stock is $50.
By choosing appropriate x and y, we have the same pay off P1 = Q1 for these two
investment, i.e.,
or
1
y= , (5.21)
3
50
x= . (5.22)
3(1 + r)
In this case, two investments have the identical payoff. In other word, we could
duplicate the option by buying stocks at spot and borrowing money from bank.
Thus, using Theorem 5.1, the initial costs of two investments are identical. Thus
the option price c is
Consider two investment options whose gain is determined by the future price
of a stock X. Assume that the stock price X can be either a or b. Let R1 and
R2 be the corresponding gain to the investments. Since the gain of investment is
determined by the stock price, we have
(
ri (a) X = a,
Ri = (5.24)
ri (b) X = b.
We can split our money x = x1 + x2 into the two investment, thus the total gain
R is obtained by
R = x1 R1 + x2 R2 . (5.25)
Theorem 5.2 (arbitrage theorem for two investments). Either one of the follow-
ings holds.
R = x1 R1 + x2 R2 > 0. (5.28)
Suppose, we somehow find the investment strategy to get some positive gain when
the stock price is b, i.e.
Now assume unfortunately our stock price turn out to be a. We have the gain as
R = x1 r1 (a) + x2 r2 (a)
p−1
= {x1 r1 (b) + x2 r2 (b)}
p
However, because of (5.31) and the fact (p − 1)/p ≤ 0, the gain when X = a is
always negative (non-positive). The same argument can be applied to the case
when there is a strategy to have positive gain at X = b. Thus there is no sure-win
when there is risk-neutral probability.
Next we show that whenever there is no sure-win, there exists a risk-neutral
probability. Define two vectors by
Then, the gain is nothing but the inner product of these vectors and the vector
x = (x1 , x2 ), i.e.,
(
x1 r1 (a) + x2 r2 (a),
R = (x, r) = . (5.34)
x1 r1 (b) + x2 r2 (b).
If there is no sure-win, the vectors r(a) and r(b) should be in line with the opposite
direction, i.e.,
for some constant α ≥ 0. Indeed, if not, we can always find some vector x having
positive inner product with both r(a) and r(b). It is easy to see that
|r1 (b)| |r2 (b)| 1
= = . (5.36)
|r1 (a)| + |r1 (b)| |r2 (a)| + |r2 (b)| α + 1
Set
1
p= . (5.37)
α +1
32 CHAPTER 5. EASY MONEY
By (5.35),
1
pr1 (a) + (1 − p)r1 (b) = {|r1 (b)|r1 (a) + |r1 (a)|r1 (b)}
|r1 (a)| + |r1 (b)|
1
= {−|r1 (b)|αr1 (b) + |αr1 (b)|r1 (b)}
|r1 (a)| + |r1 (b)|
= 0.
Similarly, we have
Let J be an experiment (of a gamble?). The outcome of J can be {1, ..., m}.
Let ri ( j) be the return fuction, that is, when we bet a unit money on wager i, if the
experiment result is J = j, we can get ri ( j). Note that the amount of bet can be
negative.
We can bet xi on wager i. We call the vector x = (x1 , ..., xn ) the betting strategy.
If we bet on the strategy x, we can get the amount of money
n
∑ xiri(J). (5.39)
i=1
Theorem 5.3 (arbitrage theorem). Either there exists risk neutral probability, or
there is the sure-win strategy (arbitrage). More precisely, either one of the follow-
ings true.
1. There exists a probability vector p = (p1 , ..., pm ) for which
m
E[ri (J)] = ∑ p j ri( j) = 0 for all wager i, (5.40)
j=1
5.5 References
reference here!
5.6 Exercises
Exercise 5.1. Something here!
Chapter 6
Brownian Motions and Poisson Processes are the most useful and practical tools
to understand stochastic processes appeared in the real world.
34
6.1. GEOMETRIC BROWNIAN MOTIONS 35
Proof. Set
S(t) S(t)
Y= = .
S(0) s0
E[S(t)]
E[Y ] = .
s0
Remark 6.1. Here the mean of S(t) increase exponentially with the rate µ + σ 2 /2,
not the rate of µ. The parameter σ represents the fluctuation, but since the log-
normal distribution has some bias, σ affects the average exponential growth rate.
Theorem 6.1. Let S(t) be a geometric Brownian motion with its drift µ and
volatility σ , then for a fixed t ≥ 0, S(t) can be represented as
Proof. We need to check that (6.2) satisfies the second part of Definition 6.1,
which is easy by taking log on both sides in (6.2) and by seeing
S(t)
log = W. (6.3)
S(0)
36 CHAPTER 6. TOOLS FOR MODELING DYNAMICS OF STOCK PRICES
where B(t) is the standard Brownian motion and dB(t) is define by so-called Ito
calculus.
Note that the standard Brownian motion is continuous function but nowhere
differentiable, so the terms like dB(t)/dt should be treated appropriately. Thus,
the following parts is not mathematically rigorous. The solution to this equation
is
2 /2)t+σ B(t)
S(t) = S(0)e(µ−σ . (6.5)
To see this is the solution, we need “unordinary” calculus. We will see the detail
in Theorem 11.2.
dS(t)
, (6.6)
dt
formally. Check if it satisfies (6.4), or not.
Definition 6.2 (Tree binomial process). The process Sn is said to be a Tree Bino-
mial process, if the dynamics is express as
(
uSn−1 with probability p,
Sn = (6.7)
dSn−1 with probability 1 − p,
Problem 6.2. Figure 6.1 depicts a sample path of Tree Binomial Process. Do you
think it is reasonably similar to Figure 6.2, which is a exchange rate of yen?
6.2. DISCRETE TIME TREE BINOMIAL PROCESS 37
130
120
110
100
90
80
20 40 60 80 100
for all n. The multiplication factors d and u as well as the probability p have
some specific value, as described in the following, to imitate the dynamics of the
Brownian motion. Let Yi be the indicator of “up”s (Bernouilli random variable),
i.e.,
(
1 up at time i∆,
Yi = (6.12)
0 down at time i∆.
Thus, the number of “up”s up to time n∆ is ∑ni=1 Yi , and the number of “down”s is
n − ∑ni=1 Yi . Hence, the stock price at time n∆ given that the initial price S(0) is
and show that the limit S(t) is geometric Brownian motion with the drift µ and
the volatility σ as ∆ → 0. Taking log on the both side, we have
S(t)
t u t/∆
log = log d + log ∑ Yi. (6.14)
S(0) ∆ d i=1
6.2. DISCRETE TIME TREE BINOMIAL PROCESS 39
Consider taking the limit ∆ → 0, then the number of terms in the sum increases.
Using Central Limit Theorem (Theorem 3.5), we have the approximation:
t/∆
∑ Yi ∼ Normal distribution. (6.16)
i=1
Thus, log(S(t)/S(0)) has also the normal distribution with its mean and variance:
−tσ √ t/∆
E[log(S(t)/S(0)] = √ + 2σ ∆ ∑ E[Yi ]
∆ i=1
−tσ √ t
= √ + 2σ ∆ p,
∆ ∆
−tσ √ t µ√
= √ +σ ∆ 1+ ∆
∆ ∆ σ
= µt,
and
t/∆
Var[log(S(t)/S(0)] = 4σ 2 ∆ ∑ Var[Yi ]
i=1
2
= 4σ t p(1 − p)
→ σ 2t, as ∆ → 0,
since p → 1/2. Thus,
log(S(t)/S(0)) ∼ N[µt, σ 2t], (6.17)
40 CHAPTER 6. TOOLS FOR MODELING DYNAMICS OF STOCK PRICES
and moreover S(t + y)/S(t) is independent with the past because of the construc-
tion of S(t). Hence the limiting distribution of S(t) with ∆ → 0 is a geometric
Brownian motion, which means all the geometric Brownian motion S(t) with the
drift µ and the volatility σ can be approximated by an appropriate tree Binomial
process.
Remark 6.2. In the following, to show the properties of geometric Brownian mo-
tions, we sometimes check the properties holds for the corresponding tree Bino-
mial Process and then taking appropriate limit to show the validity of the proper-
ties of geometric Brownian motions.
130
120
110
100
90
80
20 40 60 80 100
Figure 6.3: Another example of geometric brownian motion with the drift µ = 0.1,
the volatility σ = 0.3 and the interval ∆ = 0.01 and starting from S0 = 100.
It is important to see that the path from geometric brownian motion will differ
time by time, even we have the same parameters. It can rise and down. Figure 6.4
6.3. BROWNIAN MOTIONS 41
gathers those 10 different trajectories. Since we set the drift µ = 0.1 > 0, we can
see up-ward trend while each trajectories are radically different.
On the other hand, Figure 6.5 shows the geometric brownian motion with the
negative drift µ = −0.1, and the probability of the upward change p = 0.483333.
Can you see the difference between Figure 6.5 and Figure 6.4?
200
180
160
140
120
100
80
20 40 60 80 100
Figure 6.4: Many trajectories of geometric brownian motion with the upward
drift µ = 0.1, the volatility σ = 0.3 and the interval ∆ = 0.01 and starting from
S0 = 100.
140
120
100
20 40 60 80 100
Figure 6.5: Many trajectories of geometric brownian motion with the downward
drift µ = −0.1.
are independent for all choice of the time instants t0 < t1 , < .... < tn .
6.4. POISSON PROCESSES 43
-20 20 40 60
-20
-40
-60
-80
Thus, intuitively the future direction of the process which has independent
increment are independent with its past history.
1. N(0) = 0,
Theorem 6.3. Let N(t) be a Poisson process with its rate λ , then we have
E[N(t)]
λ= , (6.23)
t
6.5 References
6.6 Exercises
Exercise 6.1. Find E[Sn ] and Var[Sn ] for tree binomial process.
Exercise 6.2. Let N(t) be a Poisson process with its rate λ , and prove
Theorem 7.1 (Risk-neutral tree binomial process). Consider the stock price on
the discrete time. Let r be the interest rate, and let Sn be the stock price at time n
and satisfies the following special dynamics:
(
uSn−1 ,
Sn = (7.1)
dSn−1 ,
45
46 CHAPTER 7. ON A WAY TO BLACK-SCHOLES FORMULA
First, let us consider an investment scheme. The stock price data up to time
n − 1 are used only for the observation. According to some rule, we decide to buy
stock or not. More precisely, if
(X1 , X2 , . . . , Xn−1 ) = (x1 , x2 , . . . , xn−1 ), (7.3)
we buy the stock at time n − 1 by borrowing money from a bank, and sell it at time
n.
Problem 7.1. Describe why this is fairly common rule.
Let
α = P{(X1 , X2 , . . . , Xn−1 ) = (x1 , x2 , . . . , xn−1 )}, (7.4)
p = P{Xn = 1|(X1 , X2 , . . . , Xn−1 ) = (x1 , x2 , . . . , xn−1 )}. (7.5)
According to the stock price change, the gain of this investment differs. By condi-
tioning the event (X1 , X2 , . . . , Xn−1 ), we can get the expected gain at time n of this
investment E[Gn ]:
E[Gn ] = αE[profit from buying the stock] + (1 − α)0, (7.6)
since we avoid buying stock with probability 1 − α. The probability of stock rise
is p, so
E[profit from buying the stock] = puSn−1 + (1 − p)dSn−1 − (1 + r)Sn−1 , (7.7)
since we need to pay the interest for the money borrowed from the bank. Thus,
E[Gn ] = α{puSn−1 + (1 − p)dSn−1 − (1 + r)Sn−1 }. (7.8)
Thus, setting the expected gain to be zero, we have
0 = E[Gn ] = α{puSn−1 + (1 − p)dSn−1 − (1 + r)Sn−1 }, (7.9)
or,
1+r−d
p = P{Xn = 1|(X1 , X2 , . . . , Xn−1 ) = (x1 , x2 , . . . , xn−1 )} = , (7.10)
u−d
which is independent of our specific rule (x1 , x2 , . . . , xn−1 ). This means that Xn
is independent of (X1 , X2 , . . . , Xn−1 ). Accordingly, using inductive arguments, we
conclude that (X1 , X2 , . . . , Xn ) are independent. Thus, the process Sn is a tree bi-
nomial process.
7.2. OPTION PRICE ON THE DISCRETE TIME 47
where x+ = max(x, 0). Thus, the present value of the expected return of this in-
vestment R is
Theorem 7.3 (Risk neutral geometric Brownian motion). Let r be the nominal
interest rate. If there is no arbitrage, the geometric Brownian motion representing
stock price dynamics should have the drift µ = r − σ 2 /2 and the volatility σ . This
process is called the risk neutral geometric Brownian motion.
Problem 7.3. Give intuitive explanation why the volatility σ affect the drift µ =
r − σ 2 /2.
Further, we can prove the famous Black-Scholes formula in the simple form.
Theorem 7.4 (Black-Scholes formula). Consider a call option of the stock with
strike price K and the expiration time t. Let C be the price of this option. If we
assume there is no arbitrage opportunity, then we have
where S(t) is the geometric Brownian motion with the drift µ = r − σ 2 /2 and the
volatility σ .
Proof. Since we assume no arbitrage, the expected gain of all bets, including
purchase of the option, should be zero. Thus, supposing to buy a option with cost
C, we have
Remark 7.1. Since the term e−rt represents mapping back to present value, (7.24)
is nothing but the present value of expectation of option payoff.
Although, (7.24) is simple for our eyes, it is hard to evaluate.
Problem 7.4. Use Black-Scholes formula (7.24) to analyze the effect of changing
parameters. What will happen to the option price if
Corollary 7.1 (Original Black-Scholes formula). Given S(0) = s, the option price
C is obtained by
where W is a normal random variable with N((r − σ 2 /2)t, σ 2t), Φ(x) is the dis-
tribution function of the standard normal random variable and
rt + σ 2t/2 − log(K/s)
ω= √ . (7.28)
σ t
To prove this Corollary, we need the following Lemmas. Note that S(t) can be
expressed in
Then,
√
E[I] = P{S(t) > K} = Φ(σ t − ω). (7.35)
Lemma 7.3.
1 ∞ Z √ 2
= √ sert e−(x−σ t) /2 dx
2π α
1
Z ∞
2
= √ sert e−y /2 dy,
2π −ω
52 CHAPTER 7. ON A WAY TO BLACK-SCHOLES FORMULA
√
where we put y = x − σ t. Since the integrant is indeed the density of standard
normal distribution, we have
= sert P{Z > −ω}
= sert Φ(ω),
by the symmetry of Φ.
Proof of Corollary 7.1. Since (S(t) − K)+ = I(S(t) − K), by Theorem 7.4 and ,
we have
C = e−rt E[(S(t) − K)+ ]
= e−rt E[I(S(t) − K)]
= e−rt E[I(S(t)] − Ke−rt E[I]
√
= sΦ(ω) − Ke−rt Φ(σ t − ω),
where we used Lemma 7.2 and 7.3.
Thus, the price of this option is 0.07. Table 7.1 shows the return of stock and
option purchase. You can get more profit when the stock price rises.
With high probability you will lose your money with this option. Thus, you can
understand that option is high risk and high return.
In Figure 7.1, we show the simulation of the stock price dynamics. Your
opinion?
40
35
30
25
20 40 60 80 100
Figure 7.1: Example of risk-neutral geometric brownian motion with the volatility
σ = 0.2 starting from S0 = 30, when the interest rate r = 0.08. Here we used the
tree-binomial approximation with the interval ∆ = 0.01.
7.5 References
reference here!
54 CHAPTER 7. ON A WAY TO BLACK-SCHOLES FORMULA
7.6 Exercises
Exercise 7.1. Estimate the value of following call options, given the current price
of a stock s = S(0) = 30, and compare the result with Example 7.1. What can you
say about it.
Exercise 7.2. Give two examples where buying options is more appropriate than
buying its stocks.
Chapter 8
We have already known that we can duplicate option for one term in Section 5.3.
This idea can be extended to multiple terms.
Here’s our new assignment. Find the amount of money x at time 0 required to
meet a payment P1 below varying according to stock price.
(
a up,
P1 = (8.2)
b down.
55
56 CHAPTER 8. DELTA HEDGING STRATEGY
Let xi,2 be the money required to cover the payoff at time 2, given that
S2 = ui d 2−i s, (8.11)
1
x1,1 = {px2,2 + (1 − p)x1,2 }. (8.12)
1+r
Note that p = (1 + r − d)/(u − d) depends on the stock price dynamics d, u but
doesn’t depend on the payoff. In order to achieve the payoff, we need to buy stock
as much as
x2,2 − x1,2
y1,1 = , (8.13)
us(u − d)
where s, which is the initial stock price in the previous discussion, is replaced with
us, and the rest will be put in the bank.
Second, assume S1 = ds. With similar argument, we need the money x0,1 at
time 1 as
1
x0,1 = {px1,2 + (1 − p)x0,2 }, (8.14)
1+r
with
x1,2 − x0,2
y0,1 = . (8.15)
ds(u − d)
xi,1 = E[ the time-1 present value of payoff at time 2|S(1) = ui s], (8.16)
58 CHAPTER 8. DELTA HEDGING STRATEGY
where i = 0, 1.
Now we can re-evaluate our investment strategy at time 0. As shown above,
we need the payoff x0,1 and x0,1 with respect to the value of S1 . Thus, to cover this
payoff, we need the money as
1
x0,0 = {px1,1 + (1 − p)x0,1 }. (8.17)
1+r
Using (8.12) and (8.14), we have
1
x0,0 = [p{px2,2 + (1 − p)x1,2 } + (1 − p){px1,2 + (1 − p)x0,2 }]
(1 + r)2
1 2 2
= p x2,2 + 2p(1 − p)x1,2 + (1 − p) x0,2 . (8.18)
(1 + r)2
As it can be predicted, x0,0 , the money required to cover the payoff at time 2, is
the present value of the expected payoff at time 2, i.e.,
Here’s an example how to meet the requirement by adjusting our stock posi-
tion.
1. Prepare the money equal to x0,0 , and buy the stock as much as y0,0 at time
0. The rest is kept at the bank.
3. At the time 2, we have S2 = uds (up and down), then the value of our in-
vestment have x1,2 . which is our requirement.
Now it’s time to generalize the argument. Let xi,n be the payoff at time n
given that Sn = ui d n−i s. Also let xi,k be the money required at time k given that
8.2. BINOMIAL HEDGING MODEL 59
Sk = ui d k−i s. Then
xi,k = E[time-k present value of payoff at time n|Sk = ui d k−i s]
1
= {pxi+1,k+1 + (1 − p)xi,k+1 } (8.21)
1+r
n−k
−(n−k) n−k j
= (1 + r) ∑ j p (1 − p)n−k− j xi+ j,n, (8.22)
j=0
we can cover the money required to buy the option at time h, given that we have
C(s,t) at the time 0. Now take h → 0 in (8.34), then
√ √
C(seσ h ,t − h) −C(se−σ h ,t − h)
D(0) = lim √ √
h→0 seσ h − se−σ h
C(seσ a ,t − a2 ) −C(se−σ a ,t − a2 )
= lim . (8.35)
a→0 seσ a − se−σ a
√
where a = h. Since both the denominator and numerator are converges to 0 as
h → 0, we need to apply l’Hospital’s Rule (Lemma 8.1). Then,
d
C(seσ a ,t − a2 ) −C(se−σ a ,t − a2 )
da
D(0) = lim d
(8.36)
σa −σ a ]
da [se − se
a→0
Now
∂C(y,t − a2 ) ∂C(seσ a , u)
d σa 2
C(se ,t − a ) = +
da ∂a
y=seσ a ∂a
u=t−a2
= lim Ct (seσ a ,t − a2 )(−2a) +Cs (seσ a ,t − a2 )sσ eσ a
a→0
= sσCs (s.t),
d
C(se−σ a ,t − a2 ) = lim −Ct (se−σ a ,t − a2 )(−2a) −Cs (se−σ a ,t − a2 )(−sσ e−σ a )
−
da a→0
= sσCs (s.t).
2sσCy (s.t)
D(0) = lim D(h) =
h→0 2sσ
∂C(s,t)
= . (8.37)
∂s
The quantity D(0) = ∂C(s,t)/∂ s is the sensitivity of the option price with the
initial stock price and is called ∆.
Thus, we have the following theorem:
62 CHAPTER 8. DELTA HEDGING STRATEGY
Theorem 8.2 (Delta hedge in geometric brownian process). Let S(t) be the risk-
neutral geometric brownian process. Set the initial investment money as
which is equal to the call option price of strike price K and expiration time n. We
can hedge the option payoff by continuously adjusting the position of stock by ∆
∂C(s,t)
∆= , (8.39)
∂s
and the rest can be put (or borrowed) at a bank with the interest rate r.
∂C(s,t)
∆= = Φ(ω), (8.40)
∂s
where Φ(ω) is defined as in Lemma 7.3.
Thus,
∂C(s,t)
∆=
∂s
∂
= E[e−rt (S(t) − K)I]
∂ s
∂ −rt
=E {e (S(t) − K)I} .
∂s
8.5. REFERENCES 63
Now, we have
∂ −rt ∂ {e−rt (S(t) − K)} ∂I
{e (S(t) − K)I} = I + {e−rt (S(t) − K)}
∂s ∂s ∂s
−rt
∂ {e (S(t) − K)}
=I , (8.43)
∂s
on {S(t) 6= K}. Since P{S(t) = K} = 0, we have
∂ −rt
∆=E {e (S(t) − K)I} S(t) 6= K P{S(t) 6= K}
∂s
∂ −rt
+E {e (S(t) − K)I} S(t) = K P{S(t) = K}
∂s
∂ {e−rt (S(t) − K)}
=E I (8.44)
∂s
Since S(t) is the geometric brownian motion, S can be represented as
2 )t+σ
√
S(t) = se(r−σ tZ
, (8.45)
8.5 References
reference here!
64 CHAPTER 8. DELTA HEDGING STRATEGY
8.6 Exercises
Exercise 8.1. Use
f (x) − f (c)
f 0 (x) = lim (8.49)
x−c x−c
to prove l’Hospital’s Rule (Lemma 8.1).
Chapter 9
Stochastic Integral
Consider the asset price on continuous time. Naturally, we need to extend the
notion of integral to stochastic integral, which is quite different with normal (Rie-
mann) integral.
1. W (0) = 0.
are independent.
65
66 CHAPTER 9. STOCHASTIC INTEGRAL
where ∆W (t) = W (t + ∆t) −W (t). Dividing both sides with ∆t and letting ∆t → 0,
we have
dX(t)
= µ + σ v(t). (9.4)
dt
Here,
dW (t)
v(t) = . (9.5)
dt
Remark 9.2. The process v(t) cannot be exist, while the Wiener process W (t)
exists mathematically. Indeed W (t) cannot be differentiable almost everywhere.
Instead of dividing both sides with ∆t, just take ∆t → 0 in (9.1), then we have
(
dX(t) = µdt + σ dW (t),
(9.6)
X(0) = a.
The termR 0t µds is normal integral, so it is OK. However, we have a problem for
R
9.2 Information
Definition 9.3 (Information of X(t)). Define FtX by the information generated by
X(s) on s ∈ [0,t]. Roughly speaking, FtX is “what has happened to X(s) up to
time t.”
When we can decide an event A happened or not by the the trajectory of X(t),
we say that A is FtX -measurable, and
A ∈ FtX . (9.9)
Z ∈ FtX . (9.10)
for all t.
1.
Then, A ∈ F9X .
2.
Then, A ∈ F10
X , but A 6∈ F X .
9
3.
Z 5
Z= X(s)ds. (9.14)
0
Then, Z ∈ F5X .
68 CHAPTER 9. STOCHASTIC INTEGRAL
when
Rb
1. a E[g(s)2 ]ds < ∞,
2. g is adopted FtW .
Rb
Our objective is to define the stochastic integral a g(s)dW (s) on g ∈ L 2 [a, b].
Assume g is simple, i.e.,
a
g(s)dW (s) = ∑ g(tk )[W (tk+1 −W (tk )]. (9.19)
k=0
2. Take n → ∞ in
Z b Z b
gn (s)dW (s) → g(s)dW (s). (9.20)
a a
Theorem 9.1. Here are some important properties for our stochastic integral.
When g ∈ L 2 , we have
1.
b
Z
E g(s)dW (s) = 0, (9.21)
a
2.
"Z 2 # Z b
b
E g(s)dW (s) = E[g2 (s)]ds, (9.22)
a a
3.
Z b
g(s)dW (s):FbX -measurable. (9.23)
a
Proof. We prove the case only when g(t) is simple. Set g(t) = A1[a,b] with A ∈
FaX . Since A is independent with the future W (b) −W (a), we have
Z b
E g(s)dW (s) = E[A(W (b) −W (a))]
a
= E[A]E[W (b) −W (a)] = 0.
Similarly, we have
"Z 2 #
b
E g(s)dW (s) = E[A2 (W (b) −W (a))2 ]
a
We can prove the general case using the approximation with simple functions.
70 CHAPTER 9. STOCHASTIC INTEGRAL
9.4 Martingale
Definition 9.6. Given a filtration Ft (the history observed up time t), a stochastic
process Xt is said to be martingale, when
1. Xt is adopted to Ft ,
3. for all s ≤ t,
is an FtW -martingale.
Theorem 9.3. A stochastic process X(t) is martingale if and only if
Equivalently, we write
(
dX(t) = µ(t)dt + σ (t)dW (t),
(9.28)
X(0) = a.
Remark 9.5. Wiener process W (t) is continuous but perfectly rugged. Thus, we
need unusual Ito-calculus.
Since W (t) −W (s) is N[0,t − s],
E[∆W ] = 0, (9.29)
2
E[(∆W ) ] = ∆t, (9.30)
Var[∆W ] = ∆t, (9.31)
Var[(∆W )2 ] = 2(∆t)2 . (9.32)
or
Z t
(dW )2 = t. (9.35)
0
Remark 9.6. Intuitive proof can be found in text Bjork [2004] p26.
Theorem 9.4 (Ito’s formula). Given a stochastic process X(t) with
set
∂f ∂f 1 ∂2 f 2 1 ∂2 f 2 ∂2 f
df = dt + dX + (dX) + (dt) + dtdX. (9.41)
∂t ∂x 2 ∂ x2 2 ∂t 2 ∂ x∂t
Note that the the term (dX)2 should be ignored in normal differential, but in
stochastic differential this terms survive as we see in the following.
By (9.36), formally,
Now we can igonre those terms with (dt)2 and dtdW which converges much faster
than dt. Also, by (9.34), (dW )2 = dt. Substituting these, we have the result.
Lemma 9.1. You can use the following conventions:
(dt)2 = 0, (9.43)
dtdW = 0, (9.44)
(dW )2 = 0. (9.45)
set
∂f ∂f 1 ∂2 f
df = dt + ∑ dXi (t) + ∑ dXi (t)dX j (t). (9.49)
∂t i ∂ xi 2 i, j ∂ xi x j
9.5. CALCULUS OF STOCHASTIC INTEGRAL 73
For pricing via arbitrage theory, we need to evaluate the expectation of Wiener
process. We can use Ito’s formula for the evaluation.
74
10.1. EVALUATION OF E[W (T )4 ] 75
Check
∂f
= 0, (10.7)
∂t
∂f
= 4x3 , (10.8)
∂x
∂2 f
= 12x2 , (10.9)
∂ x2
∂2 f
= 0, (10.10)
∂t 2
and we have
dZ(t) = 6W (t)2 dW (t)2 + 4W (t)3 dW (t). (10.11)
Since dW (t)2 = dt by Lemma 9.1, we can rewrite the dynamics for Z(t) = W (t)4
as
(
d(W (t)4 ) = 6W (t)2 dt + 4W (t)3 dW (t),
(10.12)
Z(0) = 0.
Integrating the both side of (10.12), we have
Z t Z t
4 2
W (t) = 6 W (s) ds + 4 W (s)3 dW (s). (10.13)
0 0
Taking the expectation, we have
Z t Z t
4 2 3
E[W (t) ] = 6E W (s) ds + 4E W (s) dW (s) (10.14)
0 0
Z t Z t
2 3
= 6 E W (s) ds + 4E W (s) dW (s) . (10.15)
0 0
√
Since W (s) is N[0, s], we have
E W (s)2 = s.
(10.16)
Also, by Theorem 9.1,
t
Z
3
E W (s) dW (s) = 0. (10.17)
0
Note that we used the fact W (s)3 ∈ L 2 . Now we can evaluate (10.14).
Z t
4
E[W (t) ] = 6 sds = 3t 2 . (10.18)
0
76 CHAPTER 10. EXAMPLES OF STOCHASTIC INTEGRAL
1
dZ(t) = α 2 eαW (t) dt + αeαW (t) dW (t). (10.20)
2
Problem 10.1. Show the above equation by using Ito’s formula.
Rewriting this, we have the stochastic differential equation,
1
dZ(t) = α 2 Z(t)dt + αZ(t)dW (t), (10.21)
2
Z(0) = 1. (10.22)
Z t
1
m(t) = E[Z(t)] = 1 + α 2 m(s)ds. (10.24)
2 0
Or equivalently,
1
m0 (t) = α 2 m(t), (10.25)
2
m(0) = 1. (10.26)
Differential Equations
Thus,
log B(t) = rt +C, (11.6)
or
B(t) = Cert . (11.7)
77
78 CHAPTER 11. DIFFERENTIAL EQUATIONS
Theorem 11.2 (SDE of Geometric Brownian Motion). The solution of the equa-
tion
is given by
2 /2)t+σW (t)
X(t) = x0 e(α−σ . (11.13)
Proof. Here the tricks we used in Theorem 11.1 may not work here because dW (t)
terms should be considered as the stochastic difference.
To simplify the proof, we assume X(t) > 0. Set
∂f ∂f 1 ∂2 f
dZ(t) = dt + dX(t) + 2
(dX(t))2
∂t ∂x 2 ∂x
dX(t) 1 (dX(t))2
= − . (11.15)
X(t) 2 X(t)2
11.3. STOCHASTIC PROCESS AND PARTIAL DIFFERENTIAL EQUATION79
By (11.11),
1
dZ(t) = αdt + σ dW (t) − σ 2 dt. (11.17)
2
Luckily, the right hand side can be integrated directly here!
Z t t
1
Z
Z(t) − Z(0) = (α − σ 2 )ds + σ dW (s) (11.18)
0 2 0
1
= (α − σ 2 )t + σ dW (t). (11.19)
2
Using the initial condition (11.12), we have Z(0) = log x0 . Thus,
1
Z(t) = (α − σ 2 )t + σ dW (t) + log x0 , (11.20)
2
or
2 /2)t+σW (t)
X(t) = eZ(t) = x0 e(α−σ . (11.21)
Remark 11.1. This proof has not prove the existence of the solution yet. More
rigorous treatment of the proof, see the textbook like Oksendal [2003].
∂F ∂ F 1 2 ∂ 2F
+µ + σ − rF = 0, (11.22)
∂t ∂ x 2 ∂ x2
80 CHAPTER 11. DIFFERENTIAL EQUATIONS
Portfolio Dynamics
81
82 CHAPTER 12. PORTFOLIO DYNAMICS
where c(t) is the consumption rate at time t. Thus, we have the budget equation
as,
We will rewrite (12.5) by subtracting the term S(t − ∆t)(h(t) − h(t − ∆t)), as
(S(t) − S(t − ∆t))(h(t) − h(t − ∆t)) + S(t − ∆t)(h(t) − h(t − ∆t)) + c(t)∆t = 0.
This has the appropriate forms of (present value) × (future difference). Thus, we
can take the limit to have the proper budget equation as,
On the other hand, we can evaluate the current value of our portfolio as
By (12.8),
which can be superficially understood as the change of value of our portfolio are
due to the change of the stock price and the consumption we made. Particularly,
when c(t) = 0,
Generally, we have dW (t) terms in the dynamics of underlying stock price dy-
namics dS(t). So, overall, portfolio dynamics dV (t) contain the dW (t) terms. But
in some special case, we can make our portfolio so cleverly that we can avoid the
dW (t) terms, which means we have risk-free portfolio. In that case, the following
happens.
for some function k(t). Apparently, the value of our portfolio V (t) does not have
any risk. Assume we have no arbitrage, then the rate of return of our risk-free
portfolio is r(t), i.e.,
Proof. Assume r(t) and k(t) to be constant r and k for simplicity. First, let us
suppose k > r. Then, we can sell the risk-free asset B(t) and use the money in our
portfolio V (t). The rate of return r for selling the risk-free asset can be covered
by our portfolio, which has the rate of return k. This contradicts the no-arbitrage
assumption.
In the case of k < r, we can reverse the argument.
Remark 12.3. Thus, there is a unique rate of return for risk-free asset, which turns
out to be the short-term interest rate of bank account.
Chapter 13
given that the option is expired at time T and have the value
Typically,
2. Show we can build a risk-free portfolio h(t) using the Black-Scholes model,
which has no dW (t)-terms.
3. The rate of return of our portfolio should be the short-term interest rate r
because of Theorem 12.2.
85
86 CHAPTER 13. PRICING VIA ARBITRAGE
4. Based on the assumption above, evaluate the dynamics of the option price
Π(t), (13.6)
T. Bjork. Arbitrage Theory in Continuous Time. Oxford Finance. Oxford Univ Pr,
2nd edition, 2004.
87