Professional Documents
Culture Documents
A Hybrid Combinatorial Approach To A Two-Stage Stochastic Portfolio Optimization Model With Uncertain Asset Prices
A Hybrid Combinatorial Approach To A Two-Stage Stochastic Portfolio Optimization Model With Uncertain Asset Prices
A Hybrid Combinatorial Approach To A Two-Stage Stochastic Portfolio Optimization Model With Uncertain Asset Prices
Tianxiang.cui@nottingham.edu.cn
Abstract—Portfolio optimization is one of the most important problem, therefore it can be solved in an exact manner with a
problems in the finance field. The traditional mean-variance reasonable computational time.
model has its drawbacks since it fails to take the market However, the basic Markowitz mean-variance model has
uncertainty into account. In this work, we investigate a two-stage
stochastic portfolio optimization model with a comprehensive less practical utilities since it omits many constraints exist in
set of real world trading constraints in order to capture the real world trading. By imposing more real world constraints,
market uncertainties in terms of future asset prices. Scenarios for example cardinality (which specifies the total number of
are generated to capture uncertain prices of assets. Stability tests the held assets in a portfolio in order to reduce the tax and
are performed and the results confirm the effectiveness of the the transaction costs) and bounding (which specifies the lower
scenario generation method used for this work. We propose
a hybrid combinatorial approach, which integrates a hybrid and upper bound of the proportion of each held asset in a
algorithm and a linear programming (LP) solver for the problem portfolio in order to avoid unrealistic holdings), the model
with a large number of scenarios, where the hybrid algorithm is can be transformed into an NP-complete problem [3], [4].
used to search for the assets selection heuristically and the LP Our previous work [5] has proposed a combinatorial algorithm
solver solves the corresponding sub-problems of weight allocation for the cardinality constrained portfolio optimization problem
optimally. The hybrid algorithm is based on Population Based In-
cremental Learning (PBIL) while local search, hash search, elitist using the extended mean-variance model.
selection, partially guided mutation and learning inheritance are Although the real world constraints have later been intro-
also adopted. Comparison results against other 3 algorithms are duced into the classic mean-variance model, there still remains
given. The results show that our hybrid combinatorial approach another important market factor, the uncertainty, that compli-
can solve the two-stage stochastic model effectively and efficiently. cates the investors making investment decisions. In the current
The effects of different parameter settings are also examined.
Index Terms—Hybrid Algorithm; Combinatorial Approach;
work of mean-variance portfolio optimization problem [5]–[9],
Stochastic Programming; Population-based Incremental Learn- the mean expected return and the covariance between assets
ing; Local Search; Learning Inheritance; Portfolio Optimization are assumed to be static, which is often unrealistic due to the
Problem. economic turmoil and the market uncertainties in practice. It
has been pointed out in [10], [11] that the investment decisions
I. I NTRODUCTION should be made based on the consideration of the market
uncertainties. Usually, the random uncertainty factors are taken
With the advances in computing and the rise of big data, into account (i.e. the asset price, the currency exchange rate,
nowadays the investment decisions are made not only by the the prepayments, the external cashflows, the inflation, the
financial experts, but also based on sophisticated mathematical liabilities, etc.). There are also some other non-probabilistic
models and number crunching by mathematicians or computer uncertainty factors (i.e. the vagueness and the ambiguity, etc.)
scientists. The high yield of stock has made it a major which are mainly modeled using fuzzy techniques [12]–[15].
investment over the past decades. One typical problem in stock In this work, we will focus on the random uncertainty of the
market, portfolio optimization, can be described as allocating market, more specifically, we consider the future asset prices
the limited capital over a number of potential assets in order to to be uncertain.
achieve investors risk appetites and the return objectives. The Stochastic programming has been well studied for modeling
first portfolio optimization model was proposed by Markowitz optimization problems with uncertain factors since late 1950s
in the 1950s [1], [2], where, the risk of the portfolio is [16]–[19]. It provides a stochastic view to replace the deter-
measured as the variance of the asset return and therefore ministic one in the sense that the uncertain factors are repre-
the problem can be viewed as a mean-variance optimization sented by the assumed probability distributions. It can model
problem. The original problem is a quadratic programming uncertainty and impose real world constraints in a flexible way
[20]. As it has been shown in [21], stochastic programming has and distributes the weights of capital to each asset in the
been applied to many different areas successfully (finance, s- portfolio and GNP decides the trading strategies. Stochastic
ports, scheduling [22], telecommunications, energy, production programming models have also been widely used in the
control and capacity planning, etc.). For this work, we propose portfolio optimization literature. For example Topaloglou et
to use stochastic programming to model uncertain future asset al. [36] proposed a multi-stage stochastic programming model
prices. for international portfolio management in a dynamic setting.
Another drawback of the mean-variance model is how it The uncertainties are modeled in terms of the asset prices and
characterizes the risk. In the classical Markowitz portfolio exchange rates. Yu et al. [47] proposed a dynamic stochastic
optimization model, the risk is measured as the variance of programming model for bond portfolio management. They
the asset returns. Because such characterization of the risk model the uncertainty in terms of the interest rates. Stoyan
is a measure of the dispersion of the values of the variable and Kwon [37] considered a stochastic-goal mixed-integer pro-
around its expected value, therefore it cannot define the gramming model for the integrated stock and bond portfolio
direction of volatility in the sense that it penalizes the portfolio problem. The uncertainties are modeled in terms of the asset
profits and the portfolio losses at the same time. Practically, prices and the real world trading constraints are imposed. The
people may only want to minimize the possibility of the model was solved by a decomposition based algorithm. He
portfolio losses. Alternatively, another risk measure, namely and Qu [48] proposed a two stage portfolio selection problem
Value at Risk (VaR) [23], [24], is proposed to calculate the with a comprehensive set of real world trading constraints.
downside risk. VaR calculates the maximum possible loss with The uncertainties are modeled in terms of the asset prices. A
a specified confidence level and it is written into the industry hybrid algorithm integrating local search and a default Branch-
regulation [25], [26]. However, VaR is inadequate for market and-Bound method was proposed to solve the problem.
risk evaluation since it does not satisfy the sub-additivity and One common method used in the literature to deal with
the convexity and generally it is not a coherent risk measure stochastic portfolio optimization model is decomposition. Ben-
[27]. Also, VaR does not take the distribution of the loss ders decomposition [44], scenario decomposition [49], time
exceeding the threshold into account and it would become decomposition [42] and other novel decomposition methods
unstable if there is a sharp and heavy tail loss distribution. [37] are proposed. The problem is simplified when it is
Furthermore, VaR is difficult to optimize using scenarios [28]. decomposed into different parts.
In order to eliminate the drawbacks of VaR, Rockafella
and Uryasev [29] proposed Conditional Value at Risk (CVaR) In our previous work [50], we improved the stochastic
which calculates the expected loss for the worst case scenarios. portfolio optimization model in the literature [36], [48] and
As it has been showed in [27], CVaR is a sub-additive and proposed a hybrid algorithm for the two-stage stochastic port-
convex risk measure, therefore it can be optimized using folio optimization problem with a comprehensive set of real
stochastic programming. world trading constraints. A genetic algorithm (GA) together
Stochastic programming has been widely applied in finan- with a commercial LP solver was used where GA is used
cial optimization problems. Models for the management of to search for the assets selection heuristically and the LP
fixed income securities [30]–[32] and models for asset/liability solver can solve the corresponding sub-problems optimally.
management [33]–[37] have been well studied in recent The proposed hybrid genetic algorithm can solve the problem
decades. A more comprehensive review can be found in [38]. to a good degree of accuracy, however, the full mechanisms of
A wide range of approaches based on stochastic programming a standard GA is too heavy for the two-stage stochastic model,
for portfolio management have been developed [36], [37], especially when a large number of scenarios is used. In order
[39]–[44]. to solve the two-stage stochastic model more efficiently, in this
In the portfolio optimization problem domain, Gaivoron- work, we propose a light weight approach based on Population
ski et al. [43] investigated different approaches to portfo- Based Incremental Learning (PBIL). It intends to solve the
lio selection based on different risk characterizations. They model with a larger number of scenarios. Local search, hash
proposed an algorithm to determine whether to rebalance a search, elitist selection and partially guided mutation are also
given portfolio based on transaction costs and new market adopted in order to enhance the evolution.
condition information. Greco and Matarazzo [45] proposed an The outline of the rest part is as follows: section II in-
approach for portfolio selection in a non-Markowitz way. The troduces the background information. Section III gives the
uncertainties are modeled in terms of a series of meaningful statement of the problem as well as the corresponding no-
quantiles of probabilistic distributions. They proposed an In- tations. The detail description of our hybrid combinatorial
teractive Multiobjective Optimization (IMO) method based on approach is given in section IV. The datasets are described in
dominance-based Rough Set Approach (DRSA) to solve the section V. In Section VI, we introduce our scenario generation
model in two phases. Chen and Wang [46] introduced a hybrid method as well as the stability test results. Section VII gives
stock trading system based on Genetic Network Programming the parameter settings and examines the effects of different
and mean-CVaR model (GNP-CVaR). The proposed model learning rates. Experimental results are presented in section
combines the advantages of statistical models and artificial VIII. The final conclusion and possible future direction are
intelligence in the sense that CVaR measures the market risk given in section IX.
II. P RELIMINARIES where u is the vector of the second-stage decision variables
A. Stochastic programming which depends on the realization of the first-stage uncertain
variables. q(u, ξ) represents the second-stage cost function.
In real-world situation, many variables are not deterministic
W (ξ), h(ξ) and T (ξ) are model parameters with reasonable
due to the uncertain parameters involved (future events, human
dimensions. As these parameters are the functions of the
errors, etc). Generally, there are two approaches to deal with
uncertain data ξ, therefore they are also random. W is the
the uncertainties:
recourse matrix and h is the second-stage resource vector.
• Robust Optimization: when the uncertain variables are
T is the technology matrix which contains the technology
given within some certain boundaries, robust optimization coefficients, therefore it can convert the first-stage decision
is applied for such problems. The idea is to find a solution variable vector x into resources for the second-stage problem.
which is feasible for all the data and optimal for the worst
Therefore the general two-stage stochastic programming
case scenario.
problem with recourse can be rewritten as follows:
• Stochastic Programming: when the probability distribu-
tion of the uncertain variables are known or can be
estimated, stochastic programming is applied for such
min f (x) + E[min{q(u, ξ)|W (ξ)u + T (ξ)x = h(ξ)}]
problems. The idea is to find a policy which is feasible
for all (or at least almost all) possible data instances s.t.
and maximizes/minimizes the expectation of the objective Ax = b
function with the decision and random variables involved. x ∈ Rn
The decision-maker can gather some useful information
u ∈ Rm
by solving such models either analytically or numerically.
For this work, we use stochastic programming to deal with In this formulation, a “here and now” decision x is made
the uncertain future asset prices. The comprehensive concepts before the uncertain data ξ is realized. At the second stage,
of stochastic programming can be found in [16], [17]. after the value of the uncertain data ξ is revealed, we can
B. Two-Stage Stochastic Programming Problem With Re- modify our behavior by solving the corresponding optimiza-
course tion problem.
For this work, we consider a widely applied class of stochas- The recourse problem is not restricted to the two-stage
tic programming problem, namely the recourse problem. It formulation and it is possible to extend the problem into a
seeks a policy that can take the actions after some realisation of multistage model.
the uncertain variables as well as make the recourse decisions
based on the temporarily available information. C. Scenario Tree
The simplest case of the recourse problem have two stages: There are two common methods which can be used to deal
• first stage: A decision needs to be made. with the multistage stochastic programming problems, namely
• second stage: The values of the uncertain variables are decision rule approximation and scenario tree approximation.
revealed and further decisions are allowed to make in For this work, we will focus on the scenario tree approximation
order to avoid the constraints of the problem becoming tree method.
infeasible. Usually a decision in the second stage will de- A scenario is defined as the possible realisation of the
pend on a particular realisation of the uncertain variables. uncertain data ξ in each stage t ∈ T . An example of a scenario
Formally, the two-stage stochastic programming problem tree is showed in Figure 1. The nodes in the scenario tree
with recourse can be described as the follows [38]: represent a possible realisation of the uncertain data ξ T . Each
min f (x) + E[Q(x, ξ)] node is denoted by n = (s, t) where s is a scenario and t
is the level of the node in the tree and the decisions will be
s.t.
made at each node. The parent of the node n is represented by
Ax = b at−1 (n). The branching probability of the node n is denoted
x ∈ Rn by pn which is a conditional probability on its parent node
at−1 (n). The path toQthe node n is a partial scenario with the
where ξ represents the uncertain data, x is the first-stage
probability P rn = pn along the path and the sum of P rn
decision variable vector which should be decided before the
is up to 1 across each level of the scenario tree.
uncertain variables are revealed and Q(x, ξ) is the optimal
In order to apply the scenario tree approximation method
value for the following nonlinear program:
for the stochastic programming problem with recourse, the
min q(u, ξ) uncertain data ξ needs to be discretized and all possible
s.t. realisations of ξ can be represented by a discrete set of
scenarios. Thus, scenario generation methods are required.
W (ξ)u = h(ξ) − T (ξ)x
There are several scenario generation methods in the literature,
u ∈ Rm for this work, we applied a shape based method [51].
t=0
of the loss exceeding the threshold into account and it would
become unstable if there is a sharp and heavy tail in the loss
t=1 distribution.
2) Conditional Value at Risk (CVaR): Conditional Value at
Risk (CVaR) (also called Mean Excess Loss, Mean Expected
t=2 Shortfall, Tail VaR) is proposed in [29] in order to eliminate
the drawbacks of VaR. CVaR is a more consistent risk measure
because of its sub-additivity and the convexity [27] and it is
proven to be a coherent risk measure [52].
Fig. 1. An example of a scenario tree. At stage t = 0 there is one scenario, at
stage t = 1 there are 4 scenarios and at stage t = 2 there are eight scenarios.
CVaR calculates the average value of the loss which exceeds
the VaR value (see Figure 2). Formally, CVaR is defined as
the follows [29]:
Z
φβ (x) = (1 − β)−1 f (x, ξ)p(ξ)dξ
VaR (α) Maximum loss
f (x,ξ)≥αβ (x)
Portfolio Loss It has been proved in [29] that Fβ (x, α) is a convex function
with respect to α and the minimum point of Fβ (x, α) is
Fig. 2. Value at Risk (VaR) and Conditional Value at Risk (CVaR) VaR with respect to α. The CVaR value can be obtained by
minimizing Fβ (x, α) with respect to α.
3) Minimizing CVaR: From the definitions of VaR and
D. Percentile Risk Function CVaR we can see that given a specified probability level β,
1) Value at Risk (VaR): In the real-world situation, portfolio β-CVaR should always be greater or equal to β-VaR. In fact,
managers may only need to reduce the possibility of the we can optimize CVaR and obtain VaR simultaneously by
high loss. Value at Risk (VaR) [23], [24] gives the maximum minimizing the function Fβ (x, α) [53]. Suppose we have the
possible loss α with a specified confidence level β. That is, solution of the minimization of Fβ (x, α), (x∗ , α∗ ), then the
by the end of the investing period, the probability of the loss optimal CVaR value equals to Fβ (x∗ , α∗ ) and the correspond-
exceeding the threshold α is 1 − β (see Figure 2). ing VaR value equals to α∗ .
Formally, let f (x, ξ) be the loss function where x ∈ Z+ is We can minimize the function Fβ (x, α) by introducing an
the decision vector and ξ ∈ R is the uncertain (random) vector. auxiliary function Z(ξ) such that Z(ξ) ≥ f (x, ξ) − α and
The density of the probability distribution of ξ is denoted by Z(ξ) ≥ 0. Formally we have the following:
p(ξ). The probability of the loss function f (x, ξ) not exceeding
a threshold α is given by:
min α + (1 − β)−1 E(Z(ξ))
Z s.t.
Ψ(x, α) = p(ξ)dξ Z(ξ) ≥ f (x, ξ) − α
f (x,ξ)≤α
Z(ξ) ≥ 0
The β-VaR for the loss random variable associated with x
and the specified probability β in (0, 1) is denoted by αβ (x) α∈R
and formally we have the following: Now let us consider the portfolio optimization problem.
Here the uncertain data ξ can be referred to the future
αβ (x) = min{α ∈ R : Ψ(x, α) ≥ β} asset prices. Normally the analytical representation of density
function p(ξ) is not available but instead the scenarios can
However, VaR is inadequate for market risk evaluation. As be generated from the historical observations of each asset
it has been pointed out in [27], VaR does not satisfy the sub- price. The scenario generation can use the property matching
additivity and the convexity and generally it is not a coherent method [54], [55] or even simply Monte Carlo simulations.
risk measure (VaR is only coherent for standard deviation of Suppose we have generated N scenarios from the density p(ξ),
normal distributions). Also VaR is difficult to optimize using yn where n = 1, . . . , N . Function Fβ (x, α) can be therefore
scenarios [28]. Furthermore, VaR does not take the distribution calculated as the follows:
X
N
X min α + (1 − β)−1 pj zj (1)
−1 +
Fβ (x, α) = α + (1 − β) pn (f (x, yi ) − α) j∈Nr
n=1 s.t.
First Stage - Portfolio Selection:
where f (x, yn ) is the portfolio loss function in scenario n wi = wi0 + bi − si , ∀i ∈ A (2)
and it is defined as the negative of the total portfolio return. X X
pn is the probability of scenario n and (f (x, yi ) − α)+ = h+ (si Pi0 ) − (ηs gi + si ρs Pi0 )
i∈A i∈A
max(0, (f (x, yi ) − α)). By introducing the auxiliary func- X X
tion Z(ξ) and we can have the auxiliary variable zn where = (bi Pi0 ) + (ηb fi + bi ρb Pi0 ) (3)
zn ≥ f (x, yn ) − α, zn ≥ 0, n = 1, . . . , N . Therefore the X
i∈A i∈A
minimization of the function Fβ (x, α) can be reduced to the ci = K (4)
simplified form: i∈A
wmin ci ≤ wi ∀i ∈ A (5)
tmin fi ≤ bi ∀i ∈ A (6)
N
X tmin gi ≤ si ∀i ∈ A (7)
min α + (1 − β)−1 pn zn
fi + gi ≤ 1 ∀i ∈ A (8)
n=1
s.t. f i M ≥ bi ∀i ∈ A (9)
zn ≥ f (x, yn ) − α n = 1, . . . , N gi M ≥ si ∀i ∈ A (10)
zn ≥ 0 n = 1, . . . , N bi M ≥ f i ∀i ∈ A (11)
α∈R si M ≥ gi ∀i ∈ A (12)
x∈R n wi , b i , s i ∈ R (13)
ci , f i , g i ∈ B (14)
Second Stage - Recourse:
It has been showed in [29], [56], [57] that such formulation
can provide the numerically stable technique to the problem wij = wi + bji − sji ∀i ∈ A, ∀j ∈ Nr (15)
X j j X
with large number of scenarios. (si Pi ) − (ηs gij + sji ρs Pij )
i∈A i∈A
X j j
X
= (bi Pi ) + (ηb fij + bi ρb Pij ) ∀j ∈ Nr (16)
III. M ODEL S TATEMENT X j
i∈A i∈A
ci = K ∀j ∈ Nr (17)
i∈A
A. Notations
wmin cji ≤ wij ∀i ∈ A, ∀j ∈ Nr (18)
The notations we used in this work are given in Table I. tmin fij ≤ bji ∀i ∈ A, ∀j ∈ Nr (19)
tmin gij ≤ sji ∀i ∈ A, ∀j ∈ Nr (20)
fij + gij ≤ 1 ∀i ∈ A, ∀j ∈ Nr (21)
B. Two-stage stochastic portfolio optimization model with fij M ≥ bji ∀i ∈ A, ∀j ∈ Nr (22)
recourse j j
gi M ≥ si ∀i ∈ A, ∀j ∈ Nr (23)
j j
The model we used for this work is the same with our bi M ≥ fi ∀i ∈ A, ∀j ∈ Nr (24)
previous work [50]. Inspired by [36], the original form of the j j
si M ≥ gi ∀i ∈ A, ∀j ∈ Nr (25)
model was proposed in [48]. Although [48] is formulated as a X (j,e)
V =j
p(j,e) Pi wij ∀j ∈ Nr (26)
two stage model, it did not include a possibility that the costs
i∈A
and values change after the recourse decision is enacted. Hence e∈Nej
the recourse it that model could have no monetary effect, and
Rj = V j − V 0 ∀j ∈ Nr (27)
so would obtain the same decisions as a simpler single stage j
formulation. A contribution of our work is to extend the model zj ≥ −R − α ∀j ∈ Nr (28)
so that values can change after the recourse, and non- trivial zj ≥ 0 ∀j ∈ Nr (29)
recourse decisions can improve the portfolio performance. The X
pj R j ≥ µ (30)
proposed model is divided into two stages. j∈Nr
The objective function (1) calculates the β-percentile CVaR than 0, then the corresponding binary decision variables should
of the portfolio loss at the end of the second stage where α is equal to 1; if the decision variables for buying/selling an
the corresponding optimal VaR value. Eq. (2) is the first stage asset is 0, then the corresponding binary decision variables
asset balance condition and Eq. (15) is the second stage asset should be 0 and vice versa. Equations (26), (27) calculate the
balance condition. Equations (3), (16) are the cash balance portfolio return on each recourse node by using a different set
conditions for the first and second stage respectively. We of evaluate scenarios in order to have a better reflection of
apply a fixed transaction cost and a linear variable transaction changing price scenarios in the reality. Equations (28), (29)
cost to both buying and selling an asset. The idea is that define the excess shortfall zj of the recourse portfolio where
the cash inflows should equal to the cash outflows in both zj = max[0, −Rj − α] for each recourse node. The minimum
stages (i.e. no cash left). Equations (4), (17) are the cardinality portfolio target return µ is given in equation (30). The decision
constraints for the first and second stage respectively where variables wi , bi , si , wij , bji , sji specify the exact amount of the
K is the desired number of the assets held within a portfolio. units for an asset to buy or sell and in a real-world situation,
Equations (5) and (18) put the restrictions on the minimum these decision variables should be integers. As they increase
holding size of an asset in order to prevent very small asset the computational difficulty significantly, we took the same
positions for the first and second stages. Equations (6),(7), method suggested in [48], [58] to relax these decision variables
are the minimum trading conditions for the first stage and as continuous variables.
equations (19),(20), are the minimum trading conditions for
the second stage. The idea is to prevent it from trading a
very small proportion of an asset. Buying and selling the C. Computational complexity
same asset at the same time is not allowed, this is given in The deterministic problems are generally in NP, however,
equation (8) for the first stage and in equation (21) for the the stochastic versions can be of even harder complexity
second stage. The big-M formulations are used in the model classes; for example, it has been shown in [59] that linear
in order to bound the decision variables and the binary decision two-stage stochastic programming problems are #P-hard. For
variables (constraints (9), (10), (11), (12) for the first stage and multistage stochastic programming problems, it is generally
constraints (22), (23), (24), (25) for the second stage). The idea computationally intractable even for the medium-accuracy
is, if the decision variables for buying/selling an asset is greater solutions [60].
IV. T HE P ROPOSED H YBRID C OMBINATORIAL A PPROACH One vector k = (k0 , k1 , ..., kK ) of size K is used to
•
represent the selected K assets of the portfolio where
Exact methods and metaheuristic approaches are two suc-
ki ∈ {1, 2, . . . , Q} and i = 1, . . . , K.
cessful streams for solving combinatorial optimization prob-
• The evaluation of vector k is done by calculating a fitness
lems. Over the last few years, many works have been develope-
function F which is implemented using a standard LP
d on building hybrids of exact methods and metaheuristic ap-
solver. It maps from a list of K integers and a target
proaches. In fact, many real-world problems can be practically
return µ to a real number: F (Z K , µ) → R.
solved much better using hybrid strategies since the advantages
• The probability vector is updated by learning from the
of both types of methods are simultaneously exploited. For
best and the worst solutions obtained from the population
this work, we integrate metaheuristic approaches with exact
at the end of each generation.
methods. The idea is, we divide the two-stage stochastic
• Elitist selection is used in our PBIL-based hybrid algo-
problem into two parts. The first part is to determine the asset
rithm, i.e., we keep the best solution in each generation.
combination while the second part is to calculate the optimal
• The global best solution xgb is recorded such that
weights of the selected assets correspondingly. The first part
F (xgb , µ) ≤ F (xi , µ) for all xi at the given return level
is searched by the hybrid algorithm and the second part is
µ.
solved by a LP solver. In our previous work [50], we proposed
a hybrid genetic algorithm which can solve the problem to a The procedure of PBIL-based hybrid algorithm used in this
good degree of accuracy. However, the full mechanisms of GA work is given as Algorithm 1 and the parameters are given in
is too heavy (i.e. computationally expensive) for the two-stage section VII.
stochastic model especially when a large number of scenarios
are used. In this case, we propose a lighter approach which Algorithm 1: PBIL-based hybrid algorithm for searching
is based on Population Based Incremental Learning (PBIL). It the set of assets
intends to solve the two-stage stochastic model with a larger 1 for i = 1 to Q do
number of scenarios. Local search, hash search, elitist selection 2 vi = 0.5;
and partially guided mutation are also adopted in order to 3 while stopping criteria are not met;
enhance the evolution. 4 do
5 Generating individuals: Create a population of
A. Overview of PBIL
individuals (see IV-D);
Population Based Incremental Learning (PBIL) was origi- 6 for each individual generated do
nally introduced by Baluja [61], [62]. It is one of the simplest 7 Hash search: Search the infeasible solution hash
form of Estimation of Distribution Algorithms (EDAs). It table and bad solution hash table, if it is very
combines genetic algorithms and competitive learning for similar to the entries of the hash table,
function optimization. It evolves the entire population rather re-generating (see IV-E);
than each single individual members. The idea is, a probability
8 Evaluation: Evaluate each individual’s fitness by
vector is used to represent the distribution of all individuals.
using CPLEX LP solver (see IV-F);
After evaluating each individuals, the probability vector is
9 Local Search: Perform the local search for the top
updated by learning from the best and the worst solutions.
20% individuals (see IV-G);
Mutation is also performed on the probability vector in order to
10 Archive: Keep the record of the current best solution,
help preserve diversity. Then the new generation of population
the current worst solution and the infeasible solution
is created based on the updated probability vector. PBIL is
(see IV-H);
closely related to GA, but it is simpler and more efficient since
11 Update: Update v by learning from the current best
it does not require all the mechanisms of a standard GA.
and the current worst solution (see IV-I);
B. Problem representation 12 Mutation: Mutate v by using partially guided
mutation (see IV-J);
In this work, PBIL-based hybrid algorithm is utilized to 13 Elitism: Select the best individual from the current
evolve best values for discrete variables in the stochastic generation and insert it into the next new generation;
model. The search space is different for different benchmark
datasets (characterized by Q, see Section V). The objective is
to find the best K items from Q possible assets for a given
target return µ specified by the investor. The details of problem C. The reduced sub-problem
representation are as follows: In this work, the sub-problems are generated by dropping
• One probability vector v = (v0 , v1 , ..., vQ ) of size Q all the non-selected assets. Each individual is fixed in both the
represents the possibility of each asset to be chosen in first and second stage (i.e. ci = cji = 1 if hybrid algorithm
the portfolio. picks asset i). The recourse are limited to asset rebalancing,
• One binary vector t = (t0 , t1 , ..., tQ ) of size Q is used but not swapping the assets and therefore we can call such sub-
to denote if asset is chosen in the portfolio. problem the reduced sub-problem. As the transaction costs and
the minimum holding constraint are considered in the model, Algorithm 2: Generating individuals
the cost of buying an entirely new asset is probably signifi- 1 for i = 1 to Q do
cantly higher than just adjusting the holding of an existing one. 2 if random(0, 1) < vi then
We use CPLEX to solve a full problem on a small instance 3 ti = 1;
using a small number of scenarios with different transaction 4 else
costs and minimum holding constraint. For each different 5 ti = 0;
set of transaction costs and minimum holding constraint, we
count the number of occurrences that the selection of assets 6 K 0 = 0;
in the second-stage is not the same as in the first-stage (i.e. 7 for i = 1 to Q do
cji 6= ci ). The results are shown in Table II. We can see 8 if ti == 1 then
that when the transaction costs are bigger than 0.4% and the 9 K 0 = K 0 + 1;
minimum holding constraint is bigger than 0.8%, the assets in
the second-stage remain unchanged. For this work, we use 10 if K 0 ≥ K then
the parameter settings ρb , ρs = 0.5% wmin = 1.0% (see 11 randomly choose K among K 0 assets and insert them
section VII) therefore adding or dropping the rebalance-only into k;
condition should not make a significant difference. We do not 12 if K 0 < K then
claim this is an optimal approximation. The reason of using 13 insert K 0 assets into k;
the reduced sub-problem is to reduce the computation time by 14 randomly choose another K − K 0 assets which are
the LP solver. different from the existing K 0 assets and insert them
into k;
TABLE II
T HE NUMBER OF OCCURRENCES THAT cji 6= ci FOR THE SOLUTION OF
THE FULL PROBLEM ON A SMALL INSTANCE USING 100 POSSIBILITIES OF
SCENARIOS WITH DIFFERENT MODEL PARAMETERS
0.065
0.11
0.05
0.06
0.1 0.055
0.045
0.05
0.09
Return
Return
Return
0.045 0.04
0.08
0.04
0.035
0.07 0.035
0.03
0.03
0.06
PBIL without hash search 0.025 PBIL without hash search PBIL without hash search
PBIL with hash search PBIL with hash search PBIL with hash search
0.05 0.02 0.025
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0 0.05 0.1 0.15 0.2 0.25 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14
CVaR CVaR CVaR
0.06
0.023
0.055
0.022
0.05
Return
0.045 0.021
0.04
0.02
0.035
0.019
0.03
PBIL without hash search PBIL without hash search
PBIL with hash search PBIL with hash search
0.025 0.018
0 0.05 0.1 0.15 0.2 0.25 0.008 0.01 0.012 0.014 0.016 0.018 0.02 0.022 0.024 0.026
CVaR
Algorithm 4: Hash search the closest probability. If a better solution is obtained, the
1 for each new individual h generated do current best solution is updated. For each asset, we search na
2 for each key ky1 in HashtableInfeasibleSolution do neighbours (i.e. na closest probability successors). The local
3 if SimilarityCheck(h, ky1) ≥ K − 1 then search applied here is the incomplete neighbourhood search.
4 re-generate individual h; It aims to seek for the possible improvement of the current
5 break ; solution. Figure 4 shows that the local search can indeed help
the algorithm find better global solutions. The details of the
6 for each key ky2 in HashtableBadSolution do local search are given as Algorithm 5.
7 if SimilarityCheck(h, ky2) ≥ K − 2 then
8 re-generate individual h; Algorithm 5: Local search
9 break ;
1 Select the top 20% individuals of the generation;
2 for each selected individual do
3 for i = 1 to K do
4 for j = 1 to na do
(which is used to control the kinds of pivots permitted) and the 5 Replace asset i with a neighbourhood asset to
time allowed for each fitness calculation. That means we do form a new portfolio;
not need to compute the optimal values for every individual. 6 Evaluate the new portfolio;
We only need to calculate the optimal value once for the 7 if The fitness value of the new portfolio is
global best solution after the search of the hybrid algorithm is smaller than the current best solution then
finished. This will help improve the efficiency. 8 Update the current best solution;
G. Local search
After the evaluation is done, top 20% individuals with
the smallest fitness value of the generation are selected and
the local search are applied to them in order to seek for H. Archive
the better solutions and evolve better individuals within a As mentioned in section IV-E, during the evolution, it is
neighbourhood. Each time we replace one asset with a neigh- possible to obtain infeasible solutions. It’s important to keep
bourhood asset and then re-evaluate the new portfolio. The an archive of them. We use a hash table to record all the
neighborhood relation of an asset is defined as the asset with infeasible solutions obtained. Similarly we use a hash table
0.12 0.07 0.055
0.065
0.11
0.05
0.06
0.1 0.055
0.045
0.05
0.09
Return
Return
Return
0.045 0.04
0.08
0.04
0.035
0.07 0.035
0.03
0.03
0.06
PBIL without local search 0.025 PBIL without local search PBIL without local search
PBIL with local search PBIL with local search PBIL with local search
0.05 0.02 0.025
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0 0.05 0.1 0.15 0.2 0.25 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14
CVaR CVaR CVaR
0.06
0.023
0.055
0.022
0.05
Return
0.045 0.021
0.04
0.02
0.035
0.019
0.03
PBIL without local search PBIL without local search
PBIL with local search PBIL with local search
0.025 0.018
0 0.05 0.1 0.15 0.2 0.25 0.008 0.01 0.012 0.014 0.016 0.018 0.02 0.022 0.024 0.026
CVaR
0.095 0.065
0.05
0.09 0.06
0.085 0.055
0.045
0.08 0.05
Return
Return
Return
0.075 0.045 0.04
0.07 0.04
0.035
0.065 0.035
0.06 0.03
0.03
Efficient frontier Efficient frontier Efficient frontier
0.055 In−sample points 0.025 In−sample points In−sample points
Out−of−sample points Out−of−sample points Out−of−sample points
0.05 0.02 0.025
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35
CVaR CVaR CVaR
0.06 0.025
0.055 0.024
0.05 0.023
Return
Return
0.045 0.022
0.04 0.021
TABLE III
I N - SAMPLE STABILITY TEST RESULTS FOR 5 GENERAL MARKET initial cash h = 100000. We assume the probability of each
INSTANCES USING 400 SCENARIOS scenario is equal and therefore pj = 1/Nr , p(j,e) = 1/Nej .
0.1 0.109
0.055
0.05
0.09
Return
Return
Return
0.1085 0.045
0.08
0.04
0.07 0.035
0.108
Random Search Random Search 0.03 Random Search
0.06 GA Mutation Only GA Mutation Only GA Mutation Only
PSO PSO 0.025 PSO
Hybrid Algorithm 0.1075 Hybrid Algorithm Hybrid Algorithm
0.05 0.02
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.284 0.286 0.288 0.29 0.292 0.294 0.296 0 0.05 0.1 0.15 0.2
CVaR CVaR CVaR
0.06
0.05 0.023
0.055
0.045 0.022
0.05
Return
Return
Return
0.04 0.045 0.021
0.04
0.035 0.02
0.035
Random Search Random Search Random Search
0.03 0.019
GA Mutation Only GA Mutation Only GA Mutation Only
0.03
PSO PSO PSO
Hybrid Algorithm Hybrid Algorithm Hybrid Algorithm
0.025 0.025 0.018
0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0 0.05 0.1 0.15 0.2 0.25 0.005 0.01 0.015 0.02 0.025 0.03 0.035
CVaR CVaR CVaR
1030
Evaluation value
Evaluation value
Evaluation value
3800
8000
1025 3600
7500
3400
1020
3200
7000
1015
3000
1600
800
Evaluation value
Evaluation value
1500
750
1400
700
1300
650
1200
600
1100
550 1000
500 900
0 20 40 60 80 100 0 20 40 60 80 100
Generation Generation
[9] K. Lwin and R. Qu, “A hybrid algorithm for constrained portfolio vice network design with rerouting,” Transportation Research Part B:
selection problems,” Applied Intelligence, vol. 39, no. 2, pp. 251–266, Methodological, vol. 60, no. 0, pp. 50 – 65, 2014.
Sep. 2013. [23] P. Jorion, Value at Risk: The New Benchmark for Managing Financial
[10] R. Baldacci, M. Boschetti, N. Christofides, and S. Christofides, “Ex- Risk, 3rd ed. McGraw-Hill Education, 2006.
act methods for large-scale multi-period financial planning problems,” [24] M. Pritsker, “Evaluating Value at Risk Methodologies: Accuracy versus
Computational Management Science, vol. 6, no. 3, pp. 281–306, 2009. Computational Time,” Journal of Financial Services Research, vol. 12,
[11] D. Barro and E. Canestrelli, “Dynamic portfolio optimization: Time no. 2, pp. 201–242, October 1997.
decomposition using the maximum principle with a scenario approach,” [25] RiskMetrics technical manual, 4th ed., J.P. Morgan, New York, 1995.
European Journal of Operational Research, vol. 163, no. 1, pp. 217–229, [26] J. W. Goh, K. G. Lim, M. Sim, and W. Zhang, “Portfolio value-at-
2005, financial Modelling and Risk Management. risk optimization for asymmetrically distributed asset returns,” European
[12] J. Li and J. Xu, “Multi-objective portfolio selection model with fuzzy Journal of Operational Research, vol. 221, no. 2, pp. 397 – 406, 2012.
random returns and a compromise approach-based genetic algorithm,” [27] P. Artzner, F. Delbaen, J.-M. Eber, and D. Heath, “Coherent measures
Information Sciences, vol. 220, no. 0, pp. 507 – 521, 2013, online Fuzzy of risk,” Mathematical Finance, vol. 9, no. 3, pp. 203–228, 1999.
Machine Learning and Data Mining.
[28] H. Mausser and D. Rosen, “Beyond var: from measuring risk to
[13] H. Yano, “Fuzzy decision making for fuzzy random multiobjective linear
managing risk,” in Proceedings of the IEEE/IAFE 1999 Conference
programming problems with variance covariance matrices,” Information
on Computational Intelligence for Financial Engineering (CIFEr 1999),
Sciences, vol. 272, no. 0, pp. 111 – 125, 2014.
1999, pp. 163–178.
[14] P. Gupta, M. Inuiguchi, M. K. Mehlawat, and G. Mittal, “Multiobjective
credibilistic portfolio selection model with fuzzy chance-constraints,” [29] R. T. Rockafellar and S. Uryasev, “Optimization of conditional value-
Information Sciences, vol. 229, pp. 1 – 17, 2013. at-risk,” Journal of Risk, vol. 2, pp. 21–41, 2000.
[15] P. Gupta, M. K. Mehlawat, and A. Saxena, “A hybrid approach to asset [30] B. Golub, M. Holmer, R. McKendall, L. Pohlman, and S. A. Zenios,
allocation with simultaneous consideration of suitability and optimality,” “A stochastic programming model for money management,” European
Information Sciences, vol. 180, no. 11, pp. 2264 – 2285, 2010. Journal of Operational Research, vol. 85, no. 2, pp. 282 – 296, 1995.
[16] P. Kall and S. Wallace, Stochastic programming, ser. Wiley-Interscience [31] C. Vassiadou-Zeniou and S. A. Zenios, “Robust optimization models for
series in systems and optimization. Wiley, 1994. managing callable bond portfolios,” European Journal of Operational
[17] J. Birge and F. Louveaux, Introduction to Stochastic Programming, ser. Research, vol. 91, no. 2, pp. 264 – 273, 1996.
Springer Series in Operations Research and Financial Engineering. U.S. [32] A. Beltratti, A. Consiglio, and S. Zenios, “Scenario modeling for the
Government Printing Office, 1997. management of international bond portfolios,” Annals of Operations
[18] G. Dantzig, Linear programming and extensions, ser. Rand Corporation Research, vol. 85, no. 0, pp. 227–247, 1999.
Research Study. Princeton, NJ: Princeton Univ. Press, 1963. [33] J. M. Mulvey and H. Vladimirou, “Stochastic network programming for
[19] G. B. Dantzig, “Linear programming under uncertainty,” Management financial planning problems,” Manage. Sci., vol. 38, no. 11, pp. 1642–
Science, vol. 50, no. 12 Supplement, pp. 1764–1769, Dec. 2004. 1664, Nov. 1992.
[20] A. J. King and S. W. Wallace, Modeling with Stochastic Programming, [34] J. M. Mulvey and W. T. Ziemba, Handbooks in Operations Research
ser. Springer Series in Operations Research and Financial Engineering. and Management Science. Elsevier, 1995, vol. Volume 9, ch. Chapter
Springer New York, 2012. 15 Asset and liability allocation in a global environment, pp. 435–463.
[21] S. W. Wallace and W. T. Ziemba, Applications of Stochastic Program- [35] J. M. Mulvey, D. P. Rosenbaum, and B. Shetty, “Parameter estimation
ming, 1st ed. Society for Industrial and Applied Mathematics, June in stochastic scenario generation systems,” European Journal of Oper-
2005. ational Research, vol. 118, no. 3, pp. 563 – 577, 1999.
[22] R. Bai, S. W. Wallace, J. Li, and A. Y.-L. Chong, “Stochastic ser- [36] N. Topaloglou, H. Vladimirou, and S. A. Zenios, “A dynamic stochastic
programming model for international portfolio management,” European with an investment horizon and transaction costs,” Omega, vol. 41, no. 2,
Journal of Operational Research, vol. 185, no. 3, pp. 1501–1524, 2008. pp. 406 – 420, 2013, management science and environmental issues.
[37] S. J. Stoyan and R. H. Kwon, “A stochastic-goal mixed-integer program- [59] M. Dyer and L. Stougie, “Computational complexity of stochastic
ming approach for integrated stock and bond portfolio optimization,” programming problems,” Mathematical Programming, vol. 106, no. 3,
Computers & Industrial Engineering, vol. 61, no. 4, pp. 1285 – 1295, pp. 423–432, May 2006.
2011. [60] A. Shapiro and A. Nemirovski, “On complexity of stochastic program-
[38] L.-Y. Yu, X.-D. Ji, and S.-Y. Wang, “Stochastic programming models in ming problems,” Continous Optimization, pp. 111–146, 2004.
financial optimization: A survey,” Advanced Modeling and Optimization, [61] S. Baluja, “Population-based incremental learning: A method for in-
vol. 5, no. 1, 2003. tegrating genetic search based function optimization and competitive
[39] S. J. Stoyan and R. H. Kwon, “A two-stage stochastic mixed-integer learning,” Pittsburgh, PA, USA, Tech. Rep., 1994.
programming approach to the index tracking problem,” Optimization [62] S. Baluja and R. Caruana, “Removing the genetics from the standard
and Engineering, vol. 11, no. 2, pp. 247–275, 2010. genetic algorithm,” Pittsburgh, PA, USA, Tech. Rep., 1995.
[40] S.-E. Fleten, K. Hyland, and S. W. Wallace, “The performance of [63] F. Glover and M. Laguna, Tabu Search. Norwell, MA, USA: Kluwer
stochastic dynamic and fixed mix portfolio models,” European Journal Academic Publishers, 1997.
of Operational Research, vol. 140, no. 1, pp. 37 – 49, 2002. [64] J. L. Shapiro, “The sensitivity of PBIL to its learning rate, and how
[41] J. L. Higle and S. W. Wallace, “Sensitivity analysis and uncertainty in detailed balance can remove it,” in Proceedings of the Seventh Workshop
linear programming,” Interfaces, vol. 33, no. 4, pp. 53–60, 2003. on Foundations of Genetic Algorithms, Torremolinos, Spain, September
[42] D. Barro and E. Canestrelli, “Dynamic portfolio optimization: Time 2-4, 2002, 2002, pp. 115–132.
decomposition using the maximum principle with a scenario approach,” [65] J. E. Beasley, “OR-Library: distributing test problems by electronic
European Journal of Operational Research, vol. 163, no. 1, pp. 217 – mail,” Journal of the Operational Research Society, vol. 41, no. 11,
229, 2005, financial Modelling and Risk Management. pp. 1069–1072, 1990.
[43] A. A. Gaivoronski, S. Krylov, and N. van der Wijst, “Optimal portfo- [66] M. Kaut, S. W. Wallace, H. Vladimirou, and S. Zenios, “Stability analy-
lio selection and dynamic benchmark tracking,” European Journal of sis of portfolio management with conditional value-at-risk,” Quantitative
Operational Research, vol. 163, no. 1, pp. 115 – 131, 2005, financial Finance, vol. 7, no. 4, pp. 397–409, 2007.
Modelling and Risk Management. [67] M. Kaut and S. W. Wallace, “Evaluation of scenario-generation methods
for stochastic programming,” Pacific Journal of Optimization, vol. 3,
[44] L. Escudero, A. Garn, M. Merino, and G. Prez, “A two-stage stochastic
no. 2, pp. 257–271, 2007.
integer programming approach as a mixture of branch-and-fix coor-
[68] D. Wolpert and W. Macready, “No free lunch theorems for optimization,”
dination and benders decomposition schemes,” Annals of Operations
Evolutionary Computation, IEEE Transactions on, vol. 1, no. 1, pp. 67–
Research, vol. 152, no. 1, pp. 395–420, 2007.
82, Apr 1997.
[45] S. Greco, B. Matarazzo, and R. Sowiski, “Beyond markowitz with mul-
tiple criteria decision aiding,” Journal of Business Economics, vol. 83,
no. 1, pp. 29–60, 2013.
[46] Y. Chen and X. Wang, “A hybrid stock trading system using genetic
network programming and mean conditional value-at-risk,” European
Journal of Operational Research, vol. 240, no. 3, pp. 861 – 871, 2015.
[47] L. Yu, S. Wang, Y. Wu, and K. Lai, “A dynamic stochastic programming
model for bond portfolio management,” in Computational Science -
ICCS 2004, ser. Lecture Notes in Computer Science, M. Bubak, G. van
Albada, P. Sloot, and J. Dongarra, Eds. Springer Berlin Heidelberg,
2004, vol. 3039, pp. 876–883.
[48] F. He and R. Qu, “A two-stage stochastic mixed-integer program
modelling and hybrid solution approach to portfolio selection problems,”
Information Sciences, vol. 289, no. 0, pp. 190 – 205, 2014.
[49] M. Rysz, A. Vinel, P. Krokhmal, and P. Eduardo, “A scenario decom-
position algorithm for stochastic programming problems with a class
of downside risk measures,” INFORMS Journal on Computing, vol. 27,
no. 2, pp. 416 – 430, 2015.
[50] T. Cui, R. Bai, A. J. Parkes, F. He, R. Qu, and J. Li, “A hybrid
genetic algorithm for a two-stage stochastic portfolio optimization with
uncertain asset prices,” in Evolutionary Computation (CEC), 2015 IEEE
Congress on, May 2015.
[51] M. Kaut and S. W. Wallace, “Shape-based scenario generation using
copulas,” Computational Management Science, vol. 8, no. 1–2, pp. 181–
199, 2011.
[52] G. C. Pflug, “Some remarks on the value-at-risk and the conditional
value-at-risk,” in Probabilistic Constrained Optimization. Springer US,
2000, vol. 49, pp. 272–281.
[53] S. Uryasev, “Introduction to the theory of probabilistic functions and
percentiles (value-at-risk),” in Probabilistic Constrained Optimization,
ser. Nonconvex Optimization and Its Applications, S. Uryasev, Ed.
Springer US, 2000, vol. 49, pp. 1–25.
[54] K. Høyland and S. W. Wallace, “Generating scenario trees for multistage
decision problems,” Manage. Sci., vol. 47, no. 2, pp. 295–307, Feb. 2001.
[55] K. Høyland, M. Kaut, and S. W. Wallace, “A heuristic for moment-
matching scenario generation,” Computational Optimization and Appli-
cations, vol. 24, no. 2–3, pp. 169–185, 2003.
[56] F. Andersson, S. Uryasev, and S. Uryasev, “Credit risk optimization with
conditional value-at-risk criterion,” Mathematical Programming, vol. 89,
no. 2, pp. 273–291, January 2001.
[57] P. Krokhmal, J. Palmquist, and S. Uryasev, “Portfolio optimization with
conditional value-at-risk objective and constraints,” Journal of Risk,
vol. 4, pp. 11–27, 2002.
[58] M. Woodside-Oriakhi, C. Lucas, and J. Beasley, “Portfolio rebalancing