Professional Documents
Culture Documents
Planning Under Uncertainty: Proactively
Planning Under Uncertainty: Proactively
13.1 Preliminaries
We introduce first some basic definitions on probability spaces that are
needed throughout this section. Additional background material on proba
bility theory can be found, e.g., in Billingsley (1995) and, with emphasis on
stochastic programming, in Kall (1976) and Wets (1989). In this chapter
boldface Greek characters are used to denote random vectors which belong
to some probability space as defined below.
Let Q be an arbitrary space or set of points. A cr-field for Q is a family
S of subsets of Q such that Q itself, the complement with respect to Q
of any set in S, and any union of countably many sets in E are all in E.
The members of E are called measurable sets, or events in the language
of probability theory. The set Q with the cr-field E is called a measurable
space and is denoted by (Q, S).
Let Q be a (linear) vector space and E a cr-field. A probability measure P
on (Q, S) is a real-valued function defined over the family E, which satisfies
the following conditions: (z) 0 < P(A) < 1 for A G E; (ii) P($) = 0 and
P(Q) = 1; and (Hi) if is a sequence of disjoint sets Ak C E and
if G E then P(U^=1Ak) = The triPlet is
called a probability space. The support of (Q, E, P) is the smallest subset of
Q with probability 1. If the support is a countable set then the probability
measure is said to be discrete. The term scenario is used for the elements
of Q of a probability space with a discrete distribution.
A proposition is said to hold almost surely (abbreviated a.s.) or P-
almost surely if it holds on a subset A C Q with P(A) = 1. The expected
value of a random variable Q on (Q, E, P) is the Stieltjes integral of Q with
respect to the measure P:
E[Q I [ Q(w)dP(w)
^(A) JAi
The expected value of the profit function is the Stieltjes integral with re
spect to the distribution function:
E[F(x,w)] = f F(x,w)dP(a>)
rx POO
' = / (p+o> — p~(x — 07)) dP(w) + / p+xdP(w),
Jo Jx
and the mathematical model for the newsboy problem is the following op
timization problem with respect to x,
Minimize E[fi(x,co)]
s.t. E [fi(x, cu)] = 0, i = l,2, (13.3)
x e X C IRn.
are assumed finite for all i = 0,1,..., m unless the set {u? | fo(x, u>) = Too}
has a nonzero probability, in which case E[fifix,w)] = Too. The feasibility
set
X A {x | E[fi(x,a>)] = 0, i = 1,2,... , m} Pl {x | E[fifix, a;)] < Too}
is assumed to be nonempty.
The model (13.3) is a nonlinear programming problem, whose con
straints and objective functions are represented by integrals. Much of
the theory of stochastic programming is concerned with identifying the
Sect. 13.3 Stochastic Programming Problems 375
where gi : IRn x Q -> IR, i — 1, 2,... ,m. This constraint can be cast in
the form of the general model (13.3) by defining as follows
a - 1 if gi(x,w) = 0,
a otherwise.
The objective function may also be of a reliability type, such as P{u> |
gv(x,u)) < 7}, where 7 is a constant.
In summary, an anticipative model selects a policy that leads to some
desirable characteristics of the constraint and objective functionals under
the realizations of the random vector. In the example above it is desirable
that the probability of a constraint violation is less than the prespecified
threshold value a. The precise value of a depends on the application at
hand, the cost of constraint violation, and other similar considerations.
Minimize E[/o(z(a>),u>) | 4]
s.t. E[fi(x(w),w) | ^4] = 0, i = 1,2,..., m, (13.5)
x(w) e X, almost surely.
vations are made about uncertainty, and thus can adapt by taking recourse
decisions. For example, a portfolio manager specifies the composition of a
portfolio considering both future movements of stock prices (anticipation)
and that the portfolio will be rebalanced as prices change (adaptation).
Minimize q(y,co)
s.t. W(co)y = h(co) - T(co)x, (13.9)
y e IR™1.
the first-stage decisions, /(x), and the expected cost of the second-stage
decisions. It is written as follows.
where a1 denotes the transpose of the zth row of the mo x no matrix 4, and
bi is the zth component of the mo-vector b. (a\x) = b^ i = 1,2,..., mo are
linear restrictions on the first-stage variables. This model can be cast in the
general formulation (13.3) simply by denoting fo(x,u>) = f(x) + Q(x,lj),
and = (a\x) — bi.
A formulation that combines (13.9) and (13.10) is the following:
s.t. Ax — b, (13.11)
x G IR"°,
Minimize q(ys,cvs)
s.t. W^s)ys = h^s)-T(ujs)x, (13.13)
ys e IR^1.
/ A
T(a?) ITp)
T(w2) W(u2) (13.19)
Split-Variable Formulation
The system of linear equations in (13.19) can be rewritten in a form that is,
for some algorithms, more amenable to decomposition and parallel compu
tations. In particular, in the absence of the x variables the system (13.19)
becomes block-diagonal. The split-variable formulation replicates the first-
stage variable vector x into a set of vectors xs E IRn° for each ws E Q. Once
a different first-stage decision is allowed for each scenario, the stochastic
program decomposes into S independent problems. Of course, the first-
stage variables must be nonanticipative^ that is they cannot depend on
380 Planning Under Uncertainty Chap. 13
scenarios that have not yet been observed when the first-stage decisions
are made. This requirement is enforced by adding the restrictions that
x1 = x2 = ... — xs. The split-variable formulation is then equivalent to
the original stochastic problem (13.14)-(13.18), for which equation (13.19)
/ A \ / b \
T(uA) IVp)
A b
T(u2) W(w2) fi(w2)
(13.20)
A b
T(wN) W(wN') h(uN)
x1' 0
\yN)
\ I -I / \ 0 /
(13.21)
TK(iJjK)yK 1 + WK(<jjK)yK - hx(uK).
Sect. 13.4 Robust Optimization Problem 381
where b,c,d,e are given vectors and X,B,C are given matrices. Equa
tion (13.23) denotes the structural constraints that are free of noise. Equa
tion (13.24) denotes the control constraints. The coefficients of these con
straints, i.e., the elements of B,C, and e are subject to noise. The cost
vector d is also subject to noise, while A, b, and c are not.
To define the robust optimization problem we introduce an index set
Q = {1,2,..., S}. With each index s € Q we associate the scenario set
{d(s), B(s), C(s), e(s)} of realizations of the control coefficients. Reference
382 Planning Under Uncertainty Chap. 13
to an index s imply reference to the scenario set associated with this in
dex. The probability of the sth scenario is ps, and — 1- Now
the following question is posed: What are the desirable characteristics of a
solution to problem (13.22)-(13.26) when the coefficients of the constraints
4) = £p»U) (13-27)
s€Q
captures the risk preference of the user. A popular choice of utility func
tions, for portfolio management applications, is the logarithmic function
U(£s) = log£s. The general robust optimization model includes a term
g/1, y2,..., yN) in the objective function to denote the dependence of
13.5 Applications
In this section we discuss real-world applications where uncertainty is preva
lent and wherein it is handled using the models introduced above. We give
first (subsection 13.5.1) an illustration of the robust optimization frame
work, using the classic diet problem as an example. The other subsections
discuss models for planning under uncertainty in production and inventory
management (subsection 13.5.2) and models for matrix balancing (subsec
tion 13.5.3), with a new section devoted to models for financial planning
under uncertainty (Section 13.6).
13.5.1 Robust optimization for the diet problem
The well-known diet problem is used here as an example to illustrate the
feature of model robustness. This feature is particularly interesting in the
context of optimization formulations, since feasibility has traditionally been
overemphasized.
The problem is to find a minimum-cost diet that will satisfy certain nu
tritional requirements. The origins of this problem date back to the 1940s
and to the works of G.J. Stigler and G.B. Dant zig where it was soon rec
ognized as a problem of robust optimization, since the nutritional content
of some food products may not be certain. Dantzig was still intrigued by
this ambiguity when he wrote, several decades later:
When is an apple an apple and what do you mean by its cost
and nutritional content? For example, when you say apple
do you mean a Jonathan, or McIntosh, or Northern Spy, or
Ontario, or Winesap, or Winter Banana? You see, it can make
a difference, for the amount of ascorbic acid (vitamin C) can
vary from 2.0 to 20.8 units per 100 grams depending upon the
type of apple. (Dantzig, 1990.)
The standard linear programming formulation assumes an average nu
tritional content for each food product and produces a diet. However, as
consumers buy food products of varying nutritional content they will soon
build a deficit or surplus of some nutrients. This situation may be irrele
vant for a healthy individual over long periods of time, or it may require
remedial action in the form of vitamin supplements.
Sect. 13.5 A pplica tions 385
The weight A is used to trade off feasibility robustness with cost. For
A — 0, and also allowing the index j in (13.31) to take the value j = c,
with aAc = a, we obtain the classic linear programming formulation.
The diets obtained with larger values of A have vitamin C content that
varies very little with respect to the quality of apples. Figure 13.1 illus
trates the error in vitamin C intake due to alternative robust optimization
solutions under different scenarios of vitamin C content. This figure il
lustrates the efficacy of the robust optimization model in hedging against
alternative realizations of the data. For example, if an error of ±2c units
in total vitamin C intake is acceptable, no remedial action will be needed
for the robust optimization dieter—no matter what quality of apples are
included in the diet. On the other hand, the linear programming dieter will
need remedial treatment for several of the available apple qualities (note,
from Figure 13.1, that the error in vitamin C intake exceeds the allowable
margin of ±2e under seven scenarios).
The robust optimization diet is somewhat more expensive than the diet
produced by the linear program. Figure 13.2 shows the increase in the cost
of the diet as it becomes more robust with respect to nutritional content.
This simple example clarifies the meaning of a robust solution and shows
that robust solutions are indeed possible, but at some cost.
13.5.2 Robust optimization for planning capacity expansion
Manufacturing and service firms have to plan for capacity expansion in
order to meet increasing demand for their products and services over time.
Demand, however, is usually highly uncertain. Product demand depends
on general economic conditions, competition, technological changes, and
the general business cycle. Demand may also exhibit seasonal variations,
which are particularly difficult to address in the context of service oper-
386 Planning Under Uncertainty Chap. 13
Notation
We define first the parameters of the model: use k ~ 1,2,..., K to denote
plant sites, h = 0,1,2,..., H to denote plant configurations, s € to
denote scenarios, and t = 1,2,..., T to denote time periods.
Objective function: The objective function that must be maximized has two
terms that account for direct profits from sales of the product and from
indirect profits from diverted demand, and a third term that accounts for
the retooling cost. It takes the form:
K H
Xkhst T zst = dst for all s = 1,2,..., TV, t = 1,2,..., T. (13.34)
k = l h=0
H
Y^ykht = 1 for all k = 1,2, t = l,2,...,T, (13.37)
h=0
Wkot < Vkot for all k = 1,2,..., K, t = 1,2,... ,T. (13.38)
390 Planning Under Uncertainty Chap. 13
Robustness Considerations
In practical capacity expansion applications, the expected cost or profit of
the decision is not the only objective. Because large amounts of capital
and other resources are involved, and the careers of many employees are
The total profit for a given scenario s, accounting for the fixed cost of a
capacity expansion plan, is given by:
T T K H
= (13.40)
t=l t=l fc—1 h,=0
where E[-] and Var[-] denote the expected value and the variance of the
random variable respectively, and A is a user specified parameter. Large
values of A reduce variance at the expense of reduced profits, while smaller
values allow the variance to increase, producing higher returns at a penalty
of increased risks.
Calculation of the variance requires evaluation of a quadratic function
and the resulting optimization program becomes a nonlinear program with
Sect. 13.5 Applications 391
continuous and integer variables. Such problems are very difficult to solve
with currently available software systems, and they are likely to remain
so in the future. Moreover, the use of a variance term as a measure of
risk is inappropriate when the distribution of profits is not symmetric.
m
— Vj for j = 1,2,..., n, (13.44)
2=1
X > 0. (13.45)
When the observation vectors u and v are noisy it is possible that this op
timization problem has no solution. Clearly, if
the optimization problem has no feasible solution. Several approaches
can be pursued in order to overcome this difficulty. Tradition suggests
that the vectors u and v be first scaled using the transformation Ui <—
ui ((Ej uj)/(Ei ut)} for all i so that feasibility is restored. Alternatively,
392 Planning Under Uncertainty Chap. 13
m
— Zj — Vj for j = 1,2,..., n, (13.48)
i=l
x > 0. (13.49)
(13.51)
W) = £
oexp/S)
* -y" + -Ui. (13.53)
J=1
*(0)
(13.54)
«"(0)’
where ’iP' denotes first derivative with respect to (3. Straightforward calcu
lations yield
(13.55)
394 Planning Under Uncertainty Chap. 13
(13.56)
J=1
^75 = ^exP/3. (13.57)
y-+l = Vi - f■
(13.58)
A
Step 2: (Iterative step over columns of the matrix, i.e., equations (13.48).)
For all j = 1, 2,..., n, calculate:
m
(EX+b- Zj ~vi
fl ' ______________ (1 Q
2=1
a#1 = xij^ exp^’ (13.60)
> Zj X- (13.61)
At each point in time the manager has to assess the prevailing market
conditions (such as prices and interest rates) and the composition of the
existing portfolio. The manager has also to assess potential fluctuations in
interest rates, prices, and cashflows. This information is incorporated into a
13.6.1 Notation
The model is developed using cashflow accounting equations. Investment
decisions are in dollars of face value. We define first the parameters of the
model.
£°, £° G IRm vectors of bid and ask prices respectively, at to. These prices
are known with certainty. In order to buy an instrument the buyer
has to pay the bid price, and in order to sell it the owner is asking for
the ask price.
Co + £
i—1
=E i=l
^xi + (13.62)
from sales, less the liability payments. There is one constraint for each
scenario
m m
+ 52
Third-stage constraints: Decisions made at the third stage (i.e., at time £2)
depend on the path (so,si) realized during the period [U,^) and on the
decisions made at U. The constraints are similar to those of the second
stage. The cashflow accounting equation is
m m
Pi(so,si)vi(so) + 52fci(SO,Sl)2/(sO) + 52^2(so,si)i/22(so,si)
i—1 i=l
m
= V2(so,si) + 52C2(so,si)x?(so,si) + L2(so,Si), (13.66)
2=1
for all i e I, and all paths (so,Si) such that so € So and «i G Si.
Objective function: The objective function maximizes the expected utility
of terminal wealth. In order to measure terminal wealth all securities in
the portfolio are marked-to-market, in accordance with recent U.S. Federal
Accounting Standards Board (FASB) regulations that require reporting
portfolio market and book values. The composition of the portfolio and
its market value depend on the scenarios (sq,si). The objective of the
portfolio optimization model is
where 7r(so> si) is the probability associated with the path (so, «i); IT(so, 5i)
denotes terminal wealth; and U denotes the utility function. Terminal
wealth is given by
First stage
Security^^
of 250 airplanes (a carrier the size of Delta Airlines). A study by the West
German Institute for Technology estimated the avoidable cost of air traffic
delays in 1990 due to ground holding alone at 1.5 billion US dollars.
The ground holding policy problem seeks optimal holding policies, based
( A
TV1) MV1)
A
TV2) WV2)
(13.75)
A
T(ws) WVS)
\ I
It is evident from the structure of the constraints matrix that the problem
decomposes by scenario if the non-anticipativity constraints are ignored.
Let 7 G IRM denote the right-hand side of (13.70)-(13.72) and let z E
IRa be the vector of decision variables, i.e.,
Z e IR77, (13.79)
Minimize
xseIRno ,yseIRni
V Psfij^)
J
+ V Psqt^vlj)
J J
(13.80)
(m)G.4o
S.t.
E4- E
je^nAf0
mkiXki
ke6~a\T0
+ E4 ~ E
J6«+ njVi fce6“rWi
mkiySki = ri for a11 « e -^1’ (13-82)
rrb — xfj = 0 for all s E Q and for all (z,J) G ^4q- (13.85)
where wfj > 0 and are constants which are obtained by evaluating the
summations in the minimand of (13.80) when the functions fij and are
quadratic, and then adding up over all scenarios to get the expected value
of the objective function values.
vj - Xji
•'j = 0,'
for all (z,j) G v4o and all s C Q. The dual price 717, £ C {1,2,..., Mi},
associated with the flow conservation constraint for node i C N under
scenario s e Q, is denoted by 7rf. The dual price 717, I C {M + 1,..., M +
AT}, associated with the simple bound constraints for xfj (i.e., the reduced
cost of xfj), is denoted by 7rf-. We now develop the specific projection
formulae for use in the iterative steps of Algorithm 6.4.1. The row-action
algorithm iterates one row at a time on the constraint matrix $. The
precise formulae for the iterative step are obtained by using the equation
corresponding to the chosen matrix row, as given by equations (13.81)-
(13.82). We develop the equations for only a single iterative step of the
algorithm over all constraints, i.e., flow conservation equality constraints,
bounds on the variables and non-anticipativity equality constraints. The
complete algorithm is summarized below as Algorithm 13.8.1.
Projection on Flow Conservation Constraints
First we derive the iterative step of the algorithm when the chosen row
of the matrix $ corresponds to the flow conservation constraint (13.81).
Consider the flows on the incoming- arcs, xski for k C 6~, and the flows
on the outgoing arcs, xfj for j e 6+ for a given node i C A/o under some
scenario s 6 Q.
The generalized projection z of the current iterate z onto the hyperplane
where is the zth column of 4>T, and 7* is the zth component of
7 (see (13.77)-(13.78)), determined by the flow conservation constraint at
node z, is obtained by solving the system (see Lemma 2.2.1):
Applying the iterative step (12.33) to the functional form of the objective
function (13.86) and using the structure of the rows of the constraints
matrix $ that correspond to network flow constraints we get
Using this result in (13.90) and (13.91) gives the desired formulae for updat
ing all primal variables incident to node i. The dual variable for this node
7rf is updated by adding to its current value, i.e., 7rf 7rf -I- (3$. Similar
algebraic manipulations lead to the updating formulae required to apply
the row-action algorithm to rows corresponding to constraints (13.82).
0 = ^. = 4 + 4- (13-94)
(13.95)
VF(z) = + (13.97)
%ij = (13.98)
Noting that the //(s)th row of the constraints matrix has only two nonzero
entries (cf. equation (13.96)) we can write this system as
i.e., the point (xL,x^) is projected upon the point with coordinates equal
to the weighted average of x\j and x8^ with wL and w8j being the weights.
Consider now the effect of repeated projections of the row-action algo
rithm on the non-anticipativity constraints (13.85). We can take advantage
of the almost cyclic control of the algorithm in a way that would not have
been possible with the cyclic control alone. The almost cyclic control of
the row-action algorithm allows repeated projections upon these constraints
alone until convergence—within some tolerance—of the variables x8^ for
any fixed (i, j) E v4o to a limit x*j. We show that x*j can be obtained ana
lytically, rather than using the iterative scheme. This result has important
Sect. 13.8 Iterative Algorithm for Stochastic Network Optimization 409
lJ - xf. = 0,
X/i (13.100)
x,-,- - xf
l'J
= 0.
Let VFij : IRS —> IR6 denote the subvector of the gradient VF corre
sponding to the S replications of the first-stage variable x]j, ... ,xfj and,
similarly, let denote the submatrix of $ consisting of the columns
corresponding to xjj,... ,xfj.
By repeated projection onto these non-anticipativity constraints, such
that the i/th projection is onto the constraint x^ — x^ = 0, we obtain a
sequence of points x" E IR5 satisfying
y
= VF^y) + £ Afc^(fc)), (13.101)
k=l
1 S
xij — Vij +
V s=2
l3
* S 1 A
xij= ya —S (13.103)
W,-•
410 Planning Under Uncertainty Chap. 13
(13.105)
and t = (x
j,
* A2,..., As)T. By inverting H we can solve for t. Since we are
only interested in (not in A2,..., As), we need only calculate the first
row of denoted by h = Using the special structure of H we
easily get
I / | \
h — ------- — I TT--- (w?-, w?-,..., I,
det H \ \s—1 U /
where det H is the determinant of H. The inner product of the first column
of H, which consists of all ones, and the first row of H~1 must equal 1.
Therefore hs = 1. Hence
Note that det H > 0, so that the system (13.104) has a unique solution.
Solving for x*j we get
YXijVij
x*j = {h,y} = ^--------. (13.106)
/ -j wii
bj
s=l
s
(13.107)
s—1
VF(z°) = - ( f ) 7T°.
\1n /
— —y for j) £ Ab s e Q, (13.108)
Wij
(13.110)
(13.111)
(x°kiy+? = (x
* kiy - for all k e 6-, (13.113)
Wki
412 Planning Under Uncertainty Chap. 13
and
((<,)" - wsij(uij - (xsij)v+^) if (^)p+5 >
«.)-+!= J (TT^y + Wlj^ij)^ if (^r+i < ),
|o if 0 < (xfJ )t'+ < Uij.
(13.119)
For all second-stage arcs (z,j) € A-i:
Vs-
V
if (l/o )P+i > vij>
(ysijY+1 0 if < 0, (13.120)
ifO<(y(7r+i <vsi3.
wij
and
- wij<Vij - (ysijY+b if (ytjV+^ > i ip
s \I>4-1 ^+u>^yfj)^ if (yfj)^<<
0 ifO<(y^r+ < v®-.
(13.121)
Step 2: (Iterative step for non-anticipativity constraints.)
For all first-stage arcs (z, j) e .Ao set:
s
(13.122)
Xij =
s=l
Sect. 13.9 Notes and References 413
13.5.3 The robust optimization model for matrix balancing was developed
by Zenios and Zenios (1992). A similar model, for the more general
problem of image reconstruction from projections, was suggested ear
lier by Elfving (1989), who also suggested the use of a single Newton