Professional Documents
Culture Documents
Maurer, Göllmann - 2013 - Theory and Applications of Optimal Control Problems With Multiple Time-Delays-Annotated
Maurer, Göllmann - 2013 - Theory and Applications of Optimal Control Problems With Multiple Time-Delays-Annotated
413
MANAGEMENT OPTIMIZATION
Volume 10, Number 2, April 2014 pp. 413–441
Laurenz Göllmann
Department of Mechanical Engineering
Münster University of Applied Sciences,
Stegerwaldstrasse 39, 48565 Steinfurt, Germany
Helmut Maurer
Institute of Computational and Applied Mathematics
Westfälische Wilhelms-Universität Münster
Einsteinstrasse 62, 48149 Münster, Germany
1. Introduction. Dynamic systems with delays in both state and control variables
play an important role in the modeling of real-life phenomena in chemical and
biological processes, in economics and mechanics and various other fields, cf. [4, 5,
6, 18, 19]. There is an extensive literature on the necessary optimality conditions for
retarded optimal control problems with either pure control constraints or pure state
constraints; cf. [1, 3, 6, 7, 11, 13, 15, 16, 17, 21, 23, 26, 27, 28, 35]. In particular,
Warga [35] presents a general approach for deriving necessary optimality conditions
which is based on optimization theory in function spaces. We must admit that
we could not translate the necessary conditions of Warga [35] in the presence of
a practical example like the one in Section 6. Despite the impressive amount of
theoretical work, only a few numerical examples may be found in the literature,
notably in chemical engineering, cf. [8, 24, 26, 27, 28]. But usually no attempt
has been made to verify the necessary conditions accurately by providing adjoint
variables.
In [10], we have presented a Minimum (Maximum) Principle for delayed opti-
mal control with mixed control-state constraints and have illustrated the theory
on several numerical examples. In this paper, we consider optimal control prob-
lems with multiple time-delays and mixed control-state constraints (Section 2) and
extend the approach in [10] to derive a Minimum Principle (Section 3). Again,
413
414 LAURENZ GÖLLMANN AND HELMUT MAURER
we use the transformation technique introduced in Guinn [11] to convert the de-
layed control problem into a higher-dimensional undelayed control problem. This
approach requires a commensurability condition for the delays which is also crucial
for discretization and optimization methods to solve the delayed control problem
(Section 4). The analysis is restricted to Euler’s method but can be generalized
to higher-order Runge–Kutta methods. We show that the Lagrange multipliers of
the finite-dimensional optimization problems can be identified with the discretized
adjoint (costate) variables thus enabling us to verify all conditions of the Minimum
Principle. Section 5 presents an academic example which possesses an analytical
solution. This allows to test the accuracy of the numerical solution for various step
sizes. Finally, in Section 6 we treat an optimal control problem modeling the innate
immune response which was introduced by Stengel et al. [29, 30]. For cost func-
tionals of L2 –type and L1 –type, optimal controls are determined simultaneously in
all control components both in the undelayed and delayed case. In addition, we
introduce a mixed control-state constraint to reduce the state variable representing
the damage of an organ.
functional in Mayer form by introducing the additional state variable xn+1 defined
by the retarded equation
ẋn+1 = L(t, x(t − r0 ), . . . , x(t − rd ), u(t − r0 ), . . . , u(t − rd )), xn+1 (0) = 0. (8)
Then the objective (7) can be rewritten in Mayer form as J(x̃, u) = g(x(T )) +
xn+1 (T ) with the augmented state variable x̃ = (x, xn+1 ) ∈ Rn+1 .
A pair of functions (u, x) ∈ L∞ ([0, T ], Rm ) × W 1,∞ ([0, T ], Rn ) is called an admis-
sible pair for problem (MDOCP), if the state x and control u satisfy the restrictions
(2)–(6). An admissible pair (û, x̂) is called a locally optimal pair or weak minimum
for (MDOCP), if
J(û, x̂) ≤ J(u, x)
holds for all admissible (u, x) in a neighborhood of (û, x̂) with kx(t) − x̂(t)k, ku(t) −
û(t)k < ε for all t ∈ [0, T ] and ε > 0 sufficiently small. Instead of considering a
weak minimum, we could also work with the more general notion of a Pontryagin
minimum; cf. Milyutin, Osmolovskii [20].
Phi Ha
In view of 0 = r0 < r1 < . . . < rd we have 0 < k1 < . . . < kd . In analogy to the
nondelayed case we define a free Hamiltonian (or Pontryagin) function by
H(t, y0 , . . . , yd , v0 , . . . , vd , λ) := λ∗ f (t, y0 , . . . , yd , v0 , . . . , vd ), λ ∈ Rn , (10)
and an augmented Hamiltonian function involving the mixed control-state constraint
(6) by
H(t, y0 , . . . , yd , v0 , . . . , vd , λ, µ) :=H(t, y0 , . . . , yd , v0 , . . . , vd )
(11)
+ µ∗ C(t, y0 , . . . , yd , v0 , . . . , vd ),
416 LAURENZ GÖLLMANN AND HELMUT MAURER
where Ĥyδ (t) denotes the evaluation of the partial derivatives Hyδ at (t, x̂(t −
r0 ), . . . , x̂(t − rd ), û(t − r0 ), . . . , û(t − rd ), λ̂(t), µ̂(t)),
2. Transversality Condition:
λ̂(T )∗ = λ0 gx (x̂(T )) + ν̂ ∗ ψx (x̂(T )); (14)
3. Minimum Condition for the Free Hamiltonian Function:
d
X
χ[0,T −rδ ] (t)Ĥ(t + rδ )
δ=0
≤ H(t, . . . , u, û(t − r1 ), . . . , û(t − rd ))
(15)
d−1
X
+ χ[0,T −rδ ] (t)H(t + rδ , . . . , û(t + rδ − rδ−1 ), u, û(t + rδ − rδ+1 ), . . .)
δ=1
+ χ[0,T −rd ] (t)H(t + rd , . . . , û(t + rd − r1 ), . . . , û(t + rd − rd−1 ), u)
for all u ∈ Rm satisfying
C(t, x̂(t − r0 ), . . . , x̂(t − rd ),
û(t − r0 ), . . . , û(t − rδ−1 ), u, û(t − rδ+1 ), . . . , û(t − rd ))
≤0 for δ = 0, . . . , d,
where Ĥ(t) denotes the evaluation of the free Hamiltonian H at (t, x̂(t −
r0 ), . . . , x̂(t − rd ), û(t − r0 ), . . . , û(t − rd ), λ̂(t), µ̂(t)),
OPTIMAL CONTROL PROBLEMS WITH MULTIPLE TIME-DELAYS 417
Proof. The proof of Theorem 3.1 uses the transformation technique suggested by
Guinn [11] who derived first order necessary conditions for unconstrained optimal
control problems with a single time-delay in the state variable. Now we apply a
further extension of this transformation method to the multiple delayed problem
(MDOCP). In view of the commensurability condition (9), there exists a constant
h > 0 and integers k1 , . . . , kd such that with k0 := 0 we can rewrite the delays as
rδ = kδ h, δ = 0, . . . , d.
The time interval [0, h] will be considered as the base time interval for the trans-
formed (augmented) control problem assigned to (MDOCP).
We introduce the state variable Ξ∗ = (ξ0∗ , . . . , ξN ∗
−1 ) ∈ R
N ·n
, ξi ∈ Rn , and control
variable Θ∗ = (θ0∗ , . . . , θN
∗
−1 ) ∈ R
N ·m
, θi ∈ Rm , which are defined by
ξi (t) := x(t + ih), θi (t) := u(t + ih), for t ∈ [0, h], i = 0, . . . , N − 1. (19)
The continuity of the state x(t) in [0, T ] implies the following boundary conditions
for the augmented state Ξ(t),
ξi (h) = ξi+1 (0), i = 0, . . . , N − 2,
which can be written as
Vi (ξi+1 (0), ξi (h)) := ξi (h) − ξi+1 (0) = 0, i = 0, . . . , N − 2. (20)
In terms of the new state and control variables Ξ and Θ, the multiple delayed optimal
control problem (MDOCP) is equivalent to the following nondelayed optimal control
problem on the basic time interval [0, h]:
Minimize J(Θ, Ξ) = g(ξN −1 (h)) (21)
subject to
ξ˙i (t) = f (t + ih, ξi−k0 (t), . . . , ξi−kd (t), θi−k0 (t), . . . , θi−kd (t)), t ∈ [0, h],
(22)
i = 0, . . . , N − 1,
Vi (ξi+1 (0), ξi (h)) = 0, i = 0, . . . , N − 2,
(23)
VN −1 (ξN −1 (h)) := ψ(ξN −1 (h)) = 0,
C(t + ih, ξi−k0 (t), . . . , ξi−kd (t), θi−k0 (t), . . . , θi−kd (t)) ≤ 0, t ∈ [0, h],
(24)
i = 0, . . . , N − 1.
418 LAURENZ GÖLLMANN AND HELMUT MAURER
The fixed initial value profiles (3) and (4) are included in this notation by considering
the variables ξ−kd , . . . , ξ−1 and θ−kd , . . . , θ−1 defined by
ξi (t) := x0 (t + ih)
i = −kd , −kd − 1, . . . , −1, t ∈ [0, h].
θi (t) := u0 (t + ih)
Introducing adjoint variables and multipliers for the augmented delayed problem
(21)–(24) by
Λ = (Λ0 , . . . , ΛN −1 )∗ ∈ RN ·n , M = (M0 , . . . , MN −1 )∗ ∈ RN ·p ,
the Hamiltonians (10) and (11) for the augmented control problem become
N
X −1
K(t, Ξ, Θ, Λ) = Λ∗i f (t + ih, ξi−k0 , . . . , ξi−kd , θi−k1 , . . . , θi−kd ), (25)
i=0
K(t, Ξ, Θ, Λ, M ) = K(t, Ξ, Θ, Λ)+
N −1
X (26)
+ Mi∗ C(t + ih, ξi−k0 , . . . , ξi−kd , θi−k0 , . . . , θi−kd )
i=0
Thus in analogy to (19), every locally optimal pair (û(·), x̂(·)) for (MDOCP) defines
a pair
(Θ̂(·), Ξ̂(·)) = (θ̂0 , . . . , θ̂N −1 , ξˆ0 , . . . , ξˆN −1 )
by
ξˆi (t) := x̂(t + ih), θ̂i (t) := û(t + ih), for t ∈ [0, h], i = 0, . . . , N − 1, (27)
which is a minimizer to the augmented problem (21)–(24). Note that the assumed
regularity condition (12) ensures a corresponding regularity condition for the mixed
control-state constraints (24) of the augmented problem.
Pontryagin’s Minimum Principle for delayed optimal control problems cf. [14, 20,
22, 25] assures the existence of a adjoint (costate) function Λ̂ ∈ W 1,∞ ([0, h], RN ·n ),
a multiplier λ0 ≥ 0, a multiplier function M̂ ∈ L∞ ([0, h], RN ·p ) and a vector ν̂ ∈
R(N −1)·n+q , ν̂ = (ν̂0∗ , . . . , ν̂N
∗ ∗ ∗ n q
−2 , ν̂N −1 ) where ν̂0 , . . . ν̂N −2 ∈ R and ν̂N −1 ∈ R ,
such that the following conditions hold for almost every t ∈ [0, h]:
1. Adjoint differential equation:
d
Λ̂(t)∗ = −KΞ (t, Ξ̂(t), Θ̂(t), Λ̂(t), M̂ (t)); (28)
dt
2. Transversality condition:
Λ̂(0)∗ = −ν ∗ VΞα (Ξ̂(0), Ξ̂(h)) (29)
Λ̂(h)∗ = λ0 ΓΞβ (Ξ̂(h)) + ν ∗ VΞβ (Ξ̂(0), Ξ̂(h)) (30)
where
∗
V (Ξα , Ξβ ) = V0 (ξ0β , ξ1α ), . . . , VN −2 (ξ0β , ξ1α ), VN −1 (ξN β
−1 )
∗
= ξ0β − ξ1α , ξ1β − ξ2α , . . . , ξNβ
−2 − ξ α
N −1 , ψ(ξ β
N −1 )
denotes the function defining the boundary constraints, and
β
Γ(Ξβ ) = g(ξN −1 ).
Herein
Ξα = (ξ0α , . . . , ξN
α
−1 ), Ξβ = (ξ0β , . . . , ξN
β
−1 )
OPTIMAL CONTROL PROBLEMS WITH MULTIPLE TIME-DELAYS 419
represent placeholder variables for the boundary states Ξ(0) and Ξ(h). For the
components Λi of the adjoint variable Λ, the transversality condition becomes
N −1
X ∂
Λ̂i (0)∗ = − ν̂j∗ Vj (ξˆj+1 (0), ξˆj (h))
j=0
∂ξiα
N −1
∂ X ∂
Λ̂i (h)∗ = λ0 β
Γ(Ξ̂(h)) + ν̂j∗ Vj (ξˆj+1 (0), ξˆj (h)),
∂ξi j=0 ∂ξiβ
Let us first evaluate the adjoint equations for the components Λ̂j with j = 0, 1, N −
1 − kd :
d
Λ̂j (t)∗ = −K̂ξj (t)
dt
X d h
=− Λ̂kδ +j (t)∗fyδ (t+(kδ +j)h, ξˆkδ +j−k0 (t), . . . , ξˆkδ +j−kδ (t), . . . , ξˆkδ +j−kd (t), . . .)
δ=0
| {z }
j
i
+ M̂kδ +j (t) Cyδ (t + (kδ + j)h, ξˆkδ +j−k0 (t), . . . , ξˆkδ +j−kδ (t), . . . , ξˆkδ +j−kd (t), . . .)
∗
| {z }
j
Note that for j > N − 1 − kd the number of state variables ξj appearing in the
augmented Hamiltonian (26) is reduced due to index shifting by k0 , . . . , kd . In
detail, we summarize up the occurrence of variables ξj within K depending on j:
Taking this fact into account, we are able to rewrite the adjoint equations for every
component Λ̂j , 0 ≤ j ≤ N − 1 in a compact form:
d
Λ̂j (t)∗ = −K̂ξj (t)
dt
Xd h
=− χjδ Λ̂kδ +j (t)∗ fyδ (t + (kδ + j)h, ξˆkδ +j−k0 (t), . . . , ξˆj (t), . . . , ξˆkδ +j−kd (t), . . .)
δ=0
i
+ M̂kδ +j (t)∗ Cxδ (t + (kδ + j)h, ξˆkδ +j−k0 (t), . . . , ξˆj (t), . . . , ξˆkδ +j−kd (t), . . .) ,
where
χjδ := χ{0,...,N −1−kδ } (j)
denotes the characteristic function of the index set {0, . . . , N − 1 − kδ }. As for
j > N − 1 − k0 , the variables ξj do not appear in K, i.e.
K̂ξj = 0, j > N − 1 − k0 ,
we have
d
Λ̂j (t) = 0, j > N − 1 − k0 , t ∈ [0, h]. (37)
dt
Thus, the adjoint variable Λ̂j is constant for j > N − 1 − k0 .
Now we are able to define the adjoint function λ̂ ∈ W 1,∞ ([0, T ], Rn ) and multi-
plier function µ̂ ∈ L∞ ([0, T ], Rp ) for the multiple delayed optimal control problem
(MDOCP) in the following way. For t ∈ [0, T ] there exists 0 ≤ j ≤ N − 1 with
jh ≤ t ≤ (j + 1)h. This gives χjδ = χ{0,...,N −1−kδ } (j) = χ[0,T −rδ ] (t) as
j ∈ {0, . . . , N −1−kδ } ⇐⇒ t ∈ [0, (N −1−kδ +1)h] = [0, N h−kδ h] = [0, T −rδ ].
We now put
λ̂(t) := Λ̂j (t − jh), µ̂(t) := M̂j (t − jh) (38)
and obtain from the previous adjoint equation:
˙ d
λ̂(t) = Λ̂j (t − jh)
dt
Xd h
=− χjδ Λ̂kδ +j (t − jh)∗ fyδ (t + kδ h, x̂(t + (kδ − k0 )h), . . .
δ=0
. . . , x̂(t), . . . , x̂(t + (kδ − kd )h), . . .)
∗
+ M̂kδ +j (t − jh) Cyδ (t + kδ h, x̂(t + (kδ − k0 )h), . . .
i
. . . , x̂(t), . . . , x̂(t + (kδ − kd )h), . . .)
OPTIMAL CONTROL PROBLEMS WITH MULTIPLE TIME-DELAYS 421
d
X h
=− χjδ λ̂(t + kδ h)∗ fyδ (t + kδ h, x̂(t + (kδ − k0 )h), . . .
δ=0
. . . , x̂(t), . . . , x̂(t + (kδ − kd )h), . . .)
∗
+ µ̂(t + kδ h) Cyδ (t + kδ h, x̂(t + (kδ − k0 )h), . . .
i
. . . , x̂(t), . . . , x̂(t + (kδ − kd )h), . . .)
d
X h
=− χ[0,T −rδ ] (t) λ̂(t + rδ )∗ fyδ (t + rδ , x̂(t + rδ − r0 ), . . .
δ=0
. . . , x̂(t), . . . , x̂(t + rδ − rd ), . . .)
i
+ µ̂(t + rδ )∗ Cyδ (t + rδ , x̂(t + rδ − r0 ), . . . , x̂(t), . . . , x̂(t + rδ − rd ), . . .)
d
X
=− χ[0,T −rδ ] (t)Hyδ (t + rδ , x̂(t + rδ − r0 ), . . . , x̂(t), . . . , x̂(t + rδ − rd ), . . .)
δ=0
d
X
=− χ[0,T −rδ ] (t)Ĥyδ (t + rδ )
δ=0
Thus we have found the adjoint equation (13). The transversality condition (33)
for ΛN −1 ,
Λ̂N −1 (h)∗ = λ0 gx (ξˆN −1 (h)) + ν̂ ∗ ψx (ξˆN −1 (h)),
N −1
j + kδ for δ = 0, . . . , d:
Xd h i
χjδ Λ̂kδ +j (t0 )∗ f (t0 + (j + kδ )h, . . . , θ̂j+kδ −k0 (t0 ), . . . , θ̂j (t0 ), . . . , θ̂j+kδ −kd (t0 ))
δ=0
≤
d
X h i
χjδ Λ̂kδ +j (t0 )∗ f (t0 + (j + kδ )h, . . . , θ̂j+kδ −k0 (t0 ), . . . , θj = u, . . . , θ̂j+kδ −kd (t0 ))
δ=0
Redefining the adjoint and multiplier function in (38) with t0 = t − jh, we obtain
d
X h i
χ[0,T −rδ ] (t) λ̂(t + rδ )∗ f (t + rδ , . . . , û(t + rδ − r1 ), . . . , û(t), . . . , û(t + rδ − rd ))
δ=1
≤
d
X h i
χ[0,T −rδ ] (t) λ̂(t + rδ )∗ f (t + rδ , . . . , û(t + rδ − r1 ), . . . , u, . . . , û(t + rδ − rd ))
δ=1
and hence the desired minimum condition (15) for the Hamiltonian H,
d
X
χ[0,T −rδ ] (t)Ĥ(t + rδ )
δ=0
d
X
≤ χ[0,T −rδ ] (t)H(t + rδ , . . . , û(t + rδ − r1 ), . . . , u, . . . , û(t + rδ − rd ))
δ=0
resp., the local minimum condition (16) for the augmented Hamiltonian. Condition
(36) immediately implies the multiplier and complementarity condition (17) in view
of (38).
Remark. A feasible pair (û, x̂) that satisfies the necessary conditions in Theorem
3.1 is called an extremal. An extremal is called normal, if the multiplier λ0 in
Theorem 3.1 is nonzero and, hence, can be normalized to λ0 = 1. It is easy to show
that any extremal for a control problem with free terminal state is normal; cf. the
examples in Sections 5 and 6.
with integers 0 = k0 < k1 < . . . < kd and N . This defines an equidistant discretiza-
tion mesh with grid points ti = ih, i = 0, 1, ..., N . In the following, we consider only
the Euler integration scheme. Using the approximations x(ti ) ≈ xi ∈ Rn , u(ti ) ≈
ui ∈ Rm , we obtain the following nonlinear programming problem (NLP):
Minimize J(u, x) = g(xN ) (40)
subject to
xi − xi+1 + hf (ti , xi−k0 , . . . , xi−kd , ui−k1 , . . . , ui−kd ) = 0, i = 0, .., N − 1, (41)
ψ(xN ) = 0, (42)
C(ti , xi−k0 , . . . , xi−kd , ui−k0 , . . . , ui−kd ) ≤ 0, i = 0, .., N − 1. (43)
Herein the initial value profiles x0 and u0 provide the values
x−i := x0 (−ih), i = 0, .., kd , (44)
u−i := u0 (−ih), i = 1, .., kd . (45)
The optimization variable in (NLP) is defined by the vector
z := (u0 , x1 , u1 , x2 , ..., uN −1 , xN ) ∈ RN (m+n) .
Assuming normality, the Lagrangian function for (NLP) is given by
∗
L(z, λ, µ, νN ) := g(xN ) + νN ψ(xN )
N
X −1
+ λ∗i (xi − xi+1 + hf (ti , xi−k0 , . . . , xi−kd , ui−k0 , . . . , ui−kd ))
i=0
N
X −1
+ µ∗i C(ti , xi−k0 , . . . , xi−kd , ui−k0 , . . . , ui−kd ),
i=0
(46)
with Lagrange multipliers λ = (λ0 , λ1 , . . . , λN −1 ) ∈ Rn·N , λi ∈ Rn (i = 0, . . . , N −
1), for equations (41), µ = (µ0 , µ1 , . . . , µN −1 ) ∈ Rp·N , µi ∈ Rp (i = 0, . . . , N − 1),
for the inequality constraints (43), and νN ∈ Rq for the boundary condtion (42).
Note that the variables x−i (i = 0, . . . , kd ) and u−i (i = 1, . . . , kd ) appearing in
the Lagrangian have fixed values in view of the initial conditions (44) and (45).
For an optimal solution z̄, the Karush-Kuhn-Tucker (KKT) necessary optimality
conditions for (NLP) then imply the existence of Lagrange multipliers (λ̄, µ̄, ν̄N )
satisfying the equations
∂L ∂L
(z̄, λ̄, µ̄, ν̄N ) = 0 (i = 1, ..., N ), (z̄, λ̄, µ̄, ν̄N ) = 0 (i = 0, ..., N − 1).
∂xi ∂ui
For indices i = 1, . . . , N − 1, the first set of equations yields the relations
0 = −λ̄∗i−1 + λ̄∗i
d
X h
+ χjδ hλ̄∗i+kδ fyδ (ti+kδ , x̄i+kδ −k0 , . . . , x̄i+kδ −kd , ūi+kδ −k1 , . . . , ūi+kδ −kd ) (47)
δ=1
i
+ µ̄∗i+kδ Cyδ (ti+kδ , x̄i+kδ −k0 , . . . , x̄i+kδ −kd , ūi+kδ −k1 , . . . , ūi+kδ −kd ) .
subject to
ẋ(t) = x(t − 1)x(t − 2)u(t − 2), t ∈ [0, 3], (53)
x(t) = 1, t ∈ [−2, 0], (54)
u(t) = 0, t ∈ [−2, 0). (55)
In the general form of the control problem (MDOCP) with a Bolza cost functional
we find d = 2 and the delays
r0 = 0, r1 = 1, r2 = 2.
A control-state constraint will be imposed later. For simplicity, let us keep the
notation x and u for the nondelayed state and control variable and denote by yk ,
resp., vk (k = 1, 2) the delayed state and control variables. Then the Hamiltonian
(10) is given by
H(t, x, y1 , y2 , u, v1 , v2 , λ) = x2 + u2 + λ y1 y2 v2 . (56)
For an optimal pair (u, x), the adjoint equation (13) and transversality condition
(14) in Theorem 3.1 yield
λ̇(t) = −Hx (t, x(t), x(t − 1), x(t − 2), u(t), u(t − 1), u(t − 2), λ(t))
− χ[0,2] (t)Hy1 (t + 1, x(t + 1), x(t), x(t − 1), u(t + 1), u(t), u(t − 1), λ(t + 1))
− χ[0,1] (t)Hy2 (t + 2, x(t + 2), x(t + 1), x(t), u(t + 2), u(t + 1), u(t), λ(t + 2))
= −2x(t) − χ[0,2] (t)λ(t + 1)x(t − 1)u(t − 1) − χ[0,1] (t)λ(t + 2)x(t + 1)u(t),
λ(3) = 0.
It immediately follows from (53)–(55) that
x(t) = 1 for t ∈ [0, 2].
The state variable can only be influenced by the control u(t − 2) on the terminal
interval [2, 3]. Hence, it suffices to determine the optimal control u(t) on the interval
[0, 1]. The minimum condition (15) requires the minimization of the expression
H(t, x(t), x(t − 1), x(t − 2), u, u(t − 1), u(t − 2))
+ χ[0,2] (t)H(t + 1, x(t + 1), x(t), x(t − 1), u(t + 1), u, u(t − 1))
+ χ[0,1] (t)H(t + 2, x(t + 2), x(t + 1), x(t), u(t + 2), u(t + 1), u)
with respect to the control variable u ∈ R for t ∈ [0, 3]. As H does not depend
on v1 , the middle term H(t + 1, x(t + 1), x(t), x(t − 1), u(t + 1), u, u(t − 1)) can be
deleted in the preceding sum. Hence, the function
H(t, x(t), x(t − 1), x(t − 2), u, u(t − 1), u(t − 2))
+ χ[0,1] (t)H(t + 2, x(t + 2), x(t + 1), x(t), u(t + 2), u(t + 1), u)
= (x(t))2 + u2 + λ(t)x(t − 1)x(t − 2)u(t − 2)
+ χ[0,1] (t)((x(t + 2))2 + (u(t + 2))2 + λ(t + 2)x(t + 1)x(t)u)
has to be minimized with respect to u for t ∈ [0, 3]. Therefore, the gradient of this
expression with respect to u vanishes:
2u(t) + χ[0,1] (t)λ(t + 2)x(t + 1)x(t) = 0. (57)
For t ∈ [0, 1], we obtain the control
1 1
u(t) = − λ(t + 2)x(t + 1) = − λ(t + 2), t ∈ [0, 1].
2 2
426 LAURENZ GÖLLMANN AND HELMUT MAURER
Summing up our findings, we have obtained the optimal solution (x, u, λ):
e−2 t+2 e4 −(t+2)
for t ∈ [0, 1] : x(t) = 1, u(t) = e − e ,
e2 + 1 e2 + 1
e2t − (2e4 + 8e2 − 2)t + 5e4 + 16e2 + 3 − e4−2t
λ(t) = ,
(e2 + 1)2
for t ∈ [1, 2] : x(t) = 1, u(t) = 0,
e2t−2 − e6−2t 4e2 4(e2 − 1)
λ(t) = − t · + 2 + + 6,
(e2 + 1)2 (e2 + 1)2 (e2 + 1)2
e−2 t e4 −t
for t ∈ [2, 3] : x(t) = 2 e + 2 e , u(t) = 0,
e +1 e +1
−2e−2 t 2e4 −t
λ(t) = 2 e + 2 e .
e +1 e +1
The analytical optimal solution allows us to determine the optimal performance
index explicitly after some lengthy computations:
Z 3
2
J= (x2 (t) + u2 (t)) dt = 3 − 2 ≈ 2.761594156 .
0 e +1
Let us now compare the analytical solution with the numerical results. We solve
the nonlinear optimization problem (40)–(45) obtained by Euler discretization and
use the interior point code IPOPT developed by Wächter et al. [33, 34] and the
LOQO algorithm by Vanderbei [31, 32].
In both cases we apply an error tolerance tol =10−10 . The starting solution is
x(t) ≡ 1 and u(t) ≡ 0. Starting with a rather coarse discretization of N = 600 grid
points, we find the following results for both codes
grid size IPOPT LOQO
N = 600 J = 2.763044 J = 2.763044
CPU: 0.003s CPU: 0.003s
N = 60000 J = 2.761609 J = 2.761609
CPU: 0.406s CPU: 0.414s
N = 480000 J = 2.761596 J = 2.761596
CPU: 3.724s CPU: 3.940s
Compared with the analytically determined value of J = 2.761594156, the nu-
merically obtained performance index J(x, u) = 2.763044158 for the rather coarse
grid of N = 600 points gives a deviation of about 0.05% from the analytical value
J = 2.761594156. The extremely fine discretization with N = 480000 grid points
yields an objective functional of J(x, u) = 2.761596 which is correct in 6 decimals.
Fig. 1 displays the numerical solution for a mesh of N = 600 grid points, which
visually is identical with the solution determined analytically.
Now we impose a mixed type control-state constraint with a delay in the control
variable:
x(t) + u(t − 2) ≥ 0.5. (58)
In terms of the (MDOCP) this implies a the constraint
C(t, x, y1 , y2 , u, v1 , v2 ) = 0.5 − x − v2 ≤ 0. (59)
The regularity condition (2) in Assumption 2 is satisfied in view of
∂C
= (0, 0, −1) 6= 0.
∂(u, v1 , v2 )
428 LAURENZ GÖLLMANN AND HELMUT MAURER
1 0 5
-0.2 4
0.9
-0.4 3
0.8
-0.6 2
0.7 -0.8 1
0.6 -1 0
0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3
t t t
Figure 1. Optimal state x(t), control u(t) and adjoint λ(t) deter-
mined numerically (N = 600).
state x control u
+1.1 +0.2
+1.0 +0.0
-0.2
+0.9
-0.4
+0.8
-0.6
+0.7 -0.8
+0.6 -1.0
0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3
t t
+3.0 +0.0
+2.0 -0.2
+1.0
-0.4
+0.0
0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3
t t
In the sequel, we study two types of cost functionals that can be roughly classi-
fied as a L2 –functional and L1 –functional. Stengel et al. [29] consider a L2 –type
functional, where large values of pathogen concentration, poor organ health, and
high doses of therapeutic agents are penalized quadratically. For simplicity, we chose
all penalty weights as ones. Hence, optimal control functions ui ∈ L∞ ([0, T ]), i =
1, .., 4, are to be determined that minimize the cost functional
ZT
J2 (x, u) = x21 (T ) + x24 (T ) + (x21 (t) + x24 (t) + u21 (t) + u22 (t) + u23 (t) + u24 (t)) dt (67)
0
under the constraints (60)–(66). Here, x = (x1 , x2 , x3 , x4 ) denotes the state and
u = (u1 , u2 , u3 , u4 ) the control variable. It can easily be seen that optimal control
functions ui (·) are continuous. In contrast to L2 –functionals, we will also consider
the following L1 –functional that is linear in the control variables:
ZT
J1 (x, u) = x21 (T ) + x24 (T ) + (x21 (t) + x24 (t) + u1 (t) + u2 (t) + u3 (t) + u4 (t)) dt . (68)
0
1
For this L –functional, it will be crucial to impose bounds for the control variables:
0 ≤ ui (t) ≤ umax
i =2 for all t ∈ [0, T ], i = 1, 2, 3, 4. (69)
Then the control variable u appears linearly in the Hamiltonian and, hence, optimal
controls will be of bang-bang or singular type.
Optimal control for the L2 –functional (67). For notational purpose we
introduce the following set of variables representing the delayed variables: y1 stands
for x1 (t − rx1 ) = x(t − r1 ), y3 stands for x3 (t − rx3 ) = x(t − r2 ), and v1 represents
u1 (t − ru1 ) = u1 (t − r3 ). We keep the notation x = (x1 , x2 , x3 , x4 ) and u =
(u1 , u2 , u3 , u4 ) for the nondelayed state and control variables. Then the Hamiltonian
(10) for the delayed control problem (60)–(67) with L2 –functional (67) is given by
H(x, y1 , y3 , u, v1 , λ) = x21 + x24 + u21 + u22 + u23 + u24
+λ1 ((1 − x3 )x1 − v1 ) + λ2 (3A(x4 ) y1 y3 + u2 − (x2 − 2)) (70)
+λ3 (x2 − (1.5 + 0.5x1 )x3 ) + u3 ) + λ4 (x1 − x4 − u4 ),
where λ = (λ1 , λ2 , λ3 , λ4 ) denotes the adjoint variable (costate). Taking into ac-
count the mixed control–state constraint (66), the augmented Hamiltonian (11)
becomes
H(t, x, y1 , y3 , u, v1 , λ, µ) = H(t, x, y1 , y3 , u, v1 , λ) + µ (0.1u4 + x4 − α) (71)
with a scalar multiplier µ. The adjoint equations (13) yield the following advanced
equations:
λ̇1 (t) = − 2x1 (t) − λ1 (t)(1 − x3 (t)) + 0.5λ3 (t)x1 (t) − λ4 (t)
− χ[0,T −rx1 ] (t) · 3λ2 (t + rx1 ) · A(x4 (t + rx1 ))x3 (t + rx1 − rx3 )
λ̇2 (t) =λ2 (t) − λ3 (t)
(72)
λ̇3 (t) =λ1 (t)x1 (t) + λ3 (t)(1.5 + 0.5x1 (t))
− χ[0,T −rx3 ] (t) · 3λ2 (t + rx3 ) · A(x4 (t + rx3 ))x1 (t + rx3 − rx1 )
λ̇4 (t) = − 2x4 (t) − 3λ2 (t)A0 (x4 (t))x1 (t − rx1 )x3 (t − rx3 ) + λ4 (t) − µ(t)
432 LAURENZ GÖLLMANN AND HELMUT MAURER
large interval [0.267, 2.64] with a peak value of x4 (τ ) = 0.9196 at τ = 1.04. This
strongly motivates to consider the mixed control-state (66), 0.1u4 (t) + x4 (t) ≤ α,
with α < 0.5.
For α = 0.3 we obtain the optimal solution shown in Fig 6. It takes a bit longer to
reduce the pathogen to small values, but the organ damage shrinks to a maximum
value of 0.286. There is only one high peak of plasma cells x2 for t ≥ 1. The
numerical results obtained for N = 2000 grid points are
Optimal solutions for the L1 –functional (68). Let us study now the L1 –
functional
ZT
J1 (x, u) = x21 (T ) + x24 (T ) + (x21 (t) + x24 (t) + u1 (t) + u2 (t) + u3 (t) + u4 (t)) dt ,
0
for which the Hamiltonian becomes a linear function of the control variables. The
adjoint equations and transversality condition agrees with those in (72) and (73).
434 LAURENZ GÖLLMANN AND HELMUT MAURER
The evaluation of the minimum condition (15) leads to the switching functions
σ1 (t) = Hu1 [t] + χ[0,T −ru1 ] (t) Hv1 [t + ru1 ] = 1 − χ[0,T −ru1 ] (t) λ1 (t + ru1 ),
σ2 (t) = Hu2 [t] = 1 + λ2 (t),
σ3 (t) = Hu3 [t] = 1 + λ3 (t),
σ4 (t) = Hu4 [t] = 1 − λ4 (t),
(80)
which determine the minimizing controls according to
2 if σk (t) < 0
uk (t) = 0 if σk (t) > 0 , k = 1, . . . , 4. (81)
singular if σk (t) = 0 in Is ⊂ [0, T ]
The following computations show that optimal controls are bang-bang in the nonde-
layed control problem, whereas in the delayed problem, the control u4 has singular
subarcs. It is interesting to note that the control u4 along an active arc of the mixed
constraint (66) behaves like a singular arc as long as it stays in the interior of the
control set [0, 2].
First, the state and control variables are computed for the nondelayed control
problem omitting the constraint (66). Using N = 4000 grid points, the Euler
OPTIMAL CONTROL PROBLEMS WITH MULTIPLE TIME-DELAYS 435
2
Figure 7. L -functional (67) and delays rx1 = 1.0, rx3 = 1.5,
ru1 = 2.0: constrained function 0.1u4 (t) + x4 (t) (right scale) and
multiplier µ(t) (left scale).
The state variables are depicted in Fig. 8 on the reduced time interval [0, 6] as well as
the control variables, resp. the switching functions (80) on the time intervals [0, 1.5],
resp. [0, 2]. The control u2 is identically zero, whereas the controls u1 , u3 , u4 are
bang-bang with only one switch at tk from uk = 2 to uk = 0: t1 = 0.668, t3 =
0.238, t4 = 0.430. Fig. 8 clearly demonstrates that the switching conditions (81) for
the minimizing controls are perfectly satisfied. Drugs u1 , u3 , u4 have to be given in
full doses before discontinuing abruptly at separate switching times.
2 u3 2 u4
1.5 σ3 σ4
1.5
1
1
0.5
0.5
0
-0.5 0
-1 -0.5
-1.5 -1
0 0.5 1 1.5 2 0 0.5 1 1.5 2
time t time t
Now, we compute the optimal solution with delays r1 = rx1 = 1.0, r2 = rx3 = 1.5,
r3 = ru1 = 2.0 and get the following results with N = 1000 grid points:
J1 (x, u) = 14.6579, x1 (T ) = 2.29399 × 10−4 , x2 (T ) = 8.41865 × 10−4 .
Optimal state control and control variables are displayed in Fig. 9. It is surprising
that the pathogen killer u1 is identically zero. The controls u2 and u3 are bang-
bang with only one switch, while the organ healing factor u4 is a rather complicated
control composed by several bang-bang and singular arcs. The bottom of Fig. 9
exhibits the controls u3 and u4 and its associated switching functions σ3 and σ4 on
the reduced interval [0, 3] to demonstrate more clearly that the control law (81) is
satisfied numerically.
Finally, we consider the control problem with delays and impose the mixed
control-state constraint 0.1u4 (t) + x4 (t) ≤ 0.5 as for the L2 –functional. Here, for
N = 2000 grid points we get the results
J1 (x, u) = 14.6632, x1 (T ) = 2.4058 × 10−4 , x2 (T ) = 8.6099 × 10−4 .
Fig. 10 shows optimal controls that are similar to those in Fig. 9: the pathogen
killer u1 is identically zero, the controls u2 and u3 are bang-bang with one switch,
438 LAURENZ GÖLLMANN AND HELMUT MAURER
and the control u4 has bang-bang and boundary arcs. Here, it is noteworthy that
the initial bang-bang arc with u4 = 2 is followed by a boundary arc with active
constraint 0.1u4 (t) + x4 (t) = 0.5.
u1 2 u2
2 u4 u3
1.5 1.5
1 1
0.5 0.5
0 0
0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3
time t time t
Figure 10. L1 –functional (68) with delays rx1 = 1.0, rx2 = 1.5,
ru1 = 2.0 and constraint 0.1u4 + x4 ≤ 0.5. Top row : (a) state
variables x1 and x4 , (b) state variables x2 and x3 . Bottom row :
control variables u1 and u4 , (b) control variables u2 and u3 .
Fig. 11 displays the constraint function 0.1u4 (t) + x4 (t) and the associated mul-
tiplier µ.
0.4
2.0 0.3
0.2
0.0 0.1
0
0 0.5 1 1.5 2 2.5 3
time t
Figure 11. L1 -functional and delays rx1 = 1.0, rx3 = 1.5, ru1 =
2.0: constrained function 0.1u4 + x4 (right scale) and multiplier
µ(t) (left scale).
The Minimum Principle and the numerical approach is illustrated on two exam-
ples. First, we present an academic example that has an analytical solution when
no constraints are present. The comparison of the analytical and numerical solution
demonstrates the excellent performance of the NLP approach. Then we discuss the
optimal control of the innate immune response, where the dynamic model is taken
from previous work by Stengel et al. [29, 30]. In this model, we introduce addi-
tional time delays since the antigen and antibody concentration affects the plasma
cell production after significant duration. Also, a delayed action of the applied drug
is involved as it usually takes some time for an agent to become effective. In the
numerical analysis we consider objective functionals of L2 - and L1 -type. In the
latter case bang-bang-controls can result from L1 -type functionals that are easier
to realize in medical attendance.
REFERENCES
[1] T. S. Angell and A. Kirsch, On the necessary conditions for optimal control of retarded
systems, Appl. Math. Optim., 22 (1990), 117–145.
[2] A. Asachenkov, G. Marchuk, R. Mohler and S. Zuev, Disease Dynamics, Birkhäuser, Boston,
1994.
[3] H. T. Banks, Necessary conditions for control problems with variable time lags, SIAM J.
Control, 6 (1968), 9–47.
[4] Q. Chai, R. Loxton, K. L. Teo and C. Yang, A class of optimal state-delay control problems,
Nonlinear Anal. Real World Appl., 14 (2013), 1536–1550.
[5] Q. Chai, R. Loxton, K. L. Teo and C. Yang, A unified parameter identification method for
nonlinear time-delay systems, J. Ind. Manag. Optim., 9 (2013), 471–486.
[6] W. L. Chan and S. P. Yung, Sufficient conditions for variational problems with delayed argu-
ment, J. Optim. Theory Appl., 76 (1993), 131–144.
[7] F. Colonius and D. Hinrichsen, Optimal control of functional differential systems, SIAM J.
Control Optim., 16 (1978), 861–879.
440 LAURENZ GÖLLMANN AND HELMUT MAURER
[8] S. Dadebo and R. Luus, Optimal control of time-delay systems by dynamic programming,
Optimal Control Appl. Methods, 13 (1992), 29–41.
[9] R. Fourer, D. M. Gay and B. W. Kernighan, AMPL: A Modeling Language for Mathematical
Programming, The Scientific Press, South San Francisco, California, 1993.
[10] L. Göllmann, D. Kern and H. Maurer, Optimal control problems with delays in state and
control variables subject to mixed control-state constraints, Optimal Control Appl. Methods,
30 (2009), 341–365.
[11] T. Guinn, Reduction of delayed optimal control problems to nondelayed problems, J. Opti-
mization Theory Appl., 18 (1976), 371–377.
[12] W. Hager, Runge-Kutta methods in optimal control and the transformed adjoint system,
Numer. Math., 87 (2000), 247–282.
[13] A. Halanay, Optimal controls for systems with time lag, SIAM J. Control, 6 (1968), 215–234.
[14] M. R. Hestenes, Calculus of Variations and Optimal Control Theory, John Wiley & Sons,
Inc., New York-London-Sydney, 1966.
[15] S.-C. Huang, Optimal control problems with retardations and restricted phase coordinates,
J. Optimization Theory Appl., 3 (1969), 316–360.
[16] G. L. Kharatishvili, Maximum principle in the theory of optimal time-delay processes, Dokl.
Akad. Nauk. USSR, 136 (1961), 39–42.
[17] D. Kern, Notwendige Optimalitätsbedingungen und numerische Lösungsmethoden für opti-
male Steuerprozesse mit Retardierungen, Diploma thesis, Westfälische Wilhelms-Universität
Münster, 2005.
[18] R. Loxton, K. L. Teo and V. Rehbock, An optimization approach to state-delay identification,
IEEE Transactions on Automatic Control, 55 (2010), 2113–2119.
[19] R. M. May, Time-delay versus stability in population models with two and three tropic levels,
Ecology, 54 (1973), 315–325.
[20] A. A. Milyutin and N. P. Osmolovskii, Calculus of Variations and Optimal Control, Trans-
lations of Mathematical Monographs, 180, American Mathematical Society, Providence, RI,
1998.
[21] B. S. Mordukhovich and R. Trubnik, Stability of discrete approximations and necessary op-
timality conditions for delay-differential inclusions, Ann. Oper. Res., 101 (2001), 149–170.
[22] L. W. Neustadt, Optimization. A Theory of Necessary Conditions, Princeton University
Press, Princeton, NJ, 1976.
[23] M. N. Oğuztöreli, Time-Lag Control Systems, Mathematics in Science and Engineering, 24,
Academic Press, New York-London, 1966.
[24] S. H. Oh and R. Luus, Optimal feedback control of time-delay systems, AIChE J., 22 (1976),
140–147.
[25] L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze and E. F. Mishchenko, The Mathemat-
ical Theory of Optimal Processes, Translated from the Russian by K. N. Trirogoff, Interscience
Publishers John Wiley & Sons, Inc., New York-London, 1962.
[26] W. H. Ray and M. A. Soliman, The optimal control of processes containing pure time delays–I,
necessary conditions for an optimum, Chemical Engin. Science, 25 (1970), 1911–1925.
[27] M. A. Soliman and W. H. Ray, Optimal control of multivariable systems with pure time
delays, Automatica, 7 (1971), 681–689.
[28] M. A. Soliman and W. H. Ray, On the optimal control of systems having pure time delays and
singular arcs. I. Some necessary conditions for optimality, Int. J. Control (1), 16 (1972),
963–976.
[29] R. F. Stengel, R. Ghigliazza, N. Kulkarni and O. Laplace, Optimal control of innate immune
response, Optimal Control Appl. Methods, 23 (2002), 91–104.
[30] R. F. Stengel and R. Ghigliazza, Stochastic optimal therapy for enhanced immune response,
Mathematical Biosciences, 191 (2004), 123–142.
[31] R. J. Vanderbei, LOQO: An interior point code for quadratic programming, Optim. Methods
Softw., 11/12 (1999), 451–484.
[32] R. J. Vanderbei and D. F. Shanno, An interior-point algorithm for nonconvex nonlinear
programming, Comput. Optim. Appl., 13 (1999), 231–252.
OPTIMAL CONTROL PROBLEMS WITH MULTIPLE TIME-DELAYS 441
[33] A. Wächter, An Interior Point Algorithm for Large-Scale Nonlinear Optimization with Ap-
plications in Process Engineering, Ph.D thesis, Carnegie Mellon University in Pittsburgh,
2002.
[34] A. Wächter and L. T. Biegler, On the implementation of an interior-point filter line-search
algorithm for large-scale nonlinear programming, Math. Program., 106 (2006), 25–57.
[35] J. Warga, Optimal Control of Differential and Functional Equations, Academic Press, New
York-London, 1972.
Received November 2012; 1st revision February 2013; final revision July 2013.
E-mail address: goellmann@fh-muenster.de
E-mail address: maurer@uni-muenster.de
Annotations
01 Phi Ha Page 3
22/7/2020 5:45
Đây chính là điểm mạnh của reduce # of delays
02 Phi Ha Page 3
22/7/2020 5:44