Professional Documents
Culture Documents
Annulus Optimization
Annulus Optimization
I. I NTRODUCTION
This paper considers a class of finite time horizon optimal
control problems with a convex cost, convex state and nonconvex control constraints. There are a variety of optimal
control problems that fall into this class. One interesting
example, which is also the motivating example of this paper,
is the planetary landing problem[11], [8], also known as the
soft landing problem in the optimal control literature[6]. In
planetary landing, an autonomous spacecraft lands on the
surface of a planet by using thrusters, which can produce
a finite magnitude force vector with an upper and nonzero
lower bounds on the magnitude. The nonzero lower bound
constraint exists due to the fact that the thrusters cannot
operate reliably under this bound; if the thrusters are throttled
below this value, they may not restart, leading to failure of
the mission. This constraint on the magnitude makes the
set of feasible controls non-convex. There are also convex
state constraints that ensure, among other things, that the
spacecraft does not go below the surface of the planet.
The non-convexity of an optimal control problem can
be due to: (i) non-convex cost; (ii) nonlinear state dynamics; (iii) non-convex state constraints; and (iv) non-convex
control constraints. Since the systems considered in this
paper have convex cost, linear system dynamics and convex
state constraints, we only have a single source of nonconvexity, namely the control constraints. In this paper, we
introduce a convex relaxation of the control constraints. This
convex relaxation proceeds as follows. The non-convex set
of feasible control inputs is replaced by a convex set of
feasible control inputs by introducing a slack variable. It is
first shown that the optimal solution of the relaxed problem
also defines a feasible solution of the original one when the
optimal state trajectory of the relaxed problem is strictly in
c
2009
California Institute of Technology. Government sponsorship
acknowledged.
Authors are with Jet Propulsion Laboratory, California Institute
of Technology, Pasadena, California, USA. behcet@jpl.nasa.gov,
lars@jpl.nasa.gov.
tf
g0 (u(t)) dt
t0
x(t)
a.e. [t0 , tf ]
(t0 , tf , x(t0 ), x(tf ), (t0 ), (tf )) E := E {0}IR+
(5)
a.e. [t0 , tf ]
i = 1, ..., q,
(3)
< t0
at
6 t 7
6 t 7
EL := 4 xf 5 IR2n+2 : 4 xf 5 =
ax
0
0
>
:
xf
xf
ff
Lt 0
t
+
, (t , x ) IRqe ,
0 Lx
x
(7)
ux
Non-convex set of
feasible controls
Convexification
uy
(4)
Convex set of
feasible controls
ux
t [t0 , tf ],
(10)
(t) := (t) =
6 0,
(t)
t [t0 , tf ].
(11)
a.e. t [t0 , tf ]
(12)
(t)
= A(t)T (t)
(t)
=0
max
H(t, x, , u, , ).
(u, ) V
(15)
0
h0
maximum principle, where we use the fact that U is a
k(t
)
x(t
)
0
constant set, i.e., it does not depend on time or state. However
t0
T
LTx
h0 = 0.
h0 = 0 and Lt
the transversality condition given in (iv) requires further
k(tf )
tf
explanation. The transversality condition, for an optimal
x(tf )
solution of the relaxed problem, states that (see Sec. V.3
(19)
of [2]) the vector ((t0 ), (tf )) defined as follows
Note that, by using Lemma 2 in the Appendix, EL is a
hyperplane with:
h0
h0
h0
,
H((t0 ))
, H((tf ))
, (t0 )
t0
tf
x(t0 )
EL = (s, z) : LTt s = 0, LTx z = 0 .
(20)
h0
(tf )
, (t0 ), (tf )k
This, combined with the equalities in (19), implies that = 0
x(tf )
when the condition given by (8) holds. As a result we have:
must be orthogonal to the manifold defined by the set (, (t), (t)) = 0 a.e. t [t0 , tf ]. Since the solution must
of feasible boundary conditions E at ((t0 ), (tf )). Given satisfy the necessary condition (11), this is a contradiction,
by using which resulted from assuming that there exists an interval
any point (t0 , tf , x(t0 ), x(tf ), (t0 ), (tf )) E,
the condition (4), the corresponding tangent plane to the [t1 , t2 ] such that y(t) = 0 t [t1 , t2 ]. Hence no such
manifold defined by E at that point contains the following interval exists, which implies that for every interval the set of
hyperplane
t [t1 , t2 ] for which y(t) = 0 is countable. This, combined
with the fact that there is a finite number of such intervals
0
EL = EL z IR2 : z =
, IR .
on t [t0 , tf ], implies that the set Z = {t : y(t) = 0, t
1
[t0 , tf ]} is a countable subset of [t0 , tf ] (see Proposition 4.1
Consequently a vector is orthogonal to the tangent hyper- on p.41 of [5]). Since a countable set has a measure zero,
plane of the manifold defined by the set E only if it is also this shows that y(t) 6= 0 a.e. t [t , t ].
0 f
orthogonal to EL . A vector v EL if and only if it can be
Next we will prove that the optimal controls can only be
expressed as follows
on the boundary of the feasible set. The pointwise maximum
0
0
0 1
(z, v) V
{z
}
| {z } |
a
T
Hence, the optimal control pair {u(t), (t)} must satisfy:
for some t , x , , that is,
u(t) = argmax
y(t)T z where Z((t)) := U1 V2 ((t)).
2n+4
q
+1
z Z((t))
EL = {v IR
: v=a
+ T , IR e }.
(21)
Hence, by using Lemma 2 in the Appendix, ((t0 ), (tf )) Since U2 Z((t)) for (t) 1 and U2 = {0}, we have
defined above is orthogonal to EL if and only if Z((t)) = {0}. Hence, when y(t) 6= 0, the optimization
T T ((t0 ), (tf )) = 0, which is equivalent to the equation problem (21) has a cost whose value is not constant over
Z((t)). Since the cost is not constant and it is convex
(16). Now we claim that
on Z((t)), using Theorem 3.1 on p.137 of [3], u(t)
y(t) := B(t)T (t) 6= 0
a.e. t [t0 , tf ].
(18)
Z((t)). This implies that one or both of the following
Since A is a piecewise analytic matrix valued function of hold:
time, is piecewise analytic over [t0 , tf ] (see Theorem 3
u(t) V2 ((t))
(22)
on p.213 of [5]). Consequently, since the product of analytic
u(t)
U
(23)
1
functions is also analytic, y is also piecewise analytic over
Hence if u(t)
/ U1 , we have u(t) V2 ((t)) intU1
for some (t) 1, which then implies that u(t) U1 but
u(t)
/ U2 . Consequently u(t) U1 \ U2 , which implies that
u(t) U. Since x(t) X for t [t0 , tf ], (t0 , tf , x(), u()) is
a feasible solution of the original problem.
IV. C ONVEXIFICATION OF THE O PTIMAL C ONTROL
P ROBLEM
This section introduces convex relaxations of several cases
of the general optimal control problem given by (1). For each
case, we establish that an optimal solution of the relaxed
control problem is also optimal for the original problem.
Corollary 1: Consider the original optimal control problem given by (1) with k = 0 and its convex relaxation given by (5). Suppose that Condition 1 holds. If
(t0 , tf , x(), (), u(), ()) is an optimal solution of the
relaxed problem satisfying Condition 2 and the condition
given by (10), then (t0 , tf , x(), u()) is an optimal solution
for the original problem (1).
The next control problem has an integral cost on the
control effort and it is applicable to many soft landing
applications where fuel use must be minimized [1], [8], [11].
We assume that the set of feasible controls is given by
U = {u IRm : 1 g0 (u) },
(24)
that is g1 = g0 , q = 1, and:
U1 = {u IRm : g0 (u) }.
(25)
(t) =
.
g0 (u(t))
for t I
Rt
R
R
Furthermore since: t0f
(t)dt = I (t)dt+ I g0 (u(t))dt<
R tf
(t)dt, the cost of this new solution is less than the
t0
optimal solution, which is not possible. Hence I must have
a measure zero, that is, g0 (u(t)) = (t) a.e. [t0 , tf ]. This
then implies that the cost function in the relaxed problem
is identical to Rthe cost function
R t in the original problem,
t
since: (tf ) = t0f (t)dt = t0f g0 (u(t))dt. Note that 1
g0 (u(t)) = (t) , which implies that u(t) U a.e. [t0 , tf ].
Then, following from Theorem 1 (t0 , tf , x(), u()) is feasible for the original problem. Since the cost functions are
identical and the optimal control for the relaxed problem is
feasible for the original problem, we know that the cost of
the optimal solution to the relaxed problem is greater than
or equal to the optimal cost of the original problem. Next
A=
0
2 I
I
S
0
, B =E = 0
,
w
=
ge
,
S
=
2
I
2
2
0
ACKNOWLEDGEMENTS
vx and vy
20
15
10
5
0
!20
!10
!5
10
!10
0
5.5
10
20
30
t
4
tf=5.5
3.5
0
!2
A PPENDIX
tf=30
||u||
2.5
2
1.5
1
!4
!4
!2
0.5
5.5
10
20
30
||u||
3
2
1
15
py
0
0
t Energy
Minimum
10
||u||
3
5
2
1
0
!20
20
0
0
R EFERENCES