Approximate Solutions To Nonlinear Optimal Control Problems in Astrodynamics PDF

Hindawi Publishing Corporation
ISRN Aerospace Engineering

Volume 2013, Article ID 950912, 7 pages
http://dx.doi.org/10.1155/2013/950912
Research Article
Approximate Solutions to Nonlinear Optimal
Control Problems in Astrodynamics
Francesco Topputo and Franco Bernelli-Zazzera

Department of Aerospace Science and Technology, Politecnico di Milano, Via La Masa 34, 20156 Milano, Italy
Correspondence should be addressed to Francesco Topputo; francesco.topputo@polimi.it
Received 25 June 2013; Accepted 27 August 2013
Academic Editors: H. Baoyin, C. Bigelow, and D. Yu
Copyright © 2013 F. Topputo and F. Bernelli-Zazzera. This is an open access article distributed under the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is
properly cited.
A method to solve nonlinear optimal control problems is proposed in this work. The method implements an approximating
sequence of time-varying linear quadratic regulators that converge to the solution of the original, nonlinear problem. Each
subproblem is solved by manipulating the state transition matrix of the state-costate dynamics. Hard, soft, and mixed boundary
conditions are handled. The presented method is a modified version of an algorithm known as “approximating sequence of Riccati
equations.” Sample problems in astrodynamics are treated to show the effectiveness of the method, whose limitations are also
discussed.
1. Introduction evaluated by using the solutions at the previous iteration, and

therefore, a series of time-varying linear quadratic regulators
Optimal control problems are solved with indirect or direct is treated. This sequence is solved with a state transition
methods. Indirect methods stem from the calculus of vari- matrix approach, where three different final conditions are
ations [1, 2]; direct methods use a nonlinear programming handled: final state fully specified, final state not specified,
optimization [3, 4]. Both methods require the solution and final state not completely specified. These define hard,
of a complex set of equations (Euler-Lagrange differential soft, and mixed constrained problems, respectively.
equations or Karush-Kuhn-Tucker algebraic equations) for The main feature of the presented method is that it
which iterative numerical methods are used. These iterative does not require guessing any initial solution or Lagrange
procedures implement some form of Newton’s method to multiplier. In fact, iterations start by evaluating the state- and
find the zeros of a nonlinear function. They are initiated by control-dependent functions using the initial condition and
providing an initial guess solution. Guessing an appropriate zero control, respectively. The way the dynamics and objective
initial solution is not trivial and requires a deep knowledge function are factorized recalls the state-dependent Riccati
of the problem at hand. In indirect methods, the initial value equations (SDRE) method [7–9]. These two methods possess
of the Lagrange multiplier has to be provided, whose lack of some similarities, although the way they solve the optimal
physical meaning makes it difficult to formulate a good guess. control problem is different. As the method is approximated,
In direct methods, the initial trajectory and control have to be suboptimal solutions are derived. These could be used as first
guessed at discrete points over the whole time interval. guess solutions for either indirect or direct methods.
This paper presents an approximate method to solve
nonlinear optimal control problems. This is a modification
of the method known as “approximating sequence of Riccati 2. The Nonlinear Optimal Control Problem
equations” (ASRE) [5, 6]. It transforms the nonlinear dynam- The optimal control problem requires that, given a set of 𝑛
ics and objective function into a pseudolinear and quadratic- first-order differential equations
like structure, respectively, by using state- and control-
dependent functions. At each iteration, these functions are ẋ = f (x, u, 𝑡) , (1)
2 ISRN Aerospace Engineering
the 𝑚 control functions u(𝑡) must be determined within of Riccati equations. This method reduces problem (8)-(9)
initial, final time 𝑡𝑖 , 𝑡𝑓 , such that the performance index to a series of time-varying linear quadratic regulators that
are defined by evaluating the state- and control-dependent
𝑡𝑓
matrices using the solution of the previous iteration (the first
𝐽 = 𝜑 (x (𝑡𝑓 ) , 𝑡𝑓 ) + ∫ 𝐿 (x, u, 𝑡) d𝑡 (2)
𝑡𝑖 iteration considers the initial condition and zero control).
The initial step consists in solving problem 0, which is
is minimized while satisfying 𝑛 + 𝑞 two-point conditions defined as follows:
x (𝑡𝑖 ) = x𝑖 , 𝜓 (x (𝑡𝑓 ) , 𝑡𝑓 ) = 0. (3) ẋ (0) = 𝐴 (x𝑖 , 𝑡) x(0) + 𝐵 (x𝑖 , 0, 𝑡) u(0) ,

1
The problem consists in finding a solution that represents 𝐽 = x(0)𝑇 (𝑡𝑓 ) 𝑆 (x𝑖 , 𝑡𝑓 ) x(0) (𝑡𝑓 )
a stationary point of the augmented performance index 2
1 𝑡𝑓 (0)𝑇
𝐽 = 𝜑 (x (𝑡𝑓 ) , 𝑡𝑓 ) + ^ 𝑇 𝜓 (x (𝑡𝑓 ) , u (𝑡𝑓 ) , 𝑡𝑓 ) + ∫ [x 𝑄 (x𝑖 , 𝑡) x(0) + u(0)𝑇 𝑅 (x𝑖 , 0, 𝑡) u(0) ] d𝑡.
2 𝑡𝑖
𝑡𝑓 (4) (10)
+ ∫ [𝐿 (x, u, 𝑡) + 𝜆𝑇 (f (x, u, 𝑡) − x)]
̇ d𝑡,
𝑡𝑖 Problem 0 is a standard time-varying linear quadratic reg-
where 𝜆 is the vector of costate and ^ is the multiplier of the ulator (TVLQR), as the arguments of 𝐴, 𝐵, 𝑆, 𝑄, and 𝑅 are
boundary condition. The necessary conditions for optimality, all given except for the time. This problem is solved to yield
also referred to as Euler-Lagrange equations, are x(0) (𝑡) and u(0) (𝑡), 𝑡 ∈ [𝑡𝑖 , 𝑡𝑓 ], where the superscript denotes
the problem that the solution refers to.
𝜕𝐻 𝜕𝐻 𝜕𝐻 At a generic, subsequent iteration, problem 𝑘 has to be
ẋ = , 𝜆̇ = − , = 0, (5)
solved. This is defined as follows:
𝜕𝜆 𝜕x 𝜕u
where 𝐻, the Hamiltonian, is ẋ (𝑘) = 𝐴 (x(𝑘−1) (𝑡) , 𝑡) x(𝑘) + 𝐵 (x(𝑘−1) (𝑡) , u(𝑘−1) (𝑡) , 𝑡) u(𝑘) ,
𝐻 (x, 𝜆, u, 𝑡) = 𝐿 (x, u, 𝑡) + 𝜆𝑇 f (x, u, 𝑡) . (6) 1

𝐽 = x(𝑘)𝑇 (𝑡𝑓 ) 𝑆 (x(𝑘−1) (𝑡𝑓 ) , 𝑡𝑓 ) x(𝑘) (𝑡𝑓 )
2
The differential-algebraic system (5) must be solved together
with the final boundary conditions (3) and the transversality 1 𝑡𝑓 (𝑘)𝑇
+ ∫ [x 𝑄 (x(𝑘−1) (𝑡) , 𝑡) x(𝑘)
conditions 2 𝑡𝑖
𝜕𝜑 𝜕𝜓 𝑇 + u(𝑘)𝑇 𝑅 (x(𝑘−1) (𝑡) , u(𝑘−1) (𝑡) , 𝑡) u(𝑘) ] d𝑡.
𝜆 (𝑡𝑓 ) = [ + ( ) ^] , (7)
𝜕x 𝜕x 𝑡=𝑡 (11)
𝑓
which define a differential-algebraic parametric two-point Problem 𝑘 is again a TVLQR; note that x(𝑘−1) and u(𝑘−1) are
boundary value problem whose solution supplies ^ and the the solutions of problem 𝑘 − 1 achieved at previous iteration.
functions x(𝑡), 𝜆(𝑡), u(𝑡), 𝑡 ∈ [𝑡𝑖 , 𝑡𝑓 ]. Solving problem 𝑘 yields x(𝑘) (𝑡) and u(𝑘) (𝑡), 𝑡 ∈ [𝑡𝑖 , 𝑡𝑓 ].
Iterations continue until a certain convergence criterion
3. The Approximating Sequence of is satisfied. In the present implementation of the algorithm,
Riccati Equations the convergence is reached when
󵄩󵄩 (𝑘) 󵄩
Let the controlled dynamics (1) be rewritten in the form 󵄩󵄩x − x(𝑘−1) 󵄩󵄩󵄩
󵄩 󵄩∞
󵄨 󵄨 (12)
ẋ = 𝐴 (x, 𝑡) x + 𝐵 (x, u, 𝑡) u, (8) = max {󵄨󵄨󵄨󵄨𝑥𝑗(𝑘) (𝑡) − 𝑥𝑗(𝑘−1) (𝑡)󵄨󵄨󵄨󵄨 , 𝑗 = 1, . . . , 𝑛} ≤ 𝜀,
𝑡∈[𝑡𝑖 ,𝑡𝑓 ]
and let the objective function (2) be rearranged as
where 𝜀 is a prescribed tolerance. That is, iterations terminate
1 when the difference between each component of the state,
𝐽 = x𝑇 (𝑡𝑓 ) 𝑆 (x (𝑡𝑓 ) , 𝑡𝑓 ) x (𝑡𝑓 )
2 evaluated for all times, changes by less than 𝜀 between two
(9) consecutive iterations.
1 𝑡𝑓
+ ∫ [x𝑇 𝑄 (x, 𝑡) x + u𝑇 𝑅 (x, u, 𝑡) u] d𝑡,
2 𝑡𝑖
4. Solution of the Time-Varying
where the operators 𝐴, 𝐵, 𝑆, 𝑄, and 𝑅 have appropriate dimen- Linear Quadratic Regulator by
sions. The nonlinear dynamics (8) and the performance the State Transition Matrix
index (9) define an optimal control problem. The initial
state, x𝑖 , is assumed to be given, while the final condition With the approach sketched in Section 3, a fully nonlinear
(𝜓 in (3)) can assume three different forms (see Section 4). optimal control problem is reduced to a sequence of time-
The problem is formulated as an approximating sequence varying linear quadratic regulators. These can be solved
ISRN Aerospace Engineering 3
a number of times to achieve an approximate solution of the with the initial conditions
original, nonlinear problem. This is done by exploiting the
structure of the problem as well as its state transition matrix. 𝜙𝑥𝑥 (𝑡𝑖 , 𝑡𝑖 ) = 𝜙𝜆𝜆 (𝑡𝑖 , 𝑡𝑖 ) = 𝐼𝑛×𝑛 ,
This scheme differs from that implemented in [5, 6], and, in (25)
part, is described in [1]. 𝜙𝑥𝜆 (𝑡𝑖 , 𝑡𝑖 ) = 𝜙𝜆𝑥 (𝑡𝑖 , 𝑡𝑖 ) = 0𝑛×𝑛 .
Suppose that the following dynamics are given:
If both x𝑖 and 𝜆𝑖 were given, it would be possible to
ẋ = 𝐴 (𝑡) x + 𝐵 (𝑡) u, (13) compute x(𝑡) and 𝜆(𝑡) through (22)-(23), and therefore the
optimal control function u(𝑡) with (19). The initial condi-
together with the quadratic objective function tion is assumed to be given, whereas the computation of
𝜆𝑖 depends on the final condition, which, in the present
1
𝐽 = x𝑇 (𝑡𝑓 ) 𝑆 (𝑡𝑓 ) x (𝑡𝑓 ) algorithm, can be defined in three different ways.
2
(14)
1 𝑡𝑓 4.1. Hard Constrained Problem. In a hard constrained prob-
+ ∫ [x𝑇 𝑄 (𝑡) x + u𝑇 𝑅 (𝑡) u] d𝑡,
2 𝑡𝑖 lem (HCP), the value of the final state is fully specified,
x(𝑡𝑓 ) = x𝑓 , and therefore, (14) does not account for 𝑆. The
where 𝑄, 𝑅, and 𝑆 are positive semidefinite and positive value of 𝜆𝑖 can be found by writing (22) at final time
definite time-varying matrices with appropriate dimensions,
respectively. The Hamiltonian of this problem is
x𝑓 = 𝜙𝑥𝑥 (𝑡𝑖 , 𝑡𝑓 ) x𝑖 + 𝜙𝑥𝜆 (𝑡𝑖 , 𝑡𝑓 ) 𝜆𝑖 (26)
1 𝑇
𝐻= [x 𝑄 (𝑡) x + u𝑇 𝑅 (𝑡) u] + 𝜆𝑇 [𝐴 (𝑡) x + 𝐵 (𝑡) u] , (15) and by solving for 𝜆𝑖 ; that is,
2
and the optimality conditions (5) read −1
𝜆𝑖 (x𝑖 , x𝑓 , 𝑡𝑖 , 𝑡𝑓 ) = 𝜙𝑥𝜆 (𝑡𝑖 , 𝑡𝑓 ) [x𝑓 − 𝜙𝑥𝑥 (𝑡𝑖 , 𝑡𝑓 ) x𝑖 ] . (27)
ẋ = 𝐴 (𝑡) x + 𝐵 (𝑡) u, (16)
4.2. Soft Constrained Problem. In a soft constrained problem
𝜆̇ = −𝑄 (𝑡) x − 𝐴𝑇 (𝑡) 𝜆, (17) (SCP), the final state is not specified, and thus 𝑆 in (14) is an
𝑛 × 𝑛 positive definite matrix. The transversality condition (7)
0 = 𝑅 (𝑡) u + 𝐵𝑇 (𝑡) 𝜆. (18) sets a relation between the state and costate at final time
From (18), it is possible to get
𝜆 (𝑡𝑓 ) = 𝑆 (𝑡𝑓 ) x (𝑡𝑓 ) , (28)
−1 𝑇
u = −𝑅 (𝑡) 𝐵 (𝑡) 𝜆, (19)
which can be used to find 𝜆𝑖 . This is done by writing (22)-(23)
which can be substituted into (16)-(17) to yield at final time and using (28)
ẋ = 𝐴 (𝑡) x − 𝐵 (𝑡) 𝑅−1 (𝑡) 𝐵𝑇 (𝑡) 𝜆, x (𝑡𝑓 ) = 𝜙𝑥𝑥 (𝑡𝑖 , 𝑡𝑓 ) x𝑖 + 𝜙𝑥𝜆 (𝑡𝑖 , 𝑡𝑓 ) 𝜆𝑖 ,
(20) (29)
𝜆̇ = −𝑄 (𝑡) x − 𝐴 (𝑡) 𝜆.𝑇
𝑆 (𝑡𝑓 ) x (𝑡𝑓 ) = 𝜙𝜆𝑥 (𝑡𝑖 , 𝑡𝑓 ) x𝑖 + 𝜙𝜆𝜆 (𝑡𝑖 , 𝑡𝑓 ) 𝜆𝑖 .
In a compact form, (20) can be arranged as
Equations (29) represent a linear algebraic system of 2𝑛
ẋ −1
𝐴 (𝑡) −𝐵 (𝑡) 𝑅 (𝑡) 𝐵 (𝑡) x 𝑇 equations in the 2𝑛 unknowns {x(𝑡𝑓 ), 𝜆𝑖 }. The system can be
[ ̇] = [ ][ ]. (21) solved by substitution to yield
𝜆 −𝑄 (𝑡) −𝐴𝑇 (𝑡) 𝜆
−1
Since (21) is a system of linear differential equations, the exact 𝜆𝑖 (x𝑖 , 𝑡𝑖 , 𝑡𝑓 ) = [𝜙𝜆𝜆 (𝑡𝑖 , 𝑡𝑓 ) − 𝑆 (𝑡𝑓 ) 𝜙𝑥𝜆 (𝑡𝑖 , 𝑡𝑓 )]
solution can be written as (30)
× [𝑆 (𝑡𝑓 ) 𝜙𝑥𝑥 (𝑡𝑖 , 𝑡𝑓 ) − 𝜙𝜆𝑥 (𝑡𝑖 , 𝑡𝑓 )] x𝑖 .
x (𝑡) = 𝜙𝑥𝑥 (𝑡𝑖 , 𝑡) x𝑖 + 𝜙𝑥𝜆 (𝑡𝑖 , 𝑡) 𝜆𝑖 , (22)
𝜆 (𝑡) = 𝜙𝜆𝑥 (𝑡𝑖 , 𝑡) x𝑖 + 𝜙𝜆𝜆 (𝑡𝑖 , 𝑡) 𝜆𝑖 , (23) 4.3. Mixed Constrained Problem. In a mixed constrained
problem (MCP), some components of the final state are
where the functions 𝜙𝑥𝑥 , 𝜙𝑥𝜆 , 𝜙𝜆𝑥 , and 𝜙𝜆𝜆 are the compo- specified and some are not. Without any loss of generality,
nents of the state transition matrix, which can be found by let the state be decomposed as x = (y, z), where y are
integrating the following dynamics: the 𝑝 known components at final time, y(𝑡𝑓 ) = y𝑓 , and z
are remaining 𝑛 − 𝑝 elements. The costate is decomposed
̇
𝜙 ̇ 𝜙𝑥𝜆 𝐴 (𝑡) −𝐵 (𝑡) 𝑅−1 (𝑡) 𝐵𝑇 (𝑡) 𝜙𝑥𝑥 𝜙𝑥𝜆 accordingly as 𝜆 = (𝜉, 𝜂). With this formalism, 𝑆 in (14) is
[ 𝑥𝑥
̇ 𝜙𝜆𝜆̇ ] = [ ][ ],
𝜙𝜆𝑥 −𝑄 (𝑡) −𝐴𝑇 (𝑡) 𝜙𝜆𝑥 𝜙𝜆𝜆 (𝑛 − 𝑝) × (𝑛 − 𝑝), and it is pre- and postmultiplied by z(𝑡𝑓 ).
(24) The transversality condition (7) is 𝜂(𝑡𝑓 ) = 𝑆(𝑡𝑓 )z(𝑡𝑓 ).
The MCP is solved by partitioning the state transition the orbital radius, the time unit is such that the orbital period
matrix in a suitable form such that, at final time, (22)-(23) is 2𝜋, and the gravitational parameter is equal to 1. In these
read dynamics, the state, x = (𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ), represents the radial,
tangential displacements (𝑥1 , 𝑥2 ) and the radial, tangential
y (𝑡𝑓 ) 𝜙 𝜙 y 𝜙 𝜙 𝜉 velocity deviations (𝑥3 , 𝑥4 ), respectively. The control, u =
[ ] = [ 𝑦𝑦 𝑦𝑧 ] [ 𝑖 ] + [ 𝑦𝜉 𝑦𝜂 ] [ 𝑖 ] , (31)
z (𝑡𝑓 ) 𝜙𝑧𝑦 𝜙𝑧𝑧 z𝑖 𝜙𝑧𝜉 𝜙𝑧𝜂 𝜂𝑖 (𝑢1 , 𝑢2 ), is made up by the radial and tangential accelerations,
respectively.
𝜉 (𝑡 ) 𝜙 𝜙 y 𝜙 𝜙 𝜉 The equations of motion are
[ 𝑓 ] = [ 𝜉𝑦 𝜉𝑧 ] [ 𝑖 ] + [ 𝜉𝜉 𝜉𝜂 ] [ 𝑖 ] , (32)
𝜂 (𝑡𝑓 ) 𝜙𝜂𝑦 𝜙𝜂𝑧 z𝑖 𝜙𝜂𝜉 𝜙𝜂𝜂 𝜂𝑖
𝑥1̇ = 𝑥3 ,
where the dependence of the state transition matrix compo-
nents on 𝑡𝑖 , 𝑡𝑓 is omitted for brevity. From the first row of (31), 𝑥2̇ = 𝑥4 ,
it is possible to get 1 (37)
𝑥3̇ = 2𝑥4 − (1 + 𝑥1 ) ( − 1) + 𝑢1 ,
−1 −1 𝑟3
𝜉𝑖 = 𝜙𝑦𝜉 [y𝑓 − 𝜙𝑦𝑦 y𝑖 − 𝜙𝑦𝑧 z𝑖 ] − 𝜙𝑦𝜉 𝜙𝑦𝜂 𝜂𝑖 , (33)
1
𝑥4̇ = −2𝑥3 − 𝑥2 ( − 1) + 𝑢2 ,
which can be substituted in the second row of (31) to yield 𝑟3
−1 −1
z (𝑡𝑓 ) = [𝜙𝑧𝑦 − 𝜙𝑧𝜉 𝜙𝑦𝜉 𝜙𝑦𝑦 ] y𝑖 + [𝜙𝑧𝑧 − 𝜙𝑧𝜉 𝜙𝑦𝜉 𝜙𝑦𝑧 ] z𝑖 with 𝑟 = √(𝑥1 + 1)2 + 𝑥22 . The initial condition is x𝑖 =
(34) (0.2, 0.2, 0.1, 0.1). Two different problems are solved to test
−1 −1
+ 𝜙𝑧𝜉 𝜙𝑦𝜉 y𝑓 + [𝜙𝑧𝜂 − 𝜙𝑧𝜉 𝜙𝑦𝜉 𝜙𝑦𝜂 ] 𝜂𝑖 . the algorithm in both hard and soft constrained conditions.
Equations (33)-(34), together with the transversality condi- Hard Constrained Rendezvous. The HCP consists in
tion 𝜂(𝑡𝑓 ) = 𝑆(𝑡𝑓 )z(𝑡𝑓 ), can be substituted in the second row minimizing
of (32) to compute the component of the initial costate
1 𝑡𝑓 𝑇
−1 𝐽= ∫ u u d𝑡 (38)
𝜂𝑖 (x𝑖 , y𝑓 , 𝑡𝑖 , 𝑡𝑓 ) = [𝜙̃𝜂𝜂 ] w (x𝑖 , y𝑓 , 𝑡𝑖 , 𝑡𝑓 ) , (35) 2 𝑡𝑖
where with the final, given condition x𝑓 = (0, 0, 0, 0) and 𝑡𝑖 = 0,

𝑡𝑓 = 1.
𝜙̃𝜂𝜂 = 𝜙𝜂𝜂 − 𝜙𝜂𝜉 𝜙𝑦𝜉
−1 −1
𝜙𝑦𝜂 − 𝑆 (𝜙𝑧𝜂 − 𝜙𝑧𝜉 𝜙𝑦𝜉 𝜙𝑦𝜂 ) ,
Soft Constrained Rendezvous. The SCP considers the follow-
w (x𝑖 , y𝑓 , 𝑡𝑖 , 𝑡𝑓 ) ing objective function:
−1
= [𝑆 (𝜙𝑧𝑦 − 𝜙𝑧𝜉 𝜙𝑦𝜉 −1
𝜙𝑦𝑦 ) − 𝜙𝜂𝑦 + 𝜙𝜂𝜉 𝜙𝑦𝜉 𝜙𝑦𝑦 ] y𝑖 1 1 𝑡𝑓
(36) 𝐽 = x𝑇 (𝑡𝑓 ) 𝑆x (𝑡𝑓 ) + ∫ u𝑇 u d𝑡, (39)
2 2 𝑡𝑖
−1 −1
+ [𝑆 (𝜙𝑧𝑧 − 𝜙𝑧𝜉 𝜙𝑦𝜉 𝜙𝑦𝑧 ) + 𝜙𝜂𝑧 + 𝜙𝜂𝜉 𝜙𝑦𝜉 𝜙𝑦𝑧 ] z𝑖
with 𝑆 = diag(25, 15, 10, 10), 𝑡𝑖 = 0 and 𝑡𝑓 = 1 (x𝑓 is free).
−1 −1
+ [𝑆 (𝜙𝑧𝜉 𝜙𝑦𝜉 ) − 𝜙𝜂𝜉 𝜙𝑦𝜉 ] y𝑓 . The differential equations (37) are factorized into the form
of (8) as
Once 𝜂𝑖 is know, the remaining part of the initial costate,
𝜉𝑖 , is computed through (33), and therefore, the full initial 0 0 1 0
costate is obtained as a function of the initial condition, given 𝑥1̇ [ 0 0 0 1]
[ ]
final condition, initial and final time; that is, 𝜆𝑖 (x𝑖 , y𝑓 , 𝑡𝑖 , 𝑡𝑓 ) = [𝑥2̇ ] [ ]
[ ]=[ ]
(𝜉𝑖 (x𝑖 , y𝑓 , 𝑡𝑖 , 𝑡𝑓 ), 𝜂𝑖 (x𝑖 , y𝑓 , 𝑡𝑖 , 𝑡𝑓 )). [𝑥3̇ ] [ 1 ]
[𝑓 (𝑥1 , 𝑥2 ) (1 + ) 0 0 2]
[ 𝑥1 ]
[𝑥4̇ ]
5. Numerical Examples [ 0 𝑓 (𝑥1 , 𝑥2 ) −2 0]
⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟ (40)
𝐴(x)
Two simple problems with nonlinear dynamics are consid-
ered to apply the developed algorithm. These correspond to 𝑥1 0 0
the controlled relative spacecraft motion and to the controlled [𝑥2 ] [0 0] 𝑢1
×[ ] [ ]
[𝑥3 ] + [1 0] [𝑢2 ] ,
two-body dynamics for low-thrust transfers.
[𝑥4 ] ⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟⏟
[0 1]
5.1. Low-Thrust Rendezvous. This problem is taken from 𝐵
the literature where a solution is available, for comparison’s
sake [10, 11]. Consider the planar, relative motion of two with 𝑓(𝑥1 , 𝑥2 ) = −1/[(𝑥1 + 1)2 + 𝑥22 ]3/2 + 1. Thus, the problem
particles in a central gravity field expressed in a rotating is put into the pseudo-LQR form (8)-(9) by defining 𝐴(x) and
frame with normalized units: the length unit is equal to 𝐵 as in (40) and by setting 𝑄 = 04×4 and 𝑅 = 𝐼2×2 .
0.3 0.4
0.2 0.2
0.1 0
x2
x4
0 −0.2
−0.1 −0.4
0 0.05 0.1 0.15 0.2 0.25 −0.4 −0.2 0 0.2 0.4
x1 x3
(a) 𝑥1 versus 𝑥2 (b) 𝑥3 versus 𝑥4

0.5
−0.5
−1
u2
−1.5
−2
−2.5
−2 −1 0 1 2 3
u1
(c) 𝑢1 versus 𝑢2
Figure 1: Hard constrained rendezvous.
The two problems have been solved with the developed u = (𝑢1 , 𝑢2 ), corresponds to the radial and transversal
method. Table 1 reports the details of the HCP and SCP, accelerations, respectively [12, 13]. The equations of motion
whose solutions are shown in Figures 1 and 2, respectively. are
In Table 1, 𝐽 is the objective function at the final iteration,
“Iter” is the number of iterations, and the “CPU time” is the 𝑥1̇ = 𝑥3 ,
computational time (this refers to an Intel Core 2 Duo 2 GHz 𝑥2̇ = 𝑥4 ,
with 4 GB RAM running Mac OS X 10.6). The termination
tolerance 𝜀 in (12) is 10−9 . The optimal solutions found 1
𝑥3̇ = 𝑥1 𝑥42 − + 𝑢1 , (41)
replicate those already known in the literature [10, 11], 𝑥12
indicating the effectiveness of the developed method.
2𝑥3 𝑥4 𝑢2
𝑥4̇ = − + ,
𝑥1 𝑥1
5.2. Low-Thrust Orbital Transfer. In this problem, the con-
trolled, planar Keplerian motion of a spacecraft in polar and the objective function is
coordinates is studied. The dynamics are written in scaled
coordinates, where the length unit corresponds to the radius 1 𝑡𝑓 𝑇
𝐽= ∫ u u d𝑡, (42)
of the initial orbit, the time unit is such that its period is 2 𝑡𝑖
2𝜋, and the gravitational parameter is 1. The state, x =
(𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 ), is made up by the radial distance from with 𝑡𝑖 = 0 and 𝑡𝑓 = 𝜋. The initial state corresponds to the
the attractor (𝑥1 ), the phase angle (𝑥2 ), the radial velocity conditions at the initial orbit; that is, x𝑖 = (1, 0, 0, 1). Two
(𝑥3 ), and the transversal velocity (𝑥4 ), whereas the control, different HCPs are solved, which correspond to the final states
0.25 0.2
0.1
0.2
0
0.15
x4
x2
−0.1
0.1
−0.2
0.05 −0.3
0.05 0.1 0.15 0.2 0.25 −0.3 −0.2 −0.1 0 0.1 0.2
x1 x3
(a) 𝑥1 versus 𝑥2 (b) 𝑥3 versus 𝑥4

0.5
−0.5
u2
−1
−1.5
−2
−1.5 −1 −0.5 0 0.5 1
u1
(c) 𝑢1 versus 𝑢2
Figure 2: Soft constrained rendezvous.
x𝑓 = (1.52, 𝜋, 0, 1.52−3/2 ) and x𝑓 = (1.52, 1.5𝜋, 0, 1.52−3/2 ), Table 1: Rendezvous solutions details.
respectively. This setup mimics an Earth-Mars low-thrust Problem 𝐽 Iter CPU time (s)
transfer. The dynamics (41) and the objective function (42)
HCP 0.9586 5 0.375
are put in the form (8)-(9) by defining 𝑄 = 04×4 , 𝑅 = 𝐼2×2 ,
and SCP 0.5660 6 0.426
Table 2: Earth-Mars transfer details.

0 0 1 0
[ ] Problem 𝐽 Iter CPU time (s)
[ 0 0 0 1 ] 𝑥2,𝑓 = 𝜋 0.5298 22 5.425
[ ]
[ 1 ] 𝑥2,𝑓 = 1.5𝜋
𝐴 (x) = [ ]
[𝑥 − 3 0 0 𝑥1 𝑥4 ] ,
4.8665 123 41.831
[ 𝑥 ]
[ 1 ]
[ 2𝑥4 ]
0 0 − 0
[ 𝑥 1 ]
The two HCPs have been solved with the developed
0 0 method. The solutions’ details are reported in Table 2, whose
[0 0] columns have the same meaning as in Table 1. It can be seen
[ ]
𝐵 (x) = [1 0 ]. (43) that more iterations and an increased computational burden
[ 1] are required to solve this problem. The solution with 𝑥2,𝑓 =
0
[ 𝑥1 ] 1.5𝜋 is reported in Figure 3.
90 3
120 60
2
150 30
1
Control
0
180 0
−1
1
−2
210 330
2
−3
240 0 1 2 3 4
300
Time
270
u1
u2
u
(a) Transfer trajectory (b) Control profile
Figure 3: Orbital transfer with 𝑥2,𝑓 = 1.5𝜋.
6. Conclusion [6] T. Çimen and S. P. Banks, “Nonlinear optimal tracking control

with application to super-tankers for autopilot design,” Auto-
In this paper, an approximated method to solve nonlinear matica, vol. 40, no. 11, pp. 1845–1863, 2004.
optimal control problems has been presented, with applica- [7] C. P. Mracek and J. R. Cloutier, “Control designs for the
tions to sample cases in astrodynamics. With this method, nonlinear benchmark problem via the state-dependent Riccati
the nonlinear dynamics and objective function are factor- equation method,” International Journal of Robust and Nonlin-
ized in a pseudolinear and quadratic-like forms, which are ear Control, vol. 8, no. 4-5, pp. 401–433, 1998.
similar to those used in the state-dependent Riccati equation [8] J. D. Pearson, “Approximation methods in optimal control,”
Journal of Electronics and Control, vol. 13, pp. 453–469, 1962.
approach. Once in this form, a number of time-varying linear
[9] A. Wernli and G. Cook, “Suboptimal control for the nonlinear
quadratic regulator problems are solved. A state transition quadratic regulator problem,” Automatica, vol. 11, no. 1, pp. 75–
matrix approach is used to deal with the time-varying linear 84, 1975.
quadratic regulators. The results show the effectiveness of [10] C. Park, V. Guibout, and D. J. Scheeres, “Solving optimal con-
the method, which can be used to either have suboptimal tinuous thrust rendezvous problems with generating functions,”
solutions or to provide initial solutions to more accurate Journal of Guidance, Control, and Dynamics, vol. 29, no. 2, pp.
optimizers. 321–331, 2006.
[11] C. Park and D. J. Scheeres, “Determination of optimal feedback
terminal controllers for general boundary conditions using
References generating functions,” Automatica, vol. 42, no. 5, pp. 869–875,
2006.
[1] A. E. Bryson and Y. C. Ho, Applied Optimal Control, John Wiley [12] A. Owis, F. Topputo, and F. Bernelli-Zazzera, “Radially acceler-
& Sons, New York, NY, USA, 1975. ated optimal feedback orbits in central gravity field with linear
[2] L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. drag,” Celestial Mechanics and Dynamical Astronomy, vol. 103,
F. Mishchenko, The Mathematical Theory of Optimal Processes, no. 1, pp. 1–16, 2009.
John Wiley & Sons, New York, NY, USA, 1962. [13] F. Topputo, A. H. Owis, and F. Bernelli-Zazzera, “Analytical
[3] J. T. Betts, Practical Methods for Optimal Control and Estimation solution of optimal feedback control for radially accelerated
Using Nonlinear Programming, SIAM, Philadelphia, Pa, USA, orbits,” Journal of Guidance, Control, and Dynamics, vol. 31, no.
2010. 5, pp. 1352–1359, 2008.
[4] B. Conway, “Spacecraft trajecory optimization using direct
transcription and nonlinear programming,” in Spacecraft Tra-
jectory Optimization, pp. 37–78, Cambridge University Press,
Cambridge, UK, 2010.
[5] T. Çimen and S. P. Banks, “Global optimal feedback control
for general nonlinear systems with nonquadratic performance
criteria,” Systems and Control Letters, vol. 53, no. 5, pp. 327–346,
2004.
International Journal of
Rotating
Machinery
The Scientific
Engineering Distributed
Journal of
Journal of

World Journal
Hindawi Publishing Corporation Hindawi Publishing Corporation
Sensors
Sensor Networks
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014
Journal of
Control Science
and Engineering
Advances in
Civil Engineering
Hindawi Publishing Corporation Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014
Submit your manuscripts at

http://www.hindawi.com
Journal of
Journal of Electrical and Computer
Robotics
Engineering
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014
VLSI Design
Advances in
OptoElectronics
Modelling &
Simulation
Aerospace
Hindawi Publishing Corporation Volume 2014
Navigation and
Observation
http://www.hindawi.com Volume 2014
in Engineering
Engineering
http://www.hindawi.com
International Journal of Antennas and Active and Passive Advances in
Chemical Engineering Propagation Electronic Components Shock and Vibration Acoustics and Vibration
Hindawi Publishing Corporation Hindawi Publishing Corporation Hindawi Publishing Corporation Hindawi Publishing Corporation Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014

Approximate Solutions To Nonlinear Optimal Control Problems in Astrodynamics PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Approximate Solutions To Nonlinear Optimal Control Problems in Astrodynamics PDF

Uploaded by

Copyright:

Available Formats

Hindawi Publishing Corporation

ISRN Aerospace Engineering

Francesco Topputo and Franco Bernelli-Zazzera

Correspondence should be addressed to Francesco Topputo; francesco.topputo@polimi.it

Received 25 June 2013; Accepted 27 August 2013

Academic Editors: H. Baoyin, C. Bigelow, and D. Yu

1. Introduction evaluated by using the solutions at the previous iteration, and

x (𝑡𝑖 ) = x𝑖 , 𝜓 (x (𝑡𝑓 ) , 𝑡𝑓 ) = 0. (3) ẋ (0) = 𝐴 (x𝑖 , 𝑡) x(0) + 𝐵 (x𝑖 , 0, 𝑡) u(0) ,

𝐻 (x, 𝜆, u, 𝑡) = 𝐿 (x, u, 𝑡) + 𝜆𝑇 f (x, u, 𝑡) . (6) 1

where with the final, given condition x𝑓 = (0, 0, 0, 0) and 𝑡𝑖 = 0,

(a) 𝑥1 versus 𝑥2 (b) 𝑥3 versus 𝑥4

Figure 1: Hard constrained rendezvous.

(a) 𝑥1 versus 𝑥2 (b) 𝑥3 versus 𝑥4

Figure 2: Soft constrained rendezvous.

Table 2: Earth-Mars transfer details.

Figure 3: Orbital transfer with 𝑥2,𝑓 = 1.5𝜋.

6. Conclusion [6] T. Çimen and S. P. Banks, “Nonlinear optimal tracking control

Hindawi Publishing Corporation

Submit your manuscripts at

You might also like