Professional Documents
Culture Documents
Trajectory Generation
Trajectory Generation
Trajectory Generation
Abstract— A method based on the generating function for values separately when we use those methods. Therefore,
the finite time optimal control problems was proposed recently. calculating optimal trajectories for a set of different boundary
A single generating function can generate a family of optimal conditions imposes a heavy computational burden particular-
inputs which are functions of the state for different boundary
conditions. Therefore, a family of optimal trajectories for ly for an online optimal trajectory generation problem.
different boundary conditions can be obtained by numerical For a nonlinear finite time optimal control problem, the so-
integration along the system dynamic equation. This paper lution of the HJBE is given as a series by solving a sequence
proposes a method to compute a family of optimal trajectories of ordinary differential equations (ODEs) when the terminal
for a nonlinear optimal control problem on a finite time value of the state is zero [8]. Recently, a method to generate
interval by using a pair of generating functions. The proposed
method reduces the online computational effort in calculating optimal trajectories based on the generating functions which
the numerical integration required in the method using a single are the solutions of HJEs is proposed for nonlinear finite time
generating function. It is useful to on-line nonlinear trajectory optimal control problems with the fixed boundary condition
generation problems such as model predictive control. A nu- [9], [10]. The generating function for the HJE can be found
merical example illustrates the effectiveness of the proposed by taking the infimum of the optimal cost function, and it
method.
can be effectively solved by the Galerkin spectral method
I. INTRODUCTION with Chebyshev polynomials [11]. A recursive algorithm
The optimal control problem is a classical mathematical based on the result of [9] is proposed to solve HJEs for
problem. Since it is widely used in engineering, a lot of the generating functions for a nonlinear system [12]. Once a
important results have been obtained in the past decades. generating function is found, it gives a family of optimal
The Hamilton-Jacobi equation (HJE) plays a fundamental inputs for different boundary conditions by the canonical
role in the analysis and control of nonlinear systems. For an transformation. Then a family of the optimal trajectories
infinite time nonlinear optimal control problem, the solution can be obtained by numerical integration along the system
to the Hamilton-Jacobi-Bellman equation (HJBE) can be dynamic equation, whereas the conventional method for an
approximated by the Taylor series in a neighborhood of the optimal control problem can obtain only one trajectory by
origin [1]. The Taylor series converges to the true solution solving the HJE for a value function one time.
under suitable conditions [2]. An approximate stabilizing The authors proposed the double generating functions
solution of the HJE can be obtained by using symplectic method for a linear finite time optimal control problem
geometry and the Hamiltonian perturbation technique for [13], which uses a pair of generating functions to com-
feedback optimal control problems [3]. pute the optimal trajectory. In this paper, this method is
A finite time optimal control problem reduces to a two- extended to the nonlinear case. As in the linear case, it
point boundary-value problem (TPBVP) for ordinary dif- can calculate a family of optimal trajectories for different
ferential equations (ODEs) with respect to a Hamiltonian boundary conditions. It gives the optimal trajectories and
system [4]. There are a lot of methods, e.g. the Riccati inputs as functions of the boundary values of the state and
method [5], the shooting method [6] and its extension [7], the initial and terminal time. In addition, the generating
to solve it. The basic principle of the shooting method is function can be calculated off-line. Therefore, the optimal
computing the trajectories repeatedly so that the exact one trajectories and inputs can be obtained directly without any
satisfying the boundary values is obtained. Therefore, if numerical integration along the system equation required in
we change the desired boundary values, then we need to the single generating function case. Therefore, the proposed
solve the TPBVP again using the shooting method. Similarly method reduces the online computational effort for different
to the shooting method, we have to solve the nonlinear boundary conditions and different time intervals compared
optimal control problems for a set of initial and terminal state to the conventional methods as well as the single generating
function one.
1 Z. Hao is with Department of Mechanical Science and Engineering,
This paper is organized as follows. Section 2 introduces
Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi 464-8603, Japan
hao@haya.nuem.nagoya-u.ac.jp
the preliminaries of nonlinear optimal control problems and
2 K. Fujimoto is with the Department of Aerospace Engineering, Ky- the generating function method. Section 3 discusses how
oto University, Yoshida-Honmachi, Sakyo-ku, Kyoto 606-8501, Japan to generate optimal trajectories for nonlinear optimal con-
k.fujimoto@ieee.org trol problems for different boundary conditions by double
3 Y. Hayakawa is with the Department of Mechanical Science and
Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi 464- generating functions. This section is the main part of this
8603, Japan hayakawa@nuem.nagoya-u.ac.jp paper. A numerical example illustrates the usefulness and
978-1-4799-0178-4/$31.00 ©2013 AACC 6382
effectiveness of the proposed method in Section 4. Section S2 f (x, λ0 ,t) and S3 f (λ , x0 ,t) for the following canonical
5 concludes the paper. transformations:
6385
NO+ (C, A) := NO(C, A) ∩ L + (A). (48) Here, Z3 f (t), Y3 f (t), Z3b (t), Y3b (t), Z2 f (t), Y2 f (t), Z2b (t), and
Y2b (t) are the solutions to (19), (20), (21), (22), (37), (38),
Lemma 1 [13] For differential matrix Riccati equations
(39), and (40) respectively, λ0 is the solution to
(19) and (37), if both (AT , Q) and (−A, BR) are stabilizable
∂ S3b (λ , xt f ,t)
and ∂ S2 f (x, λ0 ,t)
x0 = or x = − , (56)
NO+ (RBT , AT ) = {0}, NO+ (Q, −A) = {0} (49) ∂ λ0
t=t0 ∂λ
t=t f
u
x
x0=−0.1,xtf=0.1 4
in Subsection B of Section 3. −0.1
x0=−0.2,xtf=0.2
−0.2
2
x =−0.4,x =0.4
0 tf
Fig. 4. The case using S3 f and S3b to generate optimal trajectory for
x
u
0
N=2 4
N=3 different boundary conditions and different time intervals
-0.2 N=4 2
bvp4c
-0.4 0
0 0.5 1 0 0.5 1
Time(s) Time(s)
is proposed to generate optimal trajectories for different
(a) The state (b) The input boundary conditions and different time intervals. The optimal
Fig. 1. The optimal trajectory and input generated by S3 f and S3b for trajectories and optimal inputs are given as functions of the
x0 = −0.4, xt f = 0.4 boundary values of the state and the initial and terminal time.
In addition, the generating function can be calculated off-
line. Therefore, it is easy to generate optimal trajectories
State trajectory X Input u
for different boundary conditions and different time intervals
0.4
8
N=2 directly without any numerical integration of the system
N=3
0.2 N=4 dynamic equation. Therefore, the online computational effort
6
0 for different boundary conditions greatly decreases. It is
u
X
4
-0.2
N=2
N=3
expected to be very useful to online nonlinear trajectory
N=4
bvp4c
2 generation problems such as model predictive control.
-0.4
0
0 0.5
Time(s)
1 0 0.5
Time(s)
1
R EFERENCES
(a) The state (b) The input [1] E. G. Al’brecht, “On the optimal stabilization of nonlinear systems,”
PMM-J. Appl. Math. Mech., vol. 25, pp. 1254–1266, 1961.
Fig. 2. The optimal trajectory and input generated by S2 f and S3b for [2] D. L. Lukes, “Optimal regulation of nonlinear dynamical systems,”
x0 = −0.4, xt f = 0.4 SIAM J. Control and Optimization, vol. 7, no. 1, pp. 75–100, 1969.
[3] N. Sakamoto and A. J. van der Schaft, “Analytical approximation
methods for the stabilizing solution of the Hamilton-Jacobi equation,”
IEEE Trans. Autom. Contr., vol. 53, no. 10, pp. 2335–2350, 2008.
State trajectory x Input u
[4] A. Locatelli, Optimal Control: An Introduction, 1st ed. Basel-Boston-
20 40 Berlin: Birkhauser Verlag, 2001, pp. 147–164.
N=2 N=2
10 bvp4c 30 [5] U. M. Ascher, R. M. M. Mattheij, and R. D. Russell, Numerical so-
0 20
lution of boundary value problems for ordinary differential equations.
Tokyo: Prentice-Hall, 1988, pp. 164–180.
x
-10 10
[6] H. B. Keller, Numerical Methods for Two-Point Boundary-Value
-20 0
Problems. Waltham, Massachusetts: Blaisdell, 1968, pp. 39–68.
-30 -10 [7] E. Cristiani and P. Martinon, “Initialization of the shooting method
-40 -20 via the hamilton-jacobi-bellman approach,” Journal of Optimization
0 0.5 1 0 0.5 1
Time(s) Time(s) Theory and Applications, vol. 146, no. 2, pp. 321–346, 2010.
[8] A. P. Willemstein, “Optimal regulation of nonlinear dynamical systems
(a) The state (b) The input on a finite interval,” SIAM J. Control and Optimization, vol. 15, no. 6,
pp. 1050–1069, 1977.
Fig. 3. The optimal trajectory and input generated by S2 f and S3 f for [9] V. Guibout and D. J. Scheeres, “Solving relative two point boundary
x0 = −0.4, xt f = 0.4 value problems: Spacecraft formation flight transfers application,” J.
Guid. Control. Dyn., vol. 27, no. 4, pp. 693–704, 2004.
[10] C. Park and D. J. Scheeres, “Determination of optimal feedback
To illustrate the advantage of the proposed method, we terminal controllers for general boundary conditions using generating
use the a pair of generating functions (S3 f (λ , x0 ,t) and functions,” Automatica, vol. 42, no. 5, pp. 869–875, 2006.
S3b (λ , xt f ,t)) to generate optimal trajectories for different [11] M. Bando and H. Yamakawa, “A new optimal orbit control for two-
point boundary-value problem using generating functions,” Advances
terminal state values and different time intervals as shown in the Astronautical Sciences, vol. 134, pp. 245–260, 2009.
in Fig.4. We can see that all of the generated optimal [12] Z. Hao and K. Fujimoto, “Approximate solutions to the hamilton-
trajectories achieve to the designed boundary values at the jacobi equations for generating functions with a quadratic cost function
with respect to the input,” in Proceedings of the 4rd IFAC Workshop on
designed time endpoint in Fig.4(a). This example illustrates Lagrangian and Hamiltonian Methods for Nonlinear Control, 2012,
the effectiveness and the advantage of the proposed method. pp. 194–199.
[13] Z. Hao, K. Fujimoto, and Y. Hayakawa, “Optimal trajectory generation
for linear systems based on double generating functions,” in To appear
in Proceedings of the 51st IEEE Conference on Decision and Control,
V. CONCLUSIONS 2012.
In this paper, for a nonlinear finite time optimal control [14] H. P. Geering, Optimal Control with Engineering Applications, 1st ed.
Berlin: Springer, Apr. 2007, pp. 24–35.
problem, a method using a pair of generating functions
6387