Professional Documents
Culture Documents
(Asce) As 1943-5525 0000398
(Asce) As 1943-5525 0000398
Abstract: Coning motion has been detected in flight experiments of many spinning projectiles for decades and the stability analysis of this
motion has been extensively studied recently. In this paper, a novel optimal control algorithm based on stability analyses is developed to
suppress the coning motion. A Lagrangian functional is built to compute the optimal control that minimizes the coning motion for a given
control cost. Then a linearized approach, which produces accurate enough control within the range of parameters considered, is proposed to
Downloaded from ascelibrary.org by Beihang University on 02/02/22. Copyright ASCE. For personal use only; all rights reserved.
reduce the computational cost. It is analytically presented that the optimal solution of this linearized optimization is unique at any given values
of control cost. The analytical analysis also indicates that there exists a direct solution of the control that approaches the linearized optimal
control when the control cost is small enough and that this direct solution can be obtained much more efficiently than the linearized optimal
control. The controllability problem associated with this optimal control algorithm is also discussed and the results shed lights on the choice
of numerical parameters in the optimization. These optimal control algorithms are implemented to calculate the optimal control that
minimizes the coning motion induced by initial disturbances to a wrap-around-fin missile. DOI: 10.1061/(ASCE)AS.1943-5525
.0000398. © 2014 American Society of Civil Engineers.
Author keywords: Nonlinear optimal control; Coning motion.
ω qSL2r
M z ¼ mz qSLr þ mz z ð2bÞ
ẋ ¼ fðxÞ þ bu ð1aÞ V
where q, S, and Lr = dynamic pressure, reference area, and refer-
ω ω
ence length, respectively; and my , my y , mm , mz , and mz z = lateral
moment, lateral-damping moment, Magnus moment, pitching
moment, and pitch-damping moment coefficients, respectively.
These coefficients are obtained from wind-tunnel measurements of
wrap-around-fin missiles (Miao et al. 1996), and have been used
successfully in overall designs of missiles, trajectory simulations,
and stability analyses of the coning motion (Lei and Wu 2005; Mao
et al. 2006, 2007).
In the stability analysis, the state vector x is decomposed into a
summation of a base state X and a perturbation state x 0 , that is
x ¼ X þ x0 ð3Þ
Substitute Eq. (3) into the governing Eqs. (1a)–(1d) and linear-
ize around the base state X to reach the linearized governing Eq. (4)
Fig. 1. Schematic of the coning motion; O denotes the center of gravity ẋ 0 ¼ Ax 0 þ bu ð4Þ
of the projectile; OX represents the longitudinal axis; OV denotes the
where A is a linearized operator and a function of the base state X
velocity direction; OY is the vertical axis perpendicular to OX in the
(Mao et al. 2006). The eigenvalue stability analysis of the operator
VOX plane; OZ (which is not shown for clarity) is perpendicular to
A suggests that there exists a critical value for the spinning rate,
the XOY plane, and preserves the right-hand-rule with OX and OY;
denoted as wxc . For wx < wxc, all the eigenvalues of A have nega-
θ is the nutation angle between OV and OX; and ωy and ωz are angular
tive real parts and so this system is asymptotically stable to infini-
speeds in OY and OZ directions, respectively
tesimally small perturbations while for wx > wxc, at least one
Fig. 2. Schematic of a typical wrap-around-fin missile; the attitude of the missile is controlled by cross jets
2005; Mao et al. 2006). The Mach number is set to M ¼ 4. At the wards from t ¼ T to t ¼ 0 and initialized by Eq. (8c). Eq. (8d)
combination of these parameters, the critical spinning rate is wxc ¼ is the recovery of the constraint on the control cost.
7.44 rad=s. To illustrate the control of an unstable coning motion, a In accordance with the definition of the Gateau differential dL
spinning rate in the unstable range ωx ¼ 10 rad=s is adopted. of the Lagrangian, the gradient of the Lagrangian with respect to
the control u, denoted as ∇u L, satisfies
Methodology of Optimal Control Lðu þ ϵδuÞ − LðuÞ
dLðδuÞ ¼ lim ¼ ð∇u L; δuÞ ð10Þ
ϵ→0 ϵ
Nonlinear Optimal Control of the Coning Motion
Therefore
The conning motion stemming from environmental disturbances is
initially excited by ωz since the nonzero value of θ is generated by ∇u L ¼ ½−2λuy − bx1 ; −2λuz − bx2 ; 0T ð11Þ
an integration of ωz and ωy ¼ 0 at θ ¼ 0. Therefore the basic strat-
egy is using control u generated by cross jets to suppress the coning In the optimization process, the control u is updated from step k
motion induced by the initial perturbation xð0Þ ¼ ½0; ωz ð0Þ; 0T . to step k þ 1 as
The objective function to be minimized, that is the magnitude of
the coning motion, is modeled as ukþ1 ¼ uk þ β k Pð∇u LÞk ð12Þ
L ¼ ðθ; θÞ ð5Þ where λ is chosen so that the new control satisfies the magnitude
constraint ðukþ1 ; ukþ1 Þ ¼ C; Pð∇u LÞk = search direction (a
where the scale product is defined as ða; bÞ ¼ ∫ T0 a · bdt, where Fletcher-Reeves conjugate gradient method is adopted to calculate
T = terminal time. The choice of T will be discussed in a sub- this direction); and β k = step length which satisfies the strong Wolf
sequent section. condition
The Lagrangian functional can be correspondingly defined as
Lkþ1 ≤ Lk þ c1 β k ð∇Lk ; P k Þ ð13aÞ
L ¼ L þ ðx ; ẋ − f − buÞ þ λ½C − ðu; uÞ ð6Þ
jð∇Lkþ1 ; P k Þj ≤ c2 jð∇Lk ; P k Þj ð13bÞ
where λ is a Lagrangian multiplier; x ¼ ½x1 ; x2 ; x3 T = adjoint
state; the first term on the right is the objective function; the second where c1 and c2 are constant, and satisfy 0 < c1 dt < c2 < 0.5, with
term is the constraint of the governing equations; and the last term dt denoting the discretized time step. A backtracking line search
is the constraint on the control cost, where C = prescribed control method is adopted to choose an acceptable value of β which is ini-
cost. There is a closely related optimization problem, that is to min- tially set to 1. It has been checked that the result is insensitive to the
imize the control cost with the constraint θðTÞ ¼ 0. This problem is choice of initial value of β.
not considered because even when the nutation angle is controlled Since θ is bounded (θ < 2π), L is also bounded, L ¼ ðθ; θÞ <
to reach zero at a fixed time horizon it could still diverge at a later ð2πÞ2 T. Because the adjoint equation is linear, x1 and x2 are linear
time. The coning motion is observed to diverge slowly and only one functions of x and subsequently the gradient ∇u L, as defined in
or two optimal control process is required during the flight, which Eq. (11), is also bounded. Therefore the Zoutendijk condition is
means the minimization of the coning motion is more meaningful satisfied and so the sequence of the gradient is bounded to zero
than the minimization of the control cost.
Apply integration by parts to the second term to obtain lim infð∇Lk ; ∇Lk Þ ¼ 0 ð14Þ
k→∞
L ¼ L þ x ðTÞ · xðTÞ − x ð0Þ · xð0Þ − ðẋ ; xÞ − ðx ; f þ buÞ which guarantees the convergence of the gradient (Nocedal and
þ λ½C − ðu; uÞ ð7Þ Wright 1999).
As a summary, the six-step procedure of this optimization of
At the minimizer of the Lagrangian functional, the first variation control can be outlined as follows:
of this function with respect to its variables x , x, xðTÞ, and λ are 1. Evolve a random guess of control u with magnitude ðu; uÞ ¼
zero, so C forwards from t ¼ 0 to t ¼ T in Eq. (8a), which is initialized
by the given initial state xð0Þ ¼ ½0; ωz ð0Þ; 0T , and record the
δLðδx Þ ¼ 0; ⇒ ẋ − f − bu ¼ 0 ð8aÞ full trajectory of the state xðtÞ;
T 2. Initialize the adjoint Eq. (8b) with Eq. (8c) and evolve the
∂f adjoint state variables backwards from t ¼ T to t ¼ 0;xðtÞ
δLðδxÞ ¼ 0; ⇒ − ẋ − x þ ½0; 0; 2θT ¼ 0 ð8bÞ
∂x saved in Step 1 is required in Step 2;
3. Calculate the gradient of the Lagrangian with respect to u is obtained. Setting the first variation of the Lagrangian with respect
using Eq. (11); to xðTÞ, λ, and u to zero, one obtains the initial condition of the
4. Update the control and scale it to satisfy the constraint on adjoint equations, the constraint on control cost, and the gradient of
control cost using Eq. (12); the Lagrangian functional with respect to u. They are all in the
5. Evolve the updated control u in Eq. (8a); if the Wolf condition same form as discussed in the nonlinear optimization, as given
is satisfied, turn to Step 6, and otherwise reduce the step length in Eqs. (8c), (8d), and (11), respectively.
β, and turn to Step 4; and From the adjoint Eq. (16) only θ is required and the other two
6. Repeat Steps 2–5 until the control converges. state variables (ωy and ωz ) are absent. Therefore the memory and
Fig. 3 shows a flowchart of this nonlinear optimization pro- computational cost of the forward evolution can be significantly
cedure, where the two iterative loops (highlighted by circles) are reduced by only calculating and saving θðtÞ.
as follows: (1) to update the control, (2) to check the satisfaction The solution of linearized governing Eq. (4) is in the form
of the Wolf condition. Z t
xðtÞ ¼ eAt xð0Þ þ eðt−τ ÞA budτ
0
Linearized Optimal Control of the Coning Motion
In the nonlinear optimization presented previously, three governing Recall that x ¼ ½ωy ; ωz ; θT and u ¼ ½uy ; uz ; 0T . Owing to the
equations are solved in the forward integration (from t ¼ 0 to linear nature of Eq. (4), the contribution of the initial condition xð0Þ
t ¼ T) and the full trajectory of xðtÞ has to be saved for solving and control u to the nutation angle θ are decoupled. Therefore
the three adjoint equations. The constraint of the Wolf condi-
θ ¼ θ0 þ θu ð17aÞ
tion on the step length β results in multiple forward integrations
to obtain an acceptable step length. The requirements of computa-
θ0 ¼ D0 xð0Þ ð17bÞ
tional time and memory in this nonlinear optimization may prevent
obtaining an optimal control signal instantly when a perturbation is
θu ¼ Du u ð17cÞ
observed, or when the coning motion is detected.
Considering the perturbations are small in magnitude when the where θ0 = nutation angle induced by a fixed initial condition xð0Þ;
coning motion is observed in practical flight trials, the linearized θu = nutation angle generated by the control u; and D0 ðtÞ and
governing equations around the zero base state can predict the evo- Du ðtÞ are the development operators of the initial condition
lution of the state variables reasonably accurately. In this section, a xð0Þ and control u to θ, respectively.
linearized optimization algorithm is developed by using the linear- θ0 is constant during the optimization process and the time
ized governing Eq. (3), rather than the nonlinear governing sequence of θu can be obtained as
Eqs. (1a)–(1d) as a constraint. This linearized optimization signifi-
cantly reduces the computational cost and is capable to generate 2 3 2 32 3
θu ðdtÞ d1 0 ··· 0 uy ðdtÞ
optimal control signals instantly in flight experiments. 6 7 6 76 7
Replacing the nonlinear governing equations with the linearized 6 θu ð2dtÞ 7 6 d2 d1 ··· 0 76 uy ð2dtÞ 7
6 7 6 76 7
governing Eq. (6), the Lagrangian functional in the linearized op- 6 .. 7¼6 . .. .. .. 76 .. 7
6 . 7 6 .. . . . 76 . 7
timization can be expressed as 4 5 4 54 5
θu ðTÞ dN dN−1 · · · d1 uy ðTÞ
L ¼ L þ ðx ; ẋ − Ax − buÞ þ λ½C − ðu; uÞ ð15Þ 2 32 3
e1 0 ··· 0 uz ðdtÞ
Setting the first variation of the Lagrangian functional with re- 6 76 7
6 e2 e1 · · · 0 76 uz ð2dtÞ 7
spect to u and x to zero, the linearized governing Eq. (4) and the 6 76 7
þ6 . .. .. .. 7 6 .. 7 ð18Þ
corresponding adjoint Eq. (16) 6 .. . . 7
. 546 . 7
4 5
ẋ ¼ −AT x þ ½0; 0; 2θT ð16Þ eN eN−1 ··· e1 uz ðTÞ
wards. The other two state variables ωy and ωz are not solved since in Appendix.
they are not going to be used in the subsequent steps. As a summary, the procedure of the linearized optimization of
In the adjoint equations only x1 and x2 are used in the optimi- control can be outlined in nine steps, as follows:
zation process [Eq. (11)], and analogously researchers can reduce 1. Set the control to zero and evolve the initial disturbance xð0Þ
the computational cost of the adjoint integration by only calculating using Eq. (4) to obtain θ0 ¼ D0 xð0Þ;
x1 and x2 . 2. Set the initial disturbance to zero and evolve a random guess of
Starting from the solution of the adjoint Eq. (16) control u satisfying the constraint on control cost to obtain the
Z T time sequence of θu using Eq. (18);
x ðtÞ ¼ − eðτ −tÞA ½0; 0; 2θT dτ
T
ð20Þ 3. Calculate the adjoint variables through Eqs. (21a) and (21b);
t 4. Calculate the gradient ∇u L from Eq. (11);
5. Calculate the search direction P~ in accordance with the con-
the time sequence of the adjoint variables can be subsequently jugate gradient method and then compute the modified search
written as direction P~ from Eq. (39) in Appendix;
2 3 2 32 3 6. Substitute P~ as u in Eq. (18) to obtain Du P; ~
x1 ðdtÞ h1 h2 ··· hN 2θðdtÞ 7. Calculate the optimal step length β which minimizes L as
6 x ð2dtÞ 7 6 0 hN 7 6 7
6 1 7 6 h1 ··· 76 2θð2dtÞ 7 detailed in Appendix;
6 .. 7¼6 . .. .. 7
.. 766 .. 7 ð21aÞ
6 7 6 . 7 8. Update the control and scale it to satisfy the constraint on
4 . 5 4 . . . . 54 . 5 control cost using Eq. (40), in Appendix I; correspondingly
x1 ðTÞ 0 0 ··· h1 2θðTÞ update the time sequence of θu ; and
9. Repeat Steps 3–8 until the control converges.
2 3 2 32 3
x2 ðdtÞ g1 g2 ··· gN 2θðdtÞ Fig. 4 shows a flowchart of this linearized optimization pro-
6 x ð2dtÞ 7 6 0 gN 7 6 7 cedure. Compared with the flow chart of nonlinear optimization
6 2 7 6 g1 ··· 76 2θð2dtÞ 7
6 .. 7¼6 . .. .. 7
.. 766 .. 7 ð21bÞ (Fig. 3), there is only one iterative loop in this procedure, which
6 7 6 . 7
4 . 5 4 . . . . 54 . 5 is the update of the control. The loop to choose a step length to
satisfy the Wolf condition is removed since an optimal step length
x2 ðTÞ 0 0 ··· g1 2θðTÞ
can be calculated analytically.
where the entries hi and gi (1 ≤ i ≤ N) are
Uniqueness of the Optimal Control
X
3
hi ¼ − 2Qj1 eiΛj dt Q−1
3j dt ð22aÞ The optimization discussed previously computes the optimal con-
j¼1 trol as a minimizer of the Lagrangian functional. This minimizer
At the equilibrium point where the gradient of this function with Substitute these decomposed forms into Eqs. (25a) and (25b) to
respect to the control vanishes this variation is zero and therefore obtain
Fig. 5 shows a flowchart for this direct solution, where the and the corresponding control cost Cable
loops appearing in the flowcharts of linearized and nonlinear
ðu; DTu θ0 Þ2
optimization algorithms have been removed, and no iterative circles Cable ¼ ðuable ; uable Þ ¼ ðu; uÞ ð37Þ
are involved. ðDTu Du u; uÞ2
By monitoring the CPU time elapsed in the simulation, the lin-
earized optimization consumes less than 10% of the CPU time used The gradient of the Lagrangian functional with respect to u is
in the nonlinear optimizations while the direct solution costs less
2ðu; DTu θ0 Þ2 T 2ðu; DT θ Þ
than 20% computational time used in the linearized optimization. ∇u Lun ¼ 2
Du Du u − T u 0 DTu θ0 ð38Þ
ðDu Du u; uÞ
T ðDu Du u; uÞ
Downloaded from ascelibrary.org by Beihang University on 02/02/22. Copyright ASCE. For personal use only; all rights reserved.
10-7 1014
1012
10-8
Cable (N2s)
Lun (rad2s)
10
10
-9
10
108
10-10
106
10-11 104
10-1 100 101 10-1 10 0
101
(a) T (s) (b) T (s)
Fig. 6. (a) Remaining of the coning motion that cannot be suppressed; (b) optimal control cost Cable obtained at time interval T
L(rad2s)
+ + x ωz(0)=0.02rad/s, linear
10-5
ωz(0)=0.02rad/s, nonlinear
+
x ωz(0)=0.05rad/s, linear
10-6
+ ωz(0)=0.05rad/s, nonlinear
x
-7
10
10-4 10-2 2
100 102
C(N s)
Downloaded from ascelibrary.org by Beihang University on 02/02/22. Copyright ASCE. For personal use only; all rights reserved.
Fig. 7. Minimization of the coning motion at fixed control cost in linear and nonlinear optimizations
1
0
direct solution
0.8 2
C= 6 (N s)
2
C=15 (N s) -0.1
C=20 (N2s)
0.6
uy (N)
0 -0.4
0 2 4 6 8 10 0 2 4 6 8 10
(a) t (s) (b) t (s)
Fig. 8. Optimal control for ωz ð0Þ ¼ 0.01 rad=s [the control has been normalized so that ðu; uÞ ¼ 1]: (a) uy component; (b) uz component
ωz ð0Þ ¼ 0.05 rad=s. Since the environmental disturbance initializ- ðu; uÞ ¼ 1. The control thrust uz ðtÞ, which directly reduces the nu-
ing the coning motion is commonly small in magnitude, it suggests tation angle in the nutation surface VOX, is much smaller than
that the linearized optimization based on perturbation analyses is an uy ðtÞ, indicating that the intuitive control acting on the nutation
accurate alternative to the full nonlinear optimization. surface is not efficient. Since uz ðtÞ directly reduces the nuta-
Figs. 8(a and b) show two components of the optimal control tion angle and uy ðtÞ reduces the rotating speed of the missile
uy ðtÞ and uz ðtÞ. To compare the time variation of the control at around the velocity vector, these results suggest that reducing
various control costs, the control has been normalized so that the rotating speed ωy is more efficient than reducing the nuta-
tion angle θ directly. As expected, the optimal control at small
0.01 control costs [C ¼ 6 N2 · s in Figs. 8(a and b)] overlaps with
uncontrolled the direct solution. At large values of control costs the optimal
2
0.008 C=1 (N s) control becomes significantly wavy and deviates from the direct
2
C=6 (N s) solution.
2
0.006
C=15 (N s) To demonstrate the control effects of the optimal control u
C=20 (N2s) calculated previously, the evolution of the coning motion at three
θ (rad)
0.004
values of control cost is tested (Fig. 9). As the control cost increases
the divergence speed of the nutation angle is reduced significantly.
When the control cost reaches 15 N2 · s, the coning motion is
0.002
completely suppressed except the oscillation at the beginning
time. However, owing to the reaction time of the control devices
0 (modeled as s in a previous section) the oscillation of the nutation
0 2 4 6 8 10 angle at small time cannot be suppressed effectively. Based on
t (s) these numerical observations, a model-free control law can be
designed to generate effective control without solving the gov-
Fig. 9. Developments of the controlled and uncontrolled coning
erning equations or adjoint equations when the coning motion is
motions
detected.
Appendix. Optimal Step Length in the Linearized c4 ¼ a21 ða24 − a1 a26 Þ ð45eÞ
Optimization
A Fletcher-Reeves conjugate gradient method (Nocedal and Wright Therefore there are five candidates for the optimal step length,
1999) is adopted to calculate the search direction P k , which is a β 1 ∼ β 5 , where β 1 ∼ β 4 are the solutions of Eq. (44) and β 5 ¼ ∞.
The following symbols are used in this paper: k = iteration step in optimization;
A = linearized state operator; · = derivative with respect to time; and
b = input operator; ^ = time discretized form of the time-continuous variables
C = constraint on control cost (N2 · s); or operators.
Cable = corresponding control cost of uable (N2 · s);
Du = evolution operator of the control thrust;
D0 = evolution operator of initial perturbations; References
dt = time step (s);
Betts, J. T. (1998). “Survey of numerical methods for trajectory optimiza-
f = state operator;
tion.” J. Guid. Contr. Dynam., 21(2), 193–207.
J 1 = roll moment of inertia (kg · m2 ); Burchett, B., Peterson, A., and Costello, M. (2002). “Prediction of swerv-
J 2 = lateral moment of inertia (kg · m2 ); ing motion of a dual-spin projectile with lateral pulse jets in atmospheric
L = time integration of the nutation angle (rad2 · s); flight.” Math. Comput. Model., 35(7–8), 821–834.
L = Lagrangian functional;
Downloaded from ascelibrary.org by Beihang University on 02/02/22. Copyright ASCE. For personal use only; all rights reserved.