Professional Documents
Culture Documents
History of The Discovery of The Pontryagin Maximum Principle
History of The Discovery of The Pontryagin Maximum Principle
c Pleiades Publishing, Ltd., 2019.
Russian Text
c The Author(s), 2019, published in Trudy Matematicheskogo Instituta imeni V.A. Steklova, 2019, Vol. 304, pp. 7–14.
Abstract—We briefly describe the history of the discovery of the Pontryagin maximum princi-
ple in optimal control theory in the mid-1950s by Academician L. S. Pontryagin in collaboration
with his students—V. G. Boltyanskii and the author of the present paper—who were then young
scientists at the Department of Differential Equations headed by Pontryagin at the Steklov
Mathematical Institute.
DOI: 10.1134/S0081543819010012
ẍ + x = u, |u| ≤ 1.
The synthesis was contained in the Princeton thesis of Bushaw [2] of 1952 and was recognized
by specialists as an outstanding scientific achievement. The description of the synthesis reached
Moscow in 1953, without proof, and, in spite of all efforts, the proof could not be restored for a long
time. Only after the formulation of the Pontryagin maximum principle, Bushaw’s synthesis was
easily interpreted: it turned out to be a straightforward consequence of the maximum principle.
1
2 R. V. GAMKRELIDZE
the 1940s. I think that such radical changes in his creative activity, which influenced all his subse-
quent intellectual life, were due to psychological reasons rather than to simple curiosity or scientific
interest in certain fields of applied mathematics.
By the beginning of the 1950s, Pontryagin became firmly convinced that the era of dominance
of Moscow topology came to an end, and the solution of hard current problems in topology required
completely new methods, including those initiated in the 1940s by French mathematicians. He
decided to abandon topology and use his experience and talents of a mathematician in applied
fields.
His expertise in the theory of dynamical systems was quite deep, and he already had some
experience of research in this field. His publications in this direction included even the famous joint
paper [1] with the well-known Soviet physicist Academician Andronov, in which the authors intro-
duced the concept of structural stability, which later played an important role in the development of
the theory of dynamical systems. Therefore, the first sessions of the seminar were devoted to some
problems of the theory of dynamical systems, including those of asymptotic theory of singularly
perturbed nonlinear oscillations with a small parameter at the highest order derivative.
Soon after the start of its work at the Mathematical Institute, Pontryagin’s seminar became
an attractive center for many leading specialists in applied fields of mathematics, and we often
hosted speakers from various scientific centers of the USSR. Pontryagin got involved in optimization
problems quite accidentally, after a meeting with two military experts in the aerodynamics of flying
vehicles.
They discussed a time-optimal problem for an aircraft that was described by a fifth-order dif-
ferential equation with three scalar control parameters. Two of these parameters were bounded and
appeared in the equations linearly; hence, it was clear from the very beginning that the extremals
of the problem cannot be found by the classical method, i.e., as solutions to the Euler–Lagrange
equations. After a few short and unsuccessful attempts to find a solution to the formulated specific
problem, Pontryagin came to the conclusion that the problem should be posed in a general form, and
one should first find some general principles for its solution that could then be applied to specific
cases. I remember he even said half-jokingly, “We should invent a new calculus of variations.”
Finally, he formulated a time-optimal problem in a form that has now become generally accepted
(sometimes including details of notation).
to the time-optimal problem in question. However, his choice was not so trivial: he started to
directly solve the variational problem and obtained a completely new type of necessary extremality
conditions, which later served as a basis for the discovery of the maximum principle.
He presented the results at the seminar, and now I proceed to give an account of them by
exactly (including the notation) reproducing Pontryagin’s talk. For brevity, I only apply the vector
notation instead of the index notation used by him during the talk.
Given the control equation (1), consider a time-optimal problem with fixed endpoints x0 and x1 :
⎧
⎨ d x(t) = f (x(t), u(t)), u(t) ∈ U, x(t ) = x , x(t ) = x ,
0 0 1 1
dt (2)
⎩
t1 − t0 = min ∀t0 ≤ t ≤ t1 .
The simple duality considerations described below led Pontryagin to the analysis of the following
adjoint (linear homogeneous) equation in an “auxiliary variable,” a covector ψ:
dψ ∂f
= −ψ (x, u). (3)
dt ∂x
The system obtained by combining equations (1) and (3) turned out to be a “clue” to solving the
whole problem and allowed him to formulate the following necessary condition, the original version
of the maximum principle, which is satisfied by an arbitrary solution x(t), u(t) of problem (2).
Original necessary optimality condition. Let x(t), u(t), t ∈ J, be a solution to the time-
optimal problem (2). Then there exists a nonzero covector function ψ(t), t ∈ J, that satisfies the
system
⎧
⎪ d
⎪
⎪ x(t) = f (x(t), u(t)),
⎪ dt
⎨
d ∂f (4)
⎪
⎪ ψ(t) = −ψ(t) (x(t), u(t)),
⎪
⎪ dt ∂x
⎩
x(t0 ) = x0 , ψ(t0 ) = 0, u(t) ∈ U
(of vector differential equations of order 2n with given initial data) and the r finite (scalar ) equations
∂f
ψ(t) (x(t), u(t)) = 0 ∀t ∈ [t0 , t1 ], i = 1, . . . r, (5)
∂ui
which serve to eliminate the r-dimensional parameter u and thus define, generally speaking, a unique
solution to system (4), (5).
Proof. Let δu(t), t ∈ [t0 , t1 ], be an admissible variation of the optimal control u(t) and δx(t)
be the corresponding (first) variation of the optimal trajectory x(t), which satisfy the linear (inho-
mogeneous) “variational equation”
d ∂f ∂f
δx(t) = (x(t), u(t))δx(t) + (x(t), u(t))δu(t), δx(t0 ) = 0. (6)
dt ∂x ∂u
Since the admissible set U is assumed to be open and equation (6) is linear, one can assume that the
variations δu(t) are arbitrary piecewise continuous functions, which is crucial for the whole proof.
Introducing fundamental matrices Φ(t) and Ψ(t) of solutions to the homogeneous part of equa-
tion (6) and to the corresponding adjoint equation,
⎧
⎪
⎪ d ∂f
⎪
⎪ Φ(t) = (x(t), u(t))Φ(t),
⎪
⎨ dt ∂x
d ∂f (7)
⎪
⎪ Ψ(t) = −Ψ(t) (x(t), u(t)),
⎪
⎪ dt ∂x
⎪
⎩
Φ(t) = Ψ−1 (t) ∀t ∈ [t0 , t1 ],
Consider the linear mapping—from the vector space of all variations δu(t) to the space Rn —defined
by formula (8) with t = t1 , {δu(t), t0 ≤ t ≤ t1 } → δx(t1 ). The image of this mapping is a
subspace Π ⊂ Rn , with Γ = x(t1 ) + Π being a plane passing through the terminal point x(t1 ) of the
trajectory x(t). Since the trajectory is optimal, it follows from elementary topological considerations
that Π cannot coincide with the whole space Rn . Hence, its dimension is at most n − 1, and there
exists a nonzero covector χ orthogonal to Π:
∃χ = 0 : χ · δx(t1 ) = 0 ∀δx(t1 ) ∈ Π.
5. SECOND VARIATION
An extremal, i.e., an arbitrary nontrivial solution of system (4), is said to be regular if the
quadratic form of δu,
∗ ∂ 2 f (x(t), u(t))
δu ψ(t) δu, t ∈ [t0 , t1 ], (10)
∂ui ∂uj
is nondegenerate along this extremal for all t ∈ [t0 , t1 ]. The study of the second variation of the
time-optimal trajectory led me to the following main result.
If a trajectory x(t), t0 ≤ t ≤ t1 , is time-optimal and the corresponding extremal ψ(t), x(t) is
regular, then the quadratic form (10) is negative definite for all t ∈ [t0 , t1 ] (for an appropriately
chosen direction of the covector ψ(t1 )):
∗ ∂ 2 f (x(t), u(t))
δu(t) ψ(t) δu(t) ≤ 0 ∀t ∈ [t0 , t1 ]. (11)
∂ui ∂uj
After inequality (11) had been written out, I finally noticed that the scalar function H of ψ, x,
and u,
H(ψ, x, u) = ψf (x, u), (12)
allows one to easily and concisely reformulate all the results obtained by that time in canonical
terms with Hamiltonian H. These results are listed below.
A. The original Pontryagin system (4), together with the system of r finite equations (5) for
obtaining the extremals of the problem, can be expressed as the canonical system with Hamilto-
nian H,
⎧
⎪ dx(t) ∂H
⎪
⎨ dt = ∂ψ (ψ(t), x(t), u(t)),
(13)
⎪
⎪ dψ(t) ∂H
⎩ =− (ψ(t), x(t)u(t)),
dt ∂x
and the system of r scalar equations
∂H
(ψ(t), x(t), u(t)) = 0 ∀t ∈ [t0 , t1 ], i = 1, . . . , r, (14)
∂ui
according to which every point of an extremal is a stationary point for the Hamiltonian H with
respect to the parameter u.
B. The regularity of an extremal in these “canonical terms” is expressed as the nonvanishing of
the Hessian of H with respect to the variables ui along the extremal:
2
∂ H
Hess|u H(ψ(t), x(t), u(t)) = det
def
∂ui ∂uj (ψ(t), x(t), u(t)) = 0 ∀t ∈ [t0 , t1 ], i = 1, . . . , r. (15)
C. Finally, the negative definiteness of the form (10) along a regular extremal implies, according
to (11), the local maximality of the Hamiltonian H with respect to u at every point of the extremal.
Thus, in the regular case, extremals are solutions of the canonical system (13), and, according
to equations (14), every point of an extremal is a stationary point of the Hamiltonian H = ψf (x, u)
with respect to the variable u; hence, inequality (11) implies that along every regular extremal the
Hamiltonian H attains a local maximum in u.
I decided to immediately test the obtained “weak version” of the oncoming (not yet formulated)
maximum principle on the extremals of the Bushaw oscillator, which are “illegitimate” for this
purpose. Surprisingly easily, I obtained the full picture of its synthesis. I quickly phoned Pontryagin
to inform about the progress in the study of our problem. We met the same day, and I gave him a
detailed account of all the results described above.
He accepted the Hamiltonian structure of his equations quite calmly; I think he did not see
any additional arguments for calculating extremals in this fact. However, the condition of local
maximality of H with respect to u along an extremal and the synthesis of the Bushaw oscillator
impressed him no less than me, and he did not try to conceal this fact: being rather reserved by
nature, he was nevertheless completely free of “natural” haughtiness, which is so characteristic of
outstanding talents. In subsequent years, I always remembered this meeting as the greatest reward
for our joint achievements.
H(ψ, x, u) = ψf (x, u)
a bit for rare critical remarks of “pure mathematicians.” However, in spite of all these achievements,
the maximum principle, being a mathematical theorem rather than a “natural-science principle,”
needed a mathematical proof.
Finally, after about eight months of search, the proof was obtained by Boltyanskii, who intro-
duced a new type of variations, the so-called needle variations of the control parameter. These
variations vanish on the whole time axis except for several intervals of small total length, where
they can take arbitrary admissible values. An important property of needle variations is that one
can always take their convex combinations irrespective of the form of the set U , which generates
a sufficiently rich attainability set. Using these variations, Boltyanskii could prove the maximum
principle in full generality, without any additional assumptions on the regularity of the problem.
This was a significant contribution of Boltyanskii to the solution of the problem. The very con-
struction of variations was highly original (especially after simplifications proposed by Pontryagin)
and made a very useful contribution to the arsenal of general methods of mathematical analysis.
The final results were summarized in our joint monograph The Mathematical Theory of Optimal
Processes [3], which was published in 1961. Pontryagin also involved E. F. Mishchenko—one of
his most active young colleagues that time—in writing this book; Mishchenko was the author of
the last chapter of the book, on a statistical problem in optimization theory. In parallel, the book
was published in English [4] in the USA by a division of John Wiley & Sons publishers. The
English translation was edited by L. Neustadt, a well-known American specialist in optimization,
who was one of the first to master the new methods coming from Moscow. Subsequently, Neustadt
maintained close scientific contacts with us during the whole short period of his remaining days.
CONCLUDING REMARKS
The development of optimal control theory that followed the discovery of the Pontryagin maxi-
mum principle can be primarily characterized as a rich harvest due to the successful application of
the maximum principle not only to pure optimization problems but also to a number of classical and
more modern branches of mathematics where the classical calculus of variations was traditionally
used; these are, for example, Riemannian and sub-Riemannian geometries.
It is remarkable that all further development of the theory was aimed at extending the class
of optimization problems under study rather than at generalizing the maximum principle itself.
For example, in such fields as nondifferentiable optimization or infinite-dimensional optimization
(partial differential equations), the corresponding necessary optimality conditions were formulated
(in the first order) by analogy with the maximum principle.
REFERENCES
1. A. A. Andronov and L. S. Pontryagin, “Structurally stable systems,” Dokl. Akad. Nauk SSSR 14 (5), 247–250
(1937).
2. D. W. Bushaw, “Differential equations with a discontinuous forcing term,” PhD Thesis (Princeton Univ., Prince-
ton, NJ, 1952).
3. L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko, The Mathematical Theory of
Optimal Processes (Fizmatgiz, Moscow, 1961) [in Russian].
4. L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko, The Mathematical Theory of
Optimal Processes, Ed. by L. W. Neustadt (Interscience, New York, 1962).
Translated by I. Nikitin