Professional Documents
Culture Documents
Functionals Share Many of The Same Properties As Functions
Functionals Share Many of The Same Properties As Functions
Functionals Share Many of The Same Properties As Functions
2.1 Functions
Consider the function y = f(x). Here, x is the independent input variable. To each x
belonging to the function domain, the function operator f associates a number y
which belongs to the function range. We call y = y(x) the dependent variable. There
is a difference between single-valued functions and multivalued functions. The first
one means that to each input belongs exactly one output. The second case means
that an input can produce multiple outputs. This short review of functions enables
us to move on to functionals.
2.2 Functionals
We begin in Subsection 1.1 by showing that the distance between two points can be
expressed as a functional. Then in Subsection 1.2 we show that the shortest
distance between two points in a plane is a straight line.
1.1 Distance between two points on a given curve
First, we need an expression for the length of a curve between two given points in a
plane.
Suppose that we are given two points Pa and Pb with Cartesian coordinates (a, A)
and (b, B), respectively. Furthermore, suppose that y = y(x), where a ≤ x ≤ b, is a
smooth curve that joins Pa at x = a to Pb at x = b, so that y(a) = A and y(b) = B, as
shown in Figure 1.
You may have derived a formula for the length of this curve in earlier modules. It is
obtained by dividing the interval a ≤ x ≤ b into N intervals of length δx = (b − a)/N.
Let us denote the endpoints of these intervals by x0, x1, . . . , xN , in that order, so
that x0 = a, xN = b and xk+1 − xk = δx for k = 0, 1, . . . , N − 1, as shown in Figure
2.
Figure 2 Subdivision of the interval [a, b] into N equal intervals; in this case N = 5
√ ( )
2
δy
δ s k =√ δ x 2 +δ y 2=δx 1+
δx
However, δy/δx is just the gradient of the segment, which in the limit as N → ∞ is
given by δy/δx = y ′ (xk) (i.e. the derivative dy/dx of y evaluated at x = xk). Therefore
√ '
δ s k =δx 1+ y ( x k )
2
The approximate distance from (a, A) to (b, B) along the curve y = y(x) is given by the
sum of all the segment lengths:
N −1 N−1
k=0 k=0
√
S[ y ]≃ ∑ ❑δ s k = ∑ ❑δx 1+ y ' ( x k )2
b
S[ y ]=∫ ❑ √ 1+ y ¿ ¿
'
The above expression gives the length of the curve y = y(x) between the points Pa
and Pb. The length changes for different functions y(x). For instance, if Pa = (−1, 0)
and Pb = (1, 0), then the distance from Pa to Pb along the straight line joining these
points is 2, whereas the distance is π along the semicircle of which this straight line
is a diameter.
We denote the numerical value of the length of the curve by S[y], which emphasises
the fact that the length depends on the function y(x). We do not write S(y), because
we wish to distinguish S[y] from the real-valued function y(x). The quantity S[y] is
our first example of a functional; it maps functions y(x) that satisfy y(a) = A and y(b)
= B to the length of the curve y = y(x) between x = a and x = b. More generally, a
functional is any map from functions to real numbers. Sometimes the set of
functions on which a functional acts is restricted, as in the example that we have
just considered.
The distance between two points (a, A) and (b, B) along a curve y = y(x) that passes
between them is given by the functional
b
S[ y ]=∫ ❑ √ 1+ y ¿ ¿
'
2.1 Brachistochrone
The problem here is to find the shape of a wire joining two given points, so that a
bead will slide down the wire under gravity from one point to the other (without
friction) in the shortest time (see Figure 6).
The name given to this curve is the brachistochrone, which comes from the Greek
brachystos, meaning shortest, and chronos, meaning time. If the y-axis is vertical,
and the two given points at the ends of the wire are Pa = (0, 0) and Pb = (b, B), and
the particle starts from rest, then it can be shown that the time taken along the
curve y(x) is
√
' 2
1+ ( y )
b
1
T [ y]= ∫ ❑ dx , y (0)=0 , y (b)=B
√2 g 0 y
The problem discussed in this subsection was first considered by Galileo Galilei
(1564–1642) in 1638, but, lacking the necessary mathematical techniques, he
concluded erroneously that the solution is the arc of a circle passing vertically
through Pa. It was Johann Bernoulli (1667–1748) who made the problem famous
when in June 1696 he challenged readers of the scientific journal Acta Eruditorum
to solve it, reassuring them that the curve was well known to geometers. He also
stated that he would demonstrate the solution at the end of the year, provided that
no one else had. In December 1696, Bernoulli extended the time limit to Easter
1697, though by this time he was in possession of the solution by Gottfried Wilhelm
Leibniz (1646–1716), sent in a letter dated 16 June 1696 – Leibniz having received
notification of the problem on 9 June. Isaac Newton (1642–1727) also solved the
problem quickly, apparently on the day of receipt, and published his solution
anonymously.
4 Euler–Lagrange equation
The calculus of variations is thus all about solving optimization problems. The point
is to find the function that optimizes the functional. To understand this method, we
first look at the basic one-dimensional functional, also known as the objective
functional:
b
J [ y ]=∫ ❑ L ( x , y , y ) d x
'
We use L instead of F because the integrand is called the Lagrangian, named after
Joseph-Louis Lagrange, an Italian mathematician who was very important in the
development of calculus of variations.
y (a)=α , y (b)= β
or the first variation test, we need to calculate the gradient of the functional. We
follow the procedure of [2]. We first impose an inner product ⟨ y ; v ⟩ on the
underlying function space. The gradient ∇J[y] is then defined by the basic
directional derivative formula:
⟨ ∇ J [ y ]; v ⟩=
d
dt |
J [ y +tv]
t=0
(3.3)
b
⟨ f ; g ⟩=∫ ❑ f (x) g (x)d x (3.4)
a
What we have to do now is to compute the derivative of J[y + tv] for each fixed y and
v. Let h(t) = J[y + tv]:
b
h( t)=J [ y +tv ]=∫ ❑ L ( x , y+ tv , y +t v ) d x (3.5)
' '
b
d
'
h ( t)∧¿ ∫ ❑ L ( x , y +tv , y ' +t v ' ) d x
dt a
b
d
¿=∫ ❑ L ( x , y +tv , y +t v ) d x
' '
a dt
[ ]
b
∂L
¿=∫ ❑ v ( x , y +tv , y ' +t v ' ) +v ' ∂ L' ( x , y+ tv , y ' +t v ' ) d x
a ∂y ∂y
Where we used the chain rule. Now evaluating the derivative at t = 0 gives:
[ ]
b
∂L( ' ∂L
⟨ ∇ J [ y ]; v ⟩=∫ ❑ v x , y , y ) +v ( x , y , y ' ) d x (3.6)
'
'
a ∂y ∂y
Equation (3.6) is known as the first variation of the functional J[y]. We call
⟨ ∇ J [ y ]; v ⟩=0 the weak form of the variational principle.
What we want to do next is to obtain an explicit formula for ∇J[y]. The first step is to
write equation (3.6) in a different way. Using the L2 -inner product (3.4) we write:
b b
⟨ ∇ J [ y ]; v ⟩=∫ ❑ ∇ J [ y ]v d x=∫ ❑hv d x
a a
[ ]
b b
∂L
∫ ❑hv d x =∫ ❑ v ( x , y , y ' ) + v ' ∂ L' ( x , y , y ' ) d x
a a ∂y ∂y
This is almost the same form, except for the v' . We fix this using integration by
parts. Therefore, let:
∂L
r (x )≡ '
( x , y , y' )
∂y
[ ]
b b
b b
a a
2
'
r (x )∧¿
d ∂L
dx ∂ y
2
' (
( x , y , y') )2
∂ L
¿= (' x , y , y ' ) + y ' ∂ L ' ( x , y , y ' ) + y ' ' ∂ L' 2 ( x , y , y ' )
∂x ∂ y ∂ y∂ y ∂( y )
Applications of the Euler-Lagrange equation