Professional Documents
Culture Documents
1 Conditioned Extremum Points: Conditioned Extremum Point of F With Respect To The Set C
1 Conditioned Extremum Points: Conditioned Extremum Point of F With Respect To The Set C
Seminar 9 2020
and
F = (F1 , ..., Fm ) : A → Rm is a C 1 function .
If c is a condition extremum of f with respect to C, and
rankJ(F )(c) = m,
Here L is called the Lagrange function associated to f , and λ01 , ...λ0m are the
Lagrange multipliers.
In practice, we are interested in point 2, from Theorem 2.21.5 from the lecture notes,
which says that: If c ∈ C and there exists λ0 ∈ Rm such that
1
for all h ∈ Rn \{0n } such that dF (c)(h) = 0M , where
then
c is a conditioned local minimum (respectively maximum)
point o f w.r.t. C.
Step 2 Determine the critical/stationary points of f , i.e all the points a ∈ A s.t.
∂f
(a) = 0, ∀j ∈ {1, ..., n}.
∂xj
C = {c ∈ A : ∇f (c) = 0n }
If f has no critical points, (i.e C = ∅) then f has no local extremum points.
If f has some critical points, (i.e. C 6= ∅) GO TO Step 3
Step 4 For every critical point c ∈ C, analyze the Hessian matrix, hence, analyze Hf (c)
2
– If Hf (c) is POSITIVE DEFINITE, then c is a local minimum point of
f.
– If Hf (c) is NEGATIVE DEFINITE, then c is a local maximum point
of f .
– If Hf (c) is INDEFINITE, then c is not a local extremum point of f , case
in which it is called a a saddle point.
Now, the natural question arising is concerning the nature of the Hessian matrix. We
use Sylvester’s Theorem applied to the quadratic matrix Hf (c). In order to formulate
this theorem, we make the following notations for the determinants for the minors (on
the first diagonal) of this matrix. Thus, for
∂2f ∂2f
2 (a) (a)
2
∂ f
∂x2 ∂x1
∆1 = 2 (a) , ∆2 = ∂∂x2 f1 ...
∂2f
∂x1 ∂x ∂x (a)
1 2 ∂x22
(a)
2 2
∂ f (a) ∂ f ∂2f
∂x21 ∂x2 ∂x1
(a) ... ∂xn ∂x1
(a)
∂2f ∂2f ∂2f
(a) ∂x22
(a) ... (a)
∆n = ∂x1 ∂x2 ∂xn ∂x2
... ... ... ...
∂2f ∂2f 2
∂ f
∂x1 ∂xn (a) ∂x2 ∂xn (a) ... (a)
∂x2
n
Sylvester’s Theorem:
A quadratic matrix of the form Hf (a) is
• INDEFINITE if we are not in the two cases above, and still ∆k > 0 for
all k ∈ {1, ..., n}
3
This theorem can be used in most of the cases, but it cannot be applies when some
of the determinants are 0.
For the case when there exists k ∈ {1, ..., n} such that ∆k = 0, we have to consider
the second order differential function attached to f at the point a, which is
d2 f (a) : Rn → R
given by
d2 f (a)(h) = hT · Hf (a) · h, ∀h ∈ Rn ,
which, if you do the matrix multiplication turns into
n X n
X ∂ 2f
2
d f (a)(h) = (a) · hi · hj , ∀h = (h1 , ..., hn ) ∈ Rn .
i=1 j=1
∂x i ∂x j
this means that the Hf (a) is indefinite, therefore, at Step 4 of the algorithm, the
conclusion is that
a is a saddle point,
hence it is not a local extremum point.
Hf (a) is said to be semidefinite if
d2 f (a)(h) ≤ 0
or ≥ 0 for all h ∈ Rn (it is not stict). In this case the nature of the point a has to be
analyzed by going back to the original form of the function (see some of the following
examples )
4
For all (x, y, z) ∈ R3 it holds ∂f
∂x
(x, y, z) = 4x − y + 2z, ∂f
∂y
(x, y, z) = −x − 1 + 3y 2 and
∂f
∂z
(x, y, z) = 2x + 2z. The stationary points of f are the solutions of the system
4x − y + 2z = 0
−x − 1 + 3y 2 = 0
2x + 2z = 0.
From the last equation it follows that x = −z. By replacing this in the first equaiton
we get y = 2x, equality which, together with the second equation, provides us with
12x2 − x − 1 = 0. Thus x ∈ {− 41 , 13 }. It follows that (− 14 , − 21 , 14 ) and ( 13 , 23 , − 31 ) are the
stationary points of the function.
For all (x, y, z) ∈ R3 it holds
∂ 2f ∂ 2f ∂ 2f
(x, y, z) = 4, (x, y, z) = 6y, (x, y, z) = 2,
∂x2 ∂y 2 ∂z 2
∂ 2f ∂ 2f ∂ 2f
(x, y, z) = −1, (x, y, z) = 2, (x, y, z) = 0,
∂x∂y ∂x∂z ∂y∂z
thus
4 −1 2 4 −1 2
1 1 1 1 2 1
H(f ) − , − , = −1 −3 0 , H(f ) , ,− = −1 4 0 .
4 2 4 3 3 3
2 0 2 2 0 2
Moreover
1 2 1 −13
f( , , − ) = .
3 3 3 27
Since Sylvester’s criterion cannot be applied to the matrix H(f ) − 41 , − 12 , 14 , we will
study its nature by other means. The quadratic from associated to this matrix is
Since Φ(0, 1, 0) = −3 < 0 and Φ(0, 0, 1) = 2 > 0, it follos that this quadratic form (and
its corresponding matrix) is indefinite. Therefore, (− 41 , − 21 , 14 ) is not a local extremum, it
is a saddle point.
b) f : R2 → R, f (x, y) = x2 + y 2 (1 − x)3 .
For all (x, y) ∈ R2 it holds ∂f
∂x
(x, y) = 2x − 3y 2 (1 − x)2 and ∂f
∂y
(x, y) = 2y(1 − x)3 . The
stationary points of f are the solutions of the system
2x − 3y 2 (1 − x)2 = 0
2y(1 − x)3 = 0.
From the last equation we get y = 0 or x = 1. If y = 0, then from the first equation of the
system we get x = 0. We notice that x = 1 does not obey the first equation of the system,
thus it is excluded. In conclusion (0, 0) is the only stationary point of the function.
5
For all (x, y) ∈ R2 it holds
∂ 2f ∂ 2f ∂ 2f
(x, y) = 2 + 6y 2 (1 − x), (x, y) = −6y(1 − x)2 , (x, y) = 2(1 − x)3 ,
∂x2 ∂x∂y ∂y 2
thus
2 0
H(f )(0, 0) = .
0 2
Since 2 > 0 and 4 > 0, it follows that H(f )(0, 0) is positive definite, thus (0, 0) is a local
minimum point. Moreover
f (0, 0) = 0.
Example 2:
Determine all local extrema, their type (minima or maxima) and
the corresponding extreme values of the following functions:
a) f : R3 → R, f (x, y, z) = x3 − 3x + y 2 + z 2 ;
b) f : R2 → R f (x, y) = x4 + y 4 − 4(x − y)2 ;
c) f : R3 → R f (x, y, z) = z 2 (1 + xy) + xy;
d) f : R2 → R f (x, y) = x3 + 3xy 2 − 15x − 12y.
Solution
a) f : R3 → R, f (x, y, z) = x3 − 3x + y 2 + z 2
Step 1: The critical/stationary points For all (x, y, z) ∈ R3 it holds ∂f ∂x
(x, y, z) =
2 ∂f ∂f
3x −3, ∂y (x, y, z) = 2y and ∂z (x, y, z) = 2z. The critical/stationary points of the function
f are the solutions of the system
2
3x − 3 = 0
2y = 0
2z = 0.
Therefore (−1, 0, 0) and (1, 0, 0) are the only stationary points of the function. The set
of the critical points is
C = {(−1, 0, 0), (1, 0, 0)}
For all (x, y, z) ∈ R3 it holds
∂ 2f ∂ 2f ∂ 2f
(x, y, z) = 6x, (x, y, z) = 2, (x, y, z) = 2,
∂x2 ∂y 2 ∂z 2
∂ 2f ∂ 2f ∂ 2f
(x, y, z) = 0, (x, y, z) = 0, (x, y, z) = 0,
∂x∂y ∂x∂z ∂y∂z
6
thus
−6 0 0 6 0 0
H(f )(−1, 0, 0) = 0 2 0 , H(f )(1, 0, 0) = 0 2 0 .
0 0 2 0 0 2
The matrix H(f )(1, 0, 0) positive definite, thus (1, 0, 0) is a local minimum point of the
function.
Since Sylvester’s criterion cannot be applied to the matrix H(f )(−1, 0, 0), we will
study its nation by other means. The quadratic form associated to this matrix is
Since Φ(1, 0, 0) = −6 < 0 and Φ(0, 1, 0) = 2 > 0, it follows that this quadratic form (and
therefore its associated matrix) is indefinite. Thus (−1, 0, 0) is not a local extrema, it is
a saddle point.
By adding the two equations we get x3 = −y 3 , thus x = −y. By replacing this in the first
equation e get x3 − 4x = 0, thus x ∈ {−2, 0, 2}. Therefore the stationary points of the
function are (−2, 2), (0, 0) and (2, −2).
For all (x, y) ∈ R2 it holds
∂ 2f ∂ 2f ∂ 2f
2
(x, y) = 12x2 − 8, (x, y) = 8, 2
(x, y) = 12y 2 − 8.
∂x ∂x∂y ∂y
For the three stationary points we get
40 8 −8 8
H(f )(−2, 2) = H(f )(2, −2) = and H(f )(0, 0) = .
8 40 8 −8
Since the matrix H(f )(−2, 2) = H(f )(2, −2) is positive definite the points (−2, 2) and
(2, −2) are local minimum points. Then
Due to the fact that the determinant of the matrix H(f )(0, 0) is zero, the algorithm
for determining the stationary points provides us with no useful information connected
to the stationary point (0, 0). Nevertheless we notice that f (0, 0) = 0, f (x, x) = 2x4 ,
f (x, 0) = x2 (x2 − 4). It follows that in each neighnborhood of (0, 0) the function f takes
both positive and negative values (for example, for all n ∈ N it holds f ( n1 , n1 ) > 0 and
f ( n1 , 0) < 0). Therefore (0, 0) is not a local extremum, it is a saddle point.
7
For all (x, y, z) ∈ R3 it holds ∂f∂x
(x, y, z) = yz 2 + y, ∂f
∂y
(x, y, z) = xz 2 + x and
∂f
∂z
(x, y, z) = 2z(1 + xy). The stationary points of f are the solutions of the system
y(z 2 + 1) = 0
x(z 2 + 1) = 0
2z(1 + xy) = 0.
From the first two equations it follows that y = 0 andx = 0, respectively. By replacing
these values in the last equation we reach the conclusion that z = 0. Hence (0, 0, 0) is the
only stationary point of the function.
For all (x, y, z) ∈ R3 it holds
∂ 2f ∂ 2f ∂ 2f
(x, y, z) = 0, (x, y, z) = 0, (x, y, z) = 2(1 + xy),
∂x2 ∂y 2 ∂z 2
∂ 2f ∂ 2f ∂ 2f
(x, y, z) = z 2 + 1, (x, y, z) = 2yz, (x, y, z) = 2xz,
∂x∂y ∂x∂z ∂y∂z
thus
0 1 0
H(f )(0, 0, 0) = 1 0 0 .
0 0 2
Since Sylvester’s criterion cannot be applied to the matrix H(f )(0, 0, 0), we study its
nature by other means. Its associated quadratic form is
Since Φ(−1, 1, 0) = −2 < 0 and Φ(0, 0, 1) = 2 > 0, it follows that this quadratic form
(therefore its corresponding matrix) is indefinite. Thus (0, 0, 0) is not a local extremum
point of the function, it is a saddle point.
8
1 2 2 1
H(f )(1, 2) = 6 , H(f )(2, 1) = 6 .
2 1 1 2
The matrix H(f )(−2, −1) is negative definite, H(f )(2, 1) is positive definite while
(f )(−1, −2) ane H(f )(1, 2) are indefinite. Thus (−2, −1) is a local maximum point of f ,
and (2, 1) is a local minimum, while (−1, −2) and (1, 2) are not local extremum point,
they are saddle points. Moreover