Professional Documents
Culture Documents
Ad 0744313
Ad 0744313
Ad 0744313
IWBiBiillPilWi
CO
CO TOPICS !N OPTIMIZATION
^5
BY
MICHAEL OSBORNE
STAN-CS-72-279
APRIL 1972
Reproduced by
■ I ■
NATIONAL TECHNICAL
INFORMATION SERVICE
U S Deparfment of Comm«rc«
Springfield VA22I}1
D D C
ill 3 1372
nins
M
--'-'■ ...-.--.... - „,.:-... .
^
•V.Kjry^T^r'rr^ iTT-'.:'--'r--7^.-y'", V7~t^r»- i ji^^,.:'-!! j«"'"" "T""'-71'7''"" S^CTTT«»nn n-wr^T- '-T-TS-T «giimi'. in J'^Ufff) V
BWMBWWWWWWI ►'wsfftffww^ff'W^^
Unclassified
SIM .intv Classification
DOCUMENT CONTROL DATA -R&D
iSrrurity rlamtirtiion of iltlu, body ol abilrmrl «iirf Mailing tmnofllon muni be «nfrad when Hi» ov»tall rapott It clmttllM)
ONiaiN«rrNC «CTIVITV (Coipont» mulhot) 1«. «EPO«T SECURITY CLASSIFICATION
Unclassified
Stanford University 26. OftOUP
I BFPORT TITLE
Topics in Optimization
Michael Osborne
10 DISTRIBUTION STATEMENT
U ABSTRACT
X
i i
DD.'r..1473 """*
I NOV I
I)
Unclassified
S/N 0101.807.6801 Security Cltstification
Unclassified
Sacurity CU«»lftc>tion
LIMK * LINK • LINK C
KtV WONOt
NOLI NOLI MOLK
'B??w5?(^^^!?''f?HÄ!p"(W*iHBW9'Ww i ■ iiMimmirr-'"———"
TOPICS IN OPTIMIZATION
BY
Michael Osborne
STAN-CS-72-279
April 1972
UL
-— ■
naMaaMMui aa— mjaaiMMMittitt 1
TOPICS IN OPTIMIZATION
Michael Osborne
Computer Science Department
Stanford University
and
Abstract
function, and (b) has a family connection with the conjugate direction
convergence indicated.
^
Introduction
quarter of 1971. In part they ara based on lectures given during the
which appear to be the most important for the development and analysis
extrema, so that convexity and duality theory are not treated in any
great detail. However, the material given is more than adequate for the
■
mmm
r3T»1WW^-T-nT-7i^«ff^TOJ'W^!W?W^,'*,^**B^^^
In the third section, penalty and barrier function methods for non-
with other areas: for example, with the method of regularization for
Ac knowledgment s
O
The material presented here has benefited greatly from discussions
with Roger Fletcher, Gene Golub, John Greenstadt, Shmuel Oren, Mike Powell,
and Mike Saunders. The presentation of the course was shaped more by the
than by any consideration for the audience. In spite of this the level
out the credit students Linda Kauftaan, John Lewis and Margaret Wright
in this respect.
Contents
] i
mm *t*m iiiMttmmm ■
pitl.lilWl|UWWlW.iW
where
Ä 2 l/2
11*11 ^ ty,'1'-} > ^he euclidean vector norm of t .
i-1 1
«MMMMMt
-■- ■ -
PPBfWWWBg^lWy ?"• [Hp^jj^yagg—ppyi -n
■rr .y. -■■ »J -w- ^ in ■■..am
convex).
convex. Then
[\x] > 0 , [x1} c S such that lijn (|\ (x^-x ) -t.jl =0 . Prescribe
[e.} iO . Select t. such that \\t. -t\\ < £^2 , and j = i(d) such
that ||\J(xJ - x0) -tj < £^2 . Then |l\J(xJ -x0) -t|| < ^ =» t e T(S,x0) .
' 3
VxrT(G,x0) .
minimum 3 & > 0 such that f(x ) > f(x0) Vx £N(xn,6) . Consider now
8
m mmmmmiKmm wipppw^^ip^w^mnyppwiywBW"^1 «'■"■■ I"1'! V.-H.M . II,.IIII..I.II twwmpawappwiwwwwww—o
We have
0 <f(xn) -f(x0)
V.i
u
I'lxiunplc: (i) IT x ( G (the jnterior of S ) then 1(3,0 = E .
then 7 s =
(ii) If S - {x;||x-wH = r} ( >*o) ^;$T^0"^ = 0
^ '
In particular if xeT(S,x ) then -xeT(S,xn) . Thus we must have
T
7f(x0)x = 0 Yx suchthat x (x -w) = 0 . Thus Vf(x0) = a(x -w)
for some a .
Ü
Let A be a set in E
n
iMWnfBMWWWW'Wggilga^W»»'!^^ WWWWWWBW^WWMWWMWWi
such that
T
~p ~ > ^- > V = 1,2,... . (1.6)
e
itepit
#
subsequence z . By definition z e T(S,0) , but, by (1.6),
zT y > -^
1 > 0
10
MM aaia i ...■.it-, .m -.^i..^-. -.„. -^ ..^V^V.^1.K.VI. ^.^ -,.^ aaiteiiaaaaaMt anü MMMüiitMifcflfliii
mmmm IPWW ■iiWip»|i»l!fll^.tiP>jllJliMlJ:»^»--N^.v^^iiiijitj|..|Wiji|i|Wi-iMiBiip.il||iWi.Jii.wi,,,|ig^
e^ = min(l, e^ ,
E = min e
k ^2 k-l,^k^ ' k > 1 ,
and
z -e k+l 2 z e
k-Lzll G [
k-1 e
k " ekH e
k ' k+1 E ' 11 5 11 ^+1' ek] '
o , 11 z ll = o
X
If xeC then xT y < —^
11 11 • , ^ vre have
Using (I.?)
Now assume xeN(0, e) , e < E, • Then jj x H e [ e^.-,, £k] for some k>5
ii
i^amMim
mimammatiaiiiamamiikäutik maMüMafaMfeMMÜlMIMIMi -'■■—'^Mi-iinf in-m-^:--^- -■■- A. .... .....■., ■....;.;.- ... ..^L...
,
^^^^*^^^^^^^WWIWWIW^tWWWP^»wiw Mimmn wmun-iM wn ■■■winu i... nn.uBmi mi. HIP.. ■!. -. ->^-^~ . in
e > E whence
However, II f II > -^ kfl k+2
2|| 2 ||
P(z) > ^ (1-^)
7f(0) = -y . Q
Proof: Let x be any point in S . Then min jjx - x jj < |jx1 - xJ| = r .
~-L xe S ~ ~U ~ ~
(x-x*)T(x*-x ) =0 (2.1)
12
■\ % T1 , *
(x-^ )-(x -x0) = C
Figure 2.1 u
that it remains to show that S c R+ . Let xeS then for 0 < 9 < 1 ,
a
so that
.2,1jx-x
_*II2 . ^/_. .*\T/.*
|f+29(x-x )1(x -x0) > 0
and, letting 9 -. 0 , D
whence x c R+ . Q
Ay > 0 (2.2)
15
it is true that
T
then a x > 0 such that A x = a .
Then the result of Parkas lemma is that if (2.2) =» (2.5) then acC .
(x - a) x - 0 (2.10
vTA(x - a) > 0
15.1
(1) Ax = b , x > 0
3. Multiplier relations.
g1(x) > 0 , i- ^ ,
hi(x) = 0 , i cI2
€ B
(ii) ^ > 0 , i o •
Ik
mmmmmmmmmmmmm
(iii) u > 0
constraints.
conditions.
the MPP
min f = -x1 ,
Vg-L = -7g5 = e2
while
7f = -^
Figure 5.1
15
V
L'i>w^tfl.il,Hiyiitt-rfiWajhU,i.r^-.lv,-vr, ., ■■^^.v^^tL.i^^. ^....^„j.^^,**^^.^^.* -■^.—--^ ^k^-^-..-,.-,,..'...-.—■ ....... .,._.■.. ......
m ■ • I,. ,)M "P »i||!W>"iii!»fiJji» l ,|iiilni)M .».wiwv-v ...-wi.i.11,.. i .lim m I.UXIV PDH
^mmm wm wtm
Let
H0 = {x jy^Cx^x = 0 , iel2} ,
G = ;Vg 0 :LcB
0 fe i^0^ ^ ' o^ '
for all f EF .
-Vf(x0)y > 0
tf'i^h ^ ü ,
Vg.1(x0)y > 0 , i r B0 .
{5.2.) holds. On the other hand, if (5.2) holds then 7f(x )y > 0 for V !
all y r G0 n H0 . 3
Remark: Lemma 5-1 shows the difficulty with the above example. Here
T -- right half plane , (Gn 0 H.) the %n axis. By Lemma 1.4 for
every xcT there is a function with a minimum at (1,0) and such that
-yf = x . Thus the conditions of Lemma 5.1 e e not met in this case.
16
and
O^g^) =61U0)+7gi(x0)(xn-x0)+o(llxn-x0ll) , i€B0
so that x e G- n H0 . D
x0 then by Lemma 5.1 -7f(x0) c (GQ 0 HQ) Vf eF0 . .*. , by Lemma l.k,
17
dx(0)
Thus the Kuhn Tucker restraint condition implies (G0 0 HQ) C T(S,X0) .
The result now follows from Lemma 3.2 and Theorem 3.1. G
x-,.
dx(ü)
1 - l,i!, ...,s , for i:jx(9) -x i| < e and
d9
= t .
Proof: Let P(x) = KT(K KT) "1 K where p.(K) = 7k. (x) , i = 1,2, ...,s
dx
= (I-P(x))t (5.3)
"d9"
18
independent.
dx (©)
6k>0) suchthat (i) ^(^(9)) -=g >0, ieB0,and
ie
(ü) gi(xk(e))>o, VBo for
*k(e) eN
(*0,6lP ' The
^g""1611*
I I
of Corollary 5.1 now gives w cT(S,x0) . But, by construction,
19
■'• 0
- \<V(*r) "w^0^ =
2 * 7 w
(x0)t + o(l) as xn -* x0 .
1 2
(ii) w(x ) -w(x ) <- Ijx -x-ll . Select a subsequence of the x such
-q
X - X,.
thai. C if^ 11 / - teT(S,x_) . Then (ii) =» tT7 w(x_)t < 0 which U
l-q'^O'1 J ~ ~u - -U -
c;ives a contradiction. □
20
wmm ■■■■■■■
and v in the case where these are Implied by the Kuhn Tucker conditions.
7l(x0) = 0 . (U.2)
LT_2
t^ x(x0)t > o , n c TCS^XQ) (^.5)
T 2
Remark; If Sn is Lagrange regular at x. then t 7 X(xn)t > 0 Yt
Example; Consider
g
l =
X^+(X2+1)2-1 >0 , gg l-x2-(x2-l)2 >0
7g > 0 7g 0 80
lf2 ' 2f2 >
that restraint condition B
21
^^
WMIUPPPpBUjPIIIPBimP^BBPIB " '. HUIIJI.Himi^Hlwmpiffpw.iaiilJ.il'il ■ rimiijrr.lMVW?*' '^r^-™^!-.--*^^'-*™--^,^ f~v~^™^ r:r ■.-.^r.m.i-B.raJOT
V;i < I5#, such that u. > 0 then 'im, 6 > 0 such that
0 1
Proof: Assume :>1 no n. , 5 > 0 such that (h.h) holds. Then for each
X • X
0
II"^ Z \\ - t ' T(S,xJ . Set G = r u S-Cx) . Then G>0 on S,
I'-q'^oll ~ ~U ieB0 1 ~
G X
T ? ( J
t 7 l(x0)t + lim sup =3—^ < 0 (U.6)
- ~ q .• ||xq-x0|l
A:- G(x ) > 0 , the second term is bounded and nonnegative. Therefore
ü X )
( Q r^
1 1
q -• IKq ' Soil i€B0 ~u ~
Thus
22
VZi*0) = 0
-VII^XQ)
1
... -7h (x0)T
^^ -^1^0)T •*• -7gm^0)T
ul7g1^0^ g
lW
0
J(x0) = g (
m ^)
0 0
^p^o)
(^.10)
minimum of f on S .
25
complementarity).
•u
In particular S, is Lagrange regular at x .
a = 0 . (4.11)
u
b
u 7
(ii) i Si(^o^+aigi^0^ =0
' i = 1,2, ...,ra , and
T 2
min y y Zi*0)y
= 0 ieI arld = 1
subject to 7Bi(x0)y = 0 , irB0 , ^(^Q)^ > 2 ' H2H
Clearly the constraint gradients are linearly independent as
2h
x
= y^^^o^o= min i^^i z0
Now if \ = 0 , (^.IS) shows that conditions (i) - (iii) above are
then \ > 0 . In this case Lemma U.5 shows that the minimum of the MPP
is isolated. O
functions.
whence, if \ < 1 ,
f(x+(l-\)(y-x))-f(x)
25
f(\x+ (i-My)
-*
- (i-M^(^•# + (i-My)(y-x)
******
< f(x)
**
. (5.5)
M
,i
Vg.^)(x* -x0) ^.(x*) >0 (5A)
= f(x0) . D
26
flaalHaijia^aiMg—j....;.^,...,,.-^
-■-••-■M"-j-i"-ii niiiiMi
PPIJ^»WWI|IPII(||»|IIIIIII|JMPPW«WV»!»II.II1P«',?VW1"™.™''«.".I"MIP<'P.-™,|.».I' )■ Tr.iiii.iii ii. „mmm*mi«fmtwr-l~wmwiwm.»WWIweiwiVWl-i'>. »m™-»™
Note that if z1 > z then S z c S Z ' so that u)(z-) —> (i)(z ) and that
~x — „d l *~ 2
if S has an interior then Sw nonempty for z > 0 and small enough
< Mz^+Cl-MwCzg) • Ü
27
* *
Proof; Let 0 < X < 1 , and z", z" > 0 , then ,)
Proof:
T *N
[z ) = inf (f(x) -g(x) z ) ,
T *
= cü(z) - z z
28
> vizj) - ^ z
Theorem 5.1: (Duality theorem). (i) sup 0(z ) < inf f(x) .
z*>0 xeS
holds in (i).
0(2*) = f(x0) . D
29
primal problem minimize the convex function f(x) subject to the concave
constraints gi(x) > 0 > 1 « 1,2, ...,m , and the dual problem maximize
then the dual problem has a solution and the objective function values are
u
equal.
problem
min f(x0)+7f(x0)(x-x0)
[ i
subject to
(i) 8i(x0)+7gi(x0)(x-x0) >0 , i€l1 , and
as the Kuhn Tucker conditions are both necessary and sufficient for a
50
solution to the linear programming problem. That the converse need not
has the solution (l,y) for any y . Thus additional conditions are
problem
minimize aT x subject to Ax-b >0 .
M# M ***** ^
maximize bTu T
subject to Au-a=0 , u>0 .
If the primal has a solution then so does the dual and the objective
then, as g^ concave,
Sa = [x ; ga(x) > 0} c U
51
if ir ir
We have S c S cU . Thus the hyperplane afx ) + VgL~(x ) (x - x ) =0
separates x and S .
by noting that the new constraint must be active. Thus a convex programming
T
(c) Consider the problem of minimizing c x subject to xeS and S
(0) i = 0 .
(v) Stop.
Note that step (i) requires the solution of a linear programming problem.
(d) The cutting plane algorithm generates a sequence of points x^^ with
T
as R0 2 R-i 2 • • • 2 S . Thus the sequence {c x. ] is increasing and
52
Let
c + 7g (
ß^ ß ^)(~ "^ ^0
But
35
Notes
Lemma 1.1+ is due to Gould and Tolle. The proof is due to Hashed
et al.
3. The main result is due to Gould and Tolle. The treatment of the
References
Applicable Analysis.
3U
Theory. Wiley.
0. L. Mangasarian (1969): Nonlinear Programing. McGraw-Hill.
Peter Whittle (1971): Optimization under Constraints. Wiley.
55
mm mmmnmmmimmm iy IJ.IMUPI.B^^WBW^W^^^P—»I .ILL» »JH..W ."»I''»-u , i MIIUIÜJ»!)^
56
——— ■■■—i
Gi(\) = FO^+M^)
sought. At x. , we have
7F(x.)
Figure 1.1
57
We have
0 = = 7F
^i+l^l ^i)ii + Xi ii ^^i^i
u
where x^ = x.+ \.t. is an appropriate mean vulue. Thus
~i ~i i~i
-7F(x.)t. 6|l^(x)ll
(1.2)
LT 2
t^^FCxJt 2„,-
' 117^(^)11
«^ X «MX K*X M«X
and assvune that F bounded below and t 7''F(x)t < K|| t jj , xeR . J
Define
»11^(^)11
X.^T = x. + and
~i+l ~i *i>
=1
^^i^i ^ ^W^ll ' IfeiH '
for i = 1,2,... where 6 >0 . Then {F(x.)} converges, and the limit
58
mmmmmtämm* , , ■■—!■! ■,
■- ■ — - | - --^-■■->^":-."1
IHIIKIIi.MilMiil.wiigilil i^wymi'wJWWPipppiiiBWiwiff.in.-Ji II. wmmmmmmm pppwwgip I,»I|.I,II»I ,i. [imn.*vw^mtwm*"wwm,wmr.**wws^whimwvr*r^,'v'mfn'm!''«!t
2
SllvF(xi)|| /6ll7F(xJA
1 T72 F _ v
F(xi+1) = n^) + —Y^- ^(x.)^ +1^—r -j H ^ii
where x. is a mean value. We have
(»IIVF^H)2 J 51^(^)11
^^l+l) ^ F( C )
ii K K )"■
Thus the sequence {F(x.)} is decreasing and bounded below and therefore
-♦ 0 , i -* • .
59
Lix-^K) = F^-F^i+^i) ,
L^
(i) = if 0
h+1 ~i ^i'**) = •
else choose \. = 1 .
i
(iii) x.^^x^X.^ .
U
fx,} c R . We have a
show that the limit points of [x.} are stationary values of F consider
hO
|bF(x )!| > e for i > i0 . This implies that inf \ = ^0 > 0 as
" -P-i " 0 i ^i
otherwise sup \|f(x ,\ ) = 1 contradicting ^(x.,\.) <l-a . Thus
i -M-i M-i -.i i -
FCx, )-F(X )
-^
\\vfU )|1 < X0a(S
(1.6)
-M-
Remark; There are two aspects of this theorem which are of particular
Theorem 1.3; (i) Let the vector sequence in the Goldstein algorithm
be defined by
\ . (A.)
where A. is positive definite, bounded, and X(A.) = r rrpr > OJ >0 ,
mcuc x
hi
Proof: (i)
7F(xi)Ai7F(xi)T
=
^^i^i
Xmln(Ai)ll7F^i)ll
u,
(ii) ^(x^M =
so that
E
1
i !i
\^*±'V - t " 2^ I ^ if^ a)ll7F(x~)||
llEjl llAjl
- 2 s. U)
(1.9)
U2
. .^ _ . . ,„■.„...^■■■..;-^...^J..V:^..,i ,.^,..:^^-.,.^.th^;^^,^.^^w^^i^^.K^.ft...^^^a:^^^rt^^
PPP^^ II ■■Wmiim^lWW. HI»!!!.!! I,,^., nilJI . ^F^^,
'»l"i..!i.^.»i.—M"W»^'.Hil"».i.H»H|l»|J||iii|i|»Wt^.ji
In particular
\\\\ llAjl
l^llfill) - -* 0 , i -
2 ' - 2 U)
s. =
VF(-i)T =" Ai(7F^i) -VF^*))T
= -A.(72F(x)(x .x*) + o(||x -x*|l))
X «* X **X A# f^X ^M
Thus
X.^T-x = x. +7.s. -x
~i+l ^ -i i~i ~
convergence. D
2
Remark; Theorem 1.5 shows that if y F is positive definite in the
U5
gganMiiaMiiüi ..iJ.^—.i.;..;,,...^.
PIPIPIfim RpppUfpi H'l"1 i WJIinmi II miliuWflü^mmmmwnKIV»"!!"!1.1! W^KVlt T«.'^rpTr7»iJi«»niWf|wIwww^i-Trii^^ ''■,">"'
--
which work well in this particular case should work well in general.
Let F be given by
0 = =
^^i+l^i *i(C^i+^i)+y
whence
hh
F(
ii+hit) ■ F(Ji) - N *T !i+\ iic ii+1 *•! -ic -i
^ijlVl^lfl0*!
! (x^C2x)2
^i+i)-^i) = -i -7^37-
C
~i -i
T
^2
1 (üi gij
f
2
/iC!i
where w. = Cx
(
F(xi+1)
,
j- —
1
,-»
*I ll'2 F(x.)
2
wTc w, ^C'-'-w,
(J w)' Ug g
l n
T 1
w C" w w C w T
(a1+an)£
respectively, whence
^5
itrtWmMiiirilM^ irmniitriiBMti-JiiWiin)^^^
*""" I UPPmp^Pmipiy l Wm\\V iHiPi»|Pii.|||^^j^ii||Wil»»l'Wi|,¥ITI"''J'! »»''h"'.T'ITrt,',l''«l'!?'l'|l'^»nri.i i| 'm.lim»<MHMI ■'" '■' |J''i'-l''''W»,«n.'in|.r>f^"gWtr^-CT^,r.wi^^
F(xi+1) <
Vli H*±)
a +(T
n l
linear.
which
x. = oc v.. + or v
„1 1 „1 n „JI
9
where v, and v are the normalized eigenvectors associated with o.
+ i+1 i
x.^,
~i+l -- oi1 \~1 + an ~nv = (1
v - l.a,
^i I7)aj-1 ~1
v. + (l
v - x
ft4a )ci! v
i n' n ~n
so that
v
^ 1' 1 n7 n
and
0
a'i+l
n
In particular
a,i+1 or a:1-1
n
a'1+1 CC a'1-1
he
J. /^i2'
so that the ratios oct/a assume Just two values for all 1 (depending
and
so that
a:
1
aX ai+1
"n n < 1 , and 7 is independent
where 7 = min ( +1
1+
a:
1+
4i+1
or a'
n n
of i . This inequality shows that the rate of convergence of steepest
descent is linear.
t^ct2 = o . (2.5)
hi
I
ma - ■
W^W^WIIW^WW^<^^^^^^^ff^Wi»»..i»li|».WJ 1 I III ■! »um ill ■■■■..■ i -"" "—
T
Proof: The descent condition gives g. o =0 so it is necessary
=
§i!s ^i+?T!s
= 0
U8
■ -^ .. . -.
-- -■ - -
Proof; From Lenma 2.1 we have T t
gV, =0 , i = 1,2, ...,n .
Thus e^1 is orthogonal to n linearly independent directions and
therefore vanishes identically. G
t± = - H^ (2.8)
U9
-- - ■ -— - ~- ■ -
PfWWWB'.'i'n )> ri.i.-i,^,'"i."^»M>M"u^i''»'j."^v;'-M,'Hr*'»^^^
*icis =
SiHic~s = o
' 8<i
'
by xs) u
6 i {2
«i^s = Ps^s ' < ' ^
where
i =^i+i^i ' li =
!i+i-!i * (2 10)
-
Consider the symmetric updating formula
0 = 1
*\{l±E±li)'tx((S.l±) ' (2,12)
and
= l KJ
Pi i^^-^iHi^ * (2 15
- )
50
T
d y
i i
\ = - T c<l
b
T
H
yi i5:i 5:1 Hi 5:1
and
p y H y
i i i i
ip W ill '
d d H y y H
i i i i i i
H
i+1-Hi Pi.T
!iyi y H y
i i i
+ + H y y H d y H H y d
k{ i^!iS
y
i7^-
y n y
i i i i- i i i- i i i
„i i i i i
where
(2.15)
:i -n Ti "i a
and
y H y
i i i • (2.16)
\ T
=
Example; The particular case p. = 1 , C 0 * i = 1,2, ... gives
the variable metric or DFP formula which is the most frequently used
51
this note that (2.9) can be written (introducing the symmetric square
1/2
root C ' of the positive definite matrix C ).
= Cl
Kls sL ' s = 1,2,...,1-1 . (2.9a)
t u
Defining the matrix f by <.(T) = —=—~s ■, nr , 1 = 1,2, ...,n , and the
1
(tTCt )1/2
A AT
H^T = TPT ,
whence
1 2
H^, ^C-^TPTV / . (2.17)
n+l
In particular, if P •= pi ,
not only for conjugate directions but also for good estimates of the
However, from (2.17), the choice of p variable may well have, scaling
52
--■■-„■.... , ■-^nüinift^ii-umfii
MMfcMMhiBttaMli MMmMMMMBMyai I ■iifi
..■-■_. i^^ii-^.^^^^ i^iwi^ - _^,
p*r',Tr«'*''f'wnww*'»w'fWf^
|^l^|WWyqpilWBgnw^yifTWrwfiWW*ffTBWF"fW
H ornu11
i+3ii+il^i -
p is constant.
and d = xt )
T T
dd1 HyyH
D(p,H)g* = (H+ p =5 f^-) f
dy y-'Hy
Hy/HCy+g)
= Hy + Hg -
"T
y Hy
i yd
i (d - -f^- Hy) .
K
y Hy ~
Whence
H g (7 + C-rvg )v . D (2.19)
* *
Remark; (i) The condition that H g =0 when v ^ 0 gives a
T* 1 *T „ 1*T*
vg =-rg Hy=--g Hg
1 (2.20)
C =
Xg Hg
53
.■„^■■^.,^j-..l,rfinj,.w,,-:,^......,....,.■„..-■^■■. ...■ ■- . .
MiaaüBcataitijaatiaMartBBiaiaMMflMaaMliBB ■-•••,: - ■ 1 •;.
^W.^M^^H'IIF ^^'w^wtimmmmmimmmmmm'QmimmmBm 1 i 11
1 . ^; ffw^pBBffmniwgpi Try" iF<rrjT.Twm TCy Wi'JiyTI TfunrnffBrnw^ f TOI- ■»7-T7f»?r j iTJT'^-^vT-^rT.f«- .^^^j—",»,^^-^.^^^^ ^
*
Provided this value of C is excluded from consideration then x is
in its derivation.
provided
y d
(2.21)
c > (dTH+d)(yTHy)-(dTy)2
where
5I+
1
u = r^d-^Hy^l . (2.^)
T T
\/(d y)(y Hy) ^ VT J
and
T T
so that, by the assunrptions, I + uy is nonsingular. New y v = 0
H+Cf wT = H(I+C1^ vv
) *
* + T
The null spaces of H and H will agree provided I+^TH VV is
0 = det(l+ C;TH+vvT)
T +
= 1+ CTV H v
T T
dy y1Hy
1+ C T V.T1 TI T„+
H+ v = 1+ C T (d1 H d - 2 ^p + ^-f- )
(d y)
= 1 + ^T (dTH+d
T
y H y
y d
C = 3T„+ ,x/ T 1^x2
(d^H^^C/H^-Cy^d)
55
"•■"■'-"
i ii ■ —— - —M—MM
"" ' ''" "J- ' - 'J *mmmm | iwiii m*rmmimmmmmmm»mmmmmmmmmmm~~--
descent condition be satisfied and remains valid even though the exact
Proof; This is a consequence of (2.22) and the above remark which shows
i)
that if H is positive definite, and if the descent condition is
condition is satisfied in each descent step» then the sequence of points 1.;
56
rn m rn '"PT ^TPT1
ddT D y y1 D + a(y d) (D y d1 + d y1 D) + ^(d'-y) d d1
= D + p ^- + add
dy y Dy + a(y1d)£:
T
* a(yT D y) y d yd
D + (d - -~^ Dy)(d - -=—- Dy)J (2.26)
y Dy + aCy^)^ ~ yDy~~ yDy"
demonstrated by noting that at the i-th stage we find the minimum in the
sub space is aleo spanned by H-g-, ...^H-g. . Thus x.+1 depends only on
57
,x
■ JMMJMlliiiiliilifiMiiMfciiMttiihaiaiaaaaaaa
PPPP^WW'^'-'^MiMiitJI^IipiJJJluipii.i.^ wmimvwmH'vwm'i'n^^W'*r^'>'m-.f™rw'"™^"wm*'
x , ...,x and not on the particular updating formula for the inverse
u
(g^Dy)2
g D g = g Dg £-
V Dy
/ *T„ *N2
(g Dg ) u
gDg +gDg
(g Dg )(giDg)
L)
g Dg +g Dg
Thus
O
(2.2?)
«T # # -»(T^ ♦ T^
gDg gDg gDg
nearly orthogonal to the gradient. This must also happen if, for any
58
mmmm MWMifilitM^fihtiirii-'iiiiiiiiMiiaiiiBia^^
^mtm i iilMHIJHIJl.miviiiiii^fiunjiW.MWJ lJ.|.»»t^l'r^»l»»l>TVIIIi»JVW^riHivJ>|»i!FiBa»»lTOWmCT!JrJUü|UlilU|J»lul,i||i|~Mf'ii|.Mt'iMi;i^ wiror-
where
,-11 - H--
D^H)-" „-1.,_!_.
1 x yy
ü
* i^r+V.)^ 1 /..^„-1^-l^JS
1 (yd H^+H'-'-dy1) , (2.29)
p
dxy d'y
T -1
diH ■'•d
V- = " " ' (2.30)
T
dy
w = y - — H^d , (2.31)
and 7 is related to g by
7 = - XllL (2.32)
TX -1
1+CTV H v
(2.33)
59
iaiMiiiiiiiiilHiiit ''M'M'M«»^*1"*'**'*'"^^ 1 ...„.A ■■'*-'■"*---''-—■'-
ipillSPiPPIIII^^
Now
« -^w , (2.5^)
i 1 1 yyT H^dd1^1
p yd d H d
-(I+zdV^I+dz1) (2.35) Ü
where
z = (2.36)
We have
as dTw = 0 .
60
- -»-'-^-'--i--..-^^vV-.^^...^.^>!-^-^.--^^-^-'- ...^^.U^J^M«!
' ''I 'I Wi i'.iiiwi"» W!MWglM»»ff,JiJIIIII,,ll||J|lH|l|4luitl|YuWiiiiiWu.u,ll|U,iilull|l>»J,ilui|^^ «■;'."-"»
^(p^H"1) = (l+2dT)H"1(l+dzT)
GCPJH
-1
)"
1
= D(p,H) + ~ vv7 .
T T
dd1 Hyy^H yyT
1
D(p,H) " > p m ■ ijf H" * (7+^)^
p
s- (yd'H^+H^dy1)
dTy T
d y -"•
******** \
, .y/ H^dd1^1 ^ n tn m
GC^H" )
1
H+(p+T) «a i- (dy^ + Hyd1)
dy dV ','~ "" j
61
■MM mm - -»" ■■ - ■ •
JIJJ,.JJJililW|W|liliJlii.WJflW.iu.,IIIIMiilpp.yii>(.iiia,ii,t. i ,. , ,,.i .■Wn,p^.l.WyMpM,t »„ mmimmmm — ._
Proof; We have
det[A+aaaT-\I} = 1T(\-M
1
det{l + o(A-\l)"1(TTa)(TTa)T}
"- i=l - -
= ft (VM
1
(1+ <y(TTa)T(A - XI)"Va))
i=l
n 2
i
1=1 &. V^ ii-
p for
^ - ^ ^ ••* - ^ ' 1'2'**-»n * f11) !•«* Kj. - C^2HiC1^2 ,
then fä > }Sf) > ••• > p » while if ffl < P then
62
^M . i *i m 1^.———
■ IIIJII m I ■;.. ■ ^WL^jiw^i« mv
I . MI
•'•■"• p—rn ^ i i j-L IIJ^JJ^I . i , i-.iij umigii
Proof; Noting that C^ d = C~^'y = a,we can write the formula for
updating K as
T T
aax Kaa K
K = K+p T T
a a a Ka
T
J = K - and
axKa
aa
J+p
a a
eigenvector. By Lenma 2.6 we have \(J) ■ 0 , and X., < X (j) < \
of J unchanged. Assume that X.(j) < p < ^S+TCJ) then reordering the
63
■-•■■--•
fffyP^^g-l^f ^^WI'^WM^^Jilll^Pg^Pg^^W*1 ^-T-.'?^'>'■''.W, ■ F^WSg»!!- i*..mAjg^psj^BBPgg|ffW^!WWPW»WWrB?Bf!' -t^^^^^-v^^rWi^H^'^V^^^^t^^^^^^^ ^r;r^^Trs7P'^^^?/^y^?JiL^.M^.''^^^yp*''.'!^.!i.ij'i'fft>iP'^
D(p>H) + -J- vvT, and H0 = eH+ (1-9)H* = D(p,H) + -^r vJ1 . Let
yd yd
*
K = C1/2HC1'2 , and define K" , K , IL similarly. Let the eigenvalues
* *
.9 respectively,
of K , K , K , IC be \. , X. , *... , and X..
9
J=l,2, ..,,n. Let 0<0<1. If X. >p then \ > ^1 > ^^ > ^^ > P
6 ,9
while if X. < p then >**<>•* <K <>**< P • If ©^ [0,1] then \.
need not lie in the interval defined by \ and p .
1+e /E 0
C = H = I , p =1
/E e 1
aT Ka V/e
,1/2
) i v = a —1 v
Ka =
K = C T =
aT a 0
61+
Remark; This res-ilt shews that fi' gives the best improvement in the
eigenvalues < p , while H has a similar property for those > p .
This suggests an algorithm in which a choice is made between updating
H to Ö or H depending on some appropriate criterion. Fletcher
suggests that if T > 1 (that is, yTHy > ^C^y ) then H
MM M t»0
should
be used, while if T < 1 then ft is chosen. He has used this criterion
in an implementation of Goldstein's algorithm, and has reported satisfactory
results.
65
MMMMi ■ ■
Wm^m, UliJ|NIJ)l.iyiimiWjpypjiWBJjJBj,,im w _
pi WjWWUBWWHW ■ "". "I;;".-, IIIIUI Dip,!
Notes
his paper 'On steepest descent« in SIAM Control, I965. Theorem 1.5
is abstracted from Goldstein and Price, 'An Effective Algorithm V'
2. For background material see Kcwalik and Osborne. The form of the
JOTA, 1970. The form (2.1U) and the result of Lemma 2.2 are
with Theorems 2.2 and 2.5 are included in his paper already cited.
References
66
-"' ■ ■ i 11 - '
WPWP|IPWPBiip^^wwimwi|8wiMi.iii flUiiNjw^jiJWWiiiwiwiim^ww^^
pp. IU7-151.
A- A. Goldstein and J. Price (1967): An effective algorithm for
minimization. Num. Math., 10, pp. I8U-I89.
67
estimate.
only additional function values. Also we work with the Choleski decompo-
matrix does not differ greatly in cost to multiplication. We felt this -'
This requires the evaluation of F(x + s.) , and this, together with the
_7 ~—~ '
-' NPL Mathematics Division, Report 97, 1970.
68
as
= F'(xi)(t(xi,si,l)-l) • (A.2)
(A.3)
gives
^(^fiA) = 2
i 2i \/
+ 1 l^v^i)
1
\? =:—
(A.10
This analysis provides the basis for our method which is given below.
Algorithm
(i) Calculate \\B^\ , set w = min(l,||si||) , \ = 1 .
69
v
MafcMjj.|-a,M|aj^iJia^yHB||.i ^^ ,■,, ■...^^.., .,...^.... .,,J..,.
wmmmm ■■■■.nijiiiji..,.,!„„.„ mmmmmgmmagmm
go to (ii), end.
\ = 2\ , end.
+
else \ = .5(^ PM •
go to (ii).
Remark
(i) Numerical experience has shown that the value of X predicted in Li
behavior.
such that
*T * *
R R = H (A.6)
70
■MMMlkMIHMa ■UMMflU
—"- ->""'*Mi«i'mrr -»rlHT-irHriHi I ^it^^^^^^gj/jjgj^
^fffßwm*&wwwwwww*i*vi\*v***w:;'»ri ww^ mwmfm*ww rsvpy^TT^-^r
.^«TT
(i) H = (l + uy^rRd+yu^ , or
ftT
rR = RXR+ST w" (A.7)
that
r R 1 r.1
R
(A.9)
Q
1 0
/Fv
Let W(i,j,{p,q}) be the plane rotation such that W(i, J, {p,q])A
hy
(A.10)
Q ="n'w(i,n+l,[nfl,i})
i=n
71
two stages. Our method uses ideas due independently to Stoer, Golub,
Q^ = ||a||en . (A.ll)
If we set
^ =J|Tw(i,n,{i,*}) (A.12)
i=l
Thus R is given by
.i
R* =Q2(Q1T+lHlenbT) (A.15)
where
Q2 = TT W(i,n,{n,i}) . (A.llO
i=n-l
be carried out very cheaply. Depending on the update formula used, the
72
ffPffWPffff'.^'." ■ ■■li|i!fini^iww.i.m»yiiwi,fvi^""-^" BPPIPP W.riT»"1? [Wr'm^mwiVWp*'* "i fT»irmr» W"1* 'WfWW^W-^T^^^miVTW^v/fWim^
for S .
2. Numerical Results
In this section we report the results of numerical experiments
carried out to test some of our techniques. We consider four line search
strategies:
search Interval,
(iv) the method for satisfVlng the Goldstein condition given in the
previous section.
H* = G has been implemented, and the results obtained for each are
given.
(x1-l)(xj-l)
-■iki i+J-1
xi = - Vi * i = 1,2,...,5 •
75
F = 100(x2-x^)2+(l-x1)2+90(xj+-x^)2
+ (l-x5)2+10.1((l-x2)2+(l.xu)2)
+ 19.8(1-x2)(l-xu) ,
Li
and the starting point is
U
(iv) Singular: Powell's singular function is designed to test the
xT = { 5 , - 1, 0 , 1] . o1
„R = {X
r 2^ 2-ll/2
1+X2} ' , :).
0) • ■■
, (
if xn < 0 then yr- arctan
J. cTT
71.
:>)
äa^UbUtaÜkim^M^^i..^^ ^^..-.i^., ^-1. .^^^.^■||| .„^ .. ..,, ■■■■^■'-■-'■aiiiiimnhBifc» ■■ —üifMB 1 tarn ' r,
P!^P^WW!^^WWP^^^gWWfWP^''l^|i^ll|.,w^^mw'^
and set
2 2
F = 100((x -1JOT) + (R-1) ) + X?
xT = [-1,0,0} .
the test for terminating the calculations was based on the size of ||s |1
-8
( \\s. jj < EPS/n with EPS = 10' ). This proved reasonably satisfactory
for all cases except the singular function — in fact in all other cases
the ultimate convergence was clearly superlinear, and the results were
However, in all cases the answers were correct to at least six decimal
cases both H and G foiroulae produce very similar results until the
tained and at the 20-th iteration (in the case of the line search
75
76
-
'J ill,HPWWfflfip»|Pii ■ i. i i. ^i ^mjm.i !■ . , W^fj p J I IM. 1P>.—. J--_»"H^HHW..-»^h..'.-.-
V fv K 1
g
o
•H
■P
\ H
l
\ «5
\
\3
\ -*
|\« \ CO
V
1 o
\
Ä
\
s\ A Vo\
1 \M
k« k
0)
\1 \ J
cJ
^
•y g V
^ » V \ \
i
\ ! ^
r
s
\
^ \r^ \p \ I
C
\ \ K*
«A N
•H
« \ ^
\ 00
CJ
V V i\3
o
A A §\ §\
%
3Ü
> V V \*
x_^ \ \ \ J
-
\ «o
as V
\ \ s V
\ Ü3
s\ Ö\
^\ S^
*
\ \ \ \* \
\ o> \ "^ \m
o \ -*
\cvi
\ K^ \H
Vy
\ Q
§
v" v v 1
H
II
vo\ -* \
CM \
CVJ \
t- \
ON \
\
s\ t(\ \
\
P
O
M ^ k V \
^
K
V
\ "A \ "^
V
\J O
0)
1 0)
10
1 fl>
vt> \
\
-* \
w \
\i
J-\
^~ \\
A w \
ir\ \
•^ \
\
\l
\
.
5
' H
-P
-P
s.
\
\ o>
\\
\ -*
\ t-
1
\ -*
\\ H
t- \ 00
\ t>-
\*
\ K^
\ t^
\H
V
\ w
\ Q
1 I
(0
y
•H O \ \
•^
O
3 >
«
Pi
\
VO \
\
^\
\
\
l/N \
A R\
\J
\
'*
\
ir\ \
\
\l
i §
•H
I
(U
V*
\^ \ 0
\5 1
^ 1
\H
X \ \ V o
*,
^ VO \ i^\
A R\
\ '
u
04 ! CO 1
s
*^ (
ati
tV 05
1
H j •s
S
m
i i
3 '
-8 j g)
a
*n I v
En
•* O 1
X j
! 1 * 1 m x
77
wm^mmmmmmmm'mmm i^qnpipRpipiininpii. i wm WPPWP^PPWBP
78
IX
. ■-- ---
iVURllllllllMU ^BiPffPWBWiBP^ i^u'^ff MyWP^^ ''^^^^.^t1 ^/MW'-rw^i^Tf^"i^^^i!ii^w^^jj}pTT"Tw
min f(x)
(iii) *<•*-
og. < 0 if g.
v
' c>
i < o.
K
i whtre the p.
K , iel. , are fixed positive constants.
i' 1 ' ^
m
e Example; (i) 0 = J^ l/ßi(x) (inverse barrier function).
i=l
m
(ii) 0 = £ (log(l+g (x)) -log g (x))
i=l
Remark: In the second example the term with argument 1+ g.(x) merely
79
M^MMU^M
M^l^m^p^^^^^^^^^^^w.ii» ■iil,.. i.-..-,..,■ n.miwmfwtfp ipu-i^nimninijuminiün.i, I.I.IJ»-HJ .,.. »..■■IU,^,,.
where r > 0 .
•J
Proof; Let r.X < r.J then
gives
(rj-r^g^)) ^(rj-r^CgUj))
8o
:>
i
—~*—-" . . _
|WI»I...»J jm j. m.m.jupMiu, ,i.u muni11,,,, „. i , u,.,,, ,„,,. u. „j.,,,,.,,,.
KtixJ . D
strict.
whence
lim T(x,,r,) > f (1.2)
i -• -1 1
Now let e > 0 be given. Choose XCSQ such that f(x) -f < e/2 (this
whence
llJt» T(x ,r ) < f . a (1.3)
i-« ~
Corollary 1.1; The limit points of [xj are local minima of the TCP.
81
I^I^^^^^^WWWWPWIHWWilWPPB^^ ,
!! WWWBWPIBHIIIWI
The previous results are readily extended to this case and are
summarized in the following theorem.
(iii) lim Q^O = f* » and the limit points of [x^] are local
k
minima of the TCP.
82
-—
2. Multiplier relations (first order analysis)
m
- k
= vti^) - f uj Vg^) » 0 (2.1)
u r Nofce tbat u 0 r 0 k
where i = " k ^- (8^)) • i " ( ^ ^ ' -• '
cases. We assume that B. - {1,2, ...,t} , that the rank of the system
7f (
!hc)T = C (
i !5k)!tik) + ^b)*^ + 0(rk) • (2 2)
-
The linear dependence of the set of vectors 7g1(x ) , ICBQ , gives
85
C2(x ) = C^x )R (2.5)
üik) + Bi2k) =
tcÄ)\(ifk>5'^i^)TWi)T+(c2(St)-ci(5t>R5u^) + o(rk)}
(2.5)
(k)
As uA hounded we conclude from (2.5)
(k)
(i) u^ hounded, and
we have
or
u
7f(x*) - E i ^(x*) . (2.6)
- iCB0
«
Corollary 2.1; If the Kuhn-Tucker conditions do not hold at x ,
k k
then fu.} , {vu) are unbounded.
8^
mm" pppiipmBpi^n^.wc^iTD^^iw.ii^i, ^-»jpy,-"^ »'.".^ «».;. ».i|;..i..w?w UPPPWBH >m« jmawmngawiwiw
Remark; By MP Lemma 5.2, this implies that {u^ ^} is bounded for the
convex programming problem provided S has an interior.
t
i=l
»i VgjC^d • Vf^d+O^) (2.7)
f uk ^ *f^+0^ (2.8)
85
v
-
mm
Bi^^^^^W'^^P|BW^W^^g»W"-H"l'.»l'"U,|g|',-|i'11 miiJi^njlH, HI ipppp i,.iw^.i-A..,;i»..^.,Wr.?^..i;uwiWi....1.L..ii.iii..,,,iii. uijui..... .,>.!. . ,»..,....1.1....,,..,,-
1 og^
r
(11) k 2 W -, + BB
* k -• , leB0 . (But see Example U(ii) p. 100
6
i for qualification.)
2
r +c D c T r
^^ k) -^5cük) Ä> k i(;k) %i€lTt-BR kM^i^V^)
dg 1 0 1
where
86
mmmmmm mmimm |lt.MP:SIJ...JIIM^»»!fm»M^tlj.i,m.i.liw.«WWiW.JU.P»igt^^^
Left
P (5.2)
k = ^^^l^)
as
Remark; The problan of generalizing this result to the case where the
V = ; 76 (
k ^ i ^- = 0 , Vi c B0} c Uk - W ; v - (I - Pk)u , u€En} .
We have
87
PÜP! ffgTffPg^^jBB^figjWg^^g^.^wm^^iitM^Tf^
V. = [v ; e^ v = ej v = 0} = lim V. c V* = (v j e^ v = 0} . The
v € lim V. .
W = U+7VVr ,
M= (uj||u|| = l^eN1} ,
Tl = min t Uu , p = min(0,TO
t€N , ueM ~ ~
Thus
88
.) i
' — -. -—.^ - - - t
^HIPIPBilPimPi !>
ffJ'"'""■'»■ »■ '■lP^'lJ,W^<!ViJi|^Wiilaiil^.wrPliiM^Jii»piw,t«l|yi^WJHi|^ H..W..JIHI ■i.-iaiiygwn»
Proof; We have
Proof; We have
89
FU^^pgg^ppi^^p!»*» m «ism. ««...j. . | .LU-.L. .-. .u-..,^— ^ TT , ....«^p-, r- > ■"•^ -
[U + VD/JX = vy (5.5)
x ~ u"1V(v:ru-1V)"1D'1y (5.7)
as rain |D.. | - • .
1X
i
x = U"1V(I + M)":1>I(VTU"1V)"1VXU'12
+ u"1(i-v(v;ru*1v)'1vTu":L)z . (5.8)
90
■MtaMBMMkM
|^l^(W!W!IIlp>iB!!P^W«plB»l»|IBWr^WW^ .. i J..1JJ1111111., j.mpi
*II2
+ 0(n1aX{llxk-xr,r k||xk-x||}) (lf.l)
Proof; The result follows by taking the scalar product of (2.2) with
(ii) 3 1^ < • and p, > 0 such that k | > ^|v, | for k > It. .
u
This implies that for at least one 1 , i S*^) = ^(ll3^ "x II) •
(1) u -• 0 , k -♦ «B , or
91
t^ct» ..^„M..^:..^^^,,^,.....^^.;,^,.,.^,^^.^^^.
ppppp^^ppawiBppiwiiiiiiMpiiw,!»^ ' ■" pi».ii»n.n».ip»iiiiwii»'j^j".i*"ii»mi!""^ «jgwan i. H.iiuuiwimi»!». mmiiP«iu*m\mmMmmm<.t imm 'Mimmmmmmiimmmmmmm^mmKl
(i) of Section 3. This follows directly from Wolfe's fonn of the duality
^VV
~ ~ =^ t ui ^^
~ - i=l ~ $
x(x u#) = f(x )
~^ ~ * ^ ^h) (
^5)
U
i ^k/^^
whence
e v(x.)
i ~"V =^/r~^uf
"*k' i •
This gives
H^-xl =0(rV2) .
u. = r, / &g.(x. )
i k' i ^k'
which gives
92
= r
h^ k/ui
and
115,-xl =0(rk)
m .
=0
^-^WbJ
1=1
'
and
u =
i/|^^ -rk ' i=l>2,...,m . (^)
differential equations
— "
dx
0
dr
H(x,u) (^.5)
du I |
e
_ dr _ u. " -
93
p
7 X(x,u) 7ß.(x)T... 7^(x)-
yt
m ™—2 7g
72 i^)
ög
H(x,u) = ik.S)
_m o_^
Vg
m^)
C Kf*
%
D = (^•7)
w
m
where
■■ ■ Wi (4.8)
then
T
7 x(x,u) - tfe-^x) " ^(^
u yg
i i^^
DH = (M)
L V^m^)
9U
In this case
- -
dx -
0
dr
DH (^.10)
du
w
dr
u"-
whence
x, - x
~k
(U.n)
'[:]
= "J (DH) dr
u -u 0
We have
=
V^ "2 ' W/ög? = 2g
i
In particular, it follows that DH(x.) is nonsingular for k large enough,
i
95
i
ox
0
dr
1 (U.32)
(DK)"
=
du 2r^
•
dr •
•
■
1/2
whence, changing to r ' as independent variable,
dx
0
= (DH)-1 (U.15)
u
du
Thus we have
0
x(r) -x
= rVVCx*)]"1 + oCr^2) . (U.ll^)
u(r) -u
NG*
= 1
" dx "
" 0
dr
= J"1 (^•15)
du
e
_ dr .
96
■ ■'- - - -■ ■ -"-- ■ -
""S^ '!'VM»I'?"Ww"">«w |p """- L
'■■' '■ I »."".'.»«■'Jim - lij.^!.ilwlnilJt»,"-J-'.iiJH,"-.ipi».iff--Ji.'Jm-)m'».'i m.mp.ii i .1 .ij.n MIM g;»;ppi||»^BBiwpiiggiWj»wiwllW|iwipp^
x,ueCP" ). Thus
x(r) -x
respect to r . We obtain
definite for r
small enough, and Corollary 3.5 can be used to give
du
the solution to (^.17). Note that (^.l?) results if ■— is eliminated
fron {h.2D). i i
Proof; The argument is essentially the same for both barrier functions,
but is simplest for the log function. Thus only this case is considered
here.
so
For the log penalty function u = r-J&AxJ) ^at i
97
f(xk)-f(/)= E i^ry^x,)*
ü v y
od^-/«)
icB0 i -k
= tr^odj^-xjl) . (U.18)
If strict ccmplementarity does not hold then for at least one ieB»
r
k
11m —T—v = 0 .
k-»09 g ?5?
g (
^ i 5c' ''si(i "iht'Ü'*0'"5i'!5 I" iS 0
0lat-i ID at most, It
follows that r, = o(|jx -x ||) and hence, from (^.18), that
f(x ) - f(x ) = o(j|xk-x !|) . Thus the SO condition does not hold.
for some K > 0 . This shows that r = S0(||x(r) -x ||) so that, by (U.lB);
not hold then the strict order condition cannot. The condition that
J(x ) be nonsingular is required only for the second part of the theorem.
98
Figure ^.1
= 0 = l+r
99
HM^MMtftMMMMMHMUMI ajaaiiaij»J»..^.a-v.^ji.-,,.^.^-.,>J.....:..-,,
i mi. riiuM». HIP,.PWII».-P.Umm-"
i
"- ! mm> WWPWWPPWWW [■wiiiii>iiiu,i.iiiiij.s«mpii ii IILHIIIHIV in ma ii ii ' ■ ■"
x1 = r^-r+OCr5/2) ,
x2 = r1/2+r+0(r5/2) .
-2x1 1
= 0 = l-r< 2 +
x,
dx1
x2 - x;L 1
x = r -2r2+0(r5) ,
7
2 '
x = r + r +0(r^) .
(ii) minimize x.
0
subject to x? - x1 > 0 , ^ > •
are active at zero, strict complementarity does not hold. Note that
100
^-yca- *' iM.'.j..i wi-ugy 11 tuyi irrr-rtr* ^aw»ww*wpyw?Bwrww ■pnyrygffwiicff^^g^p'w vv1'-' >■ 9w*&*%mgmmiQm
mmtm^mmimmmmiaimmmmmm I^W»»MWi>B»«»?^w»>WWg,».,^r-'IWW'l!M^
1
1^-4) 4;
ar = 0 = l+r
whence
x2 = 3r/2 , u2 = (Ur)1/5 , r ^| = U .
0 =-. -r
1
^-
ar 0 == 1- JlX2-Xl
whence
Xl = (r/2)V* , u1 = x , r4 =i
äg
= 3r/2 , u . (ar)1/2 , r ^f = a
a
4
101
l^HaMMaM MaWiiwMuttitaAi
|
w^m^^mmm^^K^mmmfwm^m^mn^timKf^ VWI ""'^M HBM in L u. HHIWL I I im^.i
T = -x1 X
^(l-X^-Xg 2j
3(1-x1)2 "1
= 0 = -l+r<
5x7
ÖT / 1 ll
^" ^^d-x/.x/'x^y
= 0
whence
X2 , 2V2 ^A r5A .
STi
D 0
= 0 =
1 J'Hl'xl)2 \
"1'r\(i-x1)5.x2/
102
LU, I 1..1111111I-1 I>J j ■■»II> up i^ i,^. .m.Kini i. uu,.n .j. ..n ii i ii i... _., ,
""U1-^-^*"2)
vnence
x1 = 1 - 6r ,
x2 = lOSr5 .
The above examples confirm the predicticns of our analysis, and for
more rapidly (i.e., for earlier members of the sequence) with the log
barrier function.
gives
-i > r
k-0
whence
^(^-C^^O as r^ -* 0
function.
103
,■■1,1 illlinill .JIJI.^L.li^ ^^JW,,B1.,.l.llw..>,mp^w^j|| mppMp^Mp m ). .■M.|.i.i..i ii. ,1- ii .,.>. J ■im-im I. i m »i.»....^^,,,, II,I IIJI .ij.i.n.mup,,^.^ „„.y, „, mi, ,v,mi„^„^mv^ri
VK
by
0<(1) =
^ =log(k -log(g (x)) ,
l(i)
E 0^ • (Iu20)
J=l
t ^•i) i r
k
i-1
-r i
ftx^-fCx*) = -r
(1+.21)
The right hand side of (^.21) tends to zero as i -• • , and this suggests
m
Q(x,\) = f(x) + J] \ v(k -log(g (x))) .
d J
J=l
lOU
for i = 1,2,... «md fixed r . Then the limit points of {x^ '}
1 1
7f(x^)-r £ i±K ^ m " m W^i^
* 0 , (^.22)
VQ(x(i),\(i)) = 0 (1^.23)
, .x i-1 ,
\\J = —rn—
(i)
TT —-T7\ nrr ' j=i,2,...,m .
kj -10g(gj(x )) S=2 (T + ^Cg^X^O)
(1^.21+)
fiv' > 0 , s = 1,2, ...,i-l , K\ ' can be made arbitrarily small for
105
■
ipipiimPMpnii^viiMPvpnninwiiivR
—4_
(U.20).
Lemma U.3:
(^.25)
where
U-.^+
■j "j g ,^<,^
~j ögj 'i
. (1+.26)
and
(i) -ii ,
py gj -* o . (^.27)
(U.29)
^
gives
106
MMMHi
so that
06^
"Si ^ ag
^ + g
■■*(• )
(i)
g ^G - 0 , g. -0 , i = 1,2,... . G
d «gj 0
condition is satisfied.
J ög
6
'1 'J
<(i)
(1^.31)
1
so that w^. ^ -«0 as r -«0 for jeB^
0 . Now
0
0
bi~t 1 1
[D^V )^)]-
(i)
\ 'I •
•
. m 'Sn .
+ smaller terms
107
where P^1^
is diagonal, and P^' = l/pj1^ - 1 , r
•JJ --"J -' k^0'
j = 1,2,.,,tm . This result implies that for k large enough
1|
Vü*II ^ K^ u k
JeB, j «j^ •
Thus eventually the largest components of >r ' will be those associated
with the members of the log sequence, the inactive constraints should be
k-1 k-1
where p = r, u , r, is the usual barrier parameter, and u is
108
J i
k+1
12? J :
i ~r*k+l"i
'vlUi ^-(W
c^g
i
s^As-^rft^i <^
where p. is a bound for 4^- (x.) , k = 1,2, ... .
JU
X
2-Xl=rk ' ^l^kVl '
so that
0 1 c k
ß ß
k - 2 k-l " 2
condition ßn = 0 is
ßk = -k + (l-(|)k) .
From this it follows that u = 0(r.) , and hence that x^. = 0(r ) .
Example. Show that the error estimate (U.l) is valid in this case.
L« 109
wmmwM'-i'iJWJiiiiiuii^niPiiiMiiiiiiiiimniKmnniiiiinixuM ni.MOiwiP
to
v 7 X(x ,u )v #
m = rain 1—a—- , Vv such that 7g.(x )v = 0 , Vi e B. .
V V V ~ ^ ~
of the problem
[C1(/)TC1(x*) - ^Al^ = 0
p / O
where A is a diagonal matrix. A.. =—^ / max —5 , i = 1, ...,t
11
Ög2 /i<j<t Ög2 J
110
tm jjgmjn ■ M.^.^:^..^,tikJ,i--^...UTAn;|[rWt|1||..|.[.|riini<|1|.[.|n..|i...|...[||<|.. ..
w~-
•V«-.-.- ST^HmHWIPIWWl fWHWHWWWIWW "'"' ■«^»■iiim.w,i;iii»iHi,.iiiw.i>i«ii,i»|).,ii>iMiiHui,iiiiNijiiiitii,iiti,.niii.i.iw. P1I!I«!"!','»."I1I,I'|].IW»1IH"I,1M1K3:
In this latter case we have shown that our measure of the cost of a
for the log family indicate that these will be somewhat more expensive
than the above estimate except when all constraints are active.
least in the examples that have been worked out. The use of this
algorithms.
(say f ).
Ill
equality constraint „J
and
(iw} nonincreasing. If {x } -• x then x solves
the BCP.
F X F X F
F(V^)
jr r < ( „s' r' < (v~s'
S^r) a^Ss') < (V\)
~r' s'
Thus (compare Lemma 1.2) the results for the sequences follow us before.
We have
min F(x,\) < min F(x,\) = min f(x) = f . (5.1+)
x ~ xcS ~ xeS
112
x cS =» f(x ) > f ,
but, by (5.1+),
so that
u
i " ' Xr ^J sgn(h )
i ' ±
= 1'2>"'><i. (5-5)
T
Proof: Define the matrix B(x ) by /c.(B(x)) =7h. (X) ,
so that
115
Now B(x ) has full rank for ||x -x || small enotigh. Thus
•«.i
ur = (B(>:r)T B(xr))-1 B(xr)T 7f(xr)T
for the active constraints (do this I). In fact, the strict complementarity
{u } -u , and \ djt2
_ • , i = 1,2,.. .,q , as r -♦ « , then the
~r ~ r
dh^
Proof; This is essentially the same as that of Lemmas 5.1 and 3.2. D
Example; Derive the analogues of Lemmas 5.1 and 5*5 which apply when
Ilk
(5.9)
so that
d2
Thus X ^-| - • , h. - 0 if |u. I > 0 . Strict cctrplonentarity
(l + ^^lh.fa sgn(hi) = - u*
so that
115
also indicate that the condition number of the Hessian will become
Theorem 5.h: In the IGP let f(x) be convex, and g.(x) , iel ,
minimizes
provided ^ > \
Ö *
then
116
tmmm M^MM,
-V»•.'■-,■ *■ i,im«v>-B ■",
=f(
^ X ^-o*
^-^o icB
< f(x0) - (b*-!)
= f(x*)-l
whence
117
. ■- ■■ ..„.L^^.—^ . - .
- flU
fg^H^^J.u.-i Hii^Hiii^iw.Mm i ,..J., . '^■^.-'■■'Jiv.iii.i-yviuiiMi^ -A™™, h^n *'-vi' n gn^—w ^■^r-i« P^...I+...,I.—
q
7f(x) - f ui(W,1))7hi(x) = 0 (6.2)
W X
where ui(W,Tl) = ~ ii ^" * Provided ||x(W,T0 - *II «"^
/ x
One sequential strategy for making x(W,Ty -♦ x* is to force
M* «W **0 . i
v c*x.
Let irr- be the matrix with components -rzr- , 1 = 1, ...,n ,
W fixed. We have
72 P "ri - -^n^i< ^
where all quantities are evaluated at x(W,Tl) • Defining the diagonal
118
|jpw^iy"J:WTF.ia!».»i<tiuiJ.iiiiiipvM,Mi.ittiitijl|).|jii..^, .IJI^I. urownpimnuBpipim J-*~'9J',i''>-^r-.-"rr——- &r^'rj"Wlfinv*'r**^ft7?*ryr^nr?i<y*ivnr~
2 2
Vv^ Ww^ W i
matrix V by ii == ii H = ii ^ '
^ = *ii Sh^ = 1'2—^ ' ^ the
matrix B by K.(B) = 7h. , i = 1,2, ...,q , then (6A) can be written
(72X+BVBT) ^ = - BV (6.5)
2 T
Choose V- to make U = 7 X+BVQB positive definite, and set
v = v v Then, by if min v is
i " o * ^"^» ii s11^101611*^ large
^ ~ -U'1B(BTU"1B)"1+0(V"1) . (6.6)
Consider now the use of Newton's method for solving (6.5). This
-u <& R« TJT öx t_ .
(6.7)
But, by (6.6),
,T äx
B - I + 0(V'1) (6.8)
3Ü
so that
119
"—"--—"■-^■■■''-n II --■ ■
wmMmmmmimmmmmw-*""* mm '" ' ' ■' '" WPilW ^.'HI'IIIMWHIWHIIHIHI« iiiiiiiiiiii;iB,iij!iii .iiii|iijiipiw.iiiujipy]|ii].ij|iiii,ii<<iiiii.iiiiiii., inii)ni»g.mi|iiM
Algorithm (BCP)
1
(i) Initialize T/ ^ , W^ .
IF A^C^Cx,)) <DBCR*ABS(h.(xk_1))
(v) K := K+l.
i
(vi) GO TO (ii). -,
Remark; The idea behind the algorithm is that the correcticn (6.9)
120
suggests a correction
by the options
=
(i) if g. -• 0 at a satisfactory rate then 1\. \+&± * and
(ii) if the convergence rate is too slow, then decrease the barrier
parameter.
and will drive the contribution to (6.14-) from this source rapidly to
zero by (U.55). Note that the boundedness of the barrier terms requires
a feasible point.
121
Algorithm (ICP)
IF ABStg.^)) <DECR*ABS(g1(xk_1))
THEN
«if1' ■ -/^ >
(k+l)
(ßiTi'^i
(v) K := K+l.
(vi) GO TO (ii)
and the vh.(x ) , iel , are linearly independent then S(x) has a
122
-■i---:-;-"'----"--■■„- drnn-ir.. iaC^l-..^U-^L-JL-^^..,.U^..-^A.1 ^^rf-lr^ ^Jb^ . :■.■, .--^.r ■-■.. ' ^..-^ ,- ■■—^.. - , .„,, .t J.,... rJ.L'. i^
WW^^W^'W "MWMMIN llil '.' • ■liii.uiili^.lp. m.. ..,-, „..^ I'nm "in min) ji.i ii Mi . III.III|IWII.
Proof: We have
h(x'(')T(7u(x*)-2W7h(x*))
7f(x ) -u 7h(x )
= 0
7h(x*)V(x )
+ 27h(x*)TW7h(x*) (6.15)
where terms which vanish at x have been ignored. Corollary 3.2 can
2 *
now be applied to show 7 S(x ) is positive definite. We set
V = 7h(x*')T , U = 72l(x*,u*) - 7u(x*)T7h(x*) - 7h(x*)T7u(x*) , and note
0**0 MM ** ** ** ** 9* m* r* m*
that
T T 2 , * *\.
min t Ut = min tA 7 £(x ,u )t = m > 0
^=0, Ht|l=i ~ ^=0, Htll=i ~
»ff
as the second order sufficiency conditions hold at x . G
\
125
in Uf^lUll^l^ ,,■,.,.,;:,„.....J.-;;ltt_,n„...^,.
r
i ■
Theorem 6.1; Let the conditions of Lemma 5.1 hold at x and set
(ii) There is a close connection between the penalty function (6.15) and
2
the algorithm based on (6.1) in the case i|f(h) = h . At a minimum of P
we have (as vP -■ 0 )
12l4
i^LaaafliM^fa^^^—^.^,.,...,..-.^■.■■■..: g -iilitu-i ■rrn '■ ■Mr'«!■■■■— ^..;J.-v^.'^.'..-J..:. ■--^.^^■■.^Mv, m^,^ n-y, ■ r n■ i,- ■ ■ ,...: r_'. .._.'.-_^.:.'.„. .v 'u.-..:^-n.:^...^L^
I'M mniiiiiwijpwywwwpwiiiiii i i IIWIHWI)'!.» iww, nmi]§mmwmimmmwm\-jw'mmwmwmw*wnm%w\m<mmMi.mmwtm)f piiiiunnmnn
where
h(t)+7h(t)(x-t) = 0 , (6.21)
Proof; Any point satisfying the constraints (6.21) has the form
x = t-(B(t)T)+h(t)+A(t)z (6.22)
u = B(t)+{7f(t)T-ff(B(t)T)+h(t)} . D
I I
125
minijnizing w.r.t. z
u
a T T
7f Az +| z A A z
whence
Ü
z = - i (ATA)"1AT7fr . (6.2U)
cannot deviate far from t (cf. remark (ii) following MP Corollary 5«l)» *^ |
the problan min £(x,w(x)) where w(t) is the vector of multipliers for
x ~
the problem
a
2
min f(t) +Vf(t)(x-t) + | ||x-t||
x '
resulting algorithms.
126
Notes
1., 2. See Fiacco and McCormlck's book. Also the paper 'Penalty
section).
U. The log family is due to Osbome and Ryan. The importance of the
Zangwill.
2
6. The algorithm for the BCP is due to Powell in the case | = h
References
Madison.
127
,J
I
128
filtttMU
'"-'"-— ■--- ■■-"
■ - ■-■ m mmm MUMM BUI
|pHiifiHiiPt^-^.Ti» i.^rrtwwF'T—■ ■»■■— \mi ■ ,.,mmmnim» trr}*-Tt:r~*.:.,.,l,l,*immr--yi,i>mr-rwwrr>,S. J.I')'-i"iif..M^ P.'.' •wiwvm-iw; T*TTVTWPT^'■"""""TOTTHT-WI-fWITTIWTft.^"^^11JWW '1'" ■ JUJ■■■I■W■ W|'j'IWM,-yWS-^ -^TPW'Tf ff l|■ W I'jW If WVTH'V—^•WWV'ff'TT-T-""T■nnrTt^lHTH—Tti.^
Bibliography
129
150
-
^uiT.ww^*>H»i..IiJ,W^Li*wii.i»!JM.^,p*.-^.#rtpwT^-v-WBB-- »iLmi^v^m^w^trvn^ jin.ij»mw,mwyJ.f w- ^mmmmim/WW ^.'l-H -n-uiiwi ■■ —I u^ I .^MIM^W i» M-X^^UJ.M ,m. . n K W'vmiJS.JqFinjy*
Bui Trong Lieu and Hoard, P. (1966). La m^thode des centres dans un
espace topologique. Num^r. Math., 8 (l), pp 56-67.
131
I—,H.I- -i.-!.—!..!'!- ■ ^'XI"."L..I.. m^mmm^^im*.: . —• T"-■■-■» ^!"M .T-—»r—wi-c-^-f ■
152
135
gjMMiriaMjag^g-a,,-.—,,—.-„,.,....,.....,^..„—
nT^rvi^T-T^^gT^-W-^yTtrwv^f^TTnr^
154
^JICiHIIBIUMlMUMKjMMaUifi
||pWPP^iWM^|iiWJ!»'r''l'W<IIIJy»l!F. I»1'IP|MJIIWIIIII^IUILII|1PI!|^»1.,W|IW>II.|II»H,BI|^M^I''«WIH"MPI"'J'I|.B .-H UPTB'»!»!'!!".'"Hili.imiiyiUmMCT.'ilHI.illH iiiUliW|^»,».>.PH|.|.iiiHIII-i].iHllii IIIIIIIIH^PIJ.HIIIJIWJIIIIIIIIII ! uni iji.i.np.
f 155
-"'•"""Tr ' " ' ■■'■ ' '_
liiir-T T'"irms: If^wwtfWii 'yr^r*riwv!,»A'mjn"mmi
Kelley, J. E., Jr. (i960). The cutting plane method for solving
convex programs. J. Soc. Ind. Appl. Math., 8 (4), pp 703-712.
156
—— m—ii ■ — mmimmmmmmm—mmmmm
157
■^■■■,J.--..^-,-.^—^ .. . ...
rw»w-Tyf')^l'.lw'"g',WWf^,w' v.^ffivm-'•'<;.iw.'T>'r ^.-TiP-wTl"i""W'"TT-wiw"w.^.w-F^'"wl.^l.'mn-,'i'>in' "' im'i^u'i'■.■■".'■■'WTI»..HI"I.BI,HI ^PI ,fi«ii H MM II' I.II«I ^.*m-*.•l'.y■^r*•w*^•'^'w••*rl•*":l"^^" n'>vi"\.n"H'm\\wm'^^ w*mwm\ttm%^t*fimxiHli
138
J i
159
IkO
M
—-■-TMMaMllllMH.Itollil ■ 1 I —^■-