Professional Documents
Culture Documents
Heidari Et Al-1971-Water Resources Research
Heidari Et Al-1971-Water Resources Research
DiscreteDifferentialDynamicPrograming
Approach
to Water Resources
SystemsOptimization
MANOUTCYIEYIR
HEIDARI
VENTECHOW
PETAR
V. KOKOTOVI
AND DALE D. MEREDITH
Abstract. The optimization of operating policies of multiple unit and multiple purpose
water resources systems by traditional dynamic programing with the use of high speed
digital computers encounters two major difficulties' memory requirements and computer
time requirements. This paper presents an iterative method that can ease the above diffi-
culties considerably. The method starts with a trial trajectory satisfying a specific set of
initial and final conditions and applies Bellman's recursive equation in the neighborhood of
this trajectory. At the end of each iteration step a locally improved trajectory is obtained
and used as the trial trajectory in the next step. The method has proved particularly
effective in the case of so-called 'invertible' systems.The merits of the proposed approach
are demonstrated through its application to a four-unit, two-purpose water resourcessystem.
To save computer time the example is restricted to deterministic inflows.
---Trial
Trajectory
a(O)+B s(0),
a(O) + Bs2(0)
'.'""-., : a(N)
+BsI(N)
a(O)+$s3 (0)'
---e--/--:-:.....-,--e--:'-...:
7.;/,
.-..-.;..-e
a(N'*
Bs2(N'
I I i I I I I I I I
0 I 2_ $ 4 5 6 7 8 9 I0
Stage (n)
,[u'(n)J,,
etc. Retrieve
Fk
t ,
[s'(n)]k=
2,
[s'(n-I)
n:1, ,u'(n-I),
N n-l]J
----, = ,,,[s(n-- 1),u(n-- 1),n- 1] (12)
t
is nonsingularfor every n, u(n) U(n), and
JF'=n=o
" R[s'(n),
u'(n)
, n] s(n) s(n). Assumingthat (12) is an invertible
system, one can solve the decision variables
in terms of the state variables
Choose
o-t , t=l,Z,---, T [s'(n)]k=
[sn)]k.I
t [u'(n)]k=
[uili(n
)]k_l u,(n-- 1)= t[,,[s(n),s(n-- 1),n-- 1]
FormA si(n),i--I,2,---,T
m n: O, I,---, N
n =0,1,---, N u2(n-- 1) = [s(n),s(n-- 1),n-- 1]
f
Form
subdomain
O(n),
n=O,I,---,
N,J
by[s'(n)]
+Asi(n)
, i--I,;),
--, Tm
Let all D(n), n--O,I,---, N, be u,(n-- 1)= t[,,[s(n),s(n-- 1),n-- 1]
corridorCk
Optimizeby dynamicprograming
for states in Ck
u,,(n- 1) = t['m[S(n),
s(n- 1), n- 1] (13)
In the caseof water resourcessystemsit will
Fig. 3. Flow chart showing steps of the DDDP be demonstratedthat the assumptionmentioned
approach. above is not restrictive. The ith componentof
Dynamic Programing 277
the vector equation (12) for a water resources - s'(n-I)+8sl(n-i) . s'(n)+ 8 sl(n)
systemmay be written as
u3
si(n) -- si(n- 1) q- y(n- 1)
s'(n-I)+ 8 s2(n-I) s'(n)+ $ s2(n)
-- u(n-- 1) -- L(n-- 1) (14)
where s,(n) is the storageat stagen, y(n -- 1)
is the inflow during the time period starting at
stagen -- I and lastinguntil stagen, u, (n -- 1) s'(n-I)+8 s3(n-I) s'(n) + 8 s3(n)
is release,and L, (n -- 1) is lossdue to seepage
n-P_. n-I n n+l
and evaporationin the sametime period.Since,
for a systemconsistingof m components suchas Stcge (n)
(14) Fig. 4. Possible decision paths leading to state
s'(n) q- $s_(n)from stage n -- I for a system with
m-- land T-- 3.
-- --1 or nonzero (15)
1(n) L3
in) g,[s,(N), a,(N)] = 0
s,(N)
_a,(N) (23)
otherwise
The desired state vectors of the initial and final
stagesfor i -- 1, 2, 3, 4 were assumedto be
i
4s4
i
(n) 5 5
To Farm
t,mo
I Trajectory Opt
imal
Trajectory
IOE ial Trajectory
$ rial Trajectory
2
/ '-'\
-' 8
6I./ /-- . %
%.ESERVO,R
:% / /// \\RESERVOIR
:5
o
2 \/-Trial
Iraiectory
,,/
0 xx_
//TrioI
Trajectry
m _
%,. ...... _
4 6 8 I0 12 0 2. 4 6 I I
Stage(Time),n Stage(Time),n
;Optimal
Trajectory ,Trial
Trajectory2
1-"' /T__r
icI
Trajectry
:5 II ////
.....IXXX
6 Trial
4
Trajectory
II'RESEOIR
.........
4L'xxOptimal
Trajectory//
/ o 2
xx /Trial XTriol
x Trajectory
5 [
x_ / Trajectoryg 0
4 6 8 I0 12 0 g 4 6 8 I0
Stage(Time),n Stage(Time),n
Fig. 6. Trial trajectories1, 2, and 3 and optimal trajectory of systemin Figure 5, using
ff-l.0, o' = 0.0, and = --1.0. ,.,
28O t-IEIDARI ET AL.
410, Optimum Benefit, F :401.5 After the required iterations for the three
trial trajectoriesthe reductionof a,, t -- 1, 2, 3,
does not produce any improvement in the
return. This result may be attributed to three
factors: (1) the optimal trajectory of this sys-
tem followsfull integerstates; (2) the trial tra-
,, jectories in Figure 6 are chosen so that they
follow full integer states; and (3) the valuesof
,, t -- 1, 2, 3, for all stagesare set at full inte-
gers. In a separate try, trial trajectory i is
Triol
Benefit,
':368,1 aj = 1.3 a2 = 0 a3 = --1.3 (26)
for all stagesstarting with iteration 1, and the
360/
550
Triol
Benefit,
F'=562.5
_
idea of reducing a, t -- 1, 2, 3, is employed.
After a total of 18 iterations in four corridors,
the states shown by solid circles in Figure 8
, rio;Beyefitl,
54C0 2
F' .0
54 4 6 8 I0 12
are obtained, producing a return of 399.06 as
comparedto the optimal return of 401.3. Thus
Iterations, k one concludesthat when the optimal valuesof r,
ig. 7. Total beneft6F as function of number are unknown,the resultmay be considered only
of ierfions for trial rajectories 1, 2, and 3.
an approximation to optimum.
SUmmARY A) CONC,USONS
jectories1, 2, or 3. All three solutionsconverge
to the optimal trajectory as shownin Figure 6. The major factors that inspired the DDDP
Figure 7 showsthe rate of convergencefor the approach were the inherent drawbacks of
- I iV____
RESERVO
0 2 4 6 8 IO 12 0 2 4 6 8 I0 12
Stage.( Time), n Stage ( Time), n
. Trj
a
2 T_r
j_e
o cJ
ory2
4
I I I I I I I I I 0 I I i I f i
4 6 8 I0 1.2 0 2- 4 6 8 I0 12
Stage (Time), n Stage (Time), n
Fig. 8. Near-optimal trajectory shown by solid circles and optimal trajectory of system in
Figure 5, using trial trajectory I and x -- -]-1.3, , -- 0.0, and a- --1.3.
Dynamic Programing 281
sum of the returns for N time periods Series by the Department of Civil Engineering,
due to u(n) and s(n); University of Illinois, Urbana.
F*, optimum sum of the returns for N
time periods; REFERENCES