Professional Documents
Culture Documents
Dynamic Programming Method in Problems of Discrete Optimizations
Dynamic Programming Method in Problems of Discrete Optimizations
xt = f(xt−1,vt),
The state x0 is the initial state of the trajectory T, the state xn is its final
state. The states x2, x3,…, xn−1 are T intermediate states of the given trajectory.
A trajectory is called complete if its initial state is the initial state of the system Ω,
and its final state is one of the end ends of this system. Thus, for the complete
trajectory Т = {x0, x1, x2, …, xn}, x0 = x0 and xn ∈ F. The trajectory Т = {x0, x1,
x2, …, xn} is called the final x-trajectory if x0 = x and xn ∈ F. The final x-
trajectory, observed by its initial state of reaching the state x of the system Ω, ends
at one of the endpoints of the system. The trajectory T = {x0, x1, x2, …, xn} was
originally called an x-trajectory if x0 = x0 and xn = x. The initial x-trajectory,
corresponding to its final states x, starts from the initial state of the system.
On the multiple basis of the system Ω, we have a binary reachability relation
P(x, y): the translation that arbitrary states x and y is the settlement of this place if
there is a trajectory T that has its initial state x and its final state y. revealing that
the invasive binary resistance of the resistance of reflexivity (for any state x, P(x,
x) occurs) and transitivity (for any x, y, z from what P(x, y) and P(y, z) ) that P(x,
z)) also holds. An additional assumption is that the relation P(x, y) is
antisymmetric (from P(x, y) and P(y, x) follows x = y). The antisymmetry of the
binary reachability relation means that in any of its trajectories the system cannot
realize cycles (return in the state in which it was already). It has the expressiveness
of reflexivity, transitivity and antisymmetry, the binary relation P(x, y) is a partial
order relation.
A state x of the system Ω is said to be reachable if P(x0, x) takes place. scattering
that the initial state x0 is reachable by membership. The state x of the system Ω is
said to be productive if P(x, w) takes place to consider the final state w. The final
state of the family productivity system. Unreachable and unproductive system
states can be excluded from investigation; further, as a rule, it will be considered
that each state of the system is achievable and productive. When detecting systems
generated by emerging complex problems, the question arises of identifying those
that are both achievable and productive, a separate analysis is possible. Getting rid
of unattainable and unproductive withdrawals of increased volume performs
calculations.
Let Т = {x0, x1, x2,…, xn} be an arbitrary trajectory of the system Ω, while x t = f
(x t − 1,v t), where v t ∈ V (x t − 1), t = 1, 2,… , n.
Cost C(T) of the trajectory
n
C(T) = ∑s(xt−1, vt).
m=1
Thus, the cost of the trajectory is the sum of the step-by-step payments, a
unique place in the implementation of the trajectory.
The main task is to build a complete trajectory, the economic benefits of
maintaining the total cost, i.e. total total income or total expense. Both extreme
problems according to the method of solution. To define the problem, we
formulate the formulation.
Problem A. For the system Ω, find a complete trajectory T of incredible
cost.
Additionally, we enter the accessibility of the set of tasks A(x), where x belongs to
D\F.
Problem A(x). For the system Ω and its state x, find the resulting x-
trajectory of high cost.
The solutions of the formulated extremal problems A and A(x) are called the
exceptional complete trajectory and the exceptional final x-trajectory, respectively.
The method of dynamic development is based on a fundamental feature,
formulated by mathematical mathematician Richard Bellman, the sentence:
В(x) = 0, x ∈ F. (1.1)
“The optimal property of having the property that whatever the intended
state and decision at the initial moment, subsequent decisions should constitute the
nature of the behavior relative to the state that exists at the basis of the first
solutions. ".
В*(x0) = 0. (1.3)
In the system Ω, an arbitrary state x immediately precedes the state y if the pair {x,
y} is a trajectory of the system Ω. The control that transfers the system Ω from
state x directly to state y will be denoted by v[x, y]. The set of system states
immediately preceding the state y will be denoted by Г(у). Based on Theorem 1.2,
we write a relation that allows us to organize the process of calculating the values
of the function B*(y) for all states y other than the initial one:
Note that
BA = minB*(x). (1.5)
x∈F
The calculation of the values of the function B*(y) based on the recurrence
relations (1.3)–(1.4) is performed step by step in the following order. At the first
stage, the value B*(x0) = 0 is fixed. Then, at each next stage, the calculation of the
next value of the function B* is performed for an arbitrary state y such that B*(y)
is unknown, but the values of B*(x) for all states x immediately preceding state y
have already been found.
The method for solving problem A based on relations (1.3)–(1.5) is the
direct Bellman method. In contrast, the method based on relations (1.1)–(1.2) is
called the inverse Bellman method. The functions B*(x) and B(x) will be called the
Bellman functions for forward and backward counting, respectively. The
implementation of both the inverse and direct Bellman methods for solving
Problem A requires, according to the upper bound, the number of elementary
operations that depends squarely on the number of states of the system Ω.
Sometimes the direct and inverse Bellman methods, implying that the problem is
solved using dynamic programming relations, are called the methods of direct and
backward counting, respectively.