Professional Documents
Culture Documents
CR 1384
CR 1384
them as:
In case of second derivative, it suffices to derivate further:
1) Static branch: v j = f j ( x j (t ), α j (t ), t )
69
∂ 2 yk (t ) ∂ 2 y k (t ) and vice versa. Indeed, let us suppose, without loss of generality that
y j '' = = =0 the system output depends on the addition node through a generic
∂y j (t )
' 2
∂y j (t − τ ) 2
function with memory as follows:
Let us continue showing the formulas derived in case of static n
branches. We move from the supposition that derivatives respect yk = h(uΣ ), uΣ = u1 + ... + un = ∑ u j
j =1
with node v j are correct, that means:
f
v j (t − τ ) = f j ( x j (t − τ ),α j (t − τ ), t − τ ) Original SFG Adjoint SFG
∂yk (t ) ∂ 2 yk (t )
v j '(τ ) = , v j ''(τ ) =
∂v j (t − τ ) ∂v j (t − τ ) 2
Static
∂yk (t ) ∂yk (t ) ∂vk (t − τ )
x j '(τ ) = = = v j '(τ ) ⋅ f j '(t − τ ) + f k '(τ ) 2 ⋅ vk ''(τ )
∂x j (t − τ ) ∂v j (t − τ ) ∂x j (t − τ )
vk (t ) = ∂f k
f k '(τ ) =
while for second derivative it can be deduced: = f k ( xk (t ),α k (t ), t ) ∂xk xk ( t −τ ),α k ( t −τ ),( t −τ )
∂ y k (t )
2
∂ ∂yk (t ) ∂ fk
2
x j ''(τ ) = = = f k ''(τ ) =
∂x j 2 (t − τ ) ∂x j (t − τ ) ∂x j (t − τ ) ∂xk 2 xk ( t −τ ),α k ( t −τ ),( t −τ )
∂ ∂yk (t ) ∂vk (t − τ )
= =
∂x j (t − τ ) ∂v j (t − τ ) ∂x j (t − τ )
∂ 2 yk (t ) ∂vk (t − τ ) ∂yk (t ) ∂ 2vk (t − τ )
= + =
∂x j ∂v j (t − τ ) ∂x j (t − τ ) ∂v j (t − τ ) ∂x j 2 (t − τ )
Unit Delay
v '(τ − 1) τ >0
vk (t ) = xk '(τ ) = k
where we named: 0 τ =0
x (t − 1) t >0
= k v ''(τ − 1) τ >0
f j '(t − τ ) = f j ' ( x j (t − τ ),α j (t − τ ), t − τ ) 0 t =0 xk ''(τ ) = k
0 τ =0
f j ''(t − τ ) = f j '' ( x j (t − τ ),α j (t − τ ), t − τ )
Regarding to unit delay branch we can write, under the same afore
mentioned hypotheses:
Output
∂yk (t ) ∂yk (t )
x j '(τ ) = ∂x (t − τ ) = ∂v (t − τ + 1) = v j '(τ − 1)
j j yk ' = δ jkδ (τ )
yk (t ) = xk (t )
x ''(τ ) = ∂ 2
y (t ) ∂ 2 yk (t ) yk '' = 0
k
= = v j ''(τ − 1)
j
∂x j (t − τ ) 2
∂v j (t − τ + 1) 2
70
ance with formulas of previous sections, the following notation
∂2 y ∂y ∂2 y
holds: xi′′ = 2 , xi′′ = , xij′′ = . From that, equation (1)
∂xi ∂xi ∂xi ∂x j
and inverse function theorem, it derives that:
∂2 y ∂ ∂y ∂xi ∂ 2 y ∂xi ∂y ∂ 2 xi
= = + =
∂xi ∂x j ∂xi ∂xi ∂x j ∂xi 2 ∂x j ∂xi ∂xi ∂x j (2)
( )f ( ′′
= xi′′ ⋅ 1 ′ + xi′ ⋅ − f ′ = xij′′
f )
Figure 1. Addition and distribution nodes connecting topologically
linear SFGs. ∂2 y ∂ ∂y ∂x j ∂ 2 y ∂x j ∂y ∂ x j
2
= = + =
∂x j ∂xi ∂x j ∂x j ∂xi ∂x j 2 ∂xi ∂x j ∂x j ∂xi (3)
Similar considerations can be made for second derivatives:
∂ 2 yk ∂ ∂yk ∂ ∂yk
( )′′
= x j′′ ⋅ f ′ + x j′ ⋅ f ′ = x ji′′
f
v j '' = = = =
∂u j 2 ∂u j ∂u j ∂u j ∂uΣ It can be underlined that:
∂ 2 yk ∂uΣ ∂ 2 yk • xij′′ ≠ x ji′′ : in fact xi , x j are dependent and Schwarz’s
= = = vΣ '', ∀j = 1..n
∂uΣ 2 ∂u j ∂uΣ 2 theorem can not be applied.
Concerning the distribution node, we can underlie, without loss of
• xij′′ and x ji′′ can be calculated by means of first and full
generality, that there is a general function with memory describing
the dependence of desired output on the n outgoing branches: second derivatives relative to the original graph ( x j′′ , x j ) or
y = h ( u1 , u2 ,...., un ) . Then, u1 = u2 = ... = un = u holds, where u is
to sub-graphs of it ( f ′, f ′′ ).
the value of the incoming branch. This let us state:
∂yk (t ) n
∂yk (t ) ∂u j (t − τ )
u '(τ ) = =∑ = 3.2 Case 2: only independent variables xi , x j
∂u(t − τ ) j =1 ∂u j (t − τ ) ∂u(t − τ )
Conditions of Schwarz’s theorem are assumed to be valid: this is
n
∂yk (t ) n
=∑ = ∑ u j '(τ ) supported by the independence between xi , x j . Therefore xij′′ and
j (t − τ ) j =1
j =1 ∂u
∂ 2 y k (t ) n
x ji′′ are expected to be identical. Anyway, we need further topo-
u ''(τ ) = = ∑ u j ''(τ )
∂u(t − τ ) 2
j =1 logical information for our purposes: that is the knowledge of a
In conclusion, it can be affirmed that the set of transformations in “common” node between xi , x j , namely xk . This node is depend-
Table 1 are valid for generic topology SFGs. ent on both xi and x j , through two different functions, i.e.
71
( ) ( )
xi = F x j = f −1 x j , and the former being completely independ- xi = 0 . Then, we can rewrite (5) and (6) as in the following com-
pact way:
ent from x j . This let us deduce the formulation of mixed deriva-
tives we are interested to:
( ) ⋅ 1 + ( g′ g ′) − x′ ⋅ f ′′ ( f ′) + g ′′ ( g ′) ⋅ g′
xij′′ = xi′′⋅ 1
′
f
i 2 2
∂2 y ∂ ∂y ∂xi ∂ ∂y ∂xi ∂y ∂ ∂xi
= xij′′ = = +
∂xi ∂x j ∂xi ∂xi ∂x j ∂xi ∂xi ∂x j ∂xi ∂xi ∂x j
( g′ h′) + x′ ⋅ ( f ′′ f ′) − ( h′′ h′) g′
x′′ji = x′′j ⋅ f ′ +
j
∂y ∂y ∂y ∂y
Observing that = ⋅ f′= − , we shall have:
∂xi ∂x j ∂xi ∂xi
x1 x2 x3 x4
where xi′′ is the full second derivatives of the graph deduced from
4. CONCLUSIONS
the original one by splitting xi into xi and xi . We can now afford
SFG techniques have been successfully employed here to calculate
to implement what done in case 2, and write:
first and second derivatives in dynamical non-linear systems.
Firstly, a suitable adjoint graph has been derived without using
∂2 y
=
∂ 2 y ∂xk
⋅
∂xi ∂xi ∂xi ∂xk ∂xi g ( ) ′′
( )∂x
= 1 ′ xi′′ − xi′ ⋅ g ′ ⋅ k
g ∂x
i
Lee’s theorem to yield first and full second derivatives of an output
of the original graph respect with one of its nodes automatically.
This let previous SFG based approach for sensitivity calculation [2]-
where xk is the first common node between xi and xi , and [5] be considered as particular case thereof. Secondly an helpful
xk = g ( xi ) , xk = g ( xi ) . Summarising, we get the final formula: formulation for mixed second derivatives has been deduced in order
to make them dependent on quantities attainable by means of suit-
able adjoint graphs for first and full second derivatives, even though
( ) ( )( )
f ′′
xij′′ = xi′′⋅ 1 f ′ + x′j ⋅ − f ′ − 1 f ′
∂2 y
∂x
i i∂x
+ xi′′
a proper graph description of such dependencies actually lacks.
Derivation of that is the first issue for future works. Then, applica-
2 tion of proposed approach to adapting problems where calculation
∂ y ∂ 2 y ∂xk ∂2 y of Jacobean and Hessian based information is involved, with special
= ⋅ = ⋅ g′ (5)
∂xi ∂xi ∂xi ∂xk ∂xi ∂xi ∂xk care to dynamical neural systems, is also being studied. Finally, the
( ) ( )
∂2 y ′′ authors are investigating how to extend the present approach to the
= 1 ′ xi′′ − xi′ ⋅ g ′ case of multirate systems, as recently done [6] for calculation of first
∂xi ∂xk g g
derivatives.
∂x j ∂xk ( )
∂ y = 1 xi′′ − xi′ ⋅ g ′′
g ′ ( ) g ′
Approach to On-Line Gradient Calculation”, Neural Computation,
vol. 12, no. 8, August 2000.
[6] F. Rosati, P. Campolucci, and F. Piazza, “ Learning in Linear
and Non-Linear Multirate Digital Systems by Signal Flow Graphs”,
( )
where x j = f ( xi ) , xk = h x j , xk = g ( xi ) hold. Same consid-
IEEE ISCAS, vol.2, pp. 6-9, May 2001.
erations as above can be made. In particular, (6) reduces to (3) when
72