Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Copyright to IFAC Nonlinear Control Systems Design,

Enschede, The Netherlands, 1998

NONLINEAR FILTERING USING PIECEWISE-GAUSSIAN


APPROXIMATION OF A POSTERIORI DENSfIY

Stepanov O.A.

State Research Center of Russia -


Central Scientific & Research Institute "Elektropribor"
30, Malaya Posadskaya Str., St. Pecersburg, 197046, Russia
Tel. (812) 232-59-15, Fax (812) 232-33-76, E-mail: elprib@erbi.spb.SU

Abstract. An algorithm using piecewise-Gaussian approximation of a posteriori


probability density function is proposed for special discrete-time nonlinear fIltering
problems. Copyright © 1998 IFAC

Keywords: nonlinear fIlter, discrete time, probability density function, optimal


estimation.

I. INTRODUCTION set of e for a discrete set of ej . In this case it is


not possible to take into account the local
properties of the non1inear functions in the
Among the wide variety of non1inear fIltering vicinity of ej . This can be done by the Gaussian
problems it is possible to single out the problem in sum algorithms based on poly-Gaussian
which dynamics equations of the n-dimensional approximation of a PPDF for the state vector
state vector Xi and/or measurement equations Xi = (xT ,e)T (Alspach and Sorenson., 1971).
are nonlinear only relative to the constant r- These algorithms also use the KF bank but the
dimensional subvector e (Lainiotis, 1976; dimension of the fIlters is equal to n + r. The
Maybeck, 1982; Kulbavy, 1992). number of fIlters in the Gaussian sum algorithms,
Some algorithms were earlier suggested to as a rule, increases in time and it is a drawback of
solve this problem. The algorithms using the such algorithms. Different methods are used to
partitioning method are most known among them eliminate this drawback. Thus, Gaussian
(Lainiotis, 1976). The main point of this approximation of a PPDF was periodically
algorithm is separation of a discrete set for the carried out in (Dmitriev and Shimilevich, 1979).
subvector e and representation of a posteriori In (Yarlicov and Mironov, 1993), the number of
probability density function (PPDF) for the vector KF is fixed that is acceptable when the forcing
Xi as a weighed sum of Gaussian densities noises in the dynamics equations are small. It is
conditional to e. The parameters of these obvious that in the general case the efficiency of
such algorithms may decrease when these
conditional densities are sought through the use of
methods are used.
a bank of n-dimensional Kalman fIlters, each of
There is another algorithm which can be used
them corresponding to a fixed value of e The
to estimate a constant vector. It is the so called
values of weighting coefficients determined reduced sufficient statistics fIlter (Anderson and
through the use of the same KF are proportional
Itlis, 1996; Kulhavy, 1992). This algorithm
to a PPDF for the subvector e at the points ej . approximates the PPDF through the use of a
The disadvantage of such algorithms is that weighed sum of a fixed number of the basis
despite some peculiar methods used, the number function (partial densities). It is essential that only
of KF must be great enough in order to provide the weighting coefficients change in time while the
their efficiency (Maybeck, 1982). It is a parameters of the partial densities do not change
consequence of the substitution of a continuous (Anderson and Itlis, 1996). In order to achieve
363
«good» approximation it is necessary to increase boundaries if the integration domain is not
the number of partial densities. Such approach specify. Thus, the solution of the estimation
cannot take sufficient account of the local problem is reduced to determination of integrals
properties of nonlinear functions either. (1.4)-(1.7).
In this paper an algorithm based on piecewise-
Gaussian approximation of a posteriori density,
free of the drawbacks mentioned above, is applied 3. PIECEWISE-GAUSSIAN
to a particular case of the flltering problems APPROXIMATION OF A POSTERIORI
discussed here. DENSITY

Let us derive piecewise-Gaussian


2. STATEMENT OF THE PROBLEM approximation for a posteriori density
f(8,x; / Yj)which can be used to develop
Let us assume that the discrete-time flltering economical, from the viewpoint of computation,
problem of the vector Xj = (8 T ,xT)T described algorithms for determination of integrals (1.4) -
by equations (1.7). The expression for a posteriori density of the
8i =8; (l.l) vector (8 T ,xi?, in which Xj = (XJ. .. XJ)T is
written a s \.
Xi =<I>,xi-l + \jI,(8)+ Wi (1.2) f(O,xj /yj~~cf(O)f(xj /8)f(Yj /8,xj)
by the measurements (2.1)
where c is the normalization constant.
Yi =s,(8)+H,xi + Vi (1.3) Let us introduce the domain 0 of the most
is to be solved. probable values of 0 , i.e., the domain in which a
In expressions (l.l) - (1.3) 8, X;. are r- and n- priori density [(8) is substantially different from
dimensional vectors; \jI j (8) H s,(8) are nand m- zero. Now, subdivide 0 into M noncrossing
dimensional nonlinear functions; <I>j ,H j are the similar domains oj
known matrices of a corresponding dimension; w;. M
Vi are mutually independent zero-mean Gaussian Uoj =0 . (2.2)
noise sequences with known covariance matrices j=l
Qj and R j , respectively, which are all
The oj are chosen so that within each
independent of the initiial state Xo, namely subdomain it would be possible to provide the
f(Wj) = N(wj;O,Qj), f(vj) = N(Vj;O,Rj) , acceptable accuracy of the linearized description

E{WjV~} = O,E{WjxJ} = O,E{vjxJ} = 0. of the functions \jI j (8) and s,(8) in the following
form:
The vector xo is also assumed to be zero-mean \jIi(0)~\jIi(8{)+~{(8-8{); (2.3)
Gaussian, i.e. f(xo) = N(xo;O,Pt). From here
s,(8) ~ sj(8{) + Sf (0 - 8{) (2.4)
on N (0; li, A) is used as a notation for the where
density of the Gaussian vector a with the mean
li and covariance matrix A. Special consideration
must be given to the peculiar character of the
stated problem due to the fact that equations
(1.2), (1.3) are nonlinear only with respect to the
vector 8 while the dependency on Xi is linear.
Let us fmd the optimal (minimum variance)
Bayes estimates. These estimates and the
corresponding conditional covariance matrices of
their errors in the problem under consideration
are known to be defmed by the following relations with the elements
(Sage and Melse, 1972) :
representing the line number; t = L n - the column
Xj=JX;/(Xj/Y;)dx;, (l.4)

Sj = JOf(O / y;)d:l , (1.5) number; mxr matrix si. = 8s (8!) is defmed in a


j

~T
I

p;x J x;)(x - x;) f(x; /


= (x; - T y;)dxj ,(1.6)
similar way. The last two multipliers of expression
(2.1) may be represented as

p;e J(8-8 )(0-8;) f(8/y;)d:j,


= ~
j
~ T
(1.7)
M
f(xj /8)f(Yj /8,xj) =

where y; = (yl- .. yJ)T. = LY j(8)f j (Xj / 8)f j (y j / 8,x;) (2.5)


Hereinafter we assume the differentials to be j=l
the correspondent product of the vector
components, and the integrals to have infinite where

364
8 eo/ fIlters (Sage and Melse, 1972) with due account of
8eQi" the form of matrices &( ,ii/ given in (2.12),

and Ij(xj 18), Ij(Yj 18,Xj) are the (2.13) and the form of matrices Q;, Po defmed as
densities corresponding to (1.2), (1.3) derived by
the use of (2.3), (2.4). Let us substitute expression
(2.5) into (2.1) and then multiply and divide each
Q=[O Q}Po =[~. ~1
summand by the coefficient c£ deflned as From expression (2.6) and the representation
j j j
c£ =lj(Yj)= H 1(8)/j(8,x;)/j(y;l8,x;)djdx j
I (Yj) = I (y; I Y;-I)l (yj I Yi-l)'" Ij (Yl)

(2.6) it follows that the recurrent relations


Then for 1(8,xj I Yj) the representation . . exp{-D,5(v{)T(15/)-l v f}
M cl = cl_ 1 m /2 - ' 1/2 (2.14)
1(8'Xi / Yi) = c ~> j(8'1:( 1 j (8, xi I Yi )(2.7) 21t IDII
j=1 where
in which vI =Yj -ii/x!/;_l -sj(8{)+S!8{;(2.15)
1;(8,Xi /Yj)= 1(8)/j(x; 1 8 ),.lj(Yj 1 8 ,Xj),
cl
15/ = ii/pl;_l(ii()T +Rj (2.16)
k
can be used for evaluation of c{
j = 1. M will hold true.
It is clear that these densities are Gaussian The normalization constant in (2.8) can be
densities as they correspond to the j-th linear defmed as
M
description of the functions 'V j (8) and si8) in the
form of (2.3), (2.4) with the Gaussian character of
/ = ffLY /8)c/ Nj(Xi;,tj,P;j)d)tttj (2.17)
j=1
the density 1(8). After integrating expression
j For integration in (2.17) let us represent
(2.7) with respect to Xj-l for I (8, x; I Y;) it is ·· Nj(-
dens111es x ; ·,x::'jp-j)
j ,; as
not difficult to derive
M
1(8,x; I Yj) = c; LY j(8'1:( N j (x; ;ij,p/) (2.8)
N j (Xj;i/ ,1'/) = 1 j (Xj I 8,y; )1 j (8 I Yj) (2.18)
j=1
where
where c; is the normalization constant, • • A ' 9 '
11 (8 I Yi) = N 1 (8;8f ,(Pj )1), (2.19)
Nj(xj;ij,p/) are the partial Gaussian c
densities corresponding to the j-th description of Ij(xj /8,Yj) = Nj(Xj;x( (8),[(l}X)jr )(2.20)
the functions 'V j(8) and si8) with are the Gaussian densities whose parameters are
easy to fmd by the following rules of
i j = (eT ,xT)T and determination of conditional Gaussian densities
_. [(P9)j (P9X)j] (Sage and Melse, 1972)
P/ = / / (2.9) x(8)=x( +(l}x9)j[(l}9)jrl(8-e{); (2.21)
/ (pr)jr (p/)j
The parameters of these partial densities can [(l~X)jrC = (P/)j _ (ljx9)j[(lj9)jr 1(lj9X)j
be found using the bank of KF for the n+r-
dimensional state vector Xj. The equations for (2.22)
Taking into account (2.18) and integrating in
those KF are detennined as [8]: (2.17) flrst with respect to Xi and then with respect
::'j -::'j K-j( H-j::'j (j . 8 j) to 8 it is not difficult to derive the following
Xj - x j / i _ 1 + j Yj - i x j / i _1 - si 8 j ) + si i
piecewise-Gaussian approximation for a
(2.10)
posteriori density
M
(2.11) I(x; I Y;) = L ~{(8)N j (X; ;i/ ,i/) (2.23)
where j=l

(p{ = [~j Orxn] (2.12)


in which III (8) are determined as
'Vi <l>j .
1l~(8)= ~j'
{j 8eO,l/
(2.24)
ii/=[s/ H;]. (2.13) / 0, 8eO,l/
The gain coefficients i / and the co variance where

matrices p/ are determined by the using ,,1


..-/
= C!!~ ciai
/ ~ / I'
(2.25)
standard covariance equations for the Kalman j=l

365
with cl evaluated in accordance with expression M
P/ = LJ.l{ {(l~X)i - (Pi»1)i[(p;9)ir1(Pi9X)i +
(2. 14), and al are found by the following i=l
expression + xl (X/>T + xl (b/)T [(Pi9 )ir1(p;9X)i +
al = f / i (8 / Yi)43 (2.26) + (p;xe)i(p;9)ir 1b/ (X/>T +
nj + (p;xe)i[(p;9)ir1J/ [(p;9)i](p;9X)i} - xixl.
Note that here the weighting coefficients J.l{ do (3.10)
not satisfy the nonnalization conditions, which is Hence we have derived a set of relations (2.14)
a consequence of the representation of the density - (2.17), (2.23) - (2.26), (3.3) - (3.10) which provide
as separate 'pieces' of partial densities. for evaluation of the parameters of piecewise-
Gaussian approximation of a posteriori density,
the corresponding estimates ei,Xi and the
4. ALGORITHM FOR CALCULATING
ESTIMATES AND COVARIANCE covariance matrices Ij9 , Ijx .
MATRICES The bank of n+r-dimensional KF (2.10), (2.11)
generated by th~ linearized description of the
Now let us fmd the expressions for the functions "'i(6),'Si~)in the fonn of (2.3), (2.4) is
estimates Xi = (8 T,XJ)T relying on the the basis of the algorithm. In this bank the partial
representation resulting from (2.23) fllters allow for the evaluation of the the partial
M estimates e{, xl and the co variance matrices p/
e = LfJ J.l{6Ni(Xi ; &/,~j)d3dxi '
j (3.1) which appeared in the relations brought above.
i=l The values v{ and the corresponding covariance

"II J.li Xi N
M
j ~j ,P;
- i
matrices jjl required in the determination of the
Xi - L.
A -
J. (Xi
- '. Xi )43dxi · (3.2)
i=l multiplier cl in expression (2.25) for the

Let us introduce the following notation: weighting coefficients J.l{ are also evaluated in
hI = f 8/ (6 / Yi )d3 ;
j (3.3) these ftlters according to f onnulas (2.15), (2.16).
nj
5. PECULIARITIES OF THE
d/ = f86T/i(8/Yi)43; (3.4) ALGORITHM
nj
It is very essential that in piecewise-Gaussian
"b/ = J(8 - e{)/ i (6 / Yi)d3 ; (3.5) approximation of a posteriori density proposed
n) the linearized representation of functions is only
used in bounded domains, with 6 E 0.1 . So
J/ = f (8 - e{)(8 - e{)T /j (8/ Yi)d3 (3.6) changing the size of these domains it is possible to
nj achieve a high accuracy in the description of a
posteriori density in each of them, and,
Taking into account expressions (2.18)-(2.22), consequently, the accuracy in the description of
(2.26), (3.3) - (3.6) and integrating in (3.1), (3.2)
the density for all the values 8 En as well. It is
with respect to Xi' and then to 8 it is not difficult
for the estimates to derive the following this circumstance that distinguishes this algorithm
expressions: from the algorithm in which a posteriori density is
M
approximated through the use of the relation
e = LJ.l{hl
i (3 .7)
M
/(Xi / Yi) = LJ.l{ (8)N i (Xi;&/ ,i'/) (4.1)
i=l
i=l
M similar to (2.23) but here the equation
= L(xlal + (p;9X)i[(p;X)ir 1b/) .(3.8)
Xi
i=l J.l{ = c( If cl
I i=l
The similar manipulations perfonned in is used to determine theweighting coefficients.
fmding co variance matrices p;9, p;x will result in Such approximation of a posteriori density can
be derived if, for example, the approximate
representation of a posteriori density is assumed
to have the fonn of
M
(3.9) /(8)=_1 L
Nj(e;e{,p9)
M J=
. 1

366
and in evaluating weighting coefficients each j-th where Vol(n j ) is the same for all n j size of
linearized description is assumed to be true over
the whole domain of probable values. But as it is e
domains; j is a certain point belonging to nj.
not true in reality, the algorithm resulting from
approximation (4.1) may introduce significant
In particular, the point e j may be, for example,
represented by the point
errors even when n j is small (see condition (4.2)
below). Note that poly-Gaussian approximation ej = arg m~ f j (8 / Yi)
eeo'
proposed in (yarlicov and Mironov. 1993) used
for flltering problems like (1.1) - (1.3) is reduced, Here it is important to notice that inaccurate
in fact, to approximation (4.1) and thus, the evaluation of ~f at the present step by no means
algorithm resulting from it also has the drawback
pointed above. affects the accuracy in determining the weighting
coefficients at the subsequent steps. In other
In view of the bounded size of n j in which the words, when going from one step of the
linearized description holds true it is necessary to processing to another the errors are not
evaluate the integral for the bounded domains in accumulated and that is undoubtedly an
accordance with formulas (2.26), (3.3), (3.4). It is advantage of this algorithm. The described
not difficult to see that the vector li/ and the simplified scheme of the realization of the
algorithm proves to be the most effective in the
matrix (i/
are represented by of ,bf ,d/ and they cases when the density being multimodal at the
themselves are only introduced in order to make initial stages of estimation, changes to unirnodal
the records of the resulting relations (3.8), (3.10) density,
simpler. Evaluation of the integrals like these Some notes should be made about the choice
involves difficulties and finally makes the of the points 8f , where the functions \If i (8) ,
algorithm for finding estimates and covariance
Si (8) are to be linearized. One of the possibilities
matrices more complicated. At the same time it is
necessary to emphasize the following points. As a is a fixed allocation of the points in each n j for
rule, in practice the problems under discussion all i= I, 2.... For example, the centers of the
show high a posteriori accuracy in estimation of j
corresponding domains n may be chosen as 8! .
the parameters q . If nf is taken to designate the In another variant the values of the prediction of
domain of the values of a posteriori density J(8/y j) 8 generated in each partial ruter are taken as
substantially different from zero, then starting
from a certain i* the following condition is usually e! as it is done in the extended KF (Sage and
met: Melse, 1972). As e is constant, the values of the
nf EnD' i>i* (4.2) prediction will simply be the same as the partial
where n D is the domain in which the estimates e{_l. In spite of some definite
approximation of the form (2.3), (2.4) provides an advantages of this choice of linearization points
acceptable accuracy in representing the functions due to the advantages of the extended KF in
'V; (8), Si (8). Dividing the a priori domain n so comparison with the linearized KF, it suffers one
serious disadvantage, namely, in certain situations
that nj in (2.2) could satisfy the condition all the estimates may come together in a single
n j ~ nD (4.3) partial domain. Thus, practically, the subsequent
steps will make use of a single linearization point,
it is not difficult to understand that with i>i* a
in fact, a posteriori density will be replaced by its
posteriori density will practically be the same as
Gaussian approximation. It is essential that that
the Gaussian density generated by one of KF
may occur before the condition (4.2) is satisfied.
whose weighting coefficients become dominant.
In its turn that may introduce significant
The number of this ruter designated by j*, the
approximation errors. Taking into consideration
expressions for the sought estimates and
the above said it seems appropriate to use a
covariance matrices with i>i* may be written as
certain intermediate variant in choosing
~
8; 8r
~ ~ '. ,x;
~ ~ '.
~ xl ; P;e ::: (P;e )1'. , p; ~ P;1'. . linearization points. In this variant the estimates
In this case it is clear that , in fact, the generated in the partial ruters at the previous step
problem is reduced to fixing the moment when the are assigned to be the linearization points if the
weighting coefficients of one of the fIlters become estimates themselves are not beyond the bounds
dominant and to location this fIlter so that it of nj. If this condition is not met linearization is
could be used later. To realize this simplified
version of the algorithm it will suffice to evaluate performed in a certain, close to the e{ point
the values ~f represented according to (2.25) by belonging to the boundary of the domain n j .
The proposed algorithm based on piecewise-
c{ and oj. For calculation oj it is also possible Gaussian approximation can be used in practice
to use approxime formulas of the type to solve various problems for which this model
proves to be acceptable. For example, it is good
for describing the problem of inertial system
alignment when the level of the course errors is

367
very high (Dmitriev, et al. 1997). This problem is Hollowell J. (1990). HELI/SITAN. A Terrain-
peculiar because of the linear character of the referenced Navigation Algorithm for
equation for measuring (1.3) with nonlinearity Gelicopters. Proceedings IEEE, PLAN'S-90,
only in Eqs. (1.2). Models (1.1) - (1.3) are also 616-625.
suitable for terrain aided navigation problem Hostetler L.D., Andreas R. D. (1983) Nonlinear
(Hollowell , 1990; Hostetler, 1983; Shorsi and Kalman Filtering Techniques for Terrain-
Bar-Itzhack, 1995) when errors of a navigational Aided Navigation. IEEE Trans. on Automatic
system or a trajectory of a vehicle are described by Control, V.AC-28, N3, 315-322.
quasidetenninistic sequences. Application of the Kulhavy R. (1992). Recursive Nonlinear
considered algorithms to the solution of this type Estimation: Geometry of a Space of Posterior
of problems turns out to be the most expedient Densities. Automatica, Vol. 28, 313-323,.
when, on the one hand, the function Lainiotis D.G. (1976) Partitioning: A Unifying
'Vi(9),Si(9) in the domain of a priori uncertainty Framework for Adaptive Systems, I:
is not highly variable and it is possible to restrict Estimation. Proceedings IEEE, Vol. 64, N 8,
ourselves to a small number of summands in 1126-1140.
representation (2.30) and, on the other hand, Maybeck P.S. (1982) Stochastic Models,
when one summand is not enough to provide the Estimation and Control. Vol. 2. Academic
effective work of the extendend KF. Press. New York.
Sage A.P., Melse '·J.L: (1972) Theory Estimation
wich Application '. .Co Communication and
REFERENCES Control. McGraw-Hill, N.Y.
Shorsi G., Bar-Itzhack I.Y. (1995). Satellite
Alspach D. L., Sorenson H. W. (1971). Recursive autonomous navigation based on magnetic
Bayessian Estimation Using Gaussian Sum field measurements. Journal of Guidance,
Approximations. Automatica, Vol. 7, N4, 465- Control, and Dynamics, Vol.18, N4, 843-850.
479. Dmitriev c.P., Shimilevich L.I. (1979). Extended
Anderson K., Itlis R. A (1996). Distributed Kalman Filter Using Repeated Linearization
Bearing-only Tracking Algorithm Using and its Application for Terrain-aided
Reduced Sufficient Statistics. IEEE Trans. On Navigation. Automatica and Telemechanics,
Aerospace and Electronic Systems, Vol. 32, N4, 50-55 (in Russian).
NI,339-349. Yarlicov M.S., Mironov M.A. (1993). Markovian
Dmitriev S. P., Stepanov O. A., Shepel S. V. estimation theory of random processes. Radio
(1997). Nonlinear Filtering Methods and Communication, Moskow. (in Russian).
Application in INS Alignment. IEEE Trans. on
Aerospace and Electronic Systems, Vol. 33, NI,
260-272.

368

You might also like