Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

SET THEORETIC SIGNAL PROCESSING*

P. L. Combettes
Department of Electrical Engineering
City College and Graduate School
City University of New York
New York, NY 10031, USA

ABSTRACT 0 Signal reconstruction: h is the original form of a


signal which is partially known, e.g., from Fourier,
In signal processing, estimation and design problems Radon, or Wavelet transform information.
have traditionally been solved via optimization meth- e Spectral estimation: h is the spectral distribution
ods. Such an approach often leaves little flexibility to of a random process which has been observed over
incorporate constraints. In the set theoretic frame- a finite support.
work, each constraint on the problem is associated
with a set in the solution space and their intersec- On the other hand, in design problems, the objective
tion represents the set of solutions. The main feature is to create an object h that satisfies certain specifica-
of this framework, which places emphasis on feasibility tions. Let us mention in particular:
rather than optimality, is to provide great flexibility in
the incorporation of statistical as well as nonstatistical 0 Filter design: h is the set of coeflhcients of a filter
constraints. In signal processing, set theoretic meth- (or filter bank).
ods have been applied to several classes of estimation 0 Pulse design: h is a pulse shape for a digital com-
and design problems. In this paper we give a general munication system.
overview of the field and discuss recent advances. We
conclude the paper with areas of on-going research and 0 Window design: h is a window for data analysis.
open problems. 0 Beamforming: h contains the parameters of an ar-
ray of sensors that produces a desired beam pat-
INTRODUCTION tern.

Most signal processing problems fall into two main On an abstract level, estimation and design problems
categories: estimation and design problems. In esti- can be reduced to the synthesis of an (object in a space
mation problems, the goal is to produce estimates of Z according to some criterion describing what is, in the
an object h from the measurements of signals physi- user’s belief, a good solution. Traditionally, the crite-
cally or mathematically related to it and some a priori rion has been optimality with respect to some objec-
knowledge. Such problems include: tive. The systematic quest for optimal solutions, which
is now well rooted in the scientific culture, originated
System identification: h characterizes a system, in the late 1940’s. It has been motivated to a large
e.g., impulse response, coefficients of a parametric extent by the conjunction of technological advances in
model (ARMA, Volterra, etc.), poles and zeros. computing machinery as well as progresses in branches
Bearing estimation: h is a vector containing the of applied mathematics such optimization theory, nu-
directions of arrivals of sources impinging on an merical analysis, and statistics. Naturally, optimal es-
array of sensors. timators and designs also rule in signal processing and
we can mention in particular the prominence of least-
Signal restoration: h is the original form of a signal squares filter designs, maximum likelihood estimates
which has been blurred and corrupted by noise. in system identification or array processing, maximum
entropy spectral estimates, Bayesian signal recoveries.
*This work was supported by the National Science
Foundation under grant MIP-9308609. Optimal estimation and design procedures have un-
doubtedly provided satisfactory solutions in a wide

- 1 -
range of applications. However, a reservation that can ther ignored or of secondary importance. Its main fea-
be formulated vis-&-vissuch approaches is that the cri- ture is to provide great flexibility in the incorporation
terion of optimality is inherently subjective and that of statistical as well as nonstatistical constraints. Of
different criteria may yield different solutions. More- course, the requirement for a single estimate is relaxed
over, the resulting optimization problem is usually not and the solution is a set of equally valid solutions. In
easily modifiable to incorporate whatever a p o r i in- this regard, it has been charged that the validity of set
formation or constraint may be available about the theoretic methods could be questioned on the grounds
problem. In practice, one tends to select an optimality that they do not yield a unique solution but a set of
criterion on grounds which are seldom related to ra- solutions and are therefore suboptimal. Let us first
tional and practical goals reflecting the specificities of remark that, although it may be gratifying to have ob-
the problem at hand. One generally favors standard tained the “optimal” solution, optimality claims were
procedures that provide, without too much effort, a seen to be subjective. At best, they result in a qualita-
solution. For instance, the maximum likelihood (ML) tive selection of a feasible solution. Moreover, from a
estimator has enjoyed widespread use on account of conceptual standpoint, demanding that only one solu-
its “desirable” properties such as normality and effi- tion be acceptable for problems which are notoriously
ciency. Nonetheless] the practical value of these prop- affected by uncontrollable factors (e.g. noise, uncer-
erties is quite debatable and, moreover, they are only tain data formation model) seems somewhat unwise.
asymptotic. In addition] the ML estimator may not Let us also note that methods which yield unique solu-
be defined or may lead to irrelevant results. Finally, tions are usually iterative and their solution depends
computational tractability generally imposes standard on the stopping rule. Certainly, such a criterion does
assumptions, such as normality of the random pro- not define a unique solution.
cesses involved, and leaves almost no room for the in- The set theoretic framework has been applied to nu-
corporation of a priori knowledge. On that last point, merous estimation and design problems in signals pro-
the Bayesian framework is better suited. However, it cessing, e.g., signal restoration, image reconstruction,
requires to model the object h to be estimated as the signal extrapolation, array processing, seismic signal
realization of a random variable to which a distribution processing, spectral estimation, system identification,
function is assigned (in most situations, there is how- deconvolution, filter design, window design, antenna
ever nothing random about h, which may for instance array design, pulse shape design [3, 4, 8, 17, 18, 21,
be a fixed but unknown physical constant). Moreover, 23, 261. More recent applications can be found in the
not all U przori information can be easily described in areas of geophysical signal processing [a], signal recon-
probabilistic terms and the resulting prior distribution struction from wavelet maxima [6], radar signal anal-
is usually too complex t o yield a tractable minimiza- ysis [19] and biomedical image processing [ZO].
tion of the resulting conditional expectation. At all
events, a Bayesian technique is fundamentally subjec- The purpose of this paper is to give a general overview
tive since the specification of the prior distribution and of the field of set theoretic signal processing and to dis-
of the loss function is left to the user. In the extreme, cuss recent advances in specific areas. We shall first
the user could reach any conclusion he desires through discuss set theoretic formulations and then the prob-
biased choices of these specifications. lem of the synthesis of set theoretic solutions. The
last section is devoted to areas of on-going research
In the set theoretic framework, the governing principle and open problems.
is to produce solutions which satisfy all the constraints
about the problem [8]. These constraints arise from SET THEORETIC FORMULATIONS
data measurements and a priori knowledge in estima-
tion problems, and from technical specifications in de- Let ( @ i ) i e ~be the family of constraints available on
sign problems. Each constraint 9,defines a so-called the problem and let Z be the solution space. According
property set Si in the solution space Z, namely to (l),these constraints define a collection of property
sets (Si)iEr in Z. The pair (E,(Si)iEr) is called a set
S; = { U E Z [ a satisfies qi}. (1) theoretic formulation. In this set theoretic format, the
problem can be stated as
The problem is then to produce a feasible solution,
i.e., to find a point in the intersection of all the prop- Find u E S = n S , . (2)
erty sets. The set theoretic framework departs radi- ;€I
cally from the conventional approach which relies on It should be noted that the set theoretic formulation is
optimal decision theory and in which feasibility is ei- not unique since several options are usually available

- 2 -
to select the solution space, For instance, the identifi- 0 The smallest feasibility set should be obtained in
cation of an autoregressive system can be carried out (2) to guarantee the quality of solutions.
in the space of regression coefficients or in the space of
0 Since each set is a confidence region, the solution
reflection coefficients; a signal deconvolution problem
set S should contain the true object h with a rea-
can be posed in the time domain or in the frequency
sonable level of confidence.
domain. In general one chooses the solution space in
which the available constraints are most easily repre- As in [lo], let us note that in the caste of deterministic
sented and give “nice))sets, in particular convex ones. information, the more sets the bett,er since the fea-
In the case of deterministic constraints, the construc- sibility sets becomes smaller in (2). However, in the
tions of sets is fairly straightforward. For instance, if case of statistical information, this iis not necessarily
one wants t o synthesize a signal whose energy does not true. Indeed, for a given probability that S contains
exceed a bound E , the corresponding set is the closed h , the confidence level on (and therefore the size of)
ball the individual sets much increase as tjhe number of sets
si
= { a E 2 I lla112 5 E } . (3) increases; in some cases intersecting more larger sets
If the signal is constrained to have band-limit ] - B , B [ , may not yield a smaller intersection than intersecting
the corresponding set is the closed vector space fewer smaller sets, This leads to the rather counter-
intuitive conclusion that some statistical constraints
Si = (U E 8 I (VY E R) ( F u ) ( v )= 0 if [VI 2 B}, may have to be discarded in order to obtain a small
(4) set of solutions at a given probability that h E S .
where F denotes the Fourier transform operator. Like-
wise, in the design of an N-tap low-pass filter, the lin- FEASIBILITY PROBLEM
ear phase constraint leads to the subspace
= { U E R~ 1 (vi E { I , + - . , N ) )ai = aN+I-i}. Overview. Solving a signal processing problem with
set theoretic formulation (Z, ( S i ) i E I amounts
) to solv-
(5) ing the feasibility problem (2). This basic problem
In signal processing problems, stochastic information
is often available in the form of probabilistic attributes arises in many branches of applied mathematics and
of the uncertainty that surrounds the data formation
it has therefore a long history [5, 8, 151. Because it
model, e.g., modeling errors or measurement noise. can usually not be solved in one step:, most feasibility
These attributes can be amplitude bounds, moments, algorithms are iterative:
mixed moments, distributions, spectral and polyspec- 0 Projection methods [l,8, 111: a,+l is computed in
tral properties, etc. In this case, the constraint \ki terms of the simultaneous projections of a, onto
takes the form of a statistical test and the resulting some of the sets. Although such methods exist for
set is a confidence region in the solution space [8]. As metric spaces, they have been most developed in
an example, suppose that the observed data t are re- the context of a Hilbert space in, which the sets
lated to the true object h via the model (Sj)iEr are closed and convex.
2 = T ( h )+ U , (6) 0 Subgradient projection methods [l, 51: the sets
where T is a known operator and U the uncertainty. take the form Si = { a E Z 1 $ $ ( U ) 5 0}, where
For a probabilistic property 4
i of U , the set of interest gi is a convex functional and 2 a Hilbert space. At
is’ iteration n , the set Si is activated lby projecting a,
onto the hyperplane Hi(u,), which is the intersec-
Si = { a E B I 2 - T(a) is consistent with Qi}.(7) tion of E with a hyperplane of E x IR tangent to
The problem is then to determine the confidence level the graph of g; at the point (U,,, g,i(a,,)).
to be placed on each such set. As discussed in [lo], 0 Minimization method: In a Banach space Z, the
this involves three conflicting objectives: feasibility problem (2) can be posed as that of min-
imizing a functional CP whose set (of minimizers is
0 Since the complexity of algorithms for finding a contained in S. For instance, if I is finite, take
feasible solution increases with the number of sets, ( w i ) i e I C]O, 11 with CiEr
w; = 1. Then the func-
few should be used. tional
~

‘In t h e so-called bounded error models, the information


on takes the form of amplitude bounds and yields simple
U
sets when T is linear [14, 22, 251.

- 3 -
admits S as a set of minimizers (d denotes the point in S for every a0 E Z (strong convergence also
distance of 5 ) . Another example is the functional holds if additional conditions are imposed on the sets)
[8]. A straightforward extension of (10) is obtained by
(VU E E) @ ( a ) = sup d(a, Si). (9) introducing relaxation coefficients (A,),?o in [ E , 2 - E ]
iEJ
(0 < E 5 1) as follows
Standard optimization methods can then be used
to solve (a), depending on the properties of Q, (con- (Vn E N) an+l = an + L ( P i , ( a n ) - an). (11)
vexity, continuity, subdifferentiability) and of E. In other words, instead of taking the exact projec-
Random search methods [13]: For 5 = Rk,this tion P;,,(an)of a, onto Sin, one takes a relaxed one,
method consists in generating points at random i.e., a point an+l on the segment between a, and
over an adaptively reduced search region until a its reflection 2P;,,(a,) - a , with respect to Sin.If
feasible point is found. The main advantage of 1 < An < 2, u,+l is called an overrelaxation, if A, 1,
this Monte Carlo approach is its basic simplicity a,+l is called a projection (unrelaxed iteration), and,
and the lack of assumptions about the geometrical if 0 < A, < 1, a,+i is called an underrelaxation. Re-
complexity of the sets. Because of computational laxed POCS (11)possesses the same convergence prop-
cost, it is not appropriate for large scale applica- erties as (10).
tions. Despite its widespread popularity in signal processing
Approximation methods: One builds a sequence applications, POCS has three major shortcomingsn2
(Sn),yo of supersets of the feasibility set S con-
0 Serial structure: A salient feature of POCS is its
verging in some sense to S . This approach has
serial algorithmic structure: at each iteration only
been taken is certain problems with bounded error,
where the only information used to construct prop- one of the sets can be activated. Clearly, such a
erty sets is a magnitude bound on the perturbation structure does not lend itself naturally to an im-
of the system (noise, modelling errors) 114, 22, 251. plementation on an architecture with parallel pro-
cessors.
The feasibility set is refined at each new observa-
tion of the data process and S, is usually taken to 0 Inconsistent formulations: In some problems, the
be a bounding ellipsoid or hyperparallelepiped. set theoretic formulation may be inconsistent m
a result of two or more incompatible constraints.
Recent developments. The theory of projection al- In such instances, it would be desirable to obtain
gorithms has been the focus of a lot of research recently an approximate feasible solution. On that score,
in the case when ( S i ) i c ~are closed and convex subsets POCS falls short. First of all, it is clear that
of a Hilbert space 2. Throughout this section, Z has (11) cannot converge if S = 0 since the tail of
-
scalar product < .(.>, norm (1 (1, and distance d. A the sequence will keep oscillating from one set to
sequence (an),20 C E is said to converge to a E E the next indefinitely. At best, the periodic subse-
- - converges t o zero and weakly
strongly if (Ilu,, a((),>o quence (amn+i)n>o (1 5 i 5 m) converges weakly
if, for every b in Z, (< a,-alb >)n>O converges to zero. to a point in Si. This solution is unsatisfactory
Strong convergence implies weak convergence and, in since it is guaranteed to satisfy only the ith con-
finite dimensional spaces, the converse is true. straint, which constitutes a poor approximation of
Let P;(a) denote the projection of a point a onto Si, a feasible solution.
-
i.e., the unique point in Si such that ] ! U Pi(a>ll = 0 Speed of convergence: A long recognized problem
inf(lla - bl\ 1 b E Si}. In most signal processing appli- with POCS is its slow convergence. Conceptually,
cations, the method of successive projections (POCS) POCS can be accelerated by properly relaxing the
projections at each iteration. Unfortunately, even
(Vn E N) a,+l = Pi,(a,), i, = n(modm)-t-1, (10)
for simple set theoretic formulations, there is no
where m is the total number of sets (assumed finite systematic method for determining so as
here) has been used to solve (2). In this method, the to speed up the iterations in (11).
sets are activated cyclically: a1 is the projection of 'Alternative serial projection schemes have been de-
a0 onto Sl, a2 is the projection of a1 onto Sz, ..., am vised in which the sets are activated in a noncyclic manner
is the projection of a,-1 onto Sm, a,+l is the pro- [5, 81. They will not be discussed here since they have
jection of a, onto SI, ..., akm+i is the projection of rarely been used in signal processing and have essentially
ukm+;-1 onto Si, etc. POCS converges weakly to a the same inherent shortcomings as POCS.

- 4 -
Recently, several authors have investigated the follow- Since the extrapolation parameter L, in (15) is
ing method of parallel projections (MOPP) that over- never less than 1, the condition A, < 2 is by-passed
comes the shortcomings of POCS [l, 9, 11, 121 and large overrelaxations are allowed. These ex-
trapolated relaxations contribute to significantly
(vn E N) a n + l = an + An(xwi,npi,n(an)-an), accelerating convergence to a feasible solution in
;€I, numerical applications. Thus, (14) was found to
( 12) accelerate convergence by sever#alorders of magni-
(0 < E 5 1) and
where (An)n,o C]E,+a[ tude in [12].
C wi,n = 1 and (Vt E I ) Wi,n > 0. (13) NEW DIRECTIONS
iEI,

Practically, iteration n of MOPP in (12) is performed In this section we point out certain areas of on-going
as follows. First, one selects the sets to be activated; research as well as open problems that deserve further
In contains the indices of these sets. Then, for every i investigation.
in In,one computes the projection P;(on) of a,, onto Alternative set theoretic frameworks. In the
Si and forms any strictly convex combination cn = above framework, we use sets to represent informa-
xiGI* wi,,Pi(a,) of these projections. The position tion and classes of signals. There m<aybe more flexible
of the new iterate a,+l on the ray emanating from a, ways to represent and exploit information, especially
and going through c, is determined by the relaxation when it is imprecise or inaccurate. The first alterna-
parameter A,. By comparing (11) and (12), it is clear tive set theoretic framework that cornes to mind is that
that POCS is a particular case of MOPP, obtained by of fuzzy sets [16], which has already been used in [7].
letting I, = {n (mod m) 1). + A conceptual concern with this approach, however, is
The superiority of MOPP over POCS is threefold: that it attempts to model inaccurate information with
an exact mathematical being, namely a membership
Parallel structure: at each iteration, the projec- function. Another concern is that it essentially brings
tions onto the selected sets can be computed si- the problem back to the optimization of some aggre-
multaneously on concurrent processors. It should gation of membership functions, the very type of sub-
also be pointed out that MOPP is more flexible jective problem we were trying to get away from in the
than a method in which all the sets would be acted first place. More general set theories could be investi-
upon at each iteration (i.e., In = I in (12)). This gated such as semi-sets theory [24].
allows a better distribution of the computational Optimal use of information. In many problems a
load among the processors. lot of constraints are available, especially of the statis-
Inconsistent problems: It is shown in [9] that when tical nature. It is usually unclear, however, how these
S = 8,(A,), c [ ~ , -2 E], and the weights are constraints are interrelated and how to select those
constant over the iterations, say wi,n = w;,then that will prove the most efficient in effectively reduc-
MOPP converges weakly to a point that minimizes ing the size of the solution set. This problem should
@ in (8), that is, a weighted least-squares solution be studied further.
to the inconsistent feasibility problem. From a sig- Algorithms. With MOPP, exact projections must
nal recovery point of view, such a solution is clearly be computed at each iteration. In many applications
more acceptable and useh1 than that generated by this constitutes an obstacle as exact projections can
MOSP, whose properties are elusive. be difficult if not impossible to compute. We are cur-
Convergence: As shown [12], MOPP converges for rently investigating algorithms in which the exact pro-
relaxations in the adapted range jections can be replaced by approximate ones a t each
iteration. This approach is expected to bring together
(Vn EM) E 5 An 5 ( 2 - ~ ) L n , (14) projections methods as well as subgradient projection
methods. Another point that deserves investigation
where the extrapolation coefficient Ln is defined
is the extension of MOPP to nonconvex problems, a
by context in which local convergence iresults should be
establishable.
Applications. Although set theoretic methods have
been widely used in certain areas of signal process-
ing such as signal and image recovery, they have only

- 5 -
enjoyed limited use in others such as spectral estima- [13] P. L. Combettes and H. 3. Trussell, “Set theoretic
tion a n d array signal processing. It should also be estimation by random search,” IEEE Transactions on
pointed out that the set theoretic approach has not Signal Processing, vol. 39, no. 7, pp. 1669-1671, July
been used in detection problems, which represent an 1991.
important field of applications besides estimation and [14] J. R. Deller, Jr., M. Nayeri, and S . F. Odeh, “Least-
design problems. square identification with error bounds for real-time
signal processing and control,” Proceedings of the
REFERENCES IEEE, vol. 81, no. 6, pp. 815-849, June 1993.
[15] F. Deutsch, “The method of alternating orthogonal
[l] H. H. Bauschke and J. M. Borwein, “On projection projections,” in Approximation Theory, Spline Func-
algorithms for solving convex feasibility problems,” tions and Applications, (S. P. Singh, Editor), pp. 105-
submitted for publication. 121. The Netherlands: Kluwer, 1992.
[a] I. T. Bjarnason and W. Menke, “Application of the [16] D. Dubois and H. Prade, Fuzzy Sets and Systems:
POCS inversion method to cross-borehole imaging,” Theory and Applications. New York: Academic Press,
Geophysics, vol. 58, no. 7, pp. 941-948, July 1993. 1980.
[3] 0. M. Bucci, G. Franceschetti, G. Mazzarella, and [17] H. Elmikati and A. A. Elsohly, “Extension of pro-
G. Panariello, “Intersection approach to array pattern jection method to nonuniformly linear antenna ar-
synthesis,” IEE Proceedings-H, vol. 137, no. 6, pp. rays,” IEEE Transactions on Antennas and Propa-
349-357, December 1990. gation, vol. 32, no. 5, pp. 507-512, May 1984.
[4] J. A. Cadzow, “Signal enhancement - a composite [18] N. E. Hurt, “Signal enhancement and the method of
property mapping algorithm,” IEEE Transactions on successive projections,” Acta Applicandae Mathemat-
Acoustics, Speech, and Signal Processing, vol. 36, no. icae, vol. 23, no. 2, pp. 145-162, May 1991.
1, pp. 49-62, January 1988. [19] 0. P. Kenny and B. Boashash, “Time-frequency anal-
[5] Y. Censor, “Iterative methods for the convex feasibil- ysis of backscattered signals from diffuse radar tar-
ity problem,” Annals of Discrete Mathematics, vol. gets,” IEE Proceedings-F, vol. 140, no. 3, pp. 198-208,
20, pp. 83-91, 1984. June 1993.
[6] A. E. Qetin and R. Ansari, ”Signal recovery from [20] S. Oh, C. Ramon, R. J. Marks 11, A. C. Nelson, and M.
wavelet transform maxima,” IEEE Transactions on G. Meyer, “Resolution enhancement of biomagnetic
Signal Processing, vol. 42, no. 1, pp. 194-196, Jan- images using the method of alternating projections,”
uary 1994. . IEEE Transactions on Biomedical Engineering, vol.
[7] M. R. Civanlar and H. J. Trussell, “Digital signal 40, no. 4, pp. 323-328, April 1993.
restoration using.fuzzy sets,” IEEE Transactions on [21] R. D. Palmer, 3. R. Crus, and D. S. Zrnid, “Enhanced
Acoustics, Speech, and Signal Processing, vol. 34, no. autoregressive moving average spectral estimation a p
4, pp. 919-936, August 1986. plied to the measurement of Doppler spectral width,”
[8] P. L. Combettes, “The foundations of set theoretic IEEE Transactions on Geoscience and Remote Sens-
estimation,” Proceedings of the IEEE, vol. 81, no. 2, ing, vol. 29, no. 3, pp. 358-368, May 1991.
pp. 182-208, February 1993. [22] F. C. Schweppe, Uncertain Dynamic Systems. Engle-
[9] P. L. Combettes, ”Inconsistent signal feasibility prob- wood Cliffs,NJ: Prentice-Hall, 1973.
lems: Least-squares solutions in a product space,” [23] H. J. Trussell and M. R. Civanlar, “The feasible so-
IEEE Transactions on Signal Processing, accepted lution in signal restoration,” IEEE Transactions on
for publication. Acoustics, Speech, and Signal Processing, vol. 32, no.
[lo] P. L. Combettes and T. J. Chaussalet, “Selecting sta- 2, pp. 201-212, April 1984.
tistical information in set theoretic signal processing,” [24] P. Vop5nka, Mathematics in the Alternative Set The-
these proceedings. ory. Leipzig: Teuber, 1979.
[ll] P. L. Combettes and H. Puh, “Iterations of paral- [25] E. Walter (Guest Editor), “Parameter identifications
lel convex projections in Hilbert spaces,” Numerical with error bounds,” Mathematics and Computers in
Functional Analysis and Optimization, vol. 15, nos. Simulation, vol. 32, no. 5-6, December 1990.
3-4, pp. 225-243, 1994.
[26] D. C. Youla and H. Webb, “Image restoration by the
[la] P. L. Combettes and H. Puh, “A fast parallel pro- method of convex projections: Part 1 - theory,” IEEE
jection algorithm for set theoretic image recovery,” Transactions on Medical Imaging, vol. 1, no. 2, pp.
Proceedings of the IEEE ICASSP Conference. Ade- 81-94, October 1982.
laide, Australia, April 19-22, 1994 (also submitted for
publication).

- 6 -

You might also like