Loupias 2000

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

WAVELET-BASED SALIENT POINTS FOR IMAGE RETRIEVAL

E. Loupias. N. Sebe’ S. Bres” J. -M. Jolion*


+ Leiden Institute of Advanced Computer Science * Laboratoire Reconnaissance de Formes
Leiden University, The Netherlands et Vision, INSA Lyon, France
nicu @ wi.leidenuniv.nl { loupias,sbresj olion ] @rfv .insa-1yon .fr

ABSTRACT detectors based on local derivatives [6,5]. Another


approach is based on local neighborhood properties [lo].
The use of interest points in content-based image retrieval Comer detectors are in general designed for robotics and
allows image index to represent local properties of the shape recognition and therefore, they have drawbacks
image. Classic corner detectors can be used for this when are applied to natural image retrieval.
purpose. However, they have drawbacks when applied to
various natural images for image retrieval, because visual Visual focus points need not be corners: visual
features need not be comers and corners may gather in meaningful feature is not necessarily located in a comer
small regions. In this paper, we present a salient point point. For instance in Figure 1, the fur is too smoothed to
detector that extract points where variations occur in the be detected by a comer detector such as Harris’ [5].
image, whether they are corner-like or not. The detector is
based on wavelet transform to detect global variations as
well as local ones. The wavelet-based salient points are
evaluated for image retrieval with a retrieval system using
texture features. In this experiment our method provides
better retrieval performance comparing with other point
detectors. ~ ~~~~

(a) Fox image (b) 100 corners (Harris)


Figure 1. Image with smoothed edges
1. INTRODUCTION Corners may gather in small regions: in various
natural images, regions may well contain textures (trees,
We are interested in content-based image retrieval in shirt patterns, etc.), where a lot of corners are detected (cf.
general image databases. The query is an image (iconic Figure 2). As the number of points is preset to limit the
search), and the retrieved images should be similar to the indexing computation time, most of the corners are in the
query. We assume that high-level concepts (objects, same textured region. .
feelings, etc.) cannot be extracted automatically from the
image without specific knowledge, and so use an image
similarity based on low-level features (such as color,
texture and shapes).
An image is “summarized” by a set of features, the
image index, to allow fast querying. Local features are of
interest, since they lead to an index based on local
properties of the image. This approach is also attractive (a) Dutch image (b) 100 corners (Harris)
for sub-image search. Figure 2. Image with texture in the Dutch dress
The feature extraction is limited to a subset of image
pixels, the interest points, where the image information is With comer detectors, both examples lead to an
supposed to be the most important [9,2,12,1]. This paper incomplete representation, where some parts of the image
focuses on the selection of points that are significant to are not described in the index.
compute features for indexing. For these reasons, comer points may not represent the
Comer detectors are commonly used for indexing most interesting subset of pixels for image indexing.
[9,12]. They are usually defined as points where gradient Indexing points should be related to any visual
is high in multiple orientations. This definition leads to “interesting” part of the image, whether it is smoothed or

518
0-7803-6297-7/00/$10.0002000 IEEE
corner-like. To describe different parts of the image, the The wavelet representation gives information about the
set of interesting point should not be clustered in few variations in the signal at different scales. In our retrieval
regions. context, we would like to extract salient points from any
From now on, we will refer to these points as salient part of the image where “something” happens in the signal
points, which are not necessarily corners. We will avoid at any resolution. A high wavelet coefficient (in absolute
the term interest points, which is ambiguous, since it was value) at a coarse resolution corresponds to a region with
previously used in the literature as corner. Wavelet high global variations. The idea is to find a relevant point
representations, which express image variations at to represent this global variation by looking at wavelet
different resolutions, are attractive to extract salient points. coefficients at finer resolutions.
Previous point detectors make use of multiresolution
representation. Chen et al. consider two different Since we use wavelets with a compact support, we know
resolutions to extract comers [3]. In image retrieval from which signal points each wavelet coefficient at the
context, contrast-based points are extracted in [ 2 ] . scale 2’ was computed. We can study the wavelet
However, a lot of points are also extracted in textured coefficients for the same points at the finer scale 2”’.
regions because these regions are contrasted. Points are Indeed there is a set of coefficients at the scale 2’+’
extracted with a specific wavelet in [I] but, since only a computed with the same points as a coefficient W2Jf ( n )
given scale is used, different resolutions features cannot be at the scale 2’ (see [ 7 ] for details). We call this set of
detected.
coefficients the children C(W2,f ( n ) ) of the coefficient
2. FROM WAVELETS TO SALIENT POINTS Wz,f( n ) . The children set in one dimension’ is:

The wavelet transform is a multiresolution representation


c(w,, f ( n > ) = f ( k ) , 2n s k I 2n + 2 p -I 1,
that expresses image variations at different scales. For where 0 I n < 2’N, with N the length of the signal and p
wavelet theory, see [ 8 ] . For wavelet description and the wavelet regularity.
algorithms, see [ 111.
A wavelet is an oscillating and attenuated function (its Each wavelet coefficient W2,f ( n ) is computed with
integral is equal to zero). We study the image f at the
2 - ’ p signal points. It represents their variation at the
scales (or resolutions) ?h,%, ... 2’, j~ 2 and j I -1. The
wavelet detail image W2,f is the convolution of the scale 2’. Its children coefficients give the variations of
some particular subsets of these points (with the number of
image with the wavelet function dilated at different scales. subsets depending on the wavelet). The most salient subset
is the one with the highest wavelet coefficient at the scale
2’+‘, that is the maximum in absolute value of
C(W2,f(n)). In our salient point extraction algorithm, we
consider this maximum, and look at its highest child.
Applying recursively this process, we select a coefficient
W 2 - l f ( n )at the finer resolution ?h(cf. Figure 4). Hence,
this coefficient only represents 2p signal points. To select
a salient point from this tracking, we choose among these
2p points the one with the highest gradient. We set its
(a) Cameraman image (b) Haar transform saliency value as the sum of the absolute value of the
wavelet coefficients in the track:
Figure 3. Haar transform
-I
Here we consider orthogonal wavelets, which lead to a saliency = (C‘k’(V’2,/(n)l, 0 I n < 2’N, -log* N Ij I-1
complete and non-redundant representation of the image. k=l
A wavelet can also have a compact support: its value is
zero outside a bounded interval. The simplest orthogonal The tracked point and its saliency value are computed
compactly supported wavelet is the Haar wavelet (see for every wavelet coefficient. A point related to a global
Figure 3), which is the discontinuous step function. variation has a high saliency value, since the coarse
Daubechies proposed wavelets, with any regularity p wavelet coefficients contribute to it. A finer variation also
0,> 1), that are also orthogonal and compactly supported
t41:
I For clarity we use one-dimensional signals. Extension to two

dimensions and signals with length not restricted to a power of 2,


in addition to algorithm complexity, are discussed in [7].

519
leads to an extracted point, but with a lower saliency
value. We then need to threshold the saliency value, in
relation to the -desired number of salient points. We first
obtain the points related to global variations; local
variations also appear if enough salient points are
requested.

(a) 100 Haar salient points for the Fox image

(b) 100 Haar salient points for the Dutch image


~ ~~

(a) 100 Haarsalient points (b) Tracked coefficients Figure 5. Haar salient points examples

4. EVALUATION FOR IMAGE RETRIEVAL

The best way to evaluate points detectors for image


retrieval is to compare retrieval results obtained with each
detector. The retrieval system is constituted by the
(c) Spatial support of tracked coefficients indexing (points extraction and computation of local
features to build image indexes) and the querying (based
Figure 4. Salient points extraction , on a similarity measure between indexes).
The salient points extracted by this process depend on Different retrieval features and image databases are used
the wavelet we use. Haar is the simplest wavelet function, in [7] to compare points detectors. Here we present results
so the fastest for execution. Some localization drawbacks with an image retrieval system2 based on texture features
can appear with Haar due to its non-overlapping wavelets [13]. Gabor features are computed for regions around the
at a given scale. This can be avoided with the simplest extracted points for 3 scales and 8 orientations. Maximum
overlapping wavelet, Daubechies 4. However, this kind of amplitudes are used to build a set of histograms, which is
drawback is not likely in natural images. the image index.
We use a database of 1505 various natural images. Each
3. EXAMPLES image belongs to an instinctive (and subjective) category
(animals, flowers, landscapes, buildings, cities.. .). Very
The salient points detected with the Haar transform are heterogeneous categories and images too different from
presented for the images used in Figure 1 and Figure 2 (cf: the rest of the category are removed from the test set.
Figure 5). For each image the detected points are Finally, we have a test set of 577 images in 9 classes.
superimposed on the original image to evaluate salient We present the recall-precision graph, computed from
points location. different numbers of return images n. The system retrieves
Salient points are detected for smoothed edges (cf: r images that belong to the same class C as the query
Figure 5.a) and are not gathered in textured regions ( c j (Y I n). There are Nc images in the class C of the query.
Figure 5.b). Hence they lead to a more complete image Then P = r I n is the precision and R = r I N , the recall
representation than comer detectors. Similar behavior can for this query. We use each test set image as a query, and
be observed with Daubechies 4 wavelets. use the average recall and precision for the graph (4
Repeatability of the detection under typical alterations is Figure 6).
a common evaluation criterion for corner detectors.
Repeatability of our detector is comparable to other
detectors. However this criterion may not be relevant in
our context, because features stability is more important
than geometric stability for image retrieval.
* This retrieval system is available through KIWI, the Key-
points Indexing Web Interface: http://telesun.insa-lyon.fr/kiwi

520
65 Wavelets are also attractive to extract image features for
60 indexing. These local features would be more related to
55 our salient points.
-
Q 50
6. ACKNOWLEDGMENTS
.- 45
3 40 E. Loupias' guest period in Leiden University was
p' 35 supported by the RhGne-Alpes Region, France
30 (EURODOC grant). Thanks to Dr. D.P. Huijsmans and
25 L--- I Prof. F. Peters, Leiden University, for discussions we had
20 J about this topic.
0 5 10 15 20
Recall (%) 7. REFERENCES
Figure 6. Retrieval results [l] S. Bhattacharjee and T. Ebrahimi, "Image Retrieval Based
We observe that the wavelet-based salient points on Structural Content ", Workshop on Image Analysis for
Multimedia Interactive Services, Heinrich-Hertz-Institut (HHI)
perform better than other detectors for these features and
Berlin, Germany, May 3 1 - June 1 1999.
this database. Daubechies 4 has better performances than [2] S. Bres and J.-M. Jolion, "Detection of Interest Points for
Haar but is computationally more expensive. Random Image Indexation ", 3rd Int. Con$ on Visual Information
points are also used in the experiment: we randomly select Systems, VisuaZ99, Amsterdam, The Netherlands, June 2-4 1999,
points, and compute the Gabor features around these pp. 427-434.
points. Their good result can be explained by their [3] C.-H. Chen, J.-S. Lee and Y.-N. Sun, "Wavelet
spreading in the image. For that reason they lead to a more Transformation for Gray-level Comer Detection ", Pattern
complete representation of the image than some detectors. Recognition, 1995, Vol. 28, No. 6, pp. 853-861.
Obviously, the random points are very unlikely to be [4] I. Daubechies, Orthonormal bases of compactly supported
"

wavelets ", Communications on Pure and Applied Mathematics,


located in corners or edges point, but they are spread 1988, Vol. 41, pp. 909-996.
enough to represent these variations in the index. Good [5] C . Harris and M. Stephens, " A Combined Comer and Edge
result of random points for indexing was observed with Detector ", Proc. of 4th Alvey Vision Conference, 1988, pp. 147-
other databases and other local features [7]. These 151.
experiments show that the points spreading can be as [6] L. Kitchen and A. Rosenfeld, "Gray-Level Comer
important as the points location for image indexing Detection ", Pattern Recognition Letters, December 1982, Vol.
(depending on the features). However, wavelet-based 1, NO. 2, pp. 95-102.
salient points, which are simultaneously spread and 171 E. Loupias and N. Sebe, " Wavelet-based Salient Points for
Image Retrieval ", RR 99.11, Laboratoire Reconnaissance de
located, perform better than random points.
Formes et Vision, INSA Lyon, November 1999. On-line
http://rfv.insa-1 yon.fr/-loupias/points/
5. DISCUSSION [8] S. Mallat, "A Theory for Multiresolution Signal
Decomposition : The Wavelet Representation ",IEEE Trans. on
We presented a salient point detector based on wavelets. PAMI, July 1989, Vol. 11, NO. 7, pp. 674-693.
The wavelet-based salient points are interesting for image 191 C. Schmid and R. Mohr, "Local Grayvalue Invariants for
retrieval, because they are located in many visual features Image Retrieval ", IEEE Trans. on PAMI, May 1997, Vol. 19,
(whether they are corner-like or not), without gathering in NO. 5, pp. 530-535.
textured regions. We presented a retrieval experiment with [lo] S. Smith, J. Brady, "SUSAN - A New Approach to Low
Level Image Processing ", International Journal of Computer
Gabor features where our method performs better than
Vision, May 1997, Vol. 23, No. 1 , pp. 45-78.
other point detectors from the literature. [ I l l E. Stollnitz, T. DeRose and D. Salesin, "Wavelets for
W e used the Haar transform for point extraction, which Computer Graphics: A Primer, part 1 ", IEEE Computer
is simple but may lead to bad localization. Daubechies Graphics and Applications, May 1995, Vol. 15, No. 3, pp. 76-
wavelets avoid this drawback, but are not symmetric. 84.
Since orthogonality is not required in our approach, we [I21 T. Tuytelaars and L. Van Gool, "Content-based Image
could extend it to other wavelets that are compactly Retrieval Based on Local Affinely Invariant Regions ", 3rd Int.
supported and symmetric. Conf on Visual Information Systems, Visual99,Amsterdam, The
Since points performance for indexing depends on the Netherlands, 2-4 June 1999, pp. 493-500.
image database, detector choice for a specific database [I31 C. Wolf, "Content based Image retrieval using Interest
Points and Texture Features ", RR 99.09, Laboratoire
should be investigated, as well as random points relevance Reconnaissance de Formes et Vision, INSA Lyon, 1999. On-
for local features extraction. line demo : http://telesun.insa-lyon.fr/kiwi

52 I

You might also like