Professional Documents
Culture Documents
Local Invariant Descriptor For Image Matching
Local Invariant Descriptor For Image Matching
net/publication/224612443
Conference Paper in Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on · April 2005
DOI: 10.1109/ICASSP.2005.1415582 · Source: IEEE Xplore
CITATIONS READS
7 684
4 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Weiqiang Wang on 20 May 2014.
2
C ( x , σ I , σ D ) = σ D G (σ I ) ∗ ⎢ ⎥ interesting point and then building smoothed orientation
⎣⎢ L x L y ( x , σ D ) L y ( x, σ D )⎥
2
⎦ histograms. The local region is divided in 4 × 4 sub-
regions, each with 8 orientations, which results in a
Where σ I is the integration scale, σ D the derivation scale, descriptor of dimension 128=4 × 4 × 8. The 128-
L ( x , σ ) = G (σ ) ∗ I ( x ) different levels of resolutions created dimensional descriptor is normalized to unit length to
by convoluting the Gaussian kernel G (σ ) with the eliminate the effects of illumination change.
image I ( x ) and x = ( x , y ) . Given an image I ( x ) the Given 128-dimensional descriptors, we employ
Principal Components Analysis (PCA) [12] to project
∂
derivates can be defined by L x ( x , σ ) = G (σ ) ∗ I ( x ) . The them to low dimensional space. The number of Principal
∂x Components (n) is empirically selected. Here we use n=20,
multi-scale Harris detector is used to locate interesting which get comparable results with the 128-dimensional
n
points at different resolution L(x, σ n ) , with σ n = k σ0 , descriptor. The dimension of our descriptor is low, which
results in significant space and speed benefits.
σ 0 is the initial scale factor.
Characteristic scale. For an interesting point, the 4. ROBUST MATCHING
characteristic scale can be defined as that at which the
result of a differential operator is maximized [17]. To robustly match a pair of images, we first determine
Different differential operators are comparatively point-to-point correspondence. We select for each
evaluated in [8]. Laplacian obtains the highest percentage descriptor in the first image the most similar one in the
of correct scale detection. We use Laplacian to verify for second image based on the Euclidean distance. If the
each of the candidate points found on different levels if it Euclidean distance is below a threshold η , the
forms a local maximum in the scale direction. correspondence is kept. All point-to-point correspond-
Lap ( x , σ n ) > Lap ( x , σ n −1 ) ∧ Lap ( x , σ n ) > Lap ( x , σ n +1 ) ences form a set of initial matches. We refine the initial
matches using RANdom SAmple Consensus (RANSAC).
Lap ( x , σ n ) > threshold
RANSAC has the advantage that it is largely insensitive to
Orientation. The invariance to rotation can be obtained outliers. We use fundamental matrix as the transformation
by assigning a consistent orientation to each local region. of RANSAC in our experiments.
One can use the method in [4] to get a stable estimation of
the dominant direction. 5. EXPERIMENTS
Fig.1 shows two images with detected scale invariant
regions. The threshold of multi-scale Harris and Laplacian We validate our algorithm by image matching
are 1000 and 10, respectively. 15 resolution levels are experiments and an image retrieval application.
used for scale representation. The factor k is 1.2
and σ I = 2σ D . 5.1. Image matching experiments
Fig. 2 shows the matching result for a real world scene,
which includes significant scale, rotation change as well as
a change in the viewing angle.
6. CONCLUSION
Acknowledgements
(b)
This work is supported by National Hi-Tech Development
Programs of China under grant No. 2003AA142140.
References
(c)
1. Schmid, C., Mohr, R.: Local Grayvalue Invariants for Image
Retrieval. IEEE PAMI, 19 (1997) 530–534
2. Lowe, D.G.: Object Recognition from Local Scale-Invariant
Features. In: ICCV, (1999) 1150–1157
3. Y. Ke and R. Sukthankar: “PCA-SIFT: A more distinctive
representation for a local image descriptors”, CVPR04.
4. Mikolajczyk, K.: Detection of Local Features Invariant to (d)
Affine Transformations. PhD thesis, INRIA, (2002)
5. Mikolajczyk, K., Schmid, C.: An Affine Invariant Interest
Point Detector. In: ECCV, (2002) 128-142
6. Schaffalitzky, F., Zisserman, A.: Multi-view Matching for
Unordered Image Sets, or “How Do I Organize My Holiday
Snaps?". In: ECCV, (2002) 414-431
7. Tuytelaars, T., Van Gool, L.: Wide Baseline Stereo Matching
(e)
Based on Local Affinely Invariant Regions. In: BMVC, (2000) Fig. 6. The first column shows some of the query images.
412-425 The second column shows the most similar images in the
8. Mikolajczyk, K., Schmid, C.: Indexing Based on Scale database, all of them are correct
Invariant Interest Points. In: ICCV, (2001) 525–531
9. Fergus, Perona, Zisserman. Object class recognition by
unsupervised scale invariant learning. In: CVPR 2003.