Feature Point Detection and Matching: Matching With Features Matching With Features

1
Feature Point
Detection and Matching
Wide Baseline Matching
Images taken by cameras that are far apart make the
correspondence problem very difficult
Feature-based approach: Detect and match feature
points in pairs of images
Matching with Features
Detect feature points
Find corresponding pairs
Problem 1:
Detect the same point independently
in both images
no chance to match!
We need a repeatable detector
2
Problem 2:
For each point, correctly recognize the
corresponding point
?
We need a reliable and distinctive descriptor
Properties of an Ideal Feature
Local: features are local, so robust to occlusion and clutter (no
prior segmentation)
Invariant (or covariant) to many kinds of geometric and
photometric transformations
Robust: noise, blur, discretization, compression, etc. do not have
a big impact on the feature
Distinctive: individual features can be matched to a large
database of objects
Quantity: many features can be generated for even small objects
Accurate: precise localization
Efficient: close to real-time performance
Problem 1: Detecting Good Feature
Points
Feature Detectors
Hessian
Harris
Lowe: SIFT (DoG)
Mikolajczyk & Schmid:
Hessian/Harris-Laplacian/Affine
Tuytelaars & Van Gool: EBR and IBR
Matas: MSER
Kadir & Brady: Salient Regions
and many others
3
Harris Corner Point Detector
C. Harris, M. Stephens, A Combined Corner and Edge Detector, 1988
Harris Detector: Basic Idea
We should recognize the point by looking
through a small window
Shifting a window in any direction should
give a large change in response
Harris Detector: Basic Idea
flat region:
no change in
all directions
edge:
no change along
the edge direction
corner:
significant change
in all directions
Harris Detector: Derivation
Change of intensity for a (small) shift by [u,v] in image I:
Intensity
Shifted
intensity
Weighting
function
or
Weighting function w(x,y) =
Gaussian 1 in window, 0 outside
e
+ + =
) , ( ) , (
2
,
0 0
0 0
)] , ( ) , ( )[ , ( ) , (
y x N y x
y x
y x I v y u x I y x w v u E
4
Harris Detector
Approximate using Taylor series expansion of I:
=
=
=
+ + =
y x
y x
y x
y
y x
x
y x I y x I y x w C
y x I y x w B
y x I y x w A
Bv Cuv Au v u E
,
,
2
,
2
2 2
) , ( ) , ( ) , (
) , ( ) , (
) , ( ) , (
2 ) , (
| | ( , )
A C u
E u v u v
C B v
( (
=
( (

x y x I I
x
c c = / ) , (
y y x I I
y
c c = / ) , (
| |
(
+ ~
+ + ~ + +
v
u
I I y x I
v I u I y x I v y u x I
y x
y x
) , (
) , ( ) , (
Plugging this into previous formula, we get:
Harris Corner Detector
| | ( , ) ,
u
E u v u v M
v
(
~
(

In summary, expanding E(u,v) in a Taylor series, we have, for
small shifts, [u,v], a bilinear approximation:
2
2
,
( , )
x x y
x y x y y
I I I
M w x y
I I I
(
=
(
(

where M is a 2 2 matrix computed from image derivatives:

Note: Sum computed over small neighborhood around given pixel
x y x I I
x
c c = / ) , (
y y x I I
y
c c = / ) , (
| | ( , ) ,
u
E u v u v M
v
(
~
(

Intensity change in shifting window: eigenvalue analysis
max
,
min
eigenvalues of M
Eigenvector =
direction of the
slowest change in E
Eigenvector associated with
max
=
direction of the fastest change in E
(
max
)
-1/2
(
min
)
-1/2
Ellipse E(u,v) = const
Selecting Good Features
1
and
2
both large
Image patch
SSD surface
5
large
1
, small
2
SSD surface
small
1
, small
2
SSD surface
2
Corner
1
and
2
both large,

1
~
2
;
E increases in all
directions
1
and
2
are small;
E is almost constant
in all directions
Edge
1
>>
2

Edge
2
>>
1

Flat
region
Classification of
image points using
eigenvalues of M:
Measure of corner response:
( )
2
det trace R M k M =
1 2
1 2
det
trace
M
M

=
= +
k is an empirically-determined constant; e.g., k = 0.05
6
2
Corner
Edge
Edge
Flat
R depends only on
eigenvalues of M
R is large for a corner
R is negative with large
magnitude for an edge
|R| is small for a flat
region
R > 0
R < 0
R < 0 |R| small
Harris Corner Detector: Algorithm
Algorithm:
1. Find points with large corner
response function R
(i.e., R > threshold)
2. Take the points of local maxima of R
(for localization) by non-maximum
suppression
Harris Detector: Example
Compute corner response R =
1
2
k(
1
+
2
)
2
7
Find points with large corner response: R > threshold
Take only the points of local maxima of R
Interest points extracted with Harris (~ 500 points)
8
Harris Detector: Summary
Average intensity change in direction [u,v] can be
expressed (approximately) in bilinear form:

Describe a point in terms of eigenvalues of M:
measure of corner response:

A good (corner) point should have a large intensity change
in all directions, i.e., R should be a large positive value
| | ( , ) ,
u
E u v u v M
v
(
~
(

( )
2
1 2 1 2
R k = +
Harris Detector Properties
Rotation invariance
Ellipse rotates but its shape (i.e., eigenvalues)
remains the same
Corner response R is invariant to image rotation
But not invariant to image scale
Fine scale: All points will
be classified as edges
Coarse scale: Corner
9
Harris Detector: Scale
R
min
= 0
R
min
= 1500
Quality of Harris detector for different scale
changes
Repeatability rate:
# correct correspondences
# possible correspondences
C. Schmid et al., Evaluation of Interest Point Detectors, IJCV 2000
40
Invariant Local Features
Goal: Detect the same interest points regardless
of image changes due to translation, rotation,
scale, viewpoint
41
Geometry
Rotation
Similarity (rotation + uniform scale)

Affine (scale dependent on direction)
valid for: orthographic camera, locally planar
object
Photometry
Affine intensity change (I a I + b)
Models of Image Change
10
Scale Invariant Detection
Consider regions (e.g., circles) of different sizes
around a point
Regions of corresponding sizes will look the same
in both images
Problem: How do we choose corresponding
circles independently in each image?
Solution:
Design a function on the region (circle) that is
scale invariant, i.e., the same for corresponding
regions, even if they are at different scales

Example: Average intensity. For corresponding
regions (even of different sizes) it will be the same
scale = 1/2
For a point in one image, we can consider it as a
function of region size (circle radius)
f
region size
Image 1
f
region size
Image 2
Key idea:
scale = 1/2
f
region size
Image 1
f
region size
Image 2
Take a local maximum of this function
Observation: Region size, for which the maximum is
achieved, should be invariant to image scale
s
1
s
2
Important: This scale invariant region size
is found in each image independently!
11
Automatic Scale Selection
)) , ( (
1
o x I f
m
i i
Lindeberg et al., 1996
Common definition of f:
Laplacian (or DoG)
)) , ( (
1
o x I f
m
i i
Function responses for increasing scale
Scale trace (signature)
)) , ( (
1
o x I f
m
i i
)) , ( (
1
o x I f
m
i i
12
)) , ( (
1
o x I f
m
i i
)) , ( (
1
o x I f
m
i i
)) , ( (
1
o x I f
m
i i
)) , ( (
1
o x I f
m i i
)) , ( (
1
o x I f
m
i i
'
13
)) , ( (
1
o x I f
m i i
)) , ( (
1
o x I f
m
i i
'

)) , ( (
1
o x I f
m i i
)) , ( (
1
o x I f
m
i i
'

)) , ( (
1
o x I f
m i i
)) , ( (
1
o x I f
m
i i
'

)) , ( (
1
o x I f
m i i
)) , ( (
1
o x I f
m
i i
'
14
)) , ( (
1
o x I f
m i i
)) , ( (
1
o x I f
m
i i
'

)) , ( (
1
o x I f
m i i
)) , ( (
1
o' ' x I f
m
i i
Normalize: rescale to fixed size
A good function for scale detection has one stable,
sharp peak

For many images, a good function is one that
responds to contrast (i.e., sharp local intensity
change)
f
region size
bad
f
region size
bad
f
region size
Good !
15
Scale Invariant Detectors
Harris-Laplacian
1

Find local maxima of:
Harris corner detector in space
(image coordinates)
Laplacian in scale
1
K.Mikolajczyk, C.Schmid. Indexing Based on Scale Invariant Interest Points, ICCV 2001
2
D.Lowe. Distinctive Image Features from Scale-Invariant Keypoints, IJCV, 2004
scale
x
y
Harris

L
a
p
l
a
c
i
a
n

SIFT keypoints
2

Find local extrema of:
Difference of Gaussians in
space and scale
scale
x
y
DoG

D
o
G

Blob Detection in 2D
Laplacian of Gaussian: Circularly symmetric
operator for blob detection in 2D
2
2
2
2
2
y
I
x
I
I
c
c
+
c
c
= V
Blob Detection in 2D: Scale Selection
Laplacian-of-Gaussian = blob detector

2
2
2
2
2
y
I
x
I
I
c
c
+
c
c
= V
f
i
l
t
e
r

s
c
a
l
e
s

img1 img2 img3
Bastian Leibe
Blob Detection in 2D
We define the characteristic scale as the scale
that produces peak of Laplacian response
characteristic scale
Slide credit: Lana Lazebnik
16
Example
Original image
at the size
Kristen Grauman
Original image
at the size
Kristen Grauman
Kristen Grauman Kristen Grauman
17
Kristen Grauman Kristen Grauman
Kristen Grauman
) ( ) ( o o
yy xx
L L +
o1
o2

o3

o4

o5

List of
(x, y, )
scale
Scale Invariant Interest Points
Interest points are local maxima in both position
and scale
Squared filter
response maps
18
SIFT Detector Example Result
Image credit: Lana Lazebnik
SIFT Detector Example Result
SIFT Detector Details
Difference-of-Gaussian (DoG)
is an approximation of the
Laplacian-of-Gaussian (LoG)

=
SIFT Detector
19
SIFT Detector
85
SIFT Detector Algorithm Summary
Detect local maxima in
position and scale of
squared values of
difference-of-Gaussian
Fit a quadratic to
surrounding values for
sub-pixel and sub-scale
interpolation
Output = list of (x, y, o)
points
Blur Resample Subtract
87
Scale Invariant Detectors
K.Mikolajczyk, C.Schmid. Indexing Based on Scale Invariant Interest Points, ICCV 2001
Experimental evaluation of detectors w.r.t. scale change
Repeatability rate:
# correspondences
# possible correspondences
Affine Invariant Detection
Previously we considered:
similarity transform (rotation + uniform scale)
Now we go on to invariance under an
affine transformation (rotation + scale + skew)
Circle projects to an ellipse in general
20
Harris-Affine Detector
Tuytelaarss Affine-Invariant Detector
Take a local intensity extremum as initial point
Go along every ray starting from this point and stop when
extremum of function f is reached
T.Tuytelaars and L.V.Gool. Wide Baseline Stereo Matching Based on Local,
Affinely Invariant Regions, BMVC 2000
0
1
0
( )
( )
( )
t
o
t
I t I
f t
I t I dt
}
f
points along the ray
We will obtain approximately
corresponding regions
Remark: we search for scale
in every direction
The regions found may not exactly correspond, so we
approximate them with ellipses
Geometric Moments:
2
( , )
p q
pq
m x y f x y dxdy =
}
Fact: moments m
pq
uniquely
determine the function f
Taking f to be the characteristic function of a
region (1 inside, 0 outside), moments of order up to
2 allow us to approximate the region by an ellipse
This ellipse will have the same moments of
order up to 2 as the original region
Ellipses, computed for corresponding
regions, also correspond
21
Algorithm Summary (detection of affine invariant regions):
Start from a local intensity extremum point
Go in every direction until the point of extremum of some
function f
Curve connecting the points is the region boundary
Compute geometric moments of orders up to 2 for this region
Replace the region with ellipse
T.Tuytelaars, L.V.Gool. Wide Baseline Stereo Matching Based on Local,
Affinely Invariant Regions, BMVC 2000
Feature Point Descriptors
After detecting points (and patches) in each image,
Next question: How to match them?
?
Point descriptor should be:
1. Invariant
2. Distinctive
Local Features: Description
1) Detection: Identify the
interest points

2) Description: Extract feature
vector for each interest point

3) Matching: Determine
correspondence between
descriptors in two views
] , , [
) 1 ( ) 1 (
1 1 d
x x = x
] , , [
) 2 ( ) 2 (
1 2 d
x x = x
22
Geometric Transformations
e.g. scale,
translation,
rotation
Photometric Transformations

Figure from T. Tuytelaars ECCV 2006 tutorial
Raw Patches as Local Descriptors
The simplest way to describe
the neighborhood around an
interest point is to write down
the list of intensities to form a
feature vector

But this is very sensitive to
even small shifts or rotations
Making Descriptors Invariant to
Rotation
Find local orientation
Dominant direction of gradient:
Compute description relative to this orientation
1
K.Mikolajczyk, C.Schmid. Indexing Based on Scale Invariant Interest Points. ICCV 2001
2
D.Lowe. Distinctive Image Features from Scale-Invariant Keypoints. Accepted to IJCV 2004
23
SIFT Descriptor:
Select Major Orientation
Compute histogram of local
gradient directions computed
at selected scale in
neighborhood of a feature
point relative to dominant local
orientation

Compute gradients within sub-
patches, and compute
histogram of orientations using
discrete bins

Descriptor is rotation and scale
invariant, and also has some
illumination invariance (why?)
0
2 t
SIFT Descriptor
0 2 t
SIFT Descriptor

Compute gradient orientation histograms on 4 x 4 neighborhoods
over 16 x 16 array of locations in scale space around each keypoint
position, relative to the keypoint orientation using thresholded image
gradients from Gaussian pyramid level at keypoints scale
Quantize orientations to 8 values
4 x 4 array of histograms
SIFT feature vector of length 4 x 4 x 8 = 128 values for each keypoint
Normalize the descriptor to make it invariant to intensity change

D.Lowe. Distinctive Image Features from Scale-Invariant Keypoints, IJCV 2004
Sensitivity to Number of Histogram Orientations

24
Feature Stability to Noise
Match features after random change in image scale &
orientation, with differing levels of image noise
Find nearest neighbor in database of 30,000 features
Feature Stability to Affine Change
Match features after random change in image scale &
orientation, with 2% image noise, and affine distortion
Find nearest neighbor in database of 30,000 features
Distinctiveness of Features
Vary size of database of features, with 30 degree
affine change, 2% image noise
Measure % correct for single nearest neighbor match
SIFT Descriptor Performance

Empirically found to have good performance in terms of
invariance to image rotation, scale, intensity change, and to
moderate affine transformations, illumination changes,
viewpoint, occlusion
1
D.Lowe, Distinctive Image Features from Scale-Invariant Keypoints, IJCV 2004
2
K.Mikolajczyk, C.Schmid, APerformance Evaluation of Local Descriptors, CVPR 2003
Scale = 2.5
Rotation = 45
0
25
Feature Detection and Description
Summary
Stable (repeatable) feature points can currently
be detected that are invariant to
Rotation, scale, and affine transformations, but not to
more general perspective and projective
transformations

Feature point descriptors can be computed, but
are noisy due to use of differential operators
are not invariant to projective transformations
How to Improve Feature Detectors
and Descriptors?
Invariant to large changes in camera
viewpoint, i.e., projective invariance
Robust to image noise by using descriptor
based on integral invariants instead of
differential features
Incorporate color and texture into descriptor
Feature Matching
?
Wide-Baseline Feature Matching
Standard approach for pairwise matching:
For each feature point in image A
Find the feature point with the closest descriptor
in image B

From Schaffalitzky and
Zisserman 02
26
Compare the distance, d1, to the closest
feature, to the distance, d2, to the second
closest feature
Accept if d1/d2 < 0.6
If the ratio of distances is less than a threshold,
keep the feature
Why the ratio test?
Eliminates hard-to-match repeated features
Distances in SIFT descriptor space seem to be non-
uniform


Because of the high dimensionality of
features, approximate nearest neighbors are
necessary for efficient performance
See ANN package, Mount and Arya
http://www.cs.umd.edu/~mount/ANN/

Dealing with Noisy Data: RANSAC
How to find the best fitting data to a global
model when the data are noisy especially
because of the presence of outliers, i.e.,
missing features and extraneous features?

RANSAC (Random Sample Consensus)
Method
Iteratively select a small subset of data and fit model to
that data
Then check all of data to see how many fit the model
RANSAC for Estimating Homography
1. RANSAC loop:
i. Select four feature pairs (at random)
ii. Compute homography H (exact)
iii. Compute inliers where SSD(p
i
', Hp
i
) <
2. Keep largest set of inliers
3. Re-compute least-squares H estimate using
all of the inliers
27
How Many Iterations?
If data contains q percent outliers, p
points/subset, and want probability r that at
least one subset is correct, then number of
iterations needed is

Example: p = 5, q = 40%, r = 0.99, then m = 57
) ) 1 ( 1 log(
) 1 log(
p
q
r
m

=
Summary
Interest point detection
Harris corner detector
SIFT (Laplacian of Gaussian, automatic scale selection)

Compute descriptors
Rotation according to dominant gradient direction
Histograms for robustness to small shifts and translations
(SIFT descriptor)

Match descriptors

Feature Point Detection and Matching: Matching With Features Matching With Features

Uploaded by

Copyright:

Available Formats

You might also like

Feature Point Detection and Matching: Matching With Features Matching With Features

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Feature Point Detection and Matching: Matching With Features Matching With Features

Uploaded by

Copyright:

Available Formats

1

where M is a 2 2 matrix computed from image derivatives:

Automatic Scale Selection

Automatic Scale Selection

Automatic Scale Selection

Automatic Scale Selection

You might also like