Lecture10-Featurebased Image Matching (2)
Lecture10-Featurebased Image Matching (2)
self-evident
Libertytruths
happiness
endowed
i
Creatornalie pursuit
nable
Life
Representing a Text
as a “Bag of
Words”
We hold these truths to be self-evident, that all men are created equal,
that they are endowed by their Creator with certain unalienable Rights,
that among these are Life, Liberty and the pursuit of Happiness. That to
secure these rights, Governments are instituted among Men, deriving
their just powers from the consent of the governed, That whenever any
Form of Government becomes destructive of these ends, it is the Right of
the People to alter or to abolish it, and to institute new Government, laying
its foundation on such principles and organizing its powers in such form, self-evident
as to them shall seem most likely to effect their Safety and Happiness.
Prudence, indeed, will dictate that Governments long established should Liberty truths
not be changed for light and transient causes; and accordingly all happiness
experience hath shewn, that mankind are more disposed to suffer, while
evils are sufferable, than to right themselves by abolishing the forms to endowed
which they are accustomed. But when a long train of abuses and i
usurpations, pursuing invariably the same Object evinces a design to Creatornalie pursuit
reduce them under absolute Despotism, it is their right, it is their duty, to nable
throw off such Government, and to provide new Guards for their future Life
security.
Representing an Image
as a “Bag of Visual
Words”
Feature descriptors
Oridx Dxy
yy
minan
dy
|dx along
do ||d | t gradient OGrrieandtiee
y
Filters D Blob Response
ndtPFaietcldh
“Bag of Visual Words”
Matching
Pairwise
Comparison
Which of the following statements are true?
???
(a) A bag of visual words representation is robust against partial occlusions of an
object.
(a) The SIFT descriptor can only be calculated for SIFT keypoints. Similarly,
the SURF descriptor can only be calculated for SURF keypoints.
(b) Both SIFT and SURF descriptors only depend on image gradients.
(c) The SIFT descriptor is more robust against image rotation since it uses an
orientation histogram.
Geometric mapping
Notation:
T
Homogeneous coordinates; reference image y 1
Inhomogeneous coordinates; target image x
x x
T
Translation x
y
Euclidean transformation (rotation and translation)
cos sin tx
x = x
sin cos ty
Scaled rotation (similarity
transform)
s s sin tx
cos s cos t y x
x = s
Geometric mapping
Affine transformation
a00 a01 a02
x = x
a10 a11 a12
Motion of planar surface in 3d under orthographic projection
Parallel lines are preserved
Geometric mapping
Motion of planar surface in 3d under perspective projection
Homography
h00 h01 h02
x h11
10 h12 x
h h h22
20 h21
Inhomogeneous coordinates (after normalization)
Probability of
valid correspondence
Use small number of correspondences
RANSAC with Affine Model
RANSAC with Homography
SURF features & affine
RANSAC
Pairwise
Comparison
Which of the following statements are true?
???
(a) RANSAC is resilient against missing features, extraneous features, and noisy
correspondences in a bag of visual words matching scenario.
(c) RANSAC can only be applied if the number of inliers is larger than the number of
outliers.
(d) For a fixed number of iterations in RANSAC, using a model with a larger number
of parameters always increases the probability of success.
Comparing Feature Histograms
Speed up by comparing histograms of features:
pairwise image comparison only for similar histograms
Histogram intersection Query histogram Histogram of
database entry
n
n Q i , D i
i
1 min
i Di
1
[Swain, Ballard 1991]
k=3
k=3
Query
Querying: Hard Binning vs. Soft Binning
node 1 node 1
d1 d1
SURF features
6-level tree
1M leaf nodes
3269 query images
100 top tree results
Applications
Matlab help: Local Feature Detection and Extraction