0% found this document useful (0 votes)

37 views

SIFT - The Scale Invariant Feature Transform

SIFT is a scale-invariant feature transform algorithm that detects and describes local features in images. It finds distinctive keypoints that are invariant to scale, rotation, and illumination changes. The algorithm involves several steps: (1) it detects scale-space extrema in the difference-of-Gaussian transformed image pyramid, (2) precisely localizes the keypoints and filters out low contrast ones, (3) assigns orientations based on local image gradients, and (4) builds descriptors for the keypoints based on histograms of local image gradient orientations and magnitudes. SIFT features have been shown to perform well for image matching and recognition tasks due to their robustness to various image transformations and ability to discriminate between features.

Uploaded by

Ismael Coulibaly

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

SIFT - The Scale Invariant Feature Transform

Uploaded by

Ismael Coulibaly

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

SIFT - The Scale Invariant

Feature Transform

Distinctive image features from scale-invariant keypoints. David

G. Lowe, International Journal of Computer Vision, 60, 2 (2004),
pp. 91-110

Presented by Ofir Pele.

Based upon slides from:

- Sebastian Thrun and Jana Koeck
- Neeraj Kumar
Correspondence
Fundamental to many of the core vision problems
Recognition
Motion tracking
Multiview geometry
Local features are the key

Images from: M. Brown and D. G. Lowe. Recognising Panoramas. In Proceedings of the

the International Conference on Computer Vision (ICCV2003 (
Local Features:
Detectors & Descriptors
Detected Descriptors
Interest Points/Regions

<0 12 31 0 0 23 >

<5 0 0 11 37 15 >

<14 21 10 0 3 22 >
Ideal Interest Points/Regions
Lots of them
Repeatable
Representative orientation/scale
Fast to extract and match
SIFT Overview
Detector
1. Find Scale-Space Extrema
2. Keypoint Localization & Filtering
Improve keypoints and throw out bad ones

3. Orientation Assignment
Remove effects of rotation and scale
4. Create descriptor
Using histograms of orientations
Descriptor
SIFT Overview
Detector
1. Find Scale-Space Extrema
2. Keypoint Localization & Filtering
Improve keypoints and throw out bad ones

3. Orientation Assignment
Remove effects of rotation and scale
4. Create descriptor
Using histograms of orientations
Descriptor
Scale Space
Need to find characteristic scale for feature
Scale-Space: Continuous function of scale
Only reasonable kernel is Gaussian:

L ( x, y , D ) = G ( x, y , D ) * I ( x, y )

[Koenderink 1984, Lindeberg 1994]

Scale Selection
Experimentally, Maxima of Laplacian-of-Gaussian gives
best notion of scale:

Thus use Laplacian-of-Gaussian (LoG) operator:

G
2 2

Mikolajczyk 2002
Approximate LoG
LoG is expensive, so lets approximate it
Using the heat-diffusion equation:

G G ( k ) G ( )
G =
2

k
Define Difference-of-Gaussians (DoG):
( k 1) 2 2G G ( k ) G ( )
D( ) ( G ( k ) G ( ) ) * I
DoG Efficiency
The smoothed images need to be computed in
any case for feature description.
We need only to subtract two images.
DoB Filter (`Difference of Boxes')
Even faster approximation is using box filters (by
integral image)

Bay et al., ECCV 2006

Integral Image Computation

A B
C D= B+ C - A
Integral Image Usage
Scale-Space Construction
First construct scale-space:

increasing

G ( 2 ) * I

G( ) * I G ( 2k ) * I

G ( k ) * I ( )
G 2k 2 * I
First octave ( )
G k 2 * I Second octave
Difference-of-Gaussianss
Now take differences:
Scale-Space Extrema
Choose all extrema within 3x3x3 neighborhood.
Low cost only several usually checked

(
D k 2 )
D( k )

D( )
SIFT Overview
Detector
1. Find Scale-Space Extrema
2. Keypoint Localization & Filtering
Improve keypoints and throw out bad ones

3. Orientation Assignment
Remove effects of rotation and scale
4. Create descriptor
Using histograms of orientations
Descriptor
Keypoint Localization & Filtering
Now we have much less points than pixels.
However, still lots of points (~1000s)
With only pixel-accuracy at best
At higher scales, this corresponds to several pixels in base
image
And this includes many bad points

Brown & Lowe 2002

Keypoint Localization
The problem:

True Extrema

Detected Extrema

Sampling x
Keypoint Localization
The Solution:
Take Taylor series expansion:

D T 1 T 2 D T
D( x ) = D + x + x 2 x
x 2 x

Minimize to get true location of extrema:

1
D
2
D
x = 2
x x

Brown & Lowe 2002

Keypoints

(a) 233x189 image

(b) 832 DOG extrema
Keypoint Filtering - Low Contrast
Reject points with bad contrast

D( x ) is smaller than 0.03 (image values in [0,1])

Keypoint Filtering - Edges
Reject points with strong edge response in one
direction only
Like Harris - using Trace and Determinant of
Hessian

Point constrained

Point detection
Point detection Point can move along edge
Keypoint Filtering - Edges
To check if ratio of principal curvatures is below some threshold, r, check:

Tr ( H ) (r + 1)
2 2
<
Det ( H ) r
r=10
Only 20 floating points operations to test each keypoint
Keypoint Filtering

(c) 729 left after peak value threshold (from 832)

(d) 536 left after testing ratio of principle curvatures
SIFT Overview
Detector
1. Find Scale-Space Extrema
2. Keypoint Localization & Filtering
Improve keypoints and throw out bad ones

3. Orientation Assignment
Remove effects of rotation and scale
4. Create descriptor
Using histograms of orientations
Descriptor
Ideal Descriptors
Robust to:
Affine transformation
Lighting
Noise
Distinctive
Fast to match
Not too large
Usually L1 or L2 matching
SIFT Overview
Detector
1. Find Scale-Space Extrema
2. Keypoint Localization & Filtering
Improve keypoints and throw out bad ones

3. Orientation Assignment
Remove effects of rotation and scale
4. Create descriptor
Using histograms of orientations
Descriptor
Orientation Assignment
Now we have set of good points
Choose a region around each point
Remove effects of scale and rotation
Orientation Assignment
Use scale of point to choose correct image:

L ( x , y ) = G ( x , y , ) * I ( x, y )
Compute gradient magnitude and orientation
using finite differences:

m ( x, y ) = ( L( x + 1, y ) L( x 1, y ) ) 2 + ( L( x, y + 1) L( x, y 1) ) 2
1 ( L ( x, y + 1) L ( x, y 1) )
( x, y ) = tan
( L( x + 1, y ) L( x 1, y ) )
Orientation Assignment
Create gradient histogram (36 bins)
Weighted by magnitude and Gaussian window ( is 1.5 times
that of the scale of a keypoint)
Orientation Assignment
Any peak within 80% of the highest peak is used
to create a keypoint with that orientation
~15% assigned multiplied orientations, but
contribute significantly to the stability
Finally a parabola is fit to the 3 histogram values
closest to each peak to interpolate the peak
position for better accuracy
SIFT Overview
Detector
1. Find Scale-Space Extrema
2. Keypoint Localization & Filtering
Improve keypoints and throw out bad ones

3. Orientation Assignment
Remove effects of rotation and scale

4. Create descriptor
Using histograms of orientations
Descriptor
SIFT Descriptor
Each point so far has x, y, , m,
Now we need a descriptor for the region
Could sample intensities around point, but
Sensitive to lighting changes
Sensitive to slight errors in x, y,
Look to biological vision
Neurons respond to gradients at certain frequency and
orientation
But location of gradient can shift slightly!

Edelman et al. 1997

SIFT Descriptor
4x4 Gradient window
Histogram of 4x4 samples per window in 8 directions
Gaussian weighting around center( is 0.5 times that of the scale of
a keypoint)
4x4x8 = 128 dimensional feature vector

Image from: Jonas Hurrelmann

SIFT Descriptor Lighting changes
Gains do not affect gradients
Normalization to unit length removes contrast
Saturation affects magnitudes much more than
orientation
Threshold gradient magnitudes to 0.2 and renormalize
Performance
Very robust
80% Repeatability at:
10% image noise
45 viewing angle
1k-100k keypoints in database
Best descriptor in [Mikolajczyk & Schmid 2005]s
extensive survey
3670+ citations on Google Scholar
Typical Usage
For set of database images:
1. Compute SIFT features
2. Save descriptors to database
For query image:
1. Compute SIFT features
2. For each descriptor:
Find a match
3. Verify matches
Geometry
Hough transform
Matching Descriptors
Threshold on Distance bad performance
Nearest Neighbor better
Ratio Test best performance
Matching Descriptors - Distance
L2 norm used by Lowe
SIFTDIST: linear time EMD algorithm that adds
robustness to orientation shifts
Pele and Werman, ECCV 2008
Ratio Test

Image 2 Image 1

False 2nd
best match
Best Match

True 2nd
best match
Fast Nearest-Neighbor Matching to
Feature Database
Hypotheses are generated by approximate nearest neighbor
matching of each feature to vectors in the database
SIFT use best-bin-first (Beis & Lowe, 97) modification to k-d
tree algorithm
Use heap data structure to identify bins in order by their
distance from query point

Result: Can give speedup by factor of 1000 while finding

nearest neighbor (of interest) 95% of the time
3D Object Recognition
Only 3 keys are needed for
recognition, so extra keys
provide robustness
Recognition under occlusion
Test of illumination Robustness
Same image under differing illumination

273 keys verified in final match

Location recognition
Image Registration Results

[Brown & Lowe 2003]

Cases where SIFT didnt work
Large illumination change
Same object under differing illumination
43 keypoints in left image and the corresponding closest
keypoints on the right (1 for each)
Large illumination change
Same object under differing illumination
43 keypoints in left image and the corresponding closest
keypoints on the right (5 for each)
Non rigid deformations
11 keypoints in left image and the corresponding closest
keypoints on the right (1 for each)
Non rigid deformations
11 keypoints in left image and the corresponding closest
keypoints on the right (5 for each)
Conclusion: SIFT
Built on strong foundations
First principles (LoG and DoG)
Biological vision (Descriptor)
Empirical results
Many heuristic optimizations
Rejection of bad points
Sub-pixel level fitting
Thresholds carefully chosen
Conclusion: SIFT
In wide use both in academia and industry
Many available implementations:
Binaries available at Lowes website
C/C++ open source by A. Vedaldi (UCLA)
C# library by S. Nowozin (Tu-Berlin)
Protected by a patent
Conclusion: SIFT
Empirically found2 to show very good performance, robust to
image rotation, scale, intensity change, and to moderate affine
transformations

Scale = 2.5
Rotation = 450

1
Mikolajczyk & Schmid 2005
A note regarding invariance/robustness
There is a tradeoff between invariance and
distinctiveness.
For some tasks it is better not to be invariant
Local features and kernels for classification of
texture and object categories: An in-depth
study - Zhang, Marszalek, Lazebnik and Schmid. IJCV 2007.
11 color names - J. van de Weijer, C. Schmid, Applying
Color Names to Image Description. ICIP 2007
Conclusion: Local features
Much work left to be done
Efficient search and matching
Combining with global methods
Finding better features
SIFT extensions
Color
Color SIFT - G. J. Burghouts and J. M. Geusebroek.
Performance evaluation of local colour invariants.
Comput. Vision Image Understanding, 2009
Hue and Opponent histograms - J. van de Weijer,
C. Schmid. Coloring Local Feature Extraction.
ECCV 2006
11 color names - J. van de Weijer, C. Schmid,
Applying Color Names to Image Description. ICIP 2007
PCA-SIFT
Only change step 4 (creation of descriptor)
Pre-compute an eigen-space for local gradient
patches of size 41x41
2x39x39=3042 elements
Only keep 20 components
A more compact descriptor
In K.Mikolajczyk, C.Schmid 2005 PCA-SIFT
tested inferior to original SIFT
Speed Improvements
SURF - Bay et al. 2006
Approx SIFT - Grabner et al. 2006
GPU implementation - Sudipta N. Sinha et al. 2006
GLOH (Gradient location-orientation
histogram)

SIFT

17 location bins
16 orientation bins
Analyze the 17x16=272-d
eigen-space, keep 128 components

SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
Comparis i On
No ratings yet
Comparis i On
68 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
510 pages
computer_vision_2_feature_extraction_3_students
No ratings yet
computer_vision_2_feature_extraction_3_students
105 pages
9-2e. SIFT-21-08-2024
No ratings yet
9-2e. SIFT-21-08-2024
66 pages
SIFT
No ratings yet
SIFT
33 pages
SIFT - Distinctive Image Features From Scale-Invariant Keypoints
No ratings yet
SIFT - Distinctive Image Features From Scale-Invariant Keypoints
16 pages
SIFT Transform
No ratings yet
SIFT Transform
50 pages
Distinctive Image Features From Scale-Invariant Keypoints
No ratings yet
Distinctive Image Features From Scale-Invariant Keypoints
26 pages
Sift Detector and Descriptor: (Scale Invariant Feature Transform)
No ratings yet
Sift Detector and Descriptor: (Scale Invariant Feature Transform)
34 pages
Sift Preprint
No ratings yet
Sift Preprint
28 pages
Scale Invariant Feature Transform by David Lowe Short Explanation of The Approach by Michela Lecca
No ratings yet
Scale Invariant Feature Transform by David Lowe Short Explanation of The Approach by Michela Lecca
22 pages
Scale Invariant Feature Transform (SIFT)
No ratings yet
Scale Invariant Feature Transform (SIFT)
24 pages
Article
No ratings yet
Article
27 pages
Recognition Local Features
No ratings yet
Recognition Local Features
41 pages
Featuredescriptor
No ratings yet
Featuredescriptor
45 pages
Distinctive Image Feature From Scale-Invariant Keypoints: David G. Lowe, 2004
No ratings yet
Distinctive Image Feature From Scale-Invariant Keypoints: David G. Lowe, 2004
27 pages
Scale Invariant Feature Transform: Tom Duerig
No ratings yet
Scale Invariant Feature Transform: Tom Duerig
30 pages
Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet
No ratings yet
Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet
25 pages
Harris InterestPoints Andfeatures 1
No ratings yet
Harris InterestPoints Andfeatures 1
64 pages
Scale Invariant Feature Transform (SIFT) : CS 763 Ajit Rajwade
No ratings yet
Scale Invariant Feature Transform (SIFT) : CS 763 Ajit Rajwade
52 pages
Illumination Scale Rotation
No ratings yet
Illumination Scale Rotation
16 pages
Document from Sindhu Reddy...??
No ratings yet
Document from Sindhu Reddy...??
94 pages
Recognizing Pictures at An Exhibition Using SIFT
No ratings yet
Recognizing Pictures at An Exhibition Using SIFT
5 pages
CHAP 7 Features Recognition and Classification
No ratings yet
CHAP 7 Features Recognition and Classification
94 pages
Topic: Sift (Scale Invariant Feature Transform) Method For Key Location Detection
No ratings yet
Topic: Sift (Scale Invariant Feature Transform) Method For Key Location Detection
6 pages
Unit II - Chapter 4 - Feature Detection
No ratings yet
Unit II - Chapter 4 - Feature Detection
56 pages
Feature Matching: "What Stuff in The Left Image Matches With Stuff On The Right?"
No ratings yet
Feature Matching: "What Stuff in The Left Image Matches With Stuff On The Right?"
62 pages
Module 3.1 Morphology
No ratings yet
Module 3.1 Morphology
97 pages
Improved SIFT Algorithm Image Matching
No ratings yet
Improved SIFT Algorithm Image Matching
7 pages
Comp Vis Week 5
No ratings yet
Comp Vis Week 5
49 pages
Object Recognition From Local Scale-Invariant Features (SIFT)
No ratings yet
Object Recognition From Local Scale-Invariant Features (SIFT)
24 pages
Bai09 Descriptors
No ratings yet
Bai09 Descriptors
81 pages
Technical Paper Summary Utkarsh Gupta
No ratings yet
Technical Paper Summary Utkarsh Gupta
2 pages
cv2021 Lec2 Features I - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec2 Features I - 1600 - PDF - Gdrive.vip
68 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Features
No ratings yet
Features
60 pages
Lecture 5 -Camera calibration
No ratings yet
Lecture 5 -Camera calibration
14 pages
Object Recognition From Local Scale-Invariant Features
No ratings yet
Object Recognition From Local Scale-Invariant Features
8 pages
Feature Detection: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Feature Detection: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
54 pages
Object Recognition From Local Scale-Invariant Features
No ratings yet
Object Recognition From Local Scale-Invariant Features
8 pages
Acknowledgement: Sift:Scale Invariant Feature Transform
No ratings yet
Acknowledgement: Sift:Scale Invariant Feature Transform
25 pages
Akash Mahanty 1
No ratings yet
Akash Mahanty 1
18 pages
Object Recognition From Local Scale-Invariant Features
No ratings yet
Object Recognition From Local Scale-Invariant Features
8 pages
ECE181B Proj02 Report
No ratings yet
ECE181B Proj02 Report
12 pages
Feature Description & Extraction: FAST (Features From Accelerated Segment Test)
No ratings yet
Feature Description & Extraction: FAST (Features From Accelerated Segment Test)
11 pages
A Comparison of FAST, SURF, Eigen, Harris, and MSER Features
No ratings yet
A Comparison of FAST, SURF, Eigen, Harris, and MSER Features
6 pages
Sift
No ratings yet
Sift
28 pages
4.01 08 2022 - FeatureDescriptors
No ratings yet
4.01 08 2022 - FeatureDescriptors
46 pages
Scale-Invariant Feature Transform
No ratings yet
Scale-Invariant Feature Transform
19 pages
Asift Asift
No ratings yet
Asift Asift
11 pages
Lecture 4 1 Feature Descriptors
No ratings yet
Lecture 4 1 Feature Descriptors
30 pages
CV#7 SIFT Scale Invariant Feature Transform
No ratings yet
CV#7 SIFT Scale Invariant Feature Transform
70 pages
lecture13
No ratings yet
lecture13
12 pages
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
No ratings yet
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
7 pages
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
No ratings yet
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
7 pages
Scale Estimation and Keypoint Description: Li Yicheng
No ratings yet
Scale Estimation and Keypoint Description: Li Yicheng
10 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet