0% found this document useful (0 votes)

29 views

Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet

This document summarizes a tutorial on local image features for object recognition. It describes the Scale Invariant Feature Transform (SIFT) approach, which detects keypoints that are invariant to changes in scale, rotation, and illumination. SIFT extracts 128-dimensional feature vectors for keypoints based on local image gradients. These features can be matched across images to recognize objects or track features for applications like 3D reconstruction. The tutorial outlines the main steps of SIFT: scale-space extrema detection of keypoints, orientation assignment, and generation of descriptors for matching.

Uploaded by

thanhminh300

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet

Uploaded by

thanhminh300

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

(c) 2004 F. Estrada & A. Jepson & D.

Fleet

Local Features Tutorial:

Nov. 8, 04

Local Features Tutorial References: Matlab SIFT tutorial (from course webpage) Lowe, David G. Distinctive Image Features from Scale Invariant Features, International Journal of Computer Vision, Vol. 60, No. 2, 2004, pp. 91-110

Local Features Tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Local Features Tutorial

Previous week: recognition

View based models for object

- The problem: Build a model that captures general properties of eye appearance that we can use to identify eyes (though the approach is general, and does not depend on the particular object class). - Generalized model of eye appearance based on PCA. Images taken from same pose and normalized for contrast. - Demonstrated to be useful for classication, key property: the model can nd instances of eyes it has never seen before.

Local Features Tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Local Features Tutorial

Today: Local features for object recognition - The problem: Obtain a representation that allows us to nd a particular object weve encountered before (i.e. nd Pacos mug as opposed to nd a mug). - Local features based on the appearance of the object at particular interest points. - Features should be reasonably invariant to illumination changes, and ideally, also to scaling, rotation, and minor changes in viewing direction. - In addition, we can use local features for matching, this is useful for tracking and 3D scene reconstruction.

Local Features Tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Local Features Tutorial

Key properties of a good local feature: - Must be highly distinctive, a good feature should allow for correct object identication with low probability of mismatch. Question: How to identify image locations that are distinctive enough?. - Should be easy to extract. - Invariance, a good local feature should be tolerant to Image noise Changes in illumination Uniform scaling Rotation Minor changes in viewing direction Question: How to construct the local feature to achieve invariance to the above? - Should be easy to match against a (large) database of local features.

Local Features Tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT features

Scale Invariant Feature Transform (SIFT) is an approach for detecting and extracting local feature descriptors that are reasonably invariant to changes in illumination, image noise, rotation, scaling, and small changes in viewpoint. Detection stages for SIFT features: - Scale-space extrema detection - Keypoint localization - Orientation assignment - Generation of keypoint descriptors. In the following pages well examine these stages in detail.

SIFT features

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Scale-space extrema detection

Interest points for SIFT features correspond to local extrema of dierence-of-Gaussian lters at dierent scales. Given a Gaussian-blurred image L(x, y, ) = G(x, y, ) I (x, y ), where G(x, y, ) = 1/(2 ) exp
2 (x2 +y 2 )/ 2

is a variable scale Gaussian, the result of convolving an image with a dierence-of-Gaussian lter G(x, y, k ) G(x, y, ) is given by D(x, y, ) = L(x, y, k ) L(x, y, )
Scale-space extrema detection

(1)
6

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Which is just the dierence of the Gaussian-blurred images at scales and k .

Figure 1: Diagram showing the blurred images at dierent

scales, and the computation of the dierence-of-Gaussian images (from Lowe, 2004, see ref. at the beginning of the tutorial)

The rst step toward the detection of interest points is the convolution of the image with Gaussian lters at dierent scales, and the generation of dierence-ofGaussian images from the dierence of adjacent blurred images.

Scale-space extrema detection

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Scale-space extrema detection

The convolved images are grouped by octave (an octave corresponds to doubling the value of ), and the value of k is selected so that we obtain a xed number of blurred images per octave. This also ensures that we obtain the same number of dierence-of-Gaussian images per octave. Note: The dierence-of-Gaussian lter provides an approximation to the scale-normalized Laplacian of Gaussian 22G. The dierence-of-Gaussian lter is in eect a tunable bandpass lter.

Scale-space extrema detection

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Scale-space extrema detection

Figure 2: Local extrema detection, the pixel marked is

compared against its 26 neighbors in a 3 3 3 neighborhood that spans adjacent DoG images (from Lowe, 2004)

Interest points (called keypoints in the SIFT framework) are identied as local maxima or minima of the DoG images across scales. Each pixel in the DoG images is compared to its 8 neighbors at the same scale, plus the 9 corresponding neighbors at neighboring scales. If the pixel is a local maximum or minimum, it is selected as a candidate keypoint.
Scale-space extrema detection 9

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Scale-space extrema detection

For each candidate keypoint: - Interpolation of nearby data is used to accurately determine its position. - Keypoints with low contrast are removed - Responses along edges are eliminated - The keypoint is assigned an orientation To determine the keypoint orientation, a gradient orientation histogram is computed in the neighborhood of the keypoint (using the Gaussian image at the closest scale to the keypoints scale). The contribution of each neighboring pixel is weighted by the gradient magnitude and a Gaussian window with a that is 1.5 times the scale of the keypoint. Peaks in the histogram correspond to dominant orientations. A separate keypoint is created for the direction corresponding to the histogram maximum,
Scale-space extrema detection 10

(c) 2004 F. Estrada & A. Jepson & D. Fleet

and any other direction within 80% of the maximum value. All the properties of the keypoint are measured relative to the keypoint orientation, this provides invariance to rotation.

Scale-space extrema detection

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT feature representation

Once a keypoint orientation has been selected, the feature descriptor is computed as a set of orientation histograms on 4 4 pixel neighborhoods. The orientation histograms are relative to the keypoint orientation, the orientation data comes from the Gaussian image closest in scale to the keypoints scale. Just like before, the contribution of each pixel is weighted by the gradient magnitude, and by a Gaussian with 1.5 times the scale of the keypoint.

Figure 3: SIFT feature descriptor (from Lowe, 2004)

SIFT feature representation 12

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Histograms contain 8 bins each, and each descriptor contains an array of 4 histograms around the keypoint. This leads to a SIFT feature vector with 4 4 8 = 128 elements. This vector is normalized to enhance invariance to changes in illumination.

SIFT feature representation

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT feature matching

- Find nearest neighbor in a database of SIFT features from training images. - For robustness, use ratio of nearest neighbor to ratio of second nearest neighbor. - Neighbor with minimum Euclidean distance expensive search. - Use an approximate, fast method to nd nearest neighbor with high probability.

SIFT feature matching

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Recognition using SIFT features

- Compute SIFT features on the input image - Match these features to the SIFT feature database - Each keypoint species 4 parameters: 2D location, scale, and orientation. - To increase recognition robustness: Hough transform to identify clusters of matches that vote for the same object pose. - Each keypoint votes for the set of object poses that are consistent with the keypoints location, scale, and orientation. - Locations in the Hough accumulator that accumulate at least 3 votes are selected as candidate object/pose matches. - A verication step matches the training image for the hypothesized object/pose to the image using a least-squares t to the hypothesized location, scale, and orientation of the object.
Recognition using SIFT features 15

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Gaussian blurred images and Dierence of Gaussian images

Range: [0.11, 0.131] Dims: [959, 2044]

Figure 4: Gaussian and DoG images grouped by octave

SIFT matlab tutorial 16

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Keypoint detection

Figure 5: a) Maxima of DoG across scales. b) Remaining

keypoints after removal of low contrast points. C) Remaining keypoints after removal of edge responses (bottom).
SIFT matlab tutorial 17

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Final keypoints with selected orientation and scale

Figure 6:
orientation.

Extracted keypoints, arrows indicate scale and

SIFT matlab tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Warped image and extracted keypoints

Figure 7: Warped image and extracted keypoints. The hough transform of matched SIFT features yields
SIFT matlab tutorial 19

(c) 2004 F. Estrada & A. Jepson & D. Fleet

the transformation that aligns the original and warped images:

Computed affine transformation from rotated image to original image: >> disp(aff); 0.7060 -0.7052 128.4230 0.7057 0.7100 -128.9491 0 0 1.0000 Actual transformation from rotated image to original image: >> disp(A); 0.7071 -0.7071 128.6934 0.7071 0.7071 -128.6934 0 0 1.0000

SIFT matlab tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Matching and alignment of dierent views using local features.

Orignial View Reference View

Range: [0, 1] Dims: [384, 512]

Aligned View

Reference minus Aligned View

Range: [0.0273, 1] Dims: [384, 512]

Range: [0.767, 0.822] Dims: [384, 512]

Two views of Wadham College and ane transformation for alignment.

SIFT matlab tutorial 21

Figure 8:

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Object recognition with SIFT

Image Model Location

Range: [0, 1] Dims: [480, 640]

Range: [0.986, 0.765] Dims: [480, 640]

Image

Model

Location

Range: [0, 1] Dims: [480, 640]

Range: [1.05, 0.866] Dims: [480, 640]

Image

Model

Location

Range: [0, 1] Dims: [480, 640]

Range: [1.07, 1.01] Dims: [480, 640]

Figure 9: Cellphone examples with dierent poses and occlusion.

SIFT matlab tutorial

(c) 2004 F. Estrada & A. Jepson & D. Fleet

SIFT matlab tutorial

Object recognition with SIFT

Image Model Location

Range: [0, 1] Dims: [480, 640]

Range: [0.991, 0.992] Dims: [480, 640]

Image

Model

Location

Range: [0, 1] Dims: [480, 640]

Range: [1.05, 0.963] Dims: [480, 640]

Image

Model

Location

Range: [0, 1] Dims: [480, 640]

Range: [0.988, 1.05] Dims: [480, 640]

Figure 10: Book example, what happens when we match similar

features outside the object?
SIFT matlab tutorial 23

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Closing Comments

- SIFT features are reasonably invariant to rotation, scaling, and illumination changes. - We can use them for matching and object recognition among other things. - Robust to occlusion, as long as we can see at least 3 features from the object we can compute the location and pose. - Ecient on-line matching, recognition can be performed in close-to-real time (at least for small object databases).

Closing Comments

(c) 2004 F. Estrada & A. Jepson & D. Fleet

Closing Comments

Questions: - Do local features solve the object recognition problem? - How about distinctiveness? how do we deal with false positives outside the object of interest? (see Figure 10). - Can we learn new object models without photographing them under special conditions? - How does this approach compare to the object recognition method proposed by Murase and Nayar? Recall that their model consists of a PCA basis for each object, generated from images taken under diverse illumination and viewing directions; and a representation of the manifold described by the training images in this eigenspace (see the tutorial on Eigen Eyes).

Closing Comments

A Survey of Content-Based Image Retrieval Systems Using Scale-Invariant Feature Transform (SIFT)
No ratings yet
A Survey of Content-Based Image Retrieval Systems Using Scale-Invariant Feature Transform (SIFT)
5 pages
SIFT Feature Matching
No ratings yet
SIFT Feature Matching
12 pages
Asift Asift
No ratings yet
Asift Asift
11 pages
Illumination Scale Rotation
No ratings yet
Illumination Scale Rotation
16 pages
SIFT White
No ratings yet
SIFT White
55 pages
Face Recognition Using The Most Representative Sift Images: Issam Dagher, Nour El Sallak and Hani Hazim
No ratings yet
Face Recognition Using The Most Representative Sift Images: Issam Dagher, Nour El Sallak and Hani Hazim
10 pages
CS131 Computer Vision: Foundations and Applications Practice Final (Solution) Stanford University December 11, 2017
No ratings yet
CS131 Computer Vision: Foundations and Applications Practice Final (Solution) Stanford University December 11, 2017
15 pages
A Comparison of FAST, SURF, Eigen, Harris, and MSER Features
No ratings yet
A Comparison of FAST, SURF, Eigen, Harris, and MSER Features
6 pages
Topic: Sift (Scale Invariant Feature Transform) Method For Key Location Detection
No ratings yet
Topic: Sift (Scale Invariant Feature Transform) Method For Key Location Detection
6 pages
Recognizing Pictures at An Exhibition Using SIFT
No ratings yet
Recognizing Pictures at An Exhibition Using SIFT
5 pages
Document from Sindhu Reddy...??
No ratings yet
Document from Sindhu Reddy...??
94 pages
Improved SIFT-Features Matching For Object Recognition: Emails: (Alhwarin, Wang, Ristic, Ag) @iat - Uni-Bremen - de
No ratings yet
Improved SIFT-Features Matching For Object Recognition: Emails: (Alhwarin, Wang, Ristic, Ag) @iat - Uni-Bremen - de
12 pages
Binary Image
No ratings yet
Binary Image
91 pages
Digital Image Processing
No ratings yet
Digital Image Processing
88 pages
CVML Mulakat Notlari
No ratings yet
CVML Mulakat Notlari
8 pages
Experience Implementing An Elliptical Head Tracker in Matlab
No ratings yet
Experience Implementing An Elliptical Head Tracker in Matlab
7 pages
Spatial Feat Embedding
No ratings yet
Spatial Feat Embedding
4 pages
Updated DIP UNIT4-2024
No ratings yet
Updated DIP UNIT4-2024
29 pages
Real Time Doors and Windows Recognition in Opencv Using
No ratings yet
Real Time Doors and Windows Recognition in Opencv Using
8 pages
Experiment 9 IVP - REGION - N292 - GROWING
No ratings yet
Experiment 9 IVP - REGION - N292 - GROWING
5 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Acknowledgement: Sift:Scale Invariant Feature Transform
No ratings yet
Acknowledgement: Sift:Scale Invariant Feature Transform
25 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Project Report On Contour Detection
No ratings yet
Project Report On Contour Detection
10 pages
Sift Detector and Descriptor: (Scale Invariant Feature Transform)
No ratings yet
Sift Detector and Descriptor: (Scale Invariant Feature Transform)
34 pages
Improved SIFT Algorithm Image Matching
No ratings yet
Improved SIFT Algorithm Image Matching
7 pages
Features
No ratings yet
Features
36 pages
Kernel-Based Hand Tracking
No ratings yet
Kernel-Based Hand Tracking
9 pages
Potential-of-SIFT-SURF-KAZE-AKAZE-ORB-BRISK-AGAST-and-7-More-Algorithms-for-Matching-Extremely-Variant-Image-Pairs
No ratings yet
Potential-of-SIFT-SURF-KAZE-AKAZE-ORB-BRISK-AGAST-and-7-More-Algorithms-for-Matching-Extremely-Variant-Image-Pairs
8 pages
CV - Unit 2
No ratings yet
CV - Unit 2
30 pages
Module 3.1 Morphology
No ratings yet
Module 3.1 Morphology
97 pages
Leica Photogrammetry Suite2
No ratings yet
Leica Photogrammetry Suite2
33 pages
Summary of SIFT
No ratings yet
Summary of SIFT
8 pages
Fingerprint Image Enhancement Using Directional Morphological Filter
No ratings yet
Fingerprint Image Enhancement Using Directional Morphological Filter
4 pages
CHAP 7 Features Recognition and Classification
No ratings yet
CHAP 7 Features Recognition and Classification
94 pages
Scale Invariant Feature Transfrom: A Seminar On
No ratings yet
Scale Invariant Feature Transfrom: A Seminar On
8 pages
Image Classification Using Edge Detection Withe Fuzzy Logic: Par: Chettouh Safieddine Sohbi Salem Hammoudi Mabrouk
No ratings yet
Image Classification Using Edge Detection Withe Fuzzy Logic: Par: Chettouh Safieddine Sohbi Salem Hammoudi Mabrouk
16 pages
In-Flight Characterization of Image Spatial Quality Using Point Spread Functions
No ratings yet
In-Flight Characterization of Image Spatial Quality Using Point Spread Functions
38 pages
Sift Preprint
No ratings yet
Sift Preprint
28 pages
Spatial Data Mining: Three Case Studies: Shashi Shekhar, University of Minnesota
No ratings yet
Spatial Data Mining: Three Case Studies: Shashi Shekhar, University of Minnesota
18 pages
Report On Image Enhancement
No ratings yet
Report On Image Enhancement
25 pages
Face Recognition Using SURF Features and SVM Classifier: Bhaskar Anand
No ratings yet
Face Recognition Using SURF Features and SVM Classifier: Bhaskar Anand
8 pages
Vehicle Recognition
No ratings yet
Vehicle Recognition
6 pages
Sift
No ratings yet
Sift
8 pages
Unit IV Visual Realism
100% (2)
Unit IV Visual Realism
73 pages
Classifier Combination For Face Localization in Color Images ICIAP 2005
No ratings yet
Classifier Combination For Face Localization in Color Images ICIAP 2005
8 pages
Object Recognition From Local Scale-Invariant Features
No ratings yet
Object Recognition From Local Scale-Invariant Features
8 pages
Q1 - SIFT - Distinctive Image Features From Scale-Invariant Keypoints
No ratings yet
Q1 - SIFT - Distinctive Image Features From Scale-Invariant Keypoints
20 pages
A Principal Component Regression Strategy For Estimating Motion Eng Opt 2008
No ratings yet
A Principal Component Regression Strategy For Estimating Motion Eng Opt 2008
8 pages
Akash Mahanty 1
No ratings yet
Akash Mahanty 1
18 pages
Dem Generation From High Resolution Satellite Imagery Using Parallel Projection Model
No ratings yet
Dem Generation From High Resolution Satellite Imagery Using Parallel Projection Model
6 pages
Analysis and Classification of Feature Extraction Techniques: A Study
No ratings yet
Analysis and Classification of Feature Extraction Techniques: A Study
6 pages
Gis Lecture Note
No ratings yet
Gis Lecture Note
45 pages
Active Shape Model Segmentation Using Local Edge Structures and Adaboost
No ratings yet
Active Shape Model Segmentation Using Local Edge Structures and Adaboost
9 pages
Type of Vehicle
No ratings yet
Type of Vehicle
4 pages
Midterm Exam: CS 231a Spring 2015-2016 Monday, May 9th, 2016
No ratings yet
Midterm Exam: CS 231a Spring 2015-2016 Monday, May 9th, 2016
10 pages
MFound HW3
No ratings yet
MFound HW3
4 pages
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Analytical
No ratings yet
Analytical
4 pages
Densitometry and Spectrophotometr Y: Spectrophotometer Response
No ratings yet
Densitometry and Spectrophotometr Y: Spectrophotometer Response
3 pages
WPR3
No ratings yet
WPR3
1 page
Recommended_Textbooks_RTR_Sept2023
No ratings yet
Recommended_Textbooks_RTR_Sept2023
2 pages
Computer Vision NOTES
No ratings yet
Computer Vision NOTES
15 pages
A Survey On Adaptive Edge-Enhanced Color Interpolation Processor - Jaya Praka V.M 254-258
No ratings yet
A Survey On Adaptive Edge-Enhanced Color Interpolation Processor - Jaya Praka V.M 254-258
5 pages
Detailed Lesson Plan in Arts 3
No ratings yet
Detailed Lesson Plan in Arts 3
5 pages
Vision Lec 10
No ratings yet
Vision Lec 10
23 pages
History of Board of Radiologic Technology in The Philippines
No ratings yet
History of Board of Radiologic Technology in The Philippines
4 pages
PC60 Application Software Guide
No ratings yet
PC60 Application Software Guide
22 pages
Introduction To Dr. Ghulam Gilanie Janjua: o o o o o o
No ratings yet
Introduction To Dr. Ghulam Gilanie Janjua: o o o o o o
47 pages
Synod Logo Brand Manual
No ratings yet
Synod Logo Brand Manual
47 pages
Test Method: Chromatic Adaptation
No ratings yet
Test Method: Chromatic Adaptation
4 pages
Tamiya Colour Chart
100% (1)
Tamiya Colour Chart
4 pages
100 Color Combination Ideas and Examples Canva
No ratings yet
100 Color Combination Ideas and Examples Canva
5 pages
Chương 7 - Trắc nghiệm kiến thức - Attempt review
No ratings yet
Chương 7 - Trắc nghiệm kiến thức - Attempt review
12 pages
138 Question DIP
No ratings yet
138 Question DIP
18 pages
CG ch-8 (The Graphic Pipeline)
No ratings yet
CG ch-8 (The Graphic Pipeline)
22 pages
Automatic Number Plate Recognition
No ratings yet
Automatic Number Plate Recognition
5 pages
Vizio E65u-D3 CNET Review Calibration Results
No ratings yet
Vizio E65u-D3 CNET Review Calibration Results
3 pages
Addition and Subtraction (Coloring)
100% (2)
Addition and Subtraction (Coloring)
52 pages
GLSL Tutorial 9 Shader Introduction
No ratings yet
GLSL Tutorial 9 Shader Introduction
14 pages
Vector Vs Pixel
No ratings yet
Vector Vs Pixel
3 pages
Am H606 VM
No ratings yet
Am H606 VM
3 pages
Geneva Color Mixing Formulas v01 (Reformatted 27 Jan 2017)
100% (1)
Geneva Color Mixing Formulas v01 (Reformatted 27 Jan 2017)
2 pages
Rohini 89299003921
No ratings yet
Rohini 89299003921
3 pages
Morphological Reconstruction & Top-Hat and Bottom-Hat (Group-B)
No ratings yet
Morphological Reconstruction & Top-Hat and Bottom-Hat (Group-B)
33 pages
Media Sticker v.200826
No ratings yet
Media Sticker v.200826
2 pages
Color Catalog
No ratings yet
Color Catalog
19 pages
Computer Graphics - Shading
No ratings yet
Computer Graphics - Shading
41 pages