SlideShare a Scribd company logo
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 1/8
   
Object Recognition Research
Randal C. Nelson
Department of Computer Science
University of Rochester
Appearance­based object recognition methods have
recently demonstrated good performance on a
variety of problems. However, many of these
methods either require good whole­object
segmentation, which severely limits their
performance in the presence of clutter, occlusion, or
background changes; or utilize simple conjunctions of low­level
features, which causes crosstalk problems as the number of objects is
increased. We are investigating an appearance­based object
recognition system using a keyed, multi­level context representation,
that ameliorates many of these problems, and can be used with
complex, curved shapes. Pictures on this page are from a training
database we have used in system tests.
Specifically, we utilize distinctive intermediate­level features in this
case automatically extracted 2­D boundary fragments, as keys, which
are then verified within a local context, and assembled within a loose
global context to evoke an overall percept. The system demonstrates
extraordinarily good recognition of a variety of 3­D shapes, ranging
from sports cars and fighter planes to snakes and lizards with full
orthographic invariance. We have performed a number of large­scale
experiments, involving over 2000 separate test images, that evaluate
performance with increasing number of items in the database, in the
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 2/8
presence of clutter, background change, and occlusion, and also the
results of some generic classification experiments where the system is
tested on objects never previously seen or modeled. To our
knowledge, the results we report are the best in the literature for full­
sphere tests of general shapes with occlusion and clutter resistance. 
The basic idea is to represent the visual
appearance of an object as a loosely structured
combination of a number of local context
regions keyed by distinctive key features, or
fragments. A local context region can be
thought of as an image patch surrounding the key feature and
containing a representation of other features that intersect the patch.
Now under different conditions (e.g. lighting, background, changes in
orientation etc.) the feature extraction process will find some of these
distinctive keys, but in general not all of them. Also, even with local
contextual verification, such keys may well be consistent with a
number of global hypotheses. However, the fraction that can be found
by existing feature extraction processes is frequently sufficient to
identify objects in the scene, once the global evidence is assembled.
This addresses one of the principle problems of object recognition,
which is that, in any but rather artificial conditions, it has so far
proved impossible to reliably segment whole objects on a bottom­up
basis. In the current system, local features based on automatically
extracted boundary fragments are used to represent multiple 2­D
views (aspects) of rigid 3­D objects, but the basic idea could be
applied to other features and other representations. 
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 3/8
The basic recognition strategy is to utilize a
database (here viewed as an associative memory)
of key features embedded in local contexts, which
is organized so that access via an unknown key
feature evokes associated hypotheses for the
identity and configuration of all known objects that
could have produced such an embedded feature.
These hypotheses are fed into a second stage
associative memory, keyed by configurations,
which lumps the hypotheses into clusters that are mutually consistent
within a loose global context. This secondary database maintains a
probabilistic estimate of the likelihood of each cluster based on
statistics about the occurrence of the keys in the primary database.
The idea is similar to a multi­dimensional Hough transform without
the space problems. In our case, since 3­D objects are represented by
a set of views, the configurations represent two dimensional
transforms of specific views. Efficient access to the associative
memories is achieved using a hashing scheme on parameters of the
keying features, followed by verification of the local context. As
mentioned above, this local verification step gives the voting features
sufficient power to substantially ameliorate well known problems
with false positives in Hough­like voting schemes. Details on
associative memory
A fundamental component of the approach is the use of distinctive
local features we call keys. A key is any robustly extractable part or
feature that has sufficient information content to specify a
configuration of an associated object plus enough additional, pose­
insensitive (sometimes called semi­invariant) parameters to provide
efficient indexing. The local context amplifies the power of the
feature by providing a means of verification. This local verification
step is critical, because the invariant parameters of the key features
are relatively weak evidence, leading to a proliferation of high­
scoring false hypotheses if used alone. This is a well known problem
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 4/8
with voting schemes, but can be alleviated if the voting features are
sufficiently powerful. In the current implementation we have utilized
keys based on extracted boundary fragments, both straight and
curved, but the method is by no means limited to such keys, and we
are looking at several complementary feature types. Details on keys
used. 
In order to use the system with an object,
its appearances must be stored in the
associative memory. Currently, this is done
by obtaining a number of uncluttered
images of the object from different
directions. About 100 views are needed to
cover the entire viewing sphere for the
curve­based keys we have used. For each view, key features are
extracted, and a number of the strongest are stored in the memory
with associated information about the object and view that produced
them, and their relationship to an arbitrarily specified 2­D
configuration (position, orientation, scale) for that view.
To recognize an object, that is to answer the question "what object is
in this image?", key features together with their local contexts are
extracted from the image, and fed into the associative memory. All
matches are retrieved, and for each match, the associated information
is used to compute a hypothesis about the identity, view, and
configuration of a possible object. This hypothesis is fed to a second,
"working" associative memory, where current hypotheses are stored.
If any matches are found, the evidence associated with them is
updated to reflect the new information. Otherwise a new hypothesis is
entered. The accumulation is not a flat voting process, but depends on
the frequency of occurrence of the feature over the entire database,
with uncommen features providing more evidence. The evidence
combination scheme is Bayesian if the features are independent (they
are not, but we don't have a better model, and the results are better
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 5/8
than flat voting). The hypothesis memory is them examined, and the
configuration with the most evidence selected as the most probable
answer. 
To find an object of known characteristics in a
scene, that is to answer the question of the form
"where is the dog in this image?", the same
procedure is followed, except that key feature
matches are filtered on the basis of whether the
came from a view of a dog. This actually provides a rather powerful
mechanisms for partially indexed retrieval, since the filtering can
occur on any combination of attributes that we care to associate with
the features, either in the database, or from the image, e.g. "animal",
or "pink cup". Details of algorithm.
The approach has several advantages. First, because it is based on a
merged percept of local features rather than global properties, the
method is robust to occlusion and background clutter, and does not
require prior segmentation. This is an advantage over systems based
on principal components template analysis, which are sensitive to
occlusion and clutter. Second, entry of objects into the memory is an
active, automatic procedure. Essentially, the system explores the
object visually from different viewpoints, accumulating 2­D views,
until it has seen enough not to mix it up with any other object it
knows about. Third, the method lends itself naturally to multi­modal
recognition. Because there is no single, global structure for the model,
evidence from different kinds of keys can be combined as easily as
evidence from multiple keys of the same type. The only requirement
is that the configuration descriptions evoked by the different keys
have enough common structure to allow evidence combination
procedures to be used. This is an advantage over conventional
alignment techniques, which typically require a prior 3­D model of
the object. Finally, the probabilistic nature of the evidence
combination scheme, coupled with the formal definitions for semi­
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 6/8
invariance and robustness allow quantitative predictions of the
reliability of the system to be made. 
We have run several large­scale performance tests,
involving, altogether, over 2000 separate test images. In
these experiments we investigate variation in performance
with respect to increasing database size, clutter, and
occlusion. In forced choice experiments using clean test
images from a 24 object database, we obtain 97%
classification accuracy. Performance with 75% clutter and 25%
occlusion is in the 90%+ range. We have developed a statistical
model for predicting the performance in a variety of situations from a
few basic measurements of score distributions for clean test images
and pure clutter. We also ran a generic recognition experiment, where
the system was trained on several objects in each of several several
classes, (e.g. planes, snakes, cars), and asked to classify example
objects from the same generic classes, but not in the training set.
Details of experiments. 
References
Andrea Selinger and Randal C. Nelson, ``A Perceptual Grouping
Hierarchy for Appearance­Based 3D Object Recognition'', Computer
Vision and Image Understanding, vol. 76, no. 1, October 1999, pp.83­
92. Abstract, gzipped postscript (preprint)
Randal C. Nelson and Andrea Selinger ``Large­Scale Tests of a
Keyed, Appearance­Based 3­D Object Recognition System'', Vision
Research, Special issue on computational vision, Vol. 38, No. 15­16,
Aug. 1998. Abstract, gzipped postscript (preprint)
Randal C. Nelson and Andrea Selinger ``A Cubist Approach to Object
Recognition'', International Conference on Computer Vision
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 7/8
(ICCV98), Bombay, India, January 1998, 614­621. Abstract, gzipped
postscript, also in an extended version with more complete
description of the algorithms, and additional experiments.
Randal C. Nelson, Visual Learning and the Development of
Intelligence, In Early Visual Learning, Shree K. Nayar and Tomaso
Poggio, Editors, Oxford University Press, 1996, 215­236. Abstract,
Randal C. Nelson, ``From Visual Homing to Object Recognition'' , in
Visual Navigation, Yiannis Aloimonos, Editor, Lawrence Earlbaum
Inc, 1996, 218­250. Abstract,
Randal C. Nelson, ``Memory­Based Recognition for 3­D Objects'',
Proc. ARPA Image Understanding Workshop, Palm Springs CA,
February 1996, 1305­1310. Abstract, gzipped postscript
Randal C. Nelson, ``3­D Recognition Via 2­stage Associative
Memory'', University of Rochester, Dept of Computer Science TR
565, January 1995. Abstract, gzipped postscript
Recap of Links in Text
Associative Memory
Key Features
Recognition Algorithm
Full Publication List
Back to research page
28/12/2014 Object Recognition
http://www.cs.rochester.edu/~nelson/research/recognition/recognition.html 8/8
Back to Randal Nelson's home page

More Related Content

DOC
Zhou_HCI_CAVIAR.doc
butest
 
PDF
Www.cs.berkeley.edu kunal
Kunal Kishor Nirala
 
PPTX
deadlock detection using Goldman's algorithm by ANIKET CHOUDHURY
अनिकेत चौधरी
 
PPTX
Object recognition
saniacorreya
 
PDF
Memory based recognition for 3 d object-kunal
Kunal Kishor Nirala
 
PPTX
Object recognition
akkichester
 
PDF
Face Recognition Techniques - An evaluation Study
Eswar Publications
 
PDF
Image Based Facial Recognition
ijtsrd
 
Zhou_HCI_CAVIAR.doc
butest
 
Www.cs.berkeley.edu kunal
Kunal Kishor Nirala
 
deadlock detection using Goldman's algorithm by ANIKET CHOUDHURY
अनिकेत चौधरी
 
Object recognition
saniacorreya
 
Memory based recognition for 3 d object-kunal
Kunal Kishor Nirala
 
Object recognition
akkichester
 
Face Recognition Techniques - An evaluation Study
Eswar Publications
 
Image Based Facial Recognition
ijtsrd
 

Similar to Object recognition kunal (20)

PDF
3 d recognition via 2-d stage associative memory kunal
Kunal Kishor Nirala
 
PPT
Part2
khawarbashir
 
PPTX
Iccv2009 recognition and learning object categories p1 c01 - classical methods
zukun
 
PPT
Mit6870 orsu lecture2
zukun
 
PDF
Object segmentation by alignment of poselet activations to image contours
irisshicat
 
PDF
IRJET - Automatic Attendance Provision using Image Processing
IRJET Journal
 
PDF
Face Recognition System and its Applications
IRJET Journal
 
PDF
Ijarcet vol-2-issue-4-1383-1388
Editor IJARCET
 
PDF
Criminal Identification using Arm7
IRJET Journal
 
PDF
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
zukun
 
PDF
Object recognition with cortex like mechanisms pami-07
dingggthu
 
PDF
Object Capturing In A Cluttered Scene By Using Point Feature Matching
IJERA Editor
 
PDF
Face Recognition and Increased Reality System for Mobile Devices
ijtsrd
 
PDF
Ck36515520
IJERA Editor
 
PDF
Built-in Face Recognition for Smart Phone Devices
IRJET Journal
 
PDF
Lecture 08 larry zitnick - undestanding and describing scenes
mustafa sarac
 
PDF
IRJET - Design and Development of Android Application for Face Detection and ...
IRJET Journal
 
PDF
2D Features-based Detector and Descriptor Selection System for Hierarchical R...
gerogepatton
 
PPTX
Object recognition
Geraldyne Gengania
 
PPTX
Introduction to Object recognition
Ashiq Ullah
 
3 d recognition via 2-d stage associative memory kunal
Kunal Kishor Nirala
 
Iccv2009 recognition and learning object categories p1 c01 - classical methods
zukun
 
Mit6870 orsu lecture2
zukun
 
Object segmentation by alignment of poselet activations to image contours
irisshicat
 
IRJET - Automatic Attendance Provision using Image Processing
IRJET Journal
 
Face Recognition System and its Applications
IRJET Journal
 
Ijarcet vol-2-issue-4-1383-1388
Editor IJARCET
 
Criminal Identification using Arm7
IRJET Journal
 
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
zukun
 
Object recognition with cortex like mechanisms pami-07
dingggthu
 
Object Capturing In A Cluttered Scene By Using Point Feature Matching
IJERA Editor
 
Face Recognition and Increased Reality System for Mobile Devices
ijtsrd
 
Ck36515520
IJERA Editor
 
Built-in Face Recognition for Smart Phone Devices
IRJET Journal
 
Lecture 08 larry zitnick - undestanding and describing scenes
mustafa sarac
 
IRJET - Design and Development of Android Application for Face Detection and ...
IRJET Journal
 
2D Features-based Detector and Descriptor Selection System for Hierarchical R...
gerogepatton
 
Object recognition
Geraldyne Gengania
 
Introduction to Object recognition
Ashiq Ullah
 
Ad

More from Kunal Kishor Nirala (6)

PDF
Object class recognition by unsupervide scale invariant learning - kunal
Kunal Kishor Nirala
 
PDF
An automatic algorithm for object recognition and detection based on asift ke...
Kunal Kishor Nirala
 
PDF
Object oriented-systems-development-life-cycle ppt
Kunal Kishor Nirala
 
PDF
Object oriented and classical software engineering 8th edition v413 hav
Kunal Kishor Nirala
 
PDF
C socket programming for..
Kunal Kishor Nirala
 
PDF
Uml diagram types with e..
Kunal Kishor Nirala
 
Object class recognition by unsupervide scale invariant learning - kunal
Kunal Kishor Nirala
 
An automatic algorithm for object recognition and detection based on asift ke...
Kunal Kishor Nirala
 
Object oriented-systems-development-life-cycle ppt
Kunal Kishor Nirala
 
Object oriented and classical software engineering 8th edition v413 hav
Kunal Kishor Nirala
 
C socket programming for..
Kunal Kishor Nirala
 
Uml diagram types with e..
Kunal Kishor Nirala
 
Ad

Recently uploaded (20)

PDF
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
PDF
Principles of Food Science and Nutritions
Dr. Yogesh Kumar Kosariya
 
PDF
EVS+PRESENTATIONS EVS+PRESENTATIONS like
saiyedaqib429
 
PDF
오픈소스 LLM, vLLM으로 Production까지 (Instruct.KR Summer Meetup, 2025)
Hyogeun Oh
 
PPTX
Edge to Cloud Protocol HTTP WEBSOCKET MQTT-SN MQTT.pptx
dhanashri894551
 
PPTX
Fluid Mechanics, Module 3: Basics of Fluid Mechanics
Dr. Rahul Kumar
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
Unit I Part II.pdf : Security Fundamentals
Dr. Madhuri Jawale
 
PPTX
EE3303-EM-I 25.7.25 electrical machines.pptx
Nagen87
 
PDF
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
PPTX
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
PPTX
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
PDF
Software Testing Tools - names and explanation
shruti533256
 
PPTX
ANIMAL INTERVENTION WARNING SYSTEM (4).pptx
dodultrongaming
 
PPTX
IoT_Smart_Agriculture_Presentations.pptx
poojakumari696707
 
PDF
settlement FOR FOUNDATION ENGINEERS.pdf
Endalkazene
 
PPTX
Module_II_Data_Science_Project_Management.pptx
anshitanarain
 
PPTX
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
PPTX
Simulation of electric circuit laws using tinkercad.pptx
VidhyaH3
 
PDF
6th International Conference on Artificial Intelligence and Machine Learning ...
gerogepatton
 
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
Principles of Food Science and Nutritions
Dr. Yogesh Kumar Kosariya
 
EVS+PRESENTATIONS EVS+PRESENTATIONS like
saiyedaqib429
 
오픈소스 LLM, vLLM으로 Production까지 (Instruct.KR Summer Meetup, 2025)
Hyogeun Oh
 
Edge to Cloud Protocol HTTP WEBSOCKET MQTT-SN MQTT.pptx
dhanashri894551
 
Fluid Mechanics, Module 3: Basics of Fluid Mechanics
Dr. Rahul Kumar
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
Unit I Part II.pdf : Security Fundamentals
Dr. Madhuri Jawale
 
EE3303-EM-I 25.7.25 electrical machines.pptx
Nagen87
 
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
Software Testing Tools - names and explanation
shruti533256
 
ANIMAL INTERVENTION WARNING SYSTEM (4).pptx
dodultrongaming
 
IoT_Smart_Agriculture_Presentations.pptx
poojakumari696707
 
settlement FOR FOUNDATION ENGINEERS.pdf
Endalkazene
 
Module_II_Data_Science_Project_Management.pptx
anshitanarain
 
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
Simulation of electric circuit laws using tinkercad.pptx
VidhyaH3
 
6th International Conference on Artificial Intelligence and Machine Learning ...
gerogepatton
 

Object recognition kunal