Computer Vision
Computer Vision
• Filter outputs
– essentially form a dot-product between a pattern and an image,
while shifting the pattern across the image
– strong response -> image locally looks like the pattern
– e.g. derivatives measured by filtering with a kernel that looks like a
big derivative (bright bar next to dark bar)
Convolve this image To get this
• Digital libraries
– Find me the pic of JFK and Marilyn Monroe embracing
– NCMEC
• Surveillance
– Warn me if there is a mugging in the grove
• HCI
– Do what I show you
• Military
– Shoot this, not that
What are the problems in recognition?
• Which bits of image should be recognised together?
– Segmentation.
• How can objects be recognised without focusing on detail?
– Abstraction.
• How can objects with many free parameters be
recognised?
– No popular name, but it’s a crucial problem anyhow.
• How do we structure very large modelbases?
– again, no popular name; abstraction and learning come into this
History
History-II
Segmentation
• Assemblies of primitives
– fit parametric forms
– Issues
• what primitives?
• uniqueness of representation
• few objects are actual primitives
• Indexed collection of images
– use interpolation to predict appearance between images
– Issues
• occlusion is a mild nuisance
• structuring the collection can be tricky
People
• Skin is characteristic; clothing hard to segment
– hence, people wearing little clothing
• Finding body segments:
– finding skin-like (color, texture) regions that have nearly straight,
nearly parallel boundaries
• Grouping process constructed by hand, tuned by hand
using small dataset.
• When a sufficiently large group is found, assert a person is
present
Returned data set
Tracking