0% found this document useful (0 votes)

383 views

Computer Vision

Computer vision is the study of how to extract information from images and videos to understand and interact with the visual world. It has many useful applications such as 3D reconstruction, object recognition, and automated surveillance. The field involves understanding both the physics of image formation and human visual perception. Key challenges include segmentation, tracking objects over time, and building representations of 3D scenes from 2D images.

Uploaded by

Harish Paruchuri

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

383 views

Computer Vision

Uploaded by

Harish Paruchuri

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 41

Why study Computer Vision?

• Images and movies are everywhere

• Fast-growing collection of useful applications
– building representations of the 3D world from pictures
– automated surveillance (who’s doing what)
– movie post-processing
– face finding
• Various deep and attractive scientific mysteries
– how does object recognition work?
• Greater understanding of human vision
Properties of Vision

• One can “see the future”

– Cricketers avoid being hit in the head
• There’s a reflex --- when the right eye sees something going
left, and the left eye sees something going right, move your
head fast.
– Gannets pull their wings back at the last moment
• Gannets are diving birds; they must steer with their wings, but
wings break unless pulled back at the moment of contact.
• Area of target over rate of change of area gives time to contact.
Properties of Vision

• 3D representations are easily constructed

– There are many different cues.
– Useful
• to humans (avoid bumping into things; planning a grasp; etc.)
• in computer vision (build models for movies).
– Cues include
• multiple views (motion, stereopsis)
• texture
• shading
Properties of Vision

• People draw distinctions between what is seen

– “Object recognition”
– This could mean “is this a fish or a bicycle?”
– It could mean “is this George Washington?”
– It could mean “is this poisonous or not?”
– It could mean “is this slippery or not?”
– It could mean “will this support my weight?”
– Great mystery
• How to build programs that can draw useful distinctions based
on image properties.
Part I: The Physics of Imaging

• How images are formed

– Cameras
• What a camera does
• How to tell where the camera was
– Light
• How to measure light
• What light does at surfaces
• How the brightness values we see in cameras are determined
– Color
• The underlying mechanisms of color
• How to describe it and measure it
Part II: Early Vision in One Image

• Representing small patches of image

– For three reasons
• We wish to establish correspondence between (say) points in
different images, so we need to describe the neighborhood of
the points
• Sharp changes are important in practice --- known as “edges”
• Representing texture by giving some statistics of the different
kinds of small patch present in the texture.
– Tigers have lots of bars, few spots
– Leopards are the other way
Representing an image patch

• Filter outputs
– essentially form a dot-product between a pattern and an image,
while shifting the pattern across the image
– strong response -> image locally looks like the pattern
– e.g. derivatives measured by filtering with a kernel that looks like a
big derivative (bright bar next to dark bar)
Convolve this image To get this

With this kernel

Texture

• Many objects are distinguished by their texture

– Tigers, cheetahs, grass, trees
• We represent texture with statistics of filter outputs
– For tigers, bar filters at a coarse scale respond strongly
– For cheetahs, spots at the same scale
– For grass, long narrow bars
– For the leaves of trees, extended spots
• Objects with different textures can be segmented
• The variation in textures is a cue to shape
Shape from texture
Part III: Early Vision in Multiple Images

• The geometry of multiple views

– Where could it appear in camera 2 (3, etc.) given it was here in 1
(1 and 2, etc.)?
• Stereopsis
– What we know about the world from having 2 eyes
• Structure from motion
– What we know about the world from having many eyes
• or, more commonly, our eyes moving.
Part IV: Mid-Level Vision

• Finding coherent structure so as to break the image or

movie into big units
– Segmentation:
• Breaking images and videos into useful pieces
• E.g. finding video sequences that correspond to one shot
• E.g. finding image components that are coherent in internal
appearance
– Tracking:
• Keeping track of a moving object through a long sequence of
views
Part V: High Level Vision (Geometry)

• The relations between object geometry and image

geometry
– Model based vision
• find the position and orientation of known objects
– Smooth surfaces and outlines
• how the outline of a curved object is formed, and what it looks
like
– Aspect graphs
• how the outline of a curved object moves around as you view it
from different directions
– Range data
Part VI: High Level Vision
(Probabilistic)
• Using classifiers and probability to recognize objects
– Templates and classifiers
• how to find objects that look the same from view to view with
a classifier
– Relations
• break up objects into big, simple parts, find the parts with a
classifier, and then reason about the relationships between the
parts to find the object.
– Geometric templates from spatial relations
• extend this trick so that templates are formed from relations
between much smaller parts
3D Reconstruction from multiple views

• Multiple views arise from

– stereo
– motion
• Strategy
– “triangulate” from distinct measurements of the same thing
• Issues
– Correspondence: which points in the images are projections of the
same 3D point?
– The representation: what do we report?
– Noise: how do we get stable, accurate reports
Part VII: Some Applications in Detail

• Finding images in large collections

– searching for pictures
– browsing collections of pictures
• Image based rendering
– often very difficult to produce models that look like real objects
• surface weathering, etc., create details that are hard to model
• Solution: make new pictures from old
Some applications of recognition

• Digital libraries
– Find me the pic of JFK and Marilyn Monroe embracing
– NCMEC
• Surveillance
– Warn me if there is a mugging in the grove
• HCI
– Do what I show you
• Military
– Shoot this, not that
What are the problems in recognition?
• Which bits of image should be recognised together?
– Segmentation.
• How can objects be recognised without focusing on detail?
– Abstraction.
• How can objects with many free parameters be
recognised?
– No popular name, but it’s a crucial problem anyhow.
• How do we structure very large modelbases?
– again, no popular name; abstraction and learning come into this
History
History-II
Segmentation

• Which image components “belong together”?

• Belong together=lie on the same object
• Cues
– similar colour
– similar texture
– not separated by contour
– form a suggestive shape when assembled
Computer Vision - A Modern Approach
Set: Introduction to Vision
Slides by D.A. Forsyth
Computer Vision - A Modern Approach
Set: Introduction to Vision
Slides by D.A. Forsyth
Matching templates

• Some objects are 2D patterns

– e.g. faces
• Build an explicit pattern matcher
– discount changes in illumination by using a parametric model
– changes in background are hard
– changes in pose are hard
Relations between templates

• e.g. find faces by

– finding eyes, nose, mouth
– finding assembly of the three that has the “right” relations
Representing the 3D world

• Assemblies of primitives
– fit parametric forms
– Issues
• what primitives?
• uniqueness of representation
• few objects are actual primitives
• Indexed collection of images
– use interpolation to predict appearance between images
– Issues
• occlusion is a mild nuisance
• structuring the collection can be tricky
People
• Skin is characteristic; clothing hard to segment
– hence, people wearing little clothing
• Finding body segments:
– finding skin-like (color, texture) regions that have nearly straight,
nearly parallel boundaries
• Grouping process constructed by hand, tuned by hand
using small dataset.
• When a sufficiently large group is found, assert a person is
present
Returned data set
Tracking

• Use a model to predict next position and refine using next

image
• Model:
– simple dynamic models (second order dynamics)
– kinematic models
– etc.
• Face tracking and eye tracking now work rather well
The nasty likelihood

18AI742
No ratings yet
18AI742
2 pages
Syllabus-Topics in Computer Vision
100% (1)
Syllabus-Topics in Computer Vision
5 pages
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
No ratings yet
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
35 pages
Computer Vision55
100% (1)
Computer Vision55
268 pages
Ec1009 Digital Image Processing
100% (15)
Ec1009 Digital Image Processing
37 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
ITIL Foundation v3 2011 Test Exam 1
No ratings yet
ITIL Foundation v3 2011 Test Exam 1
8 pages
Design and Analysis of Industrial Ball Valve Using Computational Fluid Dynamics
No ratings yet
Design and Analysis of Industrial Ball Valve Using Computational Fluid Dynamics
7 pages
Brown, Clive. The Orchestra-In Beethoven S Vienna
No ratings yet
Brown, Clive. The Orchestra-In Beethoven S Vienna
18 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Chapter 9: Morphological Image Processing Digital Image Processing
No ratings yet
Chapter 9: Morphological Image Processing Digital Image Processing
58 pages
Computer Vision & Image Processing
No ratings yet
Computer Vision & Image Processing
34 pages
Image Processing QB
100% (1)
Image Processing QB
29 pages
Computer Vision
No ratings yet
Computer Vision
5 pages
Window - To - Viewport Transformation
No ratings yet
Window - To - Viewport Transformation
21 pages
Introduction To Image Processing and Computer Vision 2 PDF
100% (2)
Introduction To Image Processing and Computer Vision 2 PDF
179 pages
Ece Vii Image Processing (06ec756) Solution
No ratings yet
Ece Vii Image Processing (06ec756) Solution
73 pages
Computer Vision Questions
No ratings yet
Computer Vision Questions
1 page
Feature Selection UNIT 4
100% (3)
Feature Selection UNIT 4
40 pages
Chapter2 Image Formation
No ratings yet
Chapter2 Image Formation
68 pages
Computer Vision-Unit 2 Notes
No ratings yet
Computer Vision-Unit 2 Notes
15 pages
Image Enhancement
No ratings yet
Image Enhancement
89 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Lecture 2 Fundamental Steps in Digital Image Processing
No ratings yet
Lecture 2 Fundamental Steps in Digital Image Processing
4 pages
Motion Detection
No ratings yet
Motion Detection
33 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
30 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Unit3 CV
No ratings yet
Unit3 CV
27 pages
Computer Vision-Unit 3 Notes
No ratings yet
Computer Vision-Unit 3 Notes
26 pages
CSE Dept. PPT 176 173
No ratings yet
CSE Dept. PPT 176 173
17 pages
Prash - MACHINE VISION
No ratings yet
Prash - MACHINE VISION
36 pages
CS3351 AIML UNIT 5 NOTES EduEngg
No ratings yet
CS3351 AIML UNIT 5 NOTES EduEngg
35 pages
Computer Vision Notes: Confirmed Midterm Exam Guide (Kisi-Kisi UTS)
No ratings yet
Computer Vision Notes: Confirmed Midterm Exam Guide (Kisi-Kisi UTS)
24 pages
Chapter 6. Image Segmentation
No ratings yet
Chapter 6. Image Segmentation
83 pages
Lab Manual
No ratings yet
Lab Manual
28 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Image Super Resolution Report
No ratings yet
Image Super Resolution Report
12 pages
Image and Video Super-Resolution
No ratings yet
Image and Video Super-Resolution
62 pages
Image and Video Processing Klu
No ratings yet
Image and Video Processing Klu
1 page
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
No ratings yet
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
63 pages
Digital Image Processing
No ratings yet
Digital Image Processing
129 pages
East West Institute of Technology: Sadp Notes
No ratings yet
East West Institute of Technology: Sadp Notes
30 pages
Ccs349 Iva Record - Final
No ratings yet
Ccs349 Iva Record - Final
49 pages
"Object Detection With Yolo": A Seminar On
No ratings yet
"Object Detection With Yolo": A Seminar On
14 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
8 pages
Digital Image Processing Segmntation Lab With Python
No ratings yet
Digital Image Processing Segmntation Lab With Python
9 pages
Laboratory 1. Working With Images in Opencv
No ratings yet
Laboratory 1. Working With Images in Opencv
13 pages
ANPR PowerPoint
No ratings yet
ANPR PowerPoint
39 pages
Image Fusion Presentation
100% (1)
Image Fusion Presentation
33 pages
Image and Video Analytics Unit 1
No ratings yet
Image and Video Analytics Unit 1
110 pages
Techniques of Knowledge Representation
No ratings yet
Techniques of Knowledge Representation
3 pages
Digital Image Processing Project Presentation
No ratings yet
Digital Image Processing Project Presentation
40 pages
Computer Vision-Unit 5 Notes
No ratings yet
Computer Vision-Unit 5 Notes
24 pages
Ec 1009 - Digital Image Processing
75% (4)
Ec 1009 - Digital Image Processing
30 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
6 pages
Object Detector For Blind Person
No ratings yet
Object Detector For Blind Person
20 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
2 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Why Study Computer Vision?: - Images and Movies Are Everywhere - Fast-Growing Collection of Useful Applications
No ratings yet
Why Study Computer Vision?: - Images and Movies Are Everywhere - Fast-Growing Collection of Useful Applications
45 pages
Why Study Computer Vision?: - Images and Movies Are Everywhere - Fast-Growing Collection of Useful Applications
No ratings yet
Why Study Computer Vision?: - Images and Movies Are Everywhere - Fast-Growing Collection of Useful Applications
45 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
MODULE-1
No ratings yet
MODULE-1
18 pages
2 Bayesian Networks
No ratings yet
2 Bayesian Networks
10 pages
Natural Language Processing
No ratings yet
Natural Language Processing
44 pages
Pretrained Networks
No ratings yet
Pretrained Networks
42 pages
Pretrained Networks
No ratings yet
Pretrained Networks
42 pages
Smaller Network: CNN
No ratings yet
Smaller Network: CNN
28 pages
Opencv Introduction Highgui Basic Operations Face Detection Optical Flow Template Matching Local Feature
No ratings yet
Opencv Introduction Highgui Basic Operations Face Detection Optical Flow Template Matching Local Feature
21 pages
Assignment Questions For CO1 - 2
No ratings yet
Assignment Questions For CO1 - 2
1 page
Artificial Neural Networks Kluniversity Course Handout
No ratings yet
Artificial Neural Networks Kluniversity Course Handout
18 pages
Revised Handout 15ec3054 MLC
No ratings yet
Revised Handout 15ec3054 MLC
18 pages
Linkage: Harshraj Subhash Shinde KKW, Cabt, Nashik
No ratings yet
Linkage: Harshraj Subhash Shinde KKW, Cabt, Nashik
14 pages
Innospec Product Guide
No ratings yet
Innospec Product Guide
28 pages
La Dolce Vita - Fashion and Media
100% (1)
La Dolce Vita - Fashion and Media
52 pages
Media Contacts Pro
No ratings yet
Media Contacts Pro
5 pages
PBT-PC Valox 357
No ratings yet
PBT-PC Valox 357
5 pages
Lab Handout 9
No ratings yet
Lab Handout 9
3 pages
Final Nursing Informatics Project
No ratings yet
Final Nursing Informatics Project
12 pages
Thesisrip Edited
No ratings yet
Thesisrip Edited
49 pages
Excel Exercises
No ratings yet
Excel Exercises
13 pages
In-Sight 5000 Series Vision System: Manual
No ratings yet
In-Sight 5000 Series Vision System: Manual
78 pages
Physics PM2 - Practice Questions pt4
No ratings yet
Physics PM2 - Practice Questions pt4
49 pages
Tecknit Catalog PDF
No ratings yet
Tecknit Catalog PDF
218 pages
2022 2026 Canada Restaurant Long Term Forecast Final
No ratings yet
2022 2026 Canada Restaurant Long Term Forecast Final
26 pages
Micro Ancients Expansion III Enemies of Rome 7172889 PDF Free
100% (1)
Micro Ancients Expansion III Enemies of Rome 7172889 PDF Free
13 pages
3 RJTA-pub2 PDF
No ratings yet
3 RJTA-pub2 PDF
11 pages
Walchand College of Engineering, Sangli.: (An Autonomous Institute)
No ratings yet
Walchand College of Engineering, Sangli.: (An Autonomous Institute)
12 pages
Application Form and Sample Question
100% (1)
Application Form and Sample Question
4 pages
Valves
No ratings yet
Valves
14 pages
Manual Control Remoto QN85QN800APXPA
No ratings yet
Manual Control Remoto QN85QN800APXPA
2 pages
Modern Physics Test 3 - Elevate Classes - 11628509 - 2022 - 12!25!19 - 54
No ratings yet
Modern Physics Test 3 - Elevate Classes - 11628509 - 2022 - 12!25!19 - 54
7 pages
Physics 1 (Lesson Plan Upto Midterm_SPRING 2020)
No ratings yet
Physics 1 (Lesson Plan Upto Midterm_SPRING 2020)
13 pages
Worksheet Num Dif
No ratings yet
Worksheet Num Dif
3 pages
2-Les temps en anglais
No ratings yet
2-Les temps en anglais
4 pages
HP-I, Chapter - Five, Conveyance Structures
No ratings yet
HP-I, Chapter - Five, Conveyance Structures
174 pages
Learn HTML For Beginners - Free PDF Tutorials
No ratings yet
Learn HTML For Beginners - Free PDF Tutorials
6 pages
Simon Hebden Resume
No ratings yet
Simon Hebden Resume
1 page
PRADA Assignment 1 1516
0% (1)
PRADA Assignment 1 1516
5 pages

Computer Vision

Uploaded by

Computer Vision

Uploaded by

Why study Computer Vision?

• Images and movies are everywhere

• One can “see the future”

• 3D representations are easily constructed

• People draw distinctions between what is seen

• How images are formed

• Representing small patches of image

With this kernel

• Many objects are distinguished by their texture

• The geometry of multiple views

• Finding coherent structure so as to break the image or

• The relations between object geometry and image

• Multiple views arise from

• Finding images in large collections

• Which image components “belong together”?

• Some objects are 2D patterns

• e.g. find faces by

• Use a model to predict next position and refine using next

You might also like