0
0
Course H02A5A/H02K5A • The course does not assume prior imaging experience, image
processing, or graphics
Course Holder: Dirk Vandermeulen
[email protected] PRIMARY TEXT
Teaching Assistant: Jeroen Bertels
[email protected]
1 2 3
SECONDARY TEXT
4 5 6
10 11 12
WHY VISION?
• Images and video are everywhere! THE GOAL OF COMPUTER VISION HUMAN PERCEPTION HAS ITS SHORTCOMINGS…
To bridge the gap between pixels and “meaning”
Leonardo da Vinci,
16th Century AD Stimulus Response
This work is in the public domain
No response Response Cat image by CNXOpenStax is licensed
under CCBY4.0; changes made
This work is in the public domain (end point)
Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !16 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !17 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !18 4/3/2018
16 17 18
2 ½-D sketch 3-D model
Input image Edge image
• Generalized Cylinder • Pictorial Structure
Brooks & Binford, 1979 Fischler and Elschlager, 1973
This image is CC01.0 public domain This image is CC01.0 public domain
Image is public
domain
David Lowe, 1987
Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !22 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !23 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !24 4/3/2018
22 23 24
Train
Person
frequency
Airplane
Image is public domain
Deformable Part Model
Image is public domain
Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !25 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !26 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !27 4/3/2018
25 26 27
Steel drum Steel drum
The Image Classification Challenge: The Image Classification Challenge:
1,000 object classes 1,000 object classes
www.image-net.org
1,431,167 images 1,431,167 images
Deng, Dong, Socher, Li, Li, & Fei-Fei, 2009 Russakovsky et al. IJCV 2015 Russakovsky et al. IJCV 2015
Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !28 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !29 4/3/2018 Fei-Fei Li & Justin Johnson & Serena Yeung Lecture 1 - !30 4/3/2018
28 29 30
• The next slides show some examples of what current vision systems
can do
Scene
reconstruction
NASA Mars Rover
Photo Explorer
Input Relative camera positions and
orientations
photographs Point cloud
Sparse correspondence
Pollefeys et al. Goesele et al. System for interactive browsing and exploring large collections of photos of a scene.
Computes viewpoint of each photo as well as a sparse 3d model of the scene.
31 32 33
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching for
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
© Computer Vision Course CSE 576 (S. Seitz, R. Szeliski) © Computer Vision Course CSE 576 (S. Seitz, R. Szeliski) © Computer Vision Course CSE 576 (S. Seitz, R. Szeliski)
40 41 42
• Mobileye
• Vision systems currently in high-end BMW, GM, Volvo models
• By 2010: 70% of car manufacturers. Segmentation, detection, recognition
Slide content courtesy of Amnon Shashua
• Video demo © Computer Vision Course CSE 576 (S. Seitz, R. Szeliski) © Computer Vision Course CSE 576 (S. Seitz, R. Szeliski)
46 47 48
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
MICRO21 uses NeuralVision®, a form of artificial intelligence, to locate,
pre-classify, and display up to 200 white blood cells. It’s all automatic you
Vision systems (JPL) used for several tasks don’t even have to be present. All that’s left is for you to review the results
and print the report.
• Panorama stitching
NASA’s Mars Spirit Rover http://www.robocup.org/
• 3D terrain modeling http://en.wikipedia.org/wiki/Spirit_rover
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
© Computer Vision Course CSE 576 (S. Seitz, R. Szeliski) © Computer Vision Course CSE 576 (S. Seitz, R. Szeliski)
49 50 51
BONE-AGE/SKELETAL MATURITY
DETERMINATION USING HAND X-RAYS
BONE-AGEING USING HAND X-RAYS DENTAL AGE ESTIMATION
Age-ranked radial bones (2 individuals) from a reference data base
52
f i g u r e 1 – r E P r E SE N TAT I VE E x A M
LOW Er L E F T T h I r d MOL Ar S I N E A
d EV E LO PME N TA L STAG E.
T h i rd m o la rs a re d e p i c te d wi th i n
b o u n d i n g b ox.
52 53 54
T h e lowe r lef t t h i rd m o la r wa s s u
b e i n sta g e 2 , b u t i t wa s ex p e c te
so f twa re wo u ld n ot b e a b le to re
stage due to its severe (out of p
Multiple Sclerosis Lesion Quantification
T1 grey matter
CHROMOSOME CLASSIFICATION EXAMPLE: CHROMOSOME CLASSIFICATION
T2
PD
white matter
automatic
A typical solution to an image analysis or object recognition problem often consists of three
parts:
1. Image preprocessing and segmentation: enhancement of characteristic features, removal of
artifacts.
MS lesions
2. Feature extraction: calculation of image object properties.used in the recognition step.
3. Model matching: matching of image objects to model database
Note that not necessarily every object recognition strategy involves all these three steps
consecutively.
55 56 57
COMPUTER VISION
FEATURE
EXTRACTION SYLLABUS OVERVIEW IMAGE FILTERING
• Introduction
Segmentation/Delineation
• Image Filtering
• Object Detection
• Face Detection
• Instance Recognition
• Object Recognition
206 Computer Vision: Algorithms and Applications (September 3, 2010 draft)
58 59 60
Tutorial Computing
Supervised Unsupervised Semi-Supervised Reinforcement
Learning Learning Learning Learning
[http://poseidon.csd.auth.gr/LAB_RESEARCH/Latest/imgs/
SpeakDepVidIndex_img2.jpg]
Sensory Augmented
Group video frames into shots
[Figure by J. Shi]
andRecognition
[Figure by Wang & Suter]
Figure-ground
Object
Perceptual
1980s 1990s to early 2000s 2000-2010…
Visual
[Figure by Grauman & Darrell]
Computer Vision
Object-level grouping
Course 6.S191: Lex Fridman: January
K. Grauman, B. Leibe References: [81] Intro to Deep Learning [email protected] 2017
64 65 66
Course 6.S191: Lex Fridman: January Course 6.S191: Lex Fridman: January Course 6.S191: Lex Fridman: January
Reference: https://www.youtube.com/watch?v=_33K1zTtoow Intro to Deep Learning [email protected] 2017 References: [66, 69, 89] Intro to Deep Learning [email protected] 2017 References: [81, 89] Intro to Deep Learning [email protected] 2017
67 68 69
Famous Computer Vision Datasets Let’s Build an Image Classifier for CIFAR-10 Why Deep Learning? Scalable Machine Learning