0% found this document useful (0 votes)

74 views45 pages

Image Formation: DD2423 Image Analysis and Computer Vision

The document describes the process of image formation from a physical and computational perspective. It discusses how light interacts with lenses and camera sensors to form digital images through perspective projection and sampling. Key steps in image formation include image acquisition, perspective projection, homogeneous coordinates, sampling, and image warping. Computational vision models these processes to analyze digital images and understand the visual world.

Uploaded by

Trọng Quảng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views45 pages

Image Formation: DD2423 Image Analysis and Computer Vision

Uploaded by

Trọng Quảng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

DD2423 Image Analysis and Computer Vision

IMAGE FORMATION
Mårten Björkman

Computational Vision and Active Perception

School of Computer Science and Communication

November 8, 2013

1
Image formation

Goal: Model the image formation process

• Image acquisition
• Perspective projection
– properties
– approximations
• Homogeneous coordinates
• Sampling
• Image warping

2
Image formation

Image formation is a physical process that captures scene illumination

through a lens system and relates the measured energy to a signal.

3
Basic concepts

• Irradiance E: Amount of light falling on a surface, in power per unit area (watts
per square meter). If surface tilts away from light, same amount of light strikes
bigger surface (foreshortening → less irradiance).
• Radiance L: Amount of light radiated from a surface, in power per unit area
per unit solid angle. Informally “Brightness”.

• Image irradiance E is proportional to scene radiance

4
Light source examples

Left: Forest image (left): sun behind observer, (right): sun opposite observer
Right: Field with rough surface (left): sun behind observer, (right): sun opposite observer.

5
Digital imaging

Image irradiance E × area × exposure time → Intensity

• Sensors read the light intensity that may be filtered through color filters, and
digital memory devices store the digital image information either as RGB color
space or as raw data.
• An image is discretized: sampled on a discrete 2D grid → array of color values.

6
Imaging acqusition - From world point to pixel

• World points are projected onto a camera sensor chip.

• Camera sensors sample the irradiance to compute energy values.
• Positions in camera coordinates (in mm) are converted to image coordinates
(in pixels) based on the intrinsic parameters of the camera:
- size of each sensor element,
- aspect ratio of the sensor (xsize/ysize),
- number of sensor elements in total,
- image center of sensor chip relative to the lens system.

7
Steps in a typical image processing system

• Image acquisition: capturing visual data by a vision sensor

• Discretization/digitalization - Quantization - Compression: Convert data into
discrete form; compress for efficient storage/transmission
• Image enhancement: Improving image quality (low contrast, blur noise)

• Image segmentation: Partition image into objects or constituent parts.

• Feature detection: Extracting pertinent features from an image that are
important for differentiating one class of objects from another.
• Image representation: Assigning labels to an object based on information
provided by descriptors.
• Image interpretation: Assigning meaning to image information.

8
Pinhole camera or “Camera Obscura”

9
Pinhole camera and perspective projection

• A mapping from a three dimensionsal (3D) world onto a two dimensional (2D)
plane in the previous example is called perspective projection.
• A pinhole camera is the simplest imaging device which captures the geometry
of perspective projection.
• Rays of light enter the camera through an infinitesimally small aperture.
• The intersection of light rays with the image plane form the image of the object.

10
Perspective projection

11
Pinhole camera - Perspective geometry

Y
X
lane
ep
ag
im
optical
center
Z
optical axis

p=(x,y,f) P=(X,Y,Z)
image coordinates world coordinates

focal length

• The image plane is usually modeled in front of the optical center.

• The coordinate systems in the world and in the image domain are
parallel. The optical axis is ⊥ image plane.

12
Lenses

• Purpose: gather light from from larger opening (aperture)

• Problem: only light rays from points on the focal plane intersect the
same point on the image plane
• Result: blurring in-front or behind the focal plane
• Focal depth: the range of distances with acceptable blurring

13
Imaging geometry - Basic camera models

• Perspective projection (general camera model)

All visual rays converge to a common point - the focal point
• Orthographic projection (approximation: distant objects, center of view)
All visual rays are perpendicular to the image plane

Image Image

focal

point

Perspective projection Orthographic projection

14
Projection equations

Y
y
f
Z
• Perspective mapping
x X y Y
= , =
f Z f Z
• Orthographic projection
x = X, y = Y
• Scaled orthography - Z0 constant (representative depth)
x X y Y
= , =
f Z0 f Z0

15
Perspective transformation

• A perspective transformation has three components:

- Rotation - from world to camera coordinate system
- Translation - from world to camera coordinate system
- Perspective projection - from camera to image coordinates
• Basic properties which are preserved:
- lines project to lines,
- collinear features remain collinear,
- tangencies,
- intersections.

16
Perspective transformation (cont)

parallel
lines

vanishing
point

image

camera
centre

Each set of parallel lines meet at a different vanishing point - vanishing point as-
sociated to this direction. Sets of parallel lines on the same plane lead to collinear
vanishing points - the line is called the horizon for that plane.

17
Homogeneous coordinates

• Model points (X,Y, Z) in R 3 world by (kX, kY, kZ, k) where k is arbitrary 6= 0,

and points (x, y) in R 2 image domain by (cx, cy, c) where c is arbitrary 6= 0.
• Equivalence relation: (k1X, k1Y, k1Z, k1) is same as (k2X, k2Y, k2Z, k2).
• Homogeneous coordinates imply that we regard all points on a ray (cx, cy, c) as
equivalent (if we only know the image projection, we do not know the depth).
• Possible to represent “points in infinity” with homogreneous coordinates
(X,Y, Z, 0) - intersections of parallel lines.

18
Computing vanishing points

19
Homogeneous coordinates (cont)

In homogeneous coordinates the projection equations can be written

 
    kX  
cx f 0 0 0   f kX
cy =  0 f 0 0  kY  =  f kY 
 kZ 
c 0 0 1 0 kZ
k

Image coordinates obtained by normalizing the third component to one

(divide by c = kZ).

xc f kX X yc f kY Y
x= = = f , y= = =f
c kZ Z c kZ Z

20
Transformations in homogeneous coordinates

• Translation      
X X ∆X
Y  → Y  +  ∆Y 
Z Z ∆Z
    
X 1 0 0 ∆X X
Y  0 1 0 ∆Y 
 Y 
 
 →
 Z  0 0 1 ∆Z   Z 
1 0 0 0 1 1
• Scaling     
X SX 0 0 0 X
Y   0 SY 0 0 Y 
 
 →
Z   0 0 SZ 0  Z 
1 0 0 0 1 1

21
Transformations in homogeneous coordinates II

• Rotation around the Z axis

    
X cos θ − sin θ 0 0 X
Y   sin θ cos θ 0 0 Y 
 
 →
Z   0 0 1 0  Z 
1 0 0 0 1 1

• Mirroring in the XY plane

    
X 1 0 0 0 X
Y  0 1 0 0 Y 
 
 → 
 Z  0 0 −1 0  Z 
1 0 0 0 1 1

22
Transformations in homogeneous coordinates III

Common case: Rigid body transformations (Euclidean)

 0    
X X ∆X
 Y 0  → R  Y  +  ∆Y 
Z0 Z ∆Z

where R is a rotation matrix (R−1 = RT ) is written

 0   
X ∆X X
0
Y   R ∆Y   Y
 
 0 = 
Z   ∆Z   Z 
1 0 0 0 1 1

23
Perspective projection - Extrinsic parameters

Consider world coordinates (X 0 ,Y 0 , Z 0 , 1) expressed in a coordinate system

not aligned with the camera coordinate system
     0  0
X ∆X X X
Y   R 0  0
∆Y   Y 0  = A Y 0 
 
 =
Z   ∆Z   Z  Z 
1 0 0 0 1 1 1

Perspective projection (more general later)

   0  0
    X X X
x f 0 0 0   Y 0  Y 0 
Y
c y = 0 f 0 0   = PA  0  = M 
      
Z0 

Z Z
1 0 0 1 0
1 1 1

24
Intrinsic camera parameters

Due to imperfect placement of the camera chip relative to the lens system,
there is always a small relative rotation and shift of center position.
25
Intrinsic camera parameters

A more general projection matrix allows:

• Image coordinates with an offset origin
• Non-square pixels
• Skewed coordinate axes
• Five variables below are known as the camera’s intrinsic parameters

   
fu γ u0 fu γ u0 0
K =  0 fv v0  , P = K 0 =  0 fv v0 0 
0 0 1 0 0 1 0

Most important is the focal length ( fu , fv ). Normally, fu and fv are assumed

equal and the parameters γ, uo and vo close to zero.

26
Example: Perspective mapping

27
Example: Perspective mapping in stereo

28
Mosaicing

29
Exercise

Assume you have a point at (3m, −2m, 8m) with respect to the cameras coordinate
system. What is the image coordinates, if the image has a size (w, h) = (640, 480)
and origin in the upper-left corner, and the focal length is f = 480?

30
Exercise

Answer:
X w
x= f + = (480 ∗ 3/8 + 640/2) = 500
Z 2
Y h
y = f + = (−480 ∗ 2/8 + 480/2) = 120
Z 2

31
Approximation: affine camera

32
Approximation: affine camera

• A linear approximation of perspective projection

 
    X
x m11 m12 m13 m14  
y = m21 m22 m23 m24  Y 
Z 
1 0 0 0 1
1

• Basic properties
– linear transformation (no need to divide at the end)
– parallel lines in 3D mapped to parallel lines in 2D
Angles are not preserved!

33
Planar Affine Transformation

Original Flipped x-size

Shifted and scaled Sheared

34
Summary of models
 
m11 m12 m13 m14
Projective (11 degrees of freedom): M = m21 m22 m23 m24 
m31 m32 m33 m34
 
m11 m12 m13 m14
Affine (8 degrees of freedom): M = m21 m22 m23 m24 
0 0 0 1
 
r11 r12 r13 ∆X
Scaled orthographic (6 degrees of freedom): M = r21 r22 r23 ∆Y 
0 0 0 Z0
 
r11 r12 r13 ∆X
Orthographic (5 degrees of freedom): M = r21 r22 r23 ∆Y 
0 0 0 1
All these are just approximations, since they all assume a pin-hole, which is
supposed to be infinitesimally small.

35
Sampling and quantization

• Sample the continuous signal at a finite set of points and quantize

the registered values into a finite number of levels.
• Sampling distances ∆x, ∆y and ∆t determine how rapid spatial and
temporal variations can be captured.

36
Sampling and quantization

• Sampling due to limited spatial and temporal resolution.

• Quantization due to limited intensity resolution.

37
Factors that affect quality

• Quantization: Assigning, usually integer, values to pixels (sampling an

amplitude of a function).
• Quantization error: Difference between the real value and assigned one.
• Saturation: When the physical value moves outside the allocated range,
then it is represented by the end of range value.

38
Different image resolutions

39
Different number of bits per pixel

40
Image warping

Resample image f (x, y) to get a new image g(u, v), using a coordinate trans-
formation: u = u(x, y), v = v(x, y).
Examples of transformations:

41
Image Warping

• For each grid point in (u, v) domain compute corresponding (x, y) values.
Note: transformation is inverted to avoid holes in result.
• Create g(u, v) by sampling from f (x, y) either by:
– Nearest neighbour look-up (noisy result)
– Bilinear interpolation (blurry result)

f (x + s, y + t) = (1 − t) · ((1 − s) · f (x, y) + s · f (x + 1, y)) +

+ t · ((1 − s) · f (x, y + 1) + s · f (x + 1, y + 1))
42
Nearest Neighbor vs. Bilinear Interpolation

43
Summary of good questions

• What parameters affects the quality in the acquisition process?

• What is a pinhole camera model?
• What is the difference between intrinsic and extrinsic camera parameters?
• How does a 3D point get projected to a pixel with a perspective projection?
• What are homogeneous coordinates and what are they good for?
• What is a vanishing point and how do you find it?
• What is an affine camera model?
• What is sampling and quantization?

44
Readings

• Gonzalez and Woods: Chapter 2

• Szeliski: Chapters 2.1 and 2.3.1

GIS - Advanced Remote Sensing
100% (1)
GIS - Advanced Remote Sensing
134 pages
9 Projection Geometry
No ratings yet
9 Projection Geometry
124 pages
7. Image Transforms
No ratings yet
7. Image Transforms
48 pages
Camera Geometry Alignment Final
No ratings yet
Camera Geometry Alignment Final
118 pages
[0.1 Ha] Lecture_02_Image Basic
No ratings yet
[0.1 Ha] Lecture_02_Image Basic
126 pages
Lecture1
No ratings yet
Lecture1
89 pages
Lecture W4ab
No ratings yet
Lecture W4ab
64 pages
Projections and Camera Calibration: Man-522: Computer Vision SET-2
No ratings yet
Projections and Camera Calibration: Man-522: Computer Vision SET-2
98 pages
Why Camera Modeling?: Image Processing
No ratings yet
Why Camera Modeling?: Image Processing
10 pages
Why Camera Modeling?: Image Processing
No ratings yet
Why Camera Modeling?: Image Processing
10 pages
Coordinate Conventions and Imaging Geometry
No ratings yet
Coordinate Conventions and Imaging Geometry
9 pages
02 - Image Formation and Acquisition
No ratings yet
02 - Image Formation and Acquisition
49 pages
Imaging Geometry
No ratings yet
Imaging Geometry
34 pages
CV - Image Formation
No ratings yet
CV - Image Formation
28 pages
Lec 02 Cam Models
No ratings yet
Lec 02 Cam Models
44 pages
Perspective Projection
No ratings yet
Perspective Projection
44 pages
Camera Projection II: Reading: T&V Section 2.4
No ratings yet
Camera Projection II: Reading: T&V Section 2.4
39 pages
Cameras Stereo 17 Ink
No ratings yet
Cameras Stereo 17 Ink
87 pages
02 Camera Geometry
No ratings yet
02 Camera Geometry
95 pages
Multiple View Geometry: Richard Hartley and Andrew Zisserman
No ratings yet
Multiple View Geometry: Richard Hartley and Andrew Zisserman
57 pages
Lectures 9-10: Imaging Geometry and Camera Model: Dr. V Masilamani
100% (1)
Lectures 9-10: Imaging Geometry and Camera Model: Dr. V Masilamani
37 pages
E1 216 Computer Vision: Lecture 02: Camera Geometry
No ratings yet
E1 216 Computer Vision: Lecture 02: Camera Geometry
83 pages
CSE463 Reading Lecture Note 2
No ratings yet
CSE463 Reading Lecture Note 2
11 pages
Affine Reconstruction From Multiple Views Using Singular Value Decomposition
No ratings yet
Affine Reconstruction From Multiple Views Using Singular Value Decomposition
61 pages
Basic Mathematics of Projection
No ratings yet
Basic Mathematics of Projection
63 pages
Globers Vs Flatearthers Lect5
No ratings yet
Globers Vs Flatearthers Lect5
44 pages
Lec02 Image Mod
No ratings yet
Lec02 Image Mod
64 pages
Image Projection 339
No ratings yet
Image Projection 339
21 pages
Camera Parameters
No ratings yet
Camera Parameters
50 pages
2 - Cg-2019-02-Transforms
No ratings yet
2 - Cg-2019-02-Transforms
19 pages
Camera
No ratings yet
Camera
48 pages
Exercises
No ratings yet
Exercises
5 pages
Image Processing - Complete Notes with Movie References
No ratings yet
Image Processing - Complete Notes with Movie References
63 pages
computer_vision_4_3D_vision_motion_1_students
No ratings yet
computer_vision_4_3D_vision_motion_1_students
55 pages
Computer Vision Lecture Notes All Compress
No ratings yet
Computer Vision Lecture Notes All Compress
17 pages
Projective Geometry Upto28jan
No ratings yet
Projective Geometry Upto28jan
92 pages
EEE 6512 Image Processing and Computer Vision
No ratings yet
EEE 6512 Image Processing and Computer Vision
43 pages
An Invitation To 3-D Vision From Images To Models
No ratings yet
An Invitation To 3-D Vision From Images To Models
339 pages
Lecture 2 -Pinhole Camera
No ratings yet
Lecture 2 -Pinhole Camera
18 pages
Module 7: Robot Vision I Lecture 26: The Imaging Transformation Objectives
No ratings yet
Module 7: Robot Vision I Lecture 26: The Imaging Transformation Objectives
4 pages
An Invitation To 3-D Vision PDF
No ratings yet
An Invitation To 3-D Vision PDF
338 pages
CV Notes unit-1
No ratings yet
CV Notes unit-1
22 pages
Image Formation: - The Two Parts of The Image Formation Process
No ratings yet
Image Formation: - The Two Parts of The Image Formation Process
9 pages
Unit 1 AIIA
No ratings yet
Unit 1 AIIA
68 pages
CV Research Assignment
No ratings yet
CV Research Assignment
30 pages
L2v1 Image Formation
No ratings yet
L2v1 Image Formation
42 pages
Image Acquisition _ Digitization
No ratings yet
Image Acquisition _ Digitization
64 pages
Simple camera-model from Wikipedia
No ratings yet
Simple camera-model from Wikipedia
13 pages
11-SingleViewCamera L11
No ratings yet
11-SingleViewCamera L11
51 pages
Chapter2 Image Formation
No ratings yet
Chapter2 Image Formation
68 pages
Projective Geometry For Image Analysis
No ratings yet
Projective Geometry For Image Analysis
43 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
72 pages
unit 2
No ratings yet
unit 2
88 pages
Image Formation: - The Two Parts of The Image Formation Process
No ratings yet
Image Formation: - The Two Parts of The Image Formation Process
9 pages
Stereo Calibration 2
No ratings yet
Stereo Calibration 2
6 pages
Epipolar Geometry: Unlocking Depth Perception in Computer Vision
From Everand
Epipolar Geometry: Unlocking Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
Isometric Projection: Exploring Spatial Perception in Computer Vision
From Everand
Isometric Projection: Exploring Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
Vanishing Point: Exploring the Limits of Vision: Insights from Computer Science
From Everand
Vanishing Point: Exploring the Limits of Vision: Insights from Computer Science
Fouad Sabry
No ratings yet
Reverse Perspective: Reimagining Visual Perception in Computer Vision
From Everand
Reverse Perspective: Reimagining Visual Perception in Computer Vision
Fouad Sabry
No ratings yet
Radiosity Computer Graphics: Advancing Visualization through Radiosity in Computer Vision
From Everand
Radiosity Computer Graphics: Advancing Visualization through Radiosity in Computer Vision
Fouad Sabry
No ratings yet
Pinhole Camera Model: Understanding Perspective through Computational Optics
From Everand
Pinhole Camera Model: Understanding Perspective through Computational Optics
Fouad Sabry
No ratings yet
Wired Communication (CNT)
No ratings yet
Wired Communication (CNT)
45 pages
Interrupt and Clock PDF
No ratings yet
Interrupt and Clock PDF
45 pages
Introduction
No ratings yet
Introduction
53 pages
Robot Operating System ROS Compatible Low Cost Rot
No ratings yet
Robot Operating System ROS Compatible Low Cost Rot
6 pages
3rd Prep Science
No ratings yet
3rd Prep Science
16 pages
Determination of The Focal Length of A Convex Lens by Displacement Method With The Help of An Optical Bench
No ratings yet
Determination of The Focal Length of A Convex Lens by Displacement Method With The Help of An Optical Bench
10 pages
Lecture 3-4 - Geometry of A Vertical Aerial Photograph
No ratings yet
Lecture 3-4 - Geometry of A Vertical Aerial Photograph
35 pages
Physics Level L Course Questions Solution
No ratings yet
Physics Level L Course Questions Solution
77 pages
Axes and Angles of Eye
100% (1)
Axes and Angles of Eye
20 pages
Aberration Correction
No ratings yet
Aberration Correction
13 pages
Photography
No ratings yet
Photography
19 pages
NIS_G12_PHY_02_7RP_AFP
No ratings yet
NIS_G12_PHY_02_7RP_AFP
24 pages
Term 1 Questions - Light
No ratings yet
Term 1 Questions - Light
24 pages
P5721_e
No ratings yet
P5721_e
6 pages
Fiitjee: Answers, Hints & Solutions
No ratings yet
Fiitjee: Answers, Hints & Solutions
10 pages
Lens-Powers-2019-Updated-B-1
No ratings yet
Lens-Powers-2019-Updated-B-1
55 pages
Physics KSSM: Dictionary & Formula
No ratings yet
Physics KSSM: Dictionary & Formula
38 pages
Mathematical Foundations of Photogrammetry, Metrology - The Second Part of Photogrammetry
No ratings yet
Mathematical Foundations of Photogrammetry, Metrology - The Second Part of Photogrammetry
35 pages
2013 Sec 3 WS8.1 - 8.4 Answers
No ratings yet
2013 Sec 3 WS8.1 - 8.4 Answers
11 pages
Fisika Dan Geometri Optik New Edit
No ratings yet
Fisika Dan Geometri Optik New Edit
90 pages
Optics Lenses and Dispersion
No ratings yet
Optics Lenses and Dispersion
23 pages
Geom 205: Topographical Survey: Tacheometry or Tachemetry or Telemetry
No ratings yet
Geom 205: Topographical Survey: Tacheometry or Tachemetry or Telemetry
33 pages
Practice Final - Optics 1 - KEY
No ratings yet
Practice Final - Optics 1 - KEY
8 pages
Chapter12. Stereoscopic Plotting Instruments
No ratings yet
Chapter12. Stereoscopic Plotting Instruments
81 pages
Basic Astronomical Telescope
No ratings yet
Basic Astronomical Telescope
16 pages
Krishnas BSC Physics Practical IV Edition 1f Pages 45 Code 1430 Compress
No ratings yet
Krishnas BSC Physics Practical IV Edition 1f Pages 45 Code 1430 Compress
45 pages
On The Methods To Determine The Focal Le
No ratings yet
On The Methods To Determine The Focal Le
16 pages
Lenses
No ratings yet
Lenses
13 pages
02 Photogrammetry - Vertical Photo
No ratings yet
02 Photogrammetry - Vertical Photo
37 pages
ST-1200-PLUS-руководство-по-эксплуатации
No ratings yet
ST-1200-PLUS-руководство-по-эксплуатации
17 pages
Chapter 5 Light Teacher's Guide
No ratings yet
Chapter 5 Light Teacher's Guide
38 pages
physics very short answers (1)
No ratings yet
physics very short answers (1)
8 pages
Edexcel GCSE Physics: 5.1 Optics
No ratings yet
Edexcel GCSE Physics: 5.1 Optics
24 pages

Image Formation: DD2423 Image Analysis and Computer Vision

Uploaded by

Image Formation: DD2423 Image Analysis and Computer Vision

Uploaded by

DD2423 Image Analysis and Computer Vision

Computational Vision and Active Perception

Goal: Model the image formation process

Image formation is a physical process that captures scene illumination

• Image irradiance E is proportional to scene radiance

Image irradiance E × area × exposure time → Intensity

• World points are projected onto a camera sensor chip.

• Image acquisition: capturing visual data by a vision sensor

• Image segmentation: Partition image into objects or constituent parts.

• The image plane is usually modeled in front of the optical center.

• Purpose: gather light from from larger opening (aperture)

• Perspective projection (general camera model)

Perspective projection Orthographic projection

• A perspective transformation has three components:

• Model points (X,Y, Z) in R 3 world by (kX, kY, kZ, k) where k is arbitrary 6= 0,

In homogeneous coordinates the projection equations can be written

Image coordinates obtained by normalizing the third component to one

• Rotation around the Z axis

• Mirroring in the XY plane

Common case: Rigid body transformations (Euclidean)

where R is a rotation matrix (R−1 = RT ) is written

Consider world coordinates (X 0 ,Y 0 , Z 0 , 1) expressed in a coordinate system

Perspective projection (more general later)

A more general projection matrix allows:

Most important is the focal length ( fu , fv ). Normally, fu and fv are assumed

• A linear approximation of perspective projection

Original Flipped x-size

Shifted and scaled Sheared

• Sample the continuous signal at a finite set of points and quantize

• Sampling due to limited spatial and temporal resolution.

• Quantization: Assigning, usually integer, values to pixels (sampling an

f (x + s, y + t) = (1 − t) · ((1 − s) · f (x, y) + s · f (x + 1, y)) +

• What parameters affects the quality in the acquisition process?

• Gonzalez and Woods: Chapter 2

You might also like