UNITII of Computer Vision
UNITII of Computer Vision
The science of
? facilitating human-like
capability of seeing and
understanding the
environment to a
machine or computer.
1
http://www.bmva.org/visionoverview
Computing structure from
binocular vision
Courtesy: www.cs.gmu.edu/~kosecka/lect4.ppt
4
3
2
1
1 2 3 4
P p : Projection
Center of projection
O
p
Projection plane
Projective Geometry
Real Space and Projective
Space (2D)
p(kx,ky,k)
p(x,y) p(x,y,1)
y
x
o
y
z
x
o
Homogeneous
p(x,y) p(kx,ky,k) Coordinate system
Homogeneous Representation
p(x,y) l(a/c,b/c,1)
y
x
o
A point in P 2 A line in P2 b
c
a
o
Point containment in P2
Points and lines in P2
l:ax+by+c=0 l1 l2
x1(x1,y1) p(x,y)
y y
x2(x2,y2)
x x
o o
Duality principle:
To any theorem of 2-dimensional projective geometry
there corresponds a dual theorem, which may be
derived by interchanging the role of points and lines
in the original theorem.
Points and lines in a plane
l(ka,kb,kc)
ax+by+c=0
p(x,y) l(a/c,b/c,1)
b
z c
a
o
x Tl 0 lT x 0
x l l' l x x'
In projective geometry, a scaling factor refers to a value that
scales or resizes a point, line, or other geometric object while
preserving its projective properties.
l1:ax+by+c1=0
y X(x,y,1)
x
o l2:ax+by+c2=0
(b, -a, 0)
y
c
x
o
Ideal Point
In projective geometry, points at infinity cannot be
assigned Cartesian coordinates like regular points in
Euclidean geometry. Instead, ideal points are
introduced to handle these cases.
Intersection point
(b, -a, 0)
A direction !
Ideal points
Line at infinity
l(a/c,b/c,1)
=0
This concept helps maintain the property that any two lines
in projective space intersect at exactly one point.
Projection of parallel lines
from any arbitrary plane
Canonical projection plane
(CPP)
Vanishing Point
(b, -a, 0)
y
z
x
o
Ideal Plane
Vanishing points
Vanishing Points
corresponding to parallel
lines of a plane lie on a line,
called vanishing line.
Vanishing
Line
Vanishing Points
A real life example
Vanishing points
A journey toward infinity ….
xTl=0, and lTx=0
x=l x l’, and l=x x x’
Summary
A point in a 2-D projective space represents a ray
passing through origin of an implicit 3D space.
Requires additional dimension for representation.
O
p
Pinhole camera
Z Principal axis
Y
World coordinate
Focal
P (X,Y,Z)
length f o
p(x,y) Image coordinate
C
Image plane
Camera Center
Principal plane X
(XY-plane)
Pinhole Camera:
Mapping from P3P2
O Principal
point
X
Projection Matrix under the
offset
Y
P
Z p
C
R,t
Xc
O X
CCD Camera
A charge-coupled device (CCD) is a light-
sensitive integrated circuit that captures
images by converting photons to electrons.
No. of pixels
per unit length
General Projective Camera
3
3
11 d.o.f
5
Extrinsic parameters: R, t
Intrinsic parameters: K
Properties of projective
camera matrix P=[M | p4]
Rank of P: 3; # of extrinsic params: 6
Size: 3x4; # of intrinsic params: 5
d.o.f.=11;
For n correspondences
Principal point
Y
P
Z p
C
R,t Principal plane
O X Xc
Properties of projective camera matrix
P=[M | p4]
Properties of projective camera matrix
P=[M | p4]
Properties of projective
camera matrix P=[M | p4]
The projective camera matrix is a 3x4 matrix that describes the mapping of a
pinhole camera from 3D points in the world to 2D points in an image. It has
the following properties:
The first three columns of the matrix define the camera's intrinsic parameters,
which are the focal length, the principal point, and the skew.
The last column of the matrix defines the camera's extrinsic parameters,
which are the camera's position and orientation in the world.
The image of a point at infinity is a line that passes through the camera
center.
Cameras with Lenses: Cameras with lenses use optical lenses to focus
light from the scene onto a photosensitive medium (like film or a digital
sensor). Lenses manipulate the way light rays converge to create a
sharp, properly oriented image.
Image Quality:
Pinhole Camera: Pinhole cameras produce images with
soft focus and uniform sharpness across the image due to
the nature of light diffraction through the small aperture.
However, they generally lack high detail and sharpness.
Cameras with Lenses: Cameras with lenses are widely used for
professional photography, cinematography, scientific imaging,
and everyday photography due to their versatility and ability to
produce high-quality images.
The concept of "cameras at infinity" is often used in computer graphics
and computer vision to simplify certain calculations and modeling tasks.
Use Cases: Cameras at infinity are particularly useful for scenes where
depth is not critical, and you want to simplify calculations. They are
often used in architectural rendering, certain types of technical
drawings, and other scenarios where maintaining accurate perspective
isn't essential
It's important to note that "cameras at infinity" is a mathematical
construct and not a physical reality.
For n points
Summary
o Pinhole camera model provides the projection
matrix which maps a 3D point to an image point.
o Projection matrix:
o 3x4
o Dof: 11
o 5 intrinsic parameters and 6 extrinsic parameters.
o Minimum 6 point correspondences required for
estimation
o Affine projection matrix
o Last row [0 0 0 1]T
o Dof:8
o Minimum 4 point correspondences required to estimate.
Summary (contd.)
o Geometry encoded in a projection matrix
o P=[ M | p4] or P=[p1 p2 p3 p4] or P=[r1T ; r2T ; r3T]
o Camera Center: -M-1p4
o
For affine projection matrix: Right zero of M (A direction).
o Vanishing points
o X-axis: p1
o Y-axis: p2
o Z-axis: p3
o Image of world origin: p4
o Special planes passing through the camera center
o Principal plane: r3TX=0
o Direction of optical axis: <r31, r32, r33>
o Principal point: M [r31 r32 r33]T
o Plane formed with x-axis of image coordinate system: r1TX=0
Summary (contd.)
o Geometric derivatives from Projection Matrix:
P= [M|p4]
o Projection ray formed at image point x.
o Direction ratio: M-1x
o A point on the ray:
o Camera center (-M-1p4)