ShapeThoughts-FFL-Final
ShapeThoughts-FFL-Final
net/publication/283500791
Thoughts on Shape
CITATIONS READS
2 188
1 author:
SEE PROFILE
All content following this page was uploaded by Frederic Fol Leymarie on 05 November 2015.
Thoughts on shape
. Introduction
What is shape?
Shape is the information, the structure induced by the presence of object el-
ements in a space. An object manifests its presence by bits of data we refer to as
“outline,” to be defined later. Such an outline can be obtained, for example, by a
sampling process, such as when a laser camera scans the surface of a 3D object.
Space itself is defined by its associated measuring sticks, or lack thereof, and the
number of dimensions needed to enumerate its extent.
Shape is the “glue” between object and space, revealed by an interrogation of
outlines. We interrogate an object by probing its outline. This inquisitive process
we call a “transform” as it not only is a process which attempts to identify and
characterise the object’s structure, but in doing so modifies, perturbs, the object
and space itself.
Complexity in forms emerges by the grouping, assembly, concatenation of ob-
ject elements. This complexity can be understood by defining a language for shape.
Such a language could include atomic structures, the alphabet or a code, which
when put together, via a grammar, create a word, and assembly of words create
sentences. The alphabet and grammar can then be thought of as the essence of
what the organisation of shape is.
In the following, I first define a few concepts to initiate our discussion by giv-
ing meanings to “space,” “object,” “outline,” “shape,” “form,” “structural field,”
“shape probing,” “representation.” I proceed to explore some of the most pop-
ular transforms which can reveal the nature of shape and then specify classical
geometries: the science of measure on space and its objects which leaves invari-
ant certain properties under specific transforms. I then depart from the tradition
and propose a dichotomy between two main classes of transform sequences, I call
“horizontal” and “vertical,” which do not require invariance to specify and classify
useful probes for shapes. While horizontal transform sequences refer to transforms
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.2 (351-411)
which operate along an outline, vertical ones act away from it. Along the way I pro-
vide a number of examples from the field of computer vision illustrating various
transforms and their belonging to one class or the other. Finally I consider a few
examples of “shape grammars” which permit to construct and analyse shape at
varying degrees of complexity.
. Preliminaries
Definition 2.1 (Space) Entity having extent spanning a number of dimensions; i.e.,
such that we can refer to any constitutive locus – positions – of that entity. Space can be
homogeneous, i.e., such that its dimension set is everywhere similar, or heterogeneous,
i.e., such that its dimension set may vary locally, regionally. For example, consider
the “classical” Euclidean space in 3D which is homogeneous, isotropic and of infinite
extent, versus Einstein’s relativistic space which is heterogeneous as a function of mass
(of objects).
Definition 2.2 (Object) A region of a given space, such as the Euclidean 3D space,
E3 , having some distinguishing characteristic. For example, a (homogeneous) region,
x of E3 , whose density, δ, is above some fiducial value, φ, i.e.: δ(x) > φ; refer to
the concept of solid shape and tolerances for shape by Koenderink (Koenderink
1990: Ch. 2).
Definition 2.3 (Outline) (i) Samples of an object delimiting its outer limits. For ex-
ample, consider the lines (curves) marking the edges of a pattern in an image. Also,
(ii) a sketch in which object features (not necessarily boundaries) are marked (without
shading; e.g., a line-drawing of a figure).
Definition 2.4 (Shape (in isolation)) Description of the outline of an object.
Such a “description” might simply consist in a quantitative study of the outline
itself; e.g., the ordered enumeration of the contour points.
Alternatively, a “description” of some of the distinctive features extracted from the
outline can define “shape;” e.g., the 3 corners of a triangle.
Thus, this description will depend on the method used to probe the object’s
outline. As such, one may say that “shape is operationally defined” (Koenderink
1990: 15).
Definition 2.5 (Form) Qualitative description of the shape of an object; e.g., the
“round form” of the sphere or of an ellipsoid.
Thus, “space” is a place-holder for content, while the notion of “object” corre-
sponds to (i) the physical body (assumed) present in the scene, or (ii) the region
of space under scrutiny. The notion of “shape” corresponds to the description of
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.3 (411-457)
the “available” outline of this object.1 For example, it may be derived from some
projection of the object in the scene onto the image (or retinal) plane. Finally the
notion of “form” is a more abstract concept. Many shapes may have the same form.
Shape is the “content” of form; i.e., it gives a more precise meaning or identity to
a form. I think of form as a kind of “meta-shape,” a concept which is useful for
object recognition.
Shape can also be defined within a context or environment. For example, shape
can be extended to embody the influence of “gravity,” i.e., the orientation of an
object with respect to an horizon or reference frame, and its relative position with
respect to that horizon. Consider for example a “diamond” shape which is distin-
guished from a square or parallelogram only by its orientation with respect to an
horizon or ground plane. This is a famous “phenomenon” first identified by Ernst
Mach in the late 19th century and later “refined” by Goldmeier in the 1930’s (Ley-
ton 2001: Fig. 8.3). Refer also to the related notions of “visual weight” and “visual
direction” from the Gestalt theory of perception: a measure of perceptual forces
exercised by an object on and via its visual context (Arnheim 1974).
The study of the shape of an object in isolation or within a context, a scene, can be
unified via the concept of a “structural field.”
Concept 2.1 (Shape as a structural field) Shape is the structure of the field existing
“around” an object. This field may consists of geometric entities – such as curvatures,
singularities (of some appropriate mappings), gauge figures (e.g., Dupin’s indica-
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.4 (457-507)
trix) – which may exist in association with each sample of the object being scrutinised.
In order to identify or measure such entities, we will have to “probe” the field.
Concept 2.2 (Probing shape and the uncertainty principle) Since “shape” will also
depend on the probing method, and not simply on the object’s structure, we have here
an example of a variation on the “uncertainty principle” of Heisenberg in action. We
will be as precise about our identification and measure of the structural field of an
object as the selected probing method permits to.
We will have to seek for probing methods which, although they may modify (and
not just identify or measure) the object’s field, preserve as much as possible this
structure, or emphasise certain of its features. Also, we will have to identify which
structures are most relevant for shape analysis. This selection can only be per-
formed by considering jointly the effects of probing the field. Note also that, if we
want to take into account the context for an object, say its orientation relative to
a ground plane, the probing method will need to be defined to also consider this
influence. Various probing methods will emphasise various features in the form of
a “representation.”
A “machine” is a pair of functions acting on a given input set (of parameters, data)
(Leyton 2001: §1.15, 30): such that (i) one function specifies the states of the ma-
chine being transformed, and (ii) the other function specifies a new output set
produced. The concept of a “machine” emphasises that “shape” can be cast as a
dynamic phenomenon.3
In the following section, important examples of probes used to reveal the
shape of an object are first considered. Such probes are called “shape transforms”
or simply “transforms.” After some preliminary definitions, a number of impor-
tant (practical) properties for shape transforms are considered. This preliminary
study will set the stage to consider sequences of shape transforms and construct a
bi-partite classification (in §4) inspired by applications in computer vision used to
reveal the shape of objects.
Definition 3.1 (Domain and range) Let us call the space (or region of space) con-
taining the outline or representation being probed, the domain, while the space (re-
gion) containing the resulting outline or representation is called range. Let us denote
the former by D and the latter by R.
Remark 3.1 (Spaces for domain and range) The domain and range of a shape
transform may lie in the same space or not. E.g., in the case of a “projection,” the
range lies in a subspace with respect to the original space of the domain.
Note, however, that in many practical situations, both the domain and range will
lie in the same space, i.e., the transform is an instance of a general deformation:
1–to–1 mappings of a space onto itself.
Question 3.1 (What is the purpose of a transform?) Typically, one uses the concept
of a transform to simplify, or break down, or map to a more explicit representation,
the problem being attacked. A simplification may have different meanings depending
what are the goals to be achieved. A transform may make more explicit some prop-
erty(ies) of the shape (or the domain); e.g., it could amplify the presence of linearities
of a planar object such as a square; a classical example here is the Hough transform.4
Alternatively, a transform may look more like a filter which removes some undesir-
able properties; e.g., smoothing a surface with convolution kernels, this to attenuate
noise or to interpolate data. A classical example here is the Fourier transform which
first maps a signal or image from the spatial domain to the frequency range where
certain spatial frequencies can be selected; e.g., in the context of low, high or band pass
filtering.
In other words, the composition of the transform F with itself yields F again
(and forever):
F◦F ≡F,
or, in terms of the corresponding operator, ρ,
ρ2 = ρ .
Remark 3.5 (Fixed points of idempotent transforms) Note that, all the points in
the range of an idempotent transform, i.e. F(x) = y ∈ R, are fixed points.
Jean Lagarde put forward the following informal argument in favour of idempo-
tency (Lagarde 1990: Ch. 2):
If the goal of the operator is to recover a valid interpretation from initial data,
then applying the process one more time on the supposedly valid interpretation
should not change it.
images, D ≡ I , and has for its range a binary feature map corresponding to the
edges and lines originally in I(x, y) ⊂ I . Note that R ≡ F(I) is also a valid im-
age, i.e., F(I) ⊂ I . However, applying F once more should not change the result:
(F ◦ F)(I) = F(I).9
Convolution kernels used in image processing often are selected because they share
this property. E.g., smoothing and gradient filters based on the Gaussian kernel
are more efficiently performed by using the separability of the Gaussian function
(Burt 1981, 1983).
A linear transform permits one to use the principle of superposition, i.e., arbitrary
linear combinations of data can be transformed separately or altogether.
Note that a linear transform always possess a matrix representation, A, which
can be taken as the operator. In particular, the composition of two linear trans-
forms corresponds to matrix multiplication.
Affine transform. A linear transform without any preferred origin. Any affine
transformation can be written as a (non–singular) linear transform followed by
a translation.
An affine transform does not preserve distances or angles, but it does preserve
parallelism and ratios of lengths (i.e., ordering of points).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.10 (793-860)
Lie group transforms and Hamiltonian flows. Lie group transforms are linear
differential ones which obey a particular constraint, i.e., the Jacobi identity. Essen-
tially, the latter states that flows (associated with tangent vector fields) commute,
i.e., loosely speaking, if one travels along two flows, L1 followed by L2 , on a man-
ifold, M, one ends at the same point than if one had travelled first along L2 and
then along L1 . More on this topic and its relation to vision problems can be found
in the works of William C. Hoffman (Hoffman 1966, 1978, 1984) and his followers
(Ferraro & Caelli 1988; Papathomas & Julesz 1989; Hansen 1992).
Furthermore, certain classes of global deformations (of a continuum) can be
expressed in terms of Lie derivatives, i.e., certain linear operators on vector fields
(associated to flows) which indicate how differential forms change along integral
curves on these vector fields. Lie transforms are instances of “horizontal transform
sequences,” as we shall see later (§4.1).
The forces fc and fe acting on the boundary of O, i.e., on the surface ∂O, are
represented through the use of a spatial tensor field, T, called stress (or Cauchy
stress). Note that, forces like “pressure,” “tension” or “shear” are all special
instances of stresses.
In computer vision a simple model for deformable curves and surfaces un-
der tension, called “snakes,” has been used to represent outlines and track them
in dynamic imagery such as video sequences (Kass et al. 1988; Terzopoulos et
al. 1987; Leymarie 1990; Leymarie & Levine 1993), as well as to simulate wave
propagation constrained by a potential field (Leymarie & Levine 1992b).
– Conservation laws Various conservation laws are defined within the realm of
continuous mechanics. These are general laws which are usually helpful in
imposing constraints on the deformation F. Examples of such laws – which are
all, really, different instances of the same fundamental conservation principle –
are:
– Local and global conservation of mass.
– Conservation of energy (i.e., of the sum of potential and kinetic energies).
– Conservation of momentum, linear and angular, also called momentum bal-
ance laws.
– Conservation of power (i.e., the power expanded equals the sum of stress
power and rate of change of kinetic energy).
Lemma 3.1 (Huygens’ principle) Let φq0 (t) be the wavefront of the point after time
t. For every point q of this front, consider the wavefront after time t1 : φq (t1 ). Then,
the wavefront of the point q0 after time t + t1 : φq0 (t + t1 ), will be the envelope16 of the
fronts φq (t1 ) such that q ∈ φq0 (t).
The wavefront is thus the boundary of the set of all points q to which “infor-
mation” from a given source can travel in time less than or equal to t (Arnold
1989: 249).
Huygens’ metaphor can be transfered to the discrete domain without diffi-
culty, e.g. by considering the Minkowski sum of small discrete spherical sets with
a discrete front at time t. This is precisely the realm of 3D Mathematical Mor-
phology, where the spherical sets are called structural elements (Serra 1982). The
problem is one of inefficiency: Minkowski sums with isotropic structural ele-
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.14 (1023-1080)
ments – required to ensure Euclidean results – have too much overlap. Consider
then the alternative viewpoint of Pierre de Fermat’s principle (circa 1660), which
states that:
Lemma 3.2 (Fermat’s principle) Information travels along rays from a point q0 to a
point q in the shortest possible time.
Rays are such that their gradient vector is normal to the wavefront, i.e., the direc-
tion of the ray p = ∇S, where S is the (optical) “path length” from a source q0 to
q, a function such that Sq0 (q) = t. The (level) set of all such ray paths thus coin-
cide with the wavefront φq0 (t). Rays and wavefronts are related precisely through
the refractive index ν, which provides the (local) metric structure of the medium:
dφ · ∇S = ν (Stavroudis 1972).17
Eikonal equation of geometric optics. Consider the special case where the index is
taken to be isotropic and homogeneous. Then, the direction of motion of the rays
and the wavefront coincide (Arnold 1989); thus
dφ · ∇S = dφ ∇S = 1
Note that, considering a(light)
ray travelling at constant speed c, since the index is
fixed, we also have that p = ∇S = c, leading to the classical “eikonal” equation
for the wavefront (Stavroudis 1972):
2 2 2
2
dφ = ∂φ + ∂φ + ∂φ = 1/c2 (1)
∂x ∂y ∂z
Without loss of generality, we can take the constant c = 1, that is, we propagate
wavefronts along straight rays with unit (relative) speed. Thus, the normal field to
any wavefront is identically ∇S and each wavefront moves along this field by a unit
distance step for each time step. We can therefore identify time with distance for an
accurate simulation of wavefront propagation. Given an initial set of wavefronts or
set of generators Q = {q0 : φq0 = 0} we can simulate wave propagation, according
to the eikonal equation (1), by computing point-to-set (Euclidean) distances:
L(q, Q) = min q – r (2)
r∈Q
Note that, we may consider the set of all wavefronts, Φ, from an initial time (dis-
tance) t = 0 to a final time of propagation t = T ≡ Lmax , where Lmax is the
maximal distance travelled by the wavefront. The set Φ then fills-in a portion of
space according to the distance function (2).
We call this volume of space filled with distance values by wave propagation,
“Distance Hypersurface,” following Goodman who introduced the term in 2D
(Goodman 1964).18 This is equivalent to an embedding of the wavefronts in a
4D space, where the fourth axis is taken for time or distance. The Distance Hyper-
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.15 (1080-1141)
surface is the graph of τ = Φ(x, y, z), τ being taken along the time (or distance)
axis.
Definition 3.4 (Regular distance hypersurface) A hypersurface, τ = Φ(x, y, z), is
called a Regular Distance Hypersurface over a region of 3D space E3 , if at each
point of it is a “local distance hypersurface.” An hypersurface is a local distance
hypersurface at a point q ∈ E3 if there is:
1. a spherical neighbourhood ℵ of q,
2. a smooth surface which divides ℵ into two sub-regions, and
3. a constant l,
such that L = Φ(x, y, z) – l is the directed (or signed) distance from for each point
in ℵ.
This graph takes single values almost everywhere, according to equation (2), ex-
cept at those points where wavefronts meet. This is where regularity is lost and
singularities in the distance mapping occur. The latter define the symmetry points
constitutive of a kind of 3D skeleton or symmetry set (§4.2). If we think in terms
of the physical metaphor of light rays, this is where such rays meet to define the
“burning points” or caustics of Archimedes.19
Another way to interpret the above concept is via the Implicit Function The-
orem. If the gradient ∇Φ = 0 at some point q, then it implies the regularity of
Φ in a neighbourhood of q. This only requires that Φ is one time continuously
differentiable on the respective open set. Equivalently, having a Lipschitz contin-
uous gradient ensures that the “symmetry points” stay away from the distance
hypersurface for some distance (Wolter 1985, 1992).20
The process of computing such a distance hyper-surface for a discrete dataset,
is commonly called “distance transform” or “map” in image processing. When us-
ing the Euclidean metric, such as in equation (2) above, it is called “Euclidean Dis-
tance Transform” (EDT) (Danielsson 1980; Leymarie & Levine 1992a; Leymarie &
Kimia 2000).
the early works of Blum (Blum 1967, 1973), Montanari (Montanari 1969), and
more recent works of Kimia et al. (Kimia 1991; Siddiqi & Kimia 1996; Siddiqi et
al. 1999a, b; Kimia 2003). Shock transforms are instances of “vertical transform
sequences,” as we shall see later (§4.2).
Having described a number of commonly used shape transforms, I will now
consider a classical way of classifying them via “geometries.”
I follow here the classical definition of a geometry as the study of those properties of
subsets of a region, D, which are invariant under the application of some specified
transform. For example, one can consider affine, Euclidean, projective, or symplectic
geometries for shape.
Norman and Todd have argued that (Norman & Todd 1992: 106):
There is a growing amount of evidence to suggest that human vision involves
a similar type of stratification in which the most perceptually salient aspects of
an object’s structure are those that remain invariant over the largest number of
possible transforms.
From Euclidean geometry, we may “descend” in the hierarchy to reach affine ge-
ometry which can be portrayed as Euclidean geometry without a metric or without
a “machinery to measure distances and angles” (Koenderink 1990: 58).
geometry: they do not define a class of object being invariant under an affine
transformation. In fact the concepts of squares and circles make no sense in affine
geometry; but parallelograms and ellipses do.
Paraphrasing Norman and Todd (Norman & Todd 1992: 96):
In stereopsis, the mapping between the two views has been modelled as an affine
transform, to recover the gradient of horizontal disparity (Jones & Malik 1992).
Having looked at different transforms for shape in isolation, including the “clas-
sical” geometry stratification based on invariance as proposed by Klein, I now
propose a more general bi-partite classification for sequences of transforms which
are applied to the outline and various shape representations of an object. This clas-
sification which does away with the invariance constraint, is inspired in particular
by the early filters developed in Mathematical Morphology, based on idempotence,
by the works of Zucker et al. on the application of differential transforms for the
recovery of shape, and by Michael Leyton’s work on the association of a history for
shape, based on those properties which vary under deformations.
A transform sequence involves at least a pair of transforms, where, typically,
the first one acts directly on the outline of an object, resulting in a first representa-
tion, while the second one then acts on this representation to make more explicit
special selected types of features.
I will distinguish such concatenations of transforms as to whether they operate
(i) along the outline of a shape or tangentially, or (ii) away from it or perpendic-
ularly. I call the two classes “horizontal” and “vertical” transform sequences. This
represents a fundamental dichotomy in the analysis of shape transforms. While
the horizontal class is inspired by a more classical approach to geometry based on
calculus and the study of those properties which remain intrinsic to the object, i.e.,
such that the analysis can be carried on (or along) the outline itself, independently
of the surrounding (or embedding) space, the vertical class is inspired by more
modern thinking in mathematics, such as singularity theory, and is based on an
analysis of the outline under transformation together with its surrounding space.
In the following I provide a number of examples from the field of computer
vision which illustrate how the two classes are used to solve similar problems.
Example 4.1 (1-D manifold on the plane: Co–circularity mapping and curvature)
Let a tangent field on a curve segment, α : I ⊂ R → R2 be defined; where this curve
segment represents, e.g., part of the outline (boundary) of a planar object (or of its
projection in the image plane).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.21 (1433-1495)
Tangents are co-circular if they lie on a common circular arc of radius r. So one
can take a transform from the tangent space to the co–circularity space for pairs of
tangents.24
Furthermore, co-circularity is directly related to the concept of curvature as one
takes tangents being closer and closer to each other. In the limit, one can define a
real map on the interval, I, used to parameterised the curve to define the concept of
curvature, k, a scalar field:
∆θ 1
k : I → R, k = lim = ,
∆s→0 ∆s r
where s ∈ I is the parameter and θ is the angle between the tangents (or their corre-
sponding normals). Thus, curvature measures deviation from flatness, or collinear-
ity, for successive tangents.
The corresponding (limiting) circle is called the osculating circle (Pogorelov
1958: 57). It has second order contact with the curve α, i.e., both the curve and the cir-
cle agree (locally) in position, tangency and curvature (or up to second derivatives). If
s is the arc–length parameter, then k is equal to the norm (length) of the acceleration
vector α (Pogorelov 1958: 50):
k(s) = α (s) .
In conclusion, if a circle exists such that a pair of tangents (to α) are also tangent
to this circle, then they are said to be co–circular. The (osculating) circle is thus used to
“connect” or relate the pair of tangents in a “horizontal manner,” i.e., along the curve
through a simple interpolating shape, a circle, or through a derived “co–circularity
support,” i.e., a “directed” neighbourhood or thick trace (Parent & Zucker 1989).
Example 4.2 (2-D manifold: Parallel transport) Let V be a tangent vector field over
a region B of the surface M. How can we relate (or move) a given tangent vector v at
point p ∈ M with another tangent vector w at point q ∈ M? One possibility is through
the notion of (Levi–Civita) parallel transport where a tangent vector, e.g., v, is moved
along a curve α ⊂ M such that it stays parallel with respect to the tangent space TM .
Thus, v may rotate, but only with TM (this depending on the intrinsic or Gaussian
curvature of M).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.22 (1495-1543)
In computer vision, such a notion of parallel transport has been used by Sander
and Zucker in the context of the recovery of the trace of surface points (Sander &
Zucker 1990).25 But, note that a surface can curve in more than one way at any given
point. In particular, for smooth surfaces one can show that at any point of the man-
ifold (normal) curvatures range from a minimum, kmin , to a maximum, kmax , called
principal curvatures. Thus, the object used for relating tangent vectors on M cannot
simply be an osculating sphere (the extension to 3D of the 2D osculating circle above),
but rather needs to be a more flexible object. In the context of differential geometry, the
“canonical” object used is the osculating paraboloid, S, a parabolic quadric surface
having second order contact with the manifold M (Pogorelov 1958: 87).
Remark 4.1 (Inadequacy of parallel transport) Sander notes that parallel trans-
port is not the “ideal” way of relating tangent vectors of a surface for the following
reasons (Sander 1988: 38–39):
– Which curve/path to choose on the paraboloid? There is potentially an infinity
of them and, although parallel transport is well-defined for a given curve (i.e.,
it is unique), it gives, in general (when K = 0), different results for different
curves. I.e., the holonomy angle varies with the path, and this is a function of the
Gaussian (intrinsic) curvature of the paraboloid.
– How to transport “directions” (associated with a given initial tangent vector, v,
and its corresponding normal curvature) so as to preserve them? Clearly, arbitrary
paths will not preserve such directions. An interesting choice might be lines of
(principal) curvature, but, then there is the problem of “navigating” through sin-
gularities of the tangent field, i.e., through umbilics (points where all curvatures
are equal: kmin = kmax ).
Example 4.3 (2-D manifold: Compatibilities via frame bundles) The local infor-
mation available in a cross section, σ, of the frame bundle via the first and second
fundamental forms, IS and II S ,27 of the interpolating paraboloid, S, enables the direct
generation of the frame ξα (q) corresponding to moving ξα from p to q.28 This moving
frame will tell “what the local information at” q “looks like according to the informa-
tion at” p (Sander 1988: 41). Thus, it gives “support” to the information at q, “under
the assumption of an underlying smooth surface” (i.e., S).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.23 (1543-1596)
This is again an horizontal transform sequence where frames are “moved” accord-
ing to an interpolating surface which best approximates (through some optimisa-
tion measure) the actual surface M we wish to recover (the trace of).
Example 4.4 (2-D manifold: “Classical” Shape from Shading (SfS)) Shape from
shading consists in “inferring geometric structure of a scene from measurements of
its photometric structure” (Langer & Zucker 1994).
The “classical” model studied in the context of this problem, is based on the “Im-
age Irradiance Equation” (IIE) which states that luminance, or image intensity I, is
proportional to some reflectance function or map, R : I(x) = R(N(x)), where R is
assumed dependent on the surface normal, N, i.e., dependent on the surface orienta-
tion. Here, the surface trace is the geometric structure one wishes to recover from the
image data, i.e., it is an “inverse optics problem.” A special case (the “classical” one) is
when the surface is assumed Lambertian,29 i.e., the light source is assumed punctual
and remote (far), and the albedo (the amount of light reflected, a material property),
ρ, is assumed constant over the surface. Then, the IIE can be expressed as (Langer &
Zucker 1994):
I(x) = ρ Is L · N(x) + ρ Ia ,
where L and Is are the direction and intensity of the distant point source, respectively,
and Ia is the intensity of the ambient illumination. The (constant) term ρIa is usually
ignored in the calculations. The term ρIs L is assumed known (or is approximated
empirically). Then, assuming one faces a Monge patch, i.e., the surface topography
can be represented as the graph of a function, z = f (x, y), which is always possible
locally for smooth surfaces, the (unit) normal vector N can be expressed as a function
of the gradient of the surface, i.e., in terms of the slopes, p = ∂z/∂x and q = ∂z/∂y, of
the surface in the x and y directions, respectively:
N · N = vx ∧ vy = (1, 0, p) ∧ (0, 1, q) = (–p, –q, 1) ,
where the vectors vx and vy are tangent vectors to the surface in the x and y direc-
tions. Similarly, for a distant source, we can represent its direction L via a unit vector
(–ps , –qs, 1) expressed in terms of the slopes ps , qs , of a (wave) surface perpendicu-
lar to the light ray. Thus the IIE, in the Lambertian–single–distant–light–source case,
reduces to (Horn 1986: 219):
1 + ps · p + qs · q
I(x, y) = R(p, q) = ,
1 + p2 + q2
which has been normalised by a factor ρI0 . This equation can be seen as a nonlinear
first order PDE.
Different ways of solving numerically the reduced IIE equation above have been
used in the computer vision literature. All of these are instances of horizontal trans-
form sequences where one, starting from an initial contour, usually taken as an
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.24 (1596-1648)
extrema of I, grows solutions along the surface by (numerically) integrating the IIE.
Essentially, one takes small steps from a known height/position30 of the surface
and recovers surface height at the new position.31 To “validate” the calculations
one can integrate solutions on the calculated surface, this along closed paths from
and to the initial (known) starting position; thus, the difference in height should
vanish.32
Example 4.5 (Motion field & optical flow) One may consider a motion (vector)
field as the “object” of interest. The layout of the field is transformed, e.g., under as-
sumptions of smoothness, thus retrieving a smooth vector field possibly corresponding
to some translation and/or rotation of a real physical object in the scene.33
A common hypothesis made in computer vision is that the motion field, which as-
signs a velocity vector to each point in an image, corresponds to the optical flow which
is the apparent motion of the brightness pattern.34 Note that this correspondence is an
idealisation; in many (practical) situations, the motion field and the optical flow are
not equivalent. Nevertheless, optical flow is believed to be a potentially important
source of information towards the retrieval of the true motion field or of some of its
features.
Under the (very restrictive) assumption that brightness varies smoothly one can
derive the Optical Flow Constraint Equation (OFCE) (Horn 1986: 282), which is
nothing but the 1–jet for I:35
Ix u + I y v + It = 0 ,
where u = dx/dt and v = dy/dt are the optical flow components, and the (partial)
derivatives Ix , Iy and It are estimated from the image. We may re-write this equation
as follows:
∂I
+ ∇I · (u, v) = 0 .
∂t
This can be shown to correspond to a conservation law for an incompressible
medium, where the image intensity, I, corresponds to the density of the medium (Fitz-
patrick 1988). In this context, the assumption of incompressibility for a density image
is analogous to the assumption that image brightness of a surface point is independent
of the motion of that point (a very restrictive assumption indeed).
Note that the OFCE is a local constraint on brightness velocities. In order to re-
cover a “consistent” (global) optical flow, one has to filter or transform somehow
the initial flow field given by the values (u, v) at each image point. E.g., one may
convolve the field with an averaging kernel. Or one may update information on the
basis of the available local geometric structure (i.e., tangent direction, curvature)
within a relaxation network. The latter is reminiscent of ex. 4.3, where compatibil-
ities are (here) in terms of tangent bundles, i.e., the problem can be cast as one of
selecting consistent sections σ through the tangent bundle of the “motion surface”
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.25 (1648-1706)
(i.e., motion field). Thus the flow field is integrated along the underlying surface
(or “base”), and we can picture it as an horizontal transform sequence.36
I covered in the above a number of illustrative examples of horizontal trans-
form sequences from the field of computer vision, from contour tracing to shape
from shading and shape from motion. In the next section, I consider the “comple-
mentary” vertical ones, where the transforms operate in the space away from the
outline of an object, rather than along it.
region, VR) is made from the loci of the R2 plane closest to one single outline
point. A related concept is that of the Cut Locus, the closure of the set containing
all points that have at least two shortest segments to the outline (Wolter 1992).
Ridges of a distance surface: Consider the surface generated by applying a distance
transform to the object’s form. Map distance as a height function. Then consider
the loci of points on ridges of this topographical map (Kotelly 1963; Blum 1973;
Leymarie & Levine 1992b).
Mid-chord symmetry and process inferring symmetry axis (PISA): An alternative
to the centre of contact circles, as is used to trace the MA, is to consider instead (i)
the mid-chord point defined between the two contact points on the outline (Blum
1973; Brady & Asada 1984), or (ii) the mid-point of the shortest subtented arc of
the circle (Leyton 1987).
Shock graph: Consider the radius function associated to the contact circles. It defines
a (1D) vector field along the trace of the MA. The singularities of this function
when linked by the intermediary MA segments define a more compact graph
representation of the MA. These singularities, the sources, relays and sinks of
the radius function, correspond to endpoints, junctions and saddle points of the
MA (Kimia 2003).
Now let us consider the boundary of the object as either a piecewise 1D manifold
or a discrete set of samples. Then the MSG is a transform which maps this bound-
ary onto the medial symmetry set in one of the forms given above. This transform
acts along normal rays to the boundary of the shape. That is, it acts along the nor-
mal bundle. Points where normal rays meet correspond to symmetry points. This
is an instance of a vertical transform sequence.
Concept 4.1 (Wolter’s topological shape theorem of the MA) Franz-Erich Wol-
ter has shown, in his early works, how the MA of a solid O (i.e., such that the outline
defines an inside and an outside for the object under scrutiny) “contains the essence
of the topological type” of O (Wolter 1992; Wolter et al. 2004). In summary, Wolter
has proven for the cases of smooth outlines and, in 2D, for piecewise-smooth outlines,
that the MA is the deformation retract of the solid O; and, therefore, it preserves
the homotopy type of O, which can be retrieved, for example, from the (topological)
graph structure of the MA.
Remark 4.2 (Relation between the MSG and co-circularity mapping) In the case
of the MSG, two points are put in relation with each other by searching where their
normal rays meet. This locus defines the contact circle.
In the case of the co–circularity mapping, the osculating circle is used to relate
pairs of tangents of boundary points by using the circular shape as an interpolant (ex.
4.1, §4.1).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.27 (1760-1818)
Thus, the same object, a circle in close contact with the outline, is used in two
different manners, for relating pair of points of the boundary which share a “local
symmetry,” albeit of different nature.
Remark 4.3 (Planar and space curves) In the above we considered only outlines
tracing planar curves in R2 . The discussion also applies to space curves in R3 , al-
though the normal bundle is then more complex: it involves both the normals and
bi–normals (Bruce & Giblin 1992).
Example 4.7 (MSG on curved surfaces) Wolter et al. have extended the concept of
a medial symmetry graph from the R2 plane to curved surfaces embedded in R3
(Rausch et al. 1996; Kunze et al. 1997; Wolter & Friese 2000). They use the no-
tion of the cut locus for this purpose, where shortest paths are now defined along
shortest geodesics from an outline defined on the curved surface.39 In essence, this is
an extension from the Euclidean space (flat 2D manifolds, i.e., with zero curvature)
to the Riemannian space (curved 2D manifolds).
The above notion of medial symmetry sets for 1D outlines is naturally extended to
2D outlines, i.e., to the traces of surfaces of 3D objects.
Example 4.8 (Medial symmetry sets in 3D) Medial symmetries of 3D objects can
be captured by replacing contact disks by contact spheres.
Again, one may view these medial symmetries in different flavours. For example,
a wave might propagate from the surface boundary along the normal rays to the
boundary. Singularities of this normal mapping correspond to the symmetry set.
Most of the definitions used for the 2D case have been extended to 3D, with the
exceptions of mid-chord symmetries and Leyton’s PISA.
The main difference between medial symmetry sets in 2D and 3D, is that, in 3D,
for outlines sampling the surfaces of volumetric object, the resulting set is not a
graph in general, but rather a hypergraph made of intersecting medial surfaces; in
other words the medial set has one more dimension. To benefit from the use of a
graph representation, e.g., in recognition tasks, a number of additional transforms
are available.
Example 4.9 (Medial symmetry graphs in 3D) Medial symmetries of 3D objects
constrained to take the form of a 3D (non-necessarily planar) graph. Below we give
the main definitions to be found in the literature.
Generalised cylinders and 3D curve skeletons: A generalised cylinder is defined by
(i) an axis, (ii) a radius functional, often called “sweep function,” and (iii) a set
of rules to define branching points and behaviours. The axis is typically a 3D space
curve. The radius functional specifies the trace of the outline perpendicularly to
the axis, and may vary as it “sweeps” along the axis.40 The set of rules defines what
happens at the end points of an axis: in which directions other axes with their own
radius functional may branch-out. While generalised cylinders are useful in gen-
erating pre-specified objects, 3D curve skeletons are an attempt to define from the
outline an equivalent representation. These come under different names, repre-
senting slightly different definitions which we organise in two main categories: (i)
curve skeletons (Borgefors et al. 1999), skeleton graphs (Sundar et al. 2003) or
ik-skeletons (Wade & Parent 2002) are obtained by thinning a discrete distance
field, such as resulting from an object voxelisation, and (ii) Reeb graphs (Hilaga
et al. 2001) also called Level Set Diagrams (Verroust & Lazarus 2000) are ob-
tained by a simulated flooding of the object as a solid, tracing midpoints of level
sets as the object is filled-in.
Medial and shock scaffolds: The 3D MA of Blum has been generalised by Ley-
marie, Kimia and Giblin by defining a unique 3D graph structure from which
the MA itself can be recovered (Leymarie & Kimia 2001; Leymarie 2003; Gib-
lin & Kimia 2004; Leymarie & Kimia 2007). This is an extension of the concept
of a shock graph in 2D and it comes into two main types: (i) the Medial Scaf-
fold (MS ) which links the singularities of the radius function associated to the
MA for two types of medial curves: axial shock curves where three or more MA
surfaces intersect, and rib curves which are associated with surface ridges of the
object’s outline; (ii) the Shock Scaffold (SC ) which extends the MS by defining
additional curves linking the singularities of the radius function along the MA
surfaces.
Critical nets and molecular graphs: Consider each 3D Voronoi region, VR, of the
Voronoi diagram, VD, and its decomposition in a set of k-faces which are home-
omorphic to a closed ball, i.e., such that each VR can be continuously deformed
to a closed ball of same dimensions (Bredon 1993). The collection gives a finite
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.29 (1877-1926)
Example 4.10 (Shape from shading: Case of diffuse light source conditions)
Paraphrasing from the work of Langer and Zucker (Langer & Zucker 1994):
Under diffuse lighting conditions, surface luminance depends primarily on the
amount of sky visible from each surface patch, while surface normal and mutual
illumination are of secondary importance. This dependence – on the visible sky –
is modelled in terms of a surface aperture function: the percentage of incident di-
rections visible in a local hemisphere above “ground” or above a surface patch. An
approximately linear constraint between image intensity and surface aperture is
found.
While in the case of “classical SfS” (§4.1), where one assumes a point light source
(at infinity), the local geometry constraints lie along the surface, in the case of the
diffuse lighting problem, the local geometric constraints are found in the ambient
light above the surface.
In this “SfS on a cloudy day,” once surface apertures are estimated from an initial
image, a recursive marching algorithm is used to refine depths for a viewer as-
sumed seeing the scene from above. The entire space is discretized via an Eulerian
(fixed) grid and depth is propagated one node at a time; the method resembles
a blanket slowly falling on the scene which starts folding locally according to the
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.30 (1926-1985)
surface aperture values. All computations take place in the ambient space above
the object’s surface, yet another instance of a vertical transform sequence.
The Dε distance function of Kovacs et al. is computed for each frame of a video
sequence, and can be applied in each frame directly to the available outlines. It is
comparable to other “annular” medial symmetry schemes (Kelly & Levine 1995;
van Tonder & Ejima 2000, 2003), and this group also falls in the class of vertical
transform sequences. Note that as ε goes to zero, the method may converge to the
shock graph representation, where the special medial nodes along the MA are
those for which the associated distance (or radius) function goes through a critical
point (i.e., a source, relay or sink).
where α(s, t) = (x(s, t), y(s, t)) is the position vector of the outline, s is the length
parameter (not necessarily arclength), Υ(·) is the deformation functional, and N is the
unit normal vector (typically oriented “inward” with respect to the object’s outline).
The following deformation functional: Υ ≡ β0 + β1 κ, where κ stands for the
curvature of a curve segment α, is often considered in differential geometry (curve
shortening and smoothing), physics of interfaces, and computer vision (Catté et al.
1992; Kimia et al. 1995; Sethian 1999).
Example 4.13 (Alan Turing’s morphogens) The most famous RDTS was proposed
by Alan Turing (circa 1950) to model the changing spatial concentrations of chemical
substances he called “morphogens” (Turing 1952). By varying the relative influence
of reaction versus diffusion, different regular patterns observable in nature have been
simulated, such as the stripes of zebras and the spots of leopards (Murray 2004).
Definition 5.1 (Shape grammar) (i) It consists in a set of rules which permits to
replace one outline by another, by giving a “shape–interpolation,” i.e., a plausible de-
velopmental history. (ii) Alternatively, it is a set of rules which permits to add/delete
(or cut and paste) parts/components to an outline as a function of its derived shape.
Remark 5.1 (The inverse problem of vision) Note that, for the representation and
description of an object, toward its recognition, a shape grammar will prove particu-
larly useful if it is possible to “invert” the rules, or at least identify them, when given
a particular outline of an object. This would then permit to identify the process which
gave the “final” shape, or alternatively would give cues in identifying parts or object
components.
Since its proposal in the late 1980’s the RBC theory has been implemented in a
few systems aiming at performing object recognition. Bergevin and Levine im-
plemented the first comprehensive system based on RBC called PARVO (Primal
Access Recognition of Visual Objects) (Bergevin & Levine 1993). Wu and Levine
restricted Biederman’s set of 36 geons to only 7 defined as parameterised subsets
of superellipsoids for potentially faster processing (Wu & Levine 1997). Of related
scope is the earlier work by Pentland et al. using parameterised object primitives to
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.33 (2084-2143)
model parts (Pentland & Scarloff 1991). An account of the various approaches and
issues in using geons until the mid-1990’s can be found in (Dickinson et al. 1997).
More recent work can be found in (Siddiqi & Kimia 1995; Siddiqi et al. 2001; Sing
& Hoffman 2001).
Remark 5.2 (Dynamic shape and genetic grammars) One can generalise shape
grammars within the context of cellular automata where some randomisation is in-
troduced in the manifestation of the “rules” leading to “dynamic” shapes; for example
see the works of Wolfram et al. (Wolfram 1994) and more recent studies in biological
pattern genesis (Deutsch & Dormann 2005).
Another possible generalisation is in the context of genetic programming where
mutations and the natural mixing of a pool of “genes” (possibly representing shape
components or features) is used to obtain evolving “natural” shapes; for example see
the early works of Dawkins on biomorphs (Dawkins 1986), and of Latham and Todd
on genetic art (Todd & Latham 1992), and more recent works in arts and design
(Bentley 1999) and biological sciences (Kumar & Bentley 2003).
In the following I consider and compare two important grammars which emerged
in the field of visual perception analysis in the 1980’s, this to illustrate some of the
issues one has to face in designing a “shape language.”
Curvature extrema and symmetry axes are put into correspondence through the
following principle.44
Thus, this “duality” between curvature extrema and symmetry axes gives a rule for
relating them. Furthermore, symmetry axes are postulated as the “records of de-
formational processes.” E.g., the longer an axis is, the more significant the attached
process shall be. A process in this context may be thought of as an internal or ex-
ternal force acting on the shape. Significance may be taken in terms of magnitude
or duration in time.
Corollary 5.1 ((Leyton 1988)) Each curvature extrema implies a process whose
trace is the unique symmetry axis associated with, and terminating at, that ex-
tremum.
Thus, a process explains the extrema.
Corollary 5.2 (Asymmetry rule (Leyton 1988)) Processes are understood as creat-
ing greater curvature variation, i.e., greater information in the sense of mathematical
information theory.46
Remark 5.3 (Simplest shape is the circle) If one uses the “asymmetry rule” above to
extrapolate backward in time, the “ultimate starting point for the shape must have
been the circle.” The latter possess the least amount of curvature variation, hence
information.
Definition 5.2 (Types of curvature extrema) Four types of curvature extrema are
defined: local minimum and maximum, denoted by m and M, respectively, each being
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.35 (2205-2289)
possibly of negative (–) or positive (+) curvature. The semantic interpretation of these
extrema is as follows:
M+ : “Protrusion” m– : “Indentation”
m+ : “Squashing” M– : “Internal resistance”
Leyton’s Process–grammar
The process–grammar consists of two types of extrapolation rules: Continuation
and Bifurcation rules, denoted by C and B, respectively. Continuation implies
that no new extremum is created during the (deformation) process (but an ex-
tremum may change in nature), while a bifurcation implies that a new, interven-
ing, extremum must be introduced. The (rewrite) rules of the grammar can be
summarised as follows:
CM + : M+ → 0M + 0 “Protrusion continues”
Cm– : m– → 0m– 0 “Indentation continues”
Cm+ : m+ → 0m– 0 “Squashing continues until it indents”
CM – : M – → 0M + 0 “Internal resistance continues until it protrudes”
BM + : M + → M + m+ M + “Protrusion bifurcates”
Bm– : m– → m– M – m– “Indentation bifurcates”
Bm+ : m+ → m+ M + m+ “Protrusion introduced”
BM – : M – → M – m– M – “Indentation introduced”
NB: Continuation at protrusion, M + , and indentation, m– , are structurally trivial:
i.e., no change in the extremum’s nature occurs. As such they do not need be con-
sidered explicitly in the grammar. Thus, only the last six rules are to be used in
practice.
Leyton also suggested that deblurring of the boundary (coordinates) could provide
a way of finding a single interpretation or path through the shape–space (Ley-
ton 1989). Leyton claims that deblurring is the appropriate shape transform to
use because it “embodies the size–is–time heuristic;” and in particular: “deblur-
ring incrementally moves the boundary in the direction of increased curvature
variation.”
Remark 5.4 (Figure–ground reversal) Figure–ground reversal is simply dealt with
by introducing a duality operation, D, which interchanges extrema labels as follows:
M+ ↔ m– : protrusion ↔ indentation
M– ↔ m+ : squashing ↔ internal resistance
Codon–grammar
The codon–grammar consist of two types of rules:
Level–preserving: Replace a codon by a codon.
Level–increasing: Replace an extremum with a codon or replace a codon with a
pair of consecutive codons.
The codon–grammar consists of eighteen rewrite rules (see Leyton 1988: Tbl. 2 for
a list of these).
3. The codon substitutions are cutting and pasting operations, while the Process-
grammar rewrite rules are developmental, i.e., concerned with growth.
Note that both the Process and Codon grammars can be implemented on the basis
of horizontal or vertical transform sequences, to identify curvature extrema and
their relationships. An example of an horizontal approach is the curvature mor-
phology system of Leymarie and Levine which maps an outline to a set of valleys
and peaks of curvature where the scale of an extremum is characterised by the size
of a valley (mapping to a concavity for the outline) or a peak (a convexity for the
outline) (Leymarie & Levine 1989). An example of a vertical approach is the work
of Tek and Kimia based on the 2D MA where the collection of axes can be used
to rank order a set of curvature extrema (sitting at one end of each axis) by a no-
tion of significance or scale: how important each extremum is in relation to other
nearby ones and the overall shape (Tek & Kimia 2001). Significance is directly as-
sociated to symmetry axes in how long they are or how large an object’s sub-part
they capture; as such this system provides an implementation of some of Leyton’s
ideas, and in particular his “size is time” heuristic (5.1).
An important open question when designing a practical shape language, is
whether a discrete form of a family of transforms for shape is more adequate than
a continuous one, such as those provided by the Reaction-Diffusion Transform Se-
quences (RDTS; §4.3.1). Note that a discrete, rule-based approach provided by
a shape grammar such as Leyton’s or the Codon system, has the appeal of being
more easily implementable in a reasoning system; e.g., via an expert-system. Do-
ing reasoning in a continuous shape–space derived from an approach such as the
RDTS does not appear, at first glance, as easy, since the space is not explicitly as
well “compartmentalised.” A similar issue occurs in neuroscience where the lead-
ing paradigm is one where perception relies on discrete time events and processing
(VanRullen & Koch 2003).
. Conclusion
In the above “thoughts” I have tried to characterise the meaning of what “shape”
is and how it can be related to fundamental mathematical concepts, via the notion
of transforms: mathematical constructs permitting to characterise the probing of
the structural field of objects, thus revealing their shape. I have also given many ex-
amples based on my understanding of visual perception and computer vision, to
illustrate and better focus the effort. A particular emphasis was the definition of a
bi-partite classification of sequences of transforms, whether they first act along the
outline or away from it: what I call horizontal and vertical transform sequences.
This dichotomy is useful as a mean to compare and distinguish popular shape
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.38 (2428-2457)
transforms, as was illustrated with examples from the field of computer vision,
and it could be extended to other “visual” fields. For example, in CAD-CAM (Pa-
trikalakis & Maekawa 2002), one can distinguish methods based on the differential
geometry of curves and surfaces as well as of intersections – essentially sets of hor-
izontal transform sequences – from the methods based on offsets for curves and
surfaces – i.e., vertical transform sequences. How the horizontal versus vertical
approaches precisely compare, when applied to similar or identical problems –
e.g., when identifying corners or curvature extrema – is a study which remains
to be conducted. Furthermore, such a dichotomy may also be useful to study the
human visual system itself. The hypercolumn system of receptive fields first iden-
tified by Hubel and Wiesel (Heeger et al. 1996) can be understood as the basis of a
(differential) geometric engine used to generate tangent bundles, curvature maps,
and so on, from outline samples (Zucker 1987); in other words, we can portrait (at
least part of) the hypercolumn system of the visual cortex as implementing certain
horizontal transform sequences. Kovács et al. have observed in human subjects
differential contrast sensitivity maps for 2D shapes which are consistent with a
medial function representing the percentage of outline samples equidistant from
the observation point within a tolerance level (Kovács et al. 1998). This has lead
them to hypothesize a medial MA-based shape processing method for human
vision, an instance of vertical transform sequences.
This “essay” is also the beginning of a survey in the context of visual analysis
with the goal of addressing the very nature of shape understanding. Most topics
surveyed in this manuscript would benefit from a more detailed description, with
illustrative examples. Also, some topics have only been touched upon: in particu-
lar the last topic of shape grammars, and the genesis of shape, which would benefit
from a comparative study of the different approaches now available in computing,
such as “shock grammars” (Siddiqi & Kimia 1996; Sebastian et al. 2004). The lat-
ter can be seen as recent developments in computing inspired from the original
approach to morphogenesis developed by the mathematician René Thom in the
1970’s, the founder of catastrophe theory (Thom 1989), which lead to the emer-
gence of singularity theory and its applications (Arnold 1989; Bruce & Giblin 1992;
Poston & Stewart 1996).
Notes
. One can also talk of the shape of a region of space, even if empty (φ = 0), but defined by
some outline: e.g., the region of space in between objects such as the voids between galaxies in
the universe (Weygaert 2003; Jones et al. 2005).
. Leyton talks about “transfer structure” rather than “structural field,” notions I consider
equivalent for the purpose of my somewhat imprecise study.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.39 (2457-2533)
. Refer to Leyton for an explanation of how memory and shape can be tied up together (Leyton
2001).
. The Hough transform maps edgels of a binary image to a parameter space. In the case of a
square one would use the slope and intercept of the equation of a straight line as the parameters.
Edgels which fall on nearby line segments will cluster in the parameter space.
. CAD-CAM: Computer Aided Design and Manufacturing.
. This is similar to the Hough Transform where lines in a binary image are mapped to points in
the parameter space.
. A “bias” in favour of invertible transforms for shape may have emerged in the signal process-
ing field, whose original concerns about (lossless) data transmission are very relevant, e.g., for
communication systems.
. The structural element is typically defined as a set of pixels or voxels, similarly to the input
dataset (an image array). Set operations, in particular Minkowski’s addition and subtraction,
are then used to construct higher order operators.
. E.g., this rules out the well-known Marr–Hildreth detector of zero–crossings of the Laplacian
of an image I, which creates “shadow edges,” i.e., edges of edges, if applied twice.
. Unless we change the constraint set or perturb the data set, of course – which is what one
may try to do to reach a global minimum.
. Adapted boundaries: hyper–surfaces on which one of the coordinate is constant.
. This is a restriction of a conformal map (i.e., which preserves local angles) which in the
literature is applied as a local constraint to more general functions, such as in complex analysis.
. A real square matrix A is positive definite if vT Av > 0. This is a useful property to solve
a linear system of equations, Ax = y by a (Cholesky) decomposition in a (upper) triangular
matrix.
. For more details refer to Arnold’s (Arnold 1989: Ch. 8) and Guillemin’s (Guillemin & Stern-
berg 1984) manuscripts.
. For a good introduction on this topic, refer to Gurtin’s manuscript (Gurtin 1982).
. The original statement gives a pair of solutions in general, only one of which is of practical
interest: the envelope at an increasing distance from the original sources of propagation.
. In differential geometry, ν is called the gauge figure or indicatrix: i.e., the set of (possible)
velocity vectors in every direction at q ∈ E3 (Koenderink 1990). In general, ν can vary in space,
which proves necessary to model propagation at interfaces or in non-homogeneous medium.
In the domain of visual perception, a varying index could be used to model contextual prop-
agation to favour certain orientations versus others (like horizontal versus vertical proximity
relationships between boundary elements).
. See also the work of Kotelly on Blum’s transform (Kotelly 1963). For a more recent applica-
tion of the distance surface τ = Φ(x, y) to the 2D skeletonization problem, when simulating a
grassfire on a square lattice, see the work of Leymarie and Levine (Leymarie & Levine 1992b).
. Another way of thinking of this concept, is in term of the system of normals to smooth
surfaces in Euclidean space. In neighbourhoods of such smooth surfaces, their normals (rays)
form a smooth fibration – they constitute tightly packed fibres (beams) filling-in space, hence
creating a regular distance hypersurface – but, at some distance from the surface, the various
normals begin to intersect one another (Arnold 1989: 480).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.40 (2533-2604)
. A real function f : I → R such that ∃ a constant K ≥ 0, such that f (x) – f (y) ≤ K x – y
for all x, y in the interval I, is called a Lipschitz continuous function: i.e., it is limited in how fast
it can change.
. Historical note: This generalisation of the concept of Euclidean geometry was proposed by
Bernhard Riemann in 1854 in his lecture to qualify for the post of lecturer in Gauss’s department
at the University of Göttingen: “Über die Hypothesen welche der Geometrie zu Grunde liegen”
(Lord & Wilson 1984).
. For a more recent account on the field of morphometrics, refer to the works of Fred Book-
stein et al. (1992, 1997).
. Two (mathematical) objects are said to be homotopic if one can be continuously deformed
into the other (after Henri Poincaré, circa 1900 (Collins 2004)).
. For the “classic” application to the computer vision problem of the recovery of the trace of
curve points in an image, refer to the work of Parent & Zucker (1989).
. This work can be seen as an extension to surfaces of the ideas of Parent and Zucker developed
for curves (Parent & Zucker 1989).
. A “frame bundle” on a manifold M is a set of orthonormal (coordinate) frames defined at
every point p ∈ M: it determines the possible choices for an orthonormal basis of the tangent
space of M, TM ; it also reflects the ambiguity of choosing coordinates on M. The choice of a
(local) cross section of the frame bundle gives a “moving frame” used to calculate “tensors”
such as curvatures.
. The first fundamental form defines the inner product on the tangent space (of pairs of tan-
gent vectors): IS (v, u) = v · u. The second fundamental form defines a symmetric billinear form
on the tangent space: II S (v, u) = (v) · u, where is the “shape operator” giving the (nega-
tive) derivative of the unit normal vector field of a surface (a kind of curvature). The ratio II S /IS
computes normal curvature to the surface.
. A frame on a 2-D manifold M is an orthonormal coordinate system, such that two of its
axes span the tangent space TM at any given point p ∈ M. Note that at each point of M, ∃ an
infinite number of such frames (an equivalence class) through rotation around the normal third
axis. Hence the need for the notion of bundle of frames.
. “An ideal Lambertian surface is one that appears equally bright from all viewing directions
and reflects all incident light, absorbing none” (Horn 1986: 212). The luminance of a Lambertian
surface depends only on the angle between the surface normal and the incident light ray.
. For I(x, y) = Imax one can set the height z = zmax (i.e., either the brightest or darkest spot in
a image) to some arbitrary constant.
. Refer to Berthold Horn’s manuscript for more details on the methods of “characteristic
strips” and “variational relaxation” for solving this classical shape from shading problem (Horn
1986: Ch. 11).
. This is an example of the Jacobi identity for Lie transforms (§3.1.5).
. For work along these lines, where a geometrical image transformation is put in corre-
spondence with the physical motion of the imaged objects, see e.g., the work of Fitzpatrick
(1988).
. Historical note: Gibson et al. coined the term “optical flow;” e.g., refer to (Gibson et al.
1959) for some of their early ideas on the subject.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.41 (2604-2692)
. A n–jet is a Taylor series expansion of which one keeps only the terms up–to (and including)
nth –order derivatives.
. This is an extension to the optical flow problem of the ideas on “shading flows and scenel
bundles” of Zucker et al. (Breton et al. 1992).
. I.e., it resembles a stick figure or skeleton for an anthropomorphic form.
. Using the “closure” requirement in the definition insures to capture limit points as ends of
SS branches. For example, in computing the MA of an ellipse, this ensures the two focal points
are included as end points.
. The result can as well be “stated by using maximal geodesic circles, [making it a closer
cousin] to the Euclidean maximal circle concept” (private communication from F.-E. Wolter).
. The origins of generalised cylinders can be traced back at least to the 18th century geome-
ter Gaspard Monge who defines “pipe” surfaces as the envelope of the set of spheres of radius
r centred on a space curve defining the axis or “spine” of the pipe (Monge 1850). Pipe sur-
faces are used in CAD-CAM for shape reconstruction, blending of surfaces, surface transitions,
numerically-controlled (NC) verification (Patrikalakis & Maekawa 2002: 353).
. A CW-complex, i.e., a “closure finite weak topology cell complex,” is a finite collection of
non-empty regions called “cells” homeomorphic to a closed ball, such that (i) the cells have
pairwise disjoint interiors, and (ii) the boundary of each cell is the union of other cells (of
lower dimensions) (Edelsbrunner & Shah 1997). CW-complices, a key tool in algebraic topology
(Bredon 1993), are a “nice” class of topological spaces which are general in scope and simple in
construction.
. Critical points of a function f on a manifold M are found where the gradient vanishes, i.e.,
∇f = 0. Such a function is said to be “Morse” if its critical point are non-degenerate, i.e., they
are isolated.
. Such “molecular” shape descriptions have recently started being used in Computer Graph-
ics, for modelling objects via meta-balls, that is, networks of spheres of varying radii, connected
into a 3D graph representation (Hart 1999).
. This was first proved only in the case of regular, simply connected, planar curve. It was then
extended to deal with the more interesting case of first order discontinuities, such as corners,
creases and cusps, in Hayes & Leyton (1989).
. The 3D version of this theorem states that “At each curvature extremum on a principle line of
curvature, there is a unique symmetry sheet that terminates at that extremum” (Yuille & Leyton
1990). Technically speaking, this argument is always valid (for 2D and 3D problems) when con-
sidering the symmetry set (SS) (Bruce & Giblin 1992; Giblin & Kimia 2004). When considering
only the MA (subset of the SS) some axes may be missing in correspondence to surface ridges,
i.e., lines of curvature extrema of the boundary. Recall that the MA sheets terminate at centres
of curvature associated to ridges, rather than on the bounding surface itself.
. Here information is identified with variety. Refer to Howard Resnikoff who has shown that
the measurement of curvature variation has the same form as the measurement of information in
information theory (Resnikoff 1989).
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.42 (2692-2819)
References
Arnheim, R. (1974). Art and Visual Perception: A Psychology of the Creative Eye. Berkeley, CA:
University of California Press. New version; expanded and revised edition of the 1954 origi-
nal.
Arnold, V. I. (1989). Mathematical Methods of Classical Mechanics, volume 60 of Graduate Texts
in Mathematics. Springer-Verlag, 2nd edition.
Bader, R. F. W. (1990). Atoms in Molecules – A Quantum Theory, volume 22 of International
Series of Monographs on Chemistry. Oxford, UK: Clarendon Press.
Bamberg, P., & Sternberg, S. (1988). A Course in Mathematics for Students of Physics, volume 1.
Cambridge University Press.
Bentley, P. J. (Ed). (1999). Evolutionary Design by Computers. Morgan Kaufmann.
Bergevin, R., & Levine, M. D. (1993). Generic object recognition: Building and matching coarse
descriptions from line drawings. IEEE Transactions on Pattern Analysis and Machine Intelli-
gence, 15, (1), 19–36.
Biederman, I. (1987). Recognition-by-components: A theory of human image understanding.
Psychological Review, 94, 115–147.
Biederman, I. (1995). Visual object recognition. In S. F. Kosslyn & D. N. Osherson (Eds.), An
Invitation to Cognitive Science, volume 2 of Visual Cognition, Chapter 4 (pp. 121–165). MIT
Press.
Blank, A. A. (1978). Metric geometry in human binocular perception: Theory and fact. In E. L.
J. Leeuwenberg & H. F. J. M. Buffart (Eds.), Formal Theories of Visual Perception, Chapter 4
(pp. 83–102). John Wiley and Sons.
Blum, H. (1967). A transformation for extracting new descriptors of shape. In W. Wathen-Dunn
(Ed.), Models for the Perception of Speech and Visual Form (pp. 362–380). Cambridge, MA:
MIT Press. Proceedings of a Symposium held in Boston, MA, November 1964.
Blum, H. (1973). Biological shape and visual science. Journal of Theoretical Biology, 38, 205–287.
Bookstein, F. L. (1992). Morphometric Tools for Landmark Data: Geometry and Biology. Cam-
bridge University Press.
Bookstein, F. L. (1997). Landmark methods for forms without landmarks: Morphometrics of
group differences in outline shape. Medical Image Analysis, 1(3), 225–243.
Borgefors, G. et al. (1999). Computing skeletons in 3D. Pattern Recognition, 32(7), 1225–1236.
Brady, M., & Asada, H. (1984). Smoothed local symmetries and their implementation. Interna-
tional Journal of Robotic Research, 3(3), 36–61.
Bredon, G. E. (1993). Topology and Geometry, Volume 139 of Graduate Texts in Mathematics.
Springer-Verlag.
Breton, P, Iverson, L., Langer, M., & Zucker, S. W. (1992). Shading flows and scenel bundles: A
new approach to shape from shading. In G. Sandini (Ed.), Proceedings of the 2nd European
Conference on Computer Vision (ECCV), Volume 588 of Lecture Notes in Computer Science
(pp. 135–150). Santa Margherita, Ligure, Italy. Springer-Verlag.
Bruce, J. W., & Giblin, P. J. (1992). Curves and Singularities: A Geometrical Introduction to
Singularity Theory, 2nd edition. Cambridge University Press, UK.
Burt, P. J. (1981). Fast filter transforrns for image processing. Computer Graphics and Image
Processing (CGIP), 16, 20–51.
Burt, P. J. (1983). Fast algorithms for estimating local image properties. Computer Vision, Graph-
ics and Image Processing (CVGIP), 21, 368–382.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.43 (2819-2958)
Catté, F., Lions, P.-L., Morel, J.-M., & Coll, T. (1992). Image selective smoothing and edge
detection by nonlinear diffusion. SIAM Journal on Numerical Analysis, 29(1), 182–193.
Collins, G. P. (2004). The shapes of space. Scientific American, 291, 94–103.
Danielsson, P.-E. (1980). Euclidean distance mapping. Computer Graphics and Image Processing
(CGIP), 14, 227–248.
Dawkins, R. (1986). The Blind Watchmaker. W. W. Norton and Company. Reissued in 1990 (UK)
and 1996 (USA).
Deutsch, A., & Dormann, S. (2005). Cellular Automaton Modeling of Biological Pattern Forma-
tion. Modeling and Simulation in Science, Engineering and Technology. Birkhäuser.
Dickinson, S. J., Bergevin, R., Biederman, L., Eklundh, J.-O., Munck-Fairwood, R., Jain, A. K., &
Pentland, A. (1997). Panel report: The potential of geons for generic 3-D object recognition.
Image and Vision Computing, 15(4), 277–292.
Ebert, D. S., Musgrave, F. K., Peachey, D., Perlin, K., & Worley, S. (2002). Texturing and Model-
ing – A Procedural Approach. Morgan Kaufmann, 3rd edition.
Edelsbrunner, H., & Shah, N. R. (1997). Triangulating topological spaces. International Journal
of Computational Geometry and Applications, 7(4), 365–378.
Ferraro, M., & Caelli, T. M. (1988). Relationships between integral transform invariances and
Lie group theory. Journal of the Optical Society of America – A, 5(5), 738–742.
Ferraro, P., Godin, C., & Prusinkiewicz, P. (2005). Toward a quantification of self-similarity in
plants. Fractals, 13(2), 91–109.
Fitzpatrick, J. M. (1988). The existence of geometrical density image transformations corre-
sponding to object motion. Computer Vision, Graphics and Image Processing (CVGIP), 44(2),
155–174.
Giblin, P. J., & Kimia, B. B. (2004). A formal classification of 3D medial axis points and their
local geometry. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 26(2),
238–251.
Gibson, E. J., Gibson, J. J., Smith, O. W., & Flock, H. (1959). Motion parallax as a determinant
of perceived depth. Journal of Experimental Psychology, 8(1), 40–51.
Gibson, J. J. (1950). The Perception of the Visual World. Boston, MA: Houghton Mifflin Cie.
Reprinted in 1974 by Greenwood Press, Westport, CT.
Goodman, A. W. (1964). A partial differential equation and parallel plane curves. American
Mathematical Monthly, 71, 257–264.
Guillemin, V., & Sternberg, S. (1984). Symplectic Techniques in Physics. Cambridge University
Press, UK.
Gurtin, M. E. (1982). An Introduction to Continuum Mechanics, volume 158 of Mathematics in
Science and Engineering Series. Harcourt Publishers.
Hansen, O. (1992). On the Use of Local Symmetries in Image Analysis and Computer Vision.
PhD thesis, Laboratory of Image Analysis, Institute of Electronic Systems, Aalborg Univer-
sity, Denmark.
Hart, J. C. (1999). Using the CW-complex to represent the topological structure of im-
plicit surfaces and solids. In Proc. Implicit Surfaces ’99 (pp. 107–112). Eurograph-
ics/SIGGRAPH, ACM.
Hayes, P J., & Leyton, M. (1989). Processes at discontinuites. In Proceedings of the 11th Joint
Conference on Artificial Intelligence, volume 2 (pp. 1267–1272). Detroit, MI: American Asso-
ciation for A.I.
Heeger, D. J., Simoncelli, E. P., & Movshon, J. A. (1996). Computational models of cortical vi-
sual pracessing. In Proceedings of the National Academy of Science, volume 93 (pp. 623–627).
Washington, DC.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.44 (2958-3082)
Hilaga, M., Shinagawa, Y , Kohmura, T., & Kunii, T. L. (2001). Topology matching for fully
automatic similarity estimation of 3D shapes. In Proc. SIGGRAPH 2001 (pp. 203–212). Los
Angeles, CA. ACM.
Hoffman, W. C. (1966). The Lie algebra of visual perception. Journal of Mathematical Psychology,
3, 65–98. Errata in vol. 4, pp. 348–349, 1967.
Hoffman, W. C. (1978). The Lie transformation group approach to visual neuropsychology. In
E. L. J. Leeuwenberg & H. F. J. Buffart M. (Eds.), Formal Theories of Visual Perception, Chapter
2 (pp. 27–66). John Wiley and Sons.
Hoffman, W. C. (1984). Figural synthesis by vectorfields: Geometric neuropsychology. In P. C.
Dodwell & T. Caelli (Eds.), Figural Synthesis, Chapter 8 (pp. 249–282). Lawrence Erlbaum
Associates.
Horn, B. K. P. (1986). Robot Vision. Electrical Engineering and Computer Science Series. Cam-
bridge, MA: MIT Press.
Hummel, J. E., & Biederman, I. (1992). Dynamic binding in a neural network for shape recog-
nition. Psychological Review, 99, 480–517.
Johnson, C. K. (1999). Crystallographic topology 2: Overview and work in progress. In V. Alex-
iades& G. Siopsis (Eds.), Trends in Mathematical Physics, Studies in Advanced Mathematics.
AMS, Providence, RI.
Johnson, C. K., Burnett, M. N., & Dunbar, W. D. (1996). Crystallographic topology and
its applications. Oak Ridge National Laboratory. Preprint available: http://www.ornl.gov/
sci/ortep/topology/preprint.html
Jones, B. J. T., Martínez, V. J., Saar, E., & Trimble, V. (2005). Scaling laws in the distribution of
galaxies. Reviews of Modern Physics, 76, 1211–1267.
Jones, D. G., & Malik, J. (1992). Determining three dimensional shape from orientation and
spatial frequency disparities. In G. Sandini (Ed.), Proceedings of the 2nd European Conference
on Computer Vision (ECCV), volume 588 of Lecture Notes in Computer Science (pp. 661–669).
Santa Margherita, Ligure, Italy. Springer-Verlag.
Kass, M., Witkin, A., & Terzopoulos, D. (1988). Snakes: Active contour models. International
Journal of Computer Vision (IJCV), 1(4), 321–331.
Kelly, M. F., & Levine, M. D. (1995). Annular symmetry operators: A method for locating and de-
scribing objects. In Proceedings of the 5th International Conference on Computer Vision (ICCV)
(pp. 1016–1021).
Kimia, B. B. (1991). Toward a Computational Theory of Shape. PhD thesis, Electrical Engineering,
McGill University, Montreal, Canada.
Kimia, B. B. (2003). On the role of medial geometry in human vision. Journal of Physiology,
97(2–3), 155–190.
Kimia, B. B., Tannenbaum, A. R., & Zucker, S. W. (1995). Shapes, shocks, and deformations I:
The components of two-dimensional shape and the reaction-diffusion space. International
Journal of Computer Vision, 15(3), 189–224.
Klein, F. (1893). Vergleichende Betrachtungen über neuere Geometrische Forschungen (“Erlan-
gen Programm”). Mathematische Annalen, 43, 63–100.
Koenderink, J. J. (1990). Solid Shape. A.I. Series. Cambridge, MA: MIT Press.
Koenderink, J. J., & van Doorn, A. J. (1976). Geometry of binocular vision and a model for
stereopsis. Biological Cybernetics, 21(1), 29–35.
Kotelly, J. C. (1963). A mathematical model of Blum’s theory of pattern recognition. Research
report AFCRL-63-164, U.S. Air Force Cambridge Research Laboratories.
Kovács, L., Feher, A., & Julesz, B. (1998). Medial-point description of shape: A representation
for action coding and its psychological correlates. Vision Research, 38, 2323–2333.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.45 (3082-3203)
Kumar, S., & Bentley, P. J. (Eds). (2003). On Growth, Form and Computers. Academic Press.
Kunze, R., Wolter, F.-E., & Rausch, T. (1997). Geodesic Voronoi diagrams on parametric sur-
faces. In CGI ’97 (pp. 230–237). IEEE, Computer Society Press.
Lagarde, J. W. (1990). Constraints and their satisfaction in the recovery of local surface structure.
Master’s thesis, Electrical Engineering, McGill University, Montreal, Canada.
Langer, M. S., & Zucker, S. W. (1994). Shape-from-shading on a cloudy day. Journal of the Optical
Society of America – A, 11(2), 467–478.
Leymarie, F. (1990). Tracking and describing deformable objects using active contour mod-
els. Technical report CIM-90-9, McGill University, Dept. of Electrical Engineering, Montreal,
Canada. www.lems.brown.edu/∼leymarie/meng/.
Leymarie, F., & Levine, M. D. (1989). Shape features using curvature morphology. In W. A.
Pearlman (Ed.), Proc. of the SPIE Conf. on Visual Communications and Image Processing IV,
volume 1199 (pp. 390–401).
Leymarie, F., & Levine, M. D. (1992a). Fast raster scan distance propagation on the discrete
rectangular lattice. Computer Vision, Graphics and Image Processing – Image Understanding
(CVGIP-IU), 55(1), 84–94.
Leymarie, F., & Levine, M. D. (1992b). Simulating the grassfire transform using an active con-
tour model. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 14(1),
56–75.
Leymarie, F., & Levine, M. D. (1993). Tracking deformable objects in the plane using an active
contour model. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 15(6),
617–634.
Leymarie, F. F. (2003). Three-Dimensional Shape Representation via Shock Flows. PhD thesis,
Brown University, Providence, RI, USA. www.lems.brown.edu/∼leymarie/phd/.
Leymarie, F. F., & Kimia, B. B. (2000). Discrete 3D wave propagation for computing morpho-
logical operations from surface patches and unorganized points. In J. Goutsias, L. Vincent, &
D. Bloomberg (Eds.), Math. Morphology and its Applications to Image and Signal Processing,
volume 18 of Computational Imaging and Vision Series (pp. 351–360). Kluwer Academic.
Leymarie, F. F., & Kimia, B. B. (2001). The shock scaffold for representing 3D shape. In C. Arcelli,
L. P. Cordella, G. S. d. B. (Eds.), Visual Form, number LNCS 2059 (pp. 216–229). Springer-
Verlag.
Leymarie, F. F., & Kimia, B. B. (2007). The medial scaffold of 3D unorganized point clouds. IEEE
Transactions on Pattern Analysis and Machine Intelligence. Accepted for publication.
Leyton, M. (1984). Perceptual organization as nested control. Biological Cybernetics, 51, 141–
153.
Leyton, M. (1985). Generative systems of analyzers. Computer Vision, Graphics and Image Pro-
cessing (CVGIP), 31(2), 201–241.
Leyton, M. (1987). Symmetry-curvature duality. Computer Vision, Graphics and Image Process-
ing (CVGIP), 38, 327–341.
Leyton, M. (1988). A process grammar for shape. Artificial Intelligence Journal, 34(2), 213–247.
Leyton, M. (1989). Inferring causal history from shape. Cognitive Science, 13, 357–387.
Leyton, M. (2001). A Generative Theory of Shape. Number LNCS 2145. Springer-Verlag.
Lindenmayer, A. (1968). Mathematical models for cellular interactions in development: Parts I
and II. Journal of Theoretical Biology, 18, 280–315.
Lord, E. A., & Wilson, C. B. (1984). The Mathematical Description of Shape and Form. Ellis
Horwood Series in Mathematics and its Applications. Statistics and Operational Research.
Halsted Press.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.46 (3203-3346)
Sebastian, T. B., Klein, P. N., & Kimia, B. B. (2004). Recognition of shapes by editing their shock
graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(5), 550–571.
Serra, J. (1982). Image Analysis and Mathematical Morphology, volume 1. Academic Press.
Sethian, J. A. (1999). Level Set Methods and Fast Marching Methods: Evolving Interfaces in Com-
putational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge
Monograph on Applied and Computational Mathematics, 2nd edition. Cambridge Univer-
sity Press, UK.
Siddiqi, K., & Kimia, B. (1995). Parts of visual form: Computational aspects. IEEE Transactions
on Pattern Analysis and Machine Intelligence, 17(3), 239–251.
Siddiqi, K., & Kimia, B. B. (1996). A shock grammar for recognition. In Proceedings of the
Conference on Computer Vision and Pattern Recognition (CVPR’96), (pp. 507–513).
Siddiqi, K., Kimia, B. B., Tannenbaum, A., & Zucker, S. W. (2001). On the psychophysics of the
shape triangle. Vision Research, 41(9), 1153–1178.
Siddiqi, K., Kimia, B. B., Tannenbaum, A., & Zucker, S. W. (1999a). Shocks, shapes, and wiggles.
Image and Vision Computing, 17(5–6), 365–373.
Siddiqi, K., Shokoufandeh, A., Dickinson, S. J., & Zucker, S. W. (1999b). Shock graphs and shape
matching. International Journal of Computer Vision, 35(1), 13–32.
Sing, M., & Hoffman, D. D. (2001). Part-based representations of visual shape and implica-
tions for visual cognition. In T. F. Shipley & P. J. Kellman (Eds.), From Fragments to Objects:
Segmentation and Grouping in Vision, Chapter 9 (pp. 401–459). Elsevier Science.
Soille, P. (2004). Morphological Image Analysis, corrected 2nd edition. Springer.
Stavroudis, O. N. (1972). The Optics of Rays, Wavefronts, and Caustics, volume 38 of Pure and
Applied Physics. Academic Press.
Sundar, H., Silver, D., Gagvani, N., & Dickinson, S. J. (2003). Skeleton based shape matching
and retrieval. In Proc. of IEEE International Conference on Shape Modeling and Applications
(pp. 130–142). Seoul, Korea.
Tek, H., & Kimia, B. B. (2001). Boundary smoothing via symmetry transforrns. Journal of
Mathematical Imaging and Vision, 14(3), 211–223.
Terzopoulos, D., Witkin, A., & Kass, M. (1987). Symmetry-seeking models for 3D object recog-
nition. International Journal of Computer Vision (IJCV), 1(3), 211–221.
Thom, R. (1989). Structural Stability and Morphogenesis. Advanced book classics. Addison Wes-
ley.
Thompson, D. W. (1992). On Growth and Form. Dover Publications, complete revised edition
of the 1917 original edition.
Todd, J. T., & Reichel, F. D. (1989). Ordinal structure in the visual perception and cognition of
smoothly curved surfaces. Psychological Review, 96(4), 643–657.
Todd, S., & Latham, W. (1992). Evolutionary Art and Computers. Academic Press.
Turing, A. (1952). The chemical basis of morphogenesis. Philosophical Transactions of the Royal
Society of London, 237(641), 37–72.
van Tonder, G. J., & Ejima, Y. (2000). The patchwork engine: Image segmentation from image
symmetries. Neural Networks, 13(3), 291–303.
van Tonder, G. J., & Ejima, Y. (2003). Flexible computation of shape symmetries within the
maximal disk paradigm. IEEE Transactions on Systems, Man, and Cybernetics, Part B, 33(3),
535–540.
VanRullen, R., & Koch, C. (2003). Is perception discrete or continuous? Trends in Cognitive
Sciences, 7(5), 207–213.
Verroust, A., & Lazarus, F. (2000). Extracting skeletal curves from 3D scattered data. The Visual
Computer, 16(1), 15–25.
JB[v.20020404] Prn:23/10/2006; 9:07 F: AICR6714.tex / p.48 (3499-3558)
Wade, L., & Parent, R. E. (2002). Automated generation of control skeletons for use in anima-
tion. The Visual Computer, 18(2), 97–110.
Weygaert, R. v. d. (2003). The cosmic foam: Stochastic geometry and spatial clustering across
the universe. In E. Feigelson & G. Babu (Eds.), Statistical Challenges in Modern Astronomy III
(pp. 175–196). Springer-Verlag.
Wolfram, S. (1994). Cellular Automata and Complexity: Collected Papers. Addison-Wesley.
Wolter, F.-E. (1985). Cut Loci in Bordered and Unbordered Riemannian Manifolds. PhD thesis,
Technical University of Berlin, Department of Mathematics, Germany.
Wolter, F.-E. (1992). Cut locus and medial axis in global shape interrogation and representation.
Sea grant report, MIT, Cambridge, MA.
Wolter, F.-E., & Friese, K.-I. (2000). Local and global geometric methods for analysis interroga-
tion, reconstruction, modification and design of shape. In Proceedings of Computer Graphics
International (CGI’00) (pp. 137–151). Geneva, Switzerland. IEEE Computer Society.
Wolter, F.-E., Peinecke, N., & Reuter, M. (2004). Geometric Modeling of Complex Shapes and En-
gineering Artifacts, volume 1 of Encyclopedia of Computational Mechanics, Chapter 16. Wiley.
Wu, K., & Levine, M. D. (1997). 3-D shape approximation using parametric geons. Image Vision
and Computing, 15(2), 143–158.
Yuille, A., & Leyton, M. (1990). 3-D symmetry-curvature duality theorems. Computer Vision,
Graphics and Image Processing (CVGIP), 52, 124–140.
Zucker, S. W. (1987). The emerging paradigm of computational vision. Annual Review of Com-
puter Science, 2, 69–89.