PDF Nonlinear Computational Geometry 2010th Edition Ioannis Z Emiris Download
PDF Nonlinear Computational Geometry 2010th Edition Ioannis Z Emiris Download
com
https://ebookgate.com/product/nonlinear-
computational-geometry-2010th-edition-ioannis-z-
emiris/
https://ebookgate.com/product/computational-intelligence-in-
games-1st-edition-z-chen/
https://ebookgate.com/product/computational-algebraic-and-
analytic-geometry-ams-special-sessions-on-computational-algebrac-
and-analytic-geometry-for-low-dimensional-varieties-mika-seppala/
https://ebookgate.com/product/analysis-geometry-nonlinear-
optimization-and-applications-1st-edition-panos-m-pardalos/
https://ebookgate.com/product/handbook-of-discrete-and-
computational-geometry-2nd-edition-rajeev-bansal/
31st International Symposium on Computational Geometry
SoCG 2015 1. Aufl Edition Strupat
https://ebookgate.com/product/31st-international-symposium-on-
computational-geometry-socg-2015-1-aufl-edition-strupat/
https://ebookgate.com/product/computational-methods-for-
modelling-of-nonlinear-systems-1st-edition-a-torokhti-and-p-
howlett-eds/
https://ebookgate.com/product/elasticity-and-geometry-from-hair-
curls-to-the-nonlinear-response-of-shells-1st-edition-basile-
audoly/
https://ebookgate.com/product/nonlinear-dynamic-modeling-of-
physiological-systems-ieee-press-series-on-biomedical-
engineering-1st-edition-professor-vasilis-z-marmarelis/
https://ebookgate.com/product/improper-riemann-integrals-2nd-
edition-ioannis-m-roussos/
The IMA Volumes
in Mathematics
and its Applications
Volume 151
Series Editors
Fadil Santosa Markus Keel
Nonlinear Computational
Geometry
Editors
Ioannis Z. Emiris Frank Sottile
Lab of Geometric & Algebraic Algorithms Department of Mathematics
Department of Informatics Texas A&M University
& Telecommunications College Station, TX 77843
National and Kapodistrian University USA
of Athens http://www.math.tamu.edu/~sottile
Panepistimiopolis, 15784
Greece
http://cgi.di.uoa.gr/~emiris/index-eng.html
Thorsten Theobald
FB 12 - Institut für Mathematik
Johann Wolfgang Goethe-Universität
Robert-Mayer-Str. 10
D-60325 Frankfurt am Main
Germany
http://www.math.uni-frankfurt.de/~theobald/
Series Editors
Fadil Santosa
Markus Keel
Institute for Mathematics
& its Applications
University of Minnesota
Minneapolis, MN 55455
USA
ISSN 0940-6573
ISBN 978-1-4419-0998-5 e-ISBN 978-1-4419-0999-2
DOI 10.1007/978-1-4419-0999-2
Springer New York Dordrecht Heidelberg London
Series Editors
Fadil Santosa, Director of the IMA
Markus Keel, Deputy Director of the IMA
v
PREFACE
Ioannis Z. Emiris
Department of Informatics & Telecommunications
National and Kapodistrian University of Athens
http://cgi.di.uoa.gr/∼ emiris/index-eng.html
Frank Sottile
Department of Mathematics
Texas A&M University
http://www.math.tamu.edu/∼ sottile
Thorsten Theobald
Institut für Mathematik
Johann Wolfgang Goethe-Universität
http://www.math.uni-frankfurt.de/∼ theobald/
CONTENTS
Foreword . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
xi
SPECTRAL TECHNIQUES TO EXPLORE POINT CLOUDS
IN EUCLIDEAN SPACE, WITH APPLICATIONS TO
COLLECTIVE COORDINATES IN STRUCTURAL BIOLOGY
FRÉDÉRIC CAZALS∗ , FRÉDÉRIC CHAZAL† , AND JOACHIM GIESEN‡
Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.1 Geometric data analysis and spectral point cloud processing . . . . . . . . . . . . . . 2
1.2 Spectral methods and alternatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 An application in structural biology: Protein folding . . . . . . . . . . . . . . . . . . 5
1.4 Notations and paper overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2 PCA and MDS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1 PCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2 MDS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3 Localization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.1 Neighborhood criteria . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.2 Dimension detection using PCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4 Turning non-linear . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.1 Maximum variance unfolding (MVU) . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Locally linear embedding (LLE) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.3 ISOMAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.4 Laplacian eigenmaps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.5 Hessian eigenmaps (HLLE) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.6 Diffusion maps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
5 Applications in structural biology: the folding problem . . . . . . . . . . . . . . . . . . . . . 15
5.1 Folding: from experiments to modeling . . . . . . . . . . . . . . . . . . . . . . . . . . 16
5.2 Energy landscapes and dimensionality reduction . . . . . . . . . . . . . . . . . . . . . 16
5.2.1 Potential and free energy landscape . . . . . . . . . . . . . . . . . . . . . . 16
5.2.2 Enthalpy-entropy compensation, energy funnel, ruggedness and frustration . . 16
5.2.3 Cooperativity and correlated motions . . . . . . . . . . . . . . . . . . . . . 18
5.3 Bio-physics: Pre-requisites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
5.3.1 Molecular dynamics simulations . . . . . . . . . . . . . . . . . . . . . . . . 19
5.3.2 Models, potential energy landscapes and their ruggedness . . . . . . . . . . . 19
5.3.3 Morse theory and singularity theory . . . . . . . . . . . . . . . . . . . . . . 20
5.3.4 Free energy landscapes and reaction coordinates . . . . . . . . . . . . . . . . 20
5.3.5 Folding probability pf old . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
5.4 Inferring reaction coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.4.1 Reaction coordinates? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.4.2 Contacts based analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.4.3 Dimension reduction based analysis . . . . . . . . . . . . . . . . . . . . . . 26
5.4.4 Morse theory related analysis . . . . . . . . . . . . . . . . . . . . . . . . . . 27
6 Outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
∗ INRIA Sophia-Antipolis, 2004 route des Lucioles, BP 93, 06902 Sophia Antipolis,
France ([email protected]).
† INRIA Saclay, Parc Orsay Université, 4 rue Jacques Monod, 91893 Orsay Cedex,
France ([email protected]).
‡ Institut fuer Informatik, Ernst-Abbe-Platz 2, D-07743 Jena, Germany
([email protected]).
I.Z. Emiris et al. (eds.), Nonlinear Computational Geometry, The IMA Volumes in 1
Mathematics and its Applications 151, DOI 10.1007/978-1-4419-0999-2_1,
© Springer Science + Business Media, LLC 2010
2 FRÉDÉRIC CAZALS, FRÉDÉRIC CHAZAL, AND JOACHIM GIESEN
1. Introduction.
with R(D̂k , Dd ) the correlation coefficient taken over all entries of matrices D̂k and Dd .
The closer to zero this variance, the better the approximation.
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 5
where pij is the j’th component of the point pi ∈ P . From the matrix P
one canonically derives two positive semi-definite matrices,
(1) the covariance matrix C = P P T , and
(2) the Gram matrix G = P T P .
The
Pn covariance matrix is a (d × d)-matrix and can also be written as C =
T
i=1 pi pi , whereas the Gram matrix has dimension n × n and can also be
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 7
written as G = pTi pj . Both matrices are intimately linked also via their
eigenvectors and eigenvalues. We have the following observation.
Observation 1. The matrices C and G have the same non-zero
(positive) eigenvalues (and thus the same rank).
Proof. Let v ∈ Rd be an eigenvector of C with eigenvalue λ > 0, then
T
P v is an eigenvector of G also with eigenvalue λ as can be seen from the
following simple calculation:
GP T v = P T P P T v = P T Cv = λP T v.
Similarly, if u ∈ Rn is an eigenvector of G with eigenvalue µ > 0, then P u
is an eigenvector of C with eigenvalue µ.
to a rotation, i.e., completely preserving the pairwise inner products also preserves the
pairwise distances kpi − pj k.
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 9
data set, that is closer to an affine subspace. The unfolding aims at max-
imizing the distance between non-neighboring points (after some choice of
neighborhood) while preserving the distances between neighboring points.
Technically MVU proceeds as follows: let D = (dij = kp− pj k2 ) be the
symmetric (n × n)-matrix of pairwise distances. Choose a suited neigh-
borhood for each point in P (Weinberger and Saul choose the symmetric
κ-nearest neighbors) and let the indicator variable nij be 1 if either pi is in
the neighborhood of pj or pj is in the neighborhood of pi , and 0 otherwise.
From D an unfolding, a positive semi-definite (n × n)-matrix K = (kij )
(interpreted as the Gram matrix of the unfolded point set) is computed
through the following semi-definite program (SDP)
Maximize the trace of K subject to
(1) K is positive semi-definite
Xn
(2) kij = 0
i,j=1
(3) kii − 2kij + kjj = dij for all (i, j) with nij = 1.
From K a lower dimensional embedding can be computed as described for
MDS.
4.2. Locally linear embedding (LLE). LLE is a method intro-
duced in [45, 46] that intends to take into account the local linearity of the
underlying manifold M to perform the reduction of dimension. In a first
step, LLE discards pairwise distances between widely separated points by
building a neighborhood graph G (see Section 3). The goal of this first step
is to connect only close points of P so that the neighbors of each vertex
pi in G are contained in a small neighborhood of pi which is close to the
tangent space of the underlying manifold M at pi . To take this local lin-
earity into account, LLE computes for each vertex pi of the graph its best
approximation as a linear combination of its neighbors. More precisely,
one computes a sparse matrix of weights Wi,j that minimize the quadratic
error
n
X X
ε(W ) = kpi − Wi,j pj k2
i=1 j∈N (pi )
where N (pi ) is the set of the vertices that are connected to pi in G. This
is a simple least square problem. Solving it with the additional constraint
X
∀i, Wi,j = 1
j∈N gb(pi )
X X
Φ(p̂i ) = kp̂i − Wi,j p̂j k2
i j
kp −p k2
the weights is given by the Gaussian kernel Wi,j = exp(− i 4σ j ), where
σ is a user-defined parameter4 . Second the graph G is embedded into Rk in
such a way that the close connected points stay as close as possible. More
precisely the points pi are mapped to the points p̂i ∈ Rk that minimize
X
φ(P̂ ) = kp̂i − p̂j k2 Wi,j .
i,j
Fig. 1. An example of a graph G (the weights are given by the heat kernel ap-
proximation, see text) with points that are close or far to each other with respect to the
diffusion metric: the points x and y are close to each other while the points x and z are
far away because G is “pinched” between the two parts containing x and z. So there
are few paths connecting x to z.
–the so-called rotamers [86, 73], whence a priori an exponential number of conformations.
7 As will be shown with Eq. (2), a free energy landscape is obtained from the potential
Unfolded
conformations
Similarity
to native state
Partially folded
conformations
Folded
ie native state
Coordinates
(cartesian, internal)
8 The fact that the kinetic pathway follows the thermodynamic one is non trivial,
in the Taylor series expansion of V is the quadratic one. Diagonalizing the corresponding
quadratic form yields the so-called normal modes, whose associated eigenvectors are
collective coordinates. See for example [97].
11 Given a molecular dynamics simulation, one may investigate the correlations be-
tween the atomic fluctuations —with respect to a reference conformation. Both PCA
and MDS have been used for this problem: in [78, 59], the average covariance matrix of
the positional fluctuations is resorted to, while [83] computes the average Gram matrix.
See also [89] for a characterization of pairwise atomic correlations based of Pearson’s
coefficients and relatives.
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 19
tion, the system is Hamiltonian, and a large number of mathematical results apply [94].
We shall get back on this issue in the outlook.
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 21
we may partition the parameters as x = (q, s). Denote V (x) the poten-
tial energy of the system. By focusing on q and averaging out the other
parameters, one defines the free energy landscape, which is the kinetically
relevant one, by:
Z
V (q, s)
W (q) = −kT ln exp − ds. (2)
kT
(a) (b)
0
q q0
q q
W (q) W (q)
qA q∗ qB q qA q∗ qB q
Fig. 3. Potential energy landscape, with separatrices of the saddle in red: an order
parameter may not be a good reaction coordinate. Adapted from [65].
with peq (x) the equilibrium probability of state x and p(T P ) the fraction
of time spent on transition paths. Transition states are naturally defined
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 23
(a)
F
P
A B
q∗ q PB
(b) F
B s
q∗ q
PB
Free energy
Reaction coordinate #2
Transition State
F
U
Reaction coordinate #1
Fig. 5. Crossing the Transition State on a rugged energy landscape: the system
moves from one watershed (state U ) to a neighboring watershed (state F ) by crossing
the energy landscape pass. Adapted from [69].
Practically, the difficulty with pf old and related quantities are several
[69]. First, the concept is bound to simple landscapes corresponding to two
states folding processes. Most importantly, estimating pf old requires sam-
pling the TSE, which either requires long simulations –usually out of reach,
or some form of importance sampling to favor the rare events corresponding
to crossings of the TSE.
5.4. Inferring reaction coordinates. In the following, we review
some of the most successful techniques to analyze transitions. We focus on
the methodological aspects, and refer the reader to the original papers for
a discussion of the insights gained, including connexions with experimental
facts. As it can be seen from [69] for example, assessing the relevance of a
particular coordinate can be rather controversial.
5.4.1. Reaction coordinates? In order to prove efficient to investi-
gate folding, funnels such as that of Fig. 2 must be made quantitative, that
is, one needs to specify what the axis account for. The variables parame-
terizing the axis are called reaction coordinates, and a quantitative energy
landscape is displayed on Fig. 5. We now discuss several ways to design
such coordinates.
5.4.2. Contacts based analysis. Following the work of Go [80], a
natural way to tackle Levinthal’s paradox consists of introducing a bias in
the energy function towards native contacts, i.e. contacts observed in the
folded state. More precisely, two residues which are not adjacent along the
primary sequence of the protein form a native contact if they are spatially
close in the protein’s native state. Such pairs of residues are associated
a favorable interaction energy, while the remaining ones are associated a
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 25
0.1 0
0.5 3
Pf old = 1.0
0.8 1
Fig. 6. Folding process down a folding funnel: fraction of native contacts Q in-
creases, free energy F (Q) crosses a barrier, pf old increases. Adapted From [69].
aligned.
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 27
Euclidean distance, from which the k nearest ones according to the lRMSD
are selected.
To finish up this review, one should mention methods which do not
provide a simplified embedding of the landscape, but resort instead to a
clustering of the nodes in parameter space [81, 79]. Nodes within the same
watershed should belong to the same cluster, from which a Configuration
Space Network (CSN) can be built. In some cases, quantitative informa-
tions (e.g. free energies) can even be retrieved.
Remark 5.1. Having discussed dimensionality reduction techniques,
one comment is in order. If one does not know a priory which are the slow
variables, integrating Eq. (2) is not possible. This accounts for a three-
stage strategy which consists of performing a simulation, performing a di-
mensionality reduction to infer candidate reaction coordinates, and probing
them using pf old .
5.4.4. Morse theory related analysis. Energy landscapes govern
the folding process of proteins, but also the behavior of a number of physical
systems such as clusters of atoms, ions or simple molecules [67, 99]. For
some of these systems which exhibit a small number of stable crystalline
geometries and a large number of amorphous forms, exploring the landscape
exhaustively is impossible. Yet, a qualitative analysis can be carried out
by focusing on selected critical points. In [88, 60, 63], sequences of triples
minimum - saddle - minimum are sought, and super-basins are built from
their concatenation. In a related vein, the relative accessibility of potential
energy basins associated to minima is investigated in [61], so as to define the
so-called disconnectivity graph (DG). More precisely, two constructions are
performed in [61]. The first one, based on the canonical mapping, focuses
on the relative height of energy barriers, which governs transitions between
states, thus encoding the kinetic behavior of the system. The second one,
based on the canonical mapping, probes the potential energy surface at pre-
defined values of the energy, thus encoding global topological properties of
the landscape. Mathematically, constructing either DG is tantamount to
tracking the topological changes of the set V −1 (]∞, v]) when increasing v.
As such changes occur at critical values only [91], the graph built when all
critical values are available is called the contour tree15 . In [61], a discrete
set of energies are used to probe the topological changes of the level sets,
though.
If one focuses on the relative accessibility of basins, one problem aris-
ing is that the DG built does not have any privileged embedding —the
vertical axis encodes an energy, but the horizontal one does is meaning-
less. To complement the topological information by a geometric one, the
15 Consider the level sets of a Morse function f , and call a connected component of
a level set f −1 (h) a contour. Further contract every contour to a point. The graph
encoding the merge/split events between these contours is called the Reeb graph, or the
contour tree if the domain is simply connected [76].
28 FRÉDÉRIC CAZALS, FRÉDÉRIC CHAZAL, AND JOACHIM GIESEN
REFERENCES
[1] P.K. Agarwal, S. Har-Peled, and H. Yu. Embeddings of surfaces, curves, and
moving points in euclidean space. In ACM SoCG, 2007.
[2] D. Agrafiotus and H. Xu. A self-organizing principle for learning nonlinear
manifolds. PNAS.
30 FRÉDÉRIC CAZALS, FRÉDÉRIC CHAZAL, AND JOACHIM GIESEN
[26] J. Giesen and U. Wagner. Shape dimansion and intrinsic metric from samples
of manifolds with high co-dimension. In Proc. of the 19th Annual symp.
Computational Geometry, pp. 329–337, 2003.
[27] D. Givon, R. Kupferman, and A. Stuart. Extracting macroscopic dymamics.
Nonlinearity, 17:R55–R127, 2004.
[28] A. Globerson and S. Roweis. Metric learning by collapsing classes. In NIPS,
2005.
[29] Jihun Ham, Daniel D. Lee, Sebastian Mika, and Bernhard Schölkopf. A
kernel view of the dimensionality reduction of manifolds. In ICML ’04: Pro-
ceedings of the twenty-first international conference on Machine learning,
p. 47, New York, NY, USA, 2004. ACM.
[30] Gloria Haro, Gregory Randall, and Guillermo Sapiro. Stratification learn-
ing: Detecting mixed density and dimensionality in high dimensional point
clouds. In B. Schölkopf, J. Platt, and T. Hoffman, editors, Advances in Neu-
ral Information Processing Systems 19, pp. 553–560. MIT Press, Cambridge,
MA, 2007.
[31] T. Hastie and W. Stuetzle. Principal curves. J. Amer. Stat. Assoc., 84:502–
516, 1989.
[32] Matthias Hein and Markus Maier. Manifold denoising. In NIPS, pp. 561–568,
2006.
[33] Matthias Hein and Markus Maier. Manifold denoising. In B. Schölkopf,
J. Platt, and T. Hoffman, editors, Advances in Neural Information Process-
ing Systems 19, pp. 561–568. MIT Press, Cambridge, MA, 2007.
[34] I. Horenko, J. Schmidt-Ehrenberg, and C. Schutte. Set-oriented dimension
reduction: localizing principal component analysis vie hidden markov models.
In LNBS in Bio-informatics. 2006.
[35] B. Kégl. Intrinsic dimension estimation using packing numbers. In Advances
in Neural Information Processing Systems 17. MIT Press, Cambridge, MA,
2002.
[36] R.I. Kondor and J. Lafferty. Diffusion kernels on graphs and other discrete
structures.
[37] S. Lafon and A.B. Lee. Diffusion maps and coarse-graining: A unified frame-
work for dimensionality reduction, graph partitioning and data set parame-
terization. IEEE PAMI, 28(9):1393–1403, 2006.
[38] M.C. Law and A.K. Jain. Incremental nonlinear dimensionality reduction by
manifold learning. IEEE Trans. on pattern analysis and machine intelli-
gence, 28(3), 2006.
[39] J.A. Lee and M. Verleysen. Nonlinear Dimensionality Reduction. Springer,
2007.
[40] Elizaveta Levina and Peter J. Bickel. Maximum likelihood estimation of
intrinsic dimension. In Lawrence K. Saul, Yair Weiss, and Léon Bottou,
editors, Advances in Neural Information Processing Systems 17, pp. 777–
784. MIT Press, Cambridge, MA, 2005.
[41] Nathan Linial, Eran London, and Yuri Rabinovich. The geometry of graphs
and some of its algorithmic applications. In IEEE Symposium on Founda-
tions of Computer Science, pp. 577–591, 1994.
[42] J. Mao and A.K. Jain. Artificial neural networks for feature extraction and
multivariate data projection. IEEE Trans. Neural Networks, 6(2), 1995.
[43] E. Meerbach, E. Dittmer, I. Horenko, and C. Schutte. Multiscale modelling
in molecular dynamics : Biomolecular conformations as metastable states.
Lecture notes in physics, 703, 2006.
[44] F. Memoli and G. Sapiro. Distance functions and geodesics on point clouds,
2005.
[45] S.T. Roweis and L.K. Saul. Non linear dimensionality reduction by locally
linear embedding. Science, 290:2323–2326, 2000.
32 FRÉDÉRIC CAZALS, FRÉDÉRIC CHAZAL, AND JOACHIM GIESEN
[46] S.T. Roweis and L.K. Saul. Think globally, fit locally: Unsupervised learning
of low dimensional manifolds. Journal of Machine Learning Research, 4:119–
155, 2003.
[47] J.B. Tenenbaum and V. de Silva. Sparse multi-dimensional scaling using land-
mark points. In preparation.
[48] J.B. Tenenbaum, V. de Silva, and J.C. Langford. A global geometric frame-
work for nonlinear dimensionality reduction. Science, 290:2319–2323, 2000.
[49] R. Tibshirani. Principal curves revisited. Statistics and Computing, 2:183–190,
1992.
[50] M. Trosset. Applications of multidimensional scaling to molecular conformation.
Computing Science and Statistics, (29):148–152, 1998.
[51] L.J.P. van der Maaten, E.O. Postma, and H.J. van den Herik. Dimension-
ality reduction: a comparative review. 2007.
[52] Kilian Q. Weinberger and Lawrence K. Saul. Unsupervised learning of image
manifolds by semidefinite programming. In CVPR (2), pp. 988–995, 2004.
[53] Kilian Q. Weinberger, Fei Sha, and Lawrence K. Saul. Learning a kernel
matrix for nonlinear dimensionality reduction. In ICML ’04: Proceedings of
the twenty-first international conference on Machine learning, p. 106, New
York, NY, USA, 2004. ACM.
[54] K.Q. Weinberger and L.K. Saul. An introduction to nonlinear dimensionality
reduction by maximum variance unfolding. In AAAI, 2006.
[55] K.Q. Weinberger and L.K. Saul. Unsupervised learning of image manifolds
by semidefinite programming. International Journal of Computer Vision,
70(1):77–90, 2006.
[56] Li Yang. Building connected neighborhood graphs for isometric data embedding.
In KDD, pp. 722–728, 2005.
[57] P. Zhand, Y. Huang, S. Shekhar, and V. Kumar. Correlation analysis of
spatial time series datasets. In Pacific Asia Conf. on Knowledge Discovery
and Data Mining, 2003.
[58] Hao Zhang, Oliver van Kaick, and Ramsay Dyer. Spectral mesh processing.
Computer Graphics Forum (to appear), 2008.
[59] A. Amadei, A.B.M. Linssen, and H.J.C. Berendsen. Essential dynamics of
proteins. Proteins: Structure, Function, and Genetics, 17(4):412–425, 1993.
[60] K.D. Ball, R.S. Berry, R.Kunz, F-Y. Li, A. Proykova, and D.J. Wales.
From topographies to dynamics on multidimensional potential energy sur-
faces of atomic clusters. Science, 271(5251):963 – 966, 1996.
[61] O. Becker and M. Karplus. The topology of multidimensional potential energy
surfaces: Theory and application to peptide structure and kinetics. The
Journal of Chemical Physics, 106(4):1495–1517, 1997.
[62] O.M. Becker. Principal coordinate maps of molecular potential energy surfaces.
J. of Comp. Chem., 19(11):1255–1267, 1998.
[63] R. Stephen Berry, Nuran Elmaci, John P. Rose, and Benjamin Vekhter.
Linking topography of its potential surface with the dynamics of fold-
ing of a proteinmodel. Proceedings of the National Academy of Sciences,
94(18):9520–9524, 1997.
[64] Robert B. Best and Gerhard Hummer. Chemical Theory and Computation
Special Feature: Reaction coordinates and rates from transition paths. Pro-
ceedings of the National Academy of Sciences, 102(19):6732–6737, 2005.
TECHNIQUES TO EXPLORE POINT CLOUDS IN EUCLIDEAN SPACE 33
[65] P.G. Bolhuis, D. Chandler, C. Dellago, and P.L. Geissler. Transition path
sampling: Throwing ropes over rough mountain passes, in the dark. Annual
review of physical chemistry, 53:291–318, 2002.
[66] P.G. Bolhuisdagger, C. Dellago, and D. Chandler. Reaction coordinates of
biomolecular isomerization. PNAS, 97(11):5877–5882, 2000.
[67] C.L. Brooks, J. Onuchic, and D.J. Wales. Statistical thermodynamics: taking
a walk on a landscape. Science, 293(5530):612 – 613, 2001.
[68] L. Chavez, J.N. Onuchic, and C. Clementi. Quantifying the roughness on
the free energy landscape: Entropic bottlenecks and protein folding rates. J.
Am. Chem. Soc., 126(27):8426–8432, 2004.
[69] Samuel S. Cho, Yaakov Levy, and Peter G. Wolynes. P versus Q: Structural
reaction coordinates capture protein folding on smooth landscapes. Proceed-
ings of the National Academy of Sciences, 103(3):586–591, 2006.
[70] P. Das, M. Moll, H. Stamati, L. Kavraki, and C. Clementi. Low-
dimensional, free-energy landscapes of protein-folding reactions by nonlinear
dimensionality reduction. PNAS, 103(26):9885–9890, 2006.
[71] Payel Das, Corey J. Wilson, Giovanni Fossati, Pernilla Wittung-
Stafshede, Kathleen S. Matthews, and Cecilia Clementi. Characteri-
zation of the folding landscape of monomeric lactose repressor: Quantitative
comparison of theory and experiment. Proceedings of the National Academy
of Sciences, 102(41):14569–14574, 2005.
[72] R. Du, V. Pande, A.Y. Grosberg, T. Tanaka, and E.I. Shakhnovich. On the
transition coordinate for protein folding. J. Chem. Phys., 108(1):334–350,
1998.
[73] R.L. Dunbrack. Rotamer libraries in the 21st century. Curr. Opin. Struct. Biol.,
12(4):431–440, 2002.
[74] H.A. Scheraga et al. A united-residue force field for off-lattice protein-structure
simulations. i. functional forms and parameters of long-range side-chain inter-
action potentials from protein crystal data. J. of Computational Chemistry,
18(7):849–873, 1997.
[75] A. Fersht. Structure and Mechanism in Protein Science: A Guide to Enzyme
Catalysis and Protein Folding. 1999.
[76] A.T. Fomenko and T.L. Kunii. Topological Modeling for visualization. Springer,
1997.
[77] D. Frenkel and B. Smit. Understanding molecular simulation. Academic Press,
2002.
[78] A.E. Garcia. Large-amplitude nonlinear motions in proteins. Physical Review
Letters, 68(17):2696–2699, 1992.
[79] D. Gfeller, P. De Los Rios, A. Caflisch, and F. Rao. Complex network
analysis of free-energy landscapes. Proceedings of the National Academy of
Sciences, 104(6):1817–1822, 2007.
[80] Nobuhiro Go and Hiroshi Taketomi. Respective Roles of Short- and Long-
Range Interactions in Protein Folding. Proceedings of the National Academy
of Sciences, 75(2):559–563, 1978.
[81] Isaac A. Hubner, Eric J. Deeds, and Eugene I. Shakhnovich. Understand-
ing ensemble protein folding at atomic detail. Proceedings of the National
Academy of Sciences, 103(47):17747–17752, 2006.
[82] G. Hummer. From transition paths to transition states and rate coefficients. J.
Chemical Physics, 120(2), 2004.
[83] T. Ichiye and M. Karplus. Collective motions in proteins: A covariance analysis
of atomic fluctuations in molecular dynamics and normal mode simulations.
Proteins: Structure, Function, and Genetics, 11(3):205–217, 1991.
[84] C.L Brooks III, M. Gruebele, J. Onuchic, and P. Wolynes. Chemical
physics of protein folding. Proceedings of the National Academy of Sciences,
95(19):11037–11038, 1998.
34 FRÉDÉRIC CAZALS, FRÉDÉRIC CHAZAL, AND JOACHIM GIESEN
[85] S.E. Jackson. How do small single-domain proteins fold? Fold Des., 3(4):R81–
91, 1998.
[86] J. Janin, S. Wodak, M. Levitt, and B. Maigret. Conformations of amino
acid side chains in proteins. J. Mol. Biol., 125:357–386, 1978.
[87] T. Komatsuzaki, K. Hoshino, Y. Matsunaga, G.J. Rylance, R.L. Johnston,
and D. Wales. How many dimensions are required to approximate the po-
tential energy landscape of a model protein? J. Chem. Phys., 122, February
2005.
[88] R.E. Kunz and R.S. Berry. Statistical interpretation of topographies and dy-
namics of multidimensional potentials. J. Chem. Phys., 103:1904–1912, Au-
gust 1995.
[89] O.F. Lange and H Grubmller. Generalized correlation for biomolecular dy-
namics. Proteins, 62:1053–1061, 2006.
[90] C. Levinthal. Are there pathways for protein folding? Journal de Chimie
Physique et de Physico-Chimie Biologique, 65:44–45, 1968.
[91] John W. Milnor. Morse Theory. Princeton University Press, Princeton, NJ,
1963.
[92] E. Paci, M. Vendruscolo, and M. Karplus. Native and non-native interactions
along protein folding and unfolding pathways. Proteins, 47(3):379–392, 2002.
[93] J. Palis and W. de Melo. Geometric Theory of Dynamical Systems. Springer,
1982.
[94] M. Pettini. Geometry and Topology in Hamiltonian Dynamics and Statistical
Mechanics. Springer, 2007.
[95] E. Plaku, H. Stamati, C. Clementi, and L.E. Kavraki. Fast and reliable
analysis of molecular motion using proximity relations and dimensionality
reduction. Proteins: Structure, Function, and Bioinformatics, 67(4):897–
907, 2007.
[96] G. Rylance, R. Johnston, Y. Matsunaga, C-B Li A. Baba, and T. Komat-
suzaki. Topographical complexity of multidimensional energy landscapes.
PNAS, 103(49):18551–18555, 2006.
[97] M. Tirion. Large amplitude elastic motions in proteins from a single-parameter,
atomic analysis. Phys. Rev. Lett., 77:1905–1908, 1996.
[98] Monique M. Tirion. Large amplitude elastic motions in proteins from a single-
parameter, atomic analysis. Phys. Rev. Lett., 77(9):1905–1908, Aug 1996.
[99] D.J. Wales. Energy Landscapes. Cambridge University Press, 2003.
[100] L. Yang, G. Song, and R. Jernigan. Comparison of experimental and com-
puted protein anisotropic temperature factors. In IEEE Bioinformactics and
biomedecine workshop, 2007.
RATIONAL PARAMETRIZATIONS,
INTERSECTION THEORY, AND NEWTON POLYTOPES∗
CARLOS D’ANDREA† AND MARTÍN SOMBRA‡
Key words. Parametric curve, implicit equation, Newton polytope, tropical geom-
etry, Intersection Theory.
3t2
3t
C 99K C , t 7→ , . (1)
1 + t3 1 + t3
I.Z. Emiris et al. (eds.), Nonlinear Computational Geometry, The IMA Volumes in 35
Mathematics and its Applications 151, DOI 10.1007/978-1-4419-0999-2_2,
© Springer Science + Business Media, LLC 2010
36 CARLOS D’ANDREA AND MARTÍN SOMBRA
1,5
1,0
0,5
−2 −1 0 1
0,0
−0,5
−1,0
−1,5
3t 3t2
−2 = , 1=
1 + t3 1 + t3
admits a solution for t ∈ C or not, which is a harder task.
Depending on which kind of operation one needs to perform on a cer-
tain parametric variety, it may be convenient to dispose of the parametric
representation or of the implicit one. Efficiently performing the passage
from one representation to the other is one of the central problems of
Computational Algebraic Geometry. In the present text we will mostly
concentrate in one these directions: the implicitization problem, consisting
in computing equations for an algebraic variety given in parametric form.
INTERSECTION THEORY AND NEWTON POLYTOPES 37
Suppose that the Zariski closure Im(ρ) of the image of this map is a hyper-
surface or equivalently, that the Jacobian matrix ( ∂ρ∂tj (t))i,j has maximal
i
n−1
rank n − 1 for generic t ∈ C . The ideal of this parametric (or unira-
tional) hypersurface is generated by a single irreducible polynomial and the
problem consists in computing this “implicit equation”.
This problem is equivalent to the elimination of the parameter vari-
ables from some system of equations. For instance, to compute the implicit
equation of the folium from the parametrization (1), one should eliminate
the variable t from the system of equations
(1 + t3 ) x − 3t = 0 , (1 + t3 ) y − 3t2 = 0, (3)
This elimination task can be effectively done either with Gröbner bases or
with resultants [3] but in practice, this can be too expensive. For instance,
for a ∈ N consider the parametrization
It is not hard to check by hand that the implicit equation of the image of
this map is
2 − xa−1 y a − xa y a+1 = 0.
60
50
40
30
20
10
10 20 30
The Newton polytopes of the defining polynomials are relatively small: the
real interval [48, 63] and the singleton {32}. The exponents are rather
large, but in any case the implicit equation can be computed via the
Sylvester resultant. It’s Newton polygon is the triangle with vertices
(32, 0), (0, 48), (0, 63).
This example was studied by I. Emiris and I. Kotsireas, who succeeded
in determining the polygon by analysing the behavior of the resultant under
specialization, thus showing that it is sometimes possible to access to the
Newton polytope without computing the implicit equation [8]; see also [7]
for further applications of this method.
The recent irruption of Tropical Geometry in the mathematical
panorama has boosted the interest in the problem. The tropical variety
40 CARLOS D’ANDREA AND MARTÍN SOMBRA
B ⊂ Z2 1)
×2
×2
2) 3)
P(B)
ord0 (ρ) = (−1, −1) , ord1 (ρ) = (−1, 0) , ord∞ (ρ) = (2, −1)
and ordvi (ρ) = (0, 1) for each of the two zeros v1 , v2 of the equation
t2 − 5t + 2 = 0, while ordv (ρ) = (0, 0) for v 6= 0, ±1, ∞, v1 , v2 . Figure 5
illustrates the family B and the associated polygon P(B).
Theorem 2.1 tells us that this polygon is ind(ρ) times the actual New-
ton polygon of the curve. It is easy to check that the constructed polygon
is non contractible, in the sense that it is not a non trivial integer multiple
of another lattice polygon. We conclude that the parametrization is bira-
tional (that is, ind(ρ) = 1) and that N(C) = P(B). These results can be
Another random document with
no related content on Scribd:
PLATE CLXXXVIII.
GLADIOLUS CAMPANULATUS.
Bell-flowered Gladiolus.
SPECIFIC CHARACTER.
ZINNIA VERTICILLATA.
Double Zinnia.
SPECIFIC CHARACTER.
GERANIUM ASTRAGALIFOLIUM.
Astragalus-leaved Geranium.
SPECIFIC CHARACTER.
1. The Empalement.
2. The Chives and Pointal.
3. The Chives spread open and magnified.
4. The Seed-bud, Shaft, and Summits, magnified.
By the Kew Catalogue we are informed, that this species of Geranium,
was introduced to this country, in the year 1788, by Mr. F. Masson. It is, like
many of this branch of the extended family of Geranium, rather a tender
Green-house plant; and will not flower, in perfection, without the assistance
of the Hot-house. It loses its foliage after flowering, and remains in a state of
inaction for at least three months; during which period, it should be watered
but seldom, and that sparingly. To propagate it, the only mode is, by cutting
small portions of the root off, and putting them into the strong heat of a hot-
bed, about the month of March; as hitherto, it has not perfected any seeds
with us, and the plant produces no branch, except the flower-stem may be so
denominated. Our drawing was made from the Clapham Collection, in July,
this year. This species has been considered by Professor Martyn, (see his
edition of Mill. Dict. article Pelargonium 2.) as the same with G. pinnatum,
and G. prolificum of Linn. Sp. Plan. But, however, the specific characters in
Linnæus, of those species, may agree with our figure, the G. Astragalifolium
of Jacquin and Cavanilles, they are, unquestionably, all different plants;
drawings of the two former we have, and will be given in due course.
PLATE CXCI.
PLATYLOBIUM SCOLOPENDRUM.
GENERIC CHARACTER.
SPECIFIC CHARACTER.
ANTHOLYZA FULGENS.
Refulgent-flowered Antholyza.
SPECIFIC CHARACTER.
GERANIUM LINEARE.
Linear-petalled Geranium.
SPECIFIC CHARACTER.
HEMEROCALLIS ALBA.
White Day-Lily.
SPECIFIC CHARACTER.