PHD Digioia
PHD Digioia
Arturo di Gioia
2005
UNIVERSITA’ DEGLI STUDI DI TRENTO
Dottorato di ricerca in Modellazione, Conservazione e Controllo dei Materiali e
delle Strutture
XVII Ciclo
Commissione esaminatrice:
prof. Luigi Mongiovı̀, Università degli Studi di Trento
prof. Ján Sládek, Slovak Academy of Sciences
prof. Aurelio Muttoni, Ecole Politechnique Federale de Lausanne
Summary - Sommario
3
With reference to the Stokes flow problem, a thorough analysis of the ill-
conditioning issues inherent in the existing formulations applied to 3D problems
with complex geometries is carried out. The Mixed Velocity-Traction equation
formulation, first introduced by Frangi and Tausch [11], is finally described.
This formulation does not suffer from the ill-conditioning issues of the classical
formulation, allowing to analyze complete MEMS devices in reasonable times
and with standard computational resources.
... to Emanuela.
Acknowledgments
First of all, I would like to thank my advisors, for their helpful and patient
guidance. I would also like to thank the group of Solid and Structural Mechanics,
prof. Davide Bigoni, prof. Antonio Cazzani, prof. Marco Rovati, dr. Roberta
Springhetti, dr. Massimiliano Gei, together with prof. Walter Drugan and
with my colleagues and friends Michele Brun, Massimiliano Margonari, Andrea
Piccolroaz, Katia Bertoldi, Giulia Franceschini, Daniele Veber, Sergia Colli,
Lorenzo Magnarelli and Paola Bonetti, for the friendly and at the same time
stimulating environment they provided.
My thank goes also to all the people who indirectly contributed to this work
providing me helpful tips, dr. Claudio Fontanari, dr. Julien Langou, prof. Keith
Davey, prof. Marcus Grote and prof. Naoshi Nishimura.
My gratitude goes to my friends Claudio, Stefano, Rosanna, Andrea, Ari-
anna, Alessandro, Francesca, Barbara, Francesco, Alessandro ”the king”, Mi-
haela and Gigi ”the king of tarots”, who survived my horribly cooked Saturday
dinners, and to Gerolamo, Alessandro ”twenty minutes”, Mauro and Gianluca,
who taught me that when all you have is a snowboard, every obstacle starts
resembling a jump.
Finally, I would like to thank my family for their love and support, and
Emanuela, for her endless love and patience.
9
Contents
1 Introduction 1
1.1 The Boundary Element Method . . . . . . . . . . . . . . . . . . . 1
1.2 The MEMS devices . . . . . . . . . . . . . . . . . . . . . . . . . . 2
ii
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
iii
CONTENTS
iv
Chapter 1
Introduction
The present work deals with some recent advancements in Boundary Element
techniques for large-scale problems, with particular reference to Fast Multipole
accelerated formulations for the analysis of Micro-Electro-Mechanical structures
(MEMS). This thesis also deals with a mixed boundary-domain formulation for
evolutive problems.
the Boundary Element Method quickly became a major issue, expecially when
dealing with three-dimensional problems with complex geometries. The para-
dox was clear. The Boundary Element Method was relegated to a niche position
in the numerical methods world, however in that niche it had the possibility to
easily outperform its competitors in terms of accuracy and ease of use. Unfor-
tunately the exploitation of this potential advantage was seriously compromised
by its lack of scalability.
During the last ten years many efforts have been directed towards the devel-
opment of several numerical techniques able to grant a proper scalability to the
Boundary Element Method, among which is the Fast Multipole Method (FMM-
BEM), extensively used in the present work. These techniques allow to fully
exploit the power of boundary formulations, allowing them to gain a widespread
acceptance as a specialized tool for the solution of several classes of problems.
2
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
with the large-scale 3D external electromagnetic and fluid flow problems. Due
to the complex geometry of MEMS devices and the presence of movable parts,
the usage of domain based techniques is severely limited by the excessive com-
putational requirements, since the problems size can quickly grow up to tens of
hundreds of millions of domain DOFs. Equivalent problems, when tackled by
boundary integral equation approaches, can be discretized using a much smaller
number of DOFs and are therefore solvable on average workstations by means
of the FMM-BEM. Moreover, the considerably smaller (if not null) remeshing
requirement upon any rigid or quasi-rigid boundary motion is another major
advantage of the Boundary Element Method over the Finite Element Method
for that class of problems.
Figure 1.1: Large Force Electrostatic MEMS Actuator Segment, Photo Courtesy
of MEMX inc. (www.memx.com)
3
Introduction
4
Chapter 2
ρ (x)
∇ · E (x) = (2.1)
0
where ρ is the charge density and 0 is the permittivity.
Since E is curl-free it can be expressed as the gradient of a an electrostatic
potential V
∆V (x) = 0 (2.4)
With reference to Figure 2.1 a general internal boundary value problem can
be defined by the Laplace equation (2.4) imposed on the closed domain Ω, with
boundary conditions
over ΓV and
over ΓE .
E E
V
E
6
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
E
E
11
V
E
described in Figure 2.2, with the only modification in the direction of the normal
vector to the boundary surface Γ.
In MEMS analysis the electrostatic forces acting on the fixed and movable
parts for an assigned set of voltages are fundamental design parameters. The
model problem is that of m perfect conductors immersed in a dielectric medium
of dielectric constant = r 0 with assigned constant potential Vα over each
cosed surface Γα , α = 1, . . . , m. The resulting external problem with Dirichlet
boundary conditions is depicted in Figure 2.3. Being t (x) the electrostatic
traction acting at each boundary point
En2 (x)
t (x) = n (x) (2.7)
2
the total force
Z
Fα = t (x) dS (x) (2.8)
Γα
7
The Laplace equation
are sought.
m
1
nm
n1 Vm
V1
n2 ,
V2
2
8
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where
1 r · ny
K (x, y) = ∇y G (x, y) · n (y) = − (2.12)
4π r3
and
0 x 6∈ Ω
c = 1 x∈Ω (2.13)
1
2 x∈Γ
is the free-term coefficient resulting from the Cauchy Principal Value integral
involving the strongly singular kernel K.
Equation (2.11) simplifies if pure Dirichlet or Neumann boundary conditions
are applied.
In case of Dirichlet boundary conditions
V (x)|Γ = V (x) (2.14)
the resulting equation is a Fredholm integral equation of the first kind
Z
G (x, y) En (y) dS (y) = g (x) (2.15)
Γ
with r.h.s.
Z
g (x) = cV (x) + K (x, y) V (y) dS (y) (2.16)
Γ
with r.h.s.
Z
g (x) = G (x, y) E n (y) dS (y) (2.19)
Γ
9
The Laplace equation
if x ∈ ΓE , where
Z Z
g (x) = G (x, y) E n dS (y) − K (x, y) V (y) dS (y) (2.23)
ΓE ΓV
The existence and uniqueness of the solution of (2.15) and (2.21) are always
guaranteed, while (2.18) is a singular operator, and in order for the r.h.s. to
belong to its range, the solvability condition
Z
E n (x) dS (x) = 0 (2.24)
Γ
must be satisfied. In that case, the double layer operator admits infinite solu-
tions, which differ by a set of constant potentials.
V (x)|Γα = V α (2.25)
10
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
and
Z
(1 − c) V α β=α
K (x, y) dS (y) = (2.27)
Γβ 0 β 6= α
x∈Γα
allows to directly evaluate the electric flux as a function of the applied voltages.
and
p
(En )
X
En (x; En,1 , . . . , En,p ) = Ni (x) En,i (2.31)
i=1
11
The Laplace equation
where
Z
(En )
[G]ij = G (xi , y) Nj (y) dS (y) (2.33)
Γ
and
Z
(V )
[K]ij = K (xi , y) Nj (y) dS (y) . (2.34)
Γ
The structure of the system matrix depends upon the applied boundary
conditions. The discretization of (2.15) leads to a linear system of the form
The evaluation of the matrix coefficients (2.33) and (2.34) is a widely in-
vestigated topic, and various numerical and analytical techniques are available,
which deal with the singularity of the involved kernels. When the source point is
far enough from the integration region, numerical quadrature rules can be safely
employed. In the present work, the singular and quasi-singular integrations are
evaluated by means of the analytical integrations described in Milroy et. al.
[26] which, while being slower than semianalytical techniques or purely numer-
ical techniques based on element subdivision and coordinate transformations,
provide higher accuracy.
The described Collocational formulations are characterized by system ma-
trices which are unsymmetric and fully populated. Advanced formulations such
12
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Ax = b (2.37)
13
The Laplace equation
in the form
xk = x0 + zk = x0 + Vk y (2.38)
r0 = b − Ax0
(2.39)
Kk ≡ span r0 , Ar0 , A2 r0 , . . . , Ak−1 r0
14
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
and, since Vk+1 is a matrix with orthonormal columns, the minimal residual
norm is obtained as the solution of the (k + 1) × k linear least-squares problem
which can be solved using the QR factorization of the matrix [H k , βe1 ]. The
solution of (2.43) can be written in the form
0 = q (A) = α0 I + α1 A + . . . + αn An (2.46)
15
The Laplace equation
Ax = b (2.48)
or by right preconditioning
provided that the resulting system matrices of the preconditioned systems have
better properties than that of the original one. A trivial observation is that
the best system matrix is the identity matrix. In that case, iterative solvers
converge in just one iteration. Therefore, the best preconditioner is the inverse
of the system matrix. Two preconditioners can be used in sequence, i.e. a right
preconditioner is applied, and then the resulting matrix is left preconditioned.
The goal of any preconditioner is to approximate the inverse of the system
matrix at its best, while achieving a positive balance between the time spent
during its setup and the time saved due to the improved convergence of the
iterative solver.
The right choice of preconditioner strongly depends on the nature of the
underlying system. The simplest preconditioner which should always be used
for BEM matrices is the right Jacobi preconditioner
1 1 1
M2−1 = diag , ,..., (2.51)
a11 a22 ann
16
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
which has the effect to scale the unknowns vector, also scaling the correspond-
ing columns in the system matrix. This is expecially useful for problems with
mixed boundary conditions, where the unknows vector contains quantities of dif-
ferent nature (for example, potentials and fluxes) and thereby the corresponding
columns contain coefficients which can differ by orders of magitude.
The block Jacobi preconditioner is an extension of the diagonal precondi-
tioner, based on the inverses of block diagonal submatrices. The leaf precon-
ditioner which will be used in the last Chapter is a variant of this class of
preconditioners. Since it depends on the matrix partitioning introduced by the
Fast Multipole method, it will be described later in this work.
Classical preconditioners are often tuned for sparse matrices, and are there-
fore useless, or computationally inefficient, for BEM matrices.
According to some recent investigations (Chen [6], Giraud et al. [13]) the
SParse Approximate Inverse (SPAI) preconditioner (Benzi et al. [4], Grote et al.
[16]) appears to be a viable choice for the solution of dense, diagonally dominant
problems.
The SPAI preconditioner has been developed for sparse matrices. It consists
in the construction of a sparse matrix M −1 such that ||M −1 A − I|| is small in
some norm (usually the Fredholm norm is adopted). For an assigned sparsity
pattern, the determination of the values for the coefficients of the sparse approx-
imate inverse M −1 reduces to the solution of n least-squares problems whose
size is at most equal to m, where n is the problem size and m is the number of
nonzero coefficients per row in the sparse matrix A.
The SPAI preconditioner can be adapted to a dense system by simply ex-
tracting the most significant values from the original matrix A, therefore giving
rise to a sparse approximation à which is used to calculate the preconditioner.
The efficiency and consequently the setup cost of the SPAI preconditioner de-
pend on the parameters which control both the extraction of the sparse approxi-
mation from the original matrix A, and the sparse inverse from the approximate
matrix à (see e.g. Margonari [24]).
The described preconditioners are based on the algebraic structure of the sys-
tem matrix. Other classes of preconditioners specifically exploit the analytical
properties of the starting integral equations (Steinbach et al. [35], Christiansen
et al. [8]).
Another possibility is given by the adoption of an iterative solver which
allows a variable preconditioner, such as the FGMRES (Saad [33]) or the GM-
RESR (van der Vorst [39]). The advantage given by these solvers is that the
computation of an approximate inverse M −1 can be avoided. Instead, the vec-
tor z which should be obtained by the application of the preconditioner to a
17
The Laplace equation
known vector y
z = M −1 y (2.52)
Mz = y (2.53)
18
Chapter 3
3.1 Introduction
The matrix-vector multiplication z = Aw in a generic BEM problem corresponds
to the evaluation of the left-hand-side of eqn. (2.29) for a set of source points
xi , i = 1, . . . , n
Z
Ii = G (xi , y) En (y; En,1 , . . . , En,p ) dS (y) +
ΓV
Z (3.1)
− K (xi , y) V (y; V1 , . . . , Vq ) dS (y)
Γ En
where the approximated functions V and En depend on the nodal values con-
tained in the vector w. The domain of integration is given by the union of a
Introduction to the Fast Multipole Method
Z
I (x) = K (x − y) φ (y) dS (y) , (3.2)
Γ
20
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
the first step consists of performing a kernel expansion with respect to a pole
y0 which separates the x-dependence from the y-dependence
X
K (x, y) = kn(1) (x − y0 ) kn(2) (y − y0 ) . (3.3)
n
(1) (2)
The choice for the functions kn and kn is not unique. In order to expand
the kernel as in eqn. (3.3), the Fast Multipole Method uses harmonic expansions,
which are solutions of the Laplace equations, while, for example, the Panel
Clustering method (Hackbush et. al. [17]) uses Taylor expansions.
Usually the expansion (3.3) holds if
|x − y0 | > |y − y0 | (3.4)
(2)
The function kn satisfies a relationship of the form
X
kn(2) (y − y1 ) = km(2)
(y − y0 ) cR
n,m (y1 , y0 ) (3.5)
m
which allows a pole shift. For example, if eqn. (3.3) is obtained through a Taylor
expansion, then eqn. (3.5) reduces to the binomial expansion of a polynomial.
(1)
Moreover the function kn can be further expressed in the form
X
kn(1) (x − y0 ) = (3)
km (x − x0 ) cSm,n (x0 , y0 ) (3.6)
m
(3)
where cSm,n (x0 , y0 ) are constants and km are usually entire functions. Just like
for the expansion (3.3), also (3.6) generally holds if
21
Introduction to the Fast Multipole Method
where
Z
Mn (y0 ) = kn(2) (y − y0 ) φ (y) dS (y) (3.11)
Γ
where
X
Lm (x0 ) = cSm,n (x0 , y0 ) Mn (y0 ) (3.13)
n
can be derived.
With reference to Figure 3.1, suppose we have to evaluate the set of integrals
m Z
X
I (xi ) = K (xi , y) φ (y) dS (y) i = 1, . . . , n (3.16)
j=1 Γj
22
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Direct integrations
xi j
y0
L2I
M2L 2M
x0
- The value of the integrals is recovered by using the L2I relationship (3.12)
(O(n) operations).
The total computational cost of the operation is thus of order O(m + n). In
order to be numerically evaluated, the required multipole and local expansions
must be truncated at a certain order p. The quality of approximation is directly
related to the order of expansion.
23
Introduction to the Fast Multipole Method
- Two lists of points are prepared. The first one contains a representative
point for each integration panel (for example, the centroid of the panel).
24
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
The second one contains all the source points. This differentiation could
appear cumbersome at first sight, since usually both source and field points
belong to the same region. However, for example, a noteworthy situation
in which the source points can be located in a distinct region from the in-
tegration domain occurs for the evaluation of internal quantities by means
of integral equations (Yoshida et al. [42]);
- The root cell containing all the points belonging to the two lists is defined;
- A given cell of level l contains a certain number of source points, and
another number of field points. If the number of field points is greater than
a certain threshold M , then the cell is a branch cell, so it is subdivided
into eight parts and the children cells are analysed. If it contains less than
M field points, than it is a field leaf. Any field leaf containing at least
one source point is also a source leaf. If the cell contains no field points,
but it contains more than one source point than it is a source-only cell.
A source-only cell is a source leaf if its level is equal to the field depth
of the oct-tree, otherwise it is a source branch and it must be recursively
subdivided in order to obtain a set of source leaves at the oct-tree depth.
The described strategy results in a structure in which any source point be-
longs to a source leaf, and any field panel belongs to a field leaf (some leaves
can be at the same time source leaves and field leaves). Moreover the described
structure is adaptive, i.e. the local depth (level of the field cells in a certain sub-
region) of the oct-tree is proportional to the mesh density. A sample quad-tree
structure (the 2D counterpart of the oct-tree) with depth 3 is shown in Figure
3.2. The naming convention for the cells is level-row-column. For example, 3B4
is the name of the cell at row B and column 4 at level 3. Figure 3.3 shows an
oct-tree built on top of a surface mesh of the Stanford bunny (from the Stanford
University Computer Graphics Laboratory).
During the construction of the oct-tree the field depth is a required parameter
in order to properly obtain the source leaves. However, this value is known only
when the field part of the oct-tree is completely determined. From the practical
point of view, the construction algorithm must be split in two parts. A first
recursive scheme evaluates the field depth without considering any source point
(fake oct-tree), while successively the real oct-tree is generated.
Once the oct-tree is built, it must be set up to be useful for the Fast Multipole
algorithm. That is, for each field cell a reference point for the evaluation of
multipole moments must be chosen. Similarly, a reference point for the local
expansions must be provided for each source cell. Usually, the center of the
25
Introduction to the Fast Multipole Method
Branch cell
Leaf cell
Level 0
1 2
Level 1
1 2 3 4
A
C
Level 2
1 2 3 4 5 6 7 8
A
B
C
D
Level 3
E
F
G
H
Leaves
26
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
27
Introduction to the Fast Multipole Method
cell is chosen for this task. However, a better choice is given respectively by
the centroids of the field points and of the source points belonging to the cell.
In order to determine if a field cell lies in the far-field region of a source cell,
the conditions (3.4) and (3.7) must be checked in the worst case. To do so, it
is necessary to know the field radius of each field cell, which is the maximum
distance between any point of any integration panel belonging to the cell and
the reference point for the multipole moments. Similarly, the source radius of
each source cell is the maximum distance between any source point belonging
to the cell and the reference point for the local expansion.
The far-field condition between a source cell i and a field cell j is thus
(j)
d > rs(i) + rf (3.17)
where d is the distance between the reference point for the local expansion in
(i)
cell i and the reference point for multipole moments in cell j, rs is the source
(j)
radius in cell i and rf is the field radius in cell j.
28
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
These conditions are mutually exclusive, and simply state that for a source
cell at level l it is desirable to have in list 2 wherever possible a field
cell at level l − 1 instead of the set of its children at level l. This is a
modification to the classical definition of list 2 which will be useful in
order to reduce the number of required M 2L operations during the Fast
Multipole multiplication.
It is worth emphasizing some properties of the defined lists. List 1 and List
2 are typical of non adaptive oct-tree structures (all leaves are at the same level
since the algorithm is stopped at the depth of the bigger leaf), while List 3 and
List 4 can be non-empty only when the adaptive algorithm described in the
previous section is employed.
List 1, List 3 and List 4 are defined only on source leaves. Moreover, List 1
and List 4 contain only field leaves. List 2 is defined on both source leaves and
source branches.
In order to simplify the description of the Fast Multipole algorithm it is
convenient to define two lists which are functions of the four lists just introduced.
The interaction list of a source cell, be it a branch or a leaf cell, is given by the
union of List 2 and the list of field cells in List 1, List 3 and List 4 which satisfy
the far-field condition. The interaction list reduces to List 2 alone for source
branch cells. The near-field list of a source leaf cell is given by the list of field
cells in List 1, List 3 and List 4 which do not satisfy the far-field condition.
The definition of these lists allows, for each source leaf cell, to define a com-
plete covering of the integration domain by means of field cells. This covering
formally defines a near-field part, given by the near-field list of the source cell
itself, and a far-field part, given by the union of the interaction lists of the cell
and of all the branch cells belonging to its family, as shown in Figure 3.4 for cell
3G3 of the sample oct-tree of Figure 3.2.
29
Introduction to the Fast Multipole Method
Z
Ii = G (xi , y) En (y; En,1 , . . . , En,p ) − K (x, y) V (y; V1 , . . . , Vq ) dS (y)
Γ
(3.18)
∞ n −→ −→
1 1 X X −→ −→
G (x, y) = = Sn,m Ox Rn,m Oy Oy < Ox (3.19)
4πr 4π n=0 m=−n
where Rn,m and Sn,m are defined in terms of the associated Legendre functions
30
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
∂
K (x, y) = G (x, y) ni (y) =
∂yi
∞ n −→ −→
1 X X ∂ −→ −→
= Sn,m Ox ni (y) Rn,m Oy Oy < Ox.
4π n=0 m=−n ∂yi
(3.21)
The integral over the surface Γk belonging to the kth field leaf can the be
approximated by eqn. (3.10) which reduces to
∞ X
n −−→
(Γk )
X
Ii = Sn,m Oxi Mn,m (O) (3.22)
n=0 m=−n
31
Introduction to the Fast Multipole Method
2. Starting from the field leaves the multipole moments in a generic field
branch are evaluated by collecting the contributions of its children through
the M 2M formula (3.26), where the reference point shift is from the cen-
troid of fields in the child to the centroid of fields in the considered cell.
This step is repeated up to the cells at level 1;
4. The L2I formula (3.24) is applied to each source point, using the local
expansion of the corresponding source leaf. This step allows to evaluate
the far-field of each integral.
32
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
The first two steps (multipole moment collection in field leaves and multipole
moment evaluation in field branches) are referred to as upward pass, since the
field part of the oct-tree is spanned from leaves to root. The last three steps
(evaluation of local expansions, evaluation of far-field integral and evaluation of
near-field intagral) is instead called downward pass, since the source part of the
oct-tree is spanned from root to leaves.
33
Introduction to the Fast Multipole Method
which does not increase with the mesh size, the near-field matrix size is of order
O(n×m). For this reason the computational cost of the near-field matrix-vector
multiplication is linearly dependent on the problem size.
For these reasons the total computational complexity of the adptive Fast
Multipole technique described above is of order O(n logα n), which from the
practical point of view is an almost linear behaviour.
34
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
contain the free-terms), can be obtained from the remaining coefficients. This is
obviously not possible when the Fast Multipole Method is employed, therefore
the required solid angles must be explicitly evaluated (see e.g. Mantic [23]).
35
Introduction to the Fast Multipole Method
Spheres DOF Tmv (s) Rel Tmv nmv Rel Tmv · nmv
2 11768 1.67 1.00 7 1.00
4 23552 3.36 2.01 9 2.58
8 47098 6.92 4.14 11 6.51
16 94206 14.00 8.38 14 16.76
32 188430 28.10 16.57 17 40.24
64 376838 58.31 34.91 20 99.74
Spheres Oct-tree cells Near-field matrix size (MB) Tot (s) Tnf (s)
2 1045 47.7 1.47 5.98
4 2095 107.7 5.75 15.21
8 4187 209.5 26.93 45.65
16 8265 398.2 136.53 189.12
32 16340 981.9 531.20 713.21
64 33098 1791.1 2133.01 2633.93
36
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
ment solutions obtained with Femlab, v3.1, using 10-nodes tetrahedral elements.
The advantage of the Boundary Element method during the modelling stage
is clear, since the problems solved differ only by a rigid translation of the sur-
face mesh corresponding to the central plate. Moreover, the presence of sharp
edges requires a rather fine boundary discretization to properly approximate
the normal flux, which is singular. The corresponding Finite Element meshes
can quicky become huge, with the consequent resource requirements.
The geometry of the problem is summarized in Table 3.3, together with some
details of the FEM and BEM meshes solved.
A fundamental parameter in the comparison between the FEM and the BEM
discretizations is the number of boundary elements obtained by the restriction
to the boundary of the FEM mesh. As a matter of fact, the finest of the FEM
meshes used defines less than 20000 triangles on the boundary, although with
a higher polynomial degree than the corresponding piecewise constant elements
used with the BE approach.
The x and z components of the total force acting on the central plate are
plotted as functions of the plate’s offset in Figure 3.6. The first graph clearly
shows that the correct evaluation of the x component of the force requires a very
detailed discretization over the short edge of the central plate. The z component
is less demanding in terms of DOFs, although even the finest FEM mesh solved
leads to a visible error. This benchmark confirms the better accuracy delivered
by the BEM over the FEM for comparable surface meshes.
More carefully designed FEM meshes could lead to better results for the
same number of DOFs, but on the other hand they would require a significant
amount of modelling work, expecially for complex geometries like the ones of
37
Introduction to the Fast Multipole Method
Total force, x
8
4
Fz
2
BEM 1
1 BEM 2
BEM 3
FEM 1
0 FEM 2
FEM 3
FEM 4
-1
-1 -0.8 -0.6 -0.4 -0.2 0
Central plate offset
Total force, z
160
140
120
100
80
Fz
60
BEM 1
40 BEM 2
BEM 3
FEM 1
20 FEM 2
FEM 3
FEM 4
0
-1 -0.8 -0.6 -0.4 -0.2 0
Central plate offset
38
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Gemetry
Plate size 1 × 0.1 × 0.1
Gap between plates 0.1
External enclosure size 4×2×2
Meshes, DOF
BEM 1 1802
BEM 2 25756
BEM 3 244467
FEM 1 22731
FEM 2 89441
FEM 3 287266
FEM 4 914308
MEMS. Moreover, the required mesh sizes could quickly grow to the order of
hundreds of millions of unknowns. The corresponding FMM-BEM meshes would
be significantly smaller and well within the reach of a typical workstation.
39
Introduction to the Fast Multipole Method
Dimensions (µm)
Finger gap 2.88
Finger length 39.96
Finger overlap 19.44
Tether length 151
Tether width 1.1
Thickness 1.96
Substrate gap 2
converged, the comparison with a reference mesh which has been further refined
at the finger tip, for a total of 313064 DOFs, is reported in the first graph.
The complete structure, characterized by 15 fingers for each side of the rotor,
is then solved. One of the two stators has an assigned potential Vs1 = 1, while
the rotor and the second stator are grounded. The induced electrostatic field is
thus concentrated along the fingers of the charged stator and the corresponding
fingers of the rotor, therefore the meshes are refined only in that region. Five
meshes have been used, for a total of 9602, 29802, 97366, 376748 and 795532
degrees of freedom, respectively.
The x and y components of the total force acting on the rotor are reported
in Figure 3.10 as functions of the y displacement of the rotor from the central
position, together with the force acting on 15 separate fingers.
This example confirms the need of very refined meshes in order to properly
approximate the singularities of the electrostatic field.
40
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
41
Introduction to the Fast Multipole Method
5e-11
4e-11
Force
3e-11
2e-11
Mesh 1
1e-11 Mesh 2
Mesh 3
Mesh 4
Reference mesh
0
-0.3 -0.2 -0.1 0 0.1 0.2 0.3
Displacement
6e-09
4e-09
2e-09
Force
-2e-09
-4e-09
Mesh 1
-6e-09 Mesh 2
Mesh 3
Mesh 4
-8e-09
-0.3 -0.2 -0.1 0 0.1 0.2 0.3
Displacement
42
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
-3e-10
-4e-10
Force
-5e-10
-6e-10
-7e-10
-8e-10
-9e-10
-1e-09
-0.3 -0.2 -0.1 0 0.1 0.2 0.3
Displacement
8e-08
6e-08
4e-08
2e-08
Force
-2e-08
-4e-08
-6e-08 Mesh 1
Mesh 3
-8e-08 Mesh 5
Mesh 3 - 15 separate fingers
Mesh 4 - 15 separate fingers
-1e-07
-0.3 -0.2 -0.1 0 0.1 0.2 0.3
Displacement
43
Introduction to the Fast Multipole Method
44
Chapter 4
A FMM accelerated
Boundary Element
technique for scalar
evolutive problems in 3D
This chapter presents a Boundary Element procedure for the solution of evo-
lutive problems based on the use of the time-independent kernels of Laplace
equation. The resulting time-dependent mixed boundary-domain integral equa-
tions are solved using standard finite difference time-marching schemes, and a
Fast Multipole accelerated solver.
4.1 Introduction
The classical Boundary Element Method, both in the collocation and the varia-
tional versions, has essentially failed to live up to the initial high expectations,
especially in dynamics, for various reasons some of which are briefly summarized
in what follows. (i) The BEM produces fully populated matrices which make
the application of direct solvers unrealistic for large scale problems; moreover
the generation of such matrices is numerically costly and hence iterative solvers
do not bring much benefit to the method as is. (ii) The BEM applied to dy-
namic analyses with the appropriate time-dependent kernels yields a series of
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
still unresolved issues related to instabilities (see e.g. Frangi et al. [10] and
Peirce et al. [29]) and lengthy convolutions.
This has motivated numerous investigations on boundary-domain methods
(BDM) employing static kernels. In particular, a number of techniques, such
as the dual-reciprocity method (DRM) and the multiple-reciprocity method
(MRM) have been developed aiming at eliminating the need to compute costly
domain integrals by transforming them to equivalent boundary integrals (see
e.g. the review in Ingber et al. [18]).
With respect to point (i), the adoption of Fast Multipole accelerated iterative
solvers seem to be changing the situation drastically. Indeed they allow to reduce
the operation count per iteration to approximately O(n) (to be compared with
O(n2 ) of the classical approach), where n is the problem size. The FMM can
be considered as an efficient tool for evaluating the contributions to the integral
equations relevant to regions far away from the collocation point, the near field
contributions being evaluated by means of classical BEM tools.
Moreover, a recent contribution (Ingber et al. [18]) comparing different
approaches for the evaluation of domain integrals, has pointed out the superior
performance of truly BDM approaches over DRM and MRM, both in terms of
accuracy and computing time.
In what follows both the scalar parabolic equation and the scalar wave equa-
tion are treated by applying the BDM-FMM to evaluate directly and efficiently
the relevant integrals.
46
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
47
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
for x ∈ Γu ,
Z Z Z
1
− Gu̇Ω,0 dV + KuΓ,0 dS − GpΓ,0 dS − cuΓ,0 =
Ω κ Γp Γu
Z Z (4.10)
= + Gp̄B,0 dS − K ūB,0 dS
Γp Γu
for x ∈ Γp , and
Z Z Z
1
− Gu̇Ω,0 dV + KuΓ,0 dS − GpΓ,0 dS =
Ω κ Γp Γu
Z Z (4.11)
= cūΩ,0 + Gp̄B,0 dS − K ūB,0 dS
Γp Γu
for x ∈ Ω \ Γ.
Once the initial quantities are known, it is possible to start the time-marching
scheme. For a given time knot tk+1 eqn. (4.7) writes
Z Z Z
1
− Gu̇Ω,k+1 dV + KuΓ,k+1 dS − GpΓ,k+1 dS =
Ω κ Γp Γu
Z Z (4.12)
= cūΓ,k+1 + Gp̄B,k+1 dS − K ūB,k+1 dS
Γp Γu
48
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
for x ∈ Γu ,
Z Z Z
1
− Gu̇Ω,k+1 dV + KuΓ,k+1 dS − GpΓ,k+1 dS − cuΓ,k+1 =
Ω κ Γp Γu
Z Z (4.13)
= + Gp̄B,k+1 dS − K ūB,k+1 dS
Γp Γu
for x ∈ Γp , and
Z Z Z
1
− Gu̇Ω,k+1 dV + KuΓ,k+1 dS − GpΓ,k+1 dS − cuΩ,k+1 =
Ω κ Γp Γu
Z Z (4.14)
= Gp̄B,k+1 dS − K ūB,k+1 dS
Γp Γu
for x ∈ Ω \ Γ.
Since at the solution of step k + 1 the quantities at kth step are known, (4.8)
can be rewritten in the form
Z tk+1
u (x, tk+1 ) = u̇ (x, τ ) dτ + uk (x) (4.15)
tk
49
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
∂u
u (x, t)|Γu = ū (t) p (x, t)|Γp = (x, t) = p̄ (t) ∀t > 0 (4.19)
∂n Γp
Paralleling the steps taken in the parabolic case, the second Green identity
with the static fundamental solution is applied to eqn. (4.17), leading to the
integral equation
Z Z
1
− 2 G (x, y) ü (y, t) dV (y) − K (x, y) u (y, t) dS (y) +
cs Ω Γ
Z (4.20)
+ G (x, y) p (y, t) dS (y) = cu (x, t) .
Γ
Even in this case the free term must be expressed in terms of ü and of the
initial conditions using the relations
Z t
u̇ (x, t) = ü (x, τ ) dτ + u̇0 (x)
0
Z t
u (x, t) = u̇ (x, τ ) dτ + u0 (x) =
0 (4.21)
Z tZ τ
= ü (x, τ̃ ) dτ̃ dτ +
0 0
+ tu̇0 (x) + u0 (x)
50
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
the unassigned quantities at the starting time, and then another set of equations
embedding a suitable time marching scheme.
The equation at step 0 writes
Z Z Z
1
− 2
Gü Ω,0 dV + Ku Γ,0 dS − GpΓ,0 dS =
Ω cs Γp Γu
Z Z (4.22)
= cūΓ,0 + Gp̄Γ,0 dS − K ūΓ,0 dS
Γp Γu
for x ∈ Γu ,
Z Z Z
1
− 2
Gü Ω,0 dV + Ku Γ,0 dS − GpΓ,0 dS − cuΓ,0 =
Ω cs Γp Γu
Z Z (4.23)
= + Gp̄Γ,0 dS − K ūΓ,0 dS
Γp Γu
for x ∈ Γp , and
Z Z Z
1
− 2
Gü Ω,0 dV + Ku Γ,0 dS − GpΓ,0 dS =
Ω cs Γp Γu
Z Z (4.24)
= cūΩ,0 + Gp̄Γ,0 dS − K ūΓ,0 dS
Γp Γu
for x ∈ Ω \ Γ.
At time knot tk+1 eqn. (4.20) writes
Z Z Z
1
− 2
Gü Ω,k+1 dV + Ku Γ,k+1 dS − GpΓ,k+1 dS =
Ω cs Γp Γu
Z Z (4.25)
= cūΓ,k+1 + Gp̄Γ,k+1 dS − K ūΓ,k+1 dS
Γp Γu
for x ∈ Γu ,
Z Z Z
1
− 2
GüΩ,k+1 dV + KuΓ,k+1 dS − GpΓ,k+1 dS − cuΓ,k+1 =
Ω cs Γp Γu
Z Z (4.26)
= + Gp̄Γ,k+1 dS − K ūΓ,k+1 dS
Γp Γu
51
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
for x ∈ Γp , and
Z Z Z
1
− 2
Gü Ω,k+1 dV + Ku Γ,k+1 dS − GpΓ,k+1 dS − cuΩ,k+1 =
Ω cs Γp Γu
Z Z (4.27)
= Gp̄Γ,k+1 dS − K ūΓ,k+1 dS
Γp Γu
for x ∈ Ω \ Γ.
As in the case of the parabolic equation, a discretization of eqn. (4.21),
rewritten in the form
Z tk+1
u̇ (x, tk+1 ) = ü (x, τ ) dτ + u̇k (x)
tk
Z tk+1
u (x, tk+1 ) = u̇ (x, τ ) dτ + uk (x) =
tk (4.28)
Z tk+1 Z τ
= ü (x, τ̃ ) dτ̃ dτ +
tk tk
+ (tk+1 − tk ) u̇k (x) + uk (x)
is required.
A widely used discretization technique for (4.28) is the Newmark family of
time-marching schemes
52
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where
Z
1
[GΩ ] f = G (x, y) f (y) dV (y)
c2s
ZΩ
[GΓu ] f = G (x, y) f (y) dS (y)
Γu
Z
GΓp f = G (x, y) f (y) dS (y) (4.31)
Γp
Z
[KΓu ] f = K (x, y) f (y) dS (y)
Γu
Z
KΓp f = K (x, y) f (y) dS (y)
Γp
for x ∈ Γu ,
− [GΩ ] üΩ,k+1 + (1 + α) KΓp uΓ,k+1 − [GΓu ] pΓ,k+1 − cuΓ,k+1 =
= (1 + α) [KΓu ] ūΓ,k+1 − GΓp p̄Γ,k+1 + (4.33)
− α cuΓ,k + [GΓu ] + GΓp pΓ,k − [KΓu ] + KΓp uΓ,k
for x ∈ Γp , and
− [GΩ ] üΩ,k+1 + (1 + α) KΓp uΓ,k+1 − [GΓu ] pΓ,k+1 − cuΩ,k+1 =
= (1 + α) [KΓu ] ūΓ,k+1 − GΓp p̄Γ,k+1 + (4.34)
− α cuΩ,k + [GΓu ] + GΓp pΓ,k − [KΓu ] + KΓp uΓ,k
53
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
- the center of the k internal elements in Ω, where the value of the internal
unknown field (u̇Ω or üΩ ) is unknown.
and
Z
(uΓ )
KΓ,ij = K (xi , y) Nj (y) dS (y) (4.36)
Γ
54
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
over a generic kth boundary element, while in the latter case involves the volume
integral
Z −→
(Ω)
Mn,m(O) = Rn,m Oy f (y) dV (y)
Ω
(4.39)
u̇Ω parabolic equation
f =
üΩ scalar wave equation
over a generic kth internal element. As a consequence the total Fast Multipole
integration strategy must be specialized for the two kinds of integrals, and
requires the construction of separate oct-trees. Except for this modification,
the matrix-vector multiplication algorithm is analogous to the one described for
the potential problem.
55
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
56
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
57
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
0.5
0
Potential, Flux
-0.5
-1
-1.5
-2
-2.5
-3
0 1 2 3 4 5 6 7 8 9
Time
0.5
0
Potential, Flux
-0.5
-1
-1.5
-2
-2.5
-3
0 1 2 3 4 5 6 7 8 9
Time
Figure 4.2: Simple wave propagation: fine internal mesh, different time-step
lengths
58
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
0.5
Potential, Flux
-0.5
-1
-1.5
-2
-2.5
0 1 2 3 4 5 6 7 8 9
Time
0.5
Potential, Flux
-0.5
-1
-1.5
-2
-2.5
0 1 2 3 4 5 6 7 8 9
Time
Figure 4.3: Simple wave propagation: coarser meshes, fixed time-step coefficient
59
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
0.5
0
Potential, Flux
-0.5
-1
-1.5
-2
-2.5
-3
0 1 2 3 4 5 6 7 8 9
Time
0.5
0
Potential, Flux
-0.5
-1
-1.5
-2
-2.5
-3
0 1 2 3 4 5 6 7 8 9
Time
Figure 4.4: Simple wave propagation: coarser meshes, fixed time-step length
60
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
The graphs in Figure 4.3 show the solution obtained with coarser meshes. All
the solutions are obtained with a fixed time-step coefficient k = 1, corresponding
to time-step lengths ranging from ∆t = 0.0125 for the finest mesh to ∆t = 0.1
for the coarsest one. While the results obtained with the aligned mesh are
satisfactory, those relative to the oblique mesh are acceptable only with the
finest mesh. In fact, one notes that the wave reflections do not happen at
the correct times, as a consequence of the excessively coarse discretization of
the internal domain; moreover, the flux history experiences stronger oscillations
than in the case of the aligned mesh. Despite these drawbacks, the maximum
value of the potential and that of the flux (purged from the oscillations) are
correctly approximated even with the coarser meshes.
Finally, the graphs in Figure 4.4 are relevant to a fixed time-step ∆t =
0.00125 with different spatial discretizations. The results obtained with the
oblique mesh confirm that the shift in reflection times is due to the lack of
accuracy in the internal discretization, more than to the adoption of excessively
large time-steps.
61
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
0.2
Potential
-0.2
-0.4
-0.6
0 5 10 15 20
Time
62
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Three Boundary Element meshes, together with two Finite Element meshes,
have been studied. Details on the solved discretizations are given in Table 4.2.
DOF
Internal Boundary
BEM 1 6640 1909
BEM 2 6640 52392
BEM 3 53120 52392
FEM 1 13375 -
FEM 2 98890 -
63
A FMM accelerated Boundary Element technique for scalar evolutive
problems in 3D
0.4
0.35
0.3
Temperature
0.25
0.2
0.15
0.1
BEM 1
BEM 2
0.05 BEM 3
FEM 1
FEM 2
0
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4
Time
Figure 4.8 shows the time history of temperature at the central point of the
bottom plate. The BEM results are in good agreement with the ones obtained
with FEM, with the exception of a slight underestimation of the maximum value
of the temperature obtained with the coarsest BEM mesh.
64
Chapter 5
Two existing formulations based on integral equations for the external Stokes
problem are tested and compared, in order to determine their accuracy and
their weaknesses. The issues related to the iterative solution of singular and ill-
conditioned BEM problems are then analysed, and the necessary justification
for the necessity of the Mixed Velocity-Traction equation approach is provided.
Figure 5.1: Fluid flow regimes for several MEMS applications (Karniadakis et.
al. [20])
The low Reynolds number allows to neglect the effects of turbolence, giving
rise to a linear problem, which can be solved by means of integral equations.
While the classical Boundary Element Method can not guarantee the required
scalability to solve large-scale problems arising in MEMS analysis, the usage of
an accelerated solver employing a fast matrix-vector multiplication technique
like the Fast Multipole Method or the Wavelet Compression technique (see e.g.
Wang [40], Tausch [36]) allows to solve problems which are out of reach at a
reasonable cost using the Finite Element Method or the Finite Volume Method.
66
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where u is the fluid velocity, µ is the viscous drag coefficient and P is the
hydrostatic pressure, which is not dependent on the rate of deformation, due to
the incompressibility of the fluid.
The steady-state Stokes equations, which can be recovered from the Navier-
Stokes equations by neglecting the time-dependent terms and the terms related
to turbolence, can be written in the form
µ∆u − ∇P = 0
(5.2)
∇·u = 0
where the first equation is the 3D equilibrium equation, while the second one is
the continuity equation, which reduces to an incompressibility constraint.
With reference to Figure 5.2, a set of m rigid bodies Ωα with surface Γα
are immersed in a viscous fluid, which in turn can be enclosed by an external
envelope of surface ΓE .
m
1
nm
n1 um
u1
,
u2 n2
2
67
The Boundary Element Method for the exterior Stokes flow problem
while confined flows must match the velocity of the enclosing boundary
u|ΓE = V E + ΩE × X = g (E) (x) (5.5)
We define with G the set of right hand sides obtained by any possible com-
bination of rigid body motions over the closed surfaces Γα .
This problem is known as the resistance problem. Two different integral
formulations are available in literature to solve the resistance problem, and will
be described in the next sections.
Moreover, the possible flows affecting MEMS devices can be roughly divided
in two categories, as illustrated in Figure 5.3. Pressure driven flows are gener-
ated when the relative distance between the immersed parts varies during the
motion (e.g. in parallel plate resonators), and the total force is thus mainly
determined by the pressure difference between the opposite sides of the bodies.
Shear driven flows are instead characterized by a relative slide between the im-
mersed parts (e.g. in comb drive resonators and torsional accelerometers), and
the total force is hence determined by the shear surface tractions.
68
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
u u u
u
By taking the limit to a boundary smooth point and inserting the Dirichlet
boundary conditions of rigid body motion described in the former section we
obtain
Z XZ
1 α (α)
Gij (x, y) tj (y) dS (y) = ui (x)+ − Kij (x, y) uj (y) dS (y) (5.10)
Γ 2 α Γ α
69
The Boundary Element Method for the exterior Stokes flow problem
thus the boundary integral equation for the external flow reduces to a Fredholm
Boundary Integral Equation of the first kind
Z
Gij (x, y) tj (y) dS (y) = −gi (x) (5.13)
Γ
where the unknown potential t (x) is the surface traction exerted by the fluid
flow over the rigid surfaces Γα .
Let us define the two integral operators
Z
V f i (x) = Gij (x, y) fj (y) dS (y)
ZΓ (5.14)
K f i (x) = Kij (x, y) fj (y) dS (y) .
Γ
The corresponding traction equation for both the internal and the external
problem can be recovered by a proper combination of x-derivatives of (5.6),
which lead to
Z Z
ti (x) = Sij (x, y) tj (y) dS (y) − Dij (x, y) uj (y) dS (y) (5.18)
Γ Γ
70
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where
3 r,i r,j r,k nxk
Sij (x, y) = − (5.19)
4π r2
and Dij is an hypersingular term (see e.g. Aliabadi [2]) which satisfies
Z
Dij (x, y) gj (y) dS (y) = 0 ∀g ∈ G. (5.20)
Γ
and
1
S+ t = 0 (5.24)
2
is defined.
Moreover, since
Z
S f (x) = Sij (x, y) tj (y) dS (y) =
Γ
Z (5.26)
= − Kji (y, x) ti (y) dS (y) = − K ∗ f (x)
Γ
71
The Boundary Element Method for the exterior Stokes flow problem
72
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where
3 (x − x0 )i (x − x0 )j (x − x0 )k nk (x)
Kij (x, x0 ) = r = kx − x0 k
4πµ r5
(5.32)
where the integral exists in the Cauchy Principal Value sense. Equation (5.31)
cannot describe an arbitrary solution of (5.2) since it can be shown that (5.33)
has a null-space given by the set G of rigid body motions
1
N K− = G = span ψ 1,(α) , . . . , ψ 6,(α)
2 (5.34)
k,(α) k+3,(α)
ψj = δjk ψj = jkl (x − xα )l k = 1, 2, 3
where
1 ijk (x0 − xα )k
rij (x0 , xα ) = (5.36)
8πµ r5
73
The Boundary Element Method for the exterior Stokes flow problem
is called Rotlet, and the internal singularities allow to complete the range of the
double-layer operator, therefore providing an equation yielding a unique solution
which satisfies the Stokes equations and the prescribed boundary conditions.
The strength of the internal singulatities can be arbitrarily expressed as
linearly dependent upon the double-layer density:
Z
(α) k,(α)
Fk = φj (x) ψj (x) dS
Γ
Zα (5.37)
(α) k+3,(α)
Mk = φj (x) ψj (x) dS
Γα
The total drag forces exerted on a sphere and on a cube subject to a pure
translation in an unbounded domain are evaluated.
74
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Figure 5.4 shows the error in the evaluation of the drag force using different
meshes and formulations. The reference value is represented by the analytical
one in the case of the spherical body, and by the value obtained with a 150000
DOF mesh using the single-layer formulation for the cubic body.
Sphere Cube
1 0.1
SLPC
DLPC
DLCL
0.1 DLPL
Error
Error 0.01
SLPC
0.01 DLPC
DLCL
DLPL
DLPPL
0.001 0.001
100 1000 10000 100 1000 10000
DOF DOF
Figure 5.4: Error in the evaluation of the total drag force on a translating sphere
and cube ; SL - Single-Layer; DL - Double-Layer; PC - Piecewise Constant; CL
- Continuous Linear; PL - Piecewise Linear; PPL - Partially Piecewise Linear
75
The Boundary Element Method for the exterior Stokes flow problem
where
Z Z
SG
V hk
= Gij (x, y) dS (y) dS (x) (5.40)
Γh Γk
and
Z
g SG
h
= g (x) dS (x) (5.41)
Γh
Apart from the eventual loss of precision due to the adopted integration
strategy, the matrix V SG is still symmetric.
Things radically change if the Collocational formulation (5.13) is considered.
Despite the fact that the fundamental solution is symmetric
76
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where the coefficients of the unsymmetric matrix V C are grouped in 3×3 blocks
in the form
Z
C
V hk = Gij x(h) , y dS (y) (5.46)
Γk
77
The Boundary Element Method for the exterior Stokes flow problem
78
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
It can be shown that x = AD b is a solution of Ax = b if b ∈ R Ai , and
also that the Drazin solution is the unique Krylov solution.
Therefore, in exact arithmetic, the GMRES method converges to the Drazin
solution of the n × n singular system
in at most n − α iterations.
Another issue which arises when solving the discretized version of eqns.
(5.45) and (5.39) is the fact that the discrete right hand side of
could not belong to R([V ]), and the deviation from the correct range is often
much larger than the machine precision, expecially for coarse meshes.
The effect of this inaccuracy is that the discretized singular system has no
solution, and the GMRES method cannot converge to any vector {t}, since a
positive lower limit for the norm of the residual is defined by the out-of-range
component of the right hand side. Moreover, the Krylov space
r0 = u − V t0
(5.54)
Kk ≡ span r0 , V r0 , V 2 r0 , . . . , V k−1 r0
is malformed, since g 6∈ R(G) and therefore so does the Krylov space Kk (A, b).
As a consequence of these issues, the GMRES method keeps increasing the
Krylov space up to its completion, which happens at a generic kth step. At
this point, the (k + 1)th Krylov vector is linearly dependent upon the former
ones, the resulting Hessemberg matrix is singular and the associated least square
problem can not be solved, determining the failure of the iterative solver.
In order to recover a solution, the right hand side should be projected onto
R(V ) leading to the modified linear system
V t = −g̃ (5.55)
where
" #
X
∗(α) ∗(α)
g̃ = I − t ⊗t g (5.56)
α
and the vectors t∗(α) form an orthonormal basis for the null-space N (V T ). This
ensures that a solution can be actually reached.
79
The Boundary Element Method for the exterior Stokes flow problem
80
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
which can not be further simplified. The resulting algorithm is therefore prac-
tically useless, due to its computational requirements.
The only way to solve this issue is dropping the last term in (5.57), with the
implicit acceptance of a numerical error whose value is of the order of the norm
of the matrix D.
The better way to completely remove any convergence issue due to the sin-
gularity of A is to modify the operator itself in order to obtain a non singular
system which solution is the Drazin solution of the original one. Recalling that
the Drazin solution is the only solution of
Ax = b b ∈ R(A) (5.60)
where the augmentation term is a projector over the null-space of the adjoint
operator, provides the Drazin inverse solution of the singular operator A.
81
The Boundary Element Method for the exterior Stokes flow problem
is equivalent to the unique solution shared by the two decoupled singular systems
Ax = b
!
X
∗(α) ∗(α)
(5.65)
t ⊗t x = 0.
α
G −1 ≡ {t|V t ∈ G} (5.66)
be the space containing all the solutions of the single-layer equation with r.h.s.
belonging to G. Since V is singular,
∀t ∈ G̃ −1 , ∀tα ∈ N (V ) t 6= tα (5.69)
G̃ −1 ∩ N (V ) ≡ 0. (5.70)
82
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Thus subspace G̃ −1 has the same dimensions of G, and its intersection with
the null-space contains only the zero element.
The subspace G̃ −1 is not unique. For example
⊥
G̃ −⊥ = G −1 ∩ N (V ) (5.71)
is the space containing all the null-space free solutions of the single-layer oper-
ator.
Similarly
G̃ −D = G −1 ∩ R (V ) (5.72)
is the space containing all the Drazin inverse solutions of the single-layer oper-
ator.
⊥
If V is symmetric, then the two latter spaces coincide, since R (V ) ≡ N (V ) .
−1
The solution of V t = −g, with t ∈ G̃ , exists and is unique. Now let B
be a second operator with the properties
N (V + B) ≡ 0 (5.73)
and
∃G̃ −1 |G̃ −1 ⊆ N (B). (5.74)
Then, let t̂ be the unique solution of the single-layer operator belonging to
G̃ −1 . Clearly
(A + B) t̂ = At̂ = −g (5.75)
since t̂ ∈ N (B) for property (5.74), therefore t̂ is a solution of the modified
system
(A + B) t = −g. (5.76)
Moreover, for property (5.73), t̂ is the unique solution of (5.76).
Therefore, if the augmentation of the single-layer operator is performed by
means of a projector onto its own null-space, instead of onto the null-space of
its adjoint operator,
" #
h i X
Ṽ {t} = V + β t∗(α) ⊗ t∗(α) {t} = − {g} ∈ G (5.77)
α
the desingularized system still provides a valid solution of the single-layer oper-
ator, different from the corresponding Drazin inverse solution.
83
The Boundary Element Method for the exterior Stokes flow problem
84
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
However, the numerical issues observed here are typical of the velocity operator
of the Stokes flow problem, since the single-layer operators derived from both
the Laplace and the compressible elasticity fundamental solution lead to optimal
convergence over the same geometry.
The convergence history plotted in Figure 5.10 for both the CBEM approach
and the SGBEM approach with 52 Gauss-Hammer points shows that the stan-
dard desingularization technique alone is not able to remove all the stagnation
zones. As a matter of fact, the presence of slender members, which numerically
behave like almost independent bodies, leads to a severe ill-conditioning of the
resulting desingularized matrix.
This kind of behavior suggests the presence of a numerical null-space spanned
by the left eigenvectors associated to the lowest singular values. The effect of
the ill-conditioning on the convergence rate of the GMRES method is almost
independent of the formulation adopted.
Numerical experiments show that pressure driven flows are more sensitive
to the conditioning of the system matrix than shear driven flows. A simple
observation which corroborates this evidence is that the main issues related
85
The Boundary Element Method for the exterior Stokes flow problem
Convergence History
1
CBEM
SGBEM_1
0.01 SGBEM_7
SGBEM_52
1e-04 CBEM - NS
SGBEM_1 - NS
1e-06 SGBEM_7 - NS
SGBEM_52 - NS
Error
1e-08
1e-10
1e-12
1e-14
1e-16
0 20 40 60 80 100 120 140
Iterations
CBEM
1
0.01
0.0001
1e-06
1e-08
1e-10
1e-12
1e-14
Convergence history
<w_k,n*>/||w_k||
1e-16
0 20 40 60 80 100 120
k
86
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
87
The Boundary Element Method for the exterior Stokes flow problem
Convergence History
1
CBEM
0.01 SGBEM_52
CBEM - AS
0.0001 SGBEM_52 - AS
1e-006
1e-008
Error
1e-010
1e-012
1e-014
1e-016
1e-018
0 50 100 150 200 250
Iterations
88
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
1e-10
1e-12
1e-14
1e-16
1e-18
0 50 100 150 200 250
Iterations
Condition number
CBEM
SGBEM_52
1/c
0 5 10 15 20 25 30
Eigenvectors
Total force
138
CBEM
SGBEM_52
137
Force
136
135
134
0 5 10 15 20 25 30
Eigenvalues
89
The Boundary Element Method for the exterior Stokes flow problem
90
Chapter 6
The Mixed
Velocity-Traction
formulation for the external
Stokes flow
In this chapter, a new boundary integral formulation for the solution of the ex-
ternal Stokes flows with Dirichlet boundary conditions is proposed. The Mixed
Velocity-Traction (MVT) formulation allows to obtain an optimal rate of con-
vergence, therefore correcting any ill-conditioning issue related to the thin body
effect. The integral formulation is presented together with a proper discretiza-
tion technique, which allows to retain at discretized level the fundamental prop-
erties of the original integral operator. Several examples show the performance
and reliability of the proposed formulation for the evaluation of drag forces on
complex MEMS devices.
6.1 Introduction
As seen in the previous chapter, the AS approach, while formally retrieving
a non-singular formulation, usually does not address the performance issues
related to the thin body effect. The Mixed Velocity-Traction (MVT) equation
approach is a formulation first introduced by Frangi et al. [11] which enables
The Mixed Velocity-Traction formulation for the external Stokes flow
t = t⊥ + cα tα (6.2)
and is satisfied by the null-space free component t⊥ of the solution of the velocity
equation.
Since (6.4) is a homogeneous equation, any of its non-trivial solutions lies in
its null-space, which can thus be defined as
1 ∗
N −K = G −⊥ (6.5)
2
where
G −⊥ ≡ t | V t = −g, t⊥ , tα = 0
(6.6)
is the space spanned by the null-space free inverse solutions of the velocity
equation, when the r.h.s. is given by members of G.
Moreover, the null-space of the external double-layer operator is
1
N −K = G (6.7)
2
92
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where γ is a length which can be used to tune the √ conditioning of the resulting
equation. From the practical point of view γ = γ̃ A, where γ̃ is a nondimen-
sional parameter, and A is a characteristic size of the adopted mesh.
At this point, it is necessary to show that the solution t of (6.9) for a given
r.h.s. g ∈ G is unique, and it is also a solution of the corresponding velocity
equation (6.1).
The uniqueness of the solution of (6.9) is verified if the corresponding ho-
mogeneous equation
γ 1 ∗
[V t] (x) + − K t (x) = 0 (6.10)
µ 2
93
The Mixed Velocity-Traction formulation for the external Stokes flow
An application of the Green’s formula for the Stokes flow leads to the fol-
lowing identity (the equivalent of Betti theorem for incompressible elasticity)
Z 2 Z
µ ∂ ûi ∂ ûj
(x) + (x) dV (x) = û (x) · fˆ (x) dS (x) (6.14)
2 Ωα ∂xj ∂xi Γα
The left hand side is nonnegative, and the right hand side can be rewritten
in the form
Z Z
γ 2
û (x) · fˆ (x) dS (x) = − kû (x)k dS (x) (6.15)
Γα µ Γα
û = 0 (6.16)
must hold, and a generic solution of (6.10) must consequently belong to the
null-space N (V ). However, if t̂ ∈ N (V ), then
1 ∗
− K t̂ (x) = 0 (6.17)
2
which proves that the unique solution of the MVT equation with r.h.s. belonging
to G is a solution of the single-layer equation.
It is important to stress that the behaviour of the MVT solution at dis-
cretized level heavily depends on the ability to preserve the correct null-space
of the traction operator in the actual implementation. The three sources of
94
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
errors are (i) the discretization technique employed, (ii) the approximation of
the involved integrations (Gauss quadrature, Fast Multipole Method) (iii) the
finite precision of floating point arithmetics in computers. While the two latter
sources of error are somehow unavoidable, a correct choice in the discretization
scheme is required to avoid the total loss of accuracy of the MVT solution.
While the theoretical MVT solution does not depend on the value of the
weighting coefficient γ, the conditioning of the MVT matrix heavily depends
upon it. Indeed, for γ = 0, the single-layer formulation is recovered, and c−1
0 = 0.
−1 A+γB B A+γB
If γ → ∞, then again cγ → 0, since σmax → γσmax → ∞ and σmin →
σ ∗ ≤ σmax
A
. Therefore, it exists an optimal value γ̂ such as c−1 γ̂ = max c−1
γ .
The goal of a successful MVT implementation is to ensure that the solution of
the mixed equation for γ = γ̂ is sufficiently close to the solution of the velocity
equation, despite the loss of accuracy introduced by the discretization process.
The right choice for the discretization technique is a crucial issue in a successful
implementation of the MVT formulation. In order to unify the various dis-
cretization algorithms which will be described, let us introduce the variational
version of equation (6.9).
γ 1
t̃, [V t] + t̃, − K∗ t = − t̃, g ∀t̃ ∈ E (6.20)
µ 2
95
The Mixed Velocity-Traction formulation for the external Stokes flow
where
Z Z
t̃, [V t] = t̃i (x) Gij (x, y) tj (y) dS (y) dS (x)
Γ
Γ Z
1 ∗
t̃, −K t = t̃i (x) ti (x) dS (x) +
2 Γ
Z Z (6.21)
∗
+ t̃i (x) Kij (x, y) tj (y) dS (y) dS (x)
Γ Γ
Z
t̃, g = t̃i (x) gi (x) dS (x) .
Γ
n
(t)
X
t (x; t1 , . . . , tn ) = Ni (x) ti (6.22)
i=1
The test functions t̃ must belong to the same vector space Eh , therefore n
linearly independent equations can be derived from (6.20).
96
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where the (i, j)th element of the 3 × 3 submatrices corresponding to hth source
element and kth field element is
Z Z
G
A hk ij = Nh (x) Nk (x) Gij (x, y) dS (y) dS (x)
ZΓ ZΓ
∗
G
B hk ij = Nh (x) Nk (x) Kij (x, y) dS (y) dS (x) + (6.25)
Γ Γ
Z
1
− δij Nh (x) dS (x)
2 Γ
and
Z
G
g (x) dS (x) ∈ GnG .
g τh i
= (6.28)
τh
therefore this formulation still provides a correct MVT solution which is inde-
pendent of the value of γ.
97
The Mixed Velocity-Traction formulation for the external Stokes flow
For the piecewise constant approximation xk is the centroid of the kth tri-
angular element.
The resulting system of equations is
γ C
[W ] A + B {t} = − [W ] g C
C
(6.32)
µ
where
Z
C
A hk ij = Nk (x) Gij (xh , y) dS (y)
ZΓ
C ∗ 1 (6.33)
B hk ij = Nk (x) Kij (xh , y) dS (y) − δhk δij
Γ 2
C
g h i = gi (xh ) ∈ GnC
The most crucial issue with the Collocation approach is that the matrix
corresponding to the discretized traction operator is not the transpose of the
collocational discretized double-layer operator. From the theoretical point of
view, the existence of a null-space for the Collocational traction equation can
not be proved. Numerical experiments confirm that the resulting matrix is not
singular, and the solution of the corresponding MVT equation shows an heavy
dependence of the value of γ.
98
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where
Z
Q
A hk ij = Nh (x) Gij (x, yk ) dS (x)
ZΓ
Q ∗ 1 (6.36)
B hk ij = Nh (x) Kij (x, yk ) dS (x) − δhk δij
Γ 2
Q
g h i = gi (xh ) ∈ GnQ
and the matrix W is still
[W ] = diag (w1 , . . . , wn ) . (6.37)
The matrix obtained with the Qualocation discretization of the traction
operator B Q is the transpose of the matrix arising from the Collocation dis-
cretization of the double-layer operator. Indeed
Z
∗Q ∗
K hk ij
= Nh (x) Kij (x, yk ) dS (x) =
Γ
Z
= Nh (y) Kji (xk , y) dS (y) = (6.38)
Γ
T
= K C kh ji = K C hk ij
which leads to
Q C !T
1 1
K∗ − = K− . (6.39)
2 2
Due to the fact that any rigid body motion lies in the null-space of the
Collocational double-layer operator, its adjoint operator possesses a null-space
of the same dimension. Therefore an appropriate subspace G̃ −1 can be defined
and property (5.74) is fullfilled. The consequence is that the solution obtained
with the Qualocation approach is theoretically independent of the value of γ.
In practise, it shows a much lighter dependence than the corresponding Collo-
cational solution, allowing to obtain a well-conditioned MVT matrix with very
little impact on accuracy.
99
The Mixed Velocity-Traction formulation for the external Stokes flow
where
−→ ∂
Pij = δij − Ox
j ∂xi
(6.41)
∂
Qi = .
∂xi
the ith component of the double integral over the source element Γα and the
field element Γβ
Z Z
(Γα ,Γβ )
Ii = Gij (x, y) φj (y) dS (y) dS (x) (6.43)
Γα Γβ
100
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where M S and M V are respectively scalar and vector multipole moments defined
by the φ2M formulae
Z −→
V
Mj,n,m (O) = Rn,m Oy φj (y) dS (y)
Γβ
Z −→ −→ (6.45)
S
Mn,m (O) = Rn,m Oy Oy φj (y) dS (y) .
Γβ j
n0 =0 m0 =−n0
0
n n −−→
S
Mn,m 0
(O ) =
X X
Rn0 ,m0 O0 O Mn−n
S (6.46)
0 ,m−m0 (O) +
n0 =0 m0 =−n0
−−→
− O0 O Mj,n−n
V
0 ,m−m0 (O) .
j
where
∂
R
Fij,n,m (−x−
→ −−→ −−→
0 x) = δij Rn,m (x0 x) − (x0 x)j Rn,m (−
x−
→
0 x)
∂xj
(6.48)
−−→ ∂
GRi,n,m (x0 x) = Rn,m (−
x−
→
0 x) .
∂xi
101
The Mixed Velocity-Traction formulation for the external Stokes flow
The scalar and vector local expansion coefficients LS and LV are defined by
the M 2L formulae
0
∞
X n
X −−→
n
LVj,n,m (x0 ) = V
(−1) Sn+n0 ,m+m0 Ox0 Mj,n 0 ,m0 (O)
n0 =0 m0 =−n0
0
∞ n −−→
(−1) Sn+n0 ,m+m0 Ox0 MnS0 ,m0 (O) + (6.49)
X X n
LSn,m (x0 ) =
n0 =0 m0 =−n0
−−→
V
− Ox0 Mj,n 0 ,m0 (O) .
j
− (−
x− →
V
0 x1 )j Lj,n0 ,m0 (x0 ) .
102
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
The Fast Multipole implementation for the velocity operator thus involves
four scalar arrays, instead of the only one needed by by the scalar integral
equation associated to the Laplace problem. Let alone this difference, the ap-
proximation of the matrix-vector multiplication is carried on with an identical
algorithm.
The evaluation of the far-field part of the traction integral involves a proper def-
inition of multipole moments and local expansions based on the decomposition
of the Stresslet kernel, which is formally indentical to the single-layer traction
kernel for incompressible elasticity. The derivation of the FMM formulation
for the hypersingular equation for compressible elasticity, which involves the
traction kernels, is derived in Yoshida et al. [42]. The incompressible elasticity
formulation cannot be derived by the compressible formulation by simply set-
ting ν → 1/2, due to the presence of the pressure term in the Stresslet kernel,
which thus requires an ad hoc derivation of the corresponding FMM algorithm.
∗
The Stresslet kernel Kij appearing in the traction operator can be rewritten
in the form
∗ δik ∂ 2 ∂ ∂
Kij (x, y) = +µ Gji + µ Gjk nk (x) =
8π ∂xj r ∂xk ∂xi
(6.53)
δik ∂ 2 ∂ ∂
= − +µ Gji + µ Gjk nk (x)
8π ∂yj r ∂xk ∂xi
∞ n
∗,n,m −
1 X X → −→
∗
Kij (x, y) = Kij Ox, Oy (6.54)
8π n=0 m=−n
103
The Mixed Velocity-Traction formulation for the external Stokes flow
where
∗,n,m −
→ −→ ∂ −→ −→
Kij Ox, Oy = δij nk (x) S n,m Ox Rn,m Oy +
∂xk
−→ ∂ −→ ∂ −→
+ Ox nk (x) S n,m Ox Rn,m Oy +
i ∂xk ∂yj
∂ −→ −→ ∂ −→
− nk (x) S n,m Ox Oy Rn,m Oy +
∂xk i ∂yj
−→ −→ (6.55)
∂
+ nj (x) S n,m Ox Rn,m Oy +
∂xi
−→ ∂ −→ ∂ −→
+ nk (x) Ox S n,m Ox Rn,m Oy +
k ∂xi ∂yj
∂ −→ −→
∂ −→
− nk (x) S n,m Ox Oy Rn,m Oy .
∂xi k ∂yj
The ith component of the double integral over the source element Γα and
the field element Γβ
Z Z
(Γ ,Γ ) ∗
Ii α β = Kij (x, y) φj (y) dS (y) dS (x) (6.56)
Γα Γβ
where
−→ ∂ −→
ASij,n,m Ox = δij nk (x) S n,m Ox
∂xk
−→ ∂ −→
S
Bi,n,m Ox = S n,m Ox (6.58)
∂xi
−→ −→ −→ −→
S ∂ ∂
Ci,n,m Ox = Ox nk (x) + Ox nk (x) S n,m Ox
i ∂xk k ∂xi
104
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Finally, the integral (6.57) can be expressed also as a function of local ex-
pansion coefficients by means of the L2I formula
0
Z ∞ n
(Γ ,Γ ) 1 X X
−−→ V
Ii α β = AR
ij,n0 ,m0 (x0 x) Lj,n0 ,m0 (x0 ) dS (x) +
8π Γα n0 =0 m0 =−n0
0
Z ∞ n
1 X X
R −−→ V
+ Bi,n 0 ,m0 (x0 x) nj (x) Lj,n0 ,m0 (x0 ) dS (x) +
8π Γα n0 =0 m0 =−n0
0
Z ∞ n
1 X X
R −−→ S
+ Ci,n 0 ,m0 (x0 x) Ln0 ,m0 (x0 ) dS (x)
8π Γα n0 =0 m0 =−n0
(6.60)
where
−−→ ∂
AR
ij,n0 ,m0 (x0 x) = δij nk (x) Rn0 ,m0 (−
x−→
0 x)
∂xk
−−→ ∂
R
Bi,n 0 ,m0 (x0 x) = Rn0 ,m0 (−x−
→
0 x) (6.61)
∂xi
−−→ ∂ ∂
R
Ci,n 0 ,m0 (x0 x) = (−
x−
→
0 x)i nk (x) + (−x−→
0 x)k nk (x) Rn0 ,m0 (−
x−
→
0 x)
∂xk ∂xi
105
The Mixed Velocity-Traction formulation for the external Stokes flow
are different, and undergo different transformations, so they must be kept sepa-
rate. Moreover, since the scalar local expansion coefficients depend even on the
vector moments, they also need to be kept separate.
F = 6πµRV (6.62)
and is considered a standard benchmark for the validation of Stokes flow solvers.
The MVT solution, obtained with different values of the weighting coefficient γ̃,
is compared with the solution of the corresponding single-layer equation. The
problem is solved with three different meshes, respectively of 144 DOFs, 624
DOFs and 2400 DOFs. The results obtained with both the Collocation and the
Qualocation approaches are listed in the following table.
106
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
log γ̃ -2 -1 0 1 2 3 4
Collocation 18.74 18.73 18.64 17.84 12.47 3.18 0.36
Qualocation 18.74 18.74 18.74 18.74 18.74 18.74 18.73
The Qualocation approach proves its superior algebraic qualities. Even with
a piecewise constant discretization, the null-space of the aprroximated traction
operator is preserved, and the results are independent on the value of γ̃. On the
contrary, the Collocation discretization leads to a non-singular traction operator,
and the corresponding MVT solution tends to zero as γ̃ increases.
The plot of the inverse of the condition number in Figure 6.1 shows that an
optimal conditioning can be reached with a value γ̃ ' 1, well within the range
in which the qualocation approach provides a perfect solution.
107
The Mixed Velocity-Traction formulation for the external Stokes flow
108
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
6.3. Once again the optimal condition number is obtained when γ̃ ' 1. In this
case the Collocation approach provides an highly deteriorated solution for that
value of γ̃, while the Qualocation sapproach still delivers an acceptable solution
from the engineering point of view.
log γ̃ -3 -2 -1 0 1 2 3
Collocation 923.84 922.93 915.51 870.26 676.96 374.46 23.84
Qualocation 924.16 924.42 925.63 921.23 906.99 891.26 836.76
109
The Mixed Velocity-Traction formulation for the external Stokes flow
110
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Figure 6.5: Comb finger resonator, convergence history and total force
111
The Mixed Velocity-Traction formulation for the external Stokes flow
112
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
In this case the structure on the left and the substrate are fixed and the
rotor on the right moves in the direction orthogonal to the parallel plates with
unit velocity. The holes in the rotor structure are required by the production
process. Results are collected in Figure 6.9. F/(µLU ) is again the nondimen-
sional force with L = 100 µm. Each matrix-vector product of the GMRES takes
approximately 18s, 75s and 548s for the three meshes, respectively.
The presence of several slender fingers has been identified as a major source
of ill-conditioning for the standard formulation. In order to verify the MVT
performance in this tough case, the structures in Figure 6.10 have been anal-
ysed. Essentially they consist of a collection of the basic unit but without holes.
Results concerning the convergence history are collected in Figure 6.11 in four
different cases with 1, 5 10 and 15 units and variable level of refinement. The
MVT technique easily addresses also this kind of analyis.
The change in slope of the convergence plot after 200 iterations is due to the
restart parameter of the GMRES solver which has been indeed set to 200.
113
The Mixed Velocity-Traction formulation for the external Stokes flow
Figure 6.8: Parallel plate unit: geometry, mesh1 and detail of mesh3
6.6 Conclusions
The Fast Multipole accelerated Boundary Element Method proves to be a highly
efficient technique for the evaluation of the drag forces exerted on MEMS de-
vices. Moreover, though being limited to the range of low Reynolds and low
Knudsen numbers, it is highly competitive in comparison with the Finite Ele-
ment and Finite Volume methods. The performance of boundary integral for-
mulations based on the single-layer velocity operator is however undermined
by the intrinsic ill-conditioning of the resulting algebraic systems of equations
which leads to stagnation in convergence. The Mixed Velocity-Traction equation
approach allows to overcome this issue, ensuring a proper conditioning without
affecting the quality of the solution.
114
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Figure 6.9: Parallel plate unit: convergence history for GMRES solver and
global force on the rotor in the direction of movement
115
The Mixed Velocity-Traction formulation for the external Stokes flow
Figure 6.10: Simplified parallel plate resonator: single unit MEMS and 15 unit
MEMS
116
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
Figure 6.11: Simplified parallel plate resonator: convergence history for the
GMRES solver
117
The Mixed Velocity-Traction formulation for the external Stokes flow
118
Appendix A
Recursive evaluation of
spherical functions
The evaluation of the kernel expansions involved in the Fast Multipole algorithm
for both scalar and vector problems require the determination of the numerical
values of the functions Rn,m and Sn,m , together with their derivatives. The ex-
pressions given in (3.20) can be computed using simple recursive relations which
work in cartesian coordinates (Nishimura et al. [28]). Using these relations the
required quantities can be evaluated as functions of the position x = {x1 , x2 , x3 }
with norm r = ||x||.
Evaluation of RN,M (x):
- R0,0 (x) = 1
- for N = 1, 2, . . . , P
x1 + ix2
RN +1,N +1 (x) = RN,N (x) (A.1)
2 (N + 1)
- For M = 0, 1, . . . , P , N = M, M + 1, . . . , P − 1
(2N + 1) x3 RN,M (x) − r2 RN −1,M (x)
RN +1,M (x) = (A.2)
(N + M + 1) (N + 1 − M )
- For N = 1, 2, . . . , P M = 1, . . . , N ,
M
RN,−M (x) = (−1) RN,M (x) . (A.3)
Recursive evaluation of spherical functions
- for N = 1, 2, . . . , P
- For M = 0, 1, . . . , P , N = M, M + 1, . . . , P − 1
- For N = 1, 2, . . . , P M = 1, . . . , N ,
M
SN,−M (x) = (−1) SN,M (x) . (A.6)
120
Appendix B
outside the domain and the corresponding element does not exist. If the line
crosses the boundary one or more times (line l2 in Figure B.1), then the nearest
intersection to the candidate point is checked. Being n the outward normal to
the boundary, if d · n > 0 the point is in the domain, otherwise it is outside.
l2
l1
P
n Q3
2
Q1 1
Q2
The intersection of the line with the boundary must be checked at element
level. With reference to Figure B.1, let P be a point in space, and Q1 , Q2 and
Q3 the vertices of a triangular element; q = Q2 − Q1 and r = Q3 − Q1 are the
vectors running along two edges of the element, and v = Q1 − P is the vector
which points to the first vertex from the candidate point. The intersection
between the half-line and the plane containing the element is written in α, η1
and η2 coordinates, where η1 and η2 are the master element natural coordinates
and α is a coordinate running along db = d/ kdk.
The position of the intersection point is
v3 (q1 r2 − q2 r1 ) − v2 (q1 r3 − q3 r1 ) + v1 (q2 r3 − q3 r2 )
α =
D
d3 (r1 v2 − r2 v1 ) − d2 (r1 v3 − r3 v1 ) + d1 (r2 v3 − r3 v2 )
η1 =
D (B.1)
−d3 (q1 v2 − r2 v1 ) + d2 (q1 v3 − q3 v1 ) − d1 (q2 v3 − q3 v2 )
η2 =
D
D = d3 (q1 r2 − q2 r1 ) − d2 (q1 r3 − q3 r1 ) + d1 (q1 r2 − q2 r1 )
If D = 0 then the half-line and the plane containing the element are parallel.
122
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
and must be evaluated on cubic cells, for an arbitrary position in the three-
dimensional space of the source point x.
If the singularity lies at the origin of the reference system, the analytical
solution over a regular parallelepiped with edges lenghts a, b and c, as shown in
Figure B.2, reduces to
Z a Z b Z c
1
dy3 dy2 dy1 =
p
0 + y22 + y32
0 0 y12
1 ab 1 ac 1 bc
= − c2 tan−1 − b2 tan−1 − a2 tan−1 + (B.3)
2 cd 2 bd 2 ad
1 1 1
− ac log a2 + c2 − bc log b2 + c2 − ab log a2 + b2 +
2 2 2
+ +bc log (a + d) + ac log (b + d) + ab log (c + d)
where
p
d= a2 + b2 + c2 (B.4)
123
The internal mesh for the DBEM formulation
c
d
b
a
I I1 I2
= - +
I3 I4 I5
- - +2 +
I6 I7 I8
+2 +2 -3
Figure B.3: Volume integration with source point in a generic position in space
124
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
where I1 , . . . , I8 are the singular integrals over the domains shown in Figure B.3.
The proposed analytical solution fails if the distance between the singularity
and the integration domain is excessive, due to the numerical errors typical
of the subtraction of large quantities in (B.6). In analogy with the analytical
solutions in Milroy et. al. [26] for the surface integrals, it is convenient to set a
reference distance, over which numerical integrations can be employed.
125
The internal mesh for the DBEM formulation
126
Bibliography
[1] Abousleiman Y and Cheng AH-D. Boundary element solution for steady
and unsteady Stokes flow. Comput. Methods Appl. Mech. Engrg., 117:1–13,
1994.
[2] Aliabadi MH. The Boundary Element Method, Applications in Solids and
Structures, volume 2. Wiley, 2002.
[5] Bonnet M. Boundary Integral Equation Methods for Solids and Fluids.
John Wiley & Sons, 1995.
128
Fast Multipole accelerated Boundary Element techniques for large-scale
problems, with applications to MEMS
[22] Mammoli AA and Ingber MS. Stokes flow around cylinders in a bounded
two-dimensional domain using a multipole-accelerated boundary element
methods. Int. J. Numer. Meth. Engng., 44:897–917, 1999.
[23] Mantivc V. A new formula for the C-matrix in the Somigliana identity.
Journal of Elasticity, 33:191–201, 1993.
[32] Pozrikidis C. Boundary integral and singularity methods for linearized vis-
cous flow. Cambridge University Press, Cambridge, 1992.
129
BIBLIOGRAPHY
[34] Saad Y and Shultz MH. Gmres: A generalized minimal residual algorithm
for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comput.,
7:856–869, 1986.
[35] Steinbach O and Wendland WL. The construction of some efficient precon-
ditioners in the Boundary Element Method. Advances in Computational
Mathematics, 9:191–216, 1998.
[36] Tausch J. Sparse BEM for potential theory and Stokes flow using variable
order wavelets. Computational mechanics, 32:312–318, 2003.
[37] Tausch J and White J. Second Kind Integral Formulations of the Capaci-
tance Problem. Adv. Comput. Math., 9:217–232, 1998.
[38] Valente FP and Pina HLG. Iterative solvers for BEM algebraic systems
of equations. Engineering Analysis with Boundary Elements, 22:117–124,
1998.
[39] van der Vorst HA and Vuik C. GMRESR: a family of nested GMRES
methods. Num. Lin. Alg. Appl., 1:369–386, 1994.
[40] Wang X. FastStokes: a fast 3-D fluid simulation program for Micro-Electro-
Mechanical Systems. PhD thesis, Massachusetts Institute of Technology,
2002.
[41] Wrobel LC. The Boundary Element Method, volume 1, Applications in
Thermo-Fluids and Acoustics. John Wiley & Sons, 2002.
[42] Yoshida K, Nishimura N, and Kobayashi S. Application of Fast Multipole
Galerkin boundary integral equation method to elastostatic crack prob-
lems in 3D. International Journal for Numerical Methods in Engineering,
50:525–547, 2001.
130