Advance Calculus Based Apllication
Advance Calculus Based Apllication
Numerical Mathematics
and
Advanced Applications
Proceedings of ENUMATH 2003
the 5th European Conference
on Numerical Mathematics and
Advanced Applications
Prague, August 2003
~ Springer
Editors
Miloslav Feistauer
Vit Dolejsf
Petr Knobloch
Karel Najzar
Charles University Prague
Facuhy of Mathematics and Physics
Department of Numerical Mathematics
Sokolovska 83, 186 75 Praha 8
Czech Republic
email:
feist, dolejsi, knobloch, [email protected]
These proceedings collect the major part of the lectures given at ENU-
MATH2003, the European Conference on Numerical Mathematics and Ad-
vanced Applications, held in Prague, Czech Republic, from 18 August to 22
August, 2003.
The importance of numerical and computational mathematics and sci-
entific computing is permanently growing. There is an increasing number of
different research areas, where numerical simulation is necessary. Let us men-
tion fluid dynamics, continuum mechanics, electromagnetism, phase transi-
tion, cosmology, medicine, economics, finance, etc. The success of applications
of numerical methods is conditioned by changing its basic instruments and
looking for new appropriate techniques adapted to new problems as well as
new computer architectures.
The ENUMATH conferences were established in order to provide a fo-
rum for discussion of current topics of numerical mathematics. They seek to
convene leading experts and young scientists with special emphasis on con-
tributions from Europe. Recent results and new trends are discussed in the
analysis of numerical algorithms as well as in their applications to challenging
scientific and industrial problems.
The first ENUMATH conference was organized in Paris in 1995, then
the series continued by the conferences in Heidelberg 1997, Jyvaskyla 1999
and Ischia Porto 2001. It was a great pleasure and honour for the Czech
numerical community that it was decided at Ischia Porto to organize the
ENUMATH2003 in Prague. It was the first time when this conference crossed
the former Iron Courtain and was organized in a postsocialist country.
The ENUMATH2003 was organized by the Faculty of Mathematics and
Physics of the Charles University in cooperation with the Department of
Mathematics of the Institute of Chemical Technology in Prague. The Charles
University, the oldest university in the Middle Europe, was founded in 1348.
In the middle ages, mathematics was studied in Prague at the Artistic Fac-
ulty, later at the Philosophical Faculty and in the 20th century it belonged to
the Faculty of Natural Sciences till 1952, when the Faculty of Mathematics
and Physics was founded. As follows from historical sources, already in the
15th century the students of the Charles University had the opportunity to
be trained in "Computational Mathematics". Kfisfan from Prachatice, who
vi Preface
M. Feistauer
V. DolejSi
P. Knobloch
K. Najzar
editors
Table of Contents
Plenary Lectures
Numerical Analysis of Finite Element Methods
for Eddy Current Problems. Applications to
Electrode Simulation
Summary. The objective of this work is to introduce and numerically solve a 3D-
mathematical model for steady thermoelectrical behavior of electrodes in a metal-
lurgical electric furnace. The mathematical model couples the time-harmonic eddy
current model with the heat transfer equations in a bounded 3D-domain. An impor-
tant part of the paper deals with the analysis and numerical solution of the eddy
current model in a bounded domain.
1 Introduction
Si O 2 + 2C = Si + 2CO.
This reaction takes place in submerged arc furnaces which use three-phase
alternating current. A simple sketch of the furnace can be seen in Figure 1. It
consists of a cylindrical pot containing charge materials and three electrodes
disposed conforming an equilateral triangle.
Electrodes are the main components of reduction furnaces and their pur-
pose is to conduct the electric current which enters the electrode through the
"contact clamps" (see Figure 1). The electric current goes down crossing the
column length comprised between the contact clamps and the lower end of the
column generating heat by Joule effect. At the tip of the electrode an electric
arc is produced, reaching temperatures of about 2500 °C which are needed for
the reduction chemical reactions to take place.
Classical electrodes extensively used in industry include pure graphite, pre-
baked and S¢derberg electrodes. The latter are the most used in ferro-silicon
industry and they are composed by paste consisting of a carbon aggregate
and a tar binding which are fed into a steel casing; the casing have steel
4 A. Bermudez et al.
Contact clamps: (
current entrance
fins attached to its inner part, which are placed radially in the cylinder. The
great amount of heat generated by Joule effect is partially employed to bake
the paste; this is a crucial process during which the initially soft/liquid non-
conductive paste at the top of the electrode becomes a solid conductor. The
advantages of S¢derberg electrodes with respect to pure graphite or prebaked
electrodes are that they are built in larger sizes and cost less. However, as the
electrode is consumed, it has to be slipped and the steel casing moves with the
carbon body so it melts and pollutes silicon. This is why they cannot be used
to obtain silicon metal or silicon with metallurgical quality, which is used as
alloying of other metals as aluminum. Thus prebaked electrodes have been for
many years the only alternative for commercial silicon metal production.
In the early nineties, the Spanish company Ferroatlantica S.L. built a new
compound electrode named ELSA ([14]) which serves for the production of
silicon metal. It seems to be the solution for all silicon furnaces because its
cost can be up to one third the price of a prebaked electrode.
ELSA electrode consists of a central column of baked carbonaceous mate-
rial, graphite or similar, surrounded by a S¢derberg-like paste (see Figure 2).
There is a steel casing without fins that contains the paste until it is baked
at the contact clamps zone. Two different slipping systems exist, one for the
casing and another one for the central column; the combination of both sys-
tems is necessary so as to slip the casing as little as possible and also to carry
out the correct extrusion of the carbon electrode. Then, unlike in the case of
S¢derberg electrodes, the casing is not consumed and it is possible to produce
silicon with metallurgical quality. The result is that the furnace operation is
similar to that of prebaked electrodes, but the compound electrode is less ex-
pensive. The disadvantage is that slipping velocity is not free as in prebaked
electrodes, jJecause the paste has to be baked before leaving the casing, so it is
necessary a minimum period of time between slippages. Thus, baking of paste
is a crucial point in the working of this type of electrodes.
Methods for Eddy Current Problems 5
Nipple
Support
system
".
In general, the design and control parameters of electrodes are very complex
and numerical simulation plays an important role at this point. Modeling the
involved phenomena in a computer allows us to analyze the influence of chang-
ing a parameter without the need of expensive and difficult tests. Thus, during
the last 20 years, an important number of mathematical models and computer
programs have been developed in order to simulate the thermo electrical be-
havior of classical electrodes (see for instance [15, 17, 18]). In particular, the
mathematical models based on cylindrical symmetry have been the most ex-
tensively used. However, ELSA electrode works in a different manner from the
classical electrodes. While classical electrode has only a constitutive material,
compound electrode combines a good electric current conductor as graphite
with a paste which becomes a good conductor only at high temperatures. Not
only the core of graphite is important in the movement of the column but also
in the distribution of current inside the electrode. Moreover, unlike S¢derbeg
electrodes, the non existence of fins gives a geometrical axisymmetry (see Fig-
ure 3).
This is why we first developed a finite element method based on cylindrical
symmetry to compute the electric current and temperature distribution in
a radial section of the electrode [5]. While the axisymmetric model has given
valuable information on important electrode parameters, the assumption of
cylindrical symmetry makes necessary to neglect the following facts:
The electromagnetic effect caused on one electrode by the two others, that
is the so called "proximity effect". This arises because the magnetic field
generated by each electrode induces eddy currents in the two others.
Thermal ,boundary conditions are not axisymmetric. Indeed, the tempera-
ture of the air around the electrode is greater on the surfaces oriented toward
the furnace center.
6 A. Bermudez et al.
Steel casing Graphite Steel casing
ELSA SODERBERG
The current entrance in the electrode through the contact clamps is not ax-
isymmetric. The current is transferred to the contact clamps through copper
bus tubes which in its turn are connected to three transformers with differ-
ent phases. Then, in each electrode, half of the clamps receive current from
one transformer while the other ones are connected to a second transformer.
These points can only be considered by using a pure three-dimensional model.
Moreover, 3D-models are always needed to simulate S0derberg electrodes be-
cause the presence of fins breaks cylindrical symmetry (see Figure 3). Thus, we
have developed a three dimensional thermo electrical model which is enough
general to model any kind of electrodes and even the complete furnace. In this
paper, we describe two different mathematical models and analyze them from
mathematical and numerical points of view.
The electromagnetic problem is obtained from the time-harmonic Maxwell
equations assuming the frequency is low enough as to neglect the term in-
volving the displacement current in Ampere's law. This is the so-called eddy
current model. Because of many interesting applications in electrical engineer-
ing, numerical simulation of eddy current problems have led to a great number
of publications in recent years (see for instance [1, 2, 3, 10, 11, 12, 13]). We
notice that Maxwell equations concern the whole space, but we are interested
in solving the problem in a bounded domain, so we have to define suitable
boundary conditions and this need represents the main difficulty to study the
problem in a bounded domain. Thus, we start introducing the eddy current
problem in the whole furnace, including the electrodes and the air around, and
defining natural and essential boundary conditions. In a second step we change
this model, by introducing realistic boundary conditions, to compute the elec-
tromagnetic fields in only one electrode. Finally, we couple the electromagnetic
model with a thermal one. Coupling between Maxwell and heat transfer equa-
tions is due to Joule effect which is the source term in the heat equation, and
to the fact that thermoelectrical parameters depend on temperature.
The outline of the paper is as follows: In Section 2 we deal with the math-
ematical and numerical analysis of the electromagnetic problem in a bounded
3D domain which includes conductors and dielectrics. We introduce a weak
Methods for Eddy Current Problems 7
formulation which involves the magnetic field in the conductor domain and
a scalar magnetic potential in the dielectric one. This hybrid formulation is
discretized by using Nedelec edge finite elements for the magnetic field and
standard piecewise linear continuous elements for the magnetic potential. The
resulting discrete problems are studied and error estimates are obtained under
mild smoothness assumptions on the solution. Section 3 is devoted to propose
and analyze a finite element method to solve the electromagnetic problem only
in one electrode. We introduce a weak formulation of the problem in terms of
the magnetic field and deal with boundary conditions directly related with
the intensities which enter the domain. Lagrange multipliers are introduced
to impose these "non standard" boundary conditions and the resulting mixed
formulations are studied following classical techniques. In Section 4, we couple
the electromagnetic problem with the thermal one and give a result concern-
ing existence of solution. We end the paper by reporting, in Section 5, some
numerical results obtained for ELSA and Soderberg electrodes.
In order to consider all the facts which are neglected in the axisymmetric
models, we start proposing a model to solve the eddy current problem in
a bounded domain like the one presented in Figure 4, which includes not
only conductors (the electrodes and wires supplying the electric current), but
dielectrics as well (the air).
curlH= J, (1)
iWfLH + curlE = 0, (2)
divB = 0, (3)
divD = p, (4)
with
B = fLH, D = EE, J = (TE, (5)
where H, J, B, E, and D are the complex amplitudes associated with the
magnetic field, the current density, the magnetic induction, the electric field
and the electric displacement, respectively; p is the electric charge density, fL
is the magnetic permeability, E is the electric permittivity and (T is the electric
conductivity.
8 A. Bermudez et al.
fD
Exn=g (6)
Hx n=f (7)
Methods for Eddy Current Problems 9
where H~01/2(~)3 denotes the dual space of H;~2(~)3 which, in its turn, is
the space of functions defined on TD that extended by 0 on 8[2 \ TD belong to
Hl/2(8[2)3. We assume that j.t, E, 0" E LOO([2), and that there exist constants,
!:!:.' f, and IZ, such that
j.t(x) ~ !:!:. > 0, E(X) ~ f > 0, a.e. in [2,
O"(x) ~ IZ > 0, a.e. in [2c' O"(x) = ° in [2D.
Then, multiplying the equation (2) by a test function of the space Vo,
integrating in [2, and using Green's formula, (1), (6), and (7), we obtain the
following weak formulation in terms of the magnetic field H.
Problem MP.- To find H E V such that
Once the magnetic field H is known, the current density J and the electric
field E can be computed in conductors, namely, J = curl Hand E = (lJ) In.
(I "e
These are the magnitudes actually needed in most applications and satisfy the
Maxwell equations (1)-(5) and boundary conditions (6)-(7) (see Theorem 3.2
in [7]).
Then, for (j E W(QD)' we have that grad(j E H(curl, DD) if and only if
(j E 8, in which case curl (grad (j) = 0 (see Lemma 3.11 in [4]). Then, for
all G E V there exist (j E 8 such that Gin = grad (j.
D
We introduce the following notation: for GeE L2 (De)3 and G D E L2 (DD)3 ,
we denote by (GeIG D) the field G E L 2(D)3 defined a.e. by
W o := {
(G,- - tJr
tJr) E W: grad - x n -1/2 (~) 3} .
. Hoo
= 0 m
Methods for Eddy Current Problems 11
~ d:i
gra '¥ x n =
f 2n
. H-l/2(J:)3
1 1~ 1
00 D'
pHh . Gh + 1~
nc (J"
curl Hh . curl Gh = /, g x n . Gh x n
rc
\lGhEV~.
The following lemma shows that each curl-free vector field in N h([JD) ad-
mits a multiple-valued potential in 8 h (see [7]).
grad JS h n = fj on r
1 1 ..!.
X D,
iw
Jl c
j1H h· G h +
Jl c (7
curlHh· curlGh + iw
JJlr
D
j1gradJSh · gradtT;h
Theorem 4. Let us assume that the solution (H, JS) of problem HP satisfies
HEW (curl, flc) and grad JS E W (flD)3, with r E (~, 1]. Then, problem
DHP is well posed, it has a unique solution (Hh,JS h ), and
The model described in the previous section presents some drawbacks. First, it
is highly complex and its numerical solution takes a lot of time. On the other
hand, it is difficult to obtain the boundary data f from realistic data such as
intensities or potentials, which usually are the only data we know. Then, we are
going to propose an alternative approach which consists in solving the eddy
current problem in one electrode which is a particular bounded conducting
domain. We are going to analyze a weak formulation of this problem in terms
of the magnetic field, considering realistic boundary conditions from the point
of view of applications. In particular, following Bossavit [12], we will consider
boundary conditions directly related with the input current intensities which
enter the electrode. We will impose these boundary conditions by means of
Lagrange multipliers and study the resulting mixed formulations.
Since we only consider the conducting domain, we will get an important
saving in computer time when compared with the model of the whole furnace,
and we will still be able to consider some important effects which are not taken
into account by the axisymmetric models, although not the proximity effect.
We consider a bounded conducting domain D having a Lipschitz-conti-
nuous and connected boundary. However, it is not necessary that D be simply
connected. Let 8D be the boundary of the domain D which splits into two
parts: 8D = t;, u t;. The surface r;, corresponds to the tip of the electrode
where the electric arc arises. In its turn, the rest of the electrode boundary
splits as follows:
- -0 -1 -N
r;=~ u~ U .. ·u~ ,
where ~n, n = 1, ... , N, are the parts of the boundary connected to the wires
supplying electric current to the electrode, and SO = ~ \ (~1 U .. :.-u ~N) is the
remaining part (see Figure 6). We also assume ~n n r;, = 0 and ~n n ~m = 0,
m, n = 1, ... , N, m # n.
Our goal is to solve the eddy current equations (1)~(5) subject to the
following boundary conditions:
where the only data In, n = 1, ... ,N, are the current intensities through each
wire.
Methods for Eddy Current Problems 15
I; (electric arc)
Condition (10) is the natural one to model the free current exit on the
electrode tip. Conditions (11) and (13) take into account the input intensities
and the fact that there is no current flow through r;D, respectively. Conditions
(12) and (14) have been proposed by Bossavit [12] in a more general setting.
They will appear as natural boundary conditions of the weak formulation of our
problem. The former implies the assumption that the electric current is normal
to the surface on the current entrance, whereas the latter means that the
magnetic field is tangential to the conductor surface. Of course, condition (14)
is not always fulfilled, but it is a good approximation in our model problem.
Next, we analyze a weak formulation of this problem in terms of the mag-
netic field and propose a finite element method for its numerical solution.
To obtain a weak formulation of the eddy current problem (1)-(5) with bound-
ary conditions (10)-(14) in terms of the magnetic field, we notice that the
boundary condition (14) implies that the tangential component of E on the
boundary of [2 is a gradient. In particular, we obtain that E x n = -V¢ x n
on 8[2 for some scalar function ¢ with ¢ = 0 on ~, because of (10).
Moreover, because of (12), ¢Irn must be constant. Then, multiplying the
equation (2) by a test function G such that curl G . n = 0 on r;D and
J
iw 1 -+1-
n
pH· G 1 curl H· curl G
n(J
- = O.
a(H, G) := iw 1 -+ 11-
n
fLH· G
n(}
curlH· curlG.-
Let £ be the following closed subspace of H~b2(r;):
a(H,G) = 0 VG E W(O).
b(G,v):= (curlG.n,vl r .
J
Theorem 6. Given I E eN, let HEX be the solution of problem PI. Then,
there exists a unique), E £ such that (H,)') is the only solution of problem
MPI. Furthermore, the following estimate holds:
Methods for Eddy Current Problems 17
Theorem 7. Given I E reP, problem DMPI attains a unique solution (Hh, Ah)
Furthermore, if the solution (H, A) of problem MPI satisfies H E Hr (curl, D)
with 1/2 < r :::; 1, then the following error estimate holds true:
References
16. Howison, S.D., Rodrigues, J.F., Shillor, M. (1993): Stationary solutions to the
thermistor problem, J. Math. Anal. Appl., 1742, 573-588.
17. Innva::r, R., Fidje, K., Sira, T. (1987): 3-dimensional calculations on smelting
electrodes. Reprinted in MIC-Model. Identif. Control 8, 103-115.
18. Innva::r, R., Olsen, L. (1980): Practical use of mathematical models for Soderberg
electrodes. In: Electric Furnace Conference Proceedings, vol. 38. Iron & Steel
Society, Warrendale, PA, 40-47.
19. Nedelec, J.-C. (1980): Mixed finite elements in ]R3, Numer. Math., 35, 315-34l.
20. Salgado, P. (2002): Mathematical and numerical analysis of some electromag-
netic problems. Application to the simulation of metallurgical electrodes. Ph.
D. Thesis, Universidade de Santiago de Compostela, Spain.
Space Decomposition Preconditioners
and Parallel Solvers
Radim Blaheta
Institute of Geonics AS CR
Studentski 1768, 70800 Ostrava- Pomba, Czech Republic
[email protected]. cz
Summary. This paper uses the general framework of space decomposition - sub-
space correction for providing an overview of Schwarz-type preconditioners. The con-
sidered preconditioners are one-level and two-level Schwarz methods based on an
overlapping domain decomposition, a two-level method with the coarse grid space
created by aggregation and a new two-level method with interfaces in the coarse
grid space. Beside the description and analysis, we discuss also some implementation
details, the use of inexact subproblem solvers etc. The efficiency of preconditioners
is illustrated by numerical examples.
1 Introduction
This paper concerns the numerical solution of large scale linear systems arising
from the finite element (FE) solution of boundary value problems. Especially,
the attention is payed to that numerical methods, which are suitable for im-
plementation on high performance parallel computers.
This motivates the interest in space decomposition (SD) pre conditioners
described in Section 2. These preconditioners provide a general framework,
which has many specific applications. This paper concentrates on Schwarz-type
preconditioners, which are usually based on the overlapping domain decompo-
sition (DD). This class itself involves many variants of the basic technique and
some of them will be reported in this paper. There are also many other use-
ful decompositions, which can be used for the construction of preconditioners.
Let us mention two examples: the hierarchical decomposition of FE spaces (cf.
[13, 20, 10]) or displacement decomposition for elasticity problems (cf. e.g. [4]
and the references therein).
An overview of the standard one-level and two-level overlapping DD meth-
ods can be found in Sections 3 and 4. These methods are also called Schwarz
methods according to the pioneering work of H.A. Schwarz from 1870, cf. [19].
But the real and rapid development of these methods, motivated by the inter-
est in parallel computing, starts in the second half of 1980's.
In Section 5, we describe a less standard two-level Schwarz method with
the auxiliary global problem created by aggregation of unknowns. This method
is useful in many cases, when application of the standard two-level methods
Space Decomposition Preconditioners and Parallel Solvers 21
requires a lot of extra work involved in creating an auxiliary coarse grid and
relating this grid to the original one.
Section 6 is devoted to the description of a new two-level method, which
uses non-overlapping DD and interaction of subdomain problems only via
a special coarse grid space with interfaces.
Some aspects of implementation of the preconditioners on parallel comput-
ers, the use of inexact subproblem solvers and use of nonsymmetric multiplica-
tive or hybrid pre conditioners are discussed in Section 7. The precondition-
ers, which are not symmetric positive definite, can be efficiently implemented
within the generalized preconditioned conjugate gradient method (GPeG), cf.
[4] and the references therein. In Section 8, we provide some numerical results
illustrating the efficiency of the described methods and corresponding solvers.
In the final section, we summarize the results and mention some topics not
covered in the paper.
2 Abstract SD preconditioners
Our aim is the solution of abstract discrete symmetric elliptic problem in the
following form
finduEV: a(u,v)=l(v) \/vEV, (1)
where V is a finite dimensional subspace of a Hilbert space V = V(f?) of
functions defined in a domain f? C Rd (d = 2,3), a is a bounded symmetric
positive definite (SPD) bilinear form on V and 1 is a bounded linear functional
on V.
To be more specific, we shall consider elliptic boundary value problems
with the bilinear form
ouov
I: k
( d
a(u, v) = if ij ax ax dx (2)
f"l i,j=1 2 J
defined in the Sobolev space Hl(f?) equipped with the seminorm 1·IH1(f"l) and
the norm 11·IIH1(f"l)' We assume that K = (k ij ) is a symmetric positive definite
d x d matrix, which guarantees the existence of positive constants 11:1, 11:2, such
that
(3)
Sometimes, we shall assume that V c HI (f?) is such that
find u E V: Au = b, (5)
(6)
9=0
for k = 1, ... ,m do
9 <-- 9 + hAk1 Rk Zk
end
(7)
(8)
k=2 k=2
This preconditioner can be again symmetrized to the form
m
i.e. GSHA is equal to identity on range R(PI ) and to the additively pre-
conditioned operator GAA on range R(I - H). Thus, we can expect that
the hybrid preconditioner will be more efficient than the additive one, cf.
also [17].
Further in this paper, we shall exploit the following two assumptions, which
characterize the considered decomposition:
Note that IlviiA = V(v, V)A. A trivial upper bound is Kl = m. But we are
interested in m independent bounds for K I , which can be found e.g. by
considering the angles of subspaces Vk. If
Sometimes, one of the subspaces, say VI, plays an exceptional role. Then
it may be useful to consider E1 = (Ski, k, Ii-I) and the estimate
(17)
Remark 1. The estimate of .Amin(GAA) is well known as Lions' lemma, see [16].
The proof of Theorem 1 as well as some historical remarks can be found e.g.
in books [13, 20, 18].
r'
Theorem 2. For the multiplicative preconditioner, we get
Remark 2. The proof of the estimate (19) is more technical, see [7]. The com-
plete proof can be found e.g. in [20, 18]. The estimate (20) easily follows from
(19).
Space Decomposition Preconditioners and Parallel Solvers 25
n= nr u ... u n~ (21)
now induces a decomposition (6) of the finite element space V Vh into the
subspaces VI"'" Vm ,
Vk = {v E V: v = 0 in st \ nO . (22)
V=LVk, vk=Ih(8kv).
k
- Property (A1):
k k k TenZ, TETh
see e.g. [10]' Lemma 7.4.17 or [12]. Moreover, T c T", can belong to at most
me subdomains stZ. Thus,
where aT(vi, Vi) denotes the restriction of the bilinear form a to T. Each
T E T,. belongs to at most me sub domains and therefore L VaT(Vi,Vi)
has at most me nonzero terms. The Cauchy-Bunyakowski-Schwarz (C.B.S.)
inequality thus gives
Hence,
D
Normally, the overlap 0 is kept proportional to the size of subdomain H s , i.e.
o= {3Hs, where (3 is the proportionality constant. In this case, a combination
of Theorems 1, 2, 3 says that cond(GAA) and cond(GsMA) deteriorate with
the increasing number of subdomains (Hs -+ 0,0 -+ 0). A remedy can be found
in adding an auxiliary coarse grid FE space Va to the space decomposition with
local FE spaces (22), see the next section.
v = Va + V1 + ... + Vm , (24)
Theorem 4. The decomposition (24) fulfils the assumptions (AI), (A2) with
the constants
(25)
- Further,
m m m
L
k=1
Ilvkll~ :s: "'2 L
k=1
IVkl~l(Sl~)
Let V = Vh = span{ <p~, ... ,<p~} and let the index set {I, ... ,n} be decomposed
into groups G 1 , ... ,GN. Then it is possible to define aggregated basis functions
1/;k and the space Va C V as follows,
Space Decomposition Preconditioners and Parallel Solvers 29
(28)
We shall assume that the aggregations are regular, i.e. there is a constant j3
such that each supp 1,Uk contains a ball with diameter j3H, where
V = Va + V1 + ... + Vm . (29)
Theorem 5. The decomposition (29) fulfils the assumptions (A1), (A2) with
the constants
(30)
Sketch of the proof. We can follow the same procedure as in the proof of The-
orem 4 but instead of L2 -projection to Vh , we shall consider an averaging
operator Q.
- Let us consider the mapping Q: Vh ~ Va defined as follows:
for v E Vh , Qv =
N
L ak1,Uk,
k=l
ak = 1
isupp 1,Uk i
1
supp 1/lk
v(x)dx. (31)
-
iQ v iHl(r.?) ::; 0"1 V(H
h i v iH1(r.?) , (32)
- Then
- Further
30 R. Blaheta
m m m
- The above estimates give (30). The estimate of KI is the same as in Theo-
rem 4. 0
V = Va + VI ... + Vm = Va + W , (35)
Fig. 2. A coarse grid with interfaces (left) and its refinement with the domain
decomposition (right)
Fig. 3. Fine grid and domain decomposition (left) and aggregation out of the inter-
faces between the sub domains (right)
Ko = 1/(1-,,), (36)
where" = cos(Vo, W o) < 1 for any Wo C W such that V = Vo EEl Woo
k=1
~ 2
L.....llvkllA = IIVollA2 + IlwollA2::: 1
-1-llvo + wollA2 12
= -1-llvIlA'
k=O - , - ,
o
Note that the constant, < 1 appeared also in the analysis of the hybrid
preconditioner G H. We simply get
because 2:;;'=1 Pk is now again (,)A orthogonal projection. Note also that
G H = G M in this case. More details can be found in [5].
As a consequence of Theorem 6, the investigation of convergence of the
two-level Schwarz method with interfaces can be reduced to investigation of
the constant, = cos(Vo, Wo). This investigation of the strengthened C.B.S.
inequality can be performed locally on macroelements. It will be illustrated for
a simple 2D problem from Fig. 2 in the following theorem. More results will
be given in a forthcoming paper [5].
Theorem 7. Let us consider the problem (1) with orthotropic bilinear form (2)
on the domain n c R2 and the situation similar to Fig. 2. It means that there
is a coarse grid TH with coarse rectangular triangles inside the subdomains and
special rectangular macroelements and fine triangles along the boundaries of the
subdomains, see Fig. 2, 4. There is also a refined triangulation Tit, which con-
sists from congruent rectangular isosceles triangles with two axiparallel sides.
These triangles arise from TH by division of each coarse triangle and each
boundary macroelement into four congruent triangles, see Fig. 4.
Let Vo be the FE space corresponding to TH and W be the union of subspaces
Vk defined in (34). Let Wo contain the functions from W, which are zero in
the nodes belonging to TH . The nonzero coefficients of the bilinear form (kl1
and k 22 )) are assumed to be constant on the macroelements from TH .
Then
~
~~
T3 : T
T4 T4
~ TJ :T2
Fig. 4. The coarse triangles with refinement (a) and boundary macroelements with
3 triangles and their refinement into 4 triangles (b), (c)
Proof. The C.B.S. constant can be investigated locally on the inner and bound-
ary macro elements. If
aE( v, w) = 1'"
E.
ov -;:;-dx
L.J k ii -;:;-
,
ow
uXi uXi
and
aE(v,w) :::; "YEVaE(V, v) vaE(w,w) ,
for each macroelement E and functions v and w, which are restrictions of
v E Va and w E Wo to the macroelement E, then
"Y <
- maX"YE·
E
ow ow ow ow
-;:;-- IT!
uX2
= --;:;--
UX2
IT2 , -;:;-- IT!
uXl
= -;:;--
uXl
IT2 ,
34 R. Blaheta
i.e. IE = V
kl1
kl1
+ k22
.
Au = b, u,b E R n , (39)
8 Numerical results
The efficiency of various SD pre conditioners described in this paper can be
compared by solving a simple model problem
[}2u [}2u
kll [} 2
Xl
+ k22 [} X 2 = f in D = (0,2) x (0,3) ,
2
U = 0 on [}D,
discretized by linear triangular FE. We use the uniform grid with the mesh
size h = 1/30, sub domains Dk = (0,2) X (Xk' Xk+l) and overlap 0 = 2h. The
subproblems are solved exactly. The required numbers of iterations for the
accuracy c = 10- 3 and various additive (AP) and hybrid (HP) SD precondi-
tioners can be seen in Table 1. The hybrid preconditioners are used in nonsym-
metric form in combination with GPCG[l]. The coarse grid is either nested
coarse triangular grid, aggregation with clustering 2 x 2 square macroelements
or the same aggregation with interface. For each number of subdomains, the
first column shows the numbers of iterations for kll = k22 = 1, the second one
for kll = 1, k22 = 1/100 and the last one for kll = 1, k22 = 100.
Table 1. The numbers of iterations for the relative accuracy E = 10- 3 . AP de-
notes additive preconditioner, HP denotes nonsymmetric hybrid pre conditioner +
GPCG[I].
Number of subdomains
Type Coarse grid space 4 8 12 16 24
AP one level 16 4 26 22 4 38 23 5 51 31 5 61 37 6 78
AP nested, H = 3h 7 7 16 8 7 20 8 7 23 7 7 26 8 7 31
HP nested, H = 3h 6 6 13 6 7 16 6 7 20 6 7 22 6 6 27
AP regular agg's 2h 13 8 19 15 8 23 16 9 27 17 9 30 17 9 36
HP regular agg's 2h 10 7 15 11 7 19 11 7 22 11 8 25 11 8 29
AP agg's 2h + into 14 8 22 14 8 30 14 8 36 14 9 40 14 9 46
HP agg's 2h + into 8 4 14 7 5 16 8 5 19 8 5 22 8 5 25
Table 2. Results from the solution of the large-scale 3D elasticity problem with the
relative accuracy E: = 10- 4 . #P denotes the number of exploited processors, m is the
number of subdomains.
No coarse grid Aggregation 2 x 2 x 2 Aggregation 5 x 5 x 5
#m #P #It T [s] #P #It T [s] #P #It T [s]
2 2 104 428 3 45 268 3 56 243
3 3 113 321 4 47 242 4 59 176
4 4 121 268 5 51 240 5 62 144
5 5 128 234 6 53 236 6 65 127
6 6 133 211 7 55 244 7 67 114
7 7 136 193 8 57 265 8 70 105
9 Concluding remarks
References
1. Axelsson, 0., Blaheta, R. (2001): Two simple derivations of universal bounds for
the C.B.S. inequality constant. Report 0133, Dept. Math., University Nijmegen
2001. To appear in App!. Math.
2. Bj¢rstadt, P.E., Dryja, M., Vainikko, E. (1997): Additive Schwarz methods with-
out sub domain overlap and with new coarse spaces. In: Glowinski, R. et a!.
(eds) Domain Decomposition Methods in Sciences and Engineering. Proc. 8th
Int. Conf. on Domain Decomposition Methods. J. Wiley
3. Blaheta, R. (1988): A multilevel method with overcorrection by aggregation for
solving discrete elliptic problems. J. Compo App!. Math., 24, 227-239
38 R. Blaheta
Thierry Gallouet
1 Introduction
In the industrial context, efficient numerical simulators are often developped
after a long "trial and error" procedure. The efficiency of the simulators may
be evaluated, for instance, by the fact that the solution satisfies some natu-
ral constraints and that it is in agreement with experimental data. In some
cases, estimates on the approximate solutions allow to obtain the convergence
of some sequences of approximate solutions as the discretization size tends
to O. However, it is not easy to give the answer to the following question:
"What problem has a unique solution which is the limit of the approximate
solutions?" .
This paper will focus on the problem of boundary conditions needed in the
discretization of nonlinear hyperbolic equations or systems of equations; this
problem is not yet clearly understood in many cases. Two different cases will
be presented: a two phase flow in a pipeline and a two phase flow in a porous
medium.
A "simple" model for a two phase flow in a pipeline (see [8]' for instance) leads
to a 3 x 3 system of conservations laws. The unknown w is a function from
(0,1) x R+ in R 3 , solution of the following system:
and liquid) and the third one is the momentum equation for the mixture. The
expression of the given function F: R3 ~ R3 is quite complicated. It takes
into account thermodynamical laws and a hydro dynamical law. System (1)
is hyperbolic: for any W E R 3 , the Jacobian matrix DF(w) is diagonalizable
in R. The three eigenvalues can be ordered: .A.l(W) < .A.2(W) < .A.3(W). In
real situations, the first eigenvalue, .A.l(W) is negative and the third, .A.3(W),
is positive (they correspond to some "pressure waves" which are related to
a "sound velocity"). The second eigenvalue, .A. 2 (w), corresponds to some mean
velocity between the two phases and can change sign. One can also note that
the field related to this second eigenvalue is quite complicated because it is
not, in general, a genuinely nonlinear field or a linearly degenerate field. In
petroleum engineering, the wave associated to this second eigenvalue is a "void
fraction wave"; engineers require a good representation of this wave in the
numerical simulations.
°
In order to discretize Problem (1), (2) and some boundary conditions, which
will be introduced later, let h = f:t
(with N E N*) be the mesh size and k >
be the time step (assumed to be constant, for the sake of simplicity). The
discrete unknown are the values wi E R3 for i E {I, ... , N} and n E N. The
discretization of the initial condition leads to
For the computation of wi for n > 0, one uses an explicit, 3-points scheme:
For i E 1, ... , N -1, one takes F21-! = g(wi, wi+1)' where 9 is the numerical
flux. It has to satisfy, in particular, the classical consistency condition, namely
g(a, a) = F(a), and needs to be chosen in order to obtain some stability
properties for the numerical scheme under a so called CFL condition on the
time step (see Sect. 3 for the study of a scalar model). In the case of two
phase flow in a pipeline, the classical numerical fluxes such as the Godunov
flux (see [9]) or the Roe flux (see [11]) may not be implemented, because of
computational difficulties. A convenient choice is obtained with a simplified
Roe flux, namely g(a,b) = g(a)tg(b) + ~IA(a,b)l(a - b), where A(a, b) is some
appoximation of the Jacobian matrix, depending on a and b, but not satisfying
the so called Roe condition, see [8].
Remark 2. In fact, for the simulation of a two phase flow in a pipeline, the
magnitude of the so-called fast eigenvalues, A1 and A3, is much greater than
that of A2; the choice in [8] is to use an implicit scheme with respect to the fast
eigenvalues, whereas the eingevalue A2, which corresponds to the void fraction
wave, is handled with an explicit second order discretization, since the void
fraction wave needs to be simulated precisely (see [8] for details).
In order to compute Fr 2
(and similarily F;:,{+l) a good way is to know, or
2
to determine, some artificial value Wo E R3 (and wN+1 E R 3 ) and to take
Fr2
= go(wo,w]') (and F;:'{+l2 = gl(WN,WN+1))' The numerical fluxes go and
gl can be chosen equal to g, but this is not at all necessary (see the convergence
result of Sect. 3); in fact, there are numerous situations where one should take
go and gl different from g. Indeed, the scheme is often very sensitive to the
computation of the boundary fluxes and it is often worthwhile to use a more
42 T. Gallouet
precise, but also more expensive numerical flux (such as the Godunov flux, for
instance) for the computation of the boundary fluxes than for the computation
of the interior fluxes. The difficulty is now to determine these artificial values,
Wo and w N+1'
Remark 3. In some cases, the choice of Wo
and w N+1 is quite easy. A well
known example is given by the wall-boundary condition for the Euler equa-
tions (with a perfect gas state law or a more general state law). For the sake
of simplicity, let us mention the one-dimensional case; the generalization to
a multi-dimensional case is quite easy. The Euler equations may be written
the form (1), corresponding to conservation of mass, momentum and energy,
with w = (p, pu, E) t, where p is the density of the fluid, u its velocity, and
E its energy. The wall-boundary condition at x = 0 is u = 0, and the only
component to compute for the boundary condition is the second component of
F'l which is equal here to the pressure at x = 0 (since u = 0 at the wall), say
2
Pl' The value wr
may be computed from the values pr, ur
and pr. A natural
Po Uo Po = pr. The flux F'l
2
choice for Wo is to take = pr, = -ul' and (that is
the value Pl) is then obtained with F'l = go(wo,
2 2
wr) 2
and a convenient choice
of the numerical flux go. We suggest to choose go as the Godunov flux (or as
a linearized Godunov flux, see [3] for instance). Numerical tests which were
performed in [3] show that this choice is very satisfactory, even in the difficult
case of a strong depressurization at the boundary. These tests also show that
the pressure obtained with the Roe flux is not so satisfactory and neither is
the choice Pl =
2
pr
which may seem natural (in particular, in 2D simulations,
using a dual mesh obtained with a finite element primal mesh).
In most cases, however, the choice of Wo
and wN+1 is not so easy. A possible
method, which is described in [4], is now layed out, for a fixed n and go given:
1. Compute DF(wr), its eigenvalues {AI, A2, A3} and a basis of R 3 ,
{'P1,'P2,'P3}, such that DF(wr)'Pi = Ai'Pi, i = 1,2,3.
2. Write wr on the basis {'PI, 'P2, 'P3}, namelywr= (};1'P1 + (};2'P2 + (};3'P3,
3. Let P be the number of positive eigenvalues, compute Wo = (31 'PI + (32'P2 +
(33'P3 and F'l = go(w o , wI), where the three unknowns (31, (32, and (33 are
2
determined by the P equations stating the boundary conditions (note that
these equations involve the components of F'l) and by the 3 - P equalities
2
(3i = (};i for Ai < O.
This method is not always satisfactory. In the case of severe slugging for
the simulation of two phase flow in a pipeline, the method seems to perform
well at x = 0, where the eigenvalues ),,1 and ),,2 are always positive and the two
boundary conditions (gas and liquid fluxes) are convenient. However, at x = 1,
the second eigenvalue sometimes becomes negative and one needs a second
boundary condition (the first one is a condition on the pressure). A natural
condition seems to be Ql = 0, where Ql is the second component of the flux F,
that is the liquid flux, but this condition does not lead to good results. Other
possible choices of this additional boundary condition at x = 1 were tested and
did not give good results. A possible interpretation of this problem is the fact
that the sign of ),,2 is computed with wN' Roughly speaking, it is "too late"
when ),,2 (wN) becomes negative (see Sect. 3 for the study of a simple scalar
°
case). Indeed, good results (in agreement with experiments) are obtained with
the unilateral condition Ql ~ (whatever the sign of ),,2(wN )). It consists in
using the preceeding method (for the boundary condition at x = 1) and in
replacing, in the numerical scheme (4), the second component of F;':;'+~ by its
positive part. Then, if ),,2 (w N) < 0, two boundary conditions are given at x = 1
(pressure and Ql = 0) and if ),,2(W N) ~ 0, one boundary condition is given at
x = 1 (pressure) but, in (4), the second component of F;':;'+~ is replaced by its
positive part.
In the following section, we will try to understand the sense of this bound-
ary condition in a simplified scalar case.
-Ii
°
As in the previous section, let h = (with N E N*) be the mesh size and
k > be the time step (assumed to be constant, for the sake of simplicity).
The discrete unknowns are now the values ui E R for i E {I, ... , N} and
n E N. In order to define the approximate solution a.e. in (0,1) x R, one sets
Uh,k(X, t) = ui for x E ((i -l)h, ih), t E (nk, (n + l)k), i E {I, ... , N}, n E N.
The discretization of the initial condition leads to
(9)
C1: g is non decreasing with respect to its first argument and nonincreasing
with respect to its second argument,
C2: g(s, s) = f(s), for all s E [A, BJ,
C3: g is Lipschitz continuous.
Let L be a Lipschitz constant for g (on [A, B]2) and ( > 0. If (0,1) is
replaced by R, it is well known (see e.g. [4]) that, if k :::; (1- ()~, the approx-
imate solution Uh,k, that is the solution defined by (7)-(9) (with i E Z), takes
its values in [A, B] and converges towards the unique entropy weak solution of
(5)-(6) in Lfoc(R x R+) as h ---+ 0.
In the case x E (0,1) instead of x E R, one assumes the same conditions
on g, namely (C1)-(C3). In order to complete the scheme, one has to define
n 2
and f~+l·
2
f ~n -- go (-n
u ,un). -n - ll(n+l)k -(t)dt
1 , U - k nk U
(10)
n
f N+~ -- gl (n =n ). = _ 1 1(n+1)k =(t)dt
UN' U , , u - k nk U ,
Then a convergence theorem can be proven as in the case x E R, see [13]:
B a.e. on (0, I), A ::; u,u ::; B a.e. on R+. Let gO,g1: [A,B]2 --> R,
satisfying (Cl)-(C3). Let L be a common Lipschitz constant for g, go and g1
(on [A,B]2) and let ( > o. Then, ifk::; (1-()~, the equations (7)-(10) define
an approximate solution Uh,k which takes its values in [A, B] and converges
towards the unique solution of (11) in Lfoc([O, 1] x R+) for any 1 ::; p < 00, as
h --> 0:
u E LOO((O, 1) x (0,00)),
roo r1
[(u _ ",)±cpt + sign±(u - ",)(f(u) - f("'))CPxl dxdt
Jo Jo oo
+M 1o
(u(t) - ",)±cp(O, t)dt +M 1
1 0
00
Remark 4.
1. It is interesting to remark that this convergence result is also true if the
function 9 depends on i and n, provided that L is a common Lipschitz
constant for all these functions.
2. The definition (11) of solution of (5)-(6) with the "weak" boundary con-
ditions u and u at x = 0 and x = 1 is essentially due to F. Otto, see
[10].
3. It is interesting also to remark that if one replaces, in (11), the two en-
tropies (u - "')± by the sole entropy lu - "'I, one has an existence result
(since lu - "'I = (u - "')+ + (u - "')-) but no uniqueness result, see [13] for
a counter-example to uniqueness.
4. This convergence result can be generalized to the multidimensional case,
see Sect. 5 and [13].
One considers here Equation (5), with initial condition (6) and weak boundary
condition u and u
at x = 0 and x = 1, that is in the sense of (11), in the
particular case l' > O. In this case, the main example of numerical flux is
46 T. Gallouet
9 = go = gl, g(a, b) = f(a), which leads to the well known upstream scheme.
With this choice of go and gl, using the notations of Sect. 3.1, the boundary
conditions are taken into account in the form:
with un = f J~~+l)k 11(t)dt. One may apply the general convergence theorem.
The approximate solutions converge (as h ----) 0) towards the solution of (11).
In this case, the approximate solutions, as well as the solution of (11), do not
depends on U.
In the case l' < 0 the main example is 9 = go = gl, g(a, b) = f(b), which
also leads to the upstream scheme. The boundary conditions are taken into
account in the following way:
and does not lead to the desired results. Note also that f~+!, given by (17),
is a discontinuous function of uN.
The second method, described in Sect. 2, uses the fact that the liquid flux
cannot be negative at x = 1. Since the liquid flux at x = 1 is f(l) - fN+! and
48 T. Gallouet
since, in this case, ga (un, u 1) = f (un). The fact that u~ :S f3 is true for all n
if T is not too large. If T is too large, the convergence result can be applied
with (21) instead of (16).
+M 1 00
o
0
- Boundary condition at x = °
(recall that u is given by (20)): u(O, t) = a or
u(O, t) :::: (3. In fact, if T is not too large, one has u(O, t) = a.
- Boundary condition at x = 1: u(l, t) ::; Urn or u(l, t) = 1.
Thanks to Theorem 1, it is possible to give other choices for f~+~ for which
the approximate solutions obtained with this new choice of f~+~ converge
towards the same function u, which is the unique solution of (22). Indeed, let
h: [0,1] ----* R be a nondecreasing function such that h::; f and h(l) = f(l)
and take:
f~+l =
2
h(uN)' (23)
One may construct a function gl satisfying (C1)-(C3) such that h(s) =
gl (s, 1), for all s E [0, 1], and then use Theorem 1. Let L be a common Lipschitz
constant for g and gC and gl (on [0,1]2) and let ( > 0. If k ::; (1 - ()£, the
approximate solution Uh,k, that is the solution defined by (7)-(9), with the
boundary fluxes (23) and (21) (and Uo = 0, U = 1 and u given by (20)),
takes its values in [0,1] and converges towards the unique solution of (22) in
Lfoc([O, l] x R+) for any 1 ::; p < 00, as h ----* 0.
Turning back to the complete system described in Sect. 2, the analysis of
this simplified model for two phase flows in pipelines may also suggest another
way to take into account the boundary condition at x = 1 (with a given
numerical flux gl):
1. Compute DF(wN)' its eigenvalues {AI, A2, >d and a basis ofR3 ,
{'P1,'P2,'P3}, such that DF(WN)'Pi = Ai'Pi, i = 1,2,3.
2. Write wN on the basis {'PI, 'P2, 'P3}, namely wN = a1'P1 + a2'P2 + a3'P3·
50 T. Gallouet
A second example is given by the modelization of a two phase flow, oil and
water (for instance), in a porous medium. Phases are immiscible. Compress-
ibility and capillarity effects are neglected. The model is obtained using the
conservation of mass for each phase and Darcy's law. This study is limited to
the one dimensional case. In this case the pressure can be eliminated and the
problem is reduced to a single equation, namely (5) with:
11 =a,
2
n ~ 0. (26)
At x = 1, The boundary condition is quite complicated. A simple example
is (see [7] for a more complete study):
r 1 = h(u'N)a (27)
N+'2 h(u'N) + h(u'N)
Then the approximate solution is given with (7)-(9), g given by (25), and
(26)-(27).
sign(u(O, t) - um)(J(u(O, t)) - f(K)) :S 0, VK E [um, u(O, t)], for a.e. t E R+,
sign(u(l, t) - 1)(J(u(l, t)) - f(K)) ~ 0, VK E [1, u(l, t)], for a.e. t E R+,
with [a, b] = {ta + (1 - t)b, t E [0, I]} and sign(s) = 1 for s > 0, sign(s) = -1
for s < 0, sign(O) = 0.
This gives u(O, t) = Um or u(O, t) = 1 and u(l, t) :S Um or u(l, t) = l.
In particular, at x = 0, one has f(u(O, t)) = a (only water is injected) and,
at x = 1, f(u(l, t)) < a if u(l, t) < Um (which states that there is some oil
production).
°
f: R -> R Lipschitz continuous) and v E C 1 (R2 X [0, T]) -> R2 with
div(v(-, t)) = in R2 for all t E [0, T]. The unknown is u: Q x (0, T) -> R.
Let Uo E £OO(Q) and 11 E £OO(aQ x (0, T)). Let A, B E R be such that
A :S Uo :S B a.e. on Q and A :S 11:S B a.e. on aQ x (0, T). Following the work
of [10]' an entropy weak solution of (29) with the initial condition Uo and the
(weak) boundary condition 11 is a solution of (30):
11
o
T
n
[(u - K)±c.pt + sign±(u - K)(J(U) - f(K))V· gradc.p]dxdt
T
+M r r (11(t) - K)±c.p(X, t)d"((x)dt (30)
Jo Jan
+ L(uo - K)±c.p(X, O)dx ~ 0,
VK E [A,B], Vc.p E C;(Q x [O,T),R+),
where d-y(x) stands for the integration with respect to the one dimensional
Lebesgue measure on the boundary of Q and M is such that
Boundary Conditions for Hyperbolic Equations or Systems 53
where Ilvll oo = sUP(X,t)EnX[O,T]lv(x, t)1 (and 1·1 denotes here the Euclidean
norm in R2).
Remark 6.
°
a solution of (29) (on a weak form), u satisfies some entropy inequalities in
fl x (0, T), namely Iu- lilt +div(v(j(max(u, Ii)) - f(min(u, Ii)))) ::::; for all
Ii E R, but also on the boundary of ofl and on t = 0. u satisfies the initial
°
condition (u(·,O) = uo) and u satisfies partially the boundary condition.
For instance, if l' > and u is regular enough, then u(x, t) = u(x, t) if
x E ofl, t E (0, T) and v(x, t) . n(x, t) < 0, where n is the outward normal
vector to ofl.
2. Let M ::::: 1. It is interesting to remark that u is solution of (30) if and only
if u is solution of (30) where the term fn(uo - 1i)±cp(X, O)dx is replaced by
M fn(uo - 1i)±cp(X, O)dx.
11 iT 1
u E LOO(D x (0, T) x (0,1)),
For this step, one chooses M not only greater than the Lipschitz constant
of Ilvlloof on [A, B], but also greater than the Lipschitz constant (on [A, B]2)
of the numerical fluxes associated to the edges of the meshes (the equivalent
of L in Theorem 1). This choice of M is possible since the unique solution
of (30) does not depend on M provided that M is greater than the Lipschitz
constant of Ilvlloof on [A, B] and since it is possible to choose numerical fluxes
(namely, Godunov flux, for instance) such as the Lipschitz constant of these
numerical fluxes is bounded by the Lipschitz constant of Ilvlloof (then, the
present method leads to an existence result with M only greater than the
Lipschitz constant of Ilvlloof on s E [A, B], passing to the limit on approximate
solutions given with these numerical fluxes).
STEP 4: UNIQUENESS OF THE SOLUTION OF (31). In this step, the "dou-
bling variables" method of Krushkov is used to prove the uniqueness of the
solution of (31). Indeed, if u and ware two solutions of (31), the doubling
variables method leads to:
+ 11 11 lin
(32)
(f(max(u, w)) - f(min(u, w)))v· grad<pdxdtdood(3 2: 0
Vip E Cl(D x [0, T), R+),
Taking <p(x, t) = (T - t)+ in (32) (which is, indeed, possible) gives that u
does not depend on a, v does not depend on (3 and u = v a.e. on D x (0, T).
As a result, u is also the unique solution of (30).
STEP 5: CONCLUSION. Step 4 gives, in particular, the uniqueness of the
solution of (30). It also implies that the nonlinear weak-* limit of sequences
of approximate solutions is a solution of (30) and, therefore, it guarantees the
existence of the solution of (30). Furthermore, since the nonlinear weak-* limit
of sequences of approximate solutions does not depend on a, it is quite easy
to deduce that this limit is "strong" in LP(D x (0, T)) for any p E [1,00) (see
[4], for instance) and, thanks to the uniqueness of the limit, the convergence
holds without extraction of subsequences.
Boundary Conditions for Hyperbolic Equations or Systems 55
References
1. Bardos, C., LeRoux, A.Y., Nedelec, J.C. (1979): First order quasilinear equations
with boundary conditions. Comm. Partial Differential Equations, 9, 1017-1034
2. Brenier, Y., Jaffre, J. (1991): Upstream differencing for multiphase flow in reser-
voir simulation. SIAM J. Num. Ana. 28, 685-696
3. Buffard, T., Gallouet, T., Herard, J.M. (2000): A sequel to a rough Godunov
scheme: Application to real gases. Computers and Fluids, 29, 813-847
4. Eymard, R., Gallouet, T., Herbin, R. (2000): Finite volume methods. Handbook
of numerical analysis, Vol. VII, 713-1020. North-Holland, Amsterdam
5. Kagan, A.M., Linnik, Y.V., Rao, C.R. (1973): Characterization Problems in
Mathematical Statistics. Wiley, New York
6. Eymard, R., Gallouet, T. (2003): H-convergence and numerical schemes for ellip-
tic equations. SIAM Journal on Numerical Analysis, 41, Number 2, 539-562
7. Eymard, R., Gallouet, T., Vovelle, J.: Boundary conditions in the numerical ap-
proximation of some physical problems via finite volume schemes. Accepted for
publication in "Journal of CAM"
8. Faille, 1., Heintze, E. (1999): A rough finite volume scheme for modeling two
phase flow in a pipeline. Computers and Fluids, 28, 213-241
9. Godunov, S. (1976): Resolution numerique des problemes multidimensionnels de
la dynamique des gaz. Editions de Moscou
10. Otto, F. (1996): Initial-boundary value problem for a scalar conservation law. C.
R. Acad. Sci. Paris Ser. I Math. 8, 729-734
11. Roe, P.L. (1981): Approximate Riemann solvers, parameter vectors, and differ-
ence schemes. J. Compo Phys., 43, 357-372.
12. Patault, S., Q.-H. Tran, Q.H. (1996): Modele et schema numerique du code
TACITE-NPW, tech. report, rapport IFP 42415
13. Vovelle, J. (2002): Convergence of finite volume monotone schemes for scalar
conservation laws on bounded domains. Num. Math., 3, 563-596
Fictitious Domain Methods in Shape
Optimization with Applications In
Free-Boundary Problems
Summary. This paper deals with a class of 2D shape optimization problems with
a 'flux' cost functional and a fictitious domain formulation of state constraints. These
constraints are given by nonhomogeneous Dirichlet boundary problems in bounded,
doubly connected domains. This approach is used for the numerical realization of
free-boundary problems of Bernoulli type.
1 Introduction
This paper deals with a particular shape optimization problem with a fictitious
domain (FD) formulation of the state equation. Solvers which are based on
FD formulations represent nowadays one of efficient tools for solving large
scale algebraic systems arising from discretizations of state problems. The
fact that the new FD problem is solved in a domain n with a simple shape
(a box, e.g.) enables us to construct uniform partitions of n and consequently
to use fast solvers and special preconditioning techniques. FD solvers have
additional advantages when used in shape optimization. To see that let us
recall the standard approach in shape optimization which is based on boundary
variations of admissible domains. Let us suppose that a linear state problem is
solved by a standard finite element method and a gradient type method is used
for the minimization of the cost functional. Then the following steps have to be
performed after every change of the shape: (i) remeshing the new configuration;
(ii) assembling the new stiffness matrix and the right-hand side of the linear
algebraic system; (iii) solving this new system. As a result the computational
process is not efficient. As we shall see, FD solvers utilizing nonfitted meshes
completely avoid step (i) and partially avoid step (ii) since the stiffness matrix
remains the same for every admissible domain. The FD formulation that we use
in this paper is based on the dualization of Dirichlet conditions by boundary
Lagrange multipliers ([6]' [8]). It turns out that this variant is appropriate in
shape optimization problems for the following reasons: the Lagrange multiplier
being part of the solution is equal to the conormal derivative on the searched
boundary of the solution to the original state problem. The conormal derivative
of the state appears in expressions for the shape derivative of cost functionals
Fictitious Domain Methods in Shape Optimization 57
where:
- 0 is a family of admissible domains which consists of doubly connected
domains in JR.2 contained in a box D. The components of the boundary ow
are denoted by fo and f f(w). We shall suppose that fo is fixed and the
same for all wE 0 while f f(w) is variable and defines the shape of w (in our
presentation f f (w) is exterior to f 0 but one can consider also the opposite
situation);
- o~(w) denotes the conormal derivative of u on f f(w), where u solves the
UVA
Dirichlet state problem in w:
- div (AV'u(w)) = f in w,
{ u(w) = 9 on fo, (P(w))
u(w) = 0 on f f(w),
58 J. Haslinger et al.
The symbol II 11-1/2,rj(w) stands for the dual norm in H-l/2(r f(w)).
We shall suppose that the outer components r f (w) of the boundary OW, W E 0
are parametrized by 27r-periodic functions 'Y: [0,27r] -> ]R2. We shall use the
following notations: r f(w) := r, meaning that r f(w) is the range of 'Y. Further
w, denotes the doubly connected domain between ra and r,. The family of
admissible domains can be specified by a class S to which 'Y belongs.
Definition 1. A function 'Y: [0,27r] -> ]R2 belongs to S if:
(51) 'Y E C~7r' i.e. 'Y is 27r-periodic, twice continuously differentiable on [0, 27r];
Fictitious Domain Methods in Shape Optimization 59
(S2) 3a,/'1,/'2 > OVt E [0,27r]: I/"(t) 1~ a, Ih'lloo:S /'1, 1b"1100:S /'2;
(S3) /' is positively oriented;
(S4) W, C 0 = (0,1)2;
(S5) 3d> 0: dist(fo,f,) ~ d, dist(f"aO) ~ d;
(S6) 3h > 0 V/, E S Vt E [0, 27r]: 3Bi , Bo open discs of radius h such that
Bi C w" Bo C 0 \w" /,(t) E Bi nBo'
The unit square 0 which contains W, for all /' E S will serve as the fictitious
domain in the FD formulation. Let us introduce the following spaces:
1
Vg={vEHo(O) v=gonf o, v=Oonf,},
A A
Vo := Vg with 9 = O.
Here TO: HJ(D) ---* Hl/2(fo), T'Y: HJ(D) ---* Hl/2(f "I) stand for the trace
mappings and (, )ro' (, )r., denote the respective duality pairings. Using re-
sults of [2] one can easily prove
Theorem 1. There exists a unique solution (U,AO,A'Y) to (p(w'Y)). In addi-
tion, u:= ul solves (P(w'Y)) and A'Y = {){)u on f'Y.
w., VA
Remark 4. The equality A'Y = {){)u on f'Y is due to the zero extension of f
VA
outside of W'Y. For any other extension one only knows that A'Y = [!lv:] r ., '
where [ ]r., denotes the jump of the corresponding quantity across f 'Y"
min -211IA'Y -
'YESL QII~1/2 'r ., ' (fill)
where A'Y E H- 1/ 2(f'Y) is the last component of the solution to (p(w'Y)) and
SL ~ S is a compact subset of S.
To prove the existence of optimal shapes in (fill) one has to show that the
solutions to (p( w'Y)) depend continuously on variations of 'Y E S. One of the
difficulties that we face in any shape optimization problem is the fact that the
functions have their own, variable domain of definition. To handle this difficulty
Fictitious Domain Methods in Shape Optimization 61
we pass to reference, shape independent function spaces. This will be done for
the trace space H1/2(Ly) and its dual H-1/2(Ly). Using the parametrization
of r" we shall replace H1/2(r,,) by the reference space Hit2 and H-1/2(r,,)
-1/2
b y H 211" .
The space of 21T-periodic, square integrable functions will be denoted by
L~11"' The reference trace space Hit2 is defined as follows (see [13]):
II rp 1 2 '=1111 2
1/2,211" . rp L~~
+JJ I 211" 211"
Irp(t)-rp(sW dtds
sin((t - s)/2W .
o 0
Next we shall define the trace mapping T, from HJ(D) into the reference space
Hit2. To this end we introduce the following spaces on r", 'Y E S:
H}P(r,,) = {rp E L2(r,,) I rp0'Y E Hit2}
with the norm
IlrpI11/2,p := Ilrp °'Ylh/2,211"
and the standard Sobolev space H1/2(r,,) endowed with the norm ([15]):
+ r211" r211"
Jo Jo
Irp°'Y(t) -rpO'Y(sWI'Y'(t)II'Y'(s)ldtds
b(t) - 'Y(S)12
making use of the parametrization of f". The relation between H~/2 (f,,) and
H1/2 (f,,) follows from the next lemma.
Lemma 1. The spaces H~/2(f,,) and H1/2(f,,) coincide as sets and are topo-
logically equivalent, uniformly with respect to 'Y E S, i. e. there exist positive
constants C1, C2 such that
Let i,,: H1/2 (f,,) ----> H~/2 (f,,) be the identity mapping and I" :
H~/2 (r,,) ----> Hit2 be an isometry defined by I" (rp) := rp 0 T Then the trace
mapping T,: HJ(D) ----> Hit2 is given by
The trace mapping T, enjoys properties which are useful in the existence anal-
ysis for (IP').
62 J. Haslinger et al.
(3)
,n
T. v A
-+
T.,v A
tn
• Hl/2
2" (4)
T.,0,<P
C
= <P \..I
v<p
E Hl/2
2"
and
(5)
where c > 0 does not depend on I E S. In addition, supp E,<p is contained in
the h-neighborhood of f,.
For the proof of the previous lemmas we refer to [10].
In a similar way one can associate with any p" E H- 1 / 2 (f,) a unique
e ement p"- E H-
I 2"
1/ 2
:
where p, := I:; *i:; * p" and (, )2" denotes the duality pairing between H~1/2
an d H 2".
l/2
Since
Fictitious Domain Methods in Shape Optimization 63
(IF') ref
- 1/2
where A, E H:;1"( is the last component of the solution to (P(w'))ref"
A
(6)
(7)
making use of (4) and (8). From this and (9) we see that (u, Ao, .\"1) solves
(P(W'Y)) ref" In addition, the whole sequence in (8) tends to (U,Ao,'\'Y). Strong
convergence of {un} follows from
yielding strong convergence of {Aon} to Ao. For the proof of (7) which is more
technical we refer to [10J. 0
The dual norm defining the cost functional is hard to evaluate. For this
reason we use another functional which also measures the distance between
flu and the target Q and which is easy to compute. To this end we introduce
ul/A
the new norm in H1/2(r "I):
The mathematical analysis for (iP) is done in [9]. It is worth noticing that
the existence of optimal shapes in (iP) is ensured under weaker assumptions on
O. Instead of C 2 ,2 in (IF) one needs C 1 ,1 boundaries in (iP), only. This is due
to the fact that the cost functional in (iP) is expressed by a volume integral.
In this section we briefly describe the discretization of (iP). For more details we
refer to [9]. Instead of the family of admissible domains 0 we shall introduce
a new family 01< which contains domains with boundaries defined by a finite
number d := d(f£) of parameters, e.g. splines. For WI< E 01< given we define
a discretization of (i-'(w,)) as follows:
For the detailed mathematical analysis of (p(w",)):, in particular for the ver-
ification of the LBB-condition we refer to [6]. Having AH-y at our disposal we
define the discretization of (A(Wk)):
~iKn ~IZh(AHJli,n'
where A is the stiffness matrix and Iffi o, Iffi,,(I'>,) are matrices coupling the primal
variable u with the Lagrange multipliers Ao, A", respectively. Notice that only
the matrix Iffi" and the right hand side f depend on I'>, but not A! Therefore, A
needs to be assembled only once. We finally arrive at the following non-linear
mathematical programming problem:
(IF')~
This algorithm uses ideas of the simplex method [16] and the CRS (Con-
trolled Random Search) algorithm [17]. It starts with a population P of
N points (N » n) which are chosen at random in the search space X
(n := dim(X)). A new trial point x is generated from a simplex S (a set
of n + 1 linearly independent points of a population P in X) by the following
operation called reflection:
x = g - Y(z - g),
where z is one (randomly chosen) vertex of the simplex S, g is the center
of gravity of the remaining n vertices of the simplex and Y is a randomized
multiplicative factor. Thus the new point x is obtained from the reflection of
the point z with respect to g. Let Xmax be the point with the largest objective
function value among the N points currently stored in P. If f(x) < f(x max )
then Xmax := x, i.e. the worst point in the population is replaced by the new
trial point. The process continues until a stopping criterion is fulfilled.
The main modification of the original CRS algorithm consists in randomiz-
ing the multiplicative factor Y. The best results in most tested examples were
obtained with Y distributed uniformly in [0, a[ with a ranging from 4 to 8 and
N = max(5n, n 2 ). For more details on this algorithm we refer to [14].
u* = 0 on r o,
(13)
u*=l, 8u* )
8v=Qonr f (w*,
The discrete family 01< consists of all doubly connected domains WI< whose
variable component of the boundary is realized by piecewise second order
Bezier curves. The discretization parameter '" is related to the number of
Bezier segments. The space Vh C HJ(D) is realized by continuous, piecewise
linear functions over a uniform partition ih of D. Further, AHa, AH.., are spaces
of piecewise constant functions over partitions THo, TH.., of polygonal approxi-
mations I'o, I', of r o, r" respectively (see Fig. 2). Finally, the duality pairings
[, ]0, [, ], are realized by the L 2 (I'o), L 2 (I', )-scalar products, respectively. In
order to satisfy the LBB-condition, the partitions THo, TH.., are constructed in
such a way that H o ~ 3h, H, ~ 3h. For more details on the practical realiza-
....... (..,
--0-ro
I!!lElI WI< A A
• nodes of THo,TH..,
Fig. 2. FE partitions
tion we refer to [9]' [11]. Both external and internal problems were computed
with the following data: D = (0,10)2, h = 10/64, the number of Bezier seg-
ments '" = 10 and r o is L-shaped. The examples are computed for different
values of the target Q by using two cost functionals, namely:
(a) J 1 (w) as in the definition of (iF);
2 _ Icn l2
Il vll- 1/2,[0,1] - n=~oo 1 + Inl'
00
The results obtained by the MCRS method after 2000 function evaluations are
shown in Figs. 3-4.
Fictitious Domain Methods in Shape Optimization 69
1or:-------~----___, 1Or------.----------,
°0~----------~,0 °0~----------~,0
10r:------------___, 1Or:----------------,
ro
°0~----------~,0 °0~----------~,0
.
mmwEOrr 2"1 Jr f IOu(w)
av - QI 2 d (J',
(14)
-Llu = a on w,
{ u = 9 on To, (15)
u = a on Tf
with To and T f as in Section 2. We assume that the searched component of
the boundary Tp := Tf of every admissible domain w p, p E II, is the union
of a fixed number k of adjoining arcs Tp,i, Tp = U7=1 Tp,i, such that each arc
Tp,i can be described by a quadratic Bezier curve. More precisely, given an
ordered set of distinct control nodes X1, ... ,Xk, Xi = (~i,rJi)' i = 1, ... ,k,
the i-th Bezier arc Tp,i is determined by the triple (mi-1, Xi, mi) with mi =
!(Xi +Xi+1), i 1, ... , k. Here we set Xk+1 = Xo and mo = mk. Thus Tp,i can
=
be parametrized by
Defining B: [0,1] ---* jR1X3 (a (lx3) matrix), t f-+ ![(1 - t)2, 1 + 2t(1 - t), t 2]
and (~i-1 rJ i _ 1)
Xi = ~i rJi
~i+1 rJi+1
this parametrization can be compactly written as
-L1JL = 0 on wP '
{ JL = 0 on ro, (17)
JL = g~ - Q on rp
and u is the solution of (15) on wp. Let us introduce the shorthand notation
and let 8Xi E jR3x2 hold the perturbation of the coordinates of the nodes Xi-I,
Xi, Xi+l specified by 8p. If we insert
We conclude this paper by solving a dam problem [4], [5]. The classical bound-
ary variation approach has been used in [1] and [7]. The vertical wall n made
72 J. Haslinger et al.
10,.,..---------------,
°0~-~---------~1O
1. Found free boundaries 2. Minimization history
of a non-homogeneous material separates two water levels of height Yl, Y2. One
wants to find a curve separating the wet and dry part of D. The mathematical
model leads to the following free-boundary problem:
u = Yi - Y on fi' i = 1,2,
°
u = on f cp U f (y,
(15(rp))
au -1
all - on f o·
!
The FD formulation of (15(rp)) is now straightforward:
r kY'iL.Y'vdxdy=- Inr k av
In ay
dxdy+(.Acp,v)r", Vv E Vo, (p(rp))
where lIy is the y-component of 1I. In computations the cost functional (21)
is replaced by the L2(f cp)-norm. As before the spaces Vg and H-l/2(f cp) are
approximated by continuous, piecewise linear and piecewise constant functions,
respectively and the unknown component f cp by piecewise, second order Bezier
curves. The example was solved with the following data: n = (0,1.62) x (0,4),
h = 1/32, Yl = 3.22, Y2 = 0.84, k = 1 and the number of Bezier segments
K = 6. The free boundary found by the MCRS method after 2000 function
evaluations is shown in Fig. 7.
Conclusions
The variant of the FD method presented in this paper provides an efficient
computational tool for the numerical realization of shape optimization prob-
lems. Its main features are the following:
74 J. Haslinger et al.
4,--------,
3.5
1.5
0.5
References
1. D. Begis and R. Glowinski (1975): Application de la methode des elements finis
a l'approximation d'un probleme de domaine optimale. Methodes de resolution
des problemes appro eMs , Applied Mathematics & Optimization, 2, 130-169.
2. F. Brezzi and M. Fortin (1991): Mixed and hybrid finite element methods.
Springer-Verlag, New York.
3. M. Flucher and M. Rumpf (1997): Bernoulli's free-boundary problem, qualitative
theory and numerical approximation, J. Reine Angew. Math., 486, 165-204.
4. C. Baiocchi and A. Friedman (1977): A filtration problem in a porous medium
with variable permeability, Ann. Mat. Pura Appl., 114, 377-393.
Fictitious Domain Methods in Shape Optimization 75
Contributed Papers
Domain Decomposition Method for a Class of
Non-Linear Elliptic Equation with Arbitrary
Growth Nonlinearity and Data Measure
Summary. In this paper we show the existence of weak solutions for a nonlinear
elliptic equations with arbitrary growth of the non linearity and data measure. A nu-
merical algorithm to compute a numerical approximation of the weak solution is
discribed and analysed. In a first step a super-solution is computed using a domain
decomposition method. A numerical example is presented and commented.
1 Introduction
This work deals with weak solutions of the following quasi-linear elliptic prob-
lem with Dirichlet boundary conditions:
where G, F : [0,1] x IR ---+ [0, +oo[ are measurable and continuous with respect
to u' and u, f is a given finite non negative measure on (0,1).
The main goal is to present a numerical analysis of this weak solutions
and to study their existence and uniqueness. Such problems arise from biolog-
ical, chemical and physical systems and various methods have been proposed
to study existence, uniqueness, qualitative properties and numerical simula-
tion of such solutions (see [8]). When f is regular, it is proved in [9] that
if F) has a nonnegative super-solution in W~,oo then (1) has a solution in
n
Wo,oo W 2 ,p. Many authors dealt with this problem when f is irregular and
G is sub-quadratic with respect to u' namely:
IG(t, r)1 :::; c(g(t) + IrI2), g(t) E Ll(O, 1), c > 0 (2)
They showed that, if G satisfies (2), (1) has a solution u E HJ(O, 1) provided
that (1) has a super-solution in Wl,oo(O, 1) see [6]' [5]. The case where the
super-solution itself is irregular has been treated in [2], if the super-solution
80 N.E. Alaa, J.R. Roche
belongs to HJ (0, 1) then (1) has a solution in HJ (0,1) provided that G satisfies
(2).
In this work we are particularly interested in situations where f is irregular
and where the growths of G with respect to u' and F with respect to u are
arbitrary. Let us make some precisions about the model problem:
{ u E Wl;~(O, 1) nCo[O, 1]
-u"(t)+G(t,u'(t))=F(t,u(t))+f in 'D'(O, 1) (11)
(replace in (11) = by 2: for a weak super-solution and by :::; for a weak sub-
solution)
Remark 1. In (11) u E Wl~'coo (0, 1), using (9) we have G(t, u'(t)) and
F(t, u(t)) E L[oc (0, 1). Hence every term in (11) makes sense.
Theorem 1. Assume that (7)-(10) and f E Mi(O, 1) hold. Assume that there
exists a weak solution w to the problem,
{
Un+l E W~,OO(O, 1) (15)
- u~+1 + Gn +1 (t, u~+1) = F(u n ) + f in V'(O, 1)
where Uo = w. To have estimates when passing to the limit, we give the
following three lemmas, see [1].
Lemma 2. Let a(t) E L[oc(O, 1), v E WI~';(O, 1) n Co [0, 1] such that
for all 0 < a < b < 1, where d(t; a, b) = min(b - t, t - a) and c(a, b) is
a constant depending on a and b.
Lemma (3), will provide Wl~';'(O, 1) estimates for the approximate solution
Un. But this estimate does not allow us to pass to the limit in the nonlinear
terms. We need the strong convergence of Un in Wl~;'(O, 1). We obtain this
result from the following Lemma.
Domain Decomposition for a Non-linear Elliptic Equation 83
°
that
Un+1 ::; Un ::; W in [0,1] for all n 2: (22)
By lemma (3) Un is bounded in WI~~(O, 1) n C o [O,l] independently of n.
Therefore, there exists a subsequence, still denoted by (un) for simplicity, Un
such that converges to U strongly in Loo(O, 1) if n ---+ 00. Also u~+l converges
to u' strongly in Lfoc(O,l) and a.e. in (0,1). Then from lemma (3) we have
u~+1 converging to u' strongly in LZ;:c(O, 1), and
°
Since G(t,.) and F(t,.) are continuous with respect to the two last argu-
ments, we have for all < a < b < 1
G(t,u~+1)' F(t,u n ) converges to G(t,u') , F(t,u) a.e. t E (0,1). (24)
On the other hand, for a.e t E (a, b)
IG(t,u~+1(t)l::; max IG(t,r)1 = e(t) (25)
Irl :'0 c' (a,b)
and
max IF(t, s) 1= 8(t) (26)
Isl:'Omax( IlwIIL~(O,l)' 11!£IIL~(O,l»)
and e, 8 E Lfoc (0, 1) from (9). Using Lebesgue's dominate convergence Theo-
rem (see [7]), we also have;
G(t,u~+l)' F(t,u n ) converges to G(t,u'), F(t,u) in Ll(a,b) respectively
(27)
Now, we can pass to the limit in (15), and if 'P E V (0, 1) with sup 'P C [a, b]
then
°= n-+oolim (-u~+1 + G(u~+1) - F(u n ) , 'P) = (-u"
- F(u) , 'P) + G(u')
(28)
where (.,.) denotes the duality pairing between V'(O, 1) and V(O, 1). The the-
orem follows.
84 N.E. Alaa, J.R. Roche
3 Numerical method
In this section we present the numerical method to solve equation (1). Formally
the iterative method brings out a sequence of numerical solutions of (15) in
HJ(O,I) with a first guess which is a super-solution of (1), in our case, a
solution of problem (12).
Then the algorithm can be formulated in the following way:
1) Find w E HJ(O, 1) such that:
(30)
Both problems (Pd and (P2 ) are nonlinear, and if (PI) has a solution,
in theorem (1) we prove that the solution of (P2 ) is also a solution of the
equation (1). Problem (P2 ) has a unique solution and the numerical calculation
is straightforward by Newton method. To solve the nonlinear equation (Pd,
which presents some interesting difficulties, we construct a sequence w k such
that w k is a solution of a linear problem and w k converges to w.
Let wD = 0, we define wk + I = wk + 0 where 0 is the solution of the
following linear problem:
8F~~k) 0 = + F(wk ) + f
°
-0" - (w k )" in (0,1)
{ (31)
0(0) = 0(1) =
Then at each iteration we have to solve the linear problem (P3 ). To achieve
this, we consider a weak formulation of the problem and use the finite element
method.
To simplify the text we reformulate (P3 ) in the following way: find v E
HJ (a, b) such that:
+ c(t) v(t)
(P4 ) { - v(t)"
v(a) =
=
v(b) = °
h(t) in (a, b) (32)
where h E MB(a, b), the set of finite measure in (a, b), and c(t) E L 2 (a, b),
without restriction on its sign. We assume Coo = IlcIILOO(a,b) is bounded.
According to Lemma 1, problem (P4 ) has a solution in a domain (a, b)
small enough.
If V = HJ(O, 1) then the weak formulation of (P4 ) is:
and
{ -(v~+l)"(t) + c(t) v~+l(t) = h(t) in (a, b)
(37)
v~+l(oo) = v~(oo); v~+l(b) = 0
Now to prove the convergence of the Schwarz overlapping domain decom-
position algorithm applied to problem (P4 ) we consider two problems:
and:
- V2(t)" + c(t) V2(t) = h(t) in (0:,1)
(P4.2) { v2(OO) = vI(OO), v2(b) = 0 (39)
Proof:
Let dk = v~ - v in (a, (3) and ek = v~ - v in (a, b). We prove the following
inequality:
86 N.E. Alaa, J.R. Roche
(41)
k 1 sin(~(t-a)) . ... ..
then cp(t) = Ie + ((3)1 sIn
. (~((3
Coo - a
)). ThIS solutIOn IS umque and posItIve
if ((3 - a) < 2 ~. If we consider the difference z = cp - d k +2 it is easy to
prove that z :::: 0 and dk+2 ::; cp ::; lek+l((3)I. If now z = cp + dk+2 we have
also z:::: 0 and -le k+1((3)1 ::; cp ::; dk+2(t) , \:It in (a, (3). Then the inequality
Ild k+2 1loo ::; lek+l((3)1 ::; Il ek+llloo holds.
To prove that le k +1 ((3) I ::; --y IWlloo with --y < 1, we consider the equation:
Then le k+1 ((3) I ::; <jJ((3) ::; --y Idk(a)1 with --y = si.n~~ ~~ - (3)]. The coef-
sm~ -a
ficient --y is smaller than 1 only if a < (3.
In conclusion we have IW+211oo < Ildklloo if a < (3 and ((3 - a), (b - a) <
min(::, 2~). In the same way we prove that Ile k+21loo < Ilekll oo if a < (3
and ((3 - a), (b - a) < min({!, 2~).
We conclude that the Schwarz overlapping domain decomposition method
applied to problem (P4) converges.
4 Numerical Results
The algorithm introduced in the previous section has been implemented nu-
l
merically for the model problem (3) with p = q = 3 and f(t) a Dirac in 3.
Domain Decomposition for a Non-linear Elliptic Equation 87
2.315 - , - - - - - - - - - ,•. . . . - - - - - - - - - - - - - - - - - - - ;
I
/
i'
2.084
1.852
1.621
1/
1.389
1.158
0.926
0.695
0.463
/
0.232
/
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
- - (:omputed solution
+ computed super-solution
References
1. Alaa, N. (1989): Etude d'equations elliptiques non-lineaires a dependance convexe
en Ie gradient et a donnees measures, These de Doctorat, Universite de Nancy I
2. Alaa, N., Pierre, M. (1993): Weak solution of some quasi-linear elliptic equations
with data measures, SIAM J. Math. Anal., 24, 23-35
3. Alaa, N., Iguernane, M. (2002): Weak periodic solutions of some quasi-linear
parabolic equations with data measure, J. Inequal. Pure Appl. Math. 3
4. Baras, P., Pierre, M. (1984): Criteres d'existence de solutions positives pour des
equations semi-lineaires, Annales Fourier Grenoble, 24, 1985-2006
5. Bensoussan, A., Boccardo, L., Murat, F. (1988): On a nonlinear partial differen-
tial equation having natural growth terms and unbounded solution, Ann. Inst.
Henri Poincare, 5, 347-364
6. Boccardo, L., Murat, F., Puel, J.P. (1989): Existence results for some quasi-linear
parabolic equations, Nonlinear Analysis Theory Method and Applications, 13,
373-392
7. Brezis, H. (1983): Analyse fonctionnelle theorie et applications, Masson
88 N.E. Alaa, J.R. Roche
8. Levin, S.A., Hallam, Th. G., Gross, L.J. (1989): Applied Mathematical Ecology,
Biomathematics 18, Springer Verlag
9. Lions, P. L.(1980): Resolution de problemes elliptiques quasilineaires, Arch. Ra-
tional Mech. An!', 74, 335-353
10. Quarteroni, A.,Valli, A.(1999): Domain decomposition Methods for Partial Dif-
ferential Equations, Oxford Science Publications
11. Witomski, P. (1983): Sur la resolution numerique de quelques problemes non-
lineaires, These d'Etat, Universite Scientifique et medicale de Grenoble
Variants of Relaxation Schemes and the Lattice
Boltzmann Model Relaxation Systems
Summary. In the low Mach number limit of the Lattice Boltzmann type models
one obtains the incompressible Navier-Stokes equation. This is achieved by asymp-
totic analysis. Moreover in the course of this analysis, the Lattice Boltzmann Model
reduces to a relaxation system which can be discretized using relaxation schemes.
We present two variants of the relaxation schemes characterized by local approxima-
tion of characteristic speeds and a multidimensional flux approximation. These are
applied to relaxation systems. Their performance will be discussed with reference to
test cases of isothermal incompressible flow.
1 Introduction
(1)
where
S[u] = ~ (
2
28xu yu v) .
8 yu + 8 x v
8 + 8x
28yv
Since Q[q], q and s are not needed to derive INS, they will be ignored.
From the momentum equation in (2) one can deduce that as £ - . 0, p
approaches a constant p and can be written as p = p(1 + 3£2a p ) to obtain (')(1)
terms. Hence
1
8t p + 3£2a div u =- div pu, (4)
Relaxation Schemes and Lattice Boltzmann Model 91
(5)
Since 2 div S[u] = (L1 + V div )u, we obtain from (4) and (5) the INS equations
as limiting system for a = 1
. T
divu = 0, OtU + dlvu (>9 u + Vp = "3L1u, (6)
°
where the Reynolds number is related to the relaxation time by Re = 3/T.
For < a < 1, we obtain the incompressible Euler equations
By neglecting the lower order terms in the above equations (3) and (4) and
setting p == 1, we can introduce a simplified relaxation system:
1 .
OtP + -2- dlvu = 0, OtU + dive + Vp = 0,
10 <>
(7)
2 1
Ote + Va[u] + -SE[U]
10 2 <>
= ---(e - u
E1+<>T
(>9 U),
where
10 2 <>
SE[U] = S[U]- Tva[u].
with a = (~) E 1R!. Obviously the limit equations for this system are again
the INS equations with Reynolds number Re = l/T.
Considering the nonstiff advection parts in (7) separately for u and e, we
obtain a hyperbolic system with characteristic speeds ±a and ±b in x and y
directions:
OtU + dive = 0, (8)
As we will see in the last section, a is chosen depending on the local speed. In
the x-direction (8) can be written as:
(9)
92 M.K. Banda
I
(a- + a+)I )(u)=o
8 X x
(10)
3 Numerical Schemes
To discretize the equations in space a uniform grid in x and y with grid points
(Xi, Yj) with spacing h is used. Consider the linear system (8). For the x-
direction 8 x ± au are the characteristic variables associated with the charac-
teristic speeds ±a while for the y-direction the characteristic variables associ-
ated with the characteristic speeds ±b are 8 Y ± bu. Similarly the characteristic
variables associated with the x-direction for the HLL type scheme in equation
(10) are
w+ = 2a+ (8 X
(a+ - a-)
- a-u) and W- = (a:~:-) (8 X - a+u). (11)
+~ (a;j(8 + au) -
X
a;+1j (8 X - au)),
in the case of second order method. If minmod slope limiting is applied then
atj are given by atj(z) = ~minmod( Zij - Zi-1j, Zi+1j - Zij)'
For the HLL type scheme the numerical fluxes at cell edges are
Relaxation Schemes and Lattice Boltzmann Model 93
We denote by Fh, F~ the discretization of the convective parts div E> and
\7a[u] in equation (7), respectively. Using the numerical fluxes above one ob-
tains
And for the HLL type scheme we get the following convective term:
or equivalently
To treat only the limit equations (E = 0) any explicit high order Runge-Kutta
method [11] can be used in combination with a Poisson solver and the limiting
(relaxed) spatial discretization.
Further one would want to discretize the relaxation system (13) for all
ranges of the parameter E. This allows the study of the numerical passage
from the Boltzmann to the INS regime. An implicit-explicit (IMEX) Runge-
Kutta method of the type recently developed in [12] is used as suggested in
[6]. We denote the time step by k and use superscript n to denote the time
iterations. For the second order time discretization a two stage IMEX Runge
Kutta method [12] which guarantees second order accuracy in the stiff limit is
chosen. For E -+ 0 this scheme suggests a formulation for a second order time
discretization of INS equations i.e. the projection is taken at every step:
Step 1:
Step 2:
with" = 1- V2/2 and 5 = 1-1/2", which gives a scheme for the incompress-
ible Euler equations (0 < a < 1) and INS equations for a = 1.
Relaxation Schemes and Lattice Boltzmann Model 95
Step 2:
The cases for E = 0 will be tested in this section. The space discretization in
equation (13) will be applied. For slope limiting the van Leer limiter is used.
Example 1: Taylor-Vortex Flow - Accuracy test
Relaxation schemes will be first tested on incompressible Euler equations,
i.e. 1.1 = T = 0.0 augmented with smooth periodic initial data. The test admits
the following exact solution [13]:
Table 1. Error-norms for the Incompressible Euler Problem with the initial condition
in 14 at t = 2.0 using relaxation schemes, v = 0.0.
Table 2. Error-norms for the Incompressible Euler Problem with the initial condition
in (15) at t = 0.7 using relaxation schemes, v = 0.0.
where () is the shear layer width parameter and 0 is the strength of the initial
perturbation. The strength coefficient 0 = 0.05 is kept unchanged. The results
of vorticity profiles are displayed using 20 equidistant contours.
In figure 1 vorticity profiles of solutions obtained by applying different
schemes without recourse to slope limiters are shown.
Fig. 1. Thick Shear ((2 = 30) Layer Results for Euler case E = 0: Staggered Central
Scheme [16J (a), Godunov Based Scheme [15J(b); Relaxation Scheme with TVD time
integration (c); and Relaxation Scheme with relaxed DIRK time integration (d).
Further a refinement of the grid to N = 256 was made. The same compu-
tation was repeated with the van Leer limiter applied to the relaxation-based
schemes. The results are shown in figure 2. In figure 3 a v velocity cut at
x = 0.5 is shown.
98 M.K. Banda
t;;
9°·2
0.1 ~O.1
UJ
0 0
(d)O
0
(C)
Fig. 2. Comparison of Thick Shear Layer Results for the incompressible Euler case,
multidimensional vs. dimension-by-dimension approach: (a) second-order DIRK, (b)
second-order TVD, (c) second-order multidimensional with DIRK, (d) second-order
multidimensional with TVD.
The relaxation system was also tested on the thin shear layer, p = 80,
problem. The Navier-Stokes equation was considered for t = 1.0 using a grid
of N = 256. A comparison of the time evolution of total kinetic energy and
enstrophy in the incompressible Euler equation up to time t = 2.0 was made.
Figure 4 presents the evolution of the decay of the total kinetic energy of the
flow and a history of the mean enstrophy.
In all the three examples we observe that the relaxed schemes (E = 0)
perform reasonably well inspite of their simplicity. Much as the DIRK formu-
lation suggests the projection structure, the direct TVD formulation has better
qualitative results. The DIRK formulation tends to be more dissipative. Never-
theless they both tend to achieve their expected convergence rate. Further we
Relaxation Schemes and Lattice Boltzmann Model 99
V cuts at T = 1 .2, N = 128 for Second Order Schemes V cuts at T = 1.2, N =256 with DIRK time integration
0.3r---~--~--~---r----, 0.3r---~--~--~---r--___,
0.2 0.2
-0.1 -0.1
-0.3:----::'::----::'-0---:-:------='0-----' -0.3~-___,:'::----:'-:"---:-:------=,O-----'
(a)0 0.2 0.4 0.6 0.8 (b) 0 0.2 04 06 08
Fig. 3. The cut at x = 0.5 of v at t = 1.2 for the Thick Shear Layer Problem
computed with the staggered central scheme (,Central'), the Godunov projection
method (,Centered'), the relaxation-based scheme with DIRK time integration (,Cen-
tered DIRK') and the relaxation-based scheme with TVD time integration (,Centered
TVD') (Left). The cut at x = 0.5 of vat t = 1.2 for the Thick Shear Layer Problem
computed with the Godunov projection method (,Centered'), the relaxation-based
scheme ('2nd Order') and the multidimensional relaxation-based scheme (,2nd Order
Mult.') (Right).
Kinetic Energy for Incompressible Navier-Stokes Equation Enstrophy for Incompressible Navier-Stokes Equation
0.48
- Centered DIRK
100,--~---~-r===;c:::e::;nt=ere=d;:;:D"'IR;:;;K:=il
. - . - Centered TVD . - . - Centered TVD
- - Centered 90 - - Centered
0.47
70
0.45
~
60
0.44
~
UJ
50
0.43
.............
40
0.42 30
0.41~-----:_'::_----;----c':_-----' 20
1.5
(a) 0 05 n~e 1.5 (b) 0
0.5 1
Time
Fig. 4. Comparison of total kinetic energy and enstrophy at t = 2.0, Re = 10000 for
the thin shear layer problem computed with the Godunov projection method (,Cen-
tered'), the multidimensional relaxation-based scheme with TVD time integration
(,Centered TVD') and the multidimensional relaxation-based scheme with DIRK
time integration (,Centered DIRK').
100 M.K. Banda
i;
9°·2
-g
~ 0.1
w
00
l-----="':':--~~~~
0.2 0.4 0.6 O.B
(b) Contour Plot L =-59.9293 H = 59.9293
Fig. 5. Comparison of Thin Shear Layer Results for Navier-Stokes case: (a) second-
order multidimensional DIRK scheme, (b) second-order multidimensional TVD
scheme.
observed that for the DIRK scheme there is no significant difference between
the multidimensional and dimension-by-dimension scheme. A comparison with
other schemes shows that relaxations schemes are less dissipative than central
schemes while on some occasion they are very close to the Godunov scheme.
The resolution of the solution for thin shear layer problems shows that the
scheme has a lot of potential for improvement. The implementation of the
HLL formulation and relaxing schemes (E -I- 0) is underway to investigate how
this can be achieved.
Acknowledgement: This work was supported by a DFG Grant KL 1105/9.
References
1. Chen, S., Doolen, G.D. (1998): Lattice Boltzmann Method for fluid Flows. Ann.
Rev. Fluid Mech., 30, 329-364
2. Jin, S., Xin, Z. (1995): The Relaxation Schemes for Systems of Conservation Laws
in Arbitrary Space Dimensions. Comm. Pure Appl. Math., 48, 235-277
3. Klar, A. (1998): An Asymptotic-Induced Scheme for Nonstationary Transport
Equations in the Diffusive Limit. SIAM J. Num. Anal., 35, 1073-1094
4. Naldi, G., Pareschi, L., Toscani, G. (to appear): Relaxation schemes for PDEs and
applications to second and fourth order degenerate diffusion problems. Surveys
in Mathematics for Industry.
5. LeVeque, R.J., Pelanti, M. (2001): A Class of Approximate Riemann Solvers and
their Relation to Relaxation Schemes. J. Compo Phys., 172, 572 - 591
6. Banda, M.K., Klar, A., Pareschi, L., Seai'd, M. (2001): Lattice-Boltzmann type re-
laxation systems and high order relaxation schemes for the incompressible N avier-
Stokes equation. preprint.
Relaxation Schemes and Lattice Boltzmann Model 101
Introduction
small velocities and reduced CPU-time because the CFL time step limitation
only applies to small eigenvalues.
But the VFRoe scheme also suffers from a few drawbacks. The first one
is that the scheme is still CPU-time consuming because it needs to compute
numerically the eigenvalues of the Jacobian matrix of the system. The second
one is that theses eigenvalues are not necessarily real, that is, the system is
not always hyperbolic. It is observed in [4] that the system is hyperbolic only
if the slip between the phases is not too large.
This is why the authors of [2] designed an explicit relaxation scheme. The
approach [2] is close in spirit to Jin & Xin's [11] but differs in the fact that
only the genuine non-linearities are relaxed, namely the pressure and the hy-
drodynamic laws. In a first step, the genuine nonlinearities are shown by means
of a Lagrangian change of coordinates. Then, these are relaxed. Finally, the
equations are brought back to the Eulerian frame. The resulting relaxation
system is automatically hyperbolic and has all its fields linearly degenerated.
This scheme is less CPU-time consuming than the VFRoe-type one because
the most complex (algorithmically speaking) step is the computation of only
two relaxation coefficients. The relaxation coefficients can be easily devised so
as to guaranty two properties: first, the stability of the first order asymptotic
system computed thanks to the Chapman-Enskog expansion and, second, the
positivity of the mass fractions. As a first attempt to design a fast and rough
numerical scheme for two-phase flows, these results were encouraging.
The goal of this paper is to work out a semi-implicit version of the explicit
scheme [2]. In essence, the extension is possible because of the linear degeneracy
property of the relaxation system.
The paper is organized as follows. In section 1, we introduce the two-phase
flow model together with the boundary conditions and the characteristics of
this model. In section 2, we develop the second order explicit relaxation scheme
with the computation of new relaxation coefficients and boundary conditions.
In section 3, we present the semi-implicit scheme and section 4 is devoted to
the numerical results.
Most of the results presented here are extracted from [1], [2] and [3].
1.1 Equations
In the flow, the mixture is characterized by its density p, its velocity v and its
gas (resp. liquid) mass fraction Y (resp. X = 1 - Y). The model is governed
by the following system of conservation laws:
(1)
104 M. Baudin et al.
°
for all x E JR and t ;::: where the unknown is u := (p, pv, pY). Here the source
term S includes gravity and wall friction terms which are given functions of
the unknown. The two following functions of the unknown u
O"(u) = pY(l- Y)<l!(u), P(u) = p(u) + pY(l - Y)<l!(U)2, (2)
are highly non-linear closure laws. The natural phase space associated with
such variables then reads:
n= {u = (p, pv, pY) E JR3; p> 0, v E JR, Y E [0, I]}.
Considering the pressure p, we will consider a perfect gas and a compress-
ible liquid. The pressure law can be put in the general form p = p(p, pY) and
is a smooth function. We consider a general algebraic hydrodynamic law of the
type
VL - VG = <l!(u), \j u E n, (3)
in order to close (1). In (3), the mapping <l! is assumed to be smooth enough.
In practical situations, <l! turns out to be nonlinear in the unknown u (see [4],
[12] for instance).
°
situations, i.e., for usual values of u, the system is hyperbolic and has three
real eigenvalues >'1 < >"2 < >"3 with >"1 < and >"3 > 0.
The interesting property of the system is that the eigenvalues >"1,3 cor-
respond to the acoustic waves (or "pressure waves") and propagate at fast
speeds thanks to the compressibility of the fluid. The eigenvalue >"2, which has
a variable sign, corresponds to the kinematic waves and propagates at slow
speeds with the fluid. These properties imply that the large eigenvalues are
10-100 times bigger than the small eigenvalue, i.e. 1>"1,31» 1>"21.
A Time Semi-implicit Relaxation Scheme 105
We relax all the genuine non-linearities of the equilibrium system (1) which
appear by means of a Lagrangian change of coordinates. Then, we introduce
two new state variables E and II which are intended to coincide respectively
!
with u(u) and P(u) in the limit of the relaxation parameter A. Finally, back in
Eulerian coordinates, we propose as a relaxation model the following system:
at p + Ox (pv) = 0,
at (pv) + Ox (pv 2 + II) = S(u),
at (pII) + Ox (pIIv + a2 v) = Ap(P(U) -II), (4)
at (pY) + Ox (pYv - E) = 0,
at (pE) + Ox (pEv - b2 Y) = Ap(U(U) - E),
where a and b are two real positive parameters that we call "relaxation coef-
ficients". The above relaxation system will be given hereafter the convenient
abstract form:
The pipeline is made of I cells, denoted (Mi)i=l,I. Let Xi be the center of the
cell and L1x its length. We also denote xi+1/2 = (Xi + xi+d/2 the interface
between two cells, Xl/2 = °the inlet boundary interface and x I +1/2 = L
the outlet boundary interface. Let L1t n = t n+1 - t n be the time step. Let
u7 R;; U(Xi, tn) be the discrete unknown. The numerical scheme is based on
the following splitting method.
1. Relaxation. We take A = 00 and solve the ODE system atV = AR(v) by
projecting the variables on the equilibrium variety i.e. we set IIf := P(u7)
and Ef := u(u7)·
2. Evolution. We take A = ° and solve the system atV + axQ(v) = S(v) on
one iteration in order to go to the time t = tn+l.
106 M. Baudin et al.
(9)
where ai-I, (3i and Ii+! are 5 x 5 matrices involving partial derivatives of
the numerical flux.
This consists, at each time step, in solving a linear system A ov = b thanks
to, for example, a Gauss method. The matrix A is a tridiagonal by block
matrix with each block of size 5 x 5: the resulting matrix is a band matrix
with 9 extra-diagonal terms.
3. Finally, we can update the conservative variable of the cell i by v~+l =
vi + OVi.
Now, the remaining question is: how to construct the matrices a, (3 and
,? It is easy (see [6]) to compute these matrices when the explicit scheme
is a Roe-type scheme. The main objective is therefore to put the relaxation
explicit scheme under Roe's form. Such a rewriting is based on a shock curve
decomposition and is possible when the solution of the Riemann problem is
made of shocks and contact discontinuities (which is the case of the relaxation
system). This computation is detailed in [1, 3].
Implicit projection We follow the ideas of Chalons [5]. The use of a linearly
implicit scheme implies that the projection step is giving steady states that
are not accurate. The solution is to link the evolutions of the relaxed variables
associated with an implicit field to the evolutions of the equilibrium variables.
This modifies the linear system to be solved in the mathematical step. It
enables us to reduce the size of each block from 5 x 5 to 4 x 4 and therefore
reduce the size of the global linear system. Numerical experiments shows that
not only the steady solutions are more accurate (as already shown in [5]), but
even transient solutions are more accurate.
The slope limiters used to build the second order explicit scheme leads to a non
differentiable expression. That is why we choose a simplified version in which
we do not differentiate the nonlinear operator involved in the second order
correction. In the physical step, we first compute the second order correction
of the states UL,R and evaluate the numerical flux. In the mathematical step,
we solve the first order linear system but the derivatives are computed with
the corrected states UL,R.
In order to have a second order accuracy in time, we use again the Runge-
Kutta 2 procedure. Since the scheme is only semi-implicit, the order 2 in time
is only achieved on the explicit waves and the linearly implicit waves are solved
with order 1 in time.
4 Numerical results
In this Section, we show the numerical results of the semi-implicit relaxation
scheme: we consider a real-life problem in which the solution is driven by the
changes of the boundary conditions.
The details of this test-case are shown in figure (1) and the results are given
in figures (2).
In this experiment[6], the inlet gas flow rate is decreased from 0.114 to 0
kg/so As the mass flowrates are small, the decrease in the inlet gas mass flow
rate gives rise to negative oil velocities in the upper part of the riser. Therefore,
A Time Semi-implicit Relaxation Scheme 109
o
i 0.6
l~ 0, (
Time(s)
Fig. 2. Experiment 4, gas surface fraction. Left: TACITE results, Right: relaxation.
110 M. Baudin et al.
The difference between the two schemes is mainly the CPU cost of the
two simulations. The TACITE scheme requires 22 minutes and the relaxation
scheme requires 7 minutes. See [1] for a discussion on this point.
Conclusion
Acknowledgments
The authors wish to thank C. Berthon, 1. Faille and F. Willien for the numerous
helpful discussions we had.
References
1. M. BAUDIN, Methodes de relaxation pour la simulation des ecoulements
diphasiques dans les conduites petrolie-res, PhD thesis, Universite Pierre et Marie
Curie, 2003.
2. M. BAUDIN, C. BERTHON, F. COQUEL, R. MASSON, AND Q.-H. TRAN, A re-
laxation method for two-phase flow models with hydrodynamic closure law, sub-
mitted, (2002).
3. M. BAUDIN, F. COQUEL, AND Q.-H. TRAN, A semi-implicit relaxation scheme
for modeling two-phase flow in a pipeline, in preparation, (2002).
4. S. BENZONI-GAVAGE, Analyse Numerique des Modeles Hydrodynamiques
d'Ecoulements Diphasiques Instationnaires dans les Reseaux de Production
Peroliere, PhD thesis, Ecole Normale Superieure de Lyon, 1991.
5. C. CHALONS, Bilans d'entropie discrets dans l'approximation numerique des
chocs non classiques. Application aux equations de Navier-Stockes multi-pression
2D et a quelques systemes visco-capillaires, PhD thesis, Universite Pierre et Marie
Curie, Paris VI, novembre 2002.
6. 1. FAILLE AND E. HEINTZE, A rough finite volume scheme for modeling two phase
flow in a pipeline, Computers and Fluids, 28 (1999), pp. 213-241.
A Time Semi-implicit Relaxation Scheme 111
Markus Bause
1 Introduction
Groundwater contamination by biodegradable organic compounds has be-
come a serious and widespread environmental problem in industrialized coun-
tries. Major organic contaminants include petroleum fuels (gasoline, diesel),
petroleum bypro ducts (coal tar, coal-tar creosote), and chlorinated solvents. In
many cases, groundwater contains a mixture of organic contaminants, either
due to the complex mixture in many non-aqueous phase liquids (NAPLs; e.g.,
gasoline) or due to co-disposal/co-spillage (e.g., landfillleachates). The degra-
dation of these contaminants is controlled to a large extent by the biological
and geochemical conditions in the groundwater. Fortunately, biodegradation
tends to attenuate at least some organics during groundwater transport.
The question of whether active remediation is required, or whether natural
processes of attenuation (passive remediation) will be sufficient is a critical
issue in "real world" situations. Passive or intrinsic remediation is generally
preferred, if feasible, due to the potential to, firstly, eliminate permanently
contaminants through biogeochemical transformation or mineralization and,
secondly, avoid expensive biological, chemical and physical treatments. How-
ever, the possible attenuation of organic compounds and the impact of that
Computational Study of BTEX Transport and Biodegradation 113
briefly introduced. The simulated results for the expansion and movement of
the contaminant plume are presented in Sec. 4.
2 Governing equations
CD KID cA KIA
M= 6>Mmax Cx K . (2)
D + CD KID + CD KA + cA KIA + cA
Thus, we have to consider a coupled system of partial and ordinary differential
equations. In (1), 6> H denotes the volumetric water content, q [LT-l] the
Darcy velocity vector (volumetric flux) and D i , i = D, A, [L2 T- 1 ] with
(3)
a slower microbial growth and, therefore, a slower effective electron donor uti-
lization rate at higher concentrations; cf. [15]. In the numerical model, unre-
alistically high biomass concentrations are avoided by introducing the term
1 - CX/CXmax in the second of the equations (1). If microbial growth is not
restricted in the model, simulated microbial concentrations may become very
large, especially in source areas with continuous electron donor and acceptor
supply. In real aquifers, the size of the biomass is limited, for example, due
to a lack of available pore space, production of inhibitory metabolites, lack
of nutrients and viral attack. The constant cXmax represents the maximum
microbial concentration at which the biomass reaches a quasi-steady state.
We consider solving (1), (2) over (O,T) x D where D c IR d , d = 2,3, is a
two- or three-dimensional bounded domain and the system (1)-(2) is supplied
with initial conditions
CD(O,') = CD,O, CA(O,') = CA,O, cx(O,.) = cx,o in D at t = 0, (4)
and nonhomogeneous Dirichlet and Robin boundary conditions for Ci, 2 =
D,A,
Ci = gi on (0, T) x rD ,
Here, 1/ denotes the outer unit normal to the boundary 8D = rDurR of D. The
existence of a global unique non-negative solution CD, CA E Wi,2((0, T) x D),
with p > 2, and Cx E C 1 ([0, T]; C(D)) to (1)-(5) for any given T E (0,00)
was recently proved; cf. [12]. In particular, the non-negativeness of CD, CA, Cx
can be ensured. The proof can be carried over to d = 3. For the definition of
W~/2,l((0, T) x D) we refer to [11].
In our computational experiment, the velocity vector q [LT- 1 ] in (1), (3)
and (5) is prescribed analytically. This is done due to a lack of information and
measurements of the flow field for the considered site; cf. Sec. 4. For numerical
simulations of contaminant transport and biodegradation scenarios where the
flow field is additionally computed numerically by solving the parabolic-elliptic
degenerate Richards equation we refer to [3].
The established regularity of solutions to problem (1)-(5) is, by far, too
weak to justify the use of higher order approximation schemes. However, one
may expect that a higher regularity of the solution still holds in some sense
locally. This might be sufficient to get a significant advantage of higher order
approximation schemes over lower order ones. Such superiority of the higher
order methods was recently confirmed by numerical computations; cf. [3]. Fur-
ther, higher order regularity results for solutions to equations (1)~(5) are es-
tablished in a forthcoming paper; cf. [4].
forming finite element methods. Here, for simplicity, we assume that fl c ]R2
is a polygonal bounded domain. If either the whole boundary of fl or at least
some part of it is curved, we adapt the mesh to the boundary by using the
isoparametric counterparts of the finite elements introduced below; cf. [3]. In
our computations we will consider non-vanishing Dirichlet boundary values gi,
i = D, A, in (5). However, for simplicity, the variational formulation of (1)-(5)
is given for homogeneous Dirichlet boundary conditions only. Nonhomogeneous
boundary values are incorporated by standard techniques.
Let It, = {K} be a finite decomposition of mesh size h of fl into triangles.
The decompositions are assumed to be regular, i.e., "face to face". We use
standard conforming P2 elements for the Monod model (1)-(5). The approxi-
mation spaces Vh for the electron donor and acceptor C i , i = D, A, and X h for
the biomass C x are thus defined as Vh = {Ci E C(fl) I C ilK E P 2(K) for K E
12 -
Th } n Wo,'rD and Xh = {Cx E C(fl) I CXIK E P 2(K) for K E It,}. By Pj(K),
j EN, we denote the space of all continuous polynomials of maximum degree
j. Further, W~'; , D
= {c E W 1,2(fl) I C = 0 on rD}. Hence, the spatial dis-
cretization of CD, CA and Cx converges formally of third order with respect
to the norm in L 2 (fl). Advection-dominated transport of the mobile species
(electron donor and acceptor) introducing local numerical instabilities in the
solution can efficiently be captured by the streamline upwind Petrov-Galerkin
method (SUPG); cf. [3, 9].
For the temporal discretization of problems (1)-(5) we consider a mesh
{tn}, n = 0, ... ,N with to = 0 and tN = T, for the time variable t and define
Tn = tn+ 1 - tn. Due to the generally high stiffness of semidiscretizations to flow
and transport problems, implicit schemes should be preferred in the choice
of time-stepping methods for solving these problems. The backward Euler
method is robust and has excellent stability properties, but it is inaccurate
due to its first convergence order only and also strongly damping. So, it should
only be used for nonstationary calculations which aim to iterate towards the
steady limit. A scheme having similar stability properties as the backward
Euler method but being of second order accuracy is the two step backward
differentiation formula BDF2 which we use in our computations. Further, to
increase the efficiency of the calculations, an adaptive time stepping procedure
was developed and tested in [3] for the proposed discretization of the transport
and biodegradation model (1)-(5).
Now, we suppose that sequences {q(t n ),8(tn )} E Wl,OO(fl)x LOO(fl) are
explicitly prescribed. Let P Zh denote the L2-projection onto the finite element
space Zh. The discretization of the Monod model (1)-(5) by the Galerkin
method and the two step backward differentiation formula B D F2 then reads
as follows:
Set Cp = PVhCi,O and C!k = Pxhcx,o. For all time steps n = 0, ... ,N - 2
compute approximations C~+2 E Vh , i = D, A, and C~+2 E X h by solving the
equations
Computational Study of BTEX Transport and Biodegradation 117
n
C n+2
1 +:n:, e(~:::'( t- fl:) ~n+~n+'
C n+1 + cn + k C n+2
, X ~ (7)
for all nodes (Xj )j=l, ... ,M associated with degrees of freedom of C X+2 , where
By (.,.) and (., ')rR we denote the standard £2 inner product in £2([2)
and £2(rR), respectively. Further, in (6) and (7) we use the notation /'n+2 =
1 + Tn+d(Tn+1 + Tn), /'n+1 = 1 + Tn+dTn and /'n = T~+1/((Tn+1 + Tn)Tn).
The time step sizes Tn+l can be chosen adaptively; cf. [3]. Clearly, identity
(7) formulates a pointwise condition for all nodes associated with degrees of
freedom of C X+2 . Using instead of (7) a variational equation, analogously to
(6), leads to stability problems and severe oscillations.
As usual, for the test functions Vi we choose the basis functions of Vh .
Further, let C~ E X h , i = n, n + 1, n + 2, be represented in terms of the finite
element basis functions {Wj }~l of X h , i.e., X h = span{Wj 11 ::; j ::; M} and
x
C = L~l~;Wj, for i = n,n+l,n+2, where the vector ~i = (~L ... ,~k)
x'
denotes the degrees of freedom of C Then, (7) amounts to solving the system
of equations in the unknown vector ~n+2,
Fig. 1. Computational domain (left) for contaminated site Geretsried and initial
concentration of electron donor xylene (middle) and electron acceptor oxygen (right)
4 Computational results
We shall now present our computational results obtained for a "real world"
residual waste and thereby provide valuable insights into the complex interac-
tions of biological, chemical and physical processes that are involved in natural
Computational Study of BTEX Transport and Biodegradation 119
Fig. 2. Concentration of electron donor xylene at T = 20 days, 1.5 years and 4.5
years
References
1. J.F. Andrews. A mathematical model for the continuous culture of microorgan-
isms utilizing inhibitory substrates, Biotechnol. Bioeng., 10:707-723, 1968.
Computational Study of BTEX Transport and Biodegradation 121
Fig. 3. Concentration of electron acceptor oxygen at T = 20 days, 1.5 years and 4.5
years
12. W. Merz. Global existence result of the monod model, Adv. Math. Sci., accepted,
2003.
13. M. Ohlberger, C. Rohde. Adaptive finite volume approximations of weakly cou-
pled convection dominated parabolic systems, IMA J. Numer. Anal., 22:253-
280, 2002.
14. A.E. Scheidegger. General theory of dispersion in porous media, J. Geo-
phys. Res., 66(10):3273-3278, 1961.
15. M. Schirmer, J.W. Molson, E.O. Frind, J.F. Barker. Biodegradation modelling
of a dissolved gasoline plume applying independent laboratory and field param-
eters, J. Contam. Hydrol., 46:339-374, 2000.
A Two-Level Stabilization Scheme for the
N avier-Stokes Equations
1 Introduction
(v·\l)v-vi1v+\lp=f m D, (1)
divv = 0 in D, (2)
supplied with homogeneous Dirichlet boundary conditions:
v =0 on oD, (3)
and the normalization of the pressure
LPdX = o. (4)
In (1), f E L2(D)2 represents given data. For the weak formulation of (1)-(4)
we introduce the following notations:
for test functions ¢ = (?j;,~) EX. Now the weak formulation of (1)-(4) reads
in compact notation:
of freedom are present on those irregular nodes. The space W2h consists of
constants on each cell of a coarser mesh ~h obtained by one global coarsening
of Th : Each quadrilateral K E ~h is cut into four new quadrilaterals (dividing
all lengths of edges of K by 2) in order to obtain the fine partition Tt.,. Note,
that the functions in W2h are discontinuous across edges of elements of~h. We
indicate the subspaces of discrete functions respecting the Dirichlet condition
by an additional subscript Vh,o C Vh . The restriction to a mean value of zero,
cf. (4), is denoted by V,? The discrete solution Uh = (Vh,Ph) is searched in the
discrete space X h := V~ x V';o.
Furthermore, we use the L2-projections on the piecewise constants Pw
L2([2) ---+ W2h and the fluctuation operator 7rh : Vh ---+ Vh :
7rh:=I-Pw, (7)
Here, a and 0 denote piecewise constant functions which depend among other
things on the local cell size h K. The precise definition is:
OIK := mm
. (h'k hK )
-;;' Ilvlloo,K . (9)
(10)
One remarkable feature of (10) is, that the stabilization terms only act
on the diagonal of the coupled system. The structure of the stabilization is
unchanged, if additional lower order terms are added to the equations.
Our numerical experience shows that the resulting scheme has very similar
properties to SUPG concerning stability and accuracy. In the following we
present some aspects of the analysis of (10).
As the first step, we consider the proposed stabilization in the case of the Stokes
equations. It can be easily seen that the stabilization term (7rh \lPh, a7rh \l~) in
(8) leads to a larger stencil for the pressure then the original one coming from
the Galerkin part. The discrete problem for the Stokes equations reads: Find
Uh E X h such that
(11)
holds, where now
126 R. Becker, M. Braack
A similar error estimate holds true for the classical Taylor-Hood element
with biquadratic velocities and bilinear pressure (which does not require any
stabilization), see [7]. One might therefore wonder what the additional pressure
degrees of freedom in our scheme produce. The answer to this question is
provided in the following.
We denote by Vh E 1R2n the vector of coefficients of the function Vh with
respect to the canonical finite element nodal basis {1jJD, i.e., Vh = 2:i(Vh)i1jJ~.
Now, we split the pressure Ph into a coarse grid part Ph and small-scale fluc-
tuations p~:
(15)
i!
The coefficient vectors Ph and hare defined analogously by the nodal basis.
Then, the matrix representation of the linear system (11) reads
[ ~ -~)* -~')*l'~~l
B' S2 ~ S3
[~l 0,
where lh
has the obvious meaning. The matrix A stands for the Laplacian, B
and B' for the divergence, and S for the stabiliza~on.
Block elimination of the pressure component p~ leads to:
where A=A+D,
S = Sl - S~S:;lS2'
13 = B - S~S:;l B'.
We provide the following result as an interpretation of our findings.
We denote the l2 scalar product by (-, .). On quasi-regular meshes where all
cells are parallelograms, we have a mesh-size independent constant c > 0 so
that
! L Iidivvhllk::; (DVh' Vh) ::; c L
Iidivvhllk· (16)
C KETh KETh
Since S3 scales like h 2 , we get (DVh,Vh)::; c2:KETh Iidivvhlik. The proof for
the opposite direction
(19)
The stabilization term is defined similar to one part used for the Navier-Stokes
equations (8):
From the following stability result we obtain existence and uniqueness of the
discrete solution:
Lemma 1. We have
(20)
with
(21)
In the following we give an error estimate. The proof is very similar to the
classical one in [11] for the same equation supplied with SUPG stabilization.
However, notice that in contrast to the proof for SUPG, we only have control
over the streamline derivative of the fluctuations 1l'h[(!3' \7)Uh] in (21).
(22)
The estimate is similar to the standard estimate for SUPG or the discontinuous
Galerkin method, [11]. With respect to the interpolation error we loose a power
of 1/2.
Proof. By jh : V -+ Vh we denote the modified Scott-Zhang interpolation
operator introduced in [2] which has the following orthogonality property:
(23)
and allows for optimal interpolation in L2(D) and H1(D). That is, there exist
a constant C such that
We split the error as U - Uh = 'I] -~, ~ := Uh - jhu, 'I] := U - jhU. Due to this
interpolation result, it is sufficient to show
(24)
with (another) constant C for proving the assertion. We have:
The only critical terms are (((3·V)'I],~) and S(jhU, ~). We use partial integration
to obtain
(({3·V)'I],~) = + (({3·n)'I],~)
-('I],({3·V)~)
= -('I],7rh({3·V)~) + (({3. n)'I],~).
In the last line, we have used the orthogonality property (23) of the inter-
polation operator jh. Furthermore, the stabilization term can be bounded as
follows:
:::; h3/21IuIIH2(n)III~111
Here, the last line is obtained by stability of the L2 projection Pw and the
interpolation property of jh:
Now we get:
References
1 Introduction
We consider parameter identification problems involving a finite number of
unknown parameters in the following form: The state variable u in an appro-
priate Hilbert space V is determined by a partial differential equation (state
equation) in weak form:
a(u, q)(¢) = !(¢) Y¢ E V, (1)
where q E Q = lRnp denotes the unknown parameters. The form a is defined
on the Hilbert space V x Q x V and the linear functional! E V' represents
the right hand side of the state equation, where V' denotes the dual space of
V. Further, we are given an observation operator C : V ---+ Z, which maps
the state variable u to the space of measurements Z = lR nm , where we assume
nm ~ np. We denote by (., ·)z the scalar product of Z and by I . liz the cor-
responding norm. Similar notations are used for the scalar product and norm
in the space Q.
The values of the parameters are estimated from a given set of measure-
ments C E Z using a least squares approach such that we obtain a constrained
optimization problem with the cost functional J : V ---+ lR:
1 - 2
Minimize J(u) := 21IC(u) - CIIz (2)
132 R. Becker, B. Vexler
under the constraint (1). Here, the cost functional is the squared norm of the
residual R LS defined by
RLS(U) := C - C(u). (3)
The state equation is discretized by conforming finite elements on a regular
mesh Th, resulting in a finite element space Vh C V, see e.g. Ciarlet [8] for the
standard construction. In order to ease mesh refinement, the cells are allowed
to have nodes, which lie on midpoints of faces of neighboring cells. But at most
one such hanging node is permitted for each face, see Carey & Oden [7] for
implementation details.
The discrete state Uh E Vh and parameter qh E Q are determined by:
(5)
Due to the finite dimension of Q, we suppose the parameter qh in (4) to be
sought in the space Q.
2 Optimization algorithm
In this section we discuss an optimization algorithm for solving the prob-
lem (4, 5) on a fixed mesh T,..
Under the assumption ofregularity of the partial derivative a~, the implicit
function theorem in Banach spaces implies the existence of an open set Qo C Q,
containing the optimal parameter q, and a continuously differentiable solution
operator S : Qo ---+ V, q ---+ S(q), so that (1) is fulfilled for U = S(q). This
allows us to reformulate the problem (1, 2) as an unconstrained optimization
problem:
A Posteriori Error Estimates for Parameter Identification 133
G*(c(q) - C) = 0, (7)
Then, one simply proves that the entries of the matrix G are given by G ij =
CI(u)(Wj).
Similarly to the continuous case, we introduce a discrete solution operator
Sh : Q----7 Vh for the discretized state equation (5) and obtain an unconstrained
(9)
(10)
where G h = Ch(q~). The most widely used choice of the matrix Hk = G'h G h
leads to the Gauss-Newton algorithm, see e.g. Nocedal & Wright [10].
For one step of the Gauss-Newton algorithm the state equation and np
tangent problems (8) have to be solved which originate from the same linear
operator but with different right-hand sides. Due to the small dimension np of
the parameter space Q the solution of (10) is uncritical.
In this section we present an error estimator for the error with respect to a
given error functional E : Q ----7 JR. The precise error representation is given in
the following theorem. Here, we use an interpolation operator ih : V ----7 Vh ,
see e.g. Clement [9].
134 R. Becker, B. Vexler
if the least squares residual R LS (u) does not vanish; otherwise we set z = O.
The constant C does not depend on the mesh parameter h nor on the measure-
ments C.
Proof. For proof we refer to [5].
For evaluation of the error estimator, denoted by 'f/h, the local interpolation
errors y-ihY and U-ihU have to be approximated. In our numerical examples,
we use interpolation of the computed bilinear finite element solutions Yh and
Uh on the space of biquadratic finite elements on patches of cells. The main
computational cost for the a posteriori error estimator described above is the
solution of one auxiliary equation (12). This is cheap, even in comparison with
only one Gauss-Newton step, which includes solution of the state (nonlinear)
and of the several (linear) tangent equations.
In order to illustrate the typical use of the error estimator 'f/h, we sketch
a generic adaptive mesh refinement algorithm. Such an algorithm generates
a sequence of locally refined meshes and corresponding finite element spaces
until the estimated error with respect to E is below a given tolerance TOL.
A Posteriori Error Estimates for Parameter Identification 135
7. Increment k and go to 2.
4 Numerical result
In this section we compare our general approach to mesh refinement for param-
eter identification problems with some heuristic methods. We consider three
types of heuristic approaches for mesh refinement: a strategy based only on
the information obtained from the computed state variable, a strategy based
only on the a priori knowledge of the structure of the observation operator and
a strategy, which combines both types of information.
-qi1u + su = 2 in D,
(17)
u=o on aD,
o 0 0
o
under the constraint (17), where Ci denote the components of the measurement
vector C E Z = ]R9 and are given by the values of the state variable u for the
exact parameter q, i.e. Ci = U(~i).
First, we compare the quality of meshes generated by our a posteriori error
estimator for this problem with a typical strategy based on a posteriori infor-
mation obtained by the state variable, i.e with the mesh refinement guided by
one of the well-known "energy" type error estimators for uncontrolled equa-
tion, see e.g. Bank & Weiser [2] and Babuska & Miller [1]. This estimator aims
to control the error U - Uh in HI-norm, but they do not take care of the struc-
ture of the parameter identification problem. As seen from Figure 2, adaptive
refinement based on the "energy" estimator leads to a similar reduction of the
error as global refinement. However, the strategy based on our error estimator
leads to an obvious saving in the number of unknowns necessary to achieve a
prescribed accuracy level.
global - + -
ene~~~ ~~.~~~'-
0.01
0.001
0.0001
Fig. 2. Errors in q for different refinement strategies vs. number of nodes (global
refinement, "energy"-based refinement and refinement resulting from our a posteriori
error estimator)
A Posteriori Error Estimates for Parameter Identification 137
Next, we compare our strategy for mesh refinement with the following
heuristic approach: In each iteration of the mesh refinement we refine the
cells, which lie close to one of the measurement points, i.e. in the set
global_
r=O.l ---)(---
r=zO.04 ....... .
our ... -8-.,-
0.01
0.001
0.0001
Fig. 3. Errors in q for different refinement strategies vs. number of nodes (global
refinement, refinement across measurement points and refinement resulting from our
a posteriori error estimator)
After several steps one does not observe any error reduction despite in-
creasing the number of nodes for both choices of r in the described strategy,
as could be expected. Typical meshes resulting from application of our a pos-
teriori error estimator, "energy" estimator and the last strategy are shown in
Figure 4.
Fig. 4. Typical meshes produced by our a posteriori error estimator (left), energy
error estimator (right) and the refinement across the measurement points
5 Conclusions
We presented an a posteriori error estimator for finite element discretization
of parameter identification problems. This error estimator is cheap to evaluate
global ----+--
r=O.l ---)(---
r=O.04 ... * ...
0.0001
Fig. 5. Errors in q for different refinement strategies vs. number of nodes (global re-
finement, refinement produced by combing the refinement across measurement points
and "energy" -based refinement, and refinement resulting from our a posteriori error
estimator)
A Posteriori Error Estimates for Parameter Identification 139
and assess the error we are interested in, i.e. the error in parameters. We com-
pared our approach with some heuristic methods with respect to the quality
of the generated meshes. The presented error estimator is successfully applied
to parameter identification in CFD problems, see Becker & Vexler [6J and to
estimation of chemical models in multidimensional reactive flows, see Becker,
Vexler & Braack [3J.
References
1. Babuska, I., Miller, A.D. (1987): A feedback finite element method with a pos-
teriori error estimation. Comput. Methods Appl. Mech . Engrg, 61:1-40
2. Bank, R.E., Weiser, A . (1985): Some a posteriori error estimators for elliptic
partial differential equations. Math. Comp., 44:283-301
3. Becker, R., Braack, M., Vexler, B. (2003) : Numerical parameter estimation for
chemical models in multidimensional reactive flows. Combustion Theory and
Modelling, submitted
4. Becker, R., Rannacher, R. (2001): An optimal control approach to a posteriori
error estimation in finite element methods. In Acta Numerica 2001 (A. Iserles,
ed.), Cambridge University Press, Cambridge
5. Becker, R., Vexler, B. (2003): A posteriori error estimation for finite element dis-
cretization of parameter identification problems. Numerische Mathematik, pub-
lished online
6. Becker, R., Vexler, B. (2003): Calibration of PDE Models and Sensitivity Anal-
ysis with Adaptive Finite Elements: Application to CFD Problems. Journal of
Computational Physics, submitted
7. Carey, C.F., Oden J .T . (1984): Finite Elements, Computational Aspects. Vol. III.
Prentice-Hall
8. Ciarlet, P.G. (1978): The Finite Element Method for Elliptic Problems. North-
Holland Publishing Company, Amsterdam
140 R. Becker, B. Vexler
9. Clement, Ph. (1975): Approximation by finite element functions using local reg-
ularization. [J] Revue Franc. Automat. Inform. Rech. Operat. 9(R-2), 77-84
10. Nocedal, J., Wright, S.J. (1999): Numerical Optimization. Springer Series in
Operations Research, Springer New York
On a Phase-Field Model with Advection
Michal Benes
(2)
au I au
-;;;--- I = L(vr - V· nr) on r(t),
anr s unr I
For details of physical context, we refer the reader to [8, 10]. Obviously,
the above given physical problem simplifies the problem of density driven flow
around the solidifying structure in a real material. In this general case, the
law of momentum conservation would be needed, and the boundary condi-
tions should also correspond to the conservation of quantities in question. Our
purpose is to study the problem (2) by a diffuse-interface approach which al-
lows us to design a suitable numerical algorithm and to perform qualitative
numerical studies.
(3)
with the initial conditions U It=o= Uini, P It=o= Pini, and with the homoge-
neous Dirichlet boundary conditions (set for the sake of simplicity). Addition-
ally, we assume that F(u) is a bounded Lipschitz-continuous function.
The enthalpy of the system 1i( u) = u - LX(p) is expressed by means of
a focusing function X, which is monotone with bounded, Lipschitz-continuous
derivative:
where w~ = -fa, IF(u)1 ::; C F , V = IIVII=. They allow us to use the com-
pactness method in the same manner as it is presented in [2]. Namely, the
theorem assumptions together with the above estimates processed by the
Gronwall lemma give that, independently of m, \7u ffi , \7pffi are bounded in
L=(O, T; L 2(D)), and pffi are bounded in L=(O, T; Ls(D)) for each finite time
8 8
T > 0, and for any s ::::: 1. Repeated integration says that ~t' ~t are
~ ~
1 d 2 2
2 dt II u I2 - LPI211 + IIV(UI2 - Lp12) II + L(Vp12 , V(UI2 - LpI2))
+(V . V(UI2 - LpI2)), UI2 - Lp12) = ° in (0, T),
1 2d 2 ~2 12 ( )
2a~ dt Ilp1211 + 211VPI21 ::; 8
3 2 2 7a 2 2 2 2 2 2 2 2 I 112
2(CF~ + 6""" + a V ~ + 2L ~ L F C 4 1I V pIilL 4 (!2)) IPI2
+3e L}clIIVpIilL(!2) IIUI2 - LpIZII2.
Combining these inequalities, we have in (0, T):
146 M. Benes
(9)
with
where we denoted
We therefore can state that the function p = p(t, Xi~) tends to a stepwise
constant function as in the Theorem 2.2 of [4]. As in [2], the matching procedure
recovers the Gibbs-Thompson law a(vr - V . nr) = -K,r + F(u~) + O(e), as
well as the Stefan condition at r.
Remark. As indicated in [1], the model (4) can be modified by incorporating
anisotropy into coefficients of (1). The above given analysis is applicable again,
namely for weak anisotropies.
On a Phase-Field Model with Advection 147
and
The set of grid functions is denoted by 'Hh. The semi-discrete scheme has the
following form
d - h h h
(dt +V . "h)(u - LX(p )) = Llhu ,
u h IT'h = 0, ph IT'h = 0,
uh(O) = PhUini, ph(O) = PhPini,
where its solution is a map uh,ph :< 0, T >---> 'Hh and Ph : C(Q) ---> 'Hh is
a restriction operator. The stability and convergence of the scheme can be
investigated in a way very similar to the proof of the Theorem 1. The scheme
is designed to meet real conditions, where the diffusion and growth process
dominates the advection.
'......\ph854caruw -
0.2
0.15
0.15
0.1 0.1
0.05
References
1. M. Benes, Anisotropic phase-field model with focused latent-heat release, FREE
BOUNDARY PROBLEMS: Theory and Applications II (Chiba, Japan), 2000,
GAKUTO International Series Mathematical Sciences and Applications, Vo1.14,
pp.18-30.
2. ___ , Mathematical analysis of phase-field equations with numerically efficient
coupling terms, Interfaces and Free Boundaries 3 (2001), 201-221.
3. ___ , Mathematical and computational aspects of solidification of pure sub-
stances, Acta Mathematica Universitatis Comenianae 70, No.1 (2001), 123-
152.
4. M. Benes and K. Mikula, Simulation of anisotropic motion by mean curvature-
comparison of phase-field and sharp-interface approaches, Acta Math. Univ.
Comenianae 67, No.1 (1998), 17-42.
5. Y.-G. Chen, Y. Giga, and S. Goto, Uniqueness and existence of viscosity solutions
of generalized mean curvature flow equations, J. Diff. Geom. 33 (1991), 749-786.
6. L.C. Evans, H.M. Soner, and P.E. Souganidis, Phase transitions and generalized
motion by mean curvature, Comm. Pure Appl. Math. 45 (1992), 1097-1123.
7. L.C. Evans and J. Spruck, Motion of level sets by mean curvature I, J. Diff.
Geom. 33 (1991), 635-681.
8. M. Gurtin, On the two-phase Stefan problem with interfacial energy and entropy,
Arch. Rational Mech. Anal. 96 (1986), 200-240.
9. T. Ohta, M. Mimura, and R. Kobayashi, Higher-dimensional localized patterns
in excitable media, Physica D 34 (1989), 115-144.
10. A. Visintin, Models of phase transitions, Birkhiiuser, Boston, 1996.
150 M. Benes
Pattern
... solid
-
r
j
liquid
Fig. 4. Solidification of three falling dendrites - parameters are u* = 1.0, Uini = 0.0,
L = 2.0, (3 = 300, a = 4.0, ex = 3, Ll = L2 = 3.0, initial radius = 0.025, Nl = N2 =
e
300, = 0.015
Fast Evaluation of Eddy Current Integral
Operators
Steffen Barm
Summary. Boundary element formulations for eddy current problems are based on
non-local operators. Discretizing these operators by standard Galerkin techniques
leads to large dense matrices. In order to treat the discretized system efficiently, we
cannot store these dense matrices directly, but use data-sparse approximations.
We present an approach based on piecewise polynomial interpolation of the un-
derlying kernel functions. The resulting rf? -matrix approximation can be stored using
only O(nm 3 ) units of storage, where n is the number of degrees of freedom and m
is the order of the interpolation. Construction and evaluation of the approximated
matrix requires only O(nm 3 ) operations.
This paper presents joint work with Jiirg Ostrowski.
1 Introduction
1.1 Problem
for the unknown vector field u E V and the unknown potential ¢ E W, where
the bilinear forms have the form
+ sparse.
152 S. Barrn
Here, <I> denotes the singularity function for the Laplace equation, i.e.,
1
<I>(x, y) = 471"IIX _ yll ' (2)
1.2 Compression
2 Approximation
2.1 Discretization
( A-B)
_Q = rhs
_BT
defined by
2.2 Simplification
For b E Vh and 'ljJ E Wh, the functions curlr band curlr'ljJ are piecewise
constant. Therefore we introduce the auxiliary space X h spanned by piecewise
constant basis functions Xt for t E T and the auxiliary matrix G defined by
hold for sparse matrices L1 and L2 . This representation is more efficient than
the original one, but it is still based on a densely populated matrix.
3 Approximation
3.1 Interpolation
where (X~)~=1 and (X~)~=1 are interpolation points and (£~)~=1 and (£~)~=1
are the corresponding Lagrange polynomials.
This results in an approximation of the local matrix
G;t := 11 11
Xt(X)<p(x, Y)Xs (y) dy dx ~ Xt(X)$T,O' (x, Y)Xs (y) dy dx
=
1/=11'=1 "---v--" T v " 0' v '
=:V~J.t
The approximation requires only 2nk + k 2 units of storage. For typical interpo-
lation schemes, k will be much smaller than n, so the factorized representation
will by much more efficient (d. Figure 1).
Typical interpolation schemes, e.g., tensor-product Chebyshev interpola-
tion, work only for smooth functions. Since the function <P is not globally
154 S. Borm
smooth, we cannot hope to find a factorized representation for the entire ma-
trix G. Instead, we make use of the fact that {P is asymptotically smooth, i.e.,
that there are constants Casymp, co, dE lR>o such that
(5)
has to hold for a constant 'TJ E lR>o (cf. Figure 2).
into sub-domains that either fulfil this condition or are so small that we can
store the corresponding local matrix block GT,U in the standard format.
We construct the splitting of rh x rh using a hierarchical splitting of the
domain r h that is called a cluster tree:
Definition 1. A tree C is called a cluster tree for a set rh if
- the set rh is the root ofC, i.e., root(C) = rh, and
- if a node T E C is not a leaf, then it is the (up to sets of measure zero)
disjoint union of its sons, i. e.,
Since Xt and L~ are polynomials, the entries of the matrix can be computed
by standard quadrature.
Since the coefficient matrix ST,U is defined by
ST,U '= <l?(x T XU)
VI" . v' I"
(6)
TXUEPfar TXO"EPnear
The approximation (6) requires O(nk log n) = O(nm 3 log n) units of storage,
and the approximation error decreases exponentially in m.
156 S. Barm
Let T be a cluster and T' one of its sons. Since we use the same order of
interpolation for all clusters, we have
k
which implies (7)
r/Esons(r)
so we need to store the n x k-matrix Vr only for clusters without sons, since
we can reconstruct it for all other clusters by using the k x k-matrices Tr',r
(cf. Figure 3). This reduces the storage complexity to O(nk) = O(nm 3 ).
The equation (7) can also be used to perform the matrix-vector multiplication
efficiently: We introduce the auxiliary variables
uO' := VO' T u,
a,TXaEPfar
For typical domain splittings, the set {(J : T x (J E Prar} contains only a small
number of elements, so the computation ofyT can be done in complexity O(nk)
for all T E C. This leaves us with the task of evaluating uO" = VO" T U and VTyT
efficiently. Due to (7), we have
, T ,T
uO" = VO"T U = TO" 'O" VO" u =
0"' Esons( 0") 0"' Esons( 0")
so we can use a recursive procedure to compute the vectors uO" for all (J E C
in O(nk) operations. A similar recursion can be used to construct LTEC VTyT
in O(nk) operations.
Now that we know how to store and multiply by the matrices A and Q m
complexity O(nk), we only have to handle the matrix B efficiently.
We replace if> by the local approximations ;PT,O" and grad x if> by grad x if>T,O"
and recall the sparse lifting matrix L2 from (3) in order to find that
B=
6 Numerical experiment
We approximate the auxiliary matrix G on the unit sphere using a local inter-
polation operator of m = 2. We use the admissibility condition (5) for T) = 2
and find the results given in Table 1. We can see that the approximation error
is stable, while the memory and time requirements grow linearly.
158 S. Borm
References
Summary. An adaptive method for reactive flows involving locally refined meshes
and different types of diffusion models is proposed. Starting with a less exact diffusion
model, the model is changed locally throughout the computational domain to a more
accurate and much more expensive model. An a posteriori error estimator provides
reliable information on where to refine the mesh and where to adapt the model.
Discretization and modeling errors are equilibrated.
1 Introduction
The underlying equations describing reactive flows are well-known but may
involve models of different complexity, scales and accuracy. In various cases, the
most accurate and validated model cannot be chosen in numerical simulations
because of the large amount of computational costs. Simpler models usually
need less computing time and involve less couplings between variables. For
instance, the choice of diffusion models in gas mixtures is not straightforward.
Although multicomponent diffusion models are accepted to be accurate [8],
simpler and less accurate models, e.g. Fick's law, are widely used in practice
for two- and three-dimensional simulations. While simple diffusion models, as
for instance Fick's law, may involve only diagonal diffusion, multicomponent
diffusion models leads to couplings between all chemical variables. For implicit
solvers, these couplings lead to a huge fill-in in the (sparse) Jacobians. Due to
the resulting high numerical cost of complex models, it is desirable to apply
the complex diffusion model only in those regions of the computational domain
where necessary; for instance in the flame front where a complex balance of
reaction, convection and diffusion phenomena takes place. However, it is not
a priori known, where an accurate diffusion model is necessary. In this work,
we present an adaptive method which automatically detects the regions where
an accurate diffusion model is important.
In the previous work [6]' the mathematical background for a posteriori
control of modeling errors and discretization errors is given. Other work ad-
dressing the estimation of modeling error includes [9, 13, 14]. For measuring
the modeling error the variational formulation of the partial differential equa-
tion together with a duality argument is used. The a posteriori error estimator
160 M. Braack, A. Ern
conductivity>.. For each species k, we have the molecular weight mk, the
specific enthalpy hk' and the molecular production rate Wk. The viscous tensor
7r is given as usual for compressible Newtonian fluids. For ease of presentation,
in the form a3(u)(¢) the effect of temperature flux due to diffusion fluxes of
species with different specific heat capacities is neglected.
The form a3(u)(¢) does not yet include species diffusion. We consider the
following two models for the mass diffusion fluxes Fk:
Fick's law: diagonal diffusion driven by the gradient of mole fractions Xk =
Ykm/mk,
(2)
The diffusion coefficients Dr, = Dr, (y) are given by an empirical law which
is about 10% accurate, see [12].
Multicomponent diffusion: a full diffusion matrix driven by the gradients
of species mole fractions:
The diffusion coefficients Dkl are given only implicit as solutions of linear
systems. Therefore, the computational costs are higher as for the previous
diffusion model. However, this form of diffusion flux can be derived by the
theory of gases, see [8, 10].
Recent DNS computations [7] investigate the differences of these two models
and show a legibly impact of model (ii) especially for lean and rich hydrogen
flames.
Partial integration of the multicomponent model (3) leads to the semi-linear
form
3 Discretization
The discrete counterpart of the equations is the basis for the a posteriori error
estimator we need later for the adaptive procedure.
162 M. Braack, A. Ern
In order to switch locally between the diffusion models (2) and (3), we
introduce a (symbolic) parameter m and a subdomain am of n. In am, we
use multicomponent diffusion, and in a f := a \ am the simpler Fick's law.
We define the diffusion part
s
dm(u)(¢) := L -{(F/:, \lTk)n~ + (F(, \lTk)nf} '
k=O
and formulate a perturbed solution Um E V solving
a(u m)(¢) + dm(u m )(¢) = (j, ¢)
V¢ E V.
The way to determine the subdomain am will be explained later on.
The discretization is done by conforming finite elements on a triangulation
~ of a. We denote the corresponding space by Vh C V. In order to handle
the convective terms and the stiff pressure-velocity coupling one should add
stabilization terms Sh(·,·) to the discrete systems. Such stabilization can be
done in different ways. We use the stabilization concept for the v, p-coupling
proposed in [1] and for the convective terms in [11] and [2]. To this purpose, we
need certain restrictions on the meshes used. We assume that the triangulation
Th is organized patch-wise: ~ results from a global refinement of a mesh ~h.
Note that ~ contains in two dimensions twice as much hanging nodes as ~h.
The same construction is possible in three dimensions.
By i~h : Vh --* V2h we denote the nodal interpolation to the coarse grid, and
by 7rh := i - i~h : Vh --* Vh the projection operator which filters the small-scale
fluctuations The stabilization form reads now:
s(u)(¢) = (\l7rhP, op \l7rh~) + ((f3v . \l)7rhV, ov(f3v . \l)7rh'l/J)
s
+(f3r . \l7r hT , orf3r . V 7rhCJ ) + L(f3v . \l7rhYk, Okf3v . \l7rhT) ,
k=1
with f3v := pv, f3r := (pcpv + 0:) and piece-wise constant functions
op, ov, or, 01, ... , Os, depending on the mesh-size h. For further details, we refer
to [2].
This leads us to the definition of the discrete residual
ehm(U)(¢) := (j, ¢) - a(u)(¢) - dm(u)(¢) - Sh(U)(¢).
The reduced discrete system to be solved reads
Uhm E Vh : ehm(Uhm)(¢) = 0 V¢ E Vh .
The difference between the continuous and the reduced discrete residual is
ghm(U)(¢) := e(u)(¢) - ehm(U)(¢)
-d(u)(¢) + dm(u)(¢) + Sh(U)(¢)
L(F/: - F(, \lTk)nf + Sh(U)(¢).
k=O
This is the difference of the two diffusion models in the domain af' where
Fick's law is used, and the additional stabilization terms, which are usually
small. Since, in this work we do not focus on adaptively chosen stabilization
terms, we neglect the contribution Sh(U)(¢) in the expression ghm(U)(¢).
Adaptive Computation of Reactive Flows 163
4 A posteriori control
For the local refinement of the mesh and the adaptivity respect to the diffusion
model, we need an a-posteriori error estimator which gives us information
of the two error contributions. We aim to measure the error respect to an
arbitrary output functional j : V ~ JR. We formulate the following dual
residuals
i?*(U)(Z, cjJ) := j(cjJ) - a'(u)(cjJ, z) - d'(u)(cjJ, z),
i?hm(u)(z, cjJ) := j(cjJ) - a'(u)(cjJ, z) - d'r,,(u)(cjJ, z) - s~(u)(cjJ, z).
We use the dual solution Z E V to capture the influence of the error to the
functional:
Z E V: i?*(u)(z,cjJ):= 0 'VcjJ E V.
The corresponding reduced discrete version reads:
Zhm E Vh : i?hm(Uhm)(Zhm, cjJ) := 0 'VcjJ E Vh
The errors will be denoted by eu = u - Uhm and e z = Z - Zhm. We recall the
error representation in [6] wherein a proof is also given.
Theorem 1. If the semi-linear forms a(u)(.), d(u)(.), Sh(U)(cjJ) and the func-
tional j (u) are sufficiently differentiable with respect to u, then it holds
1
j(u) - j(Uhm) = ghm(Uhm)(Zhm) + 2"{ghm(uhm)(eZ) + g~m(Uhm)(eu, Zhm)}
1
+ 2"{i?hm(Uhm)(z - ihz) + i?hm(Uhm, Zhm)(U - ihu)} + R,
where ih : V ~ Vh is an arbitrary interpolation operator and a remainder R
which is cubic in the error e = {e u , e z}.
For the specific form of the remainder R and a proof of this Theorem, we refer
to [6].
The error representation of the Theorem stated above cannot be directly
used numerically, because it involves the unknown primal and dual solutions u
and z, respectively. However, the first term ghm(Uhm)(Zhm) can be easily com-
puted, because it depends only on the reduced discrete solutions Uhm and Zhm.
Furthermore, the terms ghm(Uhm)(e z ) and g~m(Uhm)(eu, Zhm) are quadratic in
the modeling error since they involve both, e as an argument and the difference
of the two diffusion models in the expression ghm' We neglect these contribu-
tions in the estimator. However, if more accuracy of the estimator is required,
these terms can also be approximated. Evaluation of the remainder R is not
worthwhile, because it is cubic in the error.
The terms i?hm(Uhm)(Z - ihz) and i?hm(Uhm, Zhm)(U - ihu) describe the
discretization error and have to be approximated by a numerical evaluation
of the interpolation errors Z - ihz and U - ihu. An efficient possibility to
do this, is the recovery process of the computed quantities by higher-order
polynomials, see [4]. For instance, in the case of quadrilaterals and piecewise
164 M. Braack, A. Ern
Taking into account that the residuals ehm(Uhm)(¢) and ehm(Uhm, Zhm)(¢)
with respect to a discrete test function ¢ E Vh vanish, leads to the following
estimator T) consisting of two parts
j(U) - j(Uhm) ::::; T) := T)h + T)m, (4)
._ 2"1{ ehm (Uhm )(.(2)
T)h . -
) * ( )(.(2))} , (5)
22h Zhm + ehm Uhm, Zhm 22h Uhm
;=1
A proof is given in [6]. Further details, especially concerning the adaptation
strategy for equilibrating both error contributions are given in [5].
Fjg. 1. O-atom mass fraction for the ozone flame. The maximum value is 1.24.10- 3 .
166 M. Braack, A. Ern
Table 1. Results for the ozone flame: number of nodes (#nodes); fraction of cells
flagged for multicomponent diffusion (% of multi.); estimator of the discretization
error 'f/h; estimator of the model error 'f/m; their sum 71; the true error j(u - Uhm);
the effectivity index Jeff
10
0.1
0.01
0.001
500 1000 2000 4000 8000 16000 32000 64000
# nodes
Fig. 2. Ozone flame: estimator 'f/m for the modeling error; estimator 'f/h for the
discretization error; their sum 71; true error j(u - Uh) as a function of the number of
nodes (mesh points)
However, the error estimator is reliable since it provides an upper bound for
the actual error.
The adaptive algorithm balances both types of errors by adapting the mesh-
size and the model. Figure 2 illustrates how the two sources of error are equi-
librated. We plot the estimators 'TJh and 'TJm, their sum 'TJ, and the true error
j (u - Uhm) as a function of the number of mesh nodes. The estimator and the
true error clearly show the same asymptotic behavior.
The sequence of locally refined meshes with 1047, 2085, and 4871 nodes
is shown in Figure 3. The darker areas indicate the part of the computational
domain where the multicomponent diffusion model is used. In the remaining
(light) part, the simple Fick law is used. We observe that the estimator de-
Adaptive Computation of Reactive Flows 167
Fig. 3. On the upper half of each picture, the areas where multicomponent diffusion
is used (dark/red areas) and where Fick's law is used (light areas) are indicated; the
lower half shows the corresponding locally refined mesh
tects quite well the reaction area where a difference in both diffusion models
influences the accuracy of the output functional j(u).
References
1. R. Becker and M. Braack. A finite element pressure gradient stabilization for
the Stokes equations based on local projections. Calcolo, 38(4):173-199,2001.
2. R. Becker and M. Braack. A two-level stabilization scheme for the Navier-Stokes
equations. In Feistauer, editor, Enumath Proceedings, Berlin, submitted, 2003.
Springer.
3. R. Becker, M. Braack, and R. Rannacher. Numerical simulation oflaminar flames
at low Mach number with adaptive finite elements. Combust. Theory Modelling,
3:503-534, 1999.
4. R. Becker and R. Rannacher. An optimal control approach to a posteriori error
estimation in finite element methods. In A. Iserles, editor, Acta Numerica 2001.
Cambridge University Press, 2001.
5. M. Braack and A. Ern. Coupling multimodeling with local mesh refinement for
the numerical solution of laminar flames. in preparation, 2003.
6. M. Braack and A. Ern. A posteriori control of modeling errors and discretization
errors. Multiscale Model. Simul., 1(2):221-238, 2003.
7. J. de Charentenay and A. Ern. Multicomponent transport impact on turbulent
premixed H2/02 flames. Combust. Theory Modelling, 6:439-462, 2002.
8. A. Ern and V. Giovangigli. Multicomponent Transport Algorithms. Lecture Notes
in Phys\cs, m24, Springer, 1994.
9. L. Fatone, P. Gervasio, and A. Quarteroni. Multimodels for incompressible flows:
iterative solutions for the navier-stokes/oseen coupling. M2AN, Math. Model.
Numer. Anal., 35(3):549-574, 2001.
168 M. Braack, A. Ern
Denoting the L2 inner product and norm by (', ')0 and I . 10, the following
quadratic functional J : HJ (D) x H( div ; D) -> lR
where the bilinear form B : [HJ(D) x H(div; D)] x [HJ(D) x H(div; D)] ~ lR.
is defined by
B(w, r; v, q) = (div r, div q)o + (r + AV'w, A -1 (q + AV'v))o. (5)
It was proved in [4] that B is continuous and coercive. The proof for coercivity
was in our opinion somewhat tedious and may also be derived as a simple
corollary of the following lemma. This lemma, which may be called a strength-
ened Cauchy-Schwarz inequality, may also serve useful in a different context.
Firstly, define norms on H(div; D) and HJ(D) and on their product space by
Ilqll~iv,A = (A-lq, q)o + Idivql5, IvitA = (AV'v, V'v)o, (6)
Multiply (11) by d/,6 and square the result, and add it to the square of (12).
This gives
d2 v ll,A
d2 ) I( q, V'v)o I2 :::; ,62I
( 1 + ,62 2 ( (A -1 q, q)o + Idlv
. q 12)
0 = ,62
2
d Iv 121,A II q 112div,A-
(13)
This proves the statement. <>
Remark 1. Instead of using Ivlo :::; dlvll :::; d/,6lvll,A as in (12), we might also
have chosen to define just one single constant 1] giving rise to a possibly smaller
constant, in (9) as follows,
o< ,= ~+ 1]2 1
with 0 < 1] = sup
O#VEH.j (st)
~
IVIr,A
(14)
Alternative Least-Squares Mixed Elements 171
(15)
This seems a more symmetric and in a sense even stronger result. However,
the norm [[ . [[ 1,A is too strong for the applications to be discussed below.
Corollary 1. With"( as in (10), for all non-zero (v, q) E HJ([2) x H(div; [2)
we have
B(v, q[v, q) ~ (1- "()[[(v, q)[[ixdiv,A > O. (16)
Proof. By definition of B we find
B(v,q[v,q) = [[(v,q)[[IxdivA+2(q,
, \7v)o ~ [[(v,q)[[ixdivA-2[(q,
, \7v)o[. (17)
The continuity and coercivity of B give that the Lax-Milgram lemma assures
the existence of a unique pair (Uh,Ph) E Vh x rh of approximations of (u,p).
Cea's lemma now implies quasi-optimality in the sense that for all (Vh, qh) E
HJ([2) x H(div; [2),
From this perspective, it may seem as if it is not wise to employ spaces Vh and
rh which have different approximation qualities, because the approximation
quality of the product space is not better than that of the worst of the two.
However, it can be shown that under rather weak conditions, there is still a
certain amount of independence present between the two approximations. In
terms of the linear algebraic problem that arises from (4) this means that the
two diagonal blocks of the system matrix, which correspond to the interactions
of Vh with itself and of rh with itself, are only weakly coupled by the two off-
diagonal blocks, which represent the interactions between Vh and rho In fact,
172 J.H. Brandts, Y.P. Chen
this coupling is so weak that solving just two well-chosen linear systems with
the diagonal blocks only, leads to approximations u~ and p~ for which
Let (Vj)~l be a basis for Vh and (qj)f=l a basis for rho Define matrices
S = (Sij), C = (Cij), D = (d ij ) and G = (gij) by means of their entries
Write u~ and p~ for the finite element functions corresponding to the vectors
u'Jvt
and p~, respectively. Then the counterpart of (23) in terms of the finite
element spaces is to write out (4) and substitute u~ for Uh, resulting in
Notice that
Alternative Least-Squares Mixed Elements 173
(D + G)p~ = fN - Cu~,
given u~, iterate { (26)
Su{jl = -C*p~.
If we write u{ and p{ for the finite element functions that correspond to the
vectors u~ and p~, respectively, this iteration reads as
(29)
and
Iuh - u Hl12
h I,A =- h , d'IV (Ph - Phj)) .
(uh - u HI (30)
Applying Green's formula and Lemma 1 gives the statement. I)
This proves convergence of the Block Gauss-Seidel iteration (21) with a con-
vergence factor that is independent of the dimension of the subspace. In each
step of the Block Gauss-Seidel iteration, a system with S and one with D + G
should be solved. Even though for many finite element spaces this can be
done in optimal computational complexity with MultiGrid solvers [1], there is
no need to do so more than once if the right start vector is chosen. Indeed,
choosing u~ := U h, the standard finite element approximation of the problem
defined via the linear system SUM = fM, and computing p~ is already suffi-
cient in most situations. This procedure costs only one solve with S and one
with D+G.
- (rh' div rh) is a Babuska Brezzi stable mixed finite element pair,
- For all f E L2(D), the solution u of (1) satisfies II u l12 ::; Clflo,
- \Iv E HJ(D) n H2(D), 3Vh E Vh, Iv - vhll ::; Chlvl2,
- \lq E [Hl(D)j2, 3qh E rh, Iq - qhlo ::; Chlqll,
- \lq E [H2(D)j2, 3qh E rh, Idivq - divqhlo ::; Chlql2,
we have that
Proof. The assumptions above imply that IUh - Ui,ll ::; Chll(u - Uh,P-
Ph)IIIxdiv, which is a super closeness result proved in [2]. By equivalence of the
norms defined in terms of A with their counterparts defined by taking A = I,
we may switch to A norms. Corollary 2 then shows that Ilph - p~lldiv,A shares
the same upper bound. Hence, the statement follows. <>
3 Conclusions
The above shows that under rather weak conditions, ui" p~ are higher order
perturbations of the least-squares mixed finite element solutions Uh, Ph. Since
the computation of ui" p~ requires only one solve with S and one with D +
G, they are much cheaper to compute than Uh, Ph, whereas they cannot be
distinguished from one another. Hence, it does not make sense to apply the
least-squares mixed method under these conditions.
In fact, instead of putting energy into solving for p~, it is also possible 1 to
construct yet another approximation for Ph in rh by means of a simple local
post-processing ui,. It is however not clear when or if such a postprocessed
approximation will also be a higher order perturbation of Ph, as p~ is.
Acknowledgments
References
1. Arnold, D.N., Falk, R.S, Winther, R. (1997). Preconditioning in H(div; Q) and
applications. Math. Comp., 66, 957-984.
1 We thank Prof. R. Rannacher for pointing this out.
Alternative Least-Squares Mixed Elements 175
2. Brandts, J.H., Chen, Y., Yang, J. (2003). Analysis of least-squares mixed finite
elements in terms of standard and mixed elements. UvA Numerica Preprint 08,
University of Amsterdam, Netherlands. Submitted.
3. Pehlivanov, A.I., Carey, G.F., Lazarov, R.D. (1994). Least-squares Mixed Finite
Elements for Second Order Elliptic Problems. SIAM J. Numer. Anal., 31, 1368-
1377.
4. Pehlivanov, A.I., Carey, G.F., Vassilevski, P.S. (1996): Least-squares mixed finite
element methods for non-selfadjoint elliptic problems. I. Error estimates. Numer.
Math., 72, 501-522.
Limit Analysis Method in Electrostatics
Igor A. Brigadnov
Summary. The limit analysis problem (LAP) for estimation of electric durability
for a dielectric in a powerful electric field is examined. The appropriate dual problem
is formulated. After the standard piecewise linear continuous finite-element approxi-
mation the dual LAP is transformed into the problem of mathematical programming
with linear equality constraints. This finite dimension problem is effectively solved
by the standard method of gradient projection.
1 Introduction
Investigation of the electrostatic boundary-value problems (BVPs) for di-
electrics in a powerful electric field is of particular interest in both theory
and practice. The current research is motivated by significance and practical
interests in Electrical Engineering and Microelectronics.
The electric state of a medium in a given domain fl C ]R3 is characterized by
the bulk and surface density of charges and by vectors of electric field intensity
E = {Ed E ]R3, electric induction D = {Dd E ]R3 and electric current density
J = {Ji } E ]R3. Vector D is introduced by the relation D = coE + P, where
co ~ 8.85.10- 12 is the electric permittivity of a vacuum and P E ]R3 is the
vector of polarization density [11, 14, 16]. For the electric field intensity the
electrostatic potential u is introduced such that E(x) = - Vu(x) for almost
every x E fl, where V = a/ax is the differential vector-operator.
In weak electric fields the conductivity current in a dielectrical medium
is practically absent and the simplest linear constitutive relation E f---+ D is
used [11, 14, 16]. As a result, for the solution of the appropriate linear BVPs
various effective analytical and numerical methods have been worked out, for
example, in [15].
The classic method of estimation of puncture conditions is based on the
point criteria. Namely, it is assumed that the electric puncture sets in when
max{IE(x) I : x E fl} :2: Eo, where E is the solution of the linear electrostatic
BVP and Eo > 0 is the critical value, which is measured in physical experi-
ments on thin plates in a homogeneous electric field [15, 16]. Unfortunately,
for dielectrics with a complex shape in nonhomogeneous electric fields this
method introduces a large error.
In powerful electric fields the essentially nonlinear phenomena of polariza-
tion saturation (IPI :::; p* < +(0) and powerful growth of the electric current
Limit Analysis Method in Electrostatics 177
(aIJI/aIEI » Eo) must be taken into account [9, 11]. As a result, the com-
plimentary physical parameter of the dielectrical medium A > 0 always exists
such that ID - JI :S A < 00.
Within the framework of the variational method, the existence of the limit
electrostatic load (such external charges with no solution of the electrostatic
BVP) was pointed out in [6, 7]. From the physical point of view this effect is
treated as a loss of electrostatic balance, i.e. as the beginning of the electric
puncture of dielectric. For calculation of the limit electrostatic load the original
variational limit analysis problem (LAP) was formulated. From the mathemat-
ical point of view this problem needs a relaxation because its solution belongs
to the space BV(D) of scalar functions with bounded variations having the
generalized gradient as the bounded Radon's measure [1, 13, 17].
Unfortunately no clear physical interpretation can be provided for the fully
relaxed LAP. Therefore, the original partial relaxation of LAP was proposed
in [6, 7]. This relaxation is based on the special discontinuous finite-element
approximation (FEA) , which was proposed earlier by the author for LAP in
non-linear elasticity [4, 5]. But after relaxation the appropriate finite dimen-
sional problem becomes ill-conditioned and thus needs special preconditioned
numerical methods as, for example, presented by the author in [3].
In this paper the dual LAP in electrostatics is formulated. It has a clear
physical interpretation and after the standard piecewise linear continuous FEA
is transformed into a problem of mathematical programming with linear equal-
ity constraints. This finite dimensional problem is effectively solved by the
standard method of gradient projection, which is easily adapted for parallel
computations.
The numerical results show that the proposed limit analysis method has
a qualitative advantage over the classic technique of estimation of puncture
conditions. This method can be used in Electrical Engineering and Microelec-
tronics.
2 LAP in Electrostatics
Let a dielectrical medium occupy a domain D C ]R3. The polarization and
ionization properties of the dielectrical medium are described by two constitu-
tive relations D = :O(x, E) : D x ]R3 -7 ]R3 and J = j (x, E) : D x ]R3 -7 ]R3, as
shown, for example in [11, 14, 16]. In practice the relations Di = Eij(X, E)E j
and J i = G"ij (x, E)Ej are used, where {Eij} and {G"ij} are the symmetric
second-order tensors of dielectric permittivity and conductivity, respectively.
Here and in what follows over repeated indices the summation rule applies.
For an isotropic medium Eij = EOij and G"ij = G"Oij, where 10 = E(X, lEI) and
G" = G"(x, lEI) are scalar functions and Oij is the Kronecker symbol. For a ho-
mogeneous medium {Eij} ,{ G"ij} = const(x).
Let the following quasi-static electric influences act on the dielectric: a bulk
charge with density (! in D, a surface charge with density g on a portion r 2
178 LA. Brigadnov
II(u) = J
n
<p(x, V'u(x)) dn, A(u) = J {2udn+ J gud"(,
n r2
where V = {u: n -+ lR; u(x) = 0, x E rl} is the set of admissible electro-
static potentials, <P is the specific and II (u) is the full potential energy of the
electric field, A( u) is the work of an external source on a transference of charges
from infinity to D-
In compliance with the Thompson and Joule-Lenz laws the function
<p(x, E) is calculated as
J
I
<p(x, E) = Ei [l\(x,pE) - Ji(x,pE)] dp.
o
In Fig. 1 experimental constitutive relations lEI f--+ IDI (line 1) and lEI f--+
IJI (line 2) for real isotropic dielectrical media are presented [9, 11, 16]. The
appropriate function of effective induction is shown by the line 3. It is easily
seen that for every dielectrical medium the scalar ..\ > 0 always exists such
that for every E E lR 3 and almost every x E n the following estimation is true:
where
..\(x) = max { Ii>(x, E) - J(x, E) I : E E lR3 } .
From the physical point of view ..\ is the electric saturation. In what follows,
we consider a homogeneous dielectrical medium for which ..\ = const(x).
From the estimation (2) it follows [12, 17] that the set of admissible elec-
trostatic potentials is defined as the following subspace
From the mathematical point of view the variational problem (1) can have
no solution because the functionalII(u) -A(u) can be unbounded from below
on the set V. In particular, after the point Eo = IE*I (see Fig. 1) the full
potential energy of the electric field II( u) has growth in Ilulll,l less than linear.
But the work of the electric field on the external charges A(u) always has linear
growth in Ilulll.l. As a result, for an admissible minimizing sequence U m E V
Limit Analysis Method in Electrostatics 179
(4)
From the Definition 1 it follows that for t* < 1 the electrostatic variational
problem (1) has no solution. This phenomenon corresponds to the beginning of
the electric puncture of the dielectrical medium. Therefore, the limit analysis
problem (4) is the main problem for the estimation of puncture conditions for
dielectrics of complex shape in powerful nonhomogeneous electric fields that
closes one of the modern fundamental problems [11, 14, 16].
It was pointed out in [6, 7] that the solution of LAP (4) belongs to the
space of scalar functions with bounded variations BV(D) :J W1,1(D). This
space consists of functions u E Ll(D) having the bounded total variation
where au is the vector Radon's measure denoting the gradient of the function u
J
in the sense of distribution theory [1, 13, 17]. In this case laul = lV'u(x) I dDJ
n n
for every u E W1,1(D). Here and in what follows the point between vectors
denotes the scalar product of these vectors in 1R3.
The Banach space BV(D) is weak* sequentially compact with the norm
IlullBV = IIul11 + J laul, therefore, the mathematically correct and fully relaxed
n
LAP has the following form:
3 Dual LAP
We construct here the dual LAP for a homogeneous dielectric using methods
from duality theory [10]. Thus, we introduce the set of admissible fields of the
effective induction compensating for the electric field of external charges in the
weak sense [12],
Proof. It is easily verified that dual LAP (6) is equivalent to the problem
For every value v > 0 and field D E G the following equality is true:
(8)
which is the convex set with piecewise linear boundaries, i.e. it is a simplex in
the space of global variables ]R3m.
As a result, the dual LAP (6) is approximated by the problem of mathe-
matical programming with linear equality constraints
5 Numerical results
0
1 [ 2 2]1/2
rdrdz,
V = {
II au
u E W ' ((0, l)x(O, 1)) : u(r,O) = 0, o)r,O) = 0, or (O,z) =
au °} ,
where A is the electric saturation from (2) (see Fig. 1), ~ = lla is the geometric
parameter and r E [0,1]' r.p E [0,21T), z E [0,1] are the reduced cylindrical co-
ordinates [4, 6]. Here fl = (0, l)x(O, 1) and r 2 = {r E [0,1]' z = I}. It is easily
verified that puncturing charge Q* = 1Talt •.
From Theorem 2 we have t* = AIT* ,where the parameter T* is the solution
of the appropriate dual LAP (6) on the following set of admissible effective
inductions:
The above analytical and numerical results are new. They are of practi-
cal interest, but more theoretical and experimental research is desirable. For
example, the presented limit analysis method can be used for the design of
electric isolators.
References
1. Ambrosio, L., Fusco, N., Pallera, D. (2000): Functions of Bounded Variations
and Free Discontinuity Problems. Oxford Uni. press, New York
2. Brigadnov, LA. (1996): Existence theorems for boundary value problems of hy-
perelasticity. Sbornik: Mathematics, 187, 1-14
3. Brigadnov, LA. (1996): Numerical methods in non-linear elasticity. In: Desideri,
J.-A., Le Tallec, P., Onate, E., Periaux, J., Stein, E. (eds.) Numerical methods
in engineering. Wiley, Chichester, 158-163
4. Brigadnov, LA. (1998): Discontinuous solutions and their finite element ap-
proximation in non-linear elasticity. In: Van Keer, R., Verhegghe, B., Hogge,
M., Noldus, E. (eds.) Advanced computational methods in engineering. Shaker
Publishing B.V., Maastricht, 141-148
5. Brigadnov, LA. (1999): The limited static load in finite elasticity. In: Dorfmann,
AI, Muhr, A. (eds.) Constitutive models for rubber. A.A.Balkema, Rotterdam,
37-43
6. Brigadnov, LA. (2001): Numerical analysis of dielectrics in powerful electrical
fields. Computer Assisted Mech. Eng. Sci., 8, 227-234
7. Brigadnov, LA. (2002): Variational-difference method for estimation of electrical
durability of dielectrics. Mathematical Modeling. 14(4), 57-66 (in Russian)
8. Ciarlet, Ph.G. (1980): The Finite Element Method for Elliptic Problems, North-
Holland Pub!. Co., Amsterdam
9. Duvaut, G., Lions, J.-L. (1972): Les Inequations en Mecanique et en Physique.
Dunod, Paris
10. Ekeland, 1., Temam, R. (1976): Convex Analysis and Variational Problems.
North-Holland Pub!. Co., Amsterdam.
11. Eringen, A.C., Maugin, G. (1989): Electrodynamics of Continua, VoU and II.
Springer, New York.
12. FuCik, S., Kufner, A. (1980): Nonlinear Differential Equations, Elsevier Sci. Pub!.
Co., Amsterdam-Oxford-New York
Limit Analysis Method in Electrostatics 185
1 Introduction
that using the standard finite element method on triangles with polynomials
of degree p = 1,2,3 we have the a priori error estimate
(1)
(2)
(3)
corners are the points P, Q. The lower edge of the domain coincides with the
axis of symmetry.
~_I_____6______~
Fig. 1. The solution domain fl
Investigating the equation (4) and using the technique of Kondratiev [11],
we showed in [4J that near the corners such as P or Q, the solution 1/J possesses
the expansion
1/J(x, y) = (5)
j s=o
where p, fJ are polar coordinates, w is smooth, and Aj are the poles of multiplic-
ity Pj of the corresponding resolvent R(A). More specifically, for the internal
angle a = ~1T', we proved that the leading term of the expansion for the velocity
components is as follows
Ul(p, fJ) = pO.54448374cpl(fJ) + ... , 1= 1,2, (6)
Similar results have been proved for the Navier-Stokes equations.
cf. [7]. Velocity values are given in corner nodes and midside nodes of the
quadrilateral or the triangle, and pressure values only in corner nodes, in order
to satisfy Babuska-Brezzi condition [3]. Then the velocity components and the
pressure are approximated as continuous functions of spatial variables.
(9)
where k = 2. Taking into account the expansion (6), and following Johnson's
idea [9], we derived in [4] the estimate
1 ~C
U 12Hk+l(T)~ JrT
"" C r T2h - k ),
p2 h -k-1) P dp ,- (10)
rT-hT
h T2k r T2h-k) ~
~
h2k . (11)
This lead us in [4] to an algorithm for generating the mesh near the corner:
Algorithm. Let r1 be the distance of the large element from the corner. For
given auxiliary stepsize h we compute recursively:
for i = 1,2, ... ,N:
5 Model Problem
Consider two-dimensional flow of a viscous, incompressible fluid described by
the Navier-Stokes equations in a domain with corner singularity, cf. Fig. 2.
190 P. Burda et al.
In Figs. 6-7 we present the whole computational mesh and its detail, using
Mesh 3.
I I I I I I I I I I
Fig. 6. The whole computational mesh
- i""/ ""/
::::
- :::::
~
------
-====
'\I
"-
--
V\V\ /\V\
Fig. 7. The mesh - detail
To evaluate the error we use an a posteriori error estimate derived for the
Stokes problem e.g. in [6]. Here {u, p} = (u, v, p) is the vector of the exact
solution, {Uh,Ph} = ('il, 'iJ,p) is the approximate solution computed by the
192 P. Burda et al.
(13)
where I[21 is the area of the whole domain lind I[2zl is the mean area of elements
obtained as I[2zl = ~. Here n denotes the number of all elements in the
domain.
8 Numerical results
9 Conclusions
Pros:
- distribution of the error on elements is quite uniform (esp. for Mesh 2);
- strength of singularity (both for the velocity and the pressure) is well cap-
tured;
- the algorithm of adjusted mesh refinement has been confirmed;
- the efficiency of the algorithm: to achieve the desired precision one needs to
carry out only one computation (compared with an adaptive approach, the
same precision would require approximately 10 successive refinements).
Cons:
- suitable only for sigularities due to "geometry";
- adaptive approach is much more robust.
Nevertheless, efficient refinement of the mesh near the corners still remains a
challenge.
References
1. Ainsworth, M., Oden, J., T. (1997): A posteriori error estimators for the Stokes
and Oseen problems. SIAM J. Numer. Anal., 34, 228 - 245
2. Babuska, 1., Rheinboldt, W., C. (1978): A posteriori error estimates for the finite
element method. Internat. J. Numer. Meth. Engrg., 12, 1597 - 1615
3. Brezzi, F., Fortin, M. (1991): Mixed and Hybrid Finite Element Methods.
Springer, Berlin
4. Burda, P. (1998): On the F.E.M. for the Navier-Stokes equations in domains
with corner singularities. In: Krizek, M., et al. (eds) Finite Element Methods,
Supeconvergence, Post-Processing and A Posteriori Estimates, Marcel Dekker,
New York, 41-52
194 P. Burda et a!.
0.B42
1.680 0.741
12. Ladeveze J., Peyret, R. (1974): Calcul numerique d'une solution avec singularit
des equations de Navier-Stokes: ecoulement dans un canal avec variation brusque
de section, Journal de Mecanique, 13, 367-396
13. Sistek, J. (2003): Solution of the fluid flow in a tube with singularity using
suitable mesh refinement (in Czech), Student Conference STe, eVUT Praha
Acknowledgements
This research has been supported partly by the GACR Grant No. GA 101/02/0391.
and partly by the State Research Project No. J04/98/210000010.
The Edge Stabilization Method for Finite
Elements in CFD
Summary. We give a brief overview of our recent work on the edge stabilization
method for flow problems. The application examples are convection-diffusion, with
small diffusion parameter, and a generalized Stokes model.
1 Introduction
In order for the standard Galerkin finite element method to be stable for
problems in CFD, some care must be taken. For convection-dominated flow
problems, stabilization must be introduced, while for mixed methods the ap-
proximations of velocities and pressure must either be carefully balanced or,
again, stabilized. Examples of stabilization methods are the SUPG lSD-method
[8]' the discontinuous Galerkin method [9]' the residual free bubbles [2], sub-
viscosity models [7], and pressure projection methods for the Stokes problem
[5]. The relation between the different approaches is also well understood in
most cases. However for complex flow problems, most of these methods have
drawbacks. The SUPG stabilization becomes non-symmetric and the formula-
tion does not permit lumped mass; the residual free bubbles and discontinuous
Galerkin method add additional degrees of freedom; the projection methods in-
troduce the need of hierarchical meshes for the projection or the sub-viscosity
model. In this paper we give a brief review of our recent work, [3, 4], on
an alternative method originally proposed by Douglas and Dupont [6]. This
method stabilizes convection-diffusion-reaction problems, as well as equal or-
der interpolation methods for the generalized Stokes problem, by adding a
least-squares term based on the jump in the gradient of the discrete solu-
tion over element boundaries. With this simple concept we obtain stability for
convection-reaction-diffusion problems also in the vanishing viscosity limit as
well as for the generalized Stokes problem with equal order interpolation.
The advantage of this method, in comparison with the others mentioned,
is that no additional degrees of freedom are added, no hierarchical meshes are
needed, the formulation remains symmetric, and the mass can be lumped for
efficient time marching and treatment of stiff source terms. The drawback is
an increased number of non-zero elements in the stiffness matrix due to the
The Edge Stabilization Method for Finite Elements in CFD 197
fact that the gradient jump term couple neighboring elements. The implemen-
tation also requires an element neighbor data structure that is not necessarily
available in standard finite element codes.
2 Model problems
As a first model problem, we consider, in D C lR d , d = 2,3, the problem of
solving
cy u + f3 . 'V u - 'V . (s 'V u) = f in D (1)
with, for simplicity, u = 0 on aD. Here, f is a given source term, f3 is a given
smooth velocity field, satisfying 'V . f3 = 0, and cy and s are bounded positive
functions.
The weak form of this problem is to find u E HJ
(D) such that
where
r LCTuivi+vVui,Vvidx,
d
a(u,v) = b(P,v)=-tpV.VdX,
Jn i=1 "
in in
and
L(v,q) = f· vdx - gqdx.
In the following, we shall denote the L 2 -scalar product by (.,.) and the
corresponding norm by I . II·
(8)
and
3(u, v) := r 'Yh~l [V' . u][V' . v] ds,
LK ~218K (9)
°
stability comes from, exemplified using model problem I. More precisely, we
shall sow that there exists some ( ~ (0 > such that
of M is bounded uniformly in i. Let F K be the set of all test functions 'Pi such
that K C SUPP'Pi and Di = UNi Ki. We will consider a function wE Po(K),
and its representation in the finite element basis w defined by
The operator 7rh : Po(K) -7 Vh, which denotes the lowest order Clement
operator is constructed as follows.
7rhW = '"""'
L... Wi'Pi, where Wi = m(Di) 7t
1 '"""' wIKim(K').
. (12)
In the following we will also write WIKi - WIK = I:~i [w], with [w] denoting
the jump across element boundaries and the sum is taken over the shortest
"path" from element Ki to element K.
It is now straightforward to show that the projection error is controlled by
the operator Js(w, w)
(13)
4 Numerical examples
4.1 Convection-diffusion
We consider the domain (0, 1) x (0, 1), with (J = 0, E = 10- 6 , and f3 = (1-y, x).
° ° °
°
The boundary condition on the inflow boundary is u = at y = and u =
at x = except for 0.3 < y < 0.5, where u = 1. We compare the the numerical
solutions using the present method and the streamline diffusion method in
Figure 1. The results are of comparable quality.
Fig. 1. Streamline diffusion (left) and gradient jump stabilization (right) for
convection-diffusion
To illustrate the absence of boundary layers in the pressure using the gradient
jump method, we consider the Poiseulle flow problem on (0,4) x (0,1) on an
202 E. Burman, P. Hansbo
References
6. Douglas, J., Dupont, T. (1976): Interior penalty procedures for elliptic and
parabolic Galerkin methods, in: R. Glowinski, R., Lions, J. L. (eds) Comput-
ing Methods in Applied Sciences. Springer Berlin
7. Guermond, J.L. (1999): Stabilization of Galerkin approximations of transport
equations by subgrid modeling. M2AN Math. Model. Numer. Anal., 33, 1293-
1316
8. Johnson, C., Navert, U., Pitkaranta, J. (1984): Finite element methods for linear
hyperbolic equations. Comput. Methods Appl. Mech. Engrg., 45, 285-312
9. Johnson, C., Pitkiiranta, J. (1986): An analysis of the discontinuous Galerkin
method for a scalar hyperbolic equation, Math. Comp., 46, 427-444.
Analysis and Computation of Dendritic Growth
in Binary Alloys Using a Phase-Field Model
1 Introduction
Thickness
of the solid-liquid
interface
000 • • • • • • • • • • • • • • • • / Xl
¢=o
Liquid
Fig. 2. Typical profiles of the phase field (left) and concentration field (right). The
phase-field has values zero or one, except in the phase change region. The concentra-
tion field changes rapidly across the phase change region, but may also vary outside
the phase change region.
where B(~) denotes the angle between the vector ~ and a preferential direction,
a is the real-valued function defined by
where el is the unit vector in the horizontal direction. From the physical point
of view, (1) is nothing but a generalization of a mean curvature - or Allen-Cahn
206 E. Burman et al.
- equation, see for instance [9, 21] for a general presentation. This equation
is obtained by taking the derivative of a free energy functional accounting
for phase transformation, double well barriers and an interfacial anisotropic
energy contribution
which is the weak form corresponding to the second term of (1). Due to the
double well barriers, the function S(·,·) in (1) contains small parameters that
force the phase-field ¢ to take values 0 or 1, except in a small region, see again
Fig. 1. Finally, equation (2) corresponds to solute conservation. We refer to
[22, 20] for details about the physical derivation of the model.
2 Existence
(8)
such that
S(c, 0) = S(c, 1) = 0,
D 2(0,¢) = D 2(1,¢) = 0,
they can be extended by zero outside the interval (0,1), and a maximum
principle holds for ¢ and c, that is, if 0 ::::: ¢o, Co ::::: 1, then 0 ::::: ¢(t), c(t) ::::: 1,
for all t.
In order to prove the existence result, the implicit Euler scheme is consid-
ered. Given an integer N, we set T = TIN the time step, t n = nT, n = 0, ... , N,
¢o = ¢o and CO = Co. For n = 1,2, ... , N, ¢n-l and cn - 1 being known, let ¢n
and cn be approximations of ¢(tn) and c(t n ) such that
in fl, (9)
in fl. (10)
where the function f is such that ~~ (c, ¢) = -S(c, ¢). The existence of a unique
minimizer is a consequence of the fact that the Ginzburg-Landau potential EO
1a2(B(~))1~12dx
defined by
E(~) = ~
2 J!
For any 0 < h < 1, let T;" be a conforming triangulation of fl into triangles
K with diameter hK less than h. Let Vh be the usual finite element space of
continuous, piecewise linear functions on the triangles of T;". The finite element
scheme corresponding to (9) (10) is considered. For each n = 1..., N, we are
first looking for ¢f; in Vh such that
for all Wh E Vh .
A priori error estimates in the L2(0, T; H1(f?)) norm have been proved in
[5] in the case when Ii < /\/-1.
The convergence proof strongly relies on the
strong monotonicity on the operator A. More precisely, the following result is
used to prove convergence.
Lemma 1. Let A(·) be the operator defined by (6) and let the convexity con-
dition Ii < /\/-1
hold. Then there exists p, (depending on Ii) such that, for all
¢, 'lj; E H1(f?), we have
Then, assuming that the solution (¢, c) is smooth the following result is
proved in [5].
Theorem 1. Let ¢, c be the weak solution of (1)-(5), let ¢I;" cl;, be defined by
(11)-(12). Assume that
N N
LTII\7(¢h - ¢(tn))llI2(.rt) + LTII\7(Ch - c(tn))ll 12 (.rt)
n=l n=l
3.2 A posteriori error estimates for meshes with high aspect ratio
As in [15, 16], our goal is to perform adaptive finite elements with high aspect
ratio for solving numerically (1)-(5) in the general case when Ii i= 0. The
general framework of [10, 11] is considered.
For any triangle K of the mesh, let TK : k -+ K be the affine transforma-
tion which maps the reference triangle k onto K. Let MK be the Jacobian of
TK that is
x = TK(x) = MKx + tK.
Since MK is invertible, it admits a singular value decomposition MK =
RlilKPK, where RK and PK are orthogonal and where ilK is diagonal with
positive entries. In the following we set
and RK = (~~K),
2,K
with the choice .A1,K ;::: .A2,K. A simple example of such a transformation is
Xl = HX1, X2 = hX2' with H ;::: h, thus
Analysis and Computation of Dendritic Growth in Binary Alloys 209
see Fig. 3.
In the framework of meshes with high aspect ratio, the classical minimum
angle condition must be avoided. However, it is required that, for each vertex,
the number of neighbouring vertices is bounded above, uniformly with respect
to the mesh size h. Also, for any patch LlK (the set of triangles having a vertex
common with K), the diameter of the corresponding reference patch Ll j{, that
is Llj{ = Tj(I(LlK)' must be uniformly bounded independently of h. This latter
hypothesis excludes some distorted patches, see Fig. 4. Let h : HI (fl) -+ Vh
be a Clement or Scott-Zhang like interpolation operator. We now recall some
interpolation results due to [10, 11].
1 H
1 h
1 H
Fig. 4. Example of an acceptable patch (top): the size of the reference patch Ll R does
not depend on the aspect ratio H/h. Example of a nonacceptable patch (bottom):
the size of the reference patch Ll R now depends on the aspect ratio H / h.
T
fa (111) - (hlli2(st) + Ilc - chlli2(st))
Let us comment this assumption in the frame of meshes with small aspect ratio,
that is to say when Al,K and A2,K are of order h, for all K E T",. According to
the results of section 3.1, the error in the L2(0, T; Hl(Q)) norm is shown to be
O(h) whenever the time step is small. On the other hand, O(h2) convergence
in the LOO(O, T; L2(Q)) norm is proved for in [13]' but only in the case when
a = 0. Therefore, we expect the following assumption
Analysis and Computation of Dendritic Growth in Binary Alloys 211
For each interior edge of Th , let us choose an arbitrary normal direction il,
let [~] denote the jump of ~ across the edge. For each edge of Th lying on the
boundary aD, we set [~] to twice the inner side value of ~. The following result
is proved in [6].
Theorem 2. Let <p, c be the weak solution of (1)-(5), let <Ph, ch be the
semi-discrete approximation corresponding to (11) (12). Assume that <p, c E
LOO(O, T; H2(D)) and that (15) holds. Then, there is a constant C depending
only on the interpolation constants of Lemma 2 such that, for all mesh Th such
that maXKETh ).,2,K is sufficiently small, we have
+--k II
2).,2,K
[Dl(<Ph) ~Chn +D 2(Ch, <Ph) ath]
n
I
L2(8K)
) x WK(C - Ch)' (16)
Estimate (16) is not a usual a posteriori error estimate since ¢> and C are still
involved in the right hand side. We then proceed as in [15, 16] and introduce
an estimator based on superconvergent recovery, namely a Zienkiewicz-Zhu
(Z-Z) like estimator [23, 3, 24]. More precisely, we consider the simplest Z-Z
error estimator as defined in [19, 1]. The Z-Z error estimator corresponding
to \7(c - Ch) is defined by the difference between \7ch and an approximate L2
projection of \7 Ch onto V,;,
namely :
(17)
1 rh((Ihg)vh) = 1 gVh
where rh denotes the usual Lagrange interpolant. In other words, from constant
values of \7ch on triangles, we build values at vertices P using the formula
From [2,19] we know that for a certain class of meshes (namely parallel meshes)
and for smooth solutions, Z-Z like error estimators are asymptotically exact
(i.e. the Z-Z error estimator converges to the true error when h goes to zero).
Our error indicator corresponding to C - Ch is then obtained by replacing the
matrix GK(C-Ch) present in the definition of WK(C-Ch) in (16) by the matrix
GK (Ch) defined by
(18)
In this section, the quality of our error indicator (19) is investigated numeri-
cally. For details we refer to [6]. Let us consider ¢r;; and cr;; E Vh the solutions
of (11) and (12), respectively. In practice, ¢r;; is obtained by performing only
one Newton iteration at each time step. Proceeding as in [16] we introduce ChT
the continuous, piecewise linear approximation in time defined by
t - t n- 1 tn - t
Chr(X, t) = cr;;(x) + _-C~-l(X) x E n. (20)
T T
and the simplified error indicator for each time interval [t n- 1, tn] and triangle
Kby
( 7]n,K(Chr) ) 2 = ltn
tn-I
1 -2--
-1
2,\ /
2,K
II [oChT] II
an £2(8K)
(21)
1
S(C'¢)=-V¢(1-¢)(1-2¢)+ ,\r¢ (1-¢)
5ml 2 2 (c
1-¢+k¢ -Cl
)
if 0 .:; ¢ .:; 1,
whereas S( c, ¢) = 0 if ¢ < 0 or ¢ > 1. Here ,\ is the thickness of the solid-liquid
interface, ml is the liquid slope in the phase diagram, r is the Gibbs-Thomson
isotropic coefficient, q is the liquid concentration in the phase diagram, and
k is the phase diagram partition coefficient (thus Cs = kcl, where Cs is the
solid concentration in the phase diagram). The coefficient a in (1) equals ~[" /-'k
where /-lk is the interface kinetic coefficient. The first term in the definition of
S(·,·) is nothing but the derivative of a double well potential that forces ¢ to
values zero or one, except in the phase change region. An asymptotic expansion
of the phase-field equation at first order in ,\ links the normal velocity of the
solid-liquid interface, some anisotropic measure of the interface curvature, and
the concentration field. The function D1 in (2) is given by
if 0 .:; ¢ .:; 1,
whereas D 2 (c, ¢) = 0 if C < 0 or c > 1. All the physical parameters are given
below in the international MKSA unit system.
Our first goal is to validate numerically assumption (15) in the context
of meshes with high aspect ratio. For this purpose, we set the computational
domain to D = [-0.0002,0.0002]2, we add source terms in (1) (2) so that ¢
and c are given by
1 _ tanh ( Xl ~ vt)
¢(Xl,X2,t)=C(Xl,X2,t) = 2 '
Table 1. Test case with exact solution: parameters used for the computations.
A ml r Cs Cl Ds Dl ILk
We now present the adaptive algorithm of [6]. Given a time step T, the goal
is to build triangulations Thn, n = 1, ... , N having high aspect ratio such that
the relative estimated error in the L2(0, T; Hl(D)) norm is close to a preset
tolerance TOL. Our adaptive algorithm aims at building triangulations r"n,
n = 1, ... , N such that
Analysis and Computation of Dendritic Growth in Binary Alloys 215
N 2
L L (1]n,K(Chr))
0.75 TaL:::; n=lK;!h :::; 1.25 TaL, (22)
fa In IV hrlC 2
for all triangle K E T"n, where Nvhn is the number of vertices of the mesh T"n.
We then proceed as in [15, 16] to build such an anisotropic mesh, using the
BL2D mesh generator [4].
We now consider the following physical situation. At initial time, the computa-
tional domain is liquid, with homogeneous concentration 0.02. Then, a circular
216 E. Burman et al.
solid seed of diameter 2.5 10- 6 and concentration 0.015 is placed at the center
of D. The physical parameters are now given in Tab. 3 and are taken from
[12], table 1, column B, except Cs and cz.
We now choose the anisotropy parameter a > ,}-1 c:::: 0.0667, namely
a = 0.1. In this case there are no known existence results for the system
in L2(0, T; H 1 (D)).
In Fig. 7, the concentration fields corresponding to an adaptive computa-
tion with tolerance TOL = 0.0625 are reported. A zoom of the results at final
time, Fig. 8, shows that the gradient is discontinuous in some regions. This
phenomenon is explained in details in [6].
1
D.'
D.'
D.7
D.6
D.5
0.'
0.3
D.2
D.1
D
Fig. 5. Computations with small anisotropy, a = 0.04. Adapted meshes (left col-
umn), concentration isovalues (middle column) and phase isovalues (right column),
from t = 0 to t = 1 s, with TOL = 0.0625 (6.25% estimated relative error). Row 1:
t = 0.05 s, 6874 vertices. Row 2: t = 0.5 s, 17170 vertices. Row 3: t = 1 s, 24441
vertices.
Fig. 6. Computations with small anisotropy, a= 0.04. Zooms of the mesh at final
time.
Fig. 8. Computations with large anisotropy, a= 0.1. Zooms of the mesh at final
time.
References
1. M. Ainsworth and J. T. Oden. A posteriori error estimation in finite element
analysis. Comput. Methods Appl. Mech. Engrg., 142(1-2):1-88, 1997.
2. M. Ainsworth and J.T. Oden. A unified approach to a posteriori error estimation
using finite element residual methods. Numer. Math., 65:23-50, 1993.
3. M. Ainsworth, J. Z. Zhu, A. W. Craig, and O. C. Zienkiewicz. Analysis of
the Zienkiewicz-Zhu a posteriori error estimator in the finite element method.
Internat. J. Numer. Methods Engrg., 28(9):2161-2174, 1989.
4. H. Borouchaki and P. Laug. The b12d mesh generator: Beginner's guide, user's
and programmer's manual. Technical Report RT-0194, Institut National de
Recherche en Informatique et Automatique (INRIA), Rocquencourt, 78153 Le
Chesnay, France, 1996.
5. E. Burman, D. Kessler, and J. Rappaz. Convergence of the finite element
method for an anisotropic phase-field model. Technical report, Departement
de Mathematiques, Ecole Poly technique Federale de Lausanne, 1015 Lausanne,
Switzerland, 2002.
6. E. Burman and M. Picasso. Anisotropic, adaptive finite elements for the com-
putation of a solutal dendrite. Interfaces Free Bound., 5(2): 103-127, 2003.
7. E. Burman, M. Picasso, and A. Jacot. Adaptive finite elements with high aspect
ratio for the computation of coalescence using a phase-field model. J. Comput.
Phys., accepted, 2003.
8. E. Burman and J. Rappaz. Existence of solutions to an anisotropic phase-field
model. Math. Methods Appl. Sci., 26(13):1137-1160, 2003.
9. C. M. Elliott. Approximation of curvature dependent interface motion. In The
state of the art in numerical analysis (York, 1996), pages 407-440. Oxford Univ.
Press, New York, 1997.
10. L. Formaggia and S. Perotto. New anisotropic a priori error estimates. Numer.
Math., 89:641-667, 2001.
11. L. Formaggia and S. Perotto. Anisotropic error estimates for elliptic problems.
Numer. Math., 94(1):67-92, 2003.
12. A. Jacot and M. Rappaz. A pseudo front tracking technique for the modelling of
solidification microstrucures in multicomponent alloys. Acta Materialia, 50:1909-
1926,2002.
13. D. Kessler and J.-F. Scheid. A priori error estimates for a phase-field model for
the solidification process of a binary alloy. IMA J. Numer. Anal., 22:281-305,
2002.
14. O. Kruger, M. Picasso, and J.-F. Scheid. A posteriori error estimates and adap-
tive finite elements for a nonlinear parabolic problem arising from solidification.
Comput. Methods Appl. Mech. Engrg., 192:535-558, 2001.
15. M. Picasso. Numerical study of the effectivity index for an anisotropic error
indicator based on zienkiewicz-zhu error estimator. Comm. Numer. Methods
Engnrg., 19:13-23, 2002.
16. M. Picasso. An anisotropic error indicator based on zienkiewicz-zhu error es-
timator : application to elliptic and parabolic problems. SIAM J. Sci. Comp.,
24:1328-1355, 2003.
17. J. Rappaz and J.-F. Scheid. Existence of solutions to a phase-field model for
the isothermal solidification process of a binary alloy. Math. Meth. Appl. Sci.,
23:491-513, 2000.
220 E. Burman et al.
18. M. Rappaz, A. Jacot, and W.J. Boettinger. Last stage solidification of alloys
: a theoretical study of dendrite arm and grain coalescence. Met. Trans. A,
34:467-479, 2003.
19. R. Rodriguez. Some remarks on Zienkiewicz-Zhu estimator. Numer. Methods
Partial Differential Equations, 10(5):625-635, 1994.
20. J. Tiaden, B. Nestler, H. J. Diepers, and 1. Steinbach. The multiphase-field model
with an integrated concept for modelling solute diffusion. Physica D: Nonlinear
Phenomena, 115(1-2):73-86, 1998.
21. A. Visintin. Models of phase transitions. Birkhiiuser Boston Inc., Boston, MA,
1996.
22. J. A. Warren and W. J. Boettinger. Prediction of dendritic growth and microseg-
regation patterns in a binary alloy using the phase-field model. Acta metall.
mater., 43(2):689-703, 1995.
23. O. Zienkiewicz and J. Zhu. A simple error estimator and adaptive procedure for
practical engineering analysis. Internat. J. Numer. Methods Engrg., 24(2):337-
357, 1987.
24. O. C. Zienkiewicz and J. Z. Zhu. The superconvergent patch recovery and a pos-
teriori error estimates. I. The recovery technique. Internat. J. Numer. Methods
Engrg., 33(7):1331-1364, 1992.
Discontinuous Galer kin Methods for
Timoshenko Beams
1 Introduction
where x E n = (0, L). Here, the unknowns are the transverse displacement
w, the rotation of the transverse cross-section of the beam e, the bending
moment M, and the shear force T. The material and geometrical properties of
the beam are characterized by the shear modulus G, the cross-section area A,
the Young modulus E, and the moment of inertia I. The transverse load, q, is
part of data of the problem. To complete the model and ensure the existence
and uniqueness of its solution, we must impose suitable boundary conditions;
we take, for example,
gJ: := {Xl, X2, ... , XN-I}, then we take [<pn](e) = <p(e+)nt + <p(e-)n;, where
<p(e±) := limdo <pee - En;) and n; = =flo For the boundary nodes, we take
[<pn](O) = -<p(O+), [<pn](L) = <p(L-). These jumps are well defined for <p in
HI([h), where [h = Uj=I, ... ,NJj .
It is now easy to see that if we assume that (T, M, e, w) E [HI(D)]4, we
have
d I I I
-(W'dxvl) + (w,[vln]) =(e,v )-(CAT,v), (3a)
d 1
- (e, dx v)
2
+ (e, [v 2 n]) = (EJM,v 2
), (3b)
d
- (M, dx v ) + (M, [v n]) = (T, v ),
3 3 3
(3c)
d
- (T, dx v )
4
+ (T, [v 4 n]) = (q, v 4 ). (3d)
for all v l ,v2 ,v3 ,v4 E HI(D h ). This is the weak formulation we use to define
the DC methods.
3 The DG Methods
The approximate solution (Th, Mh, eh, Wh) given by the DC method will be
sought in the finite dimensional space V/:' x V/:2 x V/:3 X V/:4; here,
where pk(K) is the set of all polynomials on K of degree not exceeding k. The
approximate solution is determined by requiring that
hold for all Vi EV/:i for i = 1,2,3,4. To complete the definition of the method,
we have to define the numerical traces (Th' M h , Bh, Wh) at the nodes. It is
through them that the interaction between the degrees of freedom of differ-
ent intervals is introduced and the boundary conditions are actually imposed.
Moreover, their choice is crucial as it affects both the stability and the accu-
racy of the method; see [3] for a detailed discussion of this issue for some other
problems.
224 F. Celiker et al.
Extending to our framework what have been already done for fluid flow
problems, we assume that the form of these traces is as follows. For an interior
node e E rffl:, we take
and at x = L,
wh(L) = WL, (7a)
Bh(L) = 8h(L-) + C21 (L)(Wh(L-) - WL) + C23 (L)(Mh(L-) - M L), (7b)
Mh(L) = M L , (7c)
Th(L) = Th(L-) + C41 (L)(Wh(L-) - WL) + C43 (L)(Mh(L-) - ML). (7d)
Note how the boundary conditions are incorporated into the DG method
through the definition of the numerical traces at the border. Note also that
the parameters C ij defining the numerical traces can have different values at
different nodes. In the next section, we investigate the role of these parame-
ters. In particular, we show that out of these sixteen parameters, six can be
expressed in terms of the remaining ten and that only four of them have an
impact on the "energy" of the discretization.
1 1
(EI M,M) + (CAT,T) = (q,w) + bc(T,M,8,w),
where
DC methods for Timoshenko Beams 225
Since this identity captures an essential feature of the problem under consid-
eration, we would like to obtain a similar energy identity for the DG method.
Such an identity is obtained in the following result.
C21 = C43 , -C22 = C33 , C24 = C13 , C31 = C42 , C34 = C12 , -Cll = C44 .
(8)
Then, we have
(10)
From the first case, we see that the stabilization parameters associated
to Bh and Wh, namely, C 32 and C 41 , respectively, have a stronger influence
on the existence and uniqueness of the solution of the method than the ones
associated to Th and M h , namely, C l4 and C 23 , respectively.
From the second and third cases, we see that when the stabilization effect
of a jump is turned off (by setting the corresponding stabilization parameter
equal to zero), the existence and uniqueness of the approximate solution can
still be guaranteed by a suitable definition of the other parameters and/or
by modifying the polynomial degree of the approximate solutions. Roughly
speaking, the more stabilization parameters are equal to zero, the more the
spaces for Bh and Wh have to be in relation to the spaces of Mh and Th,
respectively.
Proof. Due to the linearity of the problem, it is enough to show that the only
solution to (4) with q = 0, Wo = WL = Mo = ML = 0 is Wh = Bh = Mh =
Th = O. In this case, (9) takes the form
(11)
interval since Wh E and kl ?: k4. Finally, since G41 (L) > 0, by the discrete
energy identity (11), we have that [Wh n](L) = 0 and so wh(L-) = o. This
implies that Wh = o.
Case 3. Finally, consider the third case. Since k2 ?: 1, taking v 2 = x/ L
in (4b), we get that Bh(L) = (Bh' 1) = o. Then, taking v 2 = 1 on (Xj,Xj+1)
and v 2 = 0 for the rest of the domain, the equation (4b) yields Bh(xj) =
Bh(xj+r). This implies that Bh = 0 on all the nodes. By (4b), this implies that
(Bh' d~ v 2) = 0 for all v 2 E V/:2 , and since Bh E V/:3 and k2 ?: k3 + 1, this
implies that Bh = o. A similar argument shows that Wh = 0 if kl ?: k4 + 1.
This completes the proof. D
6 Numerical Results
Fig. 2. Piecewise constant (top) and linear (bottom) approximations for test prob-
lem 1
DG methods for Timoshenko Beams 229
Fig. 3. Piecewise constant (top) and linear (bottom) approximations for test prob-
lem 2
230 F. Celiker et al.
degree mesh Ile T II L2(D) Ile M II L2(D) Ilee IIL2(D) Ile w II L2(D)
k number error order error order error order error order
2 9.7e-2 0.89 1.1e-1 1.04 5.7e-1 1.14 4.7e-1 1.32
0 3 5.1e-2 0.94 5.6e-2 1.04 2.7e-1 1.09 2.1e-1 1.19
4 2.6e-2 0.98 2.7e-2 1.03 1.3e-1 1.05 9.6e-2 1.10
2 1.3e-2 1.25 2.7e-3 1.94 1.1e-2 1.99 4.2e-3 2.04
1 3 3.4e-3 1.89 6.7e-4 1.99 2.8e-3 2.00 1.1e-3 1.98
4 8.6e-4 1.98 1. 7e-4 1.99 6.ge-4 2.00 2.7e-4 1.98
2 1.8e-3 3.00 2.6e-4 2.61 2.8e-4 2.92 2.1e-4 2.95
2 3 2.2e-4 3.00 3.4e-5 2.93 3.5e-5 2.97 2.7e-5 2.98
4 2.8e-5 3.00 4.3e-6 2.99 4.5e-6 2.99 3.4e-6 2.99
2 1.8e-5 4.01 2.7e-5 3.99 1.4e-5 3.96 4.5e-6 4.06
3 3 1.1e-6 4.01 1.7e-6 4.00 8.6e-7 3.99 2.8e-7 4.02
4 7.1e-8 4.00 1.1e-7 4.00 5.4e-8 4.00 1.7e-8 4.01
2 6.ge-7 4.94 2.3e-7 5.05 5.3e-7 4.96 1. 7e-7 4.89
4 3 2.2e-8 4.98 7.1e-9 5.02 1. 7e-8 4.99 5.3e-9 4.97
4 6.ge-10 4.99 2.2e-1O 5.01 5.3e-10 5.00 1. 7e-10 4.99
2 2.3e-8 6.01 7.1e-9 5.91 1.7e-8 5.97 5.7e-9 6.04
5 3 3.6e-10 6.00 1.1e-1O 5.98 2.7e-10 5.99 8.ge-ll 6.01
4 5.6e-12 6.00 1.8e-12 5.99 4.3e-12 6.00 1.4e-12 6.01
7 Conclusion
References
1. S.P.Timoshenko (1921) On the correction for shear of the differential equation for
transverse vibrations of prismatic bars, Philosophical Magazine,41, 744-746.
2. S.P.Timoshenko (1922) On the transverse vibrations of bars of uniform cross sec-
tion, Philosophical Magazine,43,125-131.
3. B. Cockburn(2003) Discontinuous Galerkin Methods, ZAMM Z. Angew. Math.
Mech.,83,731-754.
4. D.N.Arnold(1981) Discretization by Finite Elements of a model Parameter De-
pendent Problem, Numer.Math.37, 405-421, (1981).
DC methods for Timoshenko Beams 231
5. Malkus, D.S., Hughes, T.J.R (1978) Mixed finite element methods-reduced inte-
gration and selective integration techniques: a unification of concepts. Comput.
Methods Appl. Mech. Engrg. 15, 63-81
6. Likang Li(1990) Discretization of the Timoshenko Beam Problem by the p and the
h - p Versions of the Finite Element Method, Numer.Math.57, 413-420.
7. D. N. Arnold, F.Brezzi and D. Marini A family of discontinuous Galerkin finite
elements for the Reissner-Mindlin plate, submitted to Journal of Scientific Com-
puting.
8. D.N.Arnold, R.Falk (1990) The Boundary Layer for the Reissner-Mindlin Plate
Model, SIAM J. Math. Anal., Vol 21, No.2, pp. 281-312.
9. D.N.Arnold, R.Falk (1989) A Uniformly Accurate Finite Element Method for the
Reissner-Mindlin Plate, SIAM J. Numer. Anal., Vol 26, No.6, pp. 1276-1290.
10. D.N.Arnold, F.Brezzi (1997) Locking-free Finite Element Methods for Shells
Math.Comp., Vol.66, Number 217, pp. 1-14.
Numerical Algorithms for Solving
Elliptic-Parabolic Problems
Raimondas Ciegis
Summary. This paper deals with numerical algorithms for solving elliptic-parabolic
problems. An example of such problem is given by the Richards equation for modeling
the saturated-unsaturated water flow in porous media. We consider a linear model
problem and investigate the convergence of two finite-volume schemes. The first
one uses the implicit approximation in the whole domain, and the second scheme is
constructed using the splitting method. Results of numerical experiments are also
given.
1 Introduction
OSI
clPI7ft = V'. (AI(SI)KIV'(PI(SI) - xI)),
PI(X, t) = qQ, (x, t) E Of?D X [0, T],
V'(PI - Xl) = 0, (x, t) E of?\of?D X [0, T],
SI(X,O) = Sinit(X),
where Sl is the water saturation in the l-th layer, PI is the pressure, AI(Sz)KI ,
Cl, PI are the permeability, porosity of the porous media, and density of the
°
fluid, respectively.
The equation becomes elliptic in the region of saturation, where PI > and
Sl = sf:
V'. (AI(sl)KIV'(PI - xI)) = 0,
here the pressure PI is the primary unknown, and Sf denotes the water content
of a water-saturated medium.
The actual determination of the discrete solution may require large com-
putational resources due to the strong nonlinearity of the problem, the domi-
nance of the convective process, discontinuity of the porous medium properties.
The formulated problem looks like a system of parabolic partial differential
Numerical Algorithms for Solving Elliptic-Parabolic Problems 233
equations, but its type can become either nonlinear hyperbolic or degenerate
parabolic, depending on the influence of capillary pressure (see, e.g. Helmig
[Hel97]' Eymard et al. [EGHOOJ).
An additional difficulty arises due to degeneracy of the parabolic problem
in the unsaturated region to the elliptic problem in the saturation region of
the porous medium. Fully implicit discrete finite volume and finite difference
schemes are usually used for the approximation of the flow equations in the
whole region (see the papers of Alt and Luckhaus [ALu83j, Chen and Ewing
[ChE97]' Eymard et al. [EGHOO]' Jager and Kacur [JaK91]).
It is well-known that most effective numerical algorithms for solving multi-
dimensional parabolic problems are based on splitting methods. New time
splitting schemes for the time integration of the problems describing the two-
phase flow in the porous medium are proposed by Ciegis et al. [CPZOOj. The
parallel version of this algorithm is considered in [CCZ99j.
In Section 2 we formulate the model linear 3D elliptic-parabolic initial-
boundary value problem and define its approximation in space by the finite-
volume method. In Section 3 we formulate and investigate the fully implicit
scheme for integration in time. The additive integration scheme, which is based
on the Douglas algorithm, is presented and investigated in Section 4. Finally,
the results of numerical experiments are presented in Section 5.
2 Problem Formulation
au =
at
t~
j=l aXj
(k j aXj
au ) - q(t)u + f(X, t), (X, t) E Qparo
-t aa.
)=1 x)
(k j aaU)
x)
+ q(t)u = f(x, t), (X, t) E Qell, (1)
We assume that:
We use the notation un = U(X ijk , tn). Using the finite-difference method we
approximate a part of the differential operator (1) by the following discrete
operators
un = un (x j + h) - un
Xj h '
The selection of a time~stepping procedure is a non trivial task, since the
stability and robustness of the algorithm on the one hand must be balances
with the computational efficiency on the other hand. In the following section
we investigate two different numerical integration schemes.
(2)
or
(3)
We note that even if the problem is parabolic in the whole region of its
definition (i.e., the flow in the porous media is unsaturated) the 3D parabolic
problem is approximated by the backward Euler scheme.
The system of linear equations (3) is solved by some iterative method,
e.g. the Conjugate Gradient method. In the case of the nonlinear two~phase
flow problem the matrix of the obtained system is non~symmetric, thus some
special methods such as G M RE S should be used to solve a system of linearized
equations.
The convergence rate of iterative methods depends essentially on the stiff-
ness of the matrix A and on the distribution of its eigenvalues. It is easy to
prove that the following spectral estimates are valid:
Numerical Algorithms for Solving Elliptic-Parabolic Problems 235
_
( m_ea---,s(:.. .:. Q!.. . .pa--'.).r \ .) I
+ AA,mm ::;
A
::;
(meas (Q par) \
+ AA,max )
I,
T T
here AA,min, AA,max are the smallest and largest eigenvalues of the matrix A,
respectively, and meas(Q) denotes the volume of Q.
Now we can investigate the stiffness number of the matrix A:
Taking into account that AA,min = 0(1) and AA,max = 0(h- 2 ), and assuming
that meas(Qpar) > 0, we obtain the following asymptotic estimates:
c
if T = O(Vh) ,
meas( Qpar) h3/ 2
C
I\;(A) = if T = O(h), (4)
meas(Qpar) h
c
if T = 0(h2) .
meas(Qpar)
Thus for sufficiently large time steps T the number of iterations of the CG
method will be approximately the same as for solving the pure elliptic problem.
(5)
T
un+! _ U n +2 / 3
s s = A3 U;'+! - A3 U;'.
T
Here s denotes the elliptic iteration number, and the initial condition at time
level t n is recalculated after each iteration as
un if X E fh pan
un - { '
s+l - n+1 .
Us If X E [h,ell ,
In this section we will investigate the stability of the proposed iterative algo-
rithm (5). Let AI, A2, A3 be eigenvalues of the discrete operators AI, A 2, A 3,
respectively:
Aj < 0, j = 1,2,3.
Let us denote the error of the iterative solution by
Z~+l = U~+l _ U n +1,
zn _ {O, if x E [h,pan
8 - Ur; - U n +1, if x E [h,ell.
Theorem 1. The stability-correction scheme is unconditionally stable for the
three dimensional linear elliptic-parabolic problem and the following conver-
gence estimate is valid:
Zn
8
_
-
{O,zn+1 E
if x f?h,par,
'f E Jth,ell·
n
8-1' 1 X
5 Numerical Experiments
II ~
L A-U
J
n +1
S
11
[h,e!!
::; c.
j=l
N= 20 N=40 N=80
T
Q;ll Q~ll Q;ll Q~ll Q!ll Q~ll
0.1 21.8 7.8 48.8 20.8 102.8 53.8
0.05 19.8 7.0 44.1 18.9 96.1 49.9
0.025 17.6 6.15 38.3 16.9 83.5 43.7
0.0125 14.95 5.02 32.32 14.65 68.45 37.0
6 Conclusions
In this paper we have discussed two numerical algorithms for solving a three-
dimensional elliptic-parabolic problem. The main difference between these
238 R. Ciegis
T N = 20 TO N = 40 TO
methods is that in the first one the differential problem is treated as an elliptic
problem in the whole region of the definition and it is integrated in time by
the backward Euler scheme, whereas the second method treats the problem as
parabolic and the integration is done by the splitting-type method. The ad-
vantage of the latter approach is that the linear algebra algorithm is reduced
to simple one-dimensional subproblems. The advantage of the first method is
that the fully implicit approximation leads to a very robust algorithm.
References
°
solutions is enormous. In fact it is dense in the sense of Baire category, [8].
Namely, for any E > there exists u, E W1,oo(0, 1) such that
def r1 1 1
I(u) = Jo "4 lu (x)1 -1
I 2 12 + "2lu(x)
1 2
- f(x)1 dx. (3)
Our conclusion is motivated by the observation that we can achieve very good
numerical results by building minimizing sequences which become asymp-
totically (weak) white noise in their derivatives, c.f. [5], [7]. Note that such
Stochastic Relaxation of Variational Integrals with Non-attainable Infima 241
We consider two models: one with a fixed potential and one with a time varying
potential. Our objective is to find appropriate local minima of the potential
where
I if x E A,
where v = (Vl, ... , v n ), cPA(X) = { 0 if x <t- A, is the characteristic function
of A and A = [(i - 1)/n, i/n). For a given vector v of values of u' on the
intervals Ai, we will recover u via the formula
i(v) = I(Hu).
242 D.D. Cox et at.
o:(t) (15)
Stochastic Relaxation of Variational Integrals with Non-attainable Infima 243
where tl is chosen large enough that v(t) has converged to a steady state
solution of the Langevin equation similar to (13) but containing only Wmacro.
Then, a(t) is increased linearly to its maximum value of 1
The Probability Density Function (PDF) 9 : lR+ X lR dim V f-+ lR can be used
to obtain any statistical information contained in the microscale Langevin
systems for v at a fixed time point t. Namely,
og(t,v) . (}2
at = -dlVV [DWdensity(V)g(t, v)] + 26Vg(t, v), v E lR\ t > 0, (19)
244 D.D. Cox et al.
where (}2 is the white noise standard deviation. Hence, for our specific form of
the energy density given by (7), the Fokker-Planck Equation (19) becomes
ag~~ v) = (1 1
j(x)j3(x) dX) .V g(t, v) - (1 j3(x)
1
Q9 j3(x) dX) v . V g(t, v)
\, v J
{Bi] }~j=l
where
Ij n - n-i+1/2
n2
= i-1/2
n 2 , x E ('j
~ n,
1) .
where {v7H21 represent a1l2n states with v7j E {±1}. Note that
1. t --+ +00 corresponds to finding the equilibrium distribution of the states,
which is given by the Gibbs distribution [9],
2. A --+ +00 represents imposing the ±1 constraint,
3. (}2 --+ 0+ corresponds to cooling,
4. n --+ 00 gives the continuum case.
The Laplace transform of g( v) has the form
2n
(Lg)(y) = L qje yTv ;, (22)
j=l
Stochastic Relaxation of Variational Integrals with Non-attainable Infima 245
where v; denotes one of the 2n possible distributions of ±1, having the prob-
ability qj. Our goal now is to compute the coefficients qj in (22). In principle,
we have two options:
1. Take the Laplace transform of the Fokker-Planck equation (20) with the
aim to obtain a dense linear 2n x 2n-system for the unknown coordinates
qj. The system can be obtained by inserting 2n various vectors y into
(20) together with the representation (22). One may then use a reduction
technique, [2], [1], to obtain a sparse system and solve it.
2. Enumerate all the 2n states v;,
determine their probabilities qj, and pick
the state with highest probability as the most likely one to which the
system will relax.
Here, we show only the second approach which is based on the following con-
jecture.
Conjecture 3.1
q
= ~
2
(1 _f (j In) - lin
f ((j - 1) In)) .
This conjecture has the important application that it can be used to compute
directly the volume fraction. We believe that it is probably not exact, but in
fact provides an excellent approximation. It is in part based on our previous
work [5] where we found that the computed approximate solutions of (1.1) had
a white noise property. Since
clef '"
aj = L qi, (25)
v:j=-l
for this simulation is f(x) = x(l - x) with Jo1 f(x) dx = 1/16. The volume
fraction is computed by averaging over the replications. Namely,
(27)
Here, m is the number of replications and Vi is the vector of the derivatives for
the i-th replication. The index j refers to the interval A j , i.e., [(j -l)/n,j/nj,
where n is the number of intervals in (0,1). The result shown in this section
is based on twenty independent simulations for the mesh with h = 1/200.
To deduce the macroscopic shape, we average over all twenty replications, c.f.
Figure 1.
Variable Value
Sim .. I ' I~ d ~ Finiu. EI ~ ment Appro.imatiom EX'(ISI\. p ", .nd Av "'r.g",dM. rco,cClpicSh . ~
Fig. 1. Macroscopic approximations. The left picture shows all twenty replications
(each replication has different color) and the averaged state (thick smoother line).
The right picture shows the difference between the target function f = x(l- x) and
its computed shape. The spatial resolution is h = 1/200.
Stochastic Relaxation of Variational Integrals with Non-attainable Infima 247
-{I,20 0,1 0.2 0.3 0.4 0.5 0.6 0.7 O.S O,g 1
Fig. 2. Volume fractions. The volume fraction on the left is computed with ()' = 0.02.
The volume fraction on the right is computed with the data given by Table 1 but
with much smaller deviation. Namely, with ()' = 0.005. This shows that particles need
a higher values of the deviation to discover states with lower energy.
We use in this section the formulae and convergence results to investigate the
approximation properties based on the formulae (23) and (25). We chose the
spatial resolution to be h = 1/16. With 16 elements on the (0,1) segment
we have 2 16 = 65536 coefficients in the formula (23) and each state vector
has 16 components. We evaluate the values of qi and we select the state for
which qi is the biggest. In other words we select the state with the highest
probability to exhibit the microscopic structure of the solution. The volume
fraction is computed using the formula (25). It seems that the discrete solution
corresponding to the maximum probability state minimizes the L2-distance
to the target function. We chose f(x) = 113 sin(13x) for the calculations in this
section. The results are plotted in Figure 3.
6 Acknowledgment
Dennis Cox was supported in part by the grant NSF DMS-0204723. Petr
Kloucek was supported in part by the grant NSF DMS-0107539, by the grant
from TRW Foundation and by the grant NASA SECTP. The work of Daniel
Reynolds was performed in part under the auspices of the U.S. Department of
248 D.D. Cox et al.
Volume Fraction
Coefficient s of the Probability Density Function
0.025
0.02
0.015
0.01
0.005
Fig. 3. The picture on the left shows the distributions of the coefficients of the
Probability Density Function given by (23). It is clear that the more the target
function deviates from a constant the more isolated is the maximum among the rest
of the coordinates. The right picture shows the reconstruction of the volume fraction
using the formula (25). The darker line corresponds to the computed volume fraction,
the lighter line correspond to the P1-projection of the exact function representing
the volume fraction.
References
1. A.C. Antoulas and D.C. Sorensen. Approximation of large-scale dynamical sys-
tems: An overview. Special Issue in Numerical Analysis and System Theory,
Edited by S.L. Campbell, International J. of Applied Mathematics and Computa-
tional Science, 11:1093-1121, 2001.
2. A.C. Antoulas and D.C. Sorensen. The sylvester equation and approximate bal-
anced reduction. Special Issue in Numerical Analysis and System Theory; Edited
by V. Blondel, D. Hinrichsen, J. Rosenthal, and P.M. van Dooren, Linear Algebra
and It's Applications, 2002. to appear.
3. J. M. Ball. Singularities and computation of minimizers for variational problems.
Oxford FoCM, Lecture Notes, 1999.
4. C. Carstensen. Numerical analysis of non-convex minimization problems all-
lowing microstructures. Zeitschrift fur Angewandte mathematik und Mechanik,
76(S2):437-438, 1996.
5. D. Cox, P. Kloucek, and D. R. Reynolds. The non-local relaxation of nonattain-
able differential inclusions using a subgrid projection method: One dimensional
theory and computations. Technical Report 10, Ecole Poly technique Federale de
Lausanne, July 2001. to appear in: SIAM J. Sci. Comp., (2003).
Stochastic Relaxation of Variational Integrals with Non-attainable Infima 249
Summary. Today the finite element method is known as a powerful tool capa-
ble of solving complex flow in complex geometries. Additionally, the unstructured
grid topology is a complementary tool which effectively increases computational ef-
ficiencies. On the other hand, the finite element volume methods incorporate the
advantages of conserving the conservative quantities within elements. However, the
accurate conservation statements need utilizing suitable approximation at cell faces.
In convection dominated flows, upwind-based schemes are strongly utilized. How-
ever, these schemes do not suffice to incorporate the details of pressure field in the
approximation. Therefore, the pressure-weighted upwind scheme is a better choice
for a flow field with high pressure gradients. In this work, a pressure-weighted up-
wind scheme is suitably extended for solving incompressible flow on unstructured
grids. Subsequently, a remedy is given for the problem associated with using equal-
order pressure and velocity interpolations. Eventually, the extended formulations are
validated against suitable benchmark problems involving small and large scale re-
circulation zones. Comparing with the benchmark solutions, the current results are
excellent.
1 Introduction
2 Governing Equations
In the present study, we are concerned with the two-dimensional incompressible
steady flow. The governing equations consist of the conservation statements
for mass and momentums. The non-dimensional vector form of the governing
equations is given by
\7* . V* = 0 (1)
Re [\7* . (V*V*) + \7*p*] = \7*2V* (2)
where the lengths x & y, velocity V, and pressure p variables are nondimension-
alized with respect to a characteristic length Loo (e.g., x*=x/Loo, y*=y/L oo ),
a reference velocity Voo (e.g., V*=V /Voo ), and a reference density Poo (e.g.,
p*=p/(Poo V~)).
iii Cell
[lElement
Fig. 1. A part of unstructured grid representing the elements and a constructed cell
of the unknown variables. There are three main neighbors around each trian-
gle, see element ABC in Fig. 1. There are no limits in the number of elements
intersect at a node. For example, the shaded area in Fig. 1 shows six triangles
which encompass node P. Therefore, to utilize the benefits of cell-centered
schemes, each element is divided into three quadrilaterals by the help of its
three medians. The medians are demonstrated by dashlines in Fig. 1. The cells
are then constructed from the proper assemblage of these sub-quadrilaterals.
As is seen, irrespective of the shape and distribution of the elements, each
node is surrounded by a number of sub-quadrilaterals. The proper assemblage
of neighboring sub-quadrilaterals around any non-boundary node creates a
polygon cell. In case of unstructured grid, it is possible to have a hybrid mesh
composed of polygons having different number of sides. It is because the num-
ber of elements which visit an specific node is not fixed.
3 Computational Modelling
i V·dA=O (3)
ns ns
~[pu (u dAx + v dAY)]i = - ~(p dAx)i + ~
ns [
p,
( au au) ]
ax dAx + oy dAy
i
(7)
A Pressure-Weighted Upwind Scheme 253
ns
8[Pii (u dAx + v dAY)]i = -
ns
8(P dAY)i + 8
ns [
/-L
( ov ov) ]
ox dAx + oy dAy i
(8)
where i counts the number of cell faces from 1 to ns. The number of cell faces
around node P in Fig. 1 is 12. The bar over u and ii indicates that the variables
are approximated from the known magnitudes of the preceding iteration. These
estimations are necessary in order to linearize the nonlinear convection terms.
The rest of procedure is to relate the cell face magnitudes (identified by lower
case letters such as u, v, and P variables) directly to the nodal magnitudes
(identified by upper case letters such as U, V, and P variables) where the
unknown variables are located. A simple idea for treating the right-hand-side
terms is the use of finite element shape functions N j =1...3, i.e.,
3
Pi = LNijPj (9)
j=l
o¢ I
oz .t
= t
j=l
oNij <p.
oz J
(10)
where Pi identifies the magnitude of P at the mid-point of ith cell face. The j
notation counts the node numbers of an element where the ith cell face is lo-
cated inside it. Additionally, the variable z represents either x or y coordinates
and ¢ represents either u or v velocity components. In the above expressions,
lower and upper case letters represent cell face and nodal magnitudes, respec-
tively.
The above treatments end the pressure and diffusion term calculations at
cell face i. However, more sophisticated expressions are required to treat the
convection terms. In fact, the treatment should not disregard the convection-
diffusion physics and concept. To mimic the correct physics of the convection,
254 M. Darbandi et al.
the convection term in the left-hand-side is upwinded. Considering the ith cell
face in Fig. 2, one inclusive suggestion is given by
which has been written in the streamwise direction at mid-point of the ith cell
face. The length 118 is a geometry sensitive parameter shown in Fig. 2 as s.
Then, we need to determine the gradient of <P along the streamline. We try
to approximate this gradient using the original governing PDE's. In another
words, one meaningful approximation can be obtained by writing the revised
momentum equations in the streamwise direction, i.e.,
(12)
where I;j =vu 2 + v 2 is the total velocity at the cell mid-point and the source
term 8¢ represents either op/ox in treating x-momentum or op/oy in treating
y-momentum. The substitution of Eq.(12) in Eq.(11) results in
(13)
(14)
where Li is an appropriate diffusion length scale [6]. This length can be esti-
mated in an specified triangle by discretizing the diffusion terms using central
differencing.
Equation (14) shows that <Pi appears in both sides of equation. As is seen,
considering a lagged role for ((; in the diffusion term results in a passive role
of diffusion term in the formulations. To switch it to an active role, it is not
lagged and the impact of this term is taken to the left-hand-side of Eq.(14). A
suitable rearrangement of the new equation in terms of our major dependent
variables, i.e., cjjj and Pj, yields
3 3
<Pi = I:>l!ijcjjj + L{3ij Pj +,i (15)
j=l j=l
where a, {3, and, represent matrix, matrix, and vector coefficients, respec-
tively. The above statement indicates that <P (== u, v) at cell face can be ap-
proximated by the proper assemblage of cjj and P influences. In fact, this
approximation can be regarded as a pressure-weighted upwind scheme.
A Pressure-Weighted Upwind Scheme 255
As is known, one major disaster with the continuity equation is the lack
of having any explicit pressure term despite representing the pressure field.
The past investigation has shown that the ignorance of this important point
can result in non-physical wavy solution [9J. The most important reason for
this non-physical wavy solution is the employment of equal order pressure and
velocity interpolations in Eqs. (6)-(8). The use of unequal interpolations is
known as a general remedy to suppress the non-physical solution. However,
the current innovate idea suggests the use of more sophisticated interpolations
such as Eq.(15) which enforces the direct role of pressure field in the continuity
equation. Therefore, using Eq.(15) in the continuity equation may eliminate the
need for unequal order interpolations. In another words, Eq.(3)should no longer
permit the occurrence of a non-physical solution in the domain. Although this
strategy theoretically seems to work well, some deficits has been practically
encountered. For example, defining m = p(u dAx + v dAy), Eqs.(6)-(8) for
Euler flow can be re-written as
ns
Lmi=O (16)
i=l
ns ns
(17)
i=l i=l
ns ns
(18)
i=l i=l
-N
p\/ as = v . J-l V ¢ + s¢ - ¢( ~ ~
ax + a) (19)
where the new additional gradient terms in the parentheses provide new state-
ments for the velocity components. Considering the new definitions, Eq.(13)
is revised to
The new additional gradient terms are approximated using the approach em-
ployed in Eq.(lO). The rest of equation is treated similar to Eq.(14). The tilde
over ¢ indicates that these velocity statements are used in the continuity equa-
tion.
256 M. Darbandi et al.
The derived formulations are tested using standard squared cavity [10] and
triangular cavity [11] benchmark problems which involve many complex re-
circulation zones. The first problem is the flow in a squared cavity driven by
its upper lid. The problem is tested in Re=3200. Figure 3(left) demonstrates
a typical non-uniform unstructured grid distribution in the cavity. The grid
generator has been developed by the current authors. It is capable of generat-
ing different types of unstructured grid topology. As is observed, the grid has
been properly refined in regions with high flow field gradients. Figure 3(right)
depicts the streamlines in the cavity. The recirculation zones in bottom corners
and top-left corner resemble the complexity of this flow field. They have been
detected successfully and accurately.
To investigate the advantages of using the extended formulations on an
unstructured grid, the cavity is tested on both uniform and non-uniform un-
structured grids. The grid resolution is 71 x 71 for uniform grid. The total
number of nodes in non-uniform grid is 5508 which is close to that of uniform
grid, i.e., 5684. The number of cell for them is 10734 and 11086, respectively.
Fig. 3. A typical unstructured grid and the obtained streamlines in the cavity
10".,.------------,
- - NonunifOlmGrid7lx71
- - URcsidual
•••••••••••••• Uniform Grid 7lx71
••••..•.•.•.•. PResidual
Benchmark
10" Uniform Grid 51x51
£~10'2 .!.~~wtlfNNNiNvW¥~
~WN¥i1NNtNfN'i~\vN¥
0.5
~10~1
§
;:0
.........
Nonuniform Grid 51x51 ..........................
Fig. 4. The centerline velocities in the cavity and the convergence histories
A Pressure-Weighted Upwind Scheme 257
Figure 4(left) demonstrates the centerline u and v velocities for both uniform
and non-uniform grid types. The current results are compared with each other
and those of benchmark [10]. The figure shows that the results of non-uniform
grid distribution are similar to that of uniform grid. Additionally, they are
in good agreement with the benchmark solution. Figure 4(right) shows the
residual histories for both U velocity component and P fields. The V velocity
component history is very similar to that of U. The histories are presented for
both uniform and non-uniform grids. Despite performing equal accuracies in
Fig. 4(left), the performances are different. As is seen, the convergence insta-
bilities are dominant in uniform grid. However, the non-uniform unstructured
grid resembles a smooth and stable residual reduction.
The second test problem is the steady recirculating viscous flow in an equi-
lateral triangular cavity. A primary eddy and several secondary eddies at dif-
ferent regions indicate the complexity of the flow field. Figure 5 shows the
geometry of the cavity with two typical unstructured grid distributions. The
top horizontal wall is sliding with a constant velocity. The non-uniform topol-
0.7
0 .•
0.5
>- 0.4
0.2
0.1
ogy suitably clusters the grid around the corners. Of course, the refinement
helps to achieve more accurate results for an insufficient number of mesh nodes.
Figure 6 shows the streamline patterns (left) and isobar lines using non-
uniform grid distribution. The Reynolds number is 500. The total number of
nodes is 1135. As is observed, there are one primary eddy on top and two
secondary eddies under it. The eddies shrink in lower levels. Additionally,
there is one irregular recirculation on the left edge which has been successfully
detected. Unfortunately, there is no quantitative report of velocity magnitudes
for this benchmark case. However, a qualitative comparison has been done
with other references such as Refs. [11, 12]. Table 1 shows that the x and
y magnitudes of the eddy center locations are in good agreement with those
obtained from the benchmark solutions.
References
1. Mavriplis, D.J. (1997): Unstructured Grid Techniques, Annu. Rev. Fluid Mech.,
29, 473-514.
2. Tezduyar, T.E. (1992): Finite Element Computation of Unsteady Incompress-
ible Flows Involving Moving Boundaries and Interfaces and Iterative Solution
Strategies, AGARD-R-787, France, Chap. 3.
3. Hughes, T.J.R., Franca, L.P., and Balestra, M. (1986): A New Finite Element
Formulation for Computational Fluid Dynamics: V, Compo Methods Appl. Mech.
Eng., 5985-99.
4. Donea, J. (1984): A Taylor-Galerkin Method for Convective Transport Problems,
Int. J. Numer. Methods Eng., 20, 101-119.
5. Hansbo, P., and Szepessy (1990): A Velocity-Pressure Streamline Diffusion Fi-
nite Element Method for the Incompressible Navier-Stokes Equations, Compo
Methods Appl. Mech. Eng., 84, pp.175-192.
6. Darbandi, M., and Schneider, G.E. (1999): Application of an All-Speed Flow
Algorithm to Heat Transfer Problems, Numer. Heat Trans. A, 35, pp.695-715.
7. Baliga, B.R., and Patankar, S.V. (1987): Elliptic Systems: Finite-Element Method
II, Handbook of Numerical Heat Transfer, Edited by W.J. Minkowycz, E.M. Spar-
row, G.E. Schneider, R.H. Pletcher, John Wiley, New York, pp.421-461.
8. Prakash, C. (1986): An Improved Control Volume Finite-Element Method for
Heat and Mass Transfer, and for Fluid Flow Using Equal-Order Velocity-Pressure
Interpolation, Numer. Heat Trans., 9, 253-276.
A Pressure-Weighted Upwind Scheme 259
Vft Dolejsf
1 Introduction
We deal with the numerical solution of the compressible N avier-Stokes equa-
tions with the aid of the discontinuous Galerkin finite element method
(DGFEM). This method has become quite popular and it is discussed in a num-
ber of papers. For a review of DG methods, see [4] or [5]. Let us mention the
papers [1] and [2] dealing with the numerical simulation of compressible flows,
where the mixed formulation is applied to the treatment of the viscous terms.
We develop the so-called DGFEM with nonsymmetric interior penalty
terms. This method was applied to the solution of a scalar nonlinear
convection-diffusion equation in [8] where a complete numerical analysis is
presented. The extension of DGFEM to the system of the Navier-Stokes equa-
tions is straightforward (see preliminary results in [6]) but some suitable lin-
earization of diffusive terms has to be performed. That is the subject of this
paper.
In Section 2 the continuous problem describing compressible flow is formu-
lated. DGFE discretization is introduced in Section 3, where also the lineariza-
tion of diffusive terms is discussed. Several numerical examples demonstrating
the efficiency of the method are presented in Section 4.
2 Continuous problem
Let n c JR2 be a bounded plain domain and T > O. We set QT = n x (0, T)
and by an we denote the boundary of n
which consists of several disjoint
parts. We distinguish inlet n,outlet To and impermeable walls Tw on an.
We want to find a vector-valued function w : QT ----> JR4, such that
DGM for the Navier-Stokes equations 261
2 2
OW + '"
L
ofs(w) ~ ' " oRs(w, V'w)
- L inQT, (1)
m s=l
o~
s=l
oXs
where
1 (Ovo OV 2 )
Tij = Re ox: + ox: - "3 divvbij ,i, j = 1,2.
(3)
c) ~
2 (2~Tijni ) Vj + RePr
'Y
on = ° on n x (O,T);
oB
oB
a) Vlrwx(O,T) = 0, b) on Irwx(O,T) = 0; (6)
°
2
3 DGFE discretization
3.1 Triangulation
UU Tij=nUTW' (9)
iEI jE,D(i)
Moreover, we set
'N(i) = ,(i) \ 'D(i), (10)
where a Neumann type of boundary condition is prescribed for all components
ofw.
Furthermore, we use the following notation: nij = (( nij ) l ' (nij ) 2) = unit
outer normal to OKi on the edge Tij (nij is a constant vector on Tij) and
lTij I = length of the edge Tij . Over the triangulation Yr. we define the broken
Sobolev space
Hk(D, Yr.) = {V;VIK E Hk(K) VK E Th }. (11)
For v E Hl(D, Yr.) we set
denoting the average and jump of the traces of v on Tij = Tji , respectively.
Obviously, (V)I'ij = (V)I'jil [V]rij = -[v]I'j., and [V]I'ijnij = [v]I'jinji.
DGM for the Navier-Stokes equations 263
We derive the discretization of problems (CFP) with the aid of the DGFEM.
The approximate solution Wh as well as test functions 'Ph are elements of the
finite dimensional space of vector-valued functions
(13)
where
(14)
P E Z+ and PP(K) denotes the space of all polynomials on K of degree:::; p.
Assuming that W is a classical sufficiently regular solution of problem
(CFP) and 'P E [H2(D, lhW, we multiply equation (1) by 'P, integrate over
Ki E lh, apply Green's theorem, sum over all Ki E Th and with the aid of (8)
we arrive at the identity
(15)
j
2
- L L LRs(w, "ilw)(nij)s' 'P dS = O.
iEI jE'"Y(i) r" 8=1
3'2 ReWl
1 [2 (awaX23 - aX2 - (~
Wl £.!!!.J..)
!£l!.
aXl - ~
Wl aWl)]
aXl
Re Wl
1 [(~
aXl - ~ aXl + aX2 - '£l.
Wl £.!!!.J..) Wl aWl)]
aX2(0£2
W2 D(2)
Wl 1
+ !!'.!l.
Wl
D(3) +
1
"( [£<.e.i _ .'£±
RePrwl aXl
£.!!!.J.. _
Wl aXl
-..L
Wl
(w 2
012
aXl
+ w3 ~)
aXl
Re Wl
1 [(~
aXl - ~ aWl)
Wl aXl
+ (0£2
aX2 - '£l.
Wl
aWl)]
aX2
3'2 ReWl
1 [2 (~
aX2 - Wl aWl)
'£l.
aX2 - (0£2
aXl - Wl £.!!!.J..)]
'£l.
aXl
W2 D(2)
Wl 2
+ W3Wl
D(3)
2
+ RePrWl
'Y [£<.e.i
aX2
_ .'£± aWl
Wl aX2
_ -..L
Wl
(w 2
012
aX2
+ w2 ~)
aX2
The definition of Ds, 8 = 1,2 can be given in other forms. We only require
that they are linear with respect eph and satisfy (18). The natural way, how to
perform the linearization of the diffusion terms, follows from (16) where the
DGM for the Navier-Stokes equations 265
space derivatives of ware simply replaced by the derivatives of cp. This lin-
earization gives also the dependence on the gradient of the first component of
cp. However, numerical experiments carried out with the aid of this lineariza-
tion do not yield satisfactory results. Therefore we use (17) which gives terms
DB'S = 1,2 independent of \i'CP1. For more detail see [7].
We add the following terms to the left-hand side of (15):
L L 1 2
L(Ds(w, \i'w, cp, \i'cp)) (nij)s . [w] dS
iEI jEa(i) r'J 8=1
(19)
j<i
+L L
iEI jEr(i)
1 F;j
2
L D8(W, \i'w, cp, \i'cp) (nij)8 . w dS.
8=1
In the second term we use the zero natural Neumann boundary conditions
(7), a)-b) and the Dirichlet conditions are taken into account with the aid of
additional terms on the right-hand side of (15).
Moreover, to the left-hand side of (15) we add the vanishing interior penalty
1
terms
L L CI[W]' [cp] dS (20)
iEI jEs(i) r'J
j<i
with CIIF;j = (Re Irij l)-l and boundary penalty terms balanced by additional
right-hand side terms containing the Dirichlet boundary data.
We arrive at the definition of the following form
+L
iEI
L
jEs(i)
1 2
L(D8(w, \i'w, cp, \i'cp)) (nij)8 . [w] dS
r," 8=1
j<i
- L L 1 2
LR8(w, \i'w) (nij)s' cpdS
iEI jErD (i) r'J 8=1
+L
iEI
L
jErD (i)
1 2
L D8(W, \i'w, cp, \i'cp) (nij)8 (w - WB) dS
r'J 8=1
+L
iEI
L
jEs(i)
J<1.
1 r,"
CI[W]' [cp]dS + L
iEI
L
jErD(i)
1
r'J
CI (w - WB)' cpdS.
266 V. DolejSi
The boundary state WB will be defined later. The convective terms are repre-
sented by the form
Bh(Wh' Cf'h) = - ~ Ii ~ N
+2.: 2.:
iEI jES(i)
JF;j
H(whlF;j,Whlrjilnij)'Cf'h dS, Wh,Cf'h EH1 (D,1h)4,
(23)
which means that we use the "extrapolation" of Wr onto r ij from Ki E Th . In
particular, we have
WB = (pij, 0, 0, Pij 8ij ) on rw, (24)
where PD and v D = (v D1, VD2) are the given density and velocity from the
boundary conditions (5) - (7) and Pij, 8ij are the values of the density and
absolute temperature extrapolated from Ki onto r ij .
Now the discrete DGFE Navier-Stokes problems read:
Definition 1. An approximate DGFE solution of the compressible Navier-
Stokes problem (eFP) is defined as a vector-valued function Wh such that
4 Numerical examples
We implemented the numerical scheme (25), a)-c) with the aid of a piecewise
linear approximation on regular triangular grids. The system of ODE was
solved by the explicit Euler method.
Now we consider four cases of viscous flow around the profile NACA0012
with the following data, see [3]:
case Min a Re case Min a Re
C1 0.80 10° 500 (3 0.85 0° 500
(2 2.00 10° 106 (4 0.850° 2000
where Min is the far field Mach number, a the angle of attack and Re the
Reynolds number. We compare our results with the numerical results pre-
sented in [3], where ten methods were applied. The following table contains
our computed lift CL and drag CR coefficients in comparison with [3] (#Ih
denotes the number of elements of the mesh Ih)
The computed values of drag and lift correspond to the reference values
from [3]. Figures 1 shows the employed triangulation and the computed isolines
of the Mach number for the case (2.
References
1. Bassi, F., Rebay, S.(1997): A high-order accurate discontinuous finite element
method for the numerical solution of the compressible N avier-Stokes equations.
J. Comput. Phys, 131, 267-279
2. Bassi, F., Rebay, S.(2000): A high order discontinuous Galerkin method for com-
pressible turbulent flow. In Cockburn, B., Karniadakis, G. E., Shu, C.-W. (eds)
Discontinuous Galerkin Method: Theory, Computations and Applications, Lecture
Notes in Computational Science and Engineering 11, pages 113-123. Springer-
Verlag
3. Bristeau, M. 0., Glowinski, R., Periaux, J., Viviand, H. eds (1987): Numerical
Simulation of Compressible Navier-Stokes Flows, volume 18 of Notes on Numerical
Fluid Mechanics. Vieweg, Braunschwig
4. Cockburn B. (1999): Discontinuous Galerkin methods for convection dominated
problems. In Barth, T.J., Deconinck, H., (eds) , High-Order Methods for Com-
putational Physics, Lecture Notes in Computational Science and Engineering 9,
pages 69-224. Springer, Berlin
268 V. Dolejsi
-1 o 2 3 4
-1 o 2 3 4
Fig. 1. Viscous flow along NACA 0012, case (2, triangulation (top), isolines of the
Mach number (bottom)
5. Cockburn, B., Karniadakis, G. E., Shu, C.-W. eds (2000) Discontinuous Galerkin
methods. Lecture Notes in Computational Science and Engineering 11., Springer,
Berlin
6. DolejSf, V. (2002): A higher order scheme based on the finite volume approach.
In R. Herbin and D. Kroner, (eds), Finite Volumes for Complex Applications III
(Problems and Perspectives), pages 333~340. Hermes
7. Dolejsi, V.: On the discontinuous Galerkin method for the numerical solution of
the Navier~Stokes equations. Int. J. Numer. Methods Fluids, (submitted)
8. Dolejsi, V., Feistauer, M., Sobotikova, V.: A discontinuous Galerkin method for
nonlinear convection~diffusion problems. Comput. Methods Appl. Mech. Eng.
(submitted)
9. Feistauer, M., Felcman, J., StraSkraba, I. (2003): Mathematical and Computa-
tional Methods for Compressible Flow. Oxford University Press, Oxford
10. Toro, E.F. (1997): Riemann Solvers and Numerical Methods for Fluid Dynamics.
Springer-Verlag
A Finite Volume Scheme on General Meshes
for the Steady N avier-Stokes Equations in Two
Space Dimensions
Summary. We introduce a new finite volume scheme for the discretization of the
incompressible Navier-Stokes equations on general meshes, for which we prove con-
vergence without any condition on the regularity of the solution. Numerical results
are presented.
Numerical schemes for the Navier-Stokes equations (1) have been extensively
studied: see [7, 11, 12, 13, 8, 6, 15] and references therein. An advantage of
the finite volume schemes is that the unknowns are approximated by piece-
wise constant functions: this makes it easy to take into account additional
nonlinear phenomena or the coupling with algebraic or differential equations,
for instance in the case ofreactive flows. In [11] is presented the classical finite
volume scheme on rectangular meshes, which is the basis of many industrial
applications. A convergence proof of the so-called MAC scheme is given in
[10] in the case of a uniform rectangular grid. However, the use of rectangular
grids limits the type of domain which can be gridded, and more recently, finite
volume schemes for the Navier-Stokes equations on triangular grids have been
presented: see for example [9] where the vorticity formulation is used, and [2]
where primal variables are used with a Chorin type projection method (but
no proof of convergence is known). Here, we propose a new method using the
primitive variables and enforcing the divergence condition directly, using quite
general meshes such as mixed rectangular-triangular or Voronol meshes, and
for which we are able to prove convergence under general conditions (in par-
ticular, no regularity of the exact solution is required). An error estimate in
the case of the linear Stokes equations was presented in [4].
We seek an approximation of u = (u(l), u(2))t E HJ(Sl) x HJ(Sl) and
P E L2(Sl) , weak solution to the incompressible generalized Navier-Stokes
equations:
where TJ 2:: 0, u (1 ) and u (2 ) are the two components of the velocity, p denotes
the pressure, 1/ the viscosity of the fluid, under the following assumptions:
°
the unsteady Stokes or Navier-Stokes equations (with TJ as the inverse of the
time step, TJ = yields in the steady-state.
We prescribe for both problems a homogeneous Dirichlet boundary condi-
tion on the velocity (u(1),u C2 »). Let us denote by x = (x(1),x C2 ») any point of
D and by dx the 2-dimensional Lebesgue measure dx = dx (1 )dx C2 ).
(5)
where the trilinear form b is defined for all u, v, wE (HJ(D))2 by
b(u,v,w) = L L
k=1,2 i=1,2 n
1 u(i)(x)aiVCk)(x)wCk)(x)dx, (6)
b(u,v,w) = L L
k=1,2 <=1,2
l ai(u(i)VCk»)(X)WCk) (x)dx.
...... point XK
.... ·point XL
The dual cell around x s , denoted by S, is then defined as (also see Figure
1):
S = UKEMsKs.
Since there is a one-to-one mapping between the set {I, ... , N v } c Nand
the set V, we shall replace all subscripts s by S when dealing with the dual
mesh. Let VK denote the set of vertices of a given control volume K. Note
that:
where Ixyzl designates the absolute value of the measure of the angle xyz (note
- - - = "27f - ZIIXSXK·
t hat ZIIXKXS ---)
.
For all K E M and 0' E EK, we denote by nK,1I the unit vector normal to
0' outward to K. We denote by dK,1I the Euclidean distance between XK and
0'. We then define
m ii
TK,II = -d-'
K,II
The set of interior (resp. boundary) edges is denoted by E jnt (resp. Eext) ,
that is E jnt = {o' E E; 0' ct
aD} (resp. Eext = {o' E E; 0' caD}). For any
0' E Ejnt,O' = KIL (resp. Eext' 0' E EK), let XII be the center point of the line
segment [XKxL] (resp. [XKZII]) , and X~l) and x~2) its coordinates.
For all K E M and all S E VK, let 0'1 and 0'2 E EK n Es numbered such
that (x~~) - x~))(x~;) - x~l)) - (x~21- x~))(x~;) - x~l)) > 0.
Let A (1) = x(2) _ x(2) and A (2) = x(l) _ x(l)
K,S II} 1I2 K ,S 1I2 II} .
and
A Finite Volume Scheme on General Meshes 273
Let Lv(D) be the space of functions which are piecewise constant on the
domains S, for all S E V. Let divv : (Hv(D))2 -+ Lv(D) be defined by:
[v,Wlv =
where Krr denotes the control volume to which IJ is an edge. We define a norm
in Hv (D) (thanks to the discrete Poincare inequality (11) given below) by
and
[v, w]v = L [v (i) ,w(i)]v.
i=1,2
We only present here a centered finite volume scheme, and refer to [5l for the
upstream version. Under hypotheses (2)-(4), let D be an admissible discretiza-
tion of D. Let A E (0, +00). The finite volume scheme for the approximation
of the solution (1) writes: find u such that
u E Ev(D),
TJ In u(x) . v(x)dx + v[u, vJv + bv(u, u, v) = In f(x) . v(x)dx, \:Iv E Ev(D),
(12)
where, for u, v and w E Hv(D),
274 R. Eymard, R. Herbin
(14)
As in the case of the linear problem (see [4]), we use the following penalized
approximation of (12):
-1 1
(u,p) E (Hv(D))2 X Lv(D),
v ([u, v]v) p(x)divv(v)(x)dx + bv(u, u, v) = f(x) . v(x)dx,
(15)
Vv E (Hv (D))2,
divv(u) = -A size(V) p,
1 ('"
v2 ~211f (i) IIL2(!2) ) < C 2 ._ 1
.- 4diam(D)C 1
(16)
A Finite Volume Scheme on General Meshes 275
is fulfilled. Then there exists one and only one function u E (Hv(D))2 such
that
4 Numerical results
potential q)(J"> located at the edges (J E E of the mesh (see [5]). We present
in Figure 2 the streamlines in three different cases: starting form the top, the
first figure is obtained with the centered scheme, using a 25200 rectangular grid
blocks mesh, the second one with the centered scheme using a 2800 rectangular
grid blocks mesh, the third one with the upstream scheme using a 2800 rect-
angular grid blocks mesh, and the two last ones with respectively the centered
and the upstream scheme for 847 cells. It is clear from these figures that the
centered scheme is, as one could expect, more precise, but that it becomes un-
stable for coarser meshes. In fact, for a mesh of 700 cells, the Newton iterations
do not converge, even when using an under-relaxation procedure.
~-~-=------,,--- ~
~ - -- --
-- - -
~.~~.~~~- -~~- -
The numerical solution obtained with the centered scheme, using a 25200
rectangular grid blocks mesh seems to be precise enough (comparing the sep-
aration and reattachment lengths with those of the literature, see [5]) to be
used as a reference solution for experiments carried out on coarser meshes.
This allows to compute a rate of convergence of h 2 .
We conclude from these numerical tests that the upstream scheme is too
diffusive and cannot be used for accurate results, although it has the advantage
of remaining stable even on coarse meshes. The centered scheme yields accurate
results for a reasonable number of Newton iterations (typically between 5
and 15).
A Finite Volume Scheme on General Meshes 277
References
1. B.F. Armaly, F. Durst, J.C.F. Pereira and B. Schonung, "Experimental and The-
oretical investigation of backward-facing step flow" J. Fluid Mech. (1983) vol. 127,
pp 473-496.
2. S. Boivin, F. Cayre, J.M. Herard, A finite volume method to solve the Navier-
Stokes equations for incompressible flows on unstructured meshes, Int. J. Therm.
Sci., 38, 806-825, 2000.
3. R. Eymard, T. Gallouet and R. Herbin, Finite Volume Methods, Handbook of
Numerical Analysis, Vol. VII, pp. 713-1020. Edited by P.G. Ciarlet and J.L. Lions
(North Holland).
4. R. Eymard and R. Herbin, A cell-centered finite volume scheme on general meshes
for the Stokes equations in two dimensions, 125-128, t.337, 2, 2003. CRAS,
Mathematiques.
5. R. Eymard and R. Herbin, A finite volume scheme on general meshes for the
steady Navier-Stokes problem in two space dimensions, LAPT Report nO , sub-
mitted
6. J.H. Ferziger, M. Peric, Computational Methods for Fluid Dynamics. Springer,
Berlin, 1996.
7. V. Girault, P.-A. Raviart, Finite element methods for the Navier-Stokes equa-
tions: Theory and algorithms, Springer, Berlin, 1986.
8. M.D. Gunzburger, Finite element methods for viscous incompressible flows, A
guide to thoery, practice, and algorithms, Computer Science qnd Scientific Com-
puting, Academic Press 1989.
9. M.D. Gunzburger and R.A Nicolaides Incompressible computational fluid dy-
namics, Cambridge University Press, 1993.
10. R.A Nicolaides and X. Wu, Analysis and convergence of the MAC scheme II,
Navier-Stokes equations, Math. Compo 65 (1996), 29-44.
11. S.V. Patankar, (1980), Numerical Heat Transfer and Fluid Flow, Series in Com-
putational Methods in Mechanics and Thermal Sciences, Minkowycz and Sparrow
Eds. (Mc Graw Hill).
12. R. Peyret and T. Taylor, Computational methods for for fluid flow, Springer,
New-York,1893.
13. O. Pironneau, Finite element methods for fluids, John Wiley and sons, 1989.
14. R. Temam, Navier-Stokes Equations, Studies in mathematics and its applica-
tions, J.L. Lions, G. Papanicolaou, R.T. Rockafellar Editors, North-Holland,
1977.
15. P. Wesseling, Principles of Computational Fluid Dynamics, Springer, Berlin,
2001.
Existence and Uniqueness of a Weak Solution
to a Stratigraphic Model
1 Introduction
2 Mathematical Model
We consider in this paper the model defined by Eymard et al. in [2] in a sim-
plified case for which the diffusion coefficients of the lithologies are taken equal
to one. Furthermore, the sea level variations and the ground distortions are
not taken into account.
Let us denote by h(x, t) the sediment thickness variable, function of time
t > 0 and of x E fl C IR d , d = 1 or 2, the horizontal extension of the
basin. The sediments are modeled as a mixture of L immiscible lithologies,
such as sand or shale, characterized by their grain size population, and con-
sidered as incompressible materials of constant grain density and null poros-
ity. Inside the basin, the mixture is described by its composition given by
the L concentrations Ci(X, z, t) ;::: 0 in lithology i defined on the domain
B = {(x, z, t) I x E fl, t > 0, z < h(x, t)}, and satisfying 2:~=1 Ci = 1. The
sediments transported by the surficial fluxes, i.e. deposited at the surface in
case of sedimentation and passing through it in case of erosion, are character-
280 R. Eymard et al.
ized by their concentrations cHx, t) ::::: 0, defined on [l x 1R,+, and also satisfying
2:~=1 cf = l.
Since the sediment fluxes are non zero only at the surface of the basin, no
change of the sediment composition occurs inside the basin: OtCi = 0 on B.
The evolution of Ci is then only governed by the boundary condition at the
top of the basin stating that cilz=h = ci in case of sedimentation (Oth > 0).
An initial condition to the basin concentrations is also prescribed: Ci It=o = c?
on {(x, z) Ix E [l, z < hO(x)}.
Let us now consider these equations in the new coordinate system (x,~, t) =
(x', h(x', t') - z, t') in which the vertical position of a point is measured down-
ward from the top of the basin, and let us define Ui(X,~, t) = Ci(X, h(x, t) -~, t)
on [l x 1R,+ X 1R,+, u?(x,~) = c?(x, hO(x) -~, t) on [l x 1R,+. Then, we get the
new problem:
lithology a first order linear equation for the surface concentration ef with a
linear advection equation for the concentration Ui, for which ef appears as an
input boundary condition at the top of the basin in case of sedimentation:
r r r [8tcp(x,~,t)+8th(x,t)8t;,cp(x,~,t)lui(x,~,t)dtd~dx
In lIR+ lIR+
+ r r u?(x,~)cp(x,~,O)d~dx+ r r 8t h(x,t) ef(x, t)cp(x, O,t) dtdx =0,
lnlIR+ lnlIR+
°
(5)
(ii) for all 'lj; E {q'J E Cr;:'(JR d+2) Iq'J(., 0,.) = on 8n x JR+ \ E+},
282 R. Eymard et al.
(6)
Then, the main result of this paper is the following Theorem. Its proof will be
developed in the next section.
Theorem 1. Assuming that Hypothesis 1 holds, there exists a weak solution
(u;, enE LOO(D x 1R+ x 1R+) x LOO(D x 1R+) for all i E {I, ... , L} to problem
(4) in the sense of Definition 1, and u; is unique.
The aim of this section is to prove Theorem 1. The proofs of Lemmae 1 and 2
used in the sequel are technical and will be detailed in a forthcoming paper.
en
The existence of a weak solution (u;, E LOO(DxIR+ xIR+) xLOO(DxIR+)
to (4) in the sense of Definition 1 is obtained by convergence of an implicit
finite volume scheme for the model, and has already been proved in [1]. Let us
just recall the main stages of this proof.
In the sequel, we shall consider admissible finite volume meshes defined as
follows:
Definition 2. Let D be a bounded domain of IR d , d = 1 or 2. An admissible
finite volume mesh of D for the discretization of problem (3)-(4) is given by a
family of "control volumes", denoted by K, which are open disjoint subsets of
D, and a family of points of D, denoted by P, satisfying the following proper-
ties,'
1. The closure of the union of all the control volumes of K is n.
2. For any K, K' E K with K -I- K', either the (d - I)-dimensional measure of
R n R', denoted by m( R n R'), is null, or it is strictly positive and R n R' is
included in an hyperplane of IR d . In the following, we will denote by E;nt the
family of subsets 0' of D contained in hyperplanes of IR d with strictly positive
measures, and such that there exist K, K' E K with m(RnR') > 0 and ij = RnR'.
We shall also denote by KIK' E E;nt the edge between the cells K and K'.
3. The family P = (X")"EK is such that x" E R (for any K E K), and, if
0' = K K', it is assumed that x" -I- x"' and that the straight line going through
1
x" and x"' is orthogonal to the edge 0'. We shall denote by d(K, K') the distance
between the points x" and x""
4. For any K E K, there exists a subset E" of E;nt such that OK \ aD =
Existence and Uniqueness of a Weak Solution to a Stratigraphic Model 283
U aD) = UO"EE"i7.
Pi, \ (I\:
We shall denote by (K, E int , P) this mesh, and by 8K = sup {diam(l\:) , I\: E K}
its size.
°
Let (K, E int , P) be an admissible mesh of D. The time discretization is denoted
by tn, n E IN, such that to = and Llt n +1 = t n + 1 - t n > 0. The superscript
n, n E IN, will be used to denote that the unknowns are considered at time
tn. For each control volume I\: E K and each time tn, n ~ 0, h~ shall denote
the approximation of the sediment thickness on I\: at time tn, c7 ",(z), z E
(-00, h~), the approximation of the basin concentration Ci in lithology i in
the column {(x,z)lx E I\:,Z < h(x,tn )}, and c:,;+1 the approximation of
the surface concentration in lithology i at time t;"+1 on 1\:. Then, (3)-(4) is
discretized by a fully implicit time integration and a finite volume method
with cell centered variables. For the computation of the fluxes at the edges of
the control volumes, the discretization uses an upstream weighted evaluation of
the surface concentrations. The approximate concentration c~~l is the solution
at time t n +1 of the conservation equation '
and u7 "'(~) = c7 ",(h~ - ~) for all ~ > 0. One can refer to [2] or [1] for the
compl~te numeri~al scheme.
Then, the approximate sediment thickness (h~)"'EK satisfies an implicit
finite volume numerical scheme for the parabolic problem (3) for which exis-
tence, uniqueness and error estimates have already been proved in [3]. Con-
cerning the concentration variables, we show the existence of solutions to the
discrete problem bounded in the interval [0, 1], which are unique except for the
surface concentration variables c;,;+1. For any admissible mesh (K, E int , P) of
D, any time step Llt > 0, and i =' 1, ... , L, let us define the piecewise constant
functions hK,LJ.t, cf,K,LJ.t on D x 1R'f., and Ui,K,LJ.t on D x 1R'f. x 1R'f. by
for all x E 1\:, I\: E K, t E (tn, tn+l], n ~ 0, ~ E 1R'f., where h~, and u?,,,,<:;+1
are any given solution of the discrete problem with c:,;+1 bounded in the in-
°
terval [0,1]. For all mE IN, let (Km, E: t , Pm) be an admissible mesh of D and
Lltm > 0, and let us assume that Lltm ---+ 0, 8K m / VLltm ---+ as m ---+ 00 and
°
that there exists a > such that, for all m E IN, maX~E E~ d 15(K m, ) < a. Let
~nt
u=K:I,.,.,'
Y\,,/'i; -
Let us now show the uniqueness of Ui. It is achieved using mainly the
linearity of (4) in the concentration variables and the adjoint problem on x n
(0, T), T > 0.
For any given surface concentration cf E Loo(n x 1R*+-), we have first stud-
ied the weak formulation (5) of the linear advection equation LUi = with
input boundary condition ci on D+ and initial condition u~. Using the charac-
°
teristic solution of this problem (see [5]), we have proved the following Lemma,
in which an integration by part formula for the solutions of the advection equa-
tion and its adjoint problem is stated:
Lemma 1. Hypothesis 1 is assumed to hold. Then, for any time T > 0, any
functions f E Loo(n x 1R*+- x (0, T)), lS E Loo(Dt), and VO E Loo(n x 1R*+-),
the equation
°
has a unique weak solution in Loo(n x 1R*+- x (0, T)) in the sense that for
all rp E {¢> E C'g"(IRd+2) I ¢>(., 0,.) = on n x (0, T) \ Dt and ¢>(.,., T) =
° on n x 1R*+-}, one has
r r r
T
((Lrp)(x,~, t) v(x,~, t) + f(x,~, t) rp(x,~, t)) dt d~ dx
JnJIR+Jo T
+ r r vO(x,~)rp(x,~,O)d~dx+ r r Oth(x,t) lS(x,t) rp(x,O,t) dtdx =0.
In JIR+ In Jo
(9)
r r rT((Lrp)V+frp)(x,~,t)dtd~dX+ r r (v(x,~,O)rp(x,~,O)
Jn J1R+ Jo Jn J1R+
L
-v(x,~, T) rp(x,~, T)) d~ dx + foT Oth(x, t) v(x, 0, t) rp(x, 0, t) dt dx = 0.
Let T >
equation
°and w be the weak solution in Loo(n
(10)
x 1R*+- x (0, T)) of the adjoint
Existence and Uniqueness of a Weak Solution to a Stratigraphic Model 285
defined in a similar way as above with r E LOO(O x lR+. x (0, T)) a compactly
supported function on fj x lR+ x [0, TJ, w T E LOO(O x lR+.) a compactly sup-
ported function on fj x lR+, and qS E L 00 (0 x (0, T)). Then, one has
r r rT(v(£w)+(£v)w)(x,~,t)dtd~dX- r r (v(x,~,T)w(x,~,T)
} n } 1R+ n } 1R+
11
}0 }
T
-v(x,~, 0) w(x,~, 0)) d~ dx + 8t h(x, t) v(x, 0, t) w(x, 0, t) dt dx = 0.
(12)
Let us denote by (Vi, d't) the difference between any two weak solutions of
(4). From the linearity of the set of equations (4) in the concentration variables,
the functions (Vi, d't) satisfy the weak formulation (5)-(6) with homogeneous
°
boundary and initial conditions.
Let T > 0. From Lemma 1, the function Vi 8t h has a trace at ~ = in
LOO(O x (0, T)) denoted by vilE=o 8t h. Then, from the integration by part
formula (10) of Lemma 1 and the weak formulation (6), it results that for all
°
cp E {¢ E C~(lRd+l) I ¢(x, t) = on 80 x (0, T) \E:J;, and ¢(x, T) = on O}, °
one has
11 T
(Vi(X, 0, t) 8t h(x, t) cp(x, t)+df(x, t) Vh(x, t)· Vcp(x, t)) dt dx = 0. (13)
!
Let us now consider the adjoint system
-WtIE=o 8t h + div(q:sVh) = °
°
on 0 _x (0, T),
qtlE-T = on ET ,
-£w, = v, on 0 x lR+. x (0, T), (16)
wtlE=o = q: on Dr,
wtlt=T = Vtlt=T on 0 x lR+..
The direct and adjoint problems are very close, apart from the non vanishing
right hand side Vi E LOO(Ox lR+. x lR+.) in the advection equation of (16). Then,
the existence of a weak solution (Wi, d't) E LOO(O x lR+. x lR+.) x LOO(O x lR+.)
to the adjoint problem, defined similarly as in Definition 1, can be obtained
in a very close way as in [1] by the convergence of a finite volume numerical
scheme, adapted to the non vanishing right hand side in Loo in the advection
equation.
286 R. Eymard et al.
In IT div(qI d: V'h) dt dx = 0.
According to the definition of the characteristic solution of (11) (see [5]) and
since the velocity 8t h is uniformly bounded on n
x [0, T] for any time T > 0,
the function Vi (resp. its trace Vi It=T) is compactly supported in x 1R+ x [0, T] n
n
°
(resp. in x 1R+). Applying the integration by part formula (12) of Lemma 1
to V = Vi and W = Wi, we get that for any time T >
r r rTlviI2(x,~,t)dtd~dx+ r r IViI2(x,~,T)d~dx
JnJlR+Jo JnJlR+ (18)
= In IT 8 t h(x, t) Vi(X, 0, t) Wi(X, 0, t) dt dx.
From Lemma 2 and the integration over D x (0, T) of (17) multiplied by di,
we obtain
In 1 [di(x, t) Wi(X,
T
0, t) 8t h(x, t) + qt (x, t) V'di(x, t) . V'h(x, t)] dt dx = 0.
(19)
Also, multiplying (15) by qt and integrating over D x (0, T), we get
Summing (19) and (20) and taking into account the boundary conditions
wil~=o = qt on Di, vil~=o = di on D:f, and that 8t h =
(D:f U Di), we obtain
on D x (O,T) \ °
11 T
(Wi(x,O,t)di(x,t) +Vi(X,O,t)qt(x,t) -di(x,t)qt(x,t)) 8th(x,t)dtdx
In 1 Vi(X,
nOT
This theorem, together with the convergence result on the solutions of the
implicit finite volume numerical scheme seen previoulsy, also give the conver-
gence of the full sequence of approximate solutions (Ui,lCrn,Llt rn )mEIN towards
the weak solution Ui of (4).
The proof will be detailed in a forthcoming paper, and particularly Lemmae
1, 2, and the existence of a weak solution to the adjoint problem.
References
[1] Eymard, R., Gallouet, T., Gervais, V., Masson, R.(submitted 2003) : Con-
vergence of a numerical scheme for stratigraphic modeling. SIAM J. Num.
Anal.
[2] Eymard, R., Gallouet, T., Granjeon, D., Masson, R., Tran, Q.(2003) : Multi-
lithology stratigraphic model under maximum erosion rate constraint. Int.
J. of Num. Methods in Eng., (to appear)
[3] Eymard, R., Gallouet, T., Herbin, R.(2000) : The Finite Volume Method. In:
Ciarlet P. and Lions J.(eds.) Handbook of Numerical Analysis, 7. Elsevier
[4] Flemings, P.B., Jordan, T.E.(1989) : A Synthetic Stratigraphic Model of
Foreland Basin Development. J. of Geophysical Research, Vol 94, B4, 3851-
3866
[5] Godlewski, E., Raviart, P.(1996) : Numerical Approximation of Hyperbolic
Systems of Conservation Laws. Springer
[6] Granjeon, D.,(1997) : Modelisation stratigraphique deterministe; conception
et applications d'un modele diffusif 3D multilithologique. Ph. D. disserta-
tion, Gosciences Rennes, Rennes, France
[7] Rivenaes, J.C.(1992) : Application of a dual lithology, depth-dependent dif-
fusion equation in stratigraphic simulation. Basin Research, 4, 133-146
[8] Tucker, G.E., Slingerland, R.L.(1994) : Erosional dynamics, flexural isostasy,
and long-lived escarpments: A numerical modeling study. J. of Geophysical
Research, Vol 99, B6, 12,229-12,243
Combined Nonconforming/Mixed-hybrid Finite
Element-Finite Volume Scheme for Degenerate
Parabolic Problems
1 Introduction
8(3(c)
----at - \7 . (D\7c) + \7 . (cv) + F(c) = q, (1)
The convergence of a finite volume scheme for the equation (1) with D = I d
and F = 0 has been shown in [7]. Finite volumes with upstream weighting tech-
niques are unconditionally stable; however, there are geometrical restrictions
on the mesh for the discretization of the diffusion term and there is no gen-
eral prescription how to discretize full tensors. The finite element method for
degenerate parabolic problems has been studied e.g. in [2]. One can discretize
full tensors and there are no restrictions on the mesh. However, spurious os-
cillations may appear in the velocity dominated case or in the presence of
a reaction term. Hence a quite intuitive idea is to combine finite volume and
finite element methods, trying to use the "best of both worlds". In [1], the
authors introduce a combined scheme for a convection-diffusion equation with
a nonlinear convection term in two space dimensions. In the presented pa-
per, we prove the convergence of this scheme for the equation (1) in two or
three space dimensions. We extend the techniques used in [7] for a scheme
with negative transmissibilities, general meshes satisfying only the regularity
assumption, and cases when the discrete maximum principle is not satisfied.
Assumption (A)
(A1) (3 E C(IR), (3(0) = 0 is a strictly increasing function such that
(A4) v E L2(0, T; H(div, D)) n LOO(QT) satisfies V' . v = qs ::::: 0 a.e. in QT,
Iv, nl :S: Cy , Cy > 0 a.e. on i! x (0, T) Jor each hyperplane i! c D with
normal vector n;
(A5) F(O) = 0, F is a non decreasing, Lipschitz continuous Junction with
a constant L F
or
(A6) F(O) = 0, F is a Lipschitz continuous Junction with a constant LF and
xF(x) ::::: 0 Jor x < 0 and x> M, M> 0;
(A7) q E L 2(QT), where q = qscs with Cs E LOO(QT), 0 :S: Cs :S: M a.e. in
QT;
(A8) Co E LOO(D), 0 :S: Co :S: M a.e. in D.
We now give the definition of a weak solution of the problem (1) - (3).
-1
T
1 cv· V'cpdxdt + 1T 1 F(c)cpdxdt = 1T 1 qcpdxdt (4)
Assumption (B)
(Bi) There exists a positive constant CT such that
diam(K) C
max :S: T Vh > 0,
KETh PK
the set of all boundary dual volumes, and by N(D) the set of all adjacent
volumes to the volume D. For E E N(D), we finally set IJD,E = aD n aE.
We suppose the partition of the time interval (0, T) such that 0 = to <
... < tn < ... < tN = T and define 6tn == tn - tn-l, 6t == max 6tn. When
l::;n::;N
Assumption (AS) is satisfied, we do not impose any restriction on 6t. When
only (A 6) holds, we suppose:
Assumption (C)
(Cl) The maximum time step condition 6t < Z is satisfied.
qD = D
1
tn
IDI ltn],
tn-l D
q(x, t) dxdt.
Finally,
l' fn
vD,E >
_ 0 -n- - cD
cD,E = n + O'.D,E
n (n CE - n )
CD
n <0
if vD,E -n- - n + O'.D,E
n (n n) . (9)
cD,E = cE cD - cE
VD,E # O. (10)
Remark 1. (Numerical flux) We can easily see that 0 ::; O'.D E ::; 1/2, i.e. the
numerical flux defined by (9) ranges from the centered s~heme to the full
upstream weighting.
where
at each discrete time tn, with 9 E L 2 (fl). Then using the hybridization of the
lowest order Raviart-Thomas mixed finite element method, one ends up with a
linear system Mn A = G for the Lagrange multipliers A located in barycentres
of sides, see [4, Section V.1.2]. Using the analytic form ofMn, we define
where
In the sequel, we shall consider apart the following special case, satisfied
e.g. when D = I d and when there is a maximal angle condition:
Assumption (D)
(Dl) All non-diagonal terms of the diffusion matrix are nonnegative, i.e.
One can easily verify that ]]J)'D,D =- L ]]J)'D,E for all D E "Dh and n E
EEN(D)
{I, 2, ... ,N}. Adding the finite volume discretization of the other terms, the
assertion follows.
The assertion follows immediately from Assumption (A3) and the subsequent
uniform positive definiteness of the diffusion tensors (11) or (12).
The proof is similar to that in [6] for pure finite volume schemes.
o 1 r
CD = P5T JD co(x) dx D E "D~nt, (13)
Then L (c'D)2IDI < Ces for all n E {I, 2, ... , N} with Ces > o.
DEDh
294 R. Eymard et al.
PROOF:
considering u ;::: 0, Lemmas 2 and 3, the fact that for cD < 0 or CD > M,
F(cD)cD ;::: 0 follows from Assumption (A5) or (A6), and that when 0 ~ cD ~
1
M, -F(cD)cD ~ IF(cD)llcDI ~ LpM2. Let us now introduce a function B,
B(8) == ,8(8)8 - ,8(T) dT, 8 E lit One then can derive
8
Using that ,8 is non decreasing, one can easily show that J:~l [,8(T) -
CD
which yields
k
Using the growth condition on,8 from Assumption (Ai), one can derive B(8) ;:::
c; 82 for all 8 E lit Thus, using in addition Assumption (A 8),
k
c; L (c1J)2IDI- M,8(M)IDI ~ L L [,8(cD) - ,8(c~-l)lcD IDI·
DED~nt n=l DED~nt
Using the Cauchy-Schwarz inequality, extending the summation over all n E
{I, 2, ... , N} and D E 'Dh in the first right term of (16), and using the Young
inequality, we have
k N
L 6tn L (L 6tn L
1
~ "2€~ '"" n2 1
~ 6tn ~ (CD) IDI + 2€
2
Ilqllo,QT·
n=l DEDh
Combined Nonconforming/Mixed-hybrid Finite Element 295
L + CD L
k
u c{3 max (c'D)2IDI 6t n llchllt :::; uM;3(M)IDI + (18)
2 nE{1,2, ... ,N} D D
E h
-1
n-
considering also (14) and the fact that k was arbitrarily chosen. We now choose
c = ~~. When u of- 0, this already leads the assertion of the lemma. When
u = 0, it follows from (18) that c'D
= 0 for all DE V h and all n E {I, 2, ... , N},
since II . Ilxh is a norm on X~. Thus the assertion of the lemma is trivially
satisfied in this case. 0
Theorem 1. (Existence of the solution to the discrete problem) The
problem (6)-(8) has at least one solution.
The proof makes use of an induction argument. At each time level, Lemma 4
is employed. Consequently, on can use the (Brouwer) topological degree argu-
ment (see [5]).
Theorem 2. (Uniqueness of the solution to the discrete problem) The
solution to the problem (6)-(8) is unique.
The assertion follows from Assumption (Al) and (AS) or (A 6).
Theorem 3. (Discrete maximum principle) Under Assumption (Dl), the
solution of the problem (6)-(8) satisfies, for all DE V h and n E {I, 2, ... , N},
o:::;C'D:::; M. (19)
One sets a transmissibility lED E == lDJ'D E - IVD ElnD E' E E N(D). In view
of Assumption (Dl) and (10), ~ne has ED
E 2:: 0' for all D E V"nt, E E N(D),
and hence one can prove the assertion as i'n [6].
5 A priori estimates
Theorem 4. (A priori estimates) The solution of the scheme (6)-(8) sat-
isfies
C{3 max
nE{1,2, ... ,N} DEDh
L (cD)2IDI :::; Cae, (20)
max
nE{1,2, ... ,N} DEDh
L [;3(c'D)]2IDI :::; Cae, (21)
N
CD L 6t n llchllt :::; Cae (22)
n=1
296 R. Eymard et al.
The a priori estimates follow from (18) with E = ;~ and u = 1 and using
Assumption (Al) or Assumption (A 2).
(23)
Hence, to show the convergence, we can work with Ch,6t as in finite volume
methods.
The proof is an adaptation of the technique used in [8] for degenerate parabolic
equations discretized by a finite volume scheme. It uses the equation (8) and
the priori estimate (22).
VI;. E IRd.
6 Convergence
From Lemmas 5 and 6 and (20), the sequence Ch,6t verifies the assumptions of
Kolmogorov's theorem [3, Theorem IV.25 ], and thus Ch,6t converges strongly
in L2(QT) to some function u E L2(QT)' Moreover, due to Lemma 6, [6,
Theorem 3.10] gives that this u E L2(0, T; HJ(D». Finally, considering (23),
Ch,6t converges to the same u. D
The strong convergence of Theorem 5 permits to pass to the limit in the nonlin-
ear terms. For slightly more restrictive assumptions than (A), the uniqueness
follows from [9].
References
1. Angot, A., Dolejsl, v., Feistauer, M., Felcman, J. (1998): Analysis of a com-
bined barycentric finite volume-nonconforming finite element method for nonlin-
ear convection-diffusion problems. Appl. Math. Praha, 43, 263-310
2. Barrett, J. W., Knabner, P. (1997): Finite element approximation of transport
of reactive solutes in porous media. Part II: Error estimates for the equilibrium
adsorption processes. SIAM J. Numer. AnaL, 34, 455-479
3. Brezis, H. (1983): Analyse fonctionnelle, theorie et applications. Masson, Paris
4. Brezzi, F., Fortin, M. (1991): Mixed and Hybrid Finite Element Methods.
Springer-Verlag, New York
5. Deimling, K. (1985): Nonlinear Functional Analysis. Springer-Verlag, Berlin-
Heidelberg
6. Eymard, R., Gallouet, T., Herbin, R. (2000): The finite volume method. In: Cia-
rlet P.G., Lions J.L. (eds) Handbook of numerical analysis, vol. 7, 715-1022,
Elsevier Science Publishers B.V. (North-Holland), Amsterdam
7. Eymard, R., Gallouet, T., Herbin, R., Michel, M. (2002): Convergence of a finite
volume scheme for nonlinear degenerate parabolic equations. Numer. Math., 92,
41-82
8. Eymard, R., Gallouet, T., Hilhorst, D., Na"it Slimane, Y. (1998): Finite volumes
and nonlinear diffusion equations. Model. Math. Anal. Numer., 32 , 747-761
9. Knabner, P., Otto, F. (2000): Solute transport in porous media with equilib-
rium and nonequilibrium multiple-site adsorption: uniqueness of weak solutions.
Nonlinear Anal., 42 , 381-403
Discrete Maximum Principle for Galerkin
Finite Element Solutions to Parabolic Problems
on Rectangular Meshes
1 Introduction
au
at
-0: t a 2u
k=l ax~
=f in (0, T) x n, (1)
min{O; min u}
rt
+ tmin{O; min!}
Qt
::; u(t,x) ::; max{O; maxu}
rt
+ tmax{O; max!}
Qt
(3)
is valid for the solution u (see [7, Theorem 2.1] and [9, p. 79]). Here Qt with t E
[0, T] stands for the cylinder (0, t) x Q and Tt is the union of its lateral surface
and its bottom. Formula (3) is called the continuous maximum principle.
To solve problem (1)-(2) numerically, we use certain discretizations, both
in spatial and in time coordinates. It is obvious that the validity of the discrete
analogue of the maximum principle, the so-called discrete maximum principle
(DMP), is a natural requirement for having an adequate numerical solution.
The topic of validity of various discrete maximum principles arose already
30 years ago. Thus, in the works [3] and [4], DMP was formulated and proved
for the finite difference and finite element approximations, respectively, for the
second order linear elliptic equations. In particular, in [4], DMP was proved
in 2D case for the continuous piecewise linear finite element approximations
under the following geometrical conditions: the angles of triangles in the used
triangular meshes are not greater than 7r /2 (nonobtuse type condition), or less
than 7r /2 (acute type condition). More results on DMP for the elliptic problems
can be found in [6] and [8].
In [5], the validity of the DMP for the linear parabolic problem is analyzed:
the finite element discretization was performed with linear elements on trian-
gular (simplicial) meshes and the so-called B-method was applied to the time
discretization. The discrete analogue of (3) and sufficient conditions guarantee-
ing its validity were obtained, where one of such conditions was the acuteness
of the triangulations used.
To the authors' knowledge, there is no similar result on the validity of
DMP for parabolic problems solved with the help of bilinear finite elements
in space. As far as the validity of DMP on rectangular meshes is concerned,
we mention the only work [2] in this respect, where the authors considered
the simplest elliptic problem and showed that the corresponding DMP may
not hold if the rectangular elements are chosen arbitrarily. They also derived
sufficient conditions for the validity of DMP.
In our paper, we give sufficient conditions for the validity of the discrete
maximum principle for the Galerkin finite element solutions based on the bi-
linear elements on rectangular meshes for parabolic problems. Similarly to
the case of triangular meshes, we obtain certain geometrical condition on the
shape of elements. Namely, we introduce the notion of non-narrow rectangular
element, which represents an analogue of nonobtuse triangular element for the
case of triangular meshes.
300 1. Farago et al.
2 Discretization
We denote the space of all possible linear combinations of the basis func-
tions by Vh, and define its subspace Vrf = {v E Vh I vlan = O}. Based on
the usual weak formulation of the original problem, the semi discrete form for
(1)-(2) reads: Find a function Uh = Uh(t, x) such that
J at
OUh Vh dx + B(Uh, Vh) = J fVh dx, \lvh E VOh, t E (0, T), (7)
n n
where B(Uh, Vh) = a In grad Uh . grad Vh dx. In the above, u~(x) and gh(t, x)
(for any fixed t) are linear interpolants in vh, i.e.,
N Na
u~(x) = L uO(Pi)<Pi(X) and gh(t,X) = Lg(t,PN+i)<PN+i(X).
i=1 i=1
We notice that from the consistency of the initial and the boundary conditions
(g(O, s) = uo(s), s E aD), we observe that g(O, PN+i) = UO(PN+i), i =
l, ... ,Na.
We search for the semi discrete solution in the form
N
Uh(t,X) = L U?(t)<Pi(X) + gh(t,X),
i=1
and notice that it is sufficient that Uh satisfies (7) for Vh = <Pi, i = 1, ... ,N,
only.
Discrete Maximum Principle for Parabolic Problems 301
we arrive at the Cauchy problem for the system of ordinary differential equa-
tions
dv h
Mdt +Kvh = f, (8)
vh(O) = [uo(P1 ), ... , UO(PN), g(O, PN+l),"" g(O, PN+Na)]T
for the solution of the semi discrete problem, where
Mij =
n
J cPjcPi dx, fi = J
n
fcPi dx. (10)
The above defined matrices M and K are called mass and stiffness matrices,
respectively.
In order to get a fully discrete numerical scheme, we choose a time-step
Llt and denote the approximation to v h (nLlt) and f (nLlt) by v n and fn, n =
0,1, ... , nT (nTLlt = T), respectively. To discretize (9), we apply the B-method
(B E [0, 1] is a given parameter) and obtain the system of linear algebraic
equations
(12)
To calculate the entries of the mass matrix M, we first calculate the integral of
the product cPjcPi only on a single rectangle R (denoted by MijIR)' Obviously,
302 I. Farago et al.
(14)
where aR and bR denote the length of the edges of the rectangle R. A rectan-
gular mesh Rh is called non-narrow if p, ::::; O. It is called strictly non-narrow
if p, < O. Hence, the non-narrowness of a mesh means that the longest edge
of each rectangle is not greater than v'2 times the shortest one. The non-
narrowness of the mesh will imply the non positivity of the off-diagonal ele-
ments of the stiffness matrix (see [1], page 254).
Discrete Maximum Principle for Parabolic Problems 303
for n = 0, ... , nT - l.
The discrete analogue, the so-called discrete maximum principle (DMP),
for the continuous maximum principle (3) can be written in the form (cf. [5,
p. 100])
. {O , gmin
mIn n+1 , Vmin
n } + Atf(n,n+l)
LI min _< Un+1
i
<
_ max
{O , gmax'
n+1 Vn
max } + Atf(n,n+1)
LI max ,
(15)
i = 1, ... ,N; n = 0, ... , nT - l.
Let us introduce the denotations
(n,n+l)
f max _ f(n,n+l)
- max e E]RN , f(n,n+1)
0
- f(n,n+1)
- max eo E ]RN ,
f a(n,n+l) -_ f(n,n+l) E ]RNa
max ea ,
n n E ]RN
V max == Vrnax e ,
For simplicity, we denote zero matrices and zero vectors by the symbol 0,
whose size is always chosen according to the context. The ordering relation is
meant elementwise.
Before proving the sufficient condition of the DMP, in the next auxiliary
lemmas, some important properties of the matrices M, K, A and B are sum-
marized.
Lemma 1. Let the rectangular mesh Rh for [2 be of non-narrow type (fL ::::; 0).
Then Kij ::::; 0 (i f j! i = 1, ... , N! j = 1, ... , N).
PROOF. We denote SUPPcPi n sUPPcPj by Sij and calculate Kij (i f j):
Lemma 2. Let the rectangular mesh Rh for fl be of non-narrow type (f-L ::; 0)
and condition
Bii = Mii - (1 - B)LJ.tKii ~ 0, i = 1, ... , N, (16)
be satisfied. Then B ~ O.
PROOF. The matrix M is nonnegative, because the basis functions are
nonnegative. Moreover, the previous lemma guarantees the nonpositivity of
Kij (i i- j) that implies that the off-diagonal entries of B are nonnegative.
The nonnegativity of the diagonal entries of B follows from the condition (16) .
•
Lemma 3. The relations
(PI) Ke = 0,
are valid.
PROOF. (PI) For the i-th coordinate of the vector Ke, we have
N
= f(n,n+1l
max
" M··
'L.-t = (Mf(n,n+1l)
max.
= ((M + BLJ.tK)f(n,n+1l)
max.
= (Af(n,n+1l)
max ..
j=1
~J
, "
In the above, we used the facts that the basis functions are nonnegative, their
sum equals to the constant one function, and property (PI) .•
Lemma 4. If
Aij=Mij+BLJ.tKij::;O, ii-j, i=I, ... ,N, j=I, ... ,N, (17)
then AD1 ~ O. Furthermore, the relations
-AD1 Aa ~ 0, -AD1 Aa ea ::; eo (18)
are valid.
Discrete Maximum Principle for Parabolic Problems 305
PROOF. Using (13), property (P2), the relation BV;:'ax = AV;:'ax (it follows
from (PI)) and Lemma 2, we have
Obviously,
Writing (21) for the i-th component, and expressing u~+1, we obtain the
right-hand side inequality in (15). The left-hand side inequality in (15) can be
proved in a similar manner. •
The previous theorem does not say anything about the choice of the rect-
angulation and the choice of the parameters Band L1t in order to guarantee
the DMP. The validity of the DMP can be checked only after the direct calcu-
lation of the elements of the matrices A and B by testing the two inequalities
in the above theorem. The next theorem can guarantee the DMP a priori.
306 I. Farag6 et al.
Theorem 2. The finite element solution of (1)-(2), using bilinear basis func-
tions on a strictly non-narrow rectangular mesh Rh of a rectangular domain
D, satisfies the discrete maximum principle (15) if the conditions
(22)
and
2
Llt< 'Y (23)
- 3 (1- e)a'
are fulfilled, where
4 Final comments
In this paper a priori sufficient conditions for the validity of the discrete max-
imum principle have been given for the Galerkin finite element methods based
on bilinear finite elements in space. We close the paper with some remarks
regarding our results.
h2
Llt> - (24)
- 3ea
and
h2
Llt< (25)
- 6(1- e)a
This shows that the time step can be chosen only for the values e 2:: 2/3,
i.e., the Crank-Nicolson scheme is not included.
- The results of Theorem 1 are similar to Theorem 1 in [5J. The only difference
is the application of the condition of the strict non-narrowness instead of
the acute type condition.
Discrete Maximum Principle for Parabolic Problems 307
References
Alexander I. Fedotov
Summary. In the papers [1] - [4] the quadrature-differences methods for the vari-
ous classes of the I-dimensional periodic singular integro-differential equations with
Hilbert kernels were justified. The convergence of the methods was proved and er-
ror estimates were obtained. Here we propose and justify the cubature-differences
method for 2-dimensional 1 linear periodic singular integra-differential equations.
Such equations appear in the theory of elastity (see [5]) and in some problems of
diffraction of electromagnetic waves (see e.g. [6]) The convergence of the method is
proved and error estimate is obtained.
where
u(k) = (27r)-2 i u(T)ek(T)dT
For the following we'll asume that s > 1 providing (see e.g. [8]) the embedding
of HS into the space of continuous functions.
Consider the linear singular integro-differential equation
ABu+Tu= J, (1)
where A is a 2-dimensional singular integral operator
2 Calculation scheme
~n
( tk )-_ II sin((2nj + l)h - tkJ/2)
T,
j=1,2 ( 2nj + 1) sin (( Tj - tkj )/ 2)'
T= (T1,T2) E a, tk = (tk"tk 2) E an.
Then the integrals will take the form
3 Preliminaries
IE In
are Fourier-Lagrange coefficients ofthe function un(t) belonging to the grid d n .
The sets HS and H~ will be mapped onto each other by the operators
IITnPnu - PnTul18,n = 'T)nlluI1 8+2m with 'T)n ---t °for n ---t 00,
5) equation (1) has a unique solution u* E H8+2m for any right-hand side
f E H8.
Then for all n, beginning from some no, the system of equations (4) is
uniquely solvable and approximate solutions u~ converge to exact solution u*
of equation (1)
Ilu~ - Pnu*lls+2m,n ---t 0, n ---t 00.
If, in addition, u* E Hs+ 2m+2, then the error estimate
is valid.
Proof. Let's take an arbitrary constant r E R which is not an eigenvalue
of problem Bu + ru = 0, u E Hs+2m and substitute into equation (1)
v = Bu +ru, (5)
The existence of such a constant follows from the properties of the spectrum
of elliptic operators (see e.g. [7]). Then
where G is the inverse to Bu + ru and equation (1) will take the form
(8)
An = PnAPn, fn = Pnf,
(BnUn)(tk) = L ba f3(tk)(D!?+f3 un )(tk) ,
lal=If3I=m
and make the substitution
(9)
As it is shown in [10] equation (9) is uniquely solvable for all ll, beginning from
some lll, and for Vn = PnV solutions Un = Gnv n = GnPnv converge to the
solution u = Gv of equation (5). Here G n is inverse to operator Bnun + rUn
and
(10)
By substitution (9) we'll get equation
(11)
which is equivalent to equation (8). As before, the equivalence here means,
that solvability of one of equations yields solvability of the another and their
solutions are related by the relationships (9), (10).
The invertibility of the operators Kn : H~ ~ H~ we'll prove following [9].
To do this we have to establish the following:
a) IlPnfn - fils ~ 0 for II ~ 00;
b) the sequence of operators (Kn) approximates the operator K compactly;
c) K is invertible.
The validity of a) follows immediately from Lemma 1 2
(12)
2 Here and further C denotes generic real positive constants, independent fram n.
314 A.I. Fedotov
References
1. Fedotov, A. 1., On convergence of quadrature-differences method for linear sin-
gular integrodifferential equations, Zh. Vychisl. Mat. i. Mat. Fiz. 29 (1989), N9,
1301-1307 (in Russian).
2. Fedotov, A. 1., On convergence of quadrature-differences method for one class of
singular integro-differential equations, Izv. Vyssh. Uchebn. Zaved. Mat. (1989),
N8, 64-68 (in Russian).
3. Fedotov, A. 1., On convergence of quadrature-differences method for linear sin-
gular integrodifferential equations with discontinuous coefficients, Zh. Vychisl.
Mat. i. Mat. Fiz. 31 (1991), N2, 261-271 (in Russian).
4. Fedotov, A. 1., On convergence of quadrature-differences method for nonlinear
singular integrodifferential equations, Zh. Vychisl. Mat. i. Mat. Fiz. 31 (1991),
N5, 781-787 (in Russian).
5. Parton, V. Z., Perlin P. 1., Mathematical methods of the theory of elastity. Vol
1., "MIR", Moscow 1984.
6. Kas'anov V. 1., Suchov R. N. Galerkin's methods for a solution of a problem of
diffraction of electromagnetic waves on a dielectric wedge, Progress in Electro-
magnetic Research Symposium, July 5-14, 2000, Cambridge, MA, USA.
7. John, F., Partial differential equations, 4th ed., Springer-Verlag, New York 1981.
8. Taylor, M. E., Pseudodifferential operators, Princeton University Press, Prince-
ton 1981.
9. Vainikko, G. M., The compact approximation of the operators and the approxi-
mate solution of the equations, Tartu University Press, Tartu 1970 (in Russian).
10. Vainikko, G. M., Tamme, E. E., Convergence of the differences method in a
problem of periodic solutions of elliptical type equations, coefficients, Zh. Vychisl.
Mat. i. Mat. Fiz. 16 (1976), N3, 261-271 (in Russian).
Nonconforming Discretization Techniques for
Overlapping Domain Decompositions
lnst. for Appl. Analysis and Num. Simulation, Pfaffenwaldring 57, 70569 Stuttgart,
Germany, {fiemisch, mair, wohlmuth}@ians.uni-stuttgart.de. Supported in part by
DFG, SFB 404, C12.
Summary. For the numerical solution of coupled problems on two nested domains,
two meshes are used which are completely independent to each other. Especially in
the case of a moving subdomain, this leads to a great flexibility for employing dif-
ferent meshsizes, discretizations or model equations on the two domains. We present
a general setting for these problems in terms of saddle point formulations, and in-
vestigate one- and bi-directionally coupled applications.
1 Introduction
o r
discretizations and model equations on the two domains. Our approach is use-
ful especially for a moving subdomain, i.e., when w changes its position inside
the global domain. In this case, no remeshing will be necessary and only the
matrices responsible for the coupling have to be reassembled. In Section 2, we
start with the general variational setting in terms of a saddle point formula-
tion. A one-directionally coupled model problem is investigated in Section 3. In
Section 4, we consider bi-directionally coupled formulations on the examples
of a linear elasticity problem and an eddy current simulation.
Nonconforming Discretization Techniques 317
2 Variational Setting
The coupling between the two solution components is realized via the Lagrange
multiplier space M in terms of two continuous bilinear forms b1 (·,·) and b2 (·,·)
acting on V x M. For the applications in Section 3 and Subsection 4.2, M is the
dual of the trace space Hl/2(r), i.e., M = H- 1 / 2(r), whereas in Subsection
4.1 M = H-l/2(r)2, with r = ow indicating the subdomain boundary.
Solving additionally for the Lagrange multiplier p E M, the following gen-
eralized saddle point problem is derived: find (u,p) E V x M such that
where (-, ')v'xv and (-, ')M'xM denote the usual duality pairings. We point
out that for b1 (·,·) = b2 (·, .), problem (1) has the usual symmetric structure,
which is encountered for example in the framework of mixed [1] and mortar
[2] finite element methods. Moreover, if b1("q) acts only either on V,n or Vw,
one-directionally coupled problems are derived.
The bilinear forms bi (·,·) define coupling operators Bi : V ----7 M' and B; :
M ----7 V' by (Biv,q)M'xM = (v,B;q)vxv' = bi(v,q) for v E V and q E M.
The validation of the following coercivity- and inf-sup-conditions guarantees
the unique solvability of problem (1) in V x M / K er BT,
[5]:
a(wo, vo)
::lao> 0 : sup > ao, Wo E Ker B 2 , (2)
voEKerB, Ilwoliv Ilvoliv -
a(wo,vo)
sup > ao, Vo E Ker B 1. (3)
woEKerB 2 Ilwoliv Ilvoliv -
. f sup
::Iko > 0 : m bi (v, q) 2: k
0,
'~ = 1 , 2 . (4)
qEM vEV IlvllvllqIIM/KerB;
We note that the above conditions can be more relaxed [3, 11].
318 B. Flemisch et al.
The most delicate step for the quality of the discretization and the compu-
tational complexity is the information transfer between the two grids via the
discrete Lagrange multiplier space M h . In all our considered applications, we
essentially couple between the global grid on n
and the sub domain boundary
r. On r, dual Lagrange multipliers [13] are used to approximate M, which
have optimal stability and approximation properties. Moreover, they have lo-
cal support and satisfy a biorthogonality relation with the basis functions of
the trace space Vhjr . Therefore, the implementation of the corresponding op-
erators B1,h and B 2 ,h can be performed with low computational costs.
(7)
with the obvious meanings for a w(-'·) and (., ')w, and where M = H- 1 / 2 (r).
Introducing the Lagrange multiplier p = aa': ' we find that
b1(v,q) = (vw,q)M'xM for v = (vn,v w) E V = HJ([2) X H1(w), q E M. We
realize the continuity requirement along the boundary r in (7) by the bilinear
form b2(v, q) = (vn - Vw, q)M'xM, V E V, q E M and obtain the saddle point
formulation of (6), (7) given in (1) with 9 = 0.
It is obvious that problem (1) has a unique solution, since the problem
on the global domain [2 is not influenced by the problem on the sub domain
W, and its solution un yields the boundary data for a well posed sub domain
problem. Nevertheless, we provide a complete proof within the saddle point
setting.
Theorem 1. With the above definitions, problem (1) is uniquely solvable.
Proof. We first show the unique solvability of problem (1) by validating the
conditions (2)-(4). Our main tool is the harmonic extension operator 'H
M' ---> Vw , defined by
We observe that the trace of Vw onto r is the space M, and that h((O, v), q) =
b2((0, -v),q). Taking v = (0, ±'Hw) , w EM', condition (4) is a consequence
of the definition of the H-1/2-norm and of the fact that IIHwlh,w :s; CllwIIM"
Let us focus on condition (2). The kernels of the coupling operators are
Ker B1 = Vn x V~, and Ker B2 = {v E V : trvn = vwlr}, where tr : H1([2) --->
H1/2(r) denotes the trace operator. We uniquely decompose Vw E Vw into
VB+VI such that VB = 'H(vwlr) and VI E V~. For an arbitrary Wo = (wn, WB+
WI) E Ker B 2, we consider Vo = (w n, WI) E Ker B 1. By using the properties of
the harmonic extension, we get
{
UH in we = f? \ w,
UFE:= .
Uh mw.
Lemma 1 only provides a global estimate, which is not sufficient here, since we
like to disregard the approximate solution component UH on the sub domain
w. The necessary tools for a more local analysis can be found in [12], resulting
in the following estimate which is proved in [7].
Theorem 2. Let B :) we such that d = dist (aB \ of?, awe \ of?) > o. Then
for H small enough and U regular enough, there exists a constant C depending
on d such that
Numerical test. Consider the model problem (6) on f? := (0,1)2 with source
term f derived from the exact solution u(x, y) := exp( -100((x-0.6)2 /a 2 +(y_
0.5)2/b 2)). An elliptic patch with radii 0.25 and 0.15 is placed in the domain
f? with its center at (0.6,0.5), as illustrated in Figure 1. Since the solution
goes to zero with an exponential decay, we may have a coarser triangulation
far enough away from (0.6,0.5). Therefore, we choose an initial triangulation
Nonconforming Discretization Techniques 321
-.
7 10- 1 .......
---.....
M
~ ..............
.S ........
.... 10-2
........0 .....
Q) ......•...
10-3
103 104 105
degrees of freedom degrees of freedom
Fig. 2. Error decay in the HI-norm of PI-PI and P1-P2 coupling
a priori estimates. Choosing the same number of unknowns for the standard
and the overlapping method, the solution obtained by the PI-PI coupling is
significantly better than the solution obtained by the standard method. For
the P2-Pl coupling, the error decay is almost optimal with respect to the
piecewise quadratic finite elements used on Th . In agreement with (11), the
error behaves like c 1 h 2 + C2H, and, numerically, C2 « Cl.
-Divcr(u c ) = fc
cr(u c) nc = t
with U c = 0 on To and cr( u c) nc = g on Tl where To c an has a positive
measure and an = To u n, To n n = 0, with body forces fc E (£2(WC))2 and
322 B. Flemisch et al.
-DivO"(un) = in in [2,
-DivO"(uw) = iw in w,
un = U w onr,
[O"(un) nel = -O"(uw) ne +t on r,
with un = 0 on ro and O"(un) ne = 9 on r l , and where [wn] := wnlwc -
wnl w' iw = inlw and in E (L2([2))2 is an extension of ie to [2. It can be
written in its weak formulation as saddle point problem with the bilinear forms
an(un, vn) = In O"(un) : E(Vn), aw(uw, vw) = L O"(uw) : E(Vw), bl(v, q) = (v w+
vn, q)MxMI, b2(v,q) = (v w - vn, q)MxMI, and V = (HI([2))2 X (HI(w))2,
M = (H~! (r) ) 2. For the discretization, we use linear and quadratic finite
elements on quasi-uniform and shape regular triangulations. The conditions
(2) - (4) hold for h / H small enough in the discrete setting as well as in the
continuous setting, yielding unique solvability. For details, we refer to [9].
The realization of this approach allows for an easy shift of the hole without
having to remesh and can be used in shape optimization algorithms to deter-
mine an optimal hole position. Note that the quantity we pass back from the
hole to the background is the jump in the fluxes, i.e., in general the solution
un is only H2- E -regular, s > O. Another application of this complementary
3
coupling technique are time dependent problems where the hole is an object
moving through the domain [2 emanating some flux into we.
Numerical examples
Beam with one hole. We consider the problem domain we = (-5,5) x (0,1) \
{(x,Y) ER2 1
Fig. 3. Top: Problem setup and start grids. Bottom: 17 xx on nX w using comple-
mentary coupling; we show only the values on we.
Nonconforming Discretization Techniques 323
Rotating hole. The hole domain is a smooth star with five points described by
(x, y) = (mx, my) + (ri + L1r) cos(10n A) (cos( a(A, t)), sin( a(A, t))) with A, t E
[0,1] and a(A, t) = 2n(A - t - cos(10n(A + ~)/51)), and a center (mx, my) =
(1.1,0.65)), the medium radius ri = 0.2 and the radius change amplitude
L1r = 0.05. We solve the Poisson problem. The boundary segment described
by A E (0,0.1) carries non-zero natural data. The background region is [2 =
(0,3) x (0,1). The situation and the solution at t = 0,0.15,0.3,0.45,0.65,0.85
can be found in Figure 4. For each new position of the star, no remeshing
procedure has to be carried out, simply the existent grid has to be rotated.
Moreover, only the coupling matrix has to be reassembled, all other involved
matrices stay the same throughout the whole computation.
Fig. 4. Problem setup (top left): Zero natural b.c. in hatched areas, 0 and 1 Dirichlet
b.c. at the top left and bottom right side, respectively, influx of 5 (natural b.c.)
at the back side of the first wing. Initial background grid plus rotated star grid
at t = 0,0.15,0.3,0.45 (top right). Bottom: Solutions at different times t: t = 0
(complete), t = 0.15,0.3,0.45,0.65,0.85 (partly displayed).
with positive material parameters J-l and 0", and J-l constant in w. We assume
knowing a source vector potential Ts such that curlTs = J s in we, with
324 B. Flemisch et al.
aa(cP,v) -
ir
r (Tn)v = 1 j3Ts gradv,
we
v E Va = HJ(D), (14)
where a = .dt/(fLU), and fw contains the information from the preceding time
step.
This suggests the introduction of the Lagrange multiplier p = Tn E M =
H-l/2(r), and of the coupling bilinear form b((v,).., W), q) = ().. - v, q)M'xM
for (v,).., W) E V = Va x Vw and q E M. Setting g = 0 and b1 (·,·) = b2 (·,·) =
b(·,·), problem (1) is obtained. By choosing ().., W) = (0, gradv), v E HJ(w),
in (15), it is easy to see that the solenoidality of T is guaranteed provided
that fw is divergence free. The unique solvability of the statically condensated
formulation of problem (1) is proved in [8].
gradient as a linear combination of the basis functions for the edge element
space [4]. An optimal a priori estimate based on Lemma 1 for the finite element
solution (¢H, Th) of the statically condensated form of (5) is obtained in [8],
provided that the ratio hi H is small enough.
References
1. Babuska, I. (1973): The finite element method with Lagrangian multipliers. Nu-
mer. Math., 20, 179-192.
2. Ben Belgacem, F. (1999): The mortar finite element method with Lagrange mul-
tipliers. Numer. Math., 84, 173-197.
3. Bernardi, C., Canuto, C., Maday, Y. (1988): Generalized inf-sup conditions for
Chebyshev spectral approximation of the Stokes problem. SIAM J. Numer. Anal.,
25, 1237-1271.
4. Bossavit, A. (1998): Computational electromagnetism: variational formulations,
complementarity, edge elements. Academic Press, New York.
5. Brezzi, F., Fortin, M. (1991): Mixed and hybrid finite element methods. Springer,
New York.
6. Flemisch, B., Maday, Y., Rapetti, F., Wohlmuth, B.I. (2003): Coupling scalar and
vector potentials on nonmatching grids for eddy currents in a moving conductor.
To appear in J. Comput. Appl. Math.
7. Flemisch, B., Wohlmuth, B.I. (2003): A domain decomposition method on nested
domains and nonmatching grids. To appear in Numer. Methods Partial Differ.
Equations.
8. Maday, Y., Rapetti, F., Wohlmuth, B.I. (2003): Mortar element coupling be-
tween global scalar and local vector potentials to solve eddy current problems.
In: Brezzi, F. et al (eds) , Numerical Mathematics and Advanced Applications,
Proceedings of ENUMATH 2001, Springer, Berlin, 847-865.
9. Mair, M., Wohlmuth, B.I. (2003): A domain decomposition method for domains
with holes using a complementary decomposition. Report SFB 404 2003/38.
10. Nedelec, J.-C. (1980): Mixed finite elements in JR3. Numer. Math., 35, 315-341.
11. Nicolaides, R.A. (1982): Existence, uniqueness and approximation for generalized
saddle point problems. SIAM J. Numer. Anal., 19, 349-357.
12. Wahlbin, L.B. (1991): Local behavior in finite element methods. In: Ciarlet, P.G.,
Lions, J.L. (eds.), Handbook of Numerical Analysis, Vol. II, Elsevier Science
Publishers B.V., 1991.
13. Wohlmuth, B.I. (2000): A mortar finite element method using dual spaces for
the Lagrange multiplier. SIAM J. Numer. Anal., 38, 989-1012.
On the Use of Implicit Updates in Minimum
Curvature Multi-step Quasi-Newton Methods
1 Introduction
We consider two-step quasi-Newton methods for the unconstrained optimiza-
tion problem
(In equations (1) and (3), the step-vectors §'j and 'JLj are defined by
clef
§'j = ±j+1 - ±j , (4)
'JLj ~f Q(±j+1) - Q(±j) = QH1 - Qj , say, (5)
Implicit Updates in Minimum Curvature Methods 327
where {;r.j} are the successive iterates produced by the method.) A matrix
satisfying (1)/(2) can be constructed by appropriately modifying, for example,
the BFGS update formula, as follows:
B i+l -- B i -
Bir.ir.; Bi
T + 1J1.(Yd.;
-T- (6)
r.i Bir.i 1J1.i r.i
clef
= BFGS ( Bi, r.i' 1J1.i ) , say. (7)
The derivation of the condition (1) is described by Ford and Moghrabi [5,4].
In short, quadratic curves ;r.( 7) and g( 7) in Rn (where 7 E R) are constructed
which interpolate respectively, for the same set of values of 7, the three most
recent iterates ;r.i-1, ;r.i and ;r.i+l' and the three associated gradient evaluations
(assumed to be known). The derivatives of these two curves (at 7 = 72, where
72 is the value of 7 corresponding to ;r.i+l and :2.(;r.i+1) on the respective curves)
are then substituted into the relation
(8)
derived by applying the Chain Rule to the function g(;r.(7)). (In (8), primes
denote differentiation with respect to 7.) Of course, -
clef, '( )
1J1.i = :2. 72 (9)
will, in general, only be an approximation to the vector g' (;r.( 72)) required in
(8), whereas -
r.i clef
= ;r.'( 72 ) (10)
may be computed exactly. Nevertheless, on making these substitutions into
(8) and removing a common scaling factor, a relation of the form (1) for
Bi+1 ;:::: G(;r.i+1) is obtained.
The remainder of this paper is organised as follows: in Section 2, we review
the "minimum curvature" approach to determining a suitable set of parame-
ters {7j }J=o, while Section 3 describes the concept of implicit updates. Section
4 then develops the use of implicit updates within "minimum curvature" meth-
ods. Finally, we present the results of numerical experiments in Section 5, and
Section 6 draws conclusions on the basis of the results.
they then discussed choosing the parameter J to minimize the norm II ;]2"(7) 11M,
where
(12)
and where M is a given symmetric positive-definite matrix, thus producing
an interpolating curve ;]2(7) which is the "smoothest" in the sense of the norm
II . 11M. It was shown in [6] that fulfilling this criterion leads to the requirement
of solving the cubic polynomial
(13)
at each iteration, where
(14)
(15)
In [6], the properties ofthe polynomial1jJ were analyzed and it was shown that
its zeroes may be determined efficiently. In addition, the circumstances under
which 1jJ has three real zeroes (rather than only one) were identified and it was
shown (in that case) which zero should be selected to yield the lowest curvature.
It was observed that, in general, this approach was capable (depending on the
relative dispositions of the three iterates ;]2i-l, ;]2i and ;]2i+l) of producing all
three essentially different orderings of the iterates on the interpolating curve.
In their original paper [6] on these "minimum curvature" methods, the
authors reported the results of numerical experiments conducted with the
version of the algorithm (called A) obtained by making the straightforward
choice M = I, and showed that this yielded a substantial improvement in per-
formance, when compared with the standard BFGS method. In a subsequent
paper [7], they investigated the performance ofthe algorithm, called B, arising
from choosing M = Bi (this choice being motivated by the previous success
of other multi-step methods employing the same matrix), and showed that
a further improvement in performance was thereby obtained. In this paper, we
will pursue this avenue of investigation further by considering related choices
for the matrix M. In particular, we will consider the use of implicit updates -
that is, updated forms of the matrix Bi which are not calculated explicitly.
3 Implicit updates
Since the choice M = Bi produces an algorithm with good numerical perfor-
mance, a natural question to consider is whether related matrices might yield
further gains. An obvious line of enquiry to pursue in answering this question
is the use of updated versions of B i , where the update employs data from
the most recent iteration(s). Because this updated matrix (call it Hi for the
present) will be used to compute {7j }J=o and hence "ti, [ i and :lli.i' it cannot
Implicit Updates in Minimum Curvature Methods 329
be the matrix Bi+l which will be produced via equation (6). Thus, it appears
that use of such a matrix would necessitate carrying out a second update (re-
quiring O(n 2 ) operations) at each iteration. However, we observe that explicit
knowledge of the updated matrix is not our real goal- rather, we only need this
matrix to enable us to calculate ai, ai-l and /-li and hence (by solving the cubic
polynomial) 6, Ii, [i and YJ.i. Therefore, if it can be shown that the expressions
required in equations (14) and (15) may be computed at low cost without
explicit calculation of Hi, then we will have gained our objective of using an
updated matrix, while avoiding most of the computational expense. Because
explicit computation of Hi is avoided, methods which use this technique were
termed implicit methods in [3].
(16)
(17)
In order to avoid the explicit computation of these updated matrices (for the
reasons explained above), it is necessary to show (in the context of "minimum
curvature" methods) how the quantities ai, ai-l and /-li can be calculated with-
out explicit knowledge of the matrix. We consider this issue for each update,
in turn.
4.1 Update C
(19)
330 J.A. Ford, I.A. Moghrabi
(In deriving (19), we have assumed that J2i+1 is obtained by some form of
search along the 'quasi-Newton' direction
(20)
which implies that
Bi§.i = -tifl..i, (21)
for some positive scalar ti.) Thus we are able to derive the following expressions
for the quantities (Ji, (Ji-l and fJi:
(22)
(23)
(24)
4.2 Update D
In a similar manner, we can derive the following expressions for (Ji, (Ji-l and
fJi in the case when the implicit update (17) is applied:
(Ji-l -_ §.i-1Bi§.i-l
T
+ ti {(§.L19Y}
T + {(§.L1YY}
T
.
' (25)
§.i fl..i §.i '!f..i
(Ji = §.r'!f..i; (26)
4.3 Alternation
Alternation [8] involves the repeated application of a cycle consisting (in its
basic form) of two iterations, the first of which is a standard single-step BFGS
iteration and the second of which is the required two-step method. The conse-
quence of this arrangement is that each two-step iteration can be implemented
in the knowledge that the relation
(28)
holds (because of the preceding single-step iteration). This implies that
§.L1Bi§.i-l = §.Ll'!f..i_l' (29)
which means that §.L 1Bi§.i_l can be computed in O( n) operations.
Implicit Updates in Minimum Curvature Methods 331
4.4 Recurrence
In [9], the authors showed how the following recurrence for the efficient calcu-
lation of the quantity Ai ~f ,?L1 B i'?'i-1 could be derived:
where
5 Numerical experiments
The algorithms Calt, Crecur, Dalt and Drecur derived from the new implicit
updates were compared with each other, in our first set of experiments. All the
multi-step algorithms tested in these and the following experiments employed
the BFGS formula to update the inverse Hessian approximations Hi = B;l,
but with the usual vectors '?'i and 'JL i replaced by [ i and :lli.i:
dimension, with the original "minimum curvature" method A.) Although this
result may be somewhat surprising (it might have been expected, instead,
that an update employing the 'latest' data would be more successful still than
the method using M = B i , let alone the method A), we point out that it is
consistent with the results obtained by Ford and Tharmlikit [10] when using
a similar implicit update. On the basis of the results reported in Table 1, the
methods Dalt and Drecur will not be considered further here.
A second set of experiments (using the same test functions) was then con-
ducted, in order to compare the more successful new methods C alt and Crecur
with the existing "minimum curvature" methods A and Balt, and the new
version Brecur of B. These results are reported in Table 2. For comparison,
we have also included the results returned by the standard single-step BFGS
method. In this Table, the entries in each 'Ratios' row give the proportions
of evaluations and (in brackets) iterations for that method, expressed as per-
centages of the corresponding figures for the BFGS method. Finally, scores
(indicating 'best performances') are recorded again for this set of experiments,
but we point out that the results returned by the BFGS method were not
included in assessing these best performances, since our primary purpose is to
compare the "minimum curvature" methods. (The notation :I placed against
two of the results for the method A indicate that there was one failure for this
method [for a test problem in the Medium category], where it was unable to
converge to an acceptable minimum within the permitted limit of 5000 iter-
334 J.A. Ford, LA. Moghrabi
ations. The evaluations and iterations incurred on this failure have not been
included in the relevant totals.)
References
1. Conn, A.R., Gould, N.LM., Toint, Ph.L. (1986): Testing a class of methods for
solving minimization problems with simple bounds on the variables. Research
Report CS-86-45, University of Waterloo
2. Fletcher, R. (1987): Practical Methods of Optimization (2nd ed.). Wiley, New
York
3. Ford, J.A. (2001): Implicit updates in multistep quasi-Newton methods. Comput.
Math. App!., 42, 1083-1091
4. Ford, J.A., Moghrabi, LA. (1993): Alternative parameter choices for multi-step
quasi-Newton methods. Optimization Methods and Software, 2, 357-370
5. Ford, J.A., Moghrabi, LA. (1994): Multi-step quasi-Newton methods for opti-
mization. J. Comput. App!. Math., 50, 305-323
6. Ford, J.A., Moghrabi, LA. (1996): Minimum curvature multistep quasi-Newton
methods. Comput. Math. App!., 31,179-186
7. Ford, J.A., Moghrabi, LA. (1997): Further development of minimum curvature
multi-step quasi-Newton methods. In: Bainov, D. (ed) Proceedings of the Seventh
International Colloquium on Differential Equations. VSP Utrecht Tokyo
8. Ford, J.A., Moghrabi, LA. (1997): Alternating multi-step quasi-Newton methods
for unconstrained optimization. J. Comput. App!. Math., 82, 105-116.
Implicit Updates in Minimum Curvature Methods 335
9. Ford, J.A., Moghrabi, LA. (2002): On the Use of Alternation and Recurrences
in Two-Step Quasi-Newton Methods. Submitted to Comput. Math. Appl.
10. Ford, J.A., Tharmlikit, S. (2003): New implicit methods in multi-step quasi-
Newton methods for unconstrained optimisation. J. Comput. Appl. Math., 152,
133-146
11. More, J.J., Garbow, B.S., Hillstrom, K.E. (1981): Testing unconstrained opti-
mization software. ACM Trans. Math. Software, 7, 17-41
12. Toint, Ph.L. (1987): On large scale nonlinear least squares calculations. SIAM
J. Sci. Stat. Comput., 8,416-435
A Boundary Movement Identification Method
for a Parabolic Partial Differential Equation
Tom P. Fredman l
1 Introduction
A common feature of inverse problems is their objective of determining the
"cause from the effects". A consequence of this is the ill-posed mathemat-
ical nature [1] of such formulations. This means that they do not satisfy
Hadamard's definition of well-posedness: (i) For all admissible data, a solu-
tion exists. (ii) For all admissible data, the solution is unique. (iii) The solu-
tion depends continuously on the data. Practical consequences of ill-posedness
are strong error growth in the computation and the fact that approximation
errors as well as measurement noise cause blow up of results. Hence, special
regularization techniques are necessary for stabilizing the computations.
Here, estimation of boundary position from real process data is considered
based on a regularized, slowly divergent space marching [6] method. Possible
approaches for this have been adopted for the sideways heat equation [2], in-
cluding sequential and entire time-domain computation by output-norm mini-
mization as well as direct methods. An entire time-domain direct computation
method will be demonstrated by application to simulated input data contam-
inated by random noise. If necessary, nonlinearities through variation (tem-
perature, time or geometry) of material properties and mixed (Robin type)
boundary conditions are readily included. The integrated regularization and
straightforward space marching algorithm makes the method especially suited
to industrial applications. To our knowledge, the proposed approach has not
previously been used to tackle dynamical boundary identification problems,
although the main component in the calculation - the sideways diffusion equa-
tion - has been extensively investigated.
Boundary identification for a parabolic PDE 337
2 Hyperbolic regularization
(Kf)(t) =
8
(:;;.
2y1r
it
0
(
f(T) [ _8 2 ]
t-T )3/2 exp 4t-T
( ) dT. (5)
3 Boundary identification
1 EJ2v(y, t) av(y, t)
s2(t) ay2 = at ,(y,t) E]O,l[x]O,tf[· (6)
3.2 Preliminaries
Consider now the situation depicted in the (x, t)-plane in Fig. 1 for (1) having
a moving boundary s(t) > 0 and a specified solution function on this boundary,
i.e., u(s(t), t) = us(t). It is presumed that this function is such that there exists
an intermediate value solution s(t), i.e.,
t = tf - - - - - -,
continuation
u",(t)
S:
I
s(t)
q", (t)
o x
Fig. 1. The relation between domains for the classical sideways diffusion problem
and the boundary identification problem for a finite time horizon
4 Computational method
4.1 Cauchy Problem
[u ]
Ux x
= [at +0I
8 28 2
8t'X 0
I] [u ]
Ux
• (10)
(11)
Here, the initial and endpoint conditions (3) will be incorporated into the
differential operator in the matrix of (10).
8
( at + I 2 8t'X
8 2 )
u ( x, t) >:::: L1 t u(x, t), (12)
(13)
can be used for integration of (10) to obtain the discrete solution profile
U(x) = [u(x,O) u(x, L1t) ... u(x, N L1t)]T , 0:::; x :::; s(t), (14)
Direct numerical integration yields solution trajectories (functions of time in
vector form) at positions 0 < x :::; s(t). When an element of the solution vec-
tor reaches the boundary condition u(s(t), t) = us(t), this element is excluded
and the time interval split into two subintervals for which the computation
is repeated. To obtain the boundary position at the element, record the in-
tegration distance. Depending on the shape of the initial-value vector (13),
a small number of such interval divisions are necessary to satisfy the bound-
ary condition for most of the nodes. For each subinterval, starting point and
endpoint conditions are not known (except for the first interval starting point,
specified by the initial condition at t = 0). Therefore, it is reasonable to as-
sume that the solution extends smoothly across these points and to introduce
a corresponding linear extrapolation [10] into the difference scheme (12).
105r;=======,-.----,---.,.......--'""l
- forward
forward hyperbolic
. - .- central
104 - - analytic
'.
Fig. 2. Amplification factors for the analytical case, two-point central (S3 in [6])
and four-point forward difference approximations ((17) for I = 0) and the forward
difference hyperbolic regularization in (17)
Introducing the Fourier image G(.1x, B), in terms of the normalized frequency
B, of C(.1x, .1t), one has the relation
(16)
where II· I denotes the L2 norm in the time variable, I· IZ2 the Euclidean norm
and P is the maximum number of spatial steps required to reach the desired
boundary. The matrix G(B) is 2 x 2 and, e.g., a four-step forward difference
approximation to ft
combined with a central difference approximation to 22 tt
in (12) yields
1 .1x]
G( B) = [ Llx
4Llt
(e4ill _ 1) + ",2, (Llt)2
Llx (e ill _ 2 + e- ill ) 0 . (17)
For comparison, the corresponding analytical solution operator for one space
step .1x is a convolution kernel with the Fourier image eLlxVili/ Llt. The mag-
nitude of this is g(.1x, B) = e Llx VIIiI/2Llt to which IG(.1x, B)112 is a discrete
approximation. Thus, marching the input data P spatial steps to reach the
desired boundary, amplifies the noise by a factor g(.1x, B)P. Alternative dis-
cretizations (12) are treated extensively by Carasso [6]' including the cases
with "y = 0 and central difference as well as the forward difference approxima-
tion of tt in (17). For the sake of illustration, an example computation with
Boundary identification for a parabolic PDE 343
0.4,---~--,---~--~----,
vf"-'
0.35
j
OJ
0.25 1:.;'."'\·" ,/
.,
t ,
" • I
J:'"
.,
0.15 ./ .. .-
0.1 , ,
0.05 \ "_,'
00':---0::'::.2,------:'0.':-4-,----,0:'-:.6---:'0.::-8- - - - '
Fig. 3. (Left) Diffusion profile and flux measured at the origin (solid) together with
their theoretical trajectories (dashed), for increasing and oscillating thickness tra-
jectory. (Right) Corresponding identified (solid) and theoretical (dashed) thickness
trajectories
5 Example simulation
A simulation was carried out in order to test the ability of the outlined
computation method to identify different types of smooth thickness varia-
tion. To generate corresponding simulated measurements at x = 0, the di-
rect parabolic problem ((1) with I = 0) was solved with the theoretical tra-
jectories for s(t) and us(t) as boundary conditions. After superposition with
a noise (normally distributed, relative amplitude 0.1 %) signal the full curves
um(t) and qm(t), depicted in Fig. 3, were obtained. Subsequently, the sys-
tem (10, 13) was solved with the "measurement signals" as initial conditions
using a simple explicit Euler-method for space marching with Llx = 10- 3 ,
Llt = 7.9.10- 3 and I = 0.01. Hence, the applicable Courant-Friedrichs-Lewy
condition I Llx / Llt ::::; 1 is fulfilled. Identification of the boundary trajectory
from the direct integration as described above, yielded the results on the right
hand of Fig. 3.
The results confirm the feasibility of the used method for situations with
noisy measurement data. As seen in the figures, the solutions are stable and
consistent with the theoretical (dashed) curves. It should be noted that there
344 T.P. Fredman
is a natural time lag involved before the variation induced by boundary move-
ment reaches the sensor location at x = O. In the direct integration method, the
entire time domain is treated simultaneously without account of this causality
aspect. Causality of the problem would require measurement data for the in-
terval (1,1.1] for the solution of the final time interval (0.9,1.0] to be entirely
trustworthy, despite the extrapolation made for the initial and final solution
nodes (cf. the discussion in Elden [8]). Thus, if heads and tails of the function
trajectories are disregarded the results are encouraging.
6 Conclusions
References
l. Alifanov, O. M. (1994): Inverse Heat Transfer Problems. International Series in
Heat and Mass Transfer, Springer-Verlag, Berlin
2. Berntsson, F. (2001): Numerical methods for solving a non-characteristic Cauchy
problem for a parabolic equation. Technical Report LiTH-MAT-R-2001-17, De-
partment of Mathematics, Linkoping University, Linkoping, Sweden
3. Berntsson, F., Elden, L., Loyd, D., Garcia-Padron, R. (1997): A comparison of
three numerical methods for an inverse heat conduction problem and an indus-
trial application. In: Proc. Tenth Int. Conf. for Numerical Methdods in Thermal
Problems, volume X, Institute for Numerical Methods in Engineering, University
of Wales Swansea, Pineridge Press, Swansea, UK
4. Cannon, J. R., Rundell, W. (1991): Recovering a time dependent coefficient in a
parabolic differential equation. Journal of Mathematical Analysis and Applica-
tions, 572-582
5. Carasso, A. (1982): Determining surface temperatures from interior observations.
SIAM J. App!. Math., 42, 558-574
6. Carasso, A. S. (1992): Space marching difference schemes in the nonlinear inverse
heat conduction problem. Inverse Problems, 8, 25-43
7. Elden, L. (1988): Hyperbolic approximations for a Cauchy problem for the heat
equation. Inverse Problems, 4, 59-70
8. Elden, L. (1995): Numerical solution of the sideways heat equation. In: Engl,
H. W., Rundell, W. (eds.) Proc. GAMM-SIAM Symp., Gesellschaft fijr Ange-
wandte Mathematik und Mechanik, GAMM-SIAM, Regensburg, Germany
9. Elden, L. (1995): Numerical solution of the sideways heat equation by difference
approximation in time. Inverse Problems, 11, 913-923
10. Elden, L. (1997): Solving an inverse heat conduction problem by a "method of
lines". Transactions of the ASME, 119, 406-412
Boundary identification for a parabolic PDE 345
11. Elden, L., Berntsson, F., Reginska, T. (2000): Wavelet and Fourier methods for
solving the sideways heat equation. SIAM J. Sci. Comput., 21, 2187-2205
12. Jones, Jr, B. F. (1963): Various methods for finding unknown coefficients in
parabolic differential equations. Communications on Pure and Applied Mathe-
matics, XVI, 33-44
13. Kato, T. (1984): Perturbation Theory for Linear Operators, 2nd edition. A Series
of Comprehensive Studies in Mathematics, Springer-Verlag, Berlin
14. Mejia, C. E., Murio, D. A. (1993): Mollified hyperbolic method for coefficient
identification problems. Computers Math. Aplic., 26, 1-12
15. Mejia, C. E., Murio, D. A. (1996): Numerical solution of generalized IHCP by
discrete mollification. Computers Math. Aplic., 32, 33-50
16. Morse, P. M., Feshbach, H. (1953): Methods of Theoretical Physics, volume I of
International Series in Pure and Applied Physics. McGraw-Hill Book Company,
Inc., New York
17. Richtmyer, R. D., Morton, K. W. (1967): Difference Methods for Initial-Value
Problems, 2nd edition. Tracts in Pure and Applied Mathematics, Wiley Inter-
science, New York
18. Seidman, T. I., Elden, L. (1990): An 'optimal filtering' method for the sideways
heat equation. Inverse Problems, 6, 681-696
19. Sobolev, S. L. (1989): Partial Differential Equations of Mathematical Physics.
Dover Publications Inc., New York
20. Taler, J., Duda, P. (2001): Solution of non-linear inverse heat conduction prob-
lems using the method of lines. Heat and Mass Transfer, 37, 147-155
21. Tikhonov, A. N., Samarskii, A. A. (1990): Equations of Mathematical Physics.
Dover Publications Inc., New York
22. Weber, C. F. (1981): Analysis and solution of the ill-posed inverse heat conduc-
tion problem. Int. J. Heat Mass Transfer, 24,1783-1792
On Computational Properties of a Posteriori
Error Estimates Based upon the Method of
Duality Error Majorants
1 Introduction
For several decades the attention of a number of authors has been focused
on questions of reliability and efficiency of calculations in computational en-
gineering. These questions are closely related to the progress in the theory of
a posteriori error control. On the one hand, it is necessary to have guaran-
teed upper bounds on the errors computed in a suitable norm. On the other
hand, it is very desirable to also have qualitative indication of their local be-
havior. These efforts are aimed to decrease computational costs while ensuring
accurate and reliable modelling of physical phenomena.
Nowadays, in the framework of Finite Element Methods several approaches
to error control are used widely. The first of these was formulated at the end of
70th in the works of Babuska and Rheinboldt (see [2,3]). Further investigations
of this subject were pursued by a number of authors and the amount of the
corresponding literature is very large. The most complete description of the
methods and the associated literature are given, for instance, in [1, 4, 11].
The main idea of the method which we investigate in this paper was intro-
duced in [8, 9]. An important property of this method (which makes it different
Computational Properties of the Duality Error Majorants 347
from other approaches) is that the Duality Error Majorants (DEM) allow us to
estimate the accuracy of any conforming approximate solution independently
of the type of approximation (i.e., majorants are also suitable for methods other
than FEM). The main advantage of this technique is calculation to guaran-
teed upper bounds of the energy norm of the error. In principle, bounds can
be computed as accurately as required (subject to the implied computational
effort). This fact has been mathematically justified in [6] and [10]. As has been
shown in [10]' the method also provides local indication of the error. Both
results were confirmed by a considerable amount of numerical testing.
In view of the above-mentioned properties, it is natural to expect that
a combination of the DEM error estimator with standard packages will lead
to highly effective numerical procedures. This expectation is confirmed by the
tests performed. In this paper, we use the MATLAB PDE Toolbox for mesh
generation and adaptive refinements. Besides, we compare the majorant with
the standard error indicator of the toolbox.
In the very last example, we also present the results of numerical experi-
ments for the biharmonic problem.
In this section, we take the classical diffusion problem with a Dirichlet type
boundary condition as a basis of our investigations (we state it in the varia-
tional form and call it the primal problem).
Problem P. Find u E V such that
a
where V := {v EHl(D) I v = W + Ua, wE Va} and Va :=H1(D).
It is assumed that D is a bounded connected domain in ]Rn with a Lipschitz
continuous boundary aD, f E ][}(D), A E M~xn, and there exist positive
constants aI, a2 such that
I I e111 2:= J
n
AVe· Vedx::; J
n
(AVv - q*) . (Vv - A-lq*)dx ,
I I e1112::; M(v, (3, y*) := MD(v, (3, y*) + M R ({3, y*) , (3)
as k -HXJ.
The proof of this theorem is based on the relations (2) and the above-mentioned
property.
It is also important to emphasize the following result concerning the conver-
gence of the optimal sequence {yk}t~ to p* in Q~ (see [6]).
Theorem 2. If the sequence of pairs ({3k,Y'k) that minimizes M(v,{3,y*) on
lR+ x Q~k is such that {3k ---+ 0, then
and
The theorems state properties of the duality error majorant as a global esti-
mator.
Let us denote by c(x) and JLk(X) the integrands of the error and majo-
rant M( v, {3k, y'k), respectively:
o
where Wo :=H2(fl). As in Section 2, we assume that I E L2(fl), the tensor
B = {bijsz} possess the symmetry property b(ij)(sl) = b(sl)(ij) for i, j, s, I =
1, ... , n, and there exist positive constants aI, a2 such that
a 1 IxI 2 <Bx:x<a
_ _ 2 Ix l2 \;/xEMnxn
s , (4)
Problem P*. Find m* E N; such that
I I e111 2 := 1
n
BVV e: VVedx ::; M(v, (3, x*) := MD(v, (3, x*) + M R ({3, x*) , (6)
and al is as in (4).
In general terms, the techniques used for deriving estimates (3) and (6)
are rather close. As in Sect. 2, we consider a sequence {NWk}t~ of finite
w
dimensional subspaces of N and introduce the corresponding functionals
as k -+ 00 .
Proof. By the limit density property, for the exact solution m* of the dual
problem and any given 8 > 0 we can find k8 such that, for k ;:::: k8, there exists
an element m k E NWk satisfying the inequality
Given the relations (5), we consider parts of the majorant M(v, 8, mk):
n n
Taking into account the multipliers (1 + 8) and (1 + 1/8) c&nlal, we arrive
at the estimate I I e111 2::; Mk(v) : ;111 eIII2 +8 C. Therefore,
Theorem 5. If the sequence of pairs ((3k, xk) that minimizes M(v, (3, x*) on
IR+ x NWk is such that (3k ~ 0, then
and
4 Numerical Results
(TJ~)2:= !
T
A(Vu - Vv) . (Vu - Vv)dx .
(TJ~)2 := (1 + (3) !
T
(AVv - Yk) . (Vv - A-1Yk)dx +
+(I+I/(3)~n/al !(V'Yk+j)2 dx ,
T
In general, the numerical tests performed can be classified into two groups.
For those in the first group, it was observed that the standard error indicator
provides adaptively refined meshes of suitable quality. For this group, the DEM
is also preferable but its advantage is not very considerable. However, for the
second group of examples, the distinction is rather more obvious. Below, we
present such a case.
Example. Let us consider the classical problem
=f in n,
{ -V·(AVu)
u=o on 8n ,
where the values of A and f are given in Table 1 (the sub domains of n are
depicted in Fig. 1). From Fig. 2, we conclude that the local errors computed
by the DEM reproduce the actual distribution on the initial mesh with high
accuracy. We also observe a disadvantage of the standard approach - overes-
Fig. 1. Domain n
A [~O~] [~ ~] [~O ~]
flO 1
it is natural to predict that the DEM will lead to a more effective adaptation
than the one obtained by the indicator 'rJ~.
,
1,/ f".-/ f".-/ 1,,/
1/ 1/ 1/
"
, l"'- I"'- I"'-
1/ 1/ 1/"-
l"'- I"'- l"'-
V, V, v, v,
<
, l"'- I"'- I"'-
, V, v, v, v,
. ,
l"'-
V
l"'-
I"'- l"'- I"'-
1/ 1/ V
I"'- l"'-
V
I"'- l"'- I"'- 1",/1,/
1/
V 1/ 1/ V 1/ V
< l"'- I"'- f".-/ 1,,/ f".-/ f".-/
1/ 1/ 1/ 1/ V 1/
,
1'/1'/1,,/ f".-/ 1,,/ f".-/ 1,,/ f".-/
, 1/ 1/ 1/ 1/ 1/ 1/
Fig. 2. Indication on the initial mesh: (E) exact error distribution; (D) error indi-
cation by the DEM; (S) error indication by the standard indicator
(E13) (015)
(016) (511)
In the very last example, we show that the DEM that stems from (6) also
leads to error estimates of high quality.
Example. Let us consider the biharmonic problem with homogeneous boundary
conditions:
in 0,
on 80,
where 0 is the unit square. In this example, we aim to show that the DEM
provides effective error control not only in the framework of Finite Element
Methods but more widely. For this purpose, we choose the exact solution of
the following form: u = ¢(Xl) ¢(X2), where ¢(x) = (1 - x)2 x 2. An approxi-
mate solution is taken in the form v = u + CW, where W = 'Ij;(Xl' kt) 'Ij;(X2' k 2)
and 'Ij;(x, k) = e10xk ¢(x). We select such a value of the constant C that the
accuracy of v is about of 5%. An approximation of the dual variable )0(* is also
constructed as a combination of global basis functions (the total number of
which is denoted by Nb). The effectivity of the corresponding error bounds for
the various cases is presented in Table 3. From these results, we conclude that
a combination of 196 basis functions is quite enough to provide high-quality
estimates for the various values of the parameters kl and k 2 .
5 Conclusions
Acknowledgement
The research was partially supported by grant N201728 of the Academy of
Sciences of Finland.
Computational Properties of the Duality Error Majorants 357
References
1. Ainsworth, M., aden, J.T. (2000): A posteriori error estimation in finite element
analysis. Wiley, New York
2. Babuska, I., Rheinboldt, W.C. (1978): A-posteriori error estimates for the finite
element method. Int. J. Numer. Methods Eng., 12, 1597-1615
3. Babuska, I., Rheinboldt, W.C. (1978): Error estimates for adaptive finite ele-
ment computations. SIAM J. Numer. Anal., 15, no. 4, 736-754
4. Babuska, I., Strouboulis, T. (2001): The finite element method and its reliabil-
ity. Numerical Mathematics and Scientific Computation, The Clarendon Press,
Oxford University Press, New York
5. Ekeland, I., Temam, R. (1976): Convex analysis and variational problems. Stud-
ies in Mathematics and its Applications, Vol. 1., North-Holland Publishing Co.,
Amsterdam-Oxford, American Elsevier Publishing Co., Inc., New York
6. Frolov, M., Neittaanmiiki, P., Repin, S. (2003): On the reliability, effectivity
and robustness of a posteriori error estimation methods. In: Kuznetsov, Y.,
N eittaanmiiki, P., Pironneau, O. (eds.) Numerical Methods for Scientific Com-
puting. Variational problems and applications. CIMNE Barcelona (in press)
7. Neittaanmiiki, P., Repin, S.l. (2001): A posteriori error estimates for boundary-
value problems related to the biharmonic operator. East-West J. Numer. Math.,
9, no. 2, 157-178.
8. Repin, S.1. (1997): A posteriori error estimation for nonlinear variational prob-
lems by duality theory. Zap. Nauchn. Semin. POMI, 243, 201-214 ((2000):
english translation in J. Math. Sci., New York, 99, no. 1, 927-935)
9. Repin, S.l. (2000): A posteriori error estimation for variational problems with
uniformly convex functionals. Math. Comp., 69, no. 230, 481-500
10. Repin, S.I, Sauter, S., Smolianski, A. (2003): A posteriori error estimation for the
Dirichlet problem with account of the error in the approximation of boundary
conditions. Computing, 70, 205-233
11. Verfiirth, R. (1996): A review of a posteriori error estimation and adaptive mesh-
refinement techniques. Wiley-Teubner Series Advances in Numerical Mathemat-
ics
Efficient Algorithm for Local-Bound-Preserving
Remapping in ALE Methods
1 Los Alamos National Laboratory, T-7, MS-284, Los Alamos, NM 87545, USA
[email protected]
2 Czech Technical University in Prague, Bfehova 7, 115 19 Prague 1, Czech
Republic [email protected]
3 Los Alamos National Laboratory, T-7, MS-284, Los Alamos, NM 87545, USA
[email protected]
1 Introduction
2 Algorithm Description
Suppose, we have two grids: Lagrangian C = {c} and rezoned 6 = {c}. The
grids have the same topology. The rezoned grid is created from the original
one just by small movement of the grid nodes. There exists some underlying
Efficient Algorithm for Local-Bound-Preserving Remapping 359
gc =
I g(r) dV
.::..c--::-:,..,..-:--
V(c)
m(c)
V(c) , m(c) == J g(r) dV, V(c) == J 1dV. (1)
c c
J J
puted as
M == g(r)dV = L g(r)dV = Lm(c). (2)
n '1c c '1c
m*(c) = m(c) = J
e
g(r) dV for g(r) = a + bx + cy + dz.
are coordinates of the cell center and V(c) is the volume of the cell defined in
(1 ).
For computation of slopes we use the limited form
s~ = Pc s~ unlim sY =
'c
P C syc unlim
'C
SZ =P C
SZC unlim
, (7)
Unlimited Slopes In 1D we can use just the central difference as the un-
limited slope. To compute unlimited slopes in 2D we construct a contour sur-
rounding the cell and use Green's Theorem. In 3D this would require comput-
ing intersections of this neighborhood with the original grid, which would be
too slow. So we must use another method.
Let's construct the functional
(8)
for each cell, which measures the sum of differences between the mean values in
the neighboring cells and average values of the reconstructed function from the
original cell in the same neighboring cell. We want to minimize this functional,
so we want the reconstructed function to be as close to the mean values in the
neighboring cells as possible.
We easily compute derivative of this functional with respect to all three
variables and let them be equal to zero. This gives us a linear system
which can be easily solved and gives us our unlimited slopes six,y,z} unlim.
Efficient Algorithm for Local-Bound-Preserving Remapping 361
Limited Slopes For computation of the slopes six,y,z} we use the Barth-
Jasperson limiter at each cell vertex n and than the minimum of them as
a cell limiter
J Pc(x,y,z)dxdydz. (11)
We also need to compute the integrals in the definition of cell centers Xc, Yc,
zc (6) and cell volumes VC n (1). So we need a method for integration of the
linear function over an arbitrary polyhedron. We note, that the boundary of
the polyhedron is uniquely defined, we know just the vertices of each face. If
the face vertices do not lie in one plane, the face is curved and the boundary
is not uniquely defined.
We demonstrate our integration procedure for the example of the cell vol-
ume, the integration of an arbitrary linear function is similar. The cell volume
can be written in the form
V(c) = J
c
IdV = ~ J c
div(x,y,z)dV (12)
and using the Divergence Theorem we can rewrite it as an integral over the
J y,
boundary 8c
v (c) = ~ (x, z) T • S dA . (13)
Bc
Here the superscript T means the transposition of a vector and S is the vector
normal to the boundary. The boundary integral can be split into the sum over
all faces II of the face integrals
v (c) = ~ L
IIEBV II
J (x, y, z) T . S dA (14)
Now just by averaging the coordinates of vertices of each face we compute its
center, connect it with all face vertices and split these face integrals to the
362 R. Garimella et al.
integrals over such defined triangles .<1. On each this triangle the face normal
S is constant so it can go in front of the integral
V(c) = ~ L L
IIE8V LlEII
(SX J
Ll
xdA+SY J
Ll
ydA+Sz J
Ll
ZdA) (15)
Now we project all triangles to the coordinate planes. For each triangle we
select the coordinate plane in which the triangle has the biggest area. This
ensures us that we do not get into trouble due to numerical problems. Using
Green's Theorem we reduce these integrals over triangles to 1D edge integrals,
which can be computed directly from vertex coordinates. This algorithm gives
us a method for computing the integral of the arbitrary linear function over
an arbitrary polyhedron. More details can be seen in [3].
ORIGINAL CELL - - -
SWEPT REGION
outward from the original cell and the middle part is the swept region. In fact,
all faces can move in different ways and swept regions can be tangled. If most
of the swept region goes outward from the original cell, the swept volume and
swept mass are positive, otherwise they are negative. The mass of the swept
Efficient Algorithm for Local-Bound-Preserving Remapping 363
Here f means a swept region from the set F(c) of all swept regions of the cell
c. The new mean value can be than computed as
;;; m*(c)
Pc = V(c) (17)
and as noticed before, it can violate the local bounds due to the approxima-
tion of the integration. So the third stage is necessary to enforce local-bound
preservation.
(18)
We show the repair for the example of violation of the lower bound
(19)
upper bound is done similarly. At first we compute mass, which is needed in
the cell to bring the mean value back to the local minimum
(21)
which can safely be taken from the cell without violating the local bound also.
The total available mass in the neighborhood is
5m avail =
C(c)
'\'
L om~vail
Cn '
(22)
cnEC(c)
364 R. Garimella et al.
If the available mass is too small (om~(~il < omileeded), we extend the sten-
cil and look for the available mass in a targer area. If there is enough mass
available, we perform the repair. We bring the wrong value back to the local
mllllmum
(23)
and we take the mass from the neighborhood proportionally to the mass avail-
able
om~vail
m'(cn ) = m(cn ) - ~
>: avaIl
om':.'eeded
C
. (24)
umC(c)
In [1] we proved that this algorithm succeeds in a finite number of steps and
the repair stage corrects all local-bound violations.
3 Numerical Tests
In the first example the underlying function is equal to zero everywhere, only
in a spherical region around the center of the computation domain (0,1)3 it is
equal to 1
g(x,y, z) =
I
{0
for r ::; 0.25
else
r= J(x _~) + (y _~) + (z _~)
2 2 2
(25)
We define the uniform orthogonal grid in the computational domain, the initial
function is shown on the Fig. 2. We move the grid as the tensor product
movement
Efficient Algorithm for Local-Bound-Preserving Remapping 365
a) b)
remapped using only unlimited slopes. This causes more errors, so the effect
of the repair stage is more obvious. In the a) part of the figure we can see the
function without the repair stage. The light gray cells show areas where the
extrema are violated. In the b) part we see the same remapping with repair,
no values violate the bounds.
The second numerical example shows the same cubical computational domain
with tetrahedral mesh inside. It includes about 9000 tetrahedrons. We use
the same spherical function as before, we can see it on the Fig. 4. Now, we
shake the grid randomly 10 times and remap between these grids. In the last
time step we remap back to the original grid. On the Fig. 5 we can see the
situation with the usage of the Barth-Jasperson limiter with and without the
repair stage. Again, we can see several white cells in the a) part, where the
bounds are violated. In the b) part, the repair stage corrects everything and
no problem with bound preservation is observed.
366 R. Garimella et al.
a) b)
Fig. 5. lOtimes remapped spherical function using Barth-Jasperson limiter a) with-
out repair b) with repair
4 Conclusion
5 Acknowledgments
This work was performed under the auspices of the US Department of En-
ergy at Los Alamos National Laboratory, under contract W-7405-ENG-36. The
authors acknowledge the partial support of the DOE/ ASCR Program in the
Efficient Algorithm for Local-Bound-Preserving Remapping 367
Applied Mathematical Sciences and the Laboratory Directed Research and De-
velopment program (LDRD). R. Garimella and M. Shashkov acknowledge the
partial support of DOE's Accelerated Strategic Computing Initiative (ASCI).
M. Kuchafik also acknowledges the partial support of the Czech Technical
University grant CTU0310614. The authors thank L. Margolin, B. Wendroff,
B. Swartz, R. Liska, M. Berndt and V. Dyadechko for fruitful discussions and
constructive comments.
References
1 Introduction
Ut - = j on D
div K grad U (1)
au + (3(K grad u, n) = 'IjJ on aD
D
w =
{diVW on D
-(w, n) on aD (3)
r divwdD- fan
)n
J (w,n)dS=O (4)
k
r udivw d D - J
hn
u(w,n) d S + r (w,K-1Kgradu)d D
k
= 0 (5)
can be written as
(Dw, U)H = (w, GU)H' (6)
where the inner product of vector functions (., .)H on the space of vector func-
tions H is defined by (A, B)H = In(K- 1 A, B)d D.
Our operators G, D are acting between spaces Hand H as G : H --+ H, D :
H --+ H. The Gauss theorem (6) implies that the operator G is the adjoint
operator of the operator D in the sense of defined inner products G = D*.
370 V. Ganzha et al.
The heat equation problem (1) is discretized in time by fully implicit scheme
u n +1 _ un
--,---- - div K grad u n +1 =f on D, (7)
L1t
au n + 1 + (3(K grad u n +!, n) = 'IjJ on aD,
which can be written in operator form as Aun + 1 = Fun where the operators
A and F are given by
2 Spatial discretization
We first describe the approximation of scalar and vector functions on the
given unstructured triangulation of the region D. Triangles of the grid are
numbered by index i, vertexes by index j and edges by index k with boundary
edges ordered first. The scalar function u is approximated by the piecewise
constant discrete function with constant values Ui inside each triangle i and
with constant values Uk on each boundary edge k. The vector heat flux function
w is discretized by point values at the center of each edge by the projection
W k of w to the normal to the edge as shown in Fig. 1 The normal flux is
continuous across the edges. We define space HC of discrete scalar functions
(piecewise constant functions inside each triangle and on each boundary edge)
with natural inner product
Nt Neb
(U, V)HC = LUi Vi VCi + L Uk Vk Sk,
i=l k=l
Fig. 1. Projection of vector function w to the edges normals at centers of the edges
(9)
Any triangle can be transformed into triangle with vertexes (0,0), (1,0),
(x, y) by moving, rotation and scaling. We continue the explanation here on
this triangle. The Gauss theorem (5) is applied to our triangle with linear scalar
function u and arbitrary vector function w. We require the inner product to
372 V. Ganzha et al.
define the exact gradient which results in a system of 6 linear equations for 6
variables, elements mkl of the matrix M. Only 5 equations from the system
are independent and their solution is
where 81,82 are lengths of the triangle edges 81 = J(x - 1)2 + y2, 82 =
Jx 2 + y2 and m12 remains as a free parameter.
The Sylvester criterion for positive definiteness of matrix M results in a set
of 3 inequalities which reduces to the constraint on m12 > m'ld n = 8182(1 +
(x + 1)(x - 2)/y2)/9 after simplification by using quantifier elimination. We
have found a family of inner products depending on the parameter m12 which
produce exact gradient for linear functions. The traditional scalar product
defined by (9)-(10) belongs to this family. The free parameter m12 can be used
to improve some properties of our numerical algorithm as its accuracy and
condition number of the local matrix M [6].
where we have introduced the operators M, L which connect the natural and
formal inner products. Note that the formal inner products are plain sums of
discrete values products while the natural inner products approximate inner
products on the spaces of continuum functions.
Mimetic Finite Difference Methods for Diffusion Equations 373
(DW)i = v~.
t
L Wkf Skfsign(J/+l -
J=l
J/)
inside the triangle i (sign(jf+l - j!) distinguishes the unique direction on the
edge k! connecting vertexes jf+l and jf [4]) and (DWh = -Wknk on the
boundary edge k.
The Gauss theorem (6) can be rewritten in the discrete case as (DW, U)HC =
(W, GU)HL so that the discrete gradient is the adjoint of discrete divergence
G = D*. When we transform this inner products equality into formal inner
products using the operators Land M we get
Now the formal adjoint DO can be constructed [4] and to get the gradient
W = GU (which is the natural adjoint D* U) of the scalar grid function U the
system
LW=D0MU (11)
has to be solved. The gradient constructed by this way as adjoint to divergence
has global stencil.
The discrete approximation of the global operator (8) is symmetric and
positive definite and for its inversion, which is needed in each time step of the
implicit method, we employ the conjugate gradient method. The numerical
gradient evaluated on every iteration of the conjugate gradient method by
solving (11) for E is computed by the standard Gauss-Seidel method. Our
method is exact on piecewise constant or piecewise linear solutions, otherwise
it is second order accurate.
The situation is different for the Robin boundary conditions when both
value of the scalar function u and the boundary flux (gradient of u) are un-
known on the boundary. The discrete form of the boundary conditions in (7)
is included in the global discrete system and the boundaries with Robin con-
ditions are included in scalar products used for computing conjugate gradient
coefficients and residuals. More details on numerical treatment of boundary
conditions by mimetic methods can be found in [S].
3 Numerical tests
In this section we provide several tests of the developed mimetic method. For
all tests we use initial conditions u = 0 everywhere and compute till time
sufficiently large for the solutions to reach the steady state. Exact discrete
solutions used in error evaluation are given by the point values of exact con-
tinuous solution at the median (average of its vertexes) of each triangle.
Both piecewise linear and quadratic tests are solved on the region (x, y) E
(0,1) x (0,1) with a discontinuous piecewise constant diffusion coefficient
The maximal numerical errors for this test are shown in Table 1 (a) and are
close to machine precision, showing that our method is exact for piecewise
linear solutions.
The stationary exact solution of the piecewise quadratic test, coming e.g.
from [8, 3], is a piecewise quadratic function
Mimetic Finite Difference Methods for Diffusion Equations 375
One of our aims was to develop a method working well also for bad qual-
ity, rather distorted triangular grid which appears quite often in Lagrangian
meshes moving with the fluid. To show how our mimetic method works on bad
quality grids including triangles with big angles we choose the initial grid on
the region (x, y) E (-1,1) x (0, 1) as shown in Fig. 2 (a) and stretch this grid by
a parameter a producing bad quality grids on the region (x, y) E (-a, a) x (0, 1).
The initial grid stretched by parameter a = 5 is shown in Fig. 2 (b). We solve
one problem on series of grids obtained by stretching by increasing parameter
a.
(a)
(b)
Fig. 2. Grid used for stretching the triangulation; (a) for parameter a =1, i.e.
x E (-1,1), (b) for parameter a = 5, i.e. x E (-5,5)
The problem with anisotropic triangulation is heat equation (1) with diffu-
sion coefficient K = 1, right hand side f = -2/a 2 and zero Dirichlet boundary
conditions on left and right and zero Neumann boundary condition on top and,
down. The stationary solution of this problem is u = x 2 /a 2 -1. This problem
is solved till time t = 10a 2 when the solution reaches the stationary state. We
compare the results of our mimetic support operator method with the results
of standard linear finite element method for which we use its implementation
in Partial Differential Equation Toolbox in Matlab. This comparison is pre-
sented by maximal errors in Table 1 (b) for stretching parameter a growing
from 1 to 10 000. Our mimetic method keeps the accuracy well for all values
ofa while finite element method is loosing accuracy already for a = 100. The
376 V. Ganzha et al.
Table 1. (a) Convergence tables of maximum errors Emax for piecewise linear and
piecewise quadratic tests, for quadratic test also numerical order of convergence q is
shown; (b) Maximum errors and minima of numerical solution of problem with sta-
tionary solution u = x 2 /a 2 -Ion grids stretched by parameter a by mimetic support
operator method (MSOM) and standard linear finite element method (FEM).
References
1 Guenter. [email protected]
2 University Dortmund [email protected]
Summary. The main focus of this paper is on stable FE-discretisations for treat-
ing systems of partial differential equations arising in glaciology. The systems are
coupled ones, consisting of a flow problem determining stress, pressure and velocity
and evolution problems for temperature and mean orientation densities, describing
anisotropic material behaviour. The proposed strategies are applied to a standard
model for describing ice sheet dynamics and an enhanced one, taking into account
the developement of certain fabrics in the structure of the ice.
1 Introduction
Climate, climate history and climate forecast have become more and more
important. If one wants to conceive changes of climate in the past and to
make precdictions for the future, the climate of an ice-age has to be studied.
In the context of climate simulations the flow of polar ice masses represents
an essential part. In so far there emerges a need for appropriate climate bound-
ary conditions, Greve e.a. [11], Huybrechts [13]' Fabre e.a. [3]. Furthermore
there is a need of an appropriate description of thermo-mechanical material
behaviour. For instance simulations considering future climate need reliable
constitutive relations to generate reliable predictions about the global hydro-
balance. In addition problems emerged from cold region structural engineering,
e.g. Calove.a. [1], are often become more reliable through such flow simula-
tions. On the other hand, because it may record the past history of ice and
climatic changes and because it is sensible to ice sheets deformation history,
the microstructure of polar ice is worth studying.
The growth and retreat of inland ice masses is governed by the snowfall
onto the surface, the melting and calving of the ice close to and at the outer
ice boundaries. Owing to its own weight, the ice deforms with velocities of
typically 100 meter per year causing a transport of ice towards the ice sheet
boundaries where the ice melts and calves. This process, in turn, is influenced
by the temperature distribution within the ice, implying a delicate balance be-
tween the thermal and mechanical regimes that are established by the climate
input and the geothermal conditions of the substrate. The thermomechanically
coupled ice dynamics together with the mass flux due to snowfall and mass
loss in the vicinity of ice boundaries determine the thickness distribution of
a particular ice sheet.
On Computational Glaciology: FE-Simulation of Ice Sheet Dynamics 379
The deformation of an ice sheet and the variation of its temperature dis-
tribution depends to a large extent on its thermomechanical constitutive mod-
elling. Here, we treat ice as a rheologically nonlinear, thermally coupled, vis-
cous fluid, i.e., we asume its fluidity (inverse viscosity) to be temperature-
dependent, the latter according to a power law with exponent 3, the former
essentially following an Arrhenius-type relationship.
Ice sheets are cold, i.e. below their melting point, except at parts of their
base, where the temperature may reach the melting point. For instance, this
can be concluded from radio-echo sounding of sub-Antarctic lakes (c.f. Oswald
and Robin [16]). The usual fluid no-slip boundary conditions only apply when
basal ice is cold, but at the melting temperature, ice can slide over its base
(see Paterson [17]). In Fowler [4] it is mentioned, that several models are based
on the assumption of non-zero sliding velocity, and in some cases this is even
required in order to obtain a solution (see Morland and Johnson [15] and
Hutter e.a. [12]). On the other hand, basal topographie in Antarctica is so
rough, that for the sliding law we should expect v ;;:::: 0 (c.f. Paterson [17],
Richardson [18]).
The underlying mathematical model is a coupled system of partial differ-
ential equations for describing the distributions of velocities, temperature and
evolution of the geometry of the ice sheets. This system is solved numerically
by employing the Finite Element (FE) method. The choice of an appropriate
discretisation is involved to some extent:
1) assuming the temperature to be known, we have to choose stable elements
for the saddle-point problem determining stress, pressure and velocity simul-
taneously.
2) assuming the velocity field to be known, we have to choose stable elements
for the convection-diffusion problem determining the temperature.
The standard fluid model mentioned above is necessarily isotrop and thus
cannot describe stress-induced anisotropies evident in specimens from bore-
holes. The extreme anisotropy of the ice single crystal leads to heterogeneous
intra-granular deformation modes within the polycrystal and hence to the de-
velopment of a certain fabric. The expectation is, that the climate becomes to
some extent reconstruct able from analyzing ice-core textures, e.g., Thorsteins-
son [19], in combination with the numerical solution of the ice-sheet flow
problem. Therefore considerations for an enhanced model are based on two
geometric scales.
2 Mathematical model
The ice in large ice masses is generally polytherm, i.e., the ice mass consists
of disjoint regions in which the ice is either cold (i.e., its temperature is below
the melting point) or temperate (i.e., it is at the pressure melting point),
but except for a few recent cases theoretical formulations are restricted to
cold ice. For such a case the continuum mechanical postulate ice is a slow,
380 G. Giidert, F.T. Suttmeier
c:(v) = A(T)G(rY)rY,
where A(T) denotes the Arrhenius law and G(rY) = IrYI2 according to Glenn.
The basis for applying the Finite Element (FE) method to the classical
system is the formulation in the variational setting
(CrY, T) - (c:(v), T) = 0
(divv, q) = 0
(rY,C:('P)) - (p,div'P) = (t,'P)
and
(1)
B(U;tJ.5) =0 (2)
with
On Computational Glaciology: FE-Simulation of Ice Sheet Dynamics 381
3 Discretisation
The full discretisation for the system (2) is derived in two steps. First, we
perform a discretisation with respect to the time variable, yielding a sequence
of problems continuous with respect to the space variable. In the second step,
these problems are approximated by the finite element method.
3.1 Semi-discretisation
For discretization, the time interval [0, tMJ is decomposed like 0 = to < ti <
... < tM into subintervals 1m := (tm-i' tmJ of length k m := tm - t m - i .
382 C. Ciidert, F.T. Suttmeier
Fig. 1. Sketch of computational domains for the test examples including structure
of the FE-meshes for the benchmark problem
JI~
w(t)dt=km{awm+(l-a)wm-l}, (5)
with some a E (0,1]' yields the time dicrete schemes. The choice of a = 1
corresponds to the backward Euler scheme, while for a = ~ , we obtain the
Crank-Nicolson scheme. Here, we only consider the simple Euler scheme which
reads
0= B(U m ; tjj) := (Tm - T m - 1, w) (6)
+ (CUm,T) - (S(Vm),T) - (divvm,q)
+ (u m , s(ip)) - (pm, divip) - urn, ip)
+ km(((v m . \1)Tm,w) + (\1Tm, \1w) - (ums(vm),w)).
In this section, we describe the algorithm (see e.g. Geiger and Kanzow [7]) we
employ to solve the problems arising in (6) having the general structure
(8)
(9)
where a j is choosen s.t.
(10)
with a constant /j ;:::: O.
3. Set j = j + 1 and goto 1.
Remark: The Newton scheme is defined by the choice
(11)
V<l> E V h, (12)
384 C. Codert, F.T. Suttmeier
(
A 0
o 0 CT
-BT)
-BC 0
A= (~ ~), B = (- B, C) ,
4 Enhanced Model
An important by-product of the simulation of the dynamics of large ice masses
is the determination of the age of the ice as a function of its position where it
is located today. Hereby, the age of an ice particle is defined as the time that
elapsed since it fell as a snow flake on the free surface. The problem of the ice
On Computational Glaciology: FE-Simulation of Ice Sheet Dynamics 385
age - depth correlation has indeed become a central question in the reconstruc-
tion of the past climate from many ice cores in Greenland and Antarctica. The
thermomechanically coupled ice-sheet models yield this information on the ice
motion through spatial and temporal integration, however, the resulting mo-
tion depends on the underlying constitutive behaviour usually assumed to obey
an isotropic, non-linearly viscous flow law, as described above.
More precisely, such laws should be based upon the microscopic structure
of the material, which is for ice, as for metals, a polycrystalline aggregate
consisting of hexagonal single crystal grains. The ice in glaciers is not isotropic;
at closer examination it is seen to be built by a large number of differently
oriented, strongly anisotropic ice crystals. As structural elements one may
distinguish two building stones defining the crystal: first the basal plane along
which the crystals may relatively easily slide, and second, the unit normal
vector perpendicular to it, defining the so-called c-axis.
The anisotropy of ice crystallites is given by these axis, the fine-scale is given
by the crystallites orientation defined on the sphere 8 2 , whereas the large-scale
is to be identified with the space of daily experience say ]Rd. Following Godert
[10] the fine-scale-structure is actually considered via the second-order struc-
ture tensor A denoting mean orientation densities and yielding an anisotropic
material behaviour.
Therefore our classical system is enhanced by
A=E[v,A] (13)
d 1
0:1:=O:d_1(h 2 - d) (14)
denotes a measure of alignment, 0:1 ----) 0 for randomly distributed c-axes ori-
entations, whereas 0:1 ----) 0: if all c-axes are parallel. Furthermore 0: = 1.2 is
determined via field data (c.f. Godert [9]) and h2 denotes the inner product,
IA2 := (I d, A 2). The macro-space dimension is given by d, AO controls the
diffusion due to recrystallisation, e.g. Godert [10].
The operator ((0:1 - 1)Idev - 20: 1'Pa ) is realised as a 3 x 3-matrix, applied
to the vector
386 G. G6dert, F.T. Suttmeier
(15)
WA - AW =
(OyVI - OxV2) (
2 2aI2 - 2a12 (a22 - all)
)T .
5 Numerical results
The numerical results presented throughout this work are obtained by FE-
implementations based on the DEAL-library [2].
0.35 ,-....,.--.-----,-....,--,--....,.--.-----,-....,----,
u--
f(x) ---------.
0.3
0.25
0.2
0.15
0.1
0.05
0_1'__---'----'---'----'-----'----'---'----'-----'---'
o 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
The computations are performed using about 23365 degrees of freedom. The
time step is choosen adaptively via
with c:5 = 0.001. Here U1 and U2 denote the solution at time step m - 1
obtained by employing Euler and Crank-Nicolson scheme respectively. The
typical developement of the local step size is depicted in Figure 3 showing
k m to increase in the stationary limit. The considerations are restricted to
isothermal flow.
Now, we compare the horizontal velocity along the vertical line x = 1
for the isotropic ((3 = 1) and the enhanced model ((3 = 0.25). In Figure 4
it is shown, that the enhanced model becomes faster, after the stationary
limes was reached. This is in agreement with results found in Gagliardini and
Meyssonnier [6].
Eventually, we investigate the influence of .AD, which controls the effect of
recrystallisation, on the orientation of the c-axis described by the parameter
0:1 in (14). The results are shown in Figure 5. As predicted from theory the
maximum value of 0:1 is controlled by .AD, and we observe 0:1 --> 0: as .AD tends
to zero.
388 G. G6dert, F.T. Suttmeier
time-steps ---+-
0.1
0.01
0.01 0.1 10
Fig. 3. Typical developement of the local step size during the simulation, showing
krn to increase in the stationary limit
0.6 ,--r----,----,---r--,-----,--.....,---,
u, 1 ---+-
u,0.25
0.5
0.4
0.3
0.2
0.1
0_--'---'---'----'----'----'--'-----'
o 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0.8
0.6
0.4
Fig. 5. Evaluation of C¥l along vertical line for different values of the parameter AO
6 ConcI usion
We have presented a stable FE-discretisation for treating systems of partial
differential equations arising in glaciology. The system is a coupled one, con-
sisting of a flow problem determining stress, pressure and velocity and evo-
lution problems for temperature and mean orientation densities, describing
anisotropic material behaviour. The time discretisation is done by the stable
backward Euler scheme. Stable discretisations with respect to space are estab-
lished by a cell-wise constant pressure approximation and discontinuous and
continuous Ql-elements for stress and velocity respectively. Temperature and
orientation densities are approximated by continuous bilinear functions as well,
where the corresponding transport dominated equations require further stabil-
isation, which is provided by the streamline diffusion method. The proposed
strategies are applied to a standard model for describing ice sheet dynamics
and an enhanced one, taking into account the developement of certain fabrics
in the structure of the ice.
References
1. R. Calov, A.A. Savvin, R. Greve, L Hansen, and K. Hutter. Simulation of the
antarctic ice sheet with a three-dimensional poly thermal ice sheet modeL Annals
of Glaciology, 27:201-206, 1998.
2. DEAL. differential equations analysis library. available via http://www-
lsx. mathematik. uni-dortmund. de/user/lsx/suttmeier/deal.html, 1995.
390 G. GCidert, F.T. Suttmeier
1 Introduction
lim
r--+CX)
Vr (f)f)r - ik) u = o. (3)
The wave number k and the source term f can vary in space, while f may be
nonlinear. We have chosen the Dirichlet-type condition (2) for simplicity. The
Sommerfeld radiation condition (3) ensures that the scattered field corresponds
to a purely outgoing wave at infinity.
We assume that both scatterers have compact support and that they are
well separated. In this case, they can be surrounded by two non-overlapping
disks centered at Cl, C2 with radii R l , R 2 , respectively, such that in the un-
bounded domain D outside these disks, the scattered field satisfies the homo-
geneous Helmholtz equation with a constant wave number k > 0, together
with the Sommerfeld radiation condition:
lim
r--+(X)
Vr (f)f)r - ik) u = 0 (5)
region D into two purely outgoing wave fields Ul, U2 which solve the following
problems:
Nonreflecting Boundary Conditions in Multiple Scattering 393
lim
r~CX)
Vi (~
ur - ik) Ul = 0, (7)
lim
r-+oo
Vi (~
uT
- ik) U2 = o. (9)
lim
r-+CX)
Vi (~
ur - ik) U =0 (12)
JIVi
BR
(:r - ik) uI2 ds = R
BR
J1~~ 12 +k 2 1u1 2 ds-ikR
BR
J(u ~: - ~~u) ds.
(13)
From (12) we observe that the left side of (13) tends to zero as R --t 00. Next,
we use Green's formula and (10) to conclude that
BR
J(u ~: - ~~ U) ds J(u ~~ - ~~ U) ds.
=
oK
(14)
Here fJ/fJn is the derivative in the direction of the normal vector on fJK point-
ing away from K. The right hand side of (14) vanishes because of (11). Hence
Jlul
(13) implies that
lim
R---+oo
2 ds = o. (15)
BR
By Rellich's theorem (see for example [7], Lemma 2.11), we conclude
u == 0 in]R2 \ K, (16)
which completes the proof of Proposition 1. D
394 M.J. Grote et al.
for j = 1,2. The prime after the sum indicates that the term for n = 0 is
multiplied by 1/2, while H~I) denotes the n-th order Hankel function of the
first kind. Now let u E C 2 (000) be a given function which satisfies (4), (5).
Let Ul and U2 be the solutions to (6), (7) and (8), (9), respectively, together
with the following matching condition on B:
(18)
Both u and Ul +U2 satisfy (4), (5) in D = D 1 nD 2. Since u and Ul +U2 coincide
on B, they coincide everywhere in the exterior region D. We summarize this
result in the following proposition.
Proposition 2. Let u E C 2(000) be a function which satisfies (4), (5). Then
(19)
where Ul and U2 are solutions to the problems (6), (7) and (8), (9), respectively,
together with the matching condition (18). The decomposition ofu into the two
purely outgoing wave fields Ul and U2 is unique.
(20)
Then we introduce the propagation operators
(23)
Note that the operator equation (23) with vanishing right-hand side admits
only the trivial solution, by the uniqueness proof above. Therefore, if K is a
compact operator, existence and uniqueness of a solution to (23) for an arbi-
trary right hand side follows from Fredholm's alternative. When the solution
to (23) is found, it can be extended by (17) to a solution Ul of (6), (7) and a
solution U2 of (8), (9), respectively. In their common domain D = D1 nD 2 , the
superposition U1 + U2 then satisfies (4), (5), and (18) on B. By Proposition 1,
U == U1 + U2 in D.
It remains to show that the operator K is compact. The propagation op-
erator P1 is defined by
In (25), r1 = r1(&2) and &1 = &1(&2) denote the polar coordinates of the points
on B2 relative to the center of B 1. The truncated version of P1, denoted by
P{" N E N, is defined as in (25), with the infinite sum truncated at N.
Lemma 1. The propagation operators have the following properties:
1. p{' : CO(B 1) -+ CO(B 2) is a bounded linear operator of finite rank
2. P1 : CO(B 1) -+ CO(B 2) is a bounded linear operator
3. IIP{' - P1 11 -+ 0, N -+ 00
From Lemma 1, compactness of P1 follows (see for example [9]' Corollary
11.3.3).
Proof. (Lemma 1)
1. This property is obvious from the definition of p{'.
2. The linearity of P1 is obvious from the definition. We shall now show the
boundedness of P1 .
Because the scatterers are well separated, there exists an rmin > R 1 , such
that r1(&2) ;:::: rmin for all &2 E [0,271']. Because the function xIH~1\x)12 is
monotone decreasing for x > 0 and n > 1/2 [10]' we therefore have
(27)
We will now show the convergence of the series 2:~=1 an. From the asymp-
totic behavior of the modulus of the Hankel functions for large orders [11],
396 M.J. Grote et al.
IH~l)(ZW rv _1_
21m
((eZ)2n
2n
+4 (eZ)-2n),
2n
n ~ 00, ZE JR, (28)
n~ 00, (29)
with
ekrmin
"(n:=~' (30)
Because "(n ~ 0, n ~ 00 and rn ~ 0, n ~ 00, we find that the expression
on the right hand side of (29) converges, and therefore
.
hmsup I- - I = hm
an+l . I- -I= -
an+l Rl- < 1. (31)
n--+oo an n--+oo an r min
The ratio test now ensures convergence of the series L::~=l an. We define
(32)
and estimate
(33)
(34)
Lan.
00
Lan.
00
The right hand side tends to zero as N ~ 00. We conclude IIPf -Ptil ~ 0,
N~oo.
This completes the proof of Lemma 1, which implies the compactness of the
operator Pl. Compactness of P2 is shown similarly and compactness of K then
follows. This completes the proof of Proposition 2. D
Nonreflecting Boundary Conditions in Multiple Scattering 397
The expressions on the right hand sides of (37), (38) and on the left hand
sides of (39), (40) are evaluated explicitly by using (41)-(43) and the exact
Fourier representation (17), which involves some straightforward but technical
coordinate transformations.
The matching condition (39), (40) cannot be inverted explicitly, and
thereby Ul and U2 eliminated in the DtN map (37)-(40). Instead, one also
needs to compute the values of Ul on Bl and U2 on B 2.
With the DtN condition (37)-(40), we are now able to solve a boundary
value problem in the finite domain D:
Llu + k 2 u = f in D, (44)
U = g on r, (45)
onu = M[Ull + T[u2l on B l , (46)
onu = M[U2l + T[Ull on B2, (47)
Ul + P[u2l = U on B l , (48)
P[ull + U2 = U on B2. (49)
In [8l we prove the following theorem, which ensures existence and uniqueness
of a solution to the DtN problem (44)-(49).
Theorem 1. Assume that the free space problem (1)-(3) has a unique classi-
cal solution U E C 2(Doo) which satisfies (4), (5). Then the double scattering
boundary value problem (44) -( 49) has a unique solution in D, which coincides
with the restriction of u to D.
398 M.J. Grote et al.
3 Numerical example
We consider scattering of an incident plane wave impinging on three sound-soft
obstacles with kidney-shaped boundaries. The wave number is k = 8IT. For the
numerical solution with a second-order finite difference scheme we generalize
the DtN condition presented above, from two to three scatterers. This gener-
alization is straightforward and explicitly described in [8]. The contour lines
of the real part of the total field are shown in Fig. 1. In [8] we compared our
multi-DtN condition with the standard DtN condition applied to one single
very large domain and showed that the two solutions coincide.
4 Conclusion
We have generalized the Dirichlet-to-Neumann boundary conditions for single
scattering to multiple scattering problems. To do so we have used an expan-
sion of the scattered field into multiple purely outgoing wave fields. We have
shown that the multi-DtN condition is exact, which implies that no spurious
reflections appear at the artificial boundary. When used in a numerical scheme,
the multi-DtN condition allows for much smaller computational domains than
the single-DtN condition, especially when the scatterers are far away from
each other. Moreover, the amount of work does not increase with increasing
distance between the obstacles. We have illustrated the use of the multi-DtN
condition via a numerical example.
References
1. Keller, J.B., Givoli, D. (1989): Exact non-reflecting boundary conditions. J.
Compo Phys. 82, 172-192
Nonrefiecting Boundary Conditions in Multiple Scattering 399
-3 -2 -1
Fig. 1. Contour lines of the real part of the total field for plane wave scattering on
three kidney-shaped obstacles with sound-soft boundaries. The plane wave is incident
from the right and the wave number is k = 87r. The multi-DtN condition is used at
the artificial boundary components, combined with a second-order finite difference
scheme in the interior.
1 Introduction
Au = f, f E R(A) , (1)
* This work was supported by the Estonian Science Foundation, Grant No. 5785.
Choice of the Regularization Parameter 401
2 Regularization methods
We consider the regularization methods in the general form (see [1, 2])
(2)
Here Po, 'Y and 'Yp are positive constants, a 2: IIAII, 'Yo ::; 1 and the greatest
value of Po, for which the inequality (4) holds is called the qualification of
method.
The following regularization methods are special cases of the general
method (2).
M5 The method of the Cauchy problem. We take the solution of the Cauchy
problem
u'(r) + Au(r) = j, u(O) = Uo
for the approximation Ur to the solution of problem (1). Here gr(A)
±(1- erA), Po = 00.
402 U. Hiimarik, T. Raus
B _ {I for Po = 00 ,
r - (I - Agr (A))l/ p o for Po = 00
The MD rule has some advantages over the discrepancy principle since
(i) the MD rule can also be used for the Lavrentiev method;
Choice of the Regularization Parameter 403
1/y'2e for methods M1-M5 respectively. If rp(1) ::; b20 then choose r(o) = l.
In the contrary case we find at first r2 (0) > 1 such that
(6)
(7)
For the regularization parameter r(O) we choose the parameter r, for which
the function t(r) = r S IIBr(Au r - j) I has the global minimum on the interval
[1, r2(0)].
In iterative methods the following rule P' can be used for the choice of the
stopping index n(O) as the parameter r. For the rule P' the analogous results
hold as for the rule P.
Rule P'. Let 0 ::; s ::; 1 and b be the constant such that b > Cm. Find
n2(0) as the first n = 1,2, ... , for which rp(n) ::; M. For the regularization
parameter n(O) we choose n E N, for which the function t(n) = nSllAu n - jll
has the global minimum on the interval [1, n2(0)].
In [13] the rule for the choice of the parameter was considered, in which for
the regularization parameter the parameter r2(0) was taken. We can consider
the rule P as the generalization of this rule, since in case s = 0 these rules
coincide due to the fact that the function IIBr(Au r - j) I is monotonically
decreasing with respect to r. On the other hand, in case s = 1 the rule P
is similar to the non-selfadjoint analogue of the parameter choice rule by the
quasioptimality criterion, since we choose the minimum point of the function
rllBr(Au r - j) I for the regularization parameter. The only difference between
the rule P and the analogue of the quasioptimality criterion is the interval, on
which the function rllBr(Au r - jll is minimized: the intervals are [1, r2(0)] and
[1, 00) respectively.
In [13] the following results are proven for methods M1-M5:
(i) for each j E H we have lim rp(r) = 0;
r->oo
(ii) if Ilj - III ::; 0, IluD - u*1I ::;
M, b 2: Cm, then for each r, r 2: RM,li =
CmM/(b - Cm)o we have rp(r) ::; M; here C m = 12V15/125, C m =
(3/2)(3/2)mm/(m + 3/2)m+3/ 2, em = (3/(2;.te))3/2, C m = (30:/2)(3/2),
C m = (3/(2e))3/2 for methods M1-M5 respectively;
(iii) if II!~ ill::; const for 0 - t 0 then Ilur2 (li) - u* I - t 0 for 0 - t O.
Due to the continuity of the function rp(r) from the property (i) follows
that the choice of finite parameters r2 (0) and r( 0) ::; r2 (0) according to Rule
P is possible. The property (ii) says that if we know a constant M > 0 such
that IluD - u*1I ::; M, then it is sufficient to search the parameter r2(0) in
the finite interval [1, RM,li]. Note that the function rp(r) is non-monotone and
therefore in Rule P we must use the conditions (6)-(7) instead of inequalities
ho ::; rp(r) ::; b20.
Note that the analogues of the results of the paper [13] for non-selfadjoint
problems are presented in [14].
Choice of the Regularization Parameter 405
(8)
To prove the theorem, it suffices to show the convergence of the right-hand
side of (8). In [13] it is proven that
(9)
From inequality r(5) :::; r2(5) and from (9) follows the convergence of the
second term of (8). To show the convergence of the first term of (8), we prove
at first that
(10)
We have
Br(Aur - i) = ABrGr(uo - u*) - BrGrU - 1), (11)
from which with regard to the inequality IIBrGrU - 1)11 :::; Iii - fll :::; C5
follows that
we consider the cases a) r2(5) ----., 00, b) r2(5) :::; r = const separately. If
r2 (5) ----., 00 in the process 5 ----., 0 then using the Banach-Steinhaus theorem we
can prove similarly as in [2] (p. 43) that rPIIABrGr(uO - u*)11 ----., 0 if r ----., 00
(0:::; p :::; 1). Now we consider the case r2(5) :::; r = const. Using (11), (4) we
get
IIUr(8) -u*11 : : ; cl(b 1 , b*) 1~ s ~~~ {II(I -Agr(A))(uo -u*)11 +,rlli - fll};
(16)
Choice of the Regularization Parameter 407
5 Numerical experiments
Discretization of ill-posed problems often leads to the linear systems of alge-
braic equations with large condition numbers. In our numerical experiments
we solved many linear systems of equations Au = 1, where A was the 100 x 100
diagonal matrix with the diagonal elements AI, ... , A100. For generating eigen-
values Ak of A, solution u = (U1' ... , U100 f, noise in 1 and supposed noise level
J different schemes were used and the concrete scheme was chosen randomly
by computer. Schemes for eigenvalues Ak of A and for solution u were
_ _ [n ] 1/2
h = lk + 2(rand (0,1) - 0.5) I 1 - 111 / ~ i~ (2(rand (0,1) - 0.5))2 ,
k = 1,2, ... ,n
or all the noise was concentrated on one eigenvalue, chosen randomly:
For the supposable noise level J = III - 111/d was taken, where values of d were
1, 3, 5, 10, 20, 50, 100. By the Lavrentiev method 6000 different variants of
the problem Au = 1 were solved, parameter 'r was chosen by the rule P. The
ratios
the same problems also by MD rule using the actual noise level and then the
ratio V was almost 1 (average of V was 0.95, maximum of V was 1.24).
The results of numerical experiments are given in Table 1. The results show
that in the case of the exactly estimated noise level (d == Iii - fll/8 = 1) rule
P works nearly as well as the MD rule (averages of ratio V were 0.991 and
0.95 respectively). As expected, in the case of the underestimated noise level
(d > 1) the error of the approximate solution is larger than for the MD rule
which uses the exact noise level. But the ratio of errors of these approximate
solutions is relatively small in comparison with the error, made by estimating
the noise level. For example, if the noise level was d = 100 times smaller than
the real value, the error of the approximate solution for rule P was only 7%
larger than for the MD rule which uses the exact noise level (the corresponding
averages of V were 1.022 and 0.95), and for 96% of problems the ratio V for
rule P was smaller than 1.5. Numerical experiments also showed that if the
actual noise level is larger than the noise level used in the MD rule, then this
rule is not good. For example, if the actual noise level was 3 times larger than
the noise level used in the MD rule, the ratio V was in most cases larger than
10.
Table 1. The Lavrentiev method, Rule P, s = 0.75, bl = b2 = 1.5Cm
References
1 Introduction
In recent years there has been tremendous interest in the design of discon-
tinuous Galerkin finite element methods (DGFEM) for the discretization of
compressible fluid flow problems; see, for example, [2, 3, 6, 7] and the refer-
ences cited therein. The key advantages of these schemes are that DGFEM pro-
vide robust and high-order accurate approximations, particularly in transport-
dominated regimes, and that they are considerably flexible in the choice of
mesh design. Indeed, DGFEM can easily handle non-matching grids and non-
uniform, even anisotropic, polynomial approximation degrees.
In this paper we introduce the symmetric version of the interior penalty
DGFEM for the numerical approximation of the compressible Navier-Stokes
equations. We then consider the a posteriori error analysis and adaptive mesh
design for the underlying discretization method. In particular, here we focus
on so-called 'goal-oriented' a posteriori error estimation which bounds the er-
ror measured in terms of certain target functionals of real or physical interest.
Typical examples that we shall consider here include the drag and lift coeffi-
cients of a body immersed in a viscous fluid. By employing a duality argument
we derive a weighted (Type I) a posteriori error bound which reflects the error
creation and error propagation mechanisms inherent in viscous compressible
fluid flows. On the basis of this a posteriori estimate, we design and implement
the corresponding adaptive algorithm to ensure both the reliable and efficient
control of the error in the prescribed target functional. The superiority of the
proposed approach over standard mesh refinement algorithms which employ
Adaptive DGFEM for the Compressible Navier-Stokes Equations 411
to [2, 3, 5, 6] and the references cited therein for further details concerning the
imposition of suitable boundary conditions.
+
Inr P(Uh, VhUh) : VhVh dx - lEx
r {{P(Uh, VhUh)}} : --
[Vh] ds
Nr(Uh, Vh) = r
lr
1i(ut, ur(ut), n) . r 17 (ut -
vt ds + l~u~ ur(ut)) . vt ds,
-r
} rDurW,iso
.P(Uh, VhUh) : vt 0 nds - r } rW,adia
j"V(Uh' VhUh) : vt 0 nds
- 1r
r\rN,sup
(G~(ut)8hVtl8x;,Gi;(ut)8hVtl8x;): (ut -ur(ut)) 0nds,
where the boundary function Ur (u) is given according to the type of boundary
condition imposed. We set ur(u) = UD on r D , ur(u) = U on rN,sup, and
ur(u) = (p, PV1, PV2, ~':.'~ + !pv
2 f on rN,sub' Furthermore, we set ur(u) =
(p, 0, 0, PCvTwanf on r W , iso, and ur(u) = (p, 0, 0, ~)T = (p, 0, 0, pE - !pv 2)T
on rW,adia' Finally, we note that on the adiabatic boundary rW,adia, we define
j"V(u, Vu) such that
P(u, Vu) . n = (0, Tl1nl
A
In this section, we shall be concerned with controlling the error in the nu-
merical solution measured in terms of a given target functional J(-); for a
detailed discussion, we refer to the review articles [4, 13]. Assuming that J(.)
is differentiable, we write
11 N~[eu +
-
for all v in V. Here, N~[w](·, v) denotes the Frechet derivative of U f-+ N(u, v),
for v E V fixed, at some w in V. We remark that the linearization defined in
(5) is only a formal calculation, in the sense that N~[w](·, .) may not in general
exist. Instead, a suitable approximation to N~[w](·,·) must be determined, for
example, by computing appropriate finite difference quotients of N(·, .), cf.
[9,10]. Given a suitable linearization, we introduce the following dual problem:
find Z E V such that
We assume that (6) possesses a unique solution. Clearly, the validity of this
assumption depends on both the definition of M(u, Uh;"') and the choice of
the target functional under consideration, cf. [10]. For the proceeding error
analysis, we must therefore assume that the dual problem (6) is well-posed.
Proposition 1. Let U and Uh denote the solutions of (2) and (3), respectively,
and suppose that the dual problem (6) is well-posed. Then,
where
Ti", = 1'"
Rh . Wh dx + r
Jo",\r
(FC(Uh) . u'" -1i(ut, Uh, U"')) . wt ds
+ r
Jo",nr
(FC(Uh) . u'" -1i(ut, ur(ut), U"')) . wt ds
- Jo",\r
r 17[uh]--
: wt 0 u'" ds
- JrOl<n(rN,suburN,,,,p) (P(ut,V'hut).u",).wt ds
-ia",nrw,adia (FV(ut, V'hut) - PV(ut, V'hut)) : wt 0 U ds
+ r
J o",n(r\rN,sup)
(G~(ut)8hwtl8xi,G~(ut)8hwtl8xi) :
5 Numerical example
In this section we present a numerical example to highlight the advantages
of designing an adaptive finite element algorithm based on the weighted er-
416 R. Hartmann, P. Houston
0.01
(a) (b)
Fig. 1. (a) Mach isolines, Ma = fa, i = 1, ... ,10, of the flow around the NACA00l2
airfoil; (b) Convergence of the error IJ(u) - J(uh)1 using each mesh refinement strat-
egy
ror indicator 17],,1 in comparison with both uniform mesh refinement, as well
as an adaptive algorithm based on an empirical refinement indicator which
does not require the solution of an auxiliary (dual) problem; for simplicity, we
employ a Type II residual indicator of the form derived in [10]. Throughout
this section, we employ the Vijayasundaram flux for the discretization of the
convective terms and set p = 1 (bilinear elements). Finally, for both adaptive
refinement strategies, we use the fixed fraction refinement algorithm with re-
finement and derefinement fractions set to 20% and 10%, respectively; we also
note that for the computation of 11],,1, the dual solution is approximated using
piecewise biquadratic polynomials.
We consider a Mach 0.8 flow at an angle of attack 0: = 10° with Reynolds
number Re = 73 and constant temperature on the profile, cf. [3, 12]. The so-
lution to this problem consists of a flow that is mainly subsonic with a small
supersonic region above the airfoil, see Fig. l(a). Here, we consider the evalu-
Adaptive DGFEM for the Compressible Navier-Stokes Equations 417
(a) (b)
Fig. 2. Convergence of the nonlinear residual with the number of Newton steps
employed: (a) Uniform mesh refinement; (b) Adaptive mesh refinement using the
empirical error indicator
ation of the inviscid drag coefficient (Cdp) on the surface of the airfoil. On the
basis of a fine grid computation, the reference value of the functional is given
by J(u) ~ 0.224.
In Fig. 1 (b) we compare the true error in the computed target functional
J(.) using all three mesh refinement strategies. Here, we clearly observe the
superiority of the weighted a posteriori error indicator; at all refinement steps
the error in the computed functional is less than the corresponding quantity
when either uniform refinement or adaptive refinement based on an empirical
residual indicator is employed. Indeed, on the final mesh the true error in J(.)
is over an order of magnitude smaller than IJ(u) - J(uh)1 computed on the
sequence of meshes generated by the empirical indicator.
In Table 1 we collect the data of the adaptive algorithm when employ-
ing the weighted indicators. Here, we show the number of elements and de-
grees of freedom (DOF) in V h, the true error in the functional J(u) - J(Uh),
the computed error representation formula, the approximate a posteriori error
bound and their respective effectivity indices e1 = LI< TJI</(J(u) - J(Uh)) and
418 R. Hartmann, P. Houston
82 = 2:1< irJI<I/IJ(u) - J(uh)l· First we note that on all refinement steps the
correct sign of the error is predicted by the computed error representation
formula. Furthermore, whereas on very coarse meshes the quality of 2:1< 7]1< is
rather poor, in the sense that 81 is noticeable smaller than one, we see that
the effectivity indices 81 slowly tend towards unity as the mesh is refined. On
the other hand, even just the application of the triangle inequality leads to sig-
nificant over-estimation of the true error in the computed functional; indeed,
here we see that 82 slowly increases.
As a final remark, we note that in all of our computations, we employed
a (damped) Newton iteration to solve the system of nonlinear equations aris-
ing from the discontinuous Galerkin discretization of the compressible Navier-
Stokes equations. Within each Newton step, GMRES with Block-Gauss-Seidel
preconditioning is exploited to solve the resulting linearized problem involving
the Jacobian matrix, cf. [9]' for further details. For each of the three mesh re-
finement algorithms employed above, the nonlinear solver proceeds as follows:
starting on the coarsest mesh with free-flow conditions the nonlinear prob-
lem is solved by the Newton iteration described above. When the nonlinear
residual converges below 10- 8 , the mesh is refined once; then the discrete so-
lution is interpolated onto the new mesh and is thereby taken as the starting
solution for the Newton iteration on the newly refined mesh. In Table 2 we
present the history of this solution process on a sequence of uniformly refined
computational meshes; these results are also summarized in Fig. 2(a). On the
coarsest mesh, with free-flow values, the Newton iteration requires a few steps
until the iterative solution reaches the range of quadratic convergence. Indeed,
after only 6 Newton steps the nonlinear residual is below the given tolerance
of 10- 8 and the mesh is refined; on subsequent meshes, the Newton iteration
requires only 4 steps to reduce the residual below the given tolerance.
The convergence behaviour of the Newton iteration is analogous even when
locally refined meshes are employed. Indeed, in Fig. 2(b) we plot the nonlinear
residuals against the number of Newton steps employed for each of the meshes
generated using the empirical error indicator. Here, we again see that on the
first mesh 6 Newton steps are required to satisfy the convergence criterion,
while only 4 are necessary on subsequent meshes. Analogous behaviour is also
observed on the adaptive meshes generated using the weighted error indicator
irJl<l; for brevity, these results have been omitted.
References
1. Arnold, D, Brezzi, F., Cockburn, B., Marini, D. (2002): Unified analysis of discon-
tinuous Galerkin methods for elliptic problems. SIAM J. Numer. Anal., 39(5),
1749-1779
2. Baumann, C.E., Oden, J.T. (1999): A discontinuous hp finite element method
for the solution of the Euler and Navier-Stokes equations. Int. J. Numer. Meth.
Fluids, 31, 79-95
Adaptive DGFEM for the Compressible Navier-Stokes Equations 419
1 Motivation
Nowadays, computational mathematics becomes part of new challenging inter-
disciplinary computer-assisted technologies in medicine, natural sciences, in-
dustry and elsewhere 1 . Specifically for mesh generation this means that sim-
plification and automation of geometrical inputs becomes a crucial issue. Tra-
ditional algorithms, among which the most popular ones are based on Solid
Modeling Techniques (SMT) and CAD modellers, have not been designed to
process machine-generated information such as outputs of MRI scans, image
recognition devices, physical measurements, geographic databases etc. It is our
aim to design a fully automatic parallel generator of unstructured tetrahedral
meshes with this capability.
with
1 The authors acknowledge the financial support of the Grant Agency of the Czech
Republic under Grant No. GP102/01/D114.
On a Novel Technique for Parallel Unstructured Mesh Generation in 3D 421
R(¢) = 1- :L,
8~
r(¢) = 0.3 (1 - :L) .
8~
XD = {
1, x E n (1)
0, x¢ n.
In this paper we introduce a first working version of a C++ mesh generator
XGEN3D that has been designed to process this form of geometrical input.
The algorithm consists of four steps,
and defining k triangles (PI, P2 , Pc), (P2 , P3 , Pc), ... , (Pk - l , Pk , Pc).
Summarized, the recursive algorithm of approximating the boundary an
can be written as follows:
With a suitable choice of the repulsing force Fij of particles Pi, Pj, steady
state of the system is usually achieved after a few iterations. An example of
the repulsing force F that respects the variable mesh density h is
Pi+Pj ) Pi-Pj
Fij = F(Pi , Pj ) = h ( 2 IPi _ Pi 13·
The total force working on each particle Pi is computed from contributions
of particles that lie in a limited distance from Pi. In this way we use the
oct-tree structure available from Step 1 to avoid quadratic complexity of the
algorithm. One further uses the magnitude of forces working at each particle
together with the mesh density function in order to calculate a suitable time
step. The time step is defined in such a way that any particle Pi displaces at
most h(Pi)/D, D > o. The default value of the parameter D in the code is
D = 5. If D is chosen too large, the system will evolve very slowly and one
will have to perform a large number of iterations before the steady state is
reached. On the other hand, a too small value of D would cause instabilities
and prevent the system from converging at all.
In addition to that one has to withdraw the kinetic part of the total energy
from the system after each time step by resetting the velocities of all particles to
zero. This action is analogous to cooling of a physical system that is necessary
in order to reach the bottom of the potential well.
When moving a particle, we have to check whether it crosses the boundary
an on its way. If so, we compute the intersection of the boundary with its
trajectory and position the particle to the point of intersection. In this mo-
ment the particle becomes a boundary particle and it is not allowed to leave
the boundary anymore. From now on the particle is being moved along the
boundary - we compute the force acting on it but only consider the projec-
tion onto the plane tangential to an.The outline of the algorithm is as follows:
Let us remark that it is desired that the mesh density function h(x) does
not vary too fast, otherwise tetrahedra with obtuse angles would arise in the
correspondent areas of the domain, which would lead to low quality of the
resulting mesh. It is difficult to state any quantitative conditions, but we can
roughly say that the function h(x) should be Lipschitz-continous with the
Lipschitz constant not greater than 1/3. However, this value depends on the
domain geometry. Generally speaking, the function h(x) should be chosen as
smooth as possible.
4 Surface meshing
The set of boundary grid points is used to generate a surface mesh - this is
a standard task that virtually all mesh generators have in common. We apply
an Advancing Front (AF) algorithm [4] for this purpose. See, e.g., [2] for a
more general description of AF algorithms.
Because of some temporary technical difficulties that we hope to overcome
soon we request the user to partition the surface of the domain into M sub-
regions aD I , aD2 , ... , aDM of simpler shapes. Function Numbering(x,y,z)
then returns the index k of a subregion aDk the point [x, y, z] belongs to. Or-
dered chains of abscissas that separate a subregion aDk from the rest of aD
are called frontiers.
We generate the surface triangulation for each subregion aDk separately
using an AF technique. The k-th frontier is the starting point. We copy all its
abscissas to the list of abscissas A. Then, for the first abscissa (P, Q) in A, we
search a boundary grid point Z = [x, y, z] such that Numbering(x,y ,z)=k and
the angle (P, Z, Q) of abscissas (P, Z) and (Q, Z) is maximal. In other words,
we look for the maximizer Z of the angle (P, Z, Q) in the set of all bound-
ary grid points such that Numbering=n. The fact that the angle (P, Z, Q) is
maximal ensures that there is no boundary grid point lying inside the triangle
(P, Q, Z). If the mesh density function h(x) is Lipschitz-continous with rea-
sonably small Lipschitz constant and we have reached an equilibrium steady
state in the process of iterative grid points distribution, we can always find
a maximizer Z such that the triangle (P, Q, Z) obeys the minimal-angle-rule
and this results in a high quality mesh.
When the point Z is found, we add the newly created triangle into the list
of surface triangles Ts and remove the abscissa (P, Q) from the list A. Then
we check if the abscissas (P, Z) and (Q, Z) are already contained in A; if so,
we remove them from the list. In the opposite case we add them to A. We
repeat the search of a maximizer for the next abscissa in the list, until the list
becomes empty - this means that the subregion aD k is covered with a surface
triangulation. We can sketch the algorithm as follows:
The result of this algorithm is the surface triangulation list Ts. Examples of
two different surface meshes on the domain D from Fig. 1 are shown in Fig. 3.
Obviously the quality of the mesh strongly depends on the distribution
of grid points. In general, it is extremely difficult to give a proof of stabil-
ity of the meshing algorithm or statements about the quality of the mesh.
According to our experience, if the disribution of grid points is reasonable,
the meshing algorithm is stable and produces meshes of very good quality,
containing prevalently almost-equilateral triangles.
5 Volume meshing
We use a generalization of a two-dimensional algorithm [4] for the construction
of the volume mesh Tv. The input parameters are the list Ts of surface triangles
and the set of inner grid points. The idea of the volume meshing algorithm is
similar as in the case of the surface mesh, again based on the AF technique.
We start with a first triangle (P, Q, R) in the list Ts. For this triangle we find
a grid point Z such that the sum of the angles (P, Z, Q) + (Q, Z, R) + (R, Z, P)
is maximal. According to our experience, this criterion turned out to be op-
timal for the meshing algorithm. Next we add the newly created tetrahedron
(P, Q, R, Z) into the list Tv of tetrahedra and delete the triangle (P, Q, R) from
the list Ts. We check if the triangles (P, Q, Z), (P, R, Z) and (Q, R, Z) are con-
tained in Ts. If so, we remove them from the list. In the opposite case we add
them to the list. Then we proceed to the next triangle in Ts and so on, until
Ts becomes empty. The algorithm is written as follows:
Fig. 3. Meshes on the example geometry from Fig. 1. The first one is gradually
refined towards the thinner end of the spiral and has 1460 surface elements. The
other consists of 1872 uniformly-sized elements. In both cases geometry analysis
parameters TOLn = 0.15 and TOLan = 0.03 were used.
428 J. Ha.skovec, P. Solin
6 Parallelisation
All algorithms utilized for the adaptive oct-tree analysis of the boundary an,
distribution of grid points and generation of the surface and volume meshes
are extremely well suited for running in parallel. Since we are in the middle of
parallelization of the code, let us only give a brief description of the techniques
we use.
When distributing the grid points, the most time-consuming part of the algo-
rithm is the computation of the total force acting on the particles. With Ncpu
processors at our disposal, one divides n into Ncpu subdomains. No domain
decomposition method is needed since splitting the initial cube C does the
job. An independent instance of the algorithm is run for each of the resulting
subdomains.
The surface of the domain is again split into subregions of simpler shapes in
the same spirit as in Section 4. The construction of the surface triangulation
On a Novel Technique for Parallel Unstructured Mesh Generation in 3D 429
References
1. Cheung, Y.K., Lo, S.H., Leung, A.Y.T. (1996): Finite Element Implementation,
Blackwell Science, Oxford
2. George, P.L. (1991): Automatic Mesh Generation: Application to Finite Element
Methods, Wiley, New York
3. Schneiders, R.: http://www-users . informatik. rwth-aachen. derroberts/me-
shgeneration.html
4. Solin, P., (2000): On a Mesh Generation Technique Based on a Special Smoothing
Procedure for Uniform Inner Point Distribution. Acta Technica ASCR, 45, 397-
417.
Adaptive Finite Element Methods for
Turbulent Flow
Summary. We present recent results using adaptive finite element methods, based
on a posteriori error estimates, to compute various output functionals for incom-
pressible flow problems in 3d, for both laminar and turbulent flows. The a posteriori
error estimates are based on the solution of an associated dual problem with data
connected to the output functional we want to compute.
1 Introduction
We present recent results from [15, 14], extending earlier results in [17, 13]'
where we use adaptive finite element methods, based on a posteriori error esti-
mates, to compute various output functionals in incompressible flow problems
in 3d, for both laminar and turbulent flows. The a posteriori error estimates are
based on the solution of an associated linearized dual problem that contains
information about error propagation in space-time.
The idea of using duality arguments in a posteriori error estimation goes
back to Babuska and Miller [2] in the context of postprocessing 'quantities of
physial interest' in elliptic model problems. A framework for more general sit-
uations has since then been systematically developed by in particular Eriksson
& Johnson and Becker & Rannacher, with coworkers, see e.g. [6, 4, 19, 20].
Applications to incompressible flow have been increasingly advanced with com-
putation of functionals such as the drag coefficient for 2d stationary benchmark
problems in [3,9], and drag and lift coefficients and pressure differences for 3d
stationary benchmark problems in [15]. In [17] time dependent problems in 3d
are considered, and the extension to Large Eddy Simulation LES of turbulent
flow is investigated in [13]. In [14] a temporal mean of the drag coefficient of
a surface mounted cube in a turbulent channel flow is computed using aLES.
If we use a subgrid model in a LES, the subgrid modeling error is included
in the a posteriori error estimates, which opens the possibility of comparing
the error using different subgrid models. Altogether, the a posteriori error esti-
mates open the possibility of adaptively choosing both an optimal mesh and an
optimal subgrid model. This approach to a posteriori error estimation with re-
spect to the averaged solution, using duality teqniques, in terms of a modeling
Adaptive Finite Element Methods for Turbulent Flow 431
where u(x, t) = (Ui(X, t)) is the velocity vector and p(x, t) the pressure of the
fluid at (x, t), j is a given driving force, v is the kinematic viscosity, and
I = (0, T) is a time interval. We assume that (1) is normalized so that the
reference velocity and typical length scale are both equal to one. The Reynolds
number Re is then equal to V-I.
For low Re we may have time independent solutions that satisfies the sta-
tionary Navier-Stokes equations, where we simply drop the time derivative in
(1). For higher Re we have time dependent solutions, and for sufficiently high
Re we get turbulent solutions.
In a turbulent flow we are typically not able to resolve all scales of motion
computationally. We may instead aim at computing a running average u h of
u on a scale h, defined by
u h (x, t) = h13 1
Qh
u(x + y, t) dy, (2)
11 R7· w71
where
EJ( = Li K
(5)
4 Numerical examples
(6)
where II·IIK is a norm on the element K, based only on the size of the residuals,
coupling to energy estimates (see e.g. [1]).
_ 2FD(U,P)
CD = U2DH ' (7)
where FD (u, p) is the drag force on the cylinder. Based the results in [25] we
choose CD = 7.6 as our reference value.
In Figure 1 we compare the convergence rates of the two error indicators
(5) and (6) with respect to the reference value CD = 7.6. It is obvious that the
refinement criterion (5), based on both the residual and the solution to the
dual problem, does a better job than the refinement criterion (6), solely based
434 J. Hoffman, C. Johnson
Fig.!. Convergence rates for the computation of the drag coefficient CD (left), and
the pressure difference .dp (right), for duality based refinement ('0') and residual
based refinement ('* '), as a log-log plot of number of unknowns versus relative errors.
on the residuals without any information from the dual problem relating the
residual to the error in the drag coefficient CD.
We then evaluate the a posteriori error estimates as a stopping criterion
for the adaptive algorithm by introducing the notion of an effectivity index
Ieff' defined by
lei I = estimated error/true error, (8)
and in Table 1 we present Ieff as a function of the number of unknowns. The
a posteriori error estimates in this case are quite sharp. After a few initial
refinements the error estimates is off by less than a factor 2, and may thus be
useful as a stopping criterion.
Table 1. Effectivity indices Ief f = estimated error/true error for computing the
drag coefficient CD (left), and the pressure difference .dp (right), as functions the
number of unknowns.
We now use Algorithm 1 to compute the temporal mean of the drag coefficient
CD over a time interval I = [To, T], with To = 10 and T = 20, defined by
1 (
CD = IT _ Tal lTo CD(t) dt, (9)
where CD(t) is the drag coefficient at time t for a surface mounted cube in
a turbulent channel flow, using a cG(1 )cG(I) method (continuous piecewise
linears in space-time) for both the primal and the dual problem, on tetrahedral
meshes '4 that we choose to be constant in time for each iteration k. In the
definition of the LES for the adaptive step k we let h = h(x) be defined to be
the piecewise constant function that equals the diameters of the finite elements
in the computational mesh '4.
In our computational model we use the Navier-Stokes equations to model
the incompressible fluid around a cubic body of dimension H x H x H that sits
on the floor of a rectangular channel oflength 15H, height 2H, and width 7H,
centered at (3.5H, 0.5H, 3.5H). At the inlet we use a velocity profile interpo-
lated from experiments, we use no slip boundary conditions on the body and
the vertical boundaries, slip boundary conditions on the lateral boundaries,
and a transparent outflow boundary condition. The viscosity v is chosen to
give a Reynolds number Re = UbH/v = 40.000, where we have used Ub = 1.0.
We use no subgrid model in the computations, but we use the following
scale similarity subgrid model
(10)
from [22] to estimate the modeling residual, see [14], measuring the small scale
influence on the resolved scales.
In Figure 2 we plot the mean drag coefficient as a function of number of
degrees of freedom. We find that even though we do not reach full convergence
using the avaliable number of degrees of freedom, the value for the mean drag
coefficient seems to asymptotically approach a value between 1.45-1.5. We
know of no experimental reference values of CD, but in [21] CD is approximated
436 J. Hoffman, C. Johnson
:. J-------~
r
'V
Fig. 2. Mean drag coefficient CD over the time interval [10, 20] as a function of the
number of degrees of freedom (left), discretization error eD ('0') and modeling error
eM (' *') after 13 adaptive mesh refinements as functions of the length of the time
interval [To, T], with T fix and To varying, assuming Uh(To) = uh(To) (middle),
and a posteriori error estimates of the discretization error eD ('0') and the modeling
error eM (' *') for the time interval [10,20], as functions of the number of degrees of
freedom in a l091O-l091O plot (right).
Remark 1. The use of a stabilized Galerkin finite element method in the com-
putations may be viewed as a type of subgrid model in itself, since we then
in fact solve a modified set of equations using a standard Galerkin method.
We will further investigate this relation between numerical stabilization and
subgrid modeling in a continuation of this work. In this paper we only con-
sider the stabilization to be part of the numerical method and not an explicit
sub grid model.
5 Summary
In this paper we have presented results from [15, 14], extending earlier results
in [17, 13], where we use adaptive finite element methods based on a posteriori
error estimates to compute approximations of output functionals in incom-
pressible fluids, for both laminar and turbulent flow. The a posteriori error
estimates are based on the solution of an associated linearized dual problem,
and are used as error indicators for the adaptive mesh refinement algorithm.
In the problem of computing the mean drag coefficient in a turbulent chan-
nel flow, we emphasize the local nature of turbulence that makes adaptive
methods ideal for efficient and accurate computations. Due to the computa-
tional goal of approximating the mean drag coefficient we refine the mesh
according to the corresponding a posteriori error estimate, resolving scales of
motion corresponding to local Reynolds numbers of about 1000, and in the-
ory we would be able to resolve local scales of motion corresponding to local
Reynolds numbers of the order 105 to a similar computational cost.
In continuations of this study we will address methods for sharp estimation
of the modeling residual based on scale extrapolation, as well as adaptive
strategies to combine numerical stabilization with subgrid modeling for LES.
Acknowledgments
The first author would like to aknowledge the support by DOE grant DE-
FG02-88ER25053.
438 J. Hoffman, C. Johnson
References
1. M. AINSWORTH AND J. T. ODEN, A posteriori error estimation infinite element
analysis, Computat. Meth. Appl. Mech. Eng., 142 (1997), pp. 1-88.
2. 1. BABUSKA AND A. D. MILLER, The post-processing approach in the finite ele-
ment method, iii: A posteriori error estimation and adaptive mesh selection, Int.
J. Numer. Meth. Eng., 20 (1984), pp. 2311-2324.
3. R. BECKER AND R. RANNACHER, A feed-back approach to error control in adap-
tive finite element methods: Basic analysis and examples, East-West J. Numer.
Math., 4 (1996), pp. 237-264.
4. - - , A posteriori error estimation in finite element methods, Acta Numerica,
10 (2001), pp. 1-103.
5. M. BRAACK AND A. ERN, A posteriori control of modeling errors and discretiza-
tion errors, SIAM J. Multiscale Modeling and Simulation, (2003).
6. K. ERIKSSON, D. ESTEP, P. HANSBO, AND C. JOHNSON, Introduction to adaptive
method for differential equations, Acta Numerica, 4 (1995), pp. 105-158.
7. C. FOIAS, O. MANLEY, R. ROSA, AND R. TEMAM, Navier-Stokes Equations and
Turbulence, Cambridge University Press, 200l.
8. T. GATSKI, M. Y. HUSSAINI, AND J. LUMLEY, Simulation and Modeling of
Turbulent Flow, Oxford University Press, 1996.
9. M. GILES, M. LARSON, M. LEVENSTAM, AND E. SULI, Adaptive error control
for finite element approximations of the lift and drag coefficients in viscous flow,
Technical Report NA-76/06, Oxford University Computing Laboratory, (1997).
10. J. HOFFMAN, On dynamic computational subgrid modeling, Numerical Analysis
Group Preprint, Oxford University, (1999).
11. - - , Dynamic Subgrid Modeling for Scalar Convection-Diffusion-Reaction
Equations with Fractal Coefficients, Multiscale and Multiresolution Methods:
Theory and Application (Ed. T. J. Barth, T. Chan and R. Haimes), Lecture
Notes in Computational Science and Engineering, Springer-Verlag Publishing,
Heidelberg, 200l.
12. - - , Dynamic subgrid modeling for time dependent convection-diffusion-
reaction equations with fractal solutions, International Journal for Numerical
Methods in Fluids, Vol 40, (2001), pp. 583-592.
13. - - , Adaptive finite element methods for les: Duality based a posteriori error
estimation in various norms and linear functionals, submitted to SIAM Journal
of Scientific Computing, (2002).
14. - - , Adaptive finite element methods for les: Computation of the drag coef-
ficient in a turbulent flow around a surface mounted cube, submitted to SIAM
Journal of Scientific Computing, (2003).
15. - - , Computation of functionals in 3d incompressible flow for stationary bench-
mark problems using adaptive finite element methods, submitted to Mathematical
Models and Methods in Applied Sciences (M3AS), (2003).
16. - - , Subgrid modeling for convection-diffusion-reaction in 2 space dimensions
using a haar multiresolution analysis, to appear in Mathematical Models and
Methods in Applied Sciences, (2003).
17. J. HOFFMAN AND C. JOHNSON, Adaptive finite element methods for incompress-
ible fluid flow, Error Estimation and Solution Adaptive Discretization in Com-
putational Fluid Dynamics (Ed. T. J. Barth and H. Deconinck), Lecture Notes
in Computational Science and Engineering, Springer-Verlag Publishing, Heidel-
berg, 2002.
Adaptive Finite Element Methods for Turbulent Flow 439
1 Introduction
We consider the deposition of thin glassy films on the surface of substrates
such as silicon by molecular beam epitaxy. Such processes play an important
role in materials science with regard to the coating of surfaces in order to
obtain specific surface properties (cf., e.g., [11]).
In particular, we assume that the particle beam is impinging perpendicularly
to the surface of the substrate (cf. Fig. 1).
Denoting by D := [0, L]2 C IR? the surface of the substrate, the deposition
process can be described by the temporal and spatial distribution of the height
profile u(x, t), xED, t ;:::: 0, as given by
where the right-hand side 9 describes the surface growth. There have been
many attempts to establish appropriate models for the morphology of deposi-
tion processes featuring amorphous surface growth (cf., e.g., [1, 4, 13]). Here,
Numerical solution of a nonlinear evolution equation 441
Particle Beam
jjjjjjjjjj
Film H(x, t)
following [5, 10] and [11], we take three major growth mechanisms into ac-
count:
The first one describes surface growth due to particle interaction
(3)
where S stands for the influence of interatomic and van der Waals forces.
The second one is curvature induced surface relaxation
where ai < 0 , 1 ::; i ::; 3. The evolution equation takes the form
with an initial condition u(x, 0) = uo(x) , xED, and either periodic boundary
conditions or homogeneous Neumann boundary conditions on r = aD.
442 R.H.W. Hoppe, E. Nash
The constants aI, a2 and a3 in the evolution equation are usually determined
by parameter identification with respect to experimentally obtained measure-
ments by using Auger spectroscopy and scanning electron microscopy.
However, if the particle beam does not impinge perpendicularly, there are
overhangs in the profile and even topological changes due the formation
of inclusions. In this case, we must use the original form of the functions
gi = gi(U) , 1:::; i :::; 3, as given by (3), (4), (5), and resort to other techniques
as, for instance, level set methods (cf., e.g., [8, 9]; see also [3]).
We note that the nonlinear 4th order evolution equation (6) resembles the
well-known Cahn-Hilliard equation which describes spinodal decomposition,
i.e., phase separation in binary alloys (cf., e.g., [6]).
The paper is organized as follows: In section 2, we will briefly address the
dynamics of the growth process which features some unexpected initial linear
behavior. This motivates the use of a combined spectral element/finite ele-
ment approach for the numerical solution of the nonlinear evolution equation
(6) that is described in sections 3 and 4. Finally, in section 5 we will give some
simulation results in terms of visualizations of the height profile for different
film thicknesses.
Then, the direct sum of X+ and X++ can be shown to be a dominant sub-
space which determines the dynamical behavior of solutions to the nonlinear
evolution equation in the following sense:
Numerical solution of a nonlinear evolution equation 443
U y=x+ + x++
flO := L -2 J Uo(x) dx
n
Then there exists t* > 0 such that the solution u = u(x, t), xED, t E [0, t*),
of (6) stays with probability 1 in a vicinity of the dominant subspace
Figure 2 illustrates the initial linear behavior of the solution in case flo = O.
We note that a related result for the Cahn-Hilliard equation has been estab-
lished in [12].
m v XN d X = j UN
j uN' m-I XN d X + Tm j[ aIu m
N 'v + (10)
n n n
+a2.1u7J'v + f(u7J,v-I)jLlXN dx
where U;;.,-I := u7J- I. In this case, we can explicitly solve for the components
of the new iterate
(11)
where
We only have to evaluate the nonlinear terms r::"v-I which can be efficiently
done by the Fast Fourier Transform in case of periodic boundary conditions and
by the Fast Cosine Transform for homogeneous Neumann boundary conditions.
In both cases, this is done with respect to an equidistant grid consisting of M2
grid points where M has to be chosen larger than the dimension of the trial
space VN in order to avoid aliasing effects.
Numerical solution of a nonlinear evolution equation 445
J
n
umXdx = J
n
um - 1Xdx + T; J
n
[a1(u m + um - 1) + (12)
(13)
The spectral Galerkin method becomes inefficient when the nonlinear dynam-
ics sets in, i.e., when the solution leaves the dominant subspace Y as given by
(7). Although a theoretical bound for the exit time t* is known (cf., e.g., [2]),
446 R.H.W. Hoppe, E. Nash
ou/Ot Llw
} in Q := D x [0,(0) (15)
w a1u + a2Llu + a31\7u1 2
We discretize in time by the implicit Euler method and in space by continuous,
piecewise linear finite elements with respect to a simplicial triangulation Th of
n. Denoting by 8 1 (D; Th) the associated finite element space, for each time-
step we have to solve the nonlinear system of equations:
Find (uh',wh') E 8 1(D;Th) x 8 1(D;Th) such that
1 ~mur;:-l + 1
n
uh' Xh dx
n
\7wh" \7Xh dx = 0, Xh E 8 1(D;Th) , (16)
1
a1
n
1
Uh'1/Jh dx - a2
n
\7uh' . \71/Jh dx + a3 1
n
l\7uh'1 21/Jh dx - (17)
-1 n
Wh'1/Jh dx = 0, 1/Jh E 8 1(D;Th) .
where the Newton increment (0:,1/,0:';",) is the solution of the linear system
1
n
o:'Vxedx + Tm 1
n
\7o:t . \7xe dx = 1
n
u,!,-l xedx - (18)
1 n
u;"Vxedx Tm 1
n
\7w;',I/. \7xedx Xe E 8 1(D;Th, ,
1
n
0m,Vol,
Wi 'f/e
dx a1 1
n
0:,V1/Je dx + a2 1
n
\7o:'V . \71/Je dx - (19)
1 n
f'(u'!',V)0':e,V1/Je dx - 1
n
W'!"V1/Je dx + a1 1
n
u;"v 1/J~dx
1
- a2
n
\7u;"V . \71/Jedx + 1
n
f(u;"V)1/Je dx
Numerical solution of a nonlinear evolution equation 447
5 Simulation results
We have used the combined spectral element/finite element approach for the
numerical simulation of the deposition of the metallic glassy film Zr AlCu on
silicon substrates.
The computational domain fl has been chosen as a square of length L = 200nm
in each direction. Periodic boundary conditions have been imposed and the ini-
tial height profile uo(x) , x E fl has been determined randomly with ilo = O.
In the spectral element approach we have used 125, 200, and 250 modes per di-
mension, whereas for the computation of the nonlinearities by the Fast Fourier
Transform a uniform grid with M = 400 grid points in each direction has been
employed which is sufficiently large to avoid aliasing effects.
In the finite element method we used a hierarchy (1()~=o of five simplicial
triangulations with ho = 1/25 and h4 = 1/400.
,
I
1~1
I:
[...,.,)
, I:
Fig. 3. Computed height profile for different film thicknisses [100 nm (left), 360 nm
(middle), and 480 nm (right)]
Figure 3 displays the computed height profiles for different film thicknesses in·
a grey scale ranging from black (0 nm) to white (4 nm). One clearly observes
the effect of structure coarsening: a surface pattern with a mesa-like structure
evolves featuring hills with fiat plateaus that are separated by narrow deep val-
leys. We note that already for 125 modes per dimension the computed profiles
are both qualitatively and quantitatively in good agreement with experimen-
tally obtained data (cf., e.g., [11]).
448 R.H.W. Hoppe, E. Nash
References
1. A.L. Barabasi, and H.E. Stanley (1995): Fractal Concepts in SurfaceGrowth. Cam-
bridge Univ. Press Cambridge
2. D. Blamker (2000): Stochastic Partial Differential Equations and Surface Growth.
Wissner Augsburg
3. R.H.W. Hoppe, W.G. Litvinov, S.J. Linz (2003): On solutions of certain classes
of evolution equations for surface morphologies. Journal of Nonlinear Phenomena
in Complex Systems 6, 582-591
4. M. Kardar, G. Parisi, Y.-C. Zhang (1986): Dynamic scaling of growing interfaces.
Phys. Rev. Lett. 56, 889-892
5. S.J. Linz, M. Raible, and P. Hiinggi (2000): Stochastic field equation for amor-
phous surface growth. Lecture Notes in Physics 557, 473-483
6. S. Maier-Paape, T. Wanner (2000): Spinodal decomposition for the Cahn-Hilliard
equation in higher dimensions: Nonlinear dynamics. Arch. Rat. Mech. Anal. 151,
187-219
7. E.M. Nash (2001): Finite-Elemente und Spektral-Galerkin Verfahren zur nu-
merischen Lasung der Cahn-Hilliard Gleichung und verwandter nichtlinearer Evo-
lutionsgleichungen. Shaker Aachen
8. Ch.-D. Nguyen (2003): Level set methods for nonlinear deposition equations. Dis-
sertation. Institute for Mathematics, University of Augsburg
9. Ch.-D. Nguyen, R.H.W. Hoppe (2003): Amorphous surface growth via a level set
approach. submitted to Nonlinear Analysis: Theory, Methods, and Applications
10. M. Raible, S.J. Linz, P. Hiinggi (2000): Amorphous thin film growth: Minimal
deposition equation. Phys. Rev. E 62,1691-1705
11. M. Raible, S.G. Mayr, S.J. Linz, M. Moske, P. Hiinggi, K. Samwer (2000): Amor-
phous thin film growth: theory compared with experiment. Europhys. Lett. 50,
61-67
12. E. Sander, T. Wanner (2000): Unexpectedly linear behavior for the Cahn-Hilliard
equation. SIAM J. Appl. Math. 60, 2182-2202
13. D.E. Wolf, J. Villain (1990): Growth with surface diffusion. Europhys. Lett. 13,
389-394
Constrained Mountain Pass Algorithm for the
Numerical Solution of Semilinear Elliptic
Problems
Jifl Horak
The idea of the current approach is to "reduce" the number of the "direc-
tions of decrease" of the functional by introducing constraints on admissible
functions. Roughly speaking, we could expect to reduce mountain pass points
to constrained local minima, or more complicated saddle type points to, for
example, constrained mountain pass points.
Define a constraint given by a new functional J. Let 5 = {u E HJ (Q) \
{O} I J(u) := In [IV'uI2 - u 4 ] dx = O}. By testing (1) with u we find that all
nontrivial weak solution of (1) belong to 5. Instead of looking for critical points
of I we will look for critical points of I with respect to 5, i.e., we will solve
Constrained Mountain Pass Algorithm 451
1 Theoretical Background
The mountain pass algorithm of [4] is based on the classical mountain pass
theorem of [1]. The constrained mountain pass algorithm presented in this
contribution is based on the constrained mountain pass theorem. We review
both theorems in this section.
Let B be a real Banach space and I E C1(B,JR) a continuously Frechet
differentiable functional.
Definition 1. I satisfies the Palais-Smale condition at the level a E JR if any
sequence {un} C B such thatI(u n ) ----+ a andI'(u n ) ----+ 0 possesses a convergent
subsequence.
d
dt((t) = -P((t)"VI(((t)) , ((0) = (0 E S , (4)
where "V I (u) is defined as the Riesz representation of the Fn§chet derivative
J'(u) and Pu is the orthogonal projection on the tangent space of S at u E S
and is given by
Constrained Mountain Pass Algorithm 453
k
PuV = V - I>lOj'VJj(U) VEH,
j=1
k
'L)''V Ji(u), 'V Jj(u))Oj = J:(u)v for i E {I, ... , k}.
j=1
Step 2 - Projection Since un+! lies in the tangent space of S at Un but not
necessarily on S we need to approximate it by some U n +1 E S. This is usually
accomplished by scaling (cf. examples of Sec. 3).
Main Loop First, find the maximum of I on the path, i.e., find jm with
I(zjm) ::::: I(zj) Vj. Use interpolation to improve the maximum by moving Zjm
closer to Zjm+ 1 or Zj",-l'
Second, update Zjm by moving it in the direction of the steepest descent of
I projected to the tangent space of Sat Zj", to decrease the value of I(zj",)' In
fact, this amounts to the application of the two steps of the constrained steepest
descent method 2.1 (with Zj", instead of Un, Un+l is then the updated Zjm)'
Repeat these two steps until one of the following occurs:
1. the value of I(zj",) cannot be decreased any further,
2. in several recent consecutive repetitions of the loop the index jm has always
been the same - infinite loop.
In the first situation IfPzj ", 'VI(Zj",) I = IWlszj ", I is too small, i.e., Zjm is an
approximation of the desired critical point, the algorithm stops. In the second
situation the path needs to be refined.
It is sometimes useful and more efficient to design a Newton scheme for problem
(3) and to use the solution obtained by CMPA as an initial guess for this
scheme. The approximate constrained mountain pass solution does not need
to be very precise. This means we can stop the CMPA early and do not need
to refine the path that many times.
3 Examples
Fig. 2. Solutions of (5) found by CSDM. Approximate values of (fL, v) and u(n):
(a) (52.6,47.8), [-2.50,1.66], (b) (65.4,34.3), [-2.19,1.91]
These local minimizers are used as endpoints of the path in CMPA. The
algorithm converges to the solution in Fig. 3(b). During the run of CMPA the
size of 111'lszj = I = IlPzj = \7 1(zj=) II is checked and if it is small for a number of
iterations but later grows again, we may use the point Zjm as an initial guess
in Newton's method. If this method converges, we obtain a numerical solution
that is most likely different from the one to which CMPA eventually converges.
The solution in Fig. 3(a) was obtained this way.
A finite element method with piecewise linear functions on a triangular
grid with 11,097 nodes and 21,760 triangles was used, the path in CMPA had
P = 50 points.
{)2U
b. 2u+c 2 ""2+u+g(u)=0 xE]R2, (6)
uX 1
Fig. 3. Solutions of (5) found by CMPA. Approximate values of (p" 1/) and u(D):
(a) (103.2,45.8), [-2.55,1.90]' (b) (105.7,43.4), [-2.41,1.84]
Although the domain is the whole plane ]R2, for the numerical purposes
we can work on a large enough (but bounded) rectangle because our solutions
decay to zero as Ixi ----> 00 . Further, any translation of a solution of (6) is also a
solution with the same values of I and J. These translations can be prevented
by assuming symmetries of solutions (as in [3, 7]): U(Xl,X2) = U(-Xl,X2)
and U(Xl,X2) = U(Xl, -X2) \f(Xl,X2) E ]R2. Hence we can work on a rectangle
[-Kl' 0] x [-K2' 0]. On the boundaries Xl = -Kl and X2 = -K2 the conditions
U = 0 and ~u = 0 are implemented, on Xl = 0 and X2 = 0 the symmetry
conditions.
" "
CSDM with C = 150 yields numerical solutions shown in Fig. 4 (the profiles
of the waves have been highlighted, only a part of the computational domain is
shown). These types of solutions were found in [7] by the unconstrained moun-
tain pass algorithm. The projection step (Sec. 2.1) in the numerical methods is
again accomplished by scaling: for u E H2\ {O} find t > 0 such that U = til E S.
The two numerical local minima can be then used as end points of the
path in the constrained mountain pass algorithm. The algorithm converges to
the solution shown in Fig. 5(b). As noted in Sec. 3.1 already, by stopping the
algorithm early, if IWls, . II stays small for a number of iterations, and by
J~
References
1. Ambrosetti, A., Rabinowitz, P .R. (1973): Dual variational methods in critical
point theory and applications. J. Functional Analysis, 14, 349-381
458 J. Horak
Summary. Intake port shape affects the quality of an engine significantly. In this
paper, we present a method for improving an existing geometry with evolutionary
algorithm type optimization. Characteristic parameters of different port shapes are
calculated with a self-developed CFD program. However only small deformations al-
lowed on original design, significant improvement achieved. Proposed robust parallel
evolutionary algorithm seems to be a suitable for other optimization problems on
heterogeneous non-reliable cluster of workstations.
1 Introduction
cylinder
outlet
Fig. 1. An opaque view of surface and a cross section of intake port shape
Fig. 2. Cut of tetrahedral mesh and stationary velocity field near the valves
C _ m/Pa (1)
f - A Va '
where Po is the density of air at the inlet, A is the approximate area of
smallest intake cross section (two times the valve inner seat area), Vo =
/2(Pin - Pout)/ Po is the characteristic velocity based on pressure drop and
B is the cylinder bore.
462 A. Horvath, Z. Horvath
m= r
}sout
pvdS, T = r
) Bout
p(v x (r - ro))dS
The main steps of finding a shape optimization strategy in our problem were
the followings:
We used local, smooth deformations on the original grid. This way we did
not have to generate tetrahedral mesh for each deformed shape. Instead of
this very time consuming step we modified only the coordinates of vertices
and recalculated the geometrical parameters (volume, face normals, etc.) of
tetrahedra.
The overall deformation of the shape was put together from elementary
deformations. Each elementary deformation shifted the vertices only inside a
specified cylinder with a parallel axis with surface normal. (See Fig. 3.) The
shift of vertices is a fourth order polynomial of the distance from the axis of
cylinder and the depth relative to deformation center.
Optimal Shape Design of Diesel Intake Ports 463
This choice fits to the engineering requirements also: we could assure that
the deformations keep small and keep some places (e.g. the valves, the top of
the cylinder) unchanged.
We used combinations of such elementary deformations. This way parame-
trization of deformations consists from a list of parameters of elementary defor-
mation, i.e. a list of deformation centers, radii and size. (Height of deformation
cylinder is not a free parameter: it does not affect the shape but the tetrahe-
dral mesh. We have to choose its value carefully depending on deformation
parameters. )
Our goal is to maximize Cf while keeping C s close to the original value. There-
fore we calculated the parameters with no deformations and used them as
reference values C,/f and c;e f .
Our object function was:
Ob·= C _D(1_~)2
J
f
C,/f C;ef
(2)
optimal value rather an optimal interval, that is a few percent change is not
significant.) For example: Obj = 1.02 means at least 2% improvement in mass
flux. (Depending on swirl number change.)
During the CFD calculations we observed significant "noise" in C f and Cs.
For example we ported our C code to two different hardware architectures and
found a difference in the order of 10- 5 in C f and 10- 3 in Cs. Similar differences
appeared on the same architecture in the case of very small deformations.
This noise derives from the rounding errors and the nature of calculations: the
convergence to the stationary flow is slow in Cs.
It means that object function has a lot of false local maxima as deformation
parameters vary. For this reason we decided to use a genetic type algorithm
for optimization. It is a common decision in shape optimization problems. (See
e.g. [6])
Genetic (GA) or evolutionary (EA) type algorithm have a lot of variants. (See
[2], [8]) Depending on the problem and the hardware possibilities different
versions of GAlEA should be used.
The main peculiarities of our problem and possibilities are the following:
- The workstations we can use have different CPU-speed.
- Calculation time of object function (a complete CFD simulation) depends
on geometry and takes 3-4 ours on our fastest workstations.
- We can use approximately 20-30 workstations for a few weeks therefore
2000-4000 object function evaluations is possible.
- There is significant probability of hardware errors during the calculations.
It is obvious that a classical GAlEA with master-worker type paralleliza-
tion is not suitable in our circumstances. One reason is the significant proba-
bility of hardware errors other is the different evaluation time. Both can lead
to a significant loss off efficiency due to the synchronization stages.
Furthermore, an island type parallelization with one island per workstation
is not a good choice either, because each workstation can calculate only 2-4
full generation with reasonable population size. Since new chromosomes from
other islands can be included only at the end of a complete generation cycle,
frequency of communication between different subpopulations is too small for
efficient parallelization.
evaluated chromosomes and object function values to the local hard disk and
checks it for evaluated chromosomes arrived from other machines. "Good"
chromosomes are moved between EA-units by "agent" programs which are
running independently from EA-processes.
This way we achieved stability against hardware failures and strong con-
nection between the EA-units with zero synchronization loss. It is clearly seen
that if an EA-unit fails, the other can work further, if the connection between
units is lost (network error), the units can continue on evaluating object func-
tions and if the workstation executing the agent program fails the agent can be
restarted on an arbitrary workstation. Even after a global failure the EA-units
could read back the evaluated chromosomes and continued working. However
each failure decreases the efficiency, the whole calculation will not stop and
after reparation the calculation can continue at full capacity.
Certeanly we need a special EA algorithm to guarantee that the new results
coming from other EA-units are used as soon as possible. In a classical GAlEA
cycle it is not possible. Therefore we were searching for a more flexible approach
and found the concept of "Flexible Evolution Agents" (FEA-s, see [9]).
FEA-s has not a strictly prescribed sequence of different genetic operators
but a central "decision engine" decides about which genetic operator (muta-
tion, crossover, etc.) will be executed in the next step. This kind of flexibility
is used to get adaptivity property of EA: a learning engine collects statistics
about success of operators and decision engine uses this information to choose
the operators to be execute next time. (See [9])
We did not implemented the adaptivity of FEA because each EA-unit
executes only 100~200 genetic operators in our case and it is too small for
a reliable statistics. However the non-deterministic order of genetic operators
allowed us to read and use the results of object function evaluation of other
units provided by communication agent program.
The skeleton of a robust and parallel EA-unit:
1. Population=empty
2. sort the population with niching
3. if ( there are new chromosomes on disk)
read new values
4. if ( Population. size < Size_min)
generate and evaluate a random chromosome
--> step 2
5. if ( Population. size > Size_max)
truncate Population
6. if ( the best of Population changed
try one line search step between
old and new best value
--> step 2
7. find a new chromosome by a random elementary step
8. evaluate the new chromosome
466 A. Horvath, Z. Horvath
N=l - N..16ref.Hftrla!
N~4 •• -_._- N=15 w~h sifl"iUated errOf!
0_8 N=16 -•.•••
o~o~~~~-=~=-~~~~~=-~rom~~,=~~,= °o~--~~~~~=-~~~~~~-rom~~mm~~'=
ObjooC fundion 8\111;l ....Hons Obj!!dltn:lionllvalulltions
multilevel strategy has a close relationship with island model; in our case
a group of EA-units matches with an island.
In the real CFD shape optimization problem we could use at most 32
workstations. Following the multilevel agent strategy, we divided them into
3 groups (A, B and C) with approximately equal members and implemented
a non-symmetric data flow at the top level agent program: the best results
of Band C groups was sent to group A periodically. This way group Band
C evolved separately from other groups which is a good strategy to maintain
diversity, while group A could combine all the best chromosomes.
Both CFD and EA code was written in standard C. The agents were Bourne
shell scripts. The calculations were performed on Linux workstations at
Szechenyi Istvan University. The maximum number of workstations was 32.
The CPU-speeds were between 1.5 and 2.4 GHz.
A typical evaluation took 3-4 hours (on 2.4 GHz Pentium4 machines)
There were 6 significant hardware-problems during calculations. (Power-
outs, hard disk problems, network problems, etc.) This fact proves that the
robustness of EA method had critical importance in our case.
4.2 Preparation
4.3 Calculations
,-
B ••• _-
C···
+~
0_995 ~
1.01,"--~-~_-:-_~_,:--~:-----'
The best chromosome we found has object function value 1.024. It means
a 2.5% improvement in C f and 3% decrease of Cs. In practice it may result in
e.g. aproximately 2.5% extra power with similar quality of burning.
Optimal Shape Design of Diesel Intake Ports 469
I
i
i
Fig. 6. The resulted "optimal" shape. Black color: modified, optimal shape, light
gray: original shape, mid gray: unchanged parts
5 Conclusions
References
au
at + div f(u) =0 Vt E [0, t max ], X E D(t) (1)
u(O, x) = uo(x) Vx E D(O) (2)
+ Be (boundary conditions) (3)
+ EOS (equations of states) (4)
where
- u : UtE ~,tmaxl {t} x D( t) -7 IRK +4 is the state variable with u = (Pl, ... , PK,
pvT,e) where Pk = Pk(t,X) (k = 1, ... ,K), P := L-Pm, v = v(t,x) =
(Vl, V2, V3)T, e = e(t, x) are respectively the density of the fluid components,
the density of the fluid, the fluid velocity and the total energy density (i.e.
the total energy per unit volume of the fluid);
- f = (ft, fz, hf is the flux vector with fi(U)=(P1Vi, ... , PKVi, (pviv+peif,
vi(e + p)T (i = 1,2,3) where ei is the ith coordinate unit vector; div f(u) =
it a~i~~);
- EOS denotes the set of the equations of states of a non-ideal gas mixture:
K K
p=RT'L ~:, e = 'L Pm1m(T) (5)
m=l m=l
Compressible fluids with moving boundaries and applications 473
where T = T(t, x) denotes the temperature of the fluid; Im(T) is the specific
internal energy given a priori by interpolation formulas based on tabulated
values, R is the universal gas constant and Wm is the molecule weight of
the mth fluid component;
- Be: linearly consistent boundary conditions formulated by the help of ghost
cells (for more details see e.g. [3] pp. 457-460, [11] pp. 222-224 with an
emphasis on moving boundaries; see also [2]);
- D(t) is the time dependent flow domain defined in the following way: D(O)
is a given initial domain, which is deforming according to the mapping
<p : [0, t max ] X D(O) --; ]R3, i.e. D(t) := {x = <p(t,~) E ]R31 ~ E D(O) }; we
suppose that <Pt := <p(t,.) is one-to-one for all t; then the velocity of the
points of D is given by K(t,X) := %f(t, <p;l(X)) \It E [0, t max ] and \Ix E D(t).
Note that <p is not unique if we prescribe only the deformation of the boundary
of D, which is the case in the situations we are focussing on in this paper.
The following lemma of the calculus called Reynolds' transport theorem is
the basic tool to obtain an integral formulation for the fluid flow on a moving
domain.
Lemma 1. Let V be a moving subdomain of D, i.e. V(O) C D(O) and V(t) :=
<Pt(V(O)). Then for any 'Ij; : ]R4 --;]R differentiable function we have
Vet)
J ~~ (t, x)dx = !J
Vet)
'Ij;(t, x)dx - J
8V(t)
'Ij;(t, S)K(t, s) . n(t, s) ds.
:t J
Vet)
udx+ J
8V(t)
f(u)·n-K·n u ds = 0 \IV c D moving sub domain of D
(6)
or, integrating in time and using the notation
u = uv(t) := jV~t)1 J
Vet)
u(t, x) dx,
JJ
tb
IT.ttn )1 r
J JTj(t n )
u(tn,x)dx values
JJ
tn
fixed (if K == 0), shifting (if K(t, x) == ,8(t)b) or deforming. We call B of the
latter type if B is translation invariant in the sense that there exists a surface
So C IR3 such that Bo := UtB(t), the frame of B, equals the volume swept by
shifts of So, i.e. Bo = UOE[O,lJ(SO + Bb), and, moreover, there exists a moving
part of the boundary M C 8B such that K(t, x) = ,8(t)b whenever x E M(t)
and K . n = 0 otherwise.
The discretization of the blocks of the first or the second type can be an
arbitrary tetrahedral mesh fitting the geometry at time t = 0 and this is left
unchanged in time (first type) or simply shifted by ,8(t)b (second type). Let
us now assume that B is of deforming type and for the ease of presentation
suppose that So C 8B(O) and M(O) C So (e.g. the block of a valve in its
bottom dead center). The task we have to solve is the discretization of B
at time points t n given by the flow calculation (the "hydrocode") such that
B(tn) and its mesh is derived by the mapping <p from the given B(t n- 1 ) and
its mesh (n = 1,2, ... ); this step is called mesh modification. But first we need
the discretization of the frame Bo = B(O).
The algorithm of the mesh modification Suppose that we are given the
mesh of B (tn-I), <p and Tn. We shall call the layer in the direction of b from
M (tn-I) the moving layer.
1. Compute first the new position of the moving part, i.e. M(t n- 1 + Tn).
2. If the height of the moving layer is smaller than half of the original height
(i.e. 1/£lbl) or even worse: M(t n- 1 + Tn) does not belong to the moving
layer, reduce Tn SO that in the new position of the moving part the height
of the layer is exactly the half of the original height and take M(tn) =
M(t n- 1 + Tn).
3. Inherit the topological structure of the mesh of B(t n- 1 ) to the mesh of B(tn);
only update the coordinates of nodes of M(t n ) according to the prescribed
476 Z. Horvath, A. Horvath
mapping 'P, re-calculate the geometrical data (area of sides, etc.). In this
case we assign the node velocities as mean velocities, for example for the
node A on the moving part we take i'£(t,XA) = (XA(t n ) - XA(tn-1)/Tn .
4. If the height of the moving layer is exactly half of the original, we shall call
the neighbouring layer (in the direction of b) the new moving layer, and
its nodes back in direction b are snapped to the corresponding nodes on
M(t n - 1 ). The nodes of M(t n - 1 ) are snapped back to the initial position
of the former active layer and are deactivated (Le. signed that these points
do not belong from now to the flow domain). Of course, the geometrical
data have to be updated and since the tetrahedra in the new moving layer
corresponding to the moving part are derived by joining two layers the state
variables have to be interpolated (discussed below).
Step 4. in the algorithm above is called the snapping step and the whole
algorithm is the snapper (cf. [2]). For an illustration of this algorithm in 2D
case see Figure 2. A whole 3D tetrahedral mesh has a too complex structure
to illustrate the snapping on it, however the idea can be understood in 2D
and on Figure 3 we present the basic blocks of a 3D mesh, before and after
a snapping.
deform
---...
snap
---... ---...
deform
Due to the mesh generation and the mesh modification algorithms given
above, tetrahedral meshes before and after snapping have good properties
which enables us to do the interpolation of state variables in an effective and
conservative way. One can observe that grids before and after snapping can
be divided into "snapping blocks" of 2 or 3 prisms such way that these blocks
before and after snapping are coincident.
It is obvious that snap pings must be handled in a different way if the base
triangle of a prism of the moving layer is in the interior of M(t n - 1 ) (type 1
snap, 2 prism snapping blocks) or if there are only one or two vertices of the
base triangle on M(t n - 1 ) (type 2 snap, 3 prism snapping blocks). Type 1 and
2 snapping blocks in 2D are marked in Figure 2. In Figure 3 we present the
structure of type 1 and type 2 snapping blocks in 3D. We have to remark
that depending on which vertices of base triangle are on M(t n - 1 ), there are 6
different cases of type 2 snappings.
Fig. 3. Corresponding snapping blocks in type 1 (left) and type 2 (right) cases
The interpolation of the state variables can be done locally, inside the
snapping blocks. This means interpolation between 3 new and 6 old tetrahedra
in type 1, between 9 new and 9 old tetrahedra in type 2 snap. Since our method
is of first order and state variables are of density nature assigned to tetrahedra,
478 Z. Horvath, A. Horvath
1/32/94/278/27 0 0)
( o 1/94/278/274/9 0
o 0 1/272/272/92/3
Notice that this snapping algorithm, including the elements of the interpola-
tory matrices, is independent of the triangulation of the moving surface.
In order to obtain appropriate geometrical parameters, lljl' K,jl and Djl, the
discrete version of the geometric conservation law (GeL) gives us a guideline.
For the concept and importance of the discrete GeL condition consult e.g. [4].
The GeL condition is derived from the fact that the constant flow u(t, x) ==
u' = canst. is a solution of (1) and also its weak form (7), whenever a suitable
Be is prescribed. Hence we have for all moving sub domains V and [tn-I, tn]
(c.f. (7))
(lV(tn)I-IV(tn-1)i) u*+
tn
JJ
- 1
ndsdt f(u*)-
&V(t)
(J J
tn - 1 &V(t)
K· ndsdt) u* =0
JJ
tn
Proof. Substituting uj-l = uj = u* into (8) the statement follows from the
consistency and conservatitity of the method. 0
Lemma 3. The method (8) with a conservative and consistent numerical flux
g and the "snapper" mesh deformation respects the discrete GeL whenever
_ t n- l +tn
njl = njl( 2 ),
_ 1 (12)
Kjl:= "3
A: node of Sjl
Proof. The first relation of (11) follows from the fact that the left hand side
of the equation equals the integral of the outer normal vector over the surface
of Tj(t n - l + t n )/2), which is the zero vector. To prove the second relation of
(11) it is enough to check by (10) that Tn Ll VjlKjl'lljl = JOV(t) K·ndsdt; but
this follows now from the actual choice of parameters and the definition of K.
o
In our actual algorithms applied to problems reported in this paper we used
(12).
We remark that besides considering the discrete GeL condition is natural
to hold true it is proven to guarantee first order accuracy of the scheme (8)
provided the method is in addition accurate on fixed meshes, see [4].
right: ( v ±yyen )
H±yev·n
op Rpm/Wm op RT
W - Pe Im(T).
oe LPmCv,m(T)'
m
;:;-- =
uPm m
Proof. The results follow from tedious computations and can be validated by
checking the definitions. 0
In (9) we have f(u)· n- K,' nu = 6(u, n, K,)u with 6(u, n, K,) := O(u, n) - K,·nI
(I is the (K +4)-by-(K +4) identity matrix). Then we chose for 9 the function
. -u+v + -u+v _
g(u, v, n, K,) = gVijaya .= 0(-2-' n, K,) u + 0(-2-' n, K,) v. (13)
4 Applications
We implemented the numerical algorithm, certainly including the snapper
mesh generation, in ANSI C programming language and applied to several
problems. Here we display some results of two problems. Based on our expe-
riences we may say that the code performs well for the considered test and
real-life problems.
We investigated the gas flow in a high voltage circuit breaker. In these investi-
gations the gas flow was induced by mechanical constraints of a configuration
of pistons and valves only, i.e. the current was taken identically zero. The fluid
flow domain is slightly not axisymmetric but employing its rotational symme-
try of 90 degrees and plane symmetry it is sufficient to compute the flow in its
eighth part. Hence we did not assume a priori the usual axisymmetric formula-
tions (c.f. [8]). The gas was a mixture of two components that were originally
separated. For an illustration of our results see Figure 4. Our code performed
very well: the computed and measured pressure and density values were com-
pared at two control points and there were at most 5% relative errors in the
measured and computed quantities. However, at certain small parts of the fluid
domain we experienced spurious pressure oscillations but these occured only
for a short period during the simulation time. This was somehow expected
(see e.g. [1], [9]), and clipping negative values from the energy approximations
cured the code. Finally we remark that the flow was proven significantly not
axisymmetric (c.f. Figure 4), which justifies our model.
Fig. 4. Graph of YI := PI/pat two points of time in two perpendicular plane sections
(the plane of the cross section and the direction of movement of parts is marked)
References
1. Abgrall, R. (1996): How to prevent pressure oscillations in multicomponent flow
calculations: A quasi conservative approach. J. Comput. Phys., 125, 150-160.
2. Amsden, A.A., O'Rourke, P.J., Butler, T.D. (1989): KIVA-II: A Computer Pro-
gram for Chemically Reactive Flows with Sprays. Report LA-11560-MS, Los
Alamos National Laboratory.
3. Feistauer, M. (1993): Mathematical Methods in Fluid Dynamics. Longman, Lon-
don
4. Guillard, H., Farhat, C. (2000): On the significance of the geometric conservation
law for computations on moving meshes. Comput. Methods Appl. Engrg., 190,
1467-1482.
5. Horvath, A., Horvath, Z. (2003): Application of CFD numerical simulation for
intake port shape design of a diesel engine. Journal of Computational and Applied
Mechanics, 4, 129~ 146
6. Horvath, A., Horvath, Z., Krizek, M.: Tetrahedralization of partitions formed by
prisms. (in preparation)
482 Z. Horvath, A. Horvath
7. Kroner, D. (1997): Numerical Schemes for Conservation Laws. Wiley and Teub-
ner, Chichester Stuttgart
8. Yan, J.D., Fang, M.T.C. and Hall, W. (1999): The development of PC based CAD
tools for auto-expansion circuit-breaker design. IEEE Trans. on Power Delivery,
14, 176-181.
9. Liu, X.D., Fedkiw, R.P., Osher, S., (1998): A quasi-conservative approach to the
multiphase Euler equations without spurious pressure oscillations. CAM Reports,
98-11, UCLA
10. Schenk, K., Bader, G., Berti, G. (1998): Analysis and Approximation of Multi-
component Gas Mixtures. Technical Report, TU-Cottbus
11. Toro, E.F. (1999): Riemann Solvers and Numerical Methods for Fluid Dynamics:
a Practical Introduction. Springer, Berlin Heidelberg
12. Vijayasundaram, G. (1986): Transonic flow simulations using an upstream cen-
tered scheme of Godunov in finite elements. J. Comput. Phys., 63, 416-433
Discontinuous Galerkin Methods for the
Time-Harmonic Maxwell Equations
1 Introduction
n x u = 0 on r. (4)
In this section, we introduce interior penalty DC methods for the two model
problems in (3) and (4), and review their theoretical properties.
DC for the Time-Harmonic Maxwell Equations 485
We consider the case of insulating materials, i.e., [Jo = [J since all the key
difficulties in the numerical treatment of (3) are already present in this par-
ticular case. The DG method for the discretization of (3) with [Jo = [J and
divergence-free source term j is based on the following mixed formulation of
the problem:
Y' X (J-l-1Y' x u) -cY'p=j in [J,
Y'. (cu) = 0 in [J, (6)
n x u = 0, p = 0 on r.
486 P. Houston et al.
- JFh
r [v]r. {{p,-l\1h x u}}ds+ JFr h
ap,-l[u]r. [v]rds,
Here, and in the following, we denote by (.,.) the standard inner product
in L2 (D)d, d 2': 1, and use \1 h to denote the elementwise application of the
operator \1. Further, we use the notation I:h rpds := L.fEFh If rpds. The
form ah (', .) corresponds to the interior penalty discretization of the cur I-cur I
operator, the form bh (·,·) discretizes the divergence operator in a DG fashion,
and the form Ch (', .) is the interior penalty form that weakly enforces the
continuity of Ph.
The parameters a and C in LOO(Fh ) are the usual interior penalty stabiliza-
tion functions defined by
(8)
where D and 'Yare positive parameters independent of the mesh size.
The results contained in the following theorem have been proven and nu-
merically validated in [8].
Theorem 3. Assume that the analytical solution U of (4) satisfies the regular-
ity assumptions eU E HS (Q)3 and /-l-l \l x U E HS (Q)3, for s > ~, and let Uh
°
be the DC approximation on conforming meshes defined by (9). Then there is
a parameter O:min > only depending on the shape regularity of the mesh and
on the polynomial approximation degree £, and a mesh size ho > such that, °
for 0: ~ O:min and 0 < h ::; ho, we have the optimal a priori error bound
The analysis of [4] is based on duality arguments; thus, the result of Theo-
rem 3 can easily be extended to smooth material coefficients /-l and e. However,
the extension to piecewise smooth coefficients requires alternative mathemat-
ical tools; this is the subject of ongoing research.
3 Numerical Example
\l x \l x U - k2u = j in Q, n x u = 0 on r, (10)
with a rescaled right-hand side (again denoted by j) and the wave number
k = Wv/-loeo.
For simplicity, we restrict ourselves to the two-dimensional analogue of (10).
To this end, we let Q be the L-shaped domain (-1,1)2 \ [0,1) x (-1,0] and
DC for the Time-Harmonic Maxwell Equations 489
Here, the boundary conditions are enforced in the usual DC manner by adding
boundary terms in the formulation (9); see [5,7] for details. The analytical
solution given by (11) then contains a singularity at the re-entrant corner
located at the origin of D; in particular, we note that u lies in the Sobolev
space H 2 / 3 -e (D)2, E > O. This example represents a slight modification of the
numerical experiment presented in [4]; cf., also, [1].
We investigate the asymptotic convergence of the DC method on a sequence
of successively finer (quasi-uniform) unstructured triangular meshes for £ =
1,2,3 as the wave number k increases. To this end, in Tables 1, 2, 3 and 4 we
present numerical experiments for k = 1,2,4,6, respectively. In each case we
show the number of elements in the computational mesh, the corresponding
DC-norm of the error Ilu- Uhllv(h) and the numerical rate of convergence r. In
view of the scaling we introduced, we have taken Ilu-Uhllv(h) as (1Iu-UhI16,n+
!
IIV x (u - uh)116,n + Ilh- 2 [u - UhhI16,FJ We observe that (asymptotically)
1
2.
Ilu - Uhllv(h) converges to zero at the optimal rate O(h 2 / 3 - e ), for each fixed £
and each k, as h tends to zero, as predicted by Theorem 3. In particular, we
make two key observations: firstly, we note that for a given fixed mesh and
fixed polynomial degree, an increase in the wave number k leads to an increase
in the DC-norm of the error in the approximation to u As pointed out in [1],
where curl-conforming finite element methods were employed for the numerical
approximation of (10), the pre-asymptotic region increases as k increases; this
is particularly evident when k = 6, cf. Table 4. Secondly, we observe that
the DC-norm of the error decreases when either the mesh is refined, or the
polynomial degree is increased as we would expect; this is also the case when
the DC-norm of the error is compared with the total number of degrees of
freedom employed in the underlying finite element space, for each fixed k; for
brevity these results have been omitted.
Finally, we end this section by considering the rate of convergence of the
error in the approximation to u measured in the L2-norm. While for smooth
solutions the optimal L2-order has been confirmed numerically (see [4]), the
additional regularity assumptions for the L2 estimate in Theorem 3 do not
hold in the example considered here. Notwithstanding this, in Figure 1 we
plot the L 2-norm of the error in the approximation to u, with the square root
of the number of degrees of freedom in the finite element space V h, for k = 1
and k = 6. We observe that (asymptotically) Ilu - uhllo,n converges to zero at
the rate O(h2/3), for each fixed £ and k, as in the case of the DC-norm of the
error.
490 P. Houston et al.
£=1 £=2
Elements Ilu - uhllv(h) r Ilu - uhllv(h) r Ilu - uhllv(h) r
24 1.525e-1 8.881e-2 6.078e-2
96 8.875e-2 0.78 5.374e-2 0.73 3. 744e-2 0.70
384 5.393e-2 0.72 3.331e-2 0.69 2.337e-2 0.68
1536 3.348e-2 0.69 2.085e-2 0.68 1.467e-2 0.67
6144 2.096e-2 0.68 1. 31Oe-2 0.67 9.227e-3 0.67
4 ConcI usions
:::::::==J 0.67
k=l
yfDegrees of Freedom
Fig. 1. Convergence of Ilu - uhllo,n for k = 1 and k = 6
References
1. M. Ainsworth and J. Coyle. Hierarchic hp-edge element families for Maxwell's
equations on hybrid quadrilateral/triangular meshes. Comput. Methods Appl.
Meeh. Engrg., 190:6709-6733, 2001.
2. C. Amrouche, C. Bernardi, M. Dauge, and V. Girault. Vector potentials in
three-dimensional non-smooth domains. Math. Models Appl. Sci., 21:823-864,
1998.
3. L. Demkowicz and L. Vardapetyan. Modeling of electromagnetic absorp-
tion/scattering problems using hp-adaptive finite elements. Comput. Methods
Appl. Meeh. Engrg., 152:103-124, 1998.
4. P. Houston, I. Perugia, A. Schneebeli, and D. Schotzau. Interior penalty method
for the indefinite time-harmonic maxwell equations. Technical Report PIMS-
03-15, Pacific Institute for the Mathematical Sciences, Universify of British
Columbia, 2003.
492 P. Houston et al.
1 Introduction
Over the last few years, several mixed discontinuous Galerkin finite element
methods (DGFEM) have been proposed for the discretization of incompress-
ible fluid flow problems; see, e.g., [4, 6, 7, 9, 12, 16] and the references therein.
The main motivations that led to these schemes are that mixed DGFEM pro-
vide robust and high-order accurate approximations, particularly in transport-
dominated regimes, and that they are considerably flexible in the choice of
velocity-pressure combinations, without excessive numerical stabilization. For
example, no extra stabilization is required to use optimally matched combi-
nations where the approximation degree for the pressure is of one order lower
than that of the velocity; this result was first established in [11] in the context
of linear elasticity of nearly incompressible materials.
The work in [13] presented a unifying framework for the analysis of mixed
hp-DGFEM for Stokes flow. For discontinuous QP - QP-l elements on hexa-
hedral meshes, the results there ensure (slightly suboptimal) error bounds for
the p-version of the DGFEM, where convergence is obtained by increasing the
polynomial approximation order p on a fixed mesh. However, these bounds
result in algebraic rates of convergence and are restricted to piecewise smooth
494 P. Houston et al.
-Llu + \1p = f in D,
\1 . u = 0 in D, (1)
u = g on 8D.
This system is uniquely solvable; see, e.g., [5, 8] for details.
In [10] the regularity of the solution (u, p) to the Stokes equations with piece-
wise analytic data f and g was described in terms of the countably normed
Sobolev spaces introduced by Babuska and Guo for closely related diffusion
and elasticity problems (see [2, 3, 15]' and the references cited therein). To
°
define these spaces, let {A i };'!1 denote the vertices of the domain D. To each
vertex A we assign a weight f3i :::: and store these numbers in the M-tuple
f3 = (f31,"" f3M)' We define f3 ± j := (f31 ± j, ... , f3M ± j) and use the short-
hand notation C 1 > f3 > C2 to mean C 1 > f3i > C2 for i = 1, ... , M. Writing
ri(x) = min{l, lx-Ail}, we define the weight function iP{3(x) := I1~1 ri(x){3i,
and introduce the semi-norms
k
We denote by H~,l (0) the completion of Coo (0) with respect to the norm
Ilull~~,o(st) := L IlcPf3+laIDaulli2(st),
lal=O
Here, we denote by II· IIL2(st) the usual L2-norm. Similarly, II· IiH'(st) is the
norm on the standard Sobolev space HI(O).
lal = k ;:: l,
for constants C > 0, d ;:: 1 independent of k. Moreover, for l ;:: 1, the space
B~-! (80) is the space of traces of functions in B~(O).
Functions in B~(O) (or their traces) are referred to as piecewise analytic
functions. Indeed, they are analytic in any interior domain Oint C 0 with
Oint C 0\ {A}i'!l' but develop singularities at the corners {Adi'!l' Globally,
we have B~(O) C H I- 1 (0), but B~(O) ct
HI(O). The following regularity
result was proved in [10].
Theorem 1. There exists a weight vector 0 ::; f3min < 1 depending on the
opening angles of 0 at the vertices {Ai }i'!l such that for weight vectors 13 with
3
f3min < 13 < 1 and piecewise analytic data (f, g) E Bg(0)2 X BJ (80)2, the
solution (u,p) of the Stokes system (1) satisfies (u,p) E B$(0)2 x B~(O).
3
In the rest of the paper, we assume that (f, g) E Bg(0)2 X BJ (80)2 for a
weight vector 13 with f3min < 13 < 1, in order to ensure the piecewise analyticity
of the solution (u,p), as stated in Theorem 1.
3.1 Meshes
Next, we define average and jump operators. To this end, let K+ and K-
be two adjacent elements of '4, and x be an arbitrary point on the interior
edge e = EJK+ n EJK- C [I. Moreover, let q, v, and r. be scalar-, vector-, and
matrix-valued functions, respectively, that are smooth inside each element K±.
By (q±, v± , r.±) we denote the traces of (q, v, r.) on e taken from within the
interior of K±, respectively. Then, we define the averages at x E e by:
for all (v, q) E V h X Qh. The forms Ah and Bh are discontinuous Galerkin forms
that discretize the Laplacian and the incompressibility constraint, respectively,
with corresponding right-hand sides Fh and G h . These forms are given by
Fh(v) =
Jn
r f·vdx- Jevr (g0n):'Vh vds + Jevr cg.vds,
G h (q) = - rq
Jev
g . n ds.
Here, 'V h and 'V h' denote the discrete gradient and divergence operator, re-
spectively, taken element-wise. The function c E LOO(E) is the so-called dis-
continuity stabilization function that is chosen as follows. Define the functions
hE LOO(E) and k E LOO(E) by
x Ee = 8K n 8K' c EI ,
k(x) := {
x Ee = 8K n 8n c Ev.
Then we set
c = ,h- 1k 2 ,
with a parameter, > 0 that is independent of hand k.
Remark 1. The form Ah corresponds to the so-called symmetric interior penalty
(IP) discretization of the Laplace operator; see [1] and [13] where the presen-
tation and analysis of several different DG methods were unified for diffusion
problems and the Stokes system, respectively. All the results presented in this
paper hold true verbatim for all the mixed DG methods investigated in [13].
3.4 Well-posedness
Fig. 1. The basic geometric mesh Lln,O" Fig. 2. Local geometric refinement to-
with n = 3 and (]" = 0.5 wards the vertices of n. In all corners,
n = 3 and (]" = 0.5
5 Numerical experiment
The goal of this section is to numerically confirm the exponential convergence
result stated in Theorem 2. To this end, let [J be the L-shaped domain shown
500 P. Houston et al.
in Figure 3. As in [17, p. 113]' we select the right-hand side f = 0 and take the
Dirichlet boundary datum g in such a way that, in the polar coordinates (r, <p),
the exact solution (u,p) to the Stokes problem (1) is given by
u(r
,<p
) = rA ((1 +<p)W'(<p) -
sin(
>.) sin(<p)W(<p) + cos(<p)W'(<p))
(1 + >.) cos(<p)W(<p) ,
p = _r A- 1 [(1 + >.)2W'(<p) + W"'(<p)]/(l - >'),
where
10-',L-----,~-~-~15o__-~-~,------,J
(degrees 01 freedom) 12
Figure 4 shows the performance of the mixed hp-DGFEM for the above
problem, on meshes that are geometrically refined towards the origin and for
polynomial degree distributions that are linearly increasing away from the
origin. In our computations we used the grading factor (J' = 0.5 and the linear
slope J.l = 1. The interior penalty parameter 'Y was chosen as 'Y = 10. The
exponential convergence rate, according to Theorem 2, is clearly visible. In
addition, we observe that the asymptotic regime is already achieved even with
a small number of degrees of freedom.
Mixed hp-DGFEM for the Stokes problem in polygons 501
References
1 Introduction
T is the period of the actual motion and w == ~ is its frequency. The quoted
packages use orthogonal collocations, see [9]' to discretize the problem (2) &
(3).
The continuation is initialized by a cycle with a small amplitude h, see e.g.
[1], function initJLLC:
(4)
It reflects the well known fact that the spectrum of the differential Fu (UH, AH)
contains a purely imaginary eigenpair. The tuple (uO(t), T == ~~, A == AH) is
considered to be an approximate solution to (2) & (3).
As the predictor for the continuation, it is natural to take
(6)
namely, the differential of function UO with respect to h. Following this strategy,
differentials of the components T and A with respect to h are zeroes. Therefore,
the predictor for the pathfollowing for the first cycle reads as (vO (t), 0, 0) .
The aim of this contribution is to suggest a more sophisticated cycle ini-
tialization then just (4) and (6). The technique is based on Lyapunov-8chmidt
reduction at Hopf bifurcation point (UH, AH), see e.g. [11], [5]. We consider the
version, which is characterized by making use of bordering operators, see [8]
and [13].
In Section 2, we review the technique. In Section 3, we give the 2nd-order
formula to initiate the branch. Finally, in Section 4, we will report on a nu-
merical experiment.
2 Preliminaries
We review the main points of [8]. Instead of solving (2)&(3) we seek for 27r-
periodic solutions of (1); the reasons are just historical. The problem is formu-
lated as functional equation <I> (u, A, w) = 0 on proper function spaces:
(9)
(x = 0, A - AH = 0, w - WH = 0)
of (9) is related to the root (u H, AH, wH) of <I>. The nontrivial solutions x i= 0
of (9) are linked with cycles, the trivial solutions x = 0 are linked with steady
states.
It is natural to develop the periodic solutions u = UH + v E U to Fourier
series
+00
u = UH + [v]o + L
([V]keiks + [V]_k e- ikS ) , (10)
k=1
504 D. J anovska, V. Janovsky
(12)
where eH is the unit eigenvector, see (5), and r/H is an adjoint eigenvector,
(13)
(14)
(15)
61 cPxxx x
2
+ cPXA (A - AH) + cPxw (w - WH) + h.o.t. = O. (16)
It consists of low-order terms of the expansion (16), see [5], p.86. The corre-
sponding cycles u E U are approximated by the accordingly truncated series
(10). In (16), it is understood that the differentials of cP are evaluated at the
origin, i.e. cPxxx = cPxxx (0,0,0), etc. An algorithm for computing of the chain
of relevant differentials is supplied in [8]' and in a larger extend in [13].
In next Section 3, we resume the asymptotic formula for a cycle (it is
already done in [8]) and provide a formula for differential of the cycle with
respect to x. The latter formula is the cycle 'velocity' and serves as predictor
step of the cycle continuation.
We rescale time back to period one, see (3).
A Postprocessing of Hopf Bifurcation Points 505
3 The algorithm
for small parameter h. The increments 8A and 8w are the solution of a linear
system
B (8A)
8w
= _ h2
6
(~¢xxx)
'0¢xxx
, (20)
The objects [vxh, [v>.lo, [vxxlo, [vxxh are vectors in en and ¢xxx, ¢x>., ¢xw are
complex constants, see [8l, p. 1167. They are computed at a particular Hopf
bifurcation point UH, AH, WHo
Truncating the higher order terms in (17), (18) and (19) we get the 2nd-
oder cycle approximation.
Let us differentiate the cycle (u(t), T, A) with respect to h. The resulting
differentials (lh lh
u(t), d~ A, T) can be considered as cycle velocity. For the
components of cycle velocity we obtain the following asymptotic formulae:
d .
dh u(t) = 2 ~(exp(2mt) [vxh) + 8A [v>.lo +
+2h (~[vxxlo + ~(exp(47fit) [vxxh )) + O(h2), (21)
d
dh T = -27f w2
lh w '
d
dh W = 8w + O(h
3
), (22)
d
dh A = 8A + O(h ),
3
(23)
B (8A) =
8w
_!:3 (~¢xxx)
'0¢xxx
(24)
Truncating the higher order terms in (21), (22) and (23) we obtain the
Ist-oder cycle velocity approximation.
Remark 1. Assuming the bordering (12) or (14), it comes out that [vxh = ~H.
Actually, for a generic bordering, [vxh is a positive multiple of ~H.
506 D. Janovska, V. Janovsky
Truncating in the expansions (17), (18) and (19) the 2nd-order, and in the
expansions (21), (22) and (23) the 1st-order terms, we get
The claim is that choosing L properly, we get (4) and (6). In fact, due to
Remark 1, the choice e.g. L == 2~H or L == 2T}H would do.
4 Numerical tests
We consider
a
j; = XJL+(fJ +x - k) - JL-,),XYZ,
if = y( EX - ')'Z - p) ,
i = Z ((J')'y - 5) ,
which depends on ten parameters. This dynamical system models corruption
in democratic societies, see [12]. In [6]' p. 317, the system was investigated for
the parameter setting
0.8
0.7
0.6
0.5
>.
0.4
0.3 1
0.9
0.2 0.8
0.7
0.1 0.6
0.717 0.5
x
0.361
0.3605
0.36
>-0.3595
0
0.359
0.3585 0.766
0.765
0.358
0.764
0.3575 0.763
0.7176 0.762 X
0.761
delta
Fig. 2. Zoom - the initial cycle (solid), the next cycle (dotted)
not initialized either due to the round-off errors. On the other hand, ap-
plying ini t ..ILLenew with bordering (14), the continuation is successful for
h ::; 0.002.
508 D. J anovska, V. Janovsky
0.16
0.14
0
0.12
,.,
f)
0.1
0.08
0.08
0.04
0.7
0.02
0.6
-0.02
0.4
X
0.3
0.2
0.73
delta
0.08
0.Q7
,.,
0.06
0.05
0.04
0.42
0.03
0.36
0.02
0.7176
X
delta
Fig. 4. The lst-oder (dotted) vs the 2nd-oder (solid) predictors; h=0.08, h=0.09,
h=O.l
A Postprocessing of Hopf Bifurcation Points 509
References
1. Dhooge, A., Govaerts, W., Kuzetsov, Yu.A., Mestrom, W., Riet, A.M.:
CL.-MATCONT: A continuation toolbox III Matlab. Gent University,
http://allserv.rug.ac.be/ ajdhooge/
2. Dhooge, A., Govaerts, W., Kuzetsov, Yu.A. (2003): MATCONT: A Matlab pack-
age for numerical bifurcation analysis of ODEs. ACM Transactions on Mathe-
matical Software, 31, 141-164
3. Doedel, E.J., Govaerts, W., Kuzetsov, Yu.A.: Computation of periodic solution
bifurcations in ODEs using bordered systems. SIAM J. Numer. Anal., to appear
4. Doedel, E.J., Kernevez, J. (1986): AUTO: Software for continuation problems in
ordinary differential equations with applications. California Institute of Technol-
ogy, Applied Mathematics
5. Golubitsky, M., Schaeffer, D.G. (1985): Singularities and Groups in Bifurcation
Theory. Vol. 1. Springer-Verlag, New York
6. Govaerts, W. (2000): Numerical methods for bifurcation of dynamical equilibria.
SIAM, Philadalphia
7. Govaerts, W., Kuzetsov, Yu.A., Sijnave, B. (2000): Numerical methods for the
generalized Hopf bifurcation. SIAM J. Numer. Anal., 38, 329-346
8. Janovsky, V., Plechac, P. (1996): Local numerical analysis of Hopf bifurcation.
SIAM J. Numer. Anal., 33, 1150-1168
9. Kuzetsov, Yu.A. (1995): Elements of Applied Bifurcation Theory. Springer Ver-
lag, New York
10. Kuzetsov, Yu.A., Levitin, V.V. (1977): CONTENT: Integrated en-
vironment for analysis of dynamical systems. CWI, Amsterdam,
ftp://ftp.cwi.nl/pub/CONTENT
11. Marsden, J.E., McCracken, M. (1976): The Hopf Bifurcation and Its Applica-
tions. Springer-Verlag, New York
12. Rinaldi, S., Feichtlinger, G., Wirl, F. (1998): Corruption dynamics in democratic
societies. Complexity 3, 53-64
13. Xu, H., Janovsky, V., Werner, B. (1998): Numerical computation of degenerate
Hopf points. ZAMM Z. Angew. Mech., 78,807-821
Givens. Reduction of Quaternion-Valued
Matrices to Upper Hessenberg Form
1 Introduction
Since 1HI and illi are isomorphic, see [4], we can replace 2 X 2 complex matrices
with corresponding quaternions and use quaternion arithmetic in calculations.
The arithmetic of quaternions is, of course, more complicated, but it allows us
to economize the storage and to increase the accuracy of results, see [1].
In this paper, our aim is to develop an algorithm for a reduction of a
given quaternion-valued matrix into its upper Hessenberg form. At first, we
shortly review the algebra of quaternions, see [6]. We also briefly mention the
algorithm of Given's reduction of a quaternion-valued vector x E 1HI 2. It can be
found in [3], where also all necessary facts are proved in detail. Then, Givens'
transformation matrices for a vector x E lHI n are defined. We discuss some
possibilities of reducing the number of necessary real operations. Finally, the
number of real operations for the reduction of quaternion-valued matrices into
a similar matrix in upper Hessenberg form will be calculated. Our aim is to
show in particular that it is necessary to develop an algorithm, which reduces
the large number of operations. A numerical example of Givens' reduction of
a quaternion-valued matrix into a similar upper Hessenberg form is presented.
Let 1HI = JR.4 be equipped with the ordinary vector space structure and with
an additional multiplicative operation 1HI x 1HI ---+ 1HI which most easily can be
defined by a multiplication of the four basis elements
(2)
where XI,X2,X3,X4 E JR., ~x = Xl is the real part of x. We will identify
the quaternion x = (XI'O,O,O) with the real number Xl, the quaternion X =
(Xl, X2, 0, 0) will be identified with the complex number Xl + iX2. For X
(XI,X2,X3,X4) E 1HI, Y = (YI,Y2,Y3,Y4) E 1HI it follows from (1) that
XY = (XIYI - X2Y2 - X3Y3 - X4Y4) 1 + (XIY2 + X2YI + X3Y4 - X4Y3) i
(3)
Let us note that conjugation obeys the rules
XY = Y x, x=X.
512 D. Janovska, G. Opfer
-1 X-
X = Ix1 2 ' (6)
(7)
an equation which will be used later.
Two quaternions x and yare called equivalent, denoted by x '" y, if y =
a-lxa, for some a E 1HI\ {O}. For fixed x E 1HI the set
is the only complex element in [xl with non negative imaginary part.
Let x = (Xl, X2, ... , xn)T E lHI n and define the norm Ilxll of x by
n
The space lHIn becomes a normed vector space over 1HI with the norm defined
in (10). For x E lHIn, we denote by x* the transpose of the entrywise conjugate
ofx.
Givens' Reduction of Quaternion-Valued Matrices 513
Definition 1. Let B E lHInxn. If there exist a vector x E lHI n \ {O} and a quater-
nion A E 1HI such that
Bx = XA, (11)
we call A an eigenvalue of B and x an eigenvector corresponding to A.
Theorem 2. Let A E lHI nxn be Hermitean. Then A has only real eigenvalues
and their number is n.
G = ( -s~ s) E 1HI2x2 ,
C
514 D. Janovska., C. Opfer
X2
s=-(J'~, I(J'I = 1,
aXl + f3x2
E = { (J' E 1HI : (J' = I
aXI
f3 I' a, f3
+ X2
E JR, lal + 1f31 > 0} .
G*x = U = (J'llxll(l,O?
and there are no other unitary quaternion-valued matrices satisfying this con-
dition.
Let x = (Xl, ... ,xn)T E lHIn\{o}. For 1:<:; i < j :<:; n, we define the Givens'
rotation matrix G; E lHInxn as follows:
1 o
. 1
Ci . . . . . . . . . Si
1 (13)
-Si ......... Ci ~ j
1
o .1
T T
j
where
(14)
aXi + f3Xj
Ei = {(J' E 1HI : (J' = I
aXi + f3 Xj I' a, f3 E JR, lal + 1f31 > O}. (15)
Givens' Reduction of Quaternion-Valued Matrices 515
Theorem 5. For a given vector x = (Xl, ... ,Xn)T E lHI n , Xn =1= 0, let
(16)
°
n
Let us set (!i = L IXj 12 , i = 1, ... ,n - 1. Since Xn =1= 0, also (!i =1= for
j=i
all i.
Let us start from the first reduction. We construct G~-l to reduce the last
component of x E lHI n :
where
Un-l = CYn-l VIXn_112 + IXnI 2 ,
Cl:n-1Xn-l + f3n-l x n
CYn-l = I
Cl:n-1Xn-l + f3n-l x n I' Cl:n-l,f3n-l E ffi., lCl:n-ll + lf3n-ll > o.
In the next step we continue:
where
516 D. Janovska, G. Opfer
a n -2 X n-2 + f3n-2(Jn-1(2n-l
lan -2 X n-2 + f3n-2(Jn-li?n-ll
a n -2, f3n-2 E JR., la n -21 + lf3n-21 > o.
Now, we form the product (G~=i)*(G~-l)* and continue by reducing U n -2.
By induction we obtain the following explicit formulae for the matrix G*
(gkj) E lHI nxn :
gl,j j = 1, .. . ,n;
gi+l,i i = 1, ... ,n - 2;
_ _ Xk-l(Jk-l(JkXj k_ _
gk,j - , - 2, ... ,n 1, j = k, ... ,n;
i?k-li?k
Xn~
gn,n-l =
i?n-l
Xn-l~
gn,n
i?n-l
gi,j 0, i = 3, ... , n, j = 1, ... , i - 2;
where
an-l Xn-l + f3n-l Xn
(In-l =
lan-l Xn-l + f3n-l xnl '
ai Xi + f3i(Ji+1i?i+l f3
>0
= Iai Xi + f3i(Ji+1i?i+1 I'
1T1l
(Ji ai, i E IN.., lail+lf3il for all i = n-2, ... , 1,
For a given x = (Xl, ... ,xn)T E lHIn, Xn # 0, let us choose (Ji-l = sgnxi at
each step of the reduction. Such (Ji always belongs to the set E i , see Remark
1. If we set (J = (In-l we obtain
and G*x = ((Jllxll, 0, ... , O)T = (sgnxn Ilxll, 0, ... , O)T, where
Givens' Reduction of Quaternion-Valued Matrices 517
(!3 x2 x 3 X2 X 4 x2 x 5 x2 x n
0
G*= (!2 (!2 (!3 (!2(!3 (!2(!3 (!2 (!3
the resulting G* is not an upper Hessenberg matrix, but it has "nearly trian-
gular form":
* * * * *
* * 0 0 0
* * * 0 0
o
* * * ... *
the diagonal elements except the first one are real.
Corresponding Givens transformations G;, see (13), are then applied to k-th
column ofY, k = 1, ... ,n - 2:
Let 1 :::; io < ]0 :::; n be fixed. Let k-th column of the matrix Y plays the
role of x in (14), (15).
The corresponding Givens' matrix for the reduction of the element Yjo,k
has the form
Let us count the number of real operations necessary for the reduction of
an arbitrary matrix Y E JH[nxn to upper Hessenberg form. Moreover, let our
resulting upper Hessenberg matrix be similar to the original one, so that we
can use the reduced form to compute the eigenvalues of the matrix Y. Let us
remark that 1 addition of two quaternions needs 4 real flops, 1 multiplication
of two quaternions needs 28 real flops. For the reduction of one element of Y
we have to perform two matrix multiplications:
4n multiplications
V = (G~~)*Y of quaternions
2n additions
V Gio 4n multiplications
of quaternions
Jo 2n additions
All together, the reduction of one element of Y needs 240 n real flops. We can
substantially reduce the number of operation (to 144 n real flops) if s or cis
real.
To obtain the upper Hessenberg form we have to reduce (n - 2) + (n - 3) +
(n-2)(n-1)
.. +2+1 = 2 elements. Let r be the product of the corresponding
Givens' matrices:
( 5 4 3 -3 -2 0 a 1 4 -5 3 3 -4 -2 2 4 1 -1 3 3 -5 -2 5 -2 -5
o 2 3 4 0 0 0 -4 -1 0 -3 -3 1 0 -4 -5 4 -1 4 -1 -1 -2 -2 -4 a .
-4 5 4 2 -4 -3 3 -4 2 4 2 -3 -3 3 3 -4 0 -1 3 -1 3 0 3 -2 1
-4 3 -4 -1 0 4 2 1 -4 3 -3 0 4 -2 0 2 0 4 4 -3 -2 -3 4 2 2
Givens' Reduction of Quaternion-Valued Matrices 519
Then W := r*yr =
Wll W12 WI3 W14 w15
* * * *
* * * *
(
o * * *
00* *
000 *
just an upper Hessenberg matrix.
6 Conclusions
We have developed an algorithm for reducing a quaternion-valued matrix into
a similar upper Hessenberg matrix by making use of quaternion-valued Givens'
transformation matrices. The algorithm was written and tested in MATLAB.
It is a generalization of algorithm given in [3]. The number of real operations
necessary to perform this reduction is still too large. We have suggested to
reduce the number per each column in Section 4.
We are convinced that a good way to reduce the number of real operations
is to try to apply Fast Givens' algorithm, which is used in the real case.
A complex version of Fast Givens' algorithm can be found in [5] (written
in Chinese). This is the only paper dealing with Fast Givens' transformations
applied to complex vectors, which we were able to find.
Acknowledgement
The authors acknowledge with pleasure the support of the Grant Agency of
the Czech Republic (grant # 201/02/0844, grant # 201/02/0595) and support
of the project MSM 223400007 of the Czech Ministry of Education.
References
1. Dongarra, J.J., Gabriel, J.R., Koelling, D.D., Wilkinson, J. H. (1984): Solving
the Secular Equation Including Spin Orbit Coupling for Systems with Inversion
and Time Reversal Symmetry. J. Comput. Phys., 54, 278-288
520 D. Janovska, G. Opfer
1 Introduction
The final aim of our modelling is to predict production of nitrogen oxides from
an internal combustion engine. For this purpose, a precise chemical reaction
model should be developed. Its main inputs would be flow, temperature, and
pressure fields and their development in time. This contribution is devoted to
a model of relevant physical processes, which is being developed at Technical
University of Liberec and whose outputs would form such inputs to the precise
chemical reaction model. It is being built as a model of compressible flow,
transport of mass and energy, and production of energy in a time-dependent
domain.
A short overview of the model is done in the next section. The rest of the
paper a bit more precisely discusses the finite volume mass transport model,
a method of reduction of numerical diffusion, and its testing.
The volume and shape of the cylinder of engine changes in time. To simplify the
problem, we discretize it in time and in each time step we split the solution into
two stages, isochoric and adiabatic one. All modelled processes are computed
in the isochoric stage, supposed to take place in a fixed domain:
* This work was supported with the subvention from Ministry of Education
of the Czech Republic, project code 242200001.
522 P. Jininek et al.
All chemical processes are described by the set of stechiometric equations where
each one can be written as
dCi
-=-Rm·a· dT = R-.!L
dt ' " dt Cv'
where mi is the molar mass of reagent A and C v is the heat capacity of the
gas mixture. There are an application of the simplifying assumption that the
reaction rate R depends only on the mass fraction of a chosen gas compo-
nent and results of its calibration presented in [4]. One calibration result is
illustrated on Fig. 2.
Model of Compressible Flow and Transport in a Time-Dependent Domain 523
150 250 350 450 550 150 250 350 450 550
Rotation of crankshaft (deg] Rotation of crankshaft [deg]
Fig. 2. Results of calibration of simplified reaction model. Bold lines are computa-
tional results, thin lines are measured data.
The model of flow of compressible gas mixture is governed by the set of N avier-
Stokes equations, the continuity equation, and the state equation of perfect
gas:
~~ + (v . \7)v = vL1v + ii\7(\7 . v) - 1. \7p,
a e
a~ + \7 . (ev ) = 0,
p = RTe.
Here, v is the velocity vector, p pressure of gas, e its density, and v and ii are
viscosity coefficients, R is the molar gas constant, and T is temperature.
Nonlinear system is discretized by mixed hybrid finite element method and
linearized. Formulation of the model is set in [5], global behaviour tests were
succesfully performed, further testing is being in process.
Let [l C R3 represents the domain of the interior of the engine cylinder (since
all processes are solved as isochoric processes in the fixed domain, we also omit
the time dependence of [l in our notation). The boundary of [l is divided into
two disjoint parts r in and rex. The part r in represents the inlet part of the
boundary, the remaining part is denoted by rex. The problem is solved in the
time interval (0, t)
The mass transport is governed by the set of mass balance equations for
each component of a gas mixture [1]:
8(e i) .
----at
C
+ \7 . (fCi + Ji) + Ci/- = cVi+ in [l, i = 1, ... , N (1)
in a given flow field f(x, t) for unknown functions e(x, t), Ci(X, t) (i = 1, ... , N).
N is the number of components of gas mixture, 1'+ denotes the density of
sources (with defined mass fractions Ci,*) and 1'- is the (positive) density of
sinks. The diffusion flux ji is given using the effective diffusivity Vi by the
Fick's-law-like relationji = -eVi\7ci slightly corrected as proposed by Sutton
and Gnoffo in [6].
524 P. Jininek et al.
The energy transport is derived from the balance of the internal energy,
which can be written with respect to the temperature as [1]
Here, T* is the temperature of inflowing fluid. The terms n rev and nirr express
reversible and irreversible rates of conversion between mechanical and internal
energy and can be written as
where /-l is viscosity coefficient. In these formulas, the ideal behaviour of gas
and the Newtonian fluid are supposed.
The boundary conditions are set of Dirichlet type at inlet part of the bound-
ary:
In this section, we focus on the numerical model of mass transport. The energy
transport model is very similar.
Before starting with the numerical scheme, let us make a note about the
space decomposition. The presented model use the meshes consisting of tri-
lateral prisms with parallel bases. The base of the cylider is decomposed into
the set of triangles and then is the triangulation extruded along the z axis up
to the height of the cylinder. It should be noted that the 2-D mesh must sat-
isfy some conditions needed for a consistency of the numerical scheme. These
conditions can be found e.g. in [2] as the definition of admissible mesh.
Using the notation from [2], let us introduce the set of control volumes T,
the set of their faces £ and the set of points P such that each point XK E P can
Model of Compressible Flow and Transport in a Time-Dependent Domain 525
°
The time discretization is realized by the ascending sequence of time values
(tn)nENo, to = with the time step Lltn = tn+l - tn.
Explicit finite volume scheme of the problem (1)-(4) can be written as
follows:
+ L.-t
'"' pni,K,a + '"' inK,a cni,a,+
~
(5)
a-EEK a-EEK
= m(Khxc?'K,+, i = 1, ... , N, K E T,
where I2f K = I2K cf K means partial density of ith specie in the finite volume K.
We use ~uch a notation that e.g. I2x approximates I2(XK, t n ). Mass fractions
in the advective and source term are expressed following the upwind scheme
as
if i'K,a- :::: 0,
if i'K a- < 0, if IX :::: 0, (6)
cr ~ KIL ~ u[l, if IX < 0.
if i'K,a- < 0, cr C lin,
Final decomposition into the density and mass fraction is computed as follows:
N
I2 nK+ 1 -- 'L..t n +1
" ' I2 i,K'
i=l
is applied instead of V.
Additionally, the estimate (8) of V num should be extended to more-
dimensional case. It is made by using the same formula (8) and estimating
the 1-D parameters v and h in the following way: On each face (J = KIL, v is
set equal to the magnitude of the dominant velocity in small neighbourhood
and h is estimated as the length of the projection of the vector (XL - XK) to
the direction of the dominant velocity.
8c 8c 82c
8t (x, t) + v 8x (x, t) - V 8x 2 (x, t) = 0, (x, t) E (-00, +00) x (0, l) (10)
c(x, 0) = M 8(x - xo), (11)
where 8(x) denotes the Dirac function. The analytic solution of the problem
is
M [(X-xo -vt)2]
can (x, t) = c;r:;u exp - V ' (12)
2y7rVt 4 t
where M is the initial mass of coloured fluid in the domain.
Model of Compressible Flow and Transport in a Time-Dependent Domain 527
8c(x,O,t)
8y
= 8c(x,w,t)
8x
°
= ,x E (0 "d) t E (0 ,"J
f\ c(x
0) _ {I ,x,
( y) E K 0
c(O, y, t) = 8c(~~,t) = 0, Y E (0, w), t E (0, t) ,y, - 0, (x, y) tf- Ko '
(14)
where Ko is the triangle including point (xo, w/2). The problems are compa-
rable, if parameter M in (11) has meaning of area of the finite volume Ko (see
Fig. 3).
The tests were performed using the following parameters: d = 20, Xo =
d/4 = 5, h = diN (where N is the number of control volumes). The mesh
consisted of equilateral triangles, therefore w = h cos ~.
In Tab. 1, there are results of several computations for comparison summa-
rized. The error E is computed as the L1 norm of the difference of analytical
solution (12) and the numerical solution of (13)-(14): E = Ilcnum(x, y, t) -
can (x, t)IIL1((O,d)x(O,w)), where Cnum was computed numerically either without
[E(without)] or with [E(with)] the proposed reduction of numerical diffusion
(9).
The other presented test problem is a 2-D extension of the previous one. It is
given as
oc
at (x, t) + \l. (c(x, t)v) - V\l2c(x, t) = 0, (x, t) E R2 X (0, t) (15)
c(x,O) = M 8(lx - xol) (16)
Table 1. Comparison of numerical results of 1-D tests for various input parameters
N (number of elements), v (velocity) and V (diffusivity) performed without and with
the proposed reduction of numerical diffusion
E(without)
N v V E(without) E(with) E(with)
where Ko is the finite volume including point Xo and the boundary is split into
inlet and rest part due to direction of velocity.
Table 2. Numerical results of 2-D tests for various magnitudes and directions of
velocity v and diffusion coefficients 1)
E(without)
Ivl direction 1) E(without) E(with) E(with)
last two tests, the numerical diffusion coefficients respective to 34% or 63%
mesh sides were smaller than the physical diffusion coefficient and could be
fully substracted, the resting diffusion coefficients were set to 0 due to (9).
5 Conclusions
References
1. Bird, R.B., Stewart, W.E., Lightfoot, E.N. (2002): Transport phenomena. Wiley,
New York
2. Eymard, R., Gallouet, T., Herbin R. (1994): Finite volume methods. In: Ciar-
let P.G., Lions, J.L.(eds) Handbook of Numerical Analysis, Vol. VII, 713-1020.
North Holland, Amsterdam
3. Odman, M.T. (1997): A Quantitative Analysis of Numerical Diffusion Introduced
by Advection Algorithms in Air Quality Models. Atmospheric Environment, 31,
No. 13, 1933-1940
530 P. Jiranek et al.
4. Sembera, J., Maryska, J. (2002): On the Local Model of Energy Production Inside
a Combustion Engine. In: Prihoda, J., Kozel, K. (eds) Proceedings of Seminat
Topical Problems of Fluid Mechanics 2002, 71-74. Institute of Thermomechanics
AS CR, Prague
5. Sembera, J., Maryska, J., Novak, J. (2003): FEM/FVM Modelling of Processes
in Combustion Engine. In: Chen, Z., Glowinski, R., Li, K. (eds) Current Trends
in Scientific Computing. Contemporary Mathematics, 329.
6. Sutton, K., Gnoffo, P.A. (1998): Multi-Component Diffusion with Application To
Computational Aerothermodynamics. 7th AIAA/ ASME Joint Thermophysics
and Heat Transfer Conference, Albuquerque, NM. AIAA 98-2575.
Numerical Study of Convection of
Multi-Component Fluid in Porous Medium
1 Introduction
Convective flows in a porous medium is a subject of many works [1]. For the
convective filtration of viscous fluid in porous medium (Darcy model) it was
observed an appearance of one-parameter family of steady states with the
spectrum, which varies along the family [2]. V. Yudovich shown that these
families cannot be the orbits of any symmetry group and derived the theory of
cosymmetry [3, 4]. Investigation of continuous families of steady states may be
performed only experimentally or numerically. Computer modelling of Darcy
convection was done by spectral method [5], finite-difference method [6]' and
combined spectral and finite-difference approach [7, 8]. It was found that keep-
ing a cosymmetry in finite-dimensional approximation of differential equation
is extremely important to correct computation of continuous families of steady
states [6, 7]. In the present work we give an extension of approach [7] for the
convection of multi-component fluid obeying Darcy law.
L
S
r=l
1D
S
(pr1jJ - GL"'r()r)dxdy = 0.
r=l
(4)
3 Method of solution
Spectral and spectral-difference methods are the powerful tools for solving
problems in mathematical physics [10]. We apply here the spectral-finite-
difference method in the form derived in [7]. Solution to the problem (1)-(3)
is seeking in the form:
m .
{()\ ... ,()s,1jJ} = L{()j(x,t), ... ,()J(x,t),1jJj(x,tn sin 7r~y. (5)
j=l
+Ar 1jJj,k+l 2h
-1jJj,k-l
-
fJ Jr -+.r
r j,k =. 'l'ljk,
Numerical Study of Convection of Multi-Component Fluid 533
(10)
BT _ BT
+ J,k+l 2h J,k-l =_ 'f'2jk,
A-
r = 1, .. . ,S.
where
4 Numerical Results
"2=-5 ;\=5
120 80
c 2/-)..~·'\ C 3 ....... · . . . . 4
/. . ....
1 1 /0\ 4 \
2/ ~.;;
d b .l \
\ 1 1
I l
\ I I '\
'0
I I \
I I \ I \ '\
Nu
v 0 e I
~ E Nuv 0 I \ \
I
I I
I
I /
\ \
\ \ 1
\ \
1
\ '\ ~,
\ "- .1
'\. .... i . . . ·_· "-
-120 -80
-40 100 NU h 240 380 -20 70 NU h 160 250
branched off from the state of rest (zero equilibrium) as result of monotonic
instability. With increasing the Rayleigh parameter Al the family deforms, and
then on it the arches of unstable equilibria occur. In Fig. 1 the families for the
case of two-component fluid and the container with b = 2 are given as the
projections on a plane NUh and Nu v :
At the critical value Au two unstable points appear on the family. For
instance, for A2 = -5 we found this instability at Au = 138.1 (circles A and
E on Fig. 1 at the left), and for A2 = 5 - respectively at Au = 95.5 (asterisk
on Fig. 1 on the right). We present in Figs. 2 and 3 the streamlines, isotherms
and isolines for concentration corresponding the letters on Fig. 1.
At large negative gradient of concentration and small diffusive coefficients
the state of rest loses stability by oscillatory manner. We have observed here
a new scenario of convective regimes development. Convective transitions are
Numerical Study of Convection of Multi-Component Fluid 535
a b c d e
II
III
o '--------'
o
Fig. 2. Stream functions (I), temperature (II), concentration (III) for some regimes,
S = 2, b = 2, a = I, )'1 = 72, A2 = -5
organized using nonstationary regime branched off from zero equilibrium (state
of rest), and also two families of the stationary solutions were born 'from
air'. The given scenario is realized for the cases of two-component and three-
component fluids. We draw the development of convective regimes for two-
component fluid in Fig. 4; the parameters are the following A2 = -10, "'2 = 0.3
and b = 2.
The state of rest is stable up to A1 ~ 77 and simultaneously there exist
two families of steady states originated via 'out of thin air' bifurcation [12].
Each of these families consists of stable and unstable arches (here we mean
transversal stability or instability with respect to a family). State of rest loses
its stability via Poincare-Andronov-Hopf bifurcation (oscillatory instability)
and the stable limit cycle is formed (curve 3 in Fig. 4). Increasing A1 both
families become more complicated and at A1 ~ 79.4 collide one with another.
Then two new families are appeared: wholly unstable (curve 4) and partially
stable (curve 5). Further increasing of A1 leads to the reduction of unstable
arches and their disappearance. After that, on the interval 80 < A1 < 156.5
given family is wholly stable. Limit cycle on small interval of A1 undergoes
536 O. Kantur, V. Tsybulin
A B c D E
2,-------,
II
III
o
o
Fig. 3. Stream functions (I), temperature (II), concentration (III) for some regimes,
S = 2, b = 2, a = 1, Al = 138.1, A2 = -5
a sequence of bifurcations leading to the chaotic regime. The last collides with
the unstable family (curve 4) and disappears. This family reduces and vanishes
at ),1 = 85 in the result of collision with the state of rest. It corresponds to the
transition of two eigenvalues from left half plane to right one along real axis.
Then we observe the growing of main family and an appearance of four arches
of unstable equilibria as a result of oscillatory instability at ),1 = 156.6.
5 Conclusion
The combined spectral and finite-difference approach gives us an opportunity
to compute continuous families steady states in the planar problem of con-
vective filtration of incompressible multi-component fluid saturated a porous
medium. We study the scenario of transformation of convective regimes and
paricularly the onset of instability on the family of steady states with variative
spectrum.
Numerical Study of Convection of Multi-Component Fluid 537
Acknowledgements
References
1. Nield, D.A., Bejan, A. (1999): Convection in Porous Media. Springer, New York
2. Lyubimov, D. V. (1975): On the convective flows in the porous medium heated
from below. J. Appl. Mech. Techn. Phys., 16, 257-261.
3. Yudovich, V. I. (1991): Cosymmetry, degeneration of solutions of operator equa-
tions, and the onset of filtration convection. Mat. Zametki, 49, 142-148
4. Yudovich, V. I. (1995): Secondary cycle of equilibria in a system with cosymmetry,
its creation by bifurcation and impossibility of symmetric treatment of it. Chaos,
5, 402-441
5. Govorukhin V. N. (1998): Numerical simulation of the loss of stability for sec-
ondary steady regimes in the Darcy plane-convection problem. Doklady Akademii
N auk, 363, 806-808
6. Karasozen B., Tsybulin V.G. (1999): Finite-difference approximation and cosym-
metry conservation in filtration convection problem. Physics Letters A, 262, 321-
329
7. Kantur O. Yu., Tsybulin V.G. (2002): Spectral-difference method to the compu-
tation of convective motion of the fluid in porous medium and preservation of
cosymmetry. J. Comput. Mathematics and Math. Physics, 42, 878-888
8. Kantur O. Yu., Tsybulin V.G. (2002): Filtration-convection problem: spectral-
difference method and preservation of cosymmetry. ICCS, LNCS 2330, Springer-
Verlag Berlin Heidelberg, 432-441
9. Yudovich V. I. (2001): Cosymmetry and convection of multi-component fluid in
porous medium. Izvestiya Vuzov, SKNTs Vsh., Special Issue, Mat. model., 174-
178
10. Canuto C., Hussaini M.Y., Quarteroni A., Zang T.A. (1988): Spectral methods
in fluid dynamics. Springer-Verlag
11. Kantur O. Yu., Tsybulin V.G. (2003): Computation of steady states family of
filtration convection in tall enclosure. Applied. Mechanics and Technical Physics,
44,228-235
12. Kurakin L.G., Yudovich V.I. (2000): Bifurcations accompanying monotonic in-
stability of an aquilibrium of a cosymmetric dynamical system. Chaos, 10, 311-
330
538 O. Kantur, V. Tsybulin
A1=76.5 A1=77.5
120 120
~1 ~1
Nu v 0 + Nu v 0 (3
~2 ~2
-120 -120
-6 8 NU h 22 36 -6 8 NU h 22 36
A1=79 A1=79.5
120 120
Nu v Nu v 0
" \
14 5
I
I
\I
-120 -120
-6 8 NU h 22 36 -6 8 NU h 22 36
A1=80 A1=83
120 120
r\
'\ I
Nu v 0
I
I :4
Nu v 0
II
5
5 114
I f f
1/
-120 L
-6
__ -=====::::::::;====-_J
8 22 36
_120L---~":::::=====-_.-.J
-6 14 34 54
Fig. 4. The convective flows scenario: >-2 = -10, S = 2, b = 2, "'2 = 0.3; cross -
state of rest, curves 1 and 2 - families with stable and unstable equilibria, curve 3 -
limit cycle, dashed line (curve 4) - unstable family, curve 5 - joint family with stable
states (solid line) and unstable ones (stars)
Multi-yield Elastoplastic Continuum-Modeling
and Computations
1 Introduction
Fig. 1. Prandtl-Ishlinskii model of play type (left) and its 0' - E hysteresis type
behaviour for a periodical stress O'(t) = Asin(t), t E (0,27f) (right)
Equation (1) represents the additive decomposition of the strain S into its
elastic part e and its plastic part p as well as of the stress cr into the backstresses
cr~ and the plastic stress crf, where rEI = {I, ... , M}. The plastic strain pis
additively decomposed to internal plastic strains Pro The equation (2) denotes
a linear elastic law, in the isotropic case one has
where HI(il) and L2(il) are the usual Sobolev and Lebesgue spaces. The con-
dition q E dev JR~;~ in the definition of Q implies that tr q = 0, i.e., q is a trace
free matrix. It is shown by BROKATE, CARSTENSEN, VALDMAN [BCV03] that
the combination of the system (1)-(4) describing the Prandtl-Ishlinskii model
of the play type together with (quasi-static) equilibrium between external (de-
noted as f) and internal forces, i.e.,
a(w(t), z - w(t)) + 1j;(z) -1j;(w(t)) :::: (£(t), z - w(t)) , for all z E 'H. (8)
+L J
rEI n
IHIpr: qr dx ,
(9)
4 Numerical Algorithm
The starting point for the finite element method is the time-discretised form of
the variational problem. Problem 1 is solved by an implicit time discretisation,
we use the implicit Euler scheme with equidistant time intervals.
It was shown by ALBERTY, CARSTENSEN, AND ZARRABI [ACZ99] that
in the single-yield case, i.e., M = 1, the time-discretised dual formulation in
each time step is equivalent to an optimisation problem depending only on
the displacement u and the plastic strain p. This result was obtained by us-
ing functional analytic arguments, as the variational inequality is regarded as
a sub-differential, for which the dual sub-differential exists and can be refor-
mulated. The resulting objective depends on the chosen hardening law (linear
kinematic hardening or isotropic hardening), though the structure remains the
same. In KIENESBERGER [Kie03] an algorithm solving the single-yield prob-
lems was developed using the results and notation of ALBERTY, CARSTENSEN,
AND ZARRABI [ACZ99]. Since the multi-yield hardening model structurally
generalises the linear kinematic hardening model, authors managed to extend
the original code using templates in C++ effectively in the way, that the
multi-yield hardening model becomes a new hardening model. For computa-
tional reasons new parameters a r are introduced, which are internal harden-
ing parameters of the the same dimension as the plastic strains Pr and are
defined by a r = lHIrPr.
The notation is as follows: For given variables with index 0 of an initial time
step to, the upgrades of the variables at the time step t 1 = to + Llt have to be
determined. The already time-discretised generalised optimisation problem for
the multi-yield case in each time step, subject to the modifications for fitting
to the single-yield algorithm, reads as:
f(U,Pl,'" ,PM) := ~ J
n
C(c(u) - I>r): (c(u) - LPr)dx
rEI rEI
J
+~ L
n rEI
ia~i2 dx + ~ J L
n rEI
iPr - p~i2 dx + J L
n rEI
a~ : (Pr - p~) dx
-11 -12
Pr Pr
( -12 -22) ===}
-11)
(p;2
Pr
-22
Pr ,
Pr Pr
such that the objective and other equations can be written in a matrix and
vector notation.
For the derivation of the algorithm and numerical experiments we will
consider only the two-yield case, i.e., M = 2, as it shows the characteristics of
the multi-yield problem and can be extended easily. Now, the objective reads
as
!(U,P1,P2) = ~2 (~)
P2
T (B:~: ~!~ -~C) (Pp:21)
-CB C C+ 2 '0
+ pg) + Qag
qp~ P2
+ ~Cpo
2 1 .. pO1 + ~Cpo . pO + ~laol2 + ~laOl2
2 2. 2 2 1 22 -+ min ,
where Bu denotes the discretised strain E(U), and Q is the result of regarding
Pr as vectors, i.e., the matrix norm is defined by Ipi = (pTQp)!.
'0 1 = Q(l + 12_O"il ) is the non-linear iteration matrix of ! with respect
PI€
to P1, and analogous for '0 2 and P2. These matrices are computed in every
iteration step using the current Pr, but apart from that the dependencies on
IPr IE will be neglected. This is not an exact method for determining the change
of the plastic strain, but its error will be corrected later on as the Pr will be
calculated separately and iteratively with the alternating direction method.
Multi-yield Elastoplastic Continuum-Modeling and Computations 545
y r y
>r - - - - - - - - ,
>
>
>
>
> F
>
>
>
>
>
>A~------~-----
X x
Fig. 2. Geometry of a beam (left) and the quarter of a ring (right) problems
The matrix in (14) is positive definite, thus the minimiser (u,ih,P2) has to
fulfill the necessary condition of the derivative being equal to zero:
(15)
Extracting the vector (PI, P2 f from the two lower lines in (15) and inserting
it into the first one yields the Schur-Complement system in u:
5 Numerical Experiments
The algorithm was implemented in NGSolve - the finite element solver exten-
sion package of the mesh generator tool NETGEN developed in our group.
Finite element basis functions were chosen as piecewise linear for the displace-
ment u and piecewise constant for the plastic strains PI and P2. Furthermore,
546 J. Kienesberger , J. Valdman
Fig. 3. Plasticity domains in the single-yield (left) and two-yield case (right) of the
beam
the full multigrid method was used, i.e., we started with a coarse grid, solved
the problem, refined the grid, solved the problem on the finer grid et cetera.
The algorithm was tested on two- as well as on three-dimensional domains,
for both the single-yield and multi-yield case, see Figure 2 for the geometries.
The first testing geometry is the 2D beam of Figure 2 with the left edge
fixed and the right edge charged with a force acting in the direction of the
external normal vector. The second geometry tested is the 3D quarter of a
ring from Figure 2 with constant thickness in the z-axis which is the same as
the thickness of the ring in the 2D sketch. The quarter ring is fixed on the lower
face and a force is acting upwards on the right face. The finest uniform mesh
consists of 131 072 triangles (which corresponds to 658428 degrees of freedom
DOF in the calculation of u) for the 2D examples and 25 088 tetrahedra (122
334 DOF) for the 3D example. Figures 3 and 4 show the plasticity domains
in the single-yield and in the multi-yield case. The elastic zones are colored·
light grey, the first plastic zones are middle-grey, and the second plastic zone
is dark-grey.
Multi-yield Elastoplastic Continuum-Modeling and Computations 547
Fig. 4. Plasticity domains in the single-yield (left) and two-yield case (right) of the
quarter of the ring
7 Acknowledgment
The authors are pleased to acknowledge support by the Austrian Science Fund
'Fonds zur Forderung der wissenschaftlichen Forschung (FWF) , under grant
SFB F013/F1306 in Linz, Austria.
548 J. Kienesberger , J. Valdman
References
David R. Kincaid
Summary. It has been over fifty years since David M. Young's original work on the
successive overrelaxation (SOR) methods. This fundamental method now appears
in all textbooks containing an introductory discussion of iterative solution meth-
ods. (Most often the SOR method appears after a presentation of Jacobi iteration
and Gauss-Seidel iteration and before the conjugate gradient iterative method.) We
present a brief survey of some of the research of Professor David M. Young, together
with his students and collaborators, on iterative methods for solving large sparse lin-
ear algebraic equations. This is not a complete survey but just a sampling of various
papers with a focus on some of these publications.
Dr. David M. Young's doctoral thesis [27] was accepted in 1950 by his supervis-
ing Professor Garrett Birkhoff of Harvard University and his paper [28] based this
work appeared in 1954. This is one of the landmark contributions in modern numer-
ical analysis. The red-black ordering for matrices is of great importance in parallel
computing. Gene Golub has said: "It's almost as if David could see into the future!"
David Young celebrated his 80th birthday on October 20, 2003
(http://www.ma.utexas.edu/CNA/photos.html).
1 Introduction
2 Successive Overrelaxation
From research first done at Harvard University, Young presented in his Ph.D.
thesis [27], and in a subsequent paper [28], an analysis of the successive overre-
laxation (SOR) method for the case where the coefficient matrix of the linear
algebraic system Au = b is consistently ordered [30]. An elliptic partial dif-
ferential equations over a region with grid points numbered in the natural
ordering (left-to-right and up) and using the standard five-point discretization
550 D.R. Kincaid
stencil results in such a matrix system. In fact, any matrix system derived in
this way has Young's Property A [30]. Moreover, a consistently ordered system
can be obtained from one with Property A after a suitable permutation.
For a matrix with Property A, one can permute the rows and corresponding
columns to obtain a red-black system. The red-black ordering corresponds to a
red and black checkerboard ordering of the grid points. When A is a red-black
matrix, it is consistently ordered and Young's equation [27] (A + W - 1)2 =
w2 p,2 A gives a relation between the eigenvalues A of the iteration matrix Lw
for the SOR method and the eigenvalues p, of the iteration matrix B for the
Jacobi method. If A is symmetric positive definite, then the eigenvalues of B
are real and less than 1 in absolute value and the optimum or best value of
the acceleration factor W is given by Wb = 2/ (1 + \11 - S(B)2 ). Here S(B)
is the spectral radius of the Jacobi matrix B, which is the magnitude of the
eigenvalue of largest absolute value of the matrix B. Moreover, the spectral
radius of the SOR matrix with the optimum relaxation parameter W = Wb is
given by S(LWb) = Wb - 1 = r.
For model problems involving the Poisson equation over a region with mesh
points of grid size h, it can be shown that the number of iterations required
for convergence of the SOR method is n = O(h- 1 ) whereas the number of
iterations is n = O(h- 2 ) using either the Jacobi or Gauss-Seidel methods. In
this situation, the SOR method is faster by an order of magnitude.
Work has been done on the choice of the optimum W for the case where A is
consistently ordered but not symmetric positive definite and where some of the
eigenvalues of the Jacobi iteration matrix B are complex eigenvalues. Several
programs are available for choosing the optimum W if all of the eigenvalues of
B are known or if one knows a convex region containing them. See Young and
Eidson [35] and Young and Huang [39].
An extension of the SOR method is the modified SOR (MSOR) method
for a linear system with a red-black coefficient matrix. The MSOR method
involves the use of relaxation factors Wl'W~, W2,W&, ... , where Wi is used for
the red components, and w~ is used for the black points, for each i. In Young,
Wheeler, and Downing [34]' it is shown that there are suitable values of Wi
and w~ that are as good, though not better than, the choice Wi = w~ = Wb for
all i. On the other hand, other choices are more effective if one measures the
effectiveness in terms of certain norms as shown in Young and Kincaid [42].
Chapters 8 and 10 of Young [30] cover the modified SOR method with fixed
and variable parameters, respectively.
A number of other modifications and extensions have been made to the
SOR theory. For instance in group or block methods, the unknowns are
grouped into blocks and all values within a block are updated simultaneously.
Usually, each inner iteration of a block method is done by a direct method
since the matrices for the blocks are assumed to be easily solvable. For exam-
ple, these matrices are tridiagonal in the case of the line SOR method when the
five-point finite difference stencil is used. Also, faster convergence is obtained
Young's Research 551
for line SOR methods than for point SOR methods, in general. Moreover, the
general SOR theory has been applied to group iterative methods in Chapter
14 of Young [30].
Research on norms associated with the SOR method for the red-black
system has resulted in new formulas. It has been shown that graph of the
DLnorm function for the SOR matrix C':/;, is not monotonically decreasing
(it increases and then decreases), but the ALnorm is indeed a monotonically
decreasing function of m; however, it is still considerably larger than the spec-
tral radius function S(C':/;,) = rm. See Young and Kincaid [42] and Chapter 7
in Young [30].
Corresponding to the SOR method is the Symmetric Successive Overre-
laxation (SSOR) method in which an iteration consists of one iteration of
the (forward) SOR method followed by one iteration of the (backward) SOR
method. In the Unsymmetric Successive Overrelaxation (USSOR) method, dif-
ferent parameters may be used in the red and black equations, respectively.
Young [29] presents convergence properties of the symmetric and unsymmetric
successive overrelaxation methods and related methods.
3 Chebyshev Acceleration
the procedure in terms of only the red points or only the black points for
the Jacobi method. Golub and Varga [6] refer to this as the cyclic Chebyshev
semi-iterative method. The cyclic acceleration method is the original Cheby-
shev acceleration method with half of the calculations bypassed. The result-
ing method is equivalent to a special case of the modified SOR method with
Wi = w~ = Pn+l.
With an adaptive Chebyshev acceleration procedure, one continuously re-
vises the iteration parameters as the iterative method proceeds. The algorithm
fixes the smallest eigenvalue estimate mE :S m(G) and adaptively modifies the
largest eigenvalue estimate ME but keep ME :S M(G). The iterative procedure
continues using these values of mE and ME until the observed convergence is
much slower than expected in a certain sense. By solving a Chebyshev equation,
the algorithm increases ME but keeps ME :S M(G). This adaptive Chebyshev
acceleration procedure is repeated until convergence is achieved according to
the stopping test being utilized. Chebyshev polynomials are used in the algo-
rithm for choosing these maximum and minimum eigenvalue estimates. Such
a procedure was developed by Hageman and Young [7] and it was incorporated
into the algorithms used in the ITPACK software packages [18].
It has been shown that, in some cases, one can obtain almost as good
a convergence as with Chebyshev acceleration by the use of a stationary second
degree method given by u(n+l) = pb(Gu(n) + k) + (1-,)u(n)} + (1- p)u(n-I).
Here let P = 1 when n = O. See some of the papers by Young and/or Kincaid
on second-degree methods [14, 15, 20, 31].
where
5 N onsymmetric Systems
A difficult problem is solving the linear system when the coefficient matrix
A is not necessarily symmetric positive definite or even symmetric. Three
generalized conjugate gradient acceleration metbods called ORTHODIR, OR-
THOMIN, and ORTHORES were considered by Young and Jea [40]. It was
shown that under fairly general conditions these methods converge, in exact
arithmetic, in at most N iterations, where N is the order of the matrix. Also
in Jea and Young [9]' the biconjugate gradient (BCG) metbod as well as other
forms of Lanczos metbods were considered as generalized conjugate gradient
acceleration methods corresponding to certain double linear systems involv-
ing A and AT.
The generalized minimum residual (GMRES) metbod [25] is a widely used
method for solving nonsymmetric linear systems. The method is generally very
reliable although stagnation may occur in some cases. Moreover, for nonsym-
metric systems, the amount of work required per iteration usually increases
as the number of iterations increases. 'Recently, Young working with Chen
and Kincaid developed various generalizations of the GMRES method and
combined them with the Lanczos procedure. New iterative methods called
554 D.R. Kincaid
numbers p and pi and, for a given u(n), we determine u(n+~) and u(n+l) by
(H +pE)u(n+~) = b-(V -pE)u(n) and (V +p' E)u(n+l) = b-(H _pi E)u(n+~).
Thus, we have u(n+l) = Tp,p,u(n) + kp,p' where Tp,p' = (V + pi E)-l(H -
pi E)(H + pE)-l(V - pE) = 1 - (p + p')(V + pi E)-l E(H + pE)-l(H + V)
and kp,p' = (p+p')(V +p'E)-lE(H +pE)-lb = (1 -Tp,pl)A-lb. Examples of
alternating-type methods are the alternating-direction implicit (ADI) method,
the symmetric successive overrelaxation (SSOR) method, and the unsymmet-
ric successive overrelaxation (USSOR) method. With the ADI method, Hand
V are either tridiagonal or are permutationally similar to tridiagonal matrices
and E = 1. With the SSOR and USSOR methods, H and V are lower trian-
gular and upper triangular matrices, respectively, and E is a diagonal matrix
with positive diagonal elements.
In certain cases, the ADI method converges rapidly. For example, with
problems involving Poisson's equation in the rectangle with a grid of mesh
size h, the ADI method convergences in n = O(1og h- l ) iterations using the
optimum number of parameters and in n = O( h -11m) iterations using the best
m parameters. Recall that n = O(h-l) for the SOR method. The commutative
case is when HV = V H, HE = EH, V E = EV. It holds for certain separable
self-adjoint elliptic partial differential equations defined over rectangles. Given
the commutativity condition and also bounds on the eigenvalues of Hand
V, necessary and sufficient convergence conditions related to choosing ADI
parameters can be found in Birkhoff, Varga, and Young [1] and in Chapter 17
of Young [30]. Also see Young and Wheeler [48].
With a nonstationary alternating-type iterative method, the parameters p
and pi may vary from iteration to iteration. We seek to determine the parame-
ters {Pi} and {pa so that u(m) is as close to the true solution u = A -lb as pos-
sible. In practice, we seek to make the spectral radius S (II: TPi ,P:)
l as small
as possible. As an alternative to the (sequential) non-stationary method, we
consider the parallel alternating-type iterative method. See papers by Young
and Kincaid [43, 44].
8 Books
References
1. Birkhoff, G., Varga, R S., Young, D. M.: Alternating direction implicit methods,
in F. Alt, M. Rubinoff, editors, Advances in Computers, Academic Press, New
York, 189-273, 1962.
2. Chen, J-Y. Iterative solutions of large sparse nonsymmetric linear systems, Re-
port CNA-285, Center for Numerical Analysis, University of Texas at Austin,
January 1997.
3. Chen, J-Y., Kincaid, D. R, Young, D. M.: MGMRES iterative method, in J.
Wang, M. B. Allen III, B. M. Chen, T. Mathew, editors, Iterative Methods in
Scientific Computation, IMACS, New Brunswick, 15-20, 1998.
4. Chen, J-Y., Kincaid, D. R, Young, D. M.: GGMRES iterative method, in J.
Wang, M. B. Allen III, B. M. Chen, T. Mathew, editors, Iterative Methods in
Scientific Computation, IMACS, New Brunswick, 21-26, 1998.
5. Chen, J-Y., Kincaid, D. R, Young, D. M.: Generalization and modifications of
the GMRES iterative method, Numerical Algorithms, 21 (1999) 119-146.
6. Golub, G. H., Varga, R. S.: Chebyshev semi-iterative methods, successive over-
relaxation iterative methods, and second-degree Richardson iterative methods,
Parts I & II, Numer. Math., 3 147-168, 1961.
7. Hageman, L. A., Young, D. M.: Applied Iterative Methods, Academic Press,
New York, 1981. (Reprinted by Dover, New York, 2004.)
8. Jea, K. C.: Generalized conjugate gradient acceleration of iterative methods,
Report CNA-176, Center for Numerical Analysis, University of Texas at Austin,
February 1982.
9. Jea, K. C., Young, D. M.: On the simplification of generalized conjugate-gradient
methods for nonsymmetrizable linear systems, Linear Algebra and Its Applica-
tions, 52/53:399-417, 1983.
10. Joubert, W. D.: PCG examples manual: A package for the iterative solution of
large sparse linear systems on parallel computers, Report CNA-284, Center for
Numerical Analysis, University of Texas at Austin, July 1996.
11. Kincaid, D. R: An analysis of a class of norms of iterative methods for systems
of linear equations, Ph.D. thesis, University of Texas at Austin, 1971. Also,
Report CNA-18, Center for Numerical Analysis, University of Texas at Austin,
May 1971.
12. Kincaid, D. R.: Norms of the successive overrelaxation method, Math. Comp.,
26(118):345-357, 1972.
13. Kincaid, D. R: A class of norms of iterative methods for solving systems of
linear equations, Numer. Math., 20:392-408, 1973.
14. Kincaid, D. R: On complex second-degree iterative methods, SIAM J. Numer.
Anal., 11(2):211-218, 1974.
15. Kincaid, D. R: Stationary second-degree iterative methods, Applied Numerical
Mathematics, 16:227-237, 1994.
16. Kincaid, D. R, Hayes, L. J.: editors, Iterative Methods for Large Linear Systems,
Academic Press, New York, 1990.
17. Kincaid, D. R., Oppe, T. C., Young, D. M.: ITPACKV 2D User's Guide, Report
CNA-232, Center for Numerical Analysis, University of Texas at Austin, May
1986.
18. Kincaid, D. R, Respess, J. R, Young, D. M., Grimes, R G.: ITPACK 2C:
A FORTRAN Package for Solving Large Sparse Linear Systems by Adaptive
Accelerated Iterative Methods, ACM Trans. Math. Software 8, 1982.
Young's Research 557
19. Kincaid, D. R., Young, D. M.: A brief review of the ITPACK project, Journal
Computer 8 Applied Mathematic, 24:121-127, 1988.
20. Kincaid, D. R, Young, D. M.: Linear stationary second-degree methods for solu-
tion oflarge linear systems, in Th. M. Rassias, H. M. Srivasiava, A. Yanushauska,
editors, Topics in Polynomials of One and Several Variables and Their Appli-
cations, World Scientific Publishing Co., River Edge, NJ, 609-629, 1993.
21. Kincaid, D. R., Young, D. M.: Note on parallel alternating-type iterative meth-
ods, in S. D. Margenov P. S. Vassilevski, editors, Iterative Methods in Linear
Algebra II, IMACS, New Brunswick, 131-139, 1996.
22. Mai, T-Z., Young, D. M.: ITPACK 3B user's guide (preliminary version), Re-
port CNA-201, Center for Numerical Analysis, University of Texas at Austin,
January 1986.
23. Oppe, T. C. Joubert, W. D., Kincaid, D. R: NSPCG user's guide, version 1.0:
A package for solving large sparse linear systems by various iterative methods,
Report CNA-216, University of Texas at Austin, Center for Numerical Analysis,
April 1988.
24. Rice, J. R., Boisvert, R.: Solving Elliptic Problems Using ELLPACK, Spring-
Verlag, New York, 1985.
25. Saad, Y., Schultz, M. H.: GMRES: A generalized minimal residual algorithm for
solving nonsymmetric linear systems, SIAM J. Scientific and Statistical Com-
puting, 7:856-869, 1986.
26. Varga, R S.: Matrix Iterative Analysis, Prentice-Hall, Englewood Cliff, New
Jersey, 1962.
27. Young, D. M.: Iterative Methods for Solving Partial Difference Equations of El-
liptic Type, Ph. D. thesis, Harvard University, Mathematics Department, Cam-
bridge, MA, May 1950.
28. Young, D. M.: Iterative methods for solving partial difference equations of el-
liptic type, Trans. Amer. Math. Soc., 76:92-111, 1954.
29. Young, D. M.: Convergence properties of the symmetric and unsymmetric suc-
cessive overrelaxation methods and related methods, Math. Comp., 24(112):793-
807, 1970.
30. Young, D. M.: Iterative Solution of Large Linear Systems, Academic Press, New
York, 1971. (Reprinted by Dover, New York, 2003.)
31. Young, D. M.: Second-degree iterative methods for the solution of large linear
systems, J. Approximation Theory, 5:137-148, 1972.
32. Young, D. M.: On the accelerated SSOR method for solving large linear systems,
Advances in Mathematics, 23(3):215-271, 1977.
33. Young, D. M.: A historical review of iterative methods, in A History of Scientific
Computation S. G. Nash, editor, Addison-Wesley, Reading, MA, 180-194, 1989.
Also in Report CNA-206, Center for Numerical Analysis, University of Texas
at Austin, February 1987.
34. Young, D. M., Downing, J. A., Wheeler, M. F.: On the use of the modified suc-
cessive overrelaxation method with several relaxation factors, in W. A. Kalenich,
editor, Proceedings of IFIP 65, Spartan Books, Washington, D.C., 1965.
35. Young, D. M., Eidson, H. D.: On the determination of the optimum relaxation
factor for the SOR method when the eigenvalues of the Jacobi method are
complex, Report CNA-l, Center for Numerical Analysis, University of Texas at
Austin, September 1970.
36. Young, D. M., Gregory, R T.: A Survey of Numerical Mathematics, Volume 1,
Addison-Wesley, New York, 1972. (Reprinted by Dover, New York, 1988.)
558 D.R. Kincaid
1 Introduction
The numerical algorithms have become quite complex and require dynamic
data structures. The general practice is to use a mixed data structure for this
purpose [5]. In this paper a simplified version of the well-known relational
database model is presented, which is developed for parallel numerical compu-
tation. This database model was introduced by E. F. Codd [3, 6]. The great
advantage of this model is the flexible, simple and homogenous data structure.
Moreover, the application of special geometric search trees (oct-tree, ADT,
etc.) [4, 6, 13] can be avoided in such way. The simplified database model is
implemented in object oriented manner by using the programming language
C++.
The non-parallel version of this simplified relational database model for
non-structural mesh generation is investigated in [11]. This paper demon-
strates the efficiency of the parallel version in the case of an optimally paral-
lelized H-matrix (hierarchical-matrix) multiplier. The applied H-matrix tech-
nique is originated from W. Hackbush and B. N. Koromskij [8, 9, 2]. A great
advantage of the presented parallel H-matrix multiplier is that its implemen-
tation is independent of the geometrical complexity of the domains.
Our paper is organized as follows. Section 2 contains the description of
the simplified relational database model. The parallelized H -matrix multiplier
is discussed in Section 3. Section 4 is devoted to its application as Laplace
preconditioning matrix on a very complex 3-dimensional domain.
* This work was partly supported by the Grants T043258 and T042826 of the Hun-
garian National Research Found.
560 B. Kiss, A. Krebsz
In the case of the classical relational database model the data files are con-
sidered as tables, where the number of columns is fixed and the number of
rows may vary. Each column contains elementary data from the same data
type and can be referred to by a symbolic name. These symbolic names are
called field identifiers. Each row (record) is identified by its row (record) num-
ber and a part of a row (record) by its field identifier.
The numerical problems usually require only one ordering on one data
table. Hence it is worth to use combined data and index tables here to store the
records in the adequate key order. This idea is implemented as a RelDataBase
class with the following member functions.
For the sake of flexibility the records are called by reference, and the record
pointers are casted to character type. The third parameter KeyDesc of the
member function Create is a key descriptor structure for key segment identi-
fication (data type, beginning position).
struct KeyDesc
{
char KeyFieldType [KEY .sEGMENT .MAX] ;
char KeyFieldBegPos [KEY .sEGMENT .MAX] ;
};
The integrated data and index tables are realized by the well-known
Red-Black and B balanced tree structures [4, 6]. Both tree structures are
asymptotically optimal and widely used. The cost of the tree operations In-
sert, Delete, Update and Seek require less than or equal to O(log(n)) arith-
On the Relational Database Style Parallel Numerical Programming 561
metic operations, where n denotes the number of tree-nodes. However, the cost
of the sequential read in index order is only 0(1) in the practice.
We have applied the distributed database technique, using local simpli-
fied relational database on each process together with the message-passing
standard MPI [7].
//--------------------------------------------------------------------
void HMtxGen (int nLevelSubDom, int nLevelMesh,
double dRectMin[3] , double dRectMax[3])
{ int nLV, nLevel, nRoyBlock[3] , nColBlock[3];
nLevel = 0;
//--------------------------------------------------------------------
void HSubMtxGen(int nLevelSubDom, int nLevelMesh,
double dRectMin[3] , double dRectMax[3] ,
int nLevel, int nRoyBlock[3] , int nColBlock[3])
{int nLVi,nLVj,nLVk,nNextLevel,nRowSubBlocks[8] [3],nColSubBlocks[8] [3];
11----------------------------------------------------------------------
II Table: THMtx
II Content:Geometric search table for H-submatrix blocks.
II Key: nCSubDomld+nRSubDomld+nSubMtxType+nSubMtxld
II +nLevel+nColBlockCoords+nRowBlockCoords
struct RHMtx
};
11----------------------------------------------------------------------
II Table: TSimp
II Content:Geometric search table for terahedra.
II Key: nSubDomId+nLevel+nBlockCoords+nSimpId
struct RSimp
{
int nSimpId; II Tetrahedron identifier.
int nSubDomId; II Subdomain identifier.
int nLevel; II Fine mesh level.
int nBlockCoords[3]; II Search block coordinates.
int nLocation; II Location (LOC_INSIDE, LOCJBOUNDARY).
int nNodeIds[4]; II Node identifiers.
};
M M M (M )
'JiL(i),R(j) = L Cz . QI = LQI . CI = LQI' L !!.T,m· J:.L(i),C(k) ,
1=1 1=1 1=1 m=1
where the constant M depends only on the order of the used approximation.
The distribution of the vectors among the processes is determined by the
sub domain level H-matrix partitioning. The values CI = "L;;:;'=l !!.T,m .J:.L(i),C(k)
are computed on that subdomain which is identified by the H -submatrix col-
umn block. Three types of 'JiL(i),R(j) = HL(i),R(j),C(k) . J:.L(i),C(k) , i.e. of local
matrix-vector product computations by H-submatrix blocks are introduced.
The field nNeighbType of the table THMtx serves this purpose. The pos-
sible values of this field are the following.
The compact data exchange denotes the exchange of single and grouped
data together with the values Cz. In the grouped case we send only the param-
eters level L(i) and block coordinate R(j) of the row block cluster. The costs
of the local and global variants of the compact data exchange are O(log(~))
and O(log(p)· p), respectively. Here n is the number of unknowns and p is the
number of processes.
The computation of the global y = H . J:. product needs an additional step
to correct the local 'JiL(i),R(j) value~ at the boundaries of neighbouring sub do-
mains. For this purpose a simple local data exchange is applied by using the
MPI functions MPLSend and MPLRecv too. The cost of this correction
step is O((~)~).
564 B. Kiss, A. Krebsz
Since the computational cost of the values Cz and L;~l Cz 'Qz is O(log( ~). ~)
on each subdomain, the total computational cost of the global matrix-vector
product '}L = H . ~ is 0 (log (~) . ~ + p)
per subdomain.
The record structures of the table THCommNear and the array Ar-
rHCommFar are the next ones.
11----------------------------------------------------------------------
II Table: THCommNear
II Content:Data exchange among neighbouring subdomains.
II Key: nSubDomld+nDataType+nVarld+nLevel+nBlockCoords
struct RHCommNear
{
int nSubDomld; II Subdomain identifier.
int nDataType; I I Data type (DLSINGLE,DLGRDUPED).
int nVarld; II Variable id. (single case).
int nLevel; II Division level(grouped case).
int nBlockCoords[3]; II Block coordinates(grouped case).
double dValues[M]; II Values.
};
11----------------------------------------------------------------------
II Array: ArrHCommFar
II Content:The array for data exchange among non-neighbouring
II subdomains.
pArrCommFarArr = new double [nProcNum*M] ;
II The number of processes multiplied by M.
4 Numerical Results
Our parallel H-matrix multiplier implementation has been tested as Laplace
preconditioning matrix. The enclosed test results regard to quasi-uniform reg-
ular tetrahedral triangulations of the very complex water domain fl of the
RABA Euro 2 diesel engine. The shape of this domain can be seen in Figure
1.
We have tested the parallel solution of the three-dimensional Poisson equa-
tion
-Llu = f in fl, u = 0 on afl
by using the preconditioned conjugate gradient (peG) [5, 10, 12] algorithm.
The Poisson equation has been discretized by the finite element method,
applying piecewise linear finite elements. The optimal preconditioning matrix
566 B. Kiss, A. Krebsz
P was given in the form P = V2, where the matrix V is the H -matrix ap-
proximation of the bilinear form
V (u, v) = 11
n n
u (;r.) . v
I~ -It
(Y)
2
I
which is an H- 1 j2 norm representation. It is discretized by piecewise linear
finite elements on the level of the given tetrahedral mesh, and by piecewise
bilinear elements on the clusters, which are determined by the H-submatrix
blocks.
and the error bound is c = 10- 4 . The preconditioning matrix D denotes the
inverse of the main diagonal of the matrix of the discretized Laplace operator.
From our results we can conclude that the matrix P is an efficient precon-
ditioning matrix, but its computation is fast enough in parallel environment
only.
References
12. Samarskij, A. A., Nikolajev, E. S. (1989): Numerical Methods for Grid Equations,
vol. 1 and 2, Basel, Bikhiiuser.
13. Samet, H. (1990): Application of Spatial Data Structures, Addison-Wesley, Read-
ing, MA.
A Dynamical System Describing Evolution of
the Implicit Surfaces in Incompressible Viscous
Liquids
The explicit dependence on the unknown, implicit, surface S(V'xL t)) can
be circumvented by introduction of the so-called level set function G, [11].
Namely, let G(x, t) be a smooth function measuring the distance to the in-
terface, being positive inside of a gas domain, and let B denote the Heaviside
function. Then X = BoG. Substituting this representation of the Indicatrix
into the transport equation (4) yields
Assuming that the velocity field is known for any xED, we obtain from (5)
the following level set formulation, [11],
The Lagrangian of our system includes the kinetic, potential, and surface en-
ergies. We assume that the density of the liquid and the density of the gas are
constant. Thus
p(X(x, t), t) = PGX(X(x, t), t) + pL(l - X(X(x, t), t)), xED, t > 0, (8)
Selfcontained Dynamics of Implicit Surfaces in Liquids 571
where Pa, PL are the density of the gas and liquid, respectively. We consider
a gas-liquid system with total energy of a Mumford-Shah type, [9]' given by
C:(X, t) ~f ~ J (!
n
po(x) X(x, t)) 2 dx + E(X, t),
where
J
(9)
E(X, t) ~f po(x) g. X(x, t) dx + a II\7x(·, t)11 (0), and
n
X (X(x, t), t) = X(x, 0).
The surface energy, a II\7x(·, t)11 (0), i.e., the total variation of \7x, represents
the perimeters of the sub domains occupied by the gas multiplied by a sur-
face tension coefficient a which is a positive measured quantity. The vector g
represents the gravitation force field.
We apply a generalization of the Principle of Stationary Action, [8]' to
the Lagrangian (9). The equlibrium dynamics obtained from this principle
are given by the Euler equations with an added dissipative term. Moreover,
the pressure drop across the gas-liquid interfaces satisfies the Laplace-Young
equation. To account for the dissipative effects, we define the action as
A(X , t 1, t)
2 ~f 1 t2
t1
e).,(t-to)c:(X , t) dt , (10)
where A > 0 represents the Rayleigh's friction dissipation coefficient, and to :::::
o is a given initial time. Then
(11)
1 The configuration space is not a linear function space. Consequently, the differ-
entiation has to be considered in the tangent space to M where summation is
allowed.
572 P. Kloucek et al.
+ lT J
S?
1: ~e.\(t-to) J
2
po(x) (:tX(X,t)r dxdt
1: ~ J
n (19)
=
2
Po (x) (d~X(X'T)r dX(AT+1)2 dT.
n
Similarly,
(20)
- clef-
Let Tk = k t:n, and let X k (.) = X(., k b. T). In accordance with the PSA,
we assume that Xk+I, Xk - I are given states. We are looking for Xk which is
a solution of the minimization problem
inf {ex" + 1)' ~! Po (x) F (it) dx + ("7)' E (it) Iit E:M}, whece,
Since
e.\(tk+ 1 -to) - 1
b.T=------
A
we have for A « 1
(22)
Hence, going back to the original time coordinate and using the above approx-
imation, we obtain
xk ~f Arginf i
XEM
J po(x)F (X) dx + (6tk)2 E (X) . (23)
n
574 P. Kloucek et al.
Remark 3.1 (Weak form (17) and the variational principle (23). The
variational formulation (23) is obtained from the PSA applied to the time
interval (tk-l, tk+l) by using the trapezoidal rule to provide an integration rule
in time. Computing the Gateaux derivative of the functional appearing in (23)
and using the discrete integration by parts in time, we recover the implicitly in
time discretized weak formulation (17). D
Remark 3.2 (Time step 6tk). It follows from the above calculations that
if A « 1 then, from (18) and (22),
D
We first state the following result before we proceed to make the link be-
tween the minimizer of the variational problem (24) below and the flow maps.
Theorem 3.3. Let us assume that n is bounded with piece-wise smooth bound-
ary, and that Xk+l, Xk E X are given. Then there exists a unique minimizer
Xk of the following variational problem:
Proof. The proof follows from the strict convexity of the Wasser stein distance
on the relaxed space Xrelaxed which is defined as X but with BV(n, [0, 1]) as the
base space. The existence of the unique minimizer of (24) is then obtained by
the density argument, which follows from the compatibility of the Wasser stein
metric with a convergence in the LP -spaces. The proof can be found in [7]. D
where
(
} M-l(W)
XO(x) dx = 1
W
xl(x) dx, for all Borel subsets w of Q },
(27)
where q'>k-l,k is the unique solution of the Monge-Ampere problem (25). More-
over, the link mentioned above is provided by
xE Q. (28)
4 Dynamical System
(29)
(30)
. d
dlV -XT(Xk,k+l(X)) I = 0 for a. a. x E fl. (31)
dT T=O
Hence, we consider all possible variations of the form
It follows from [7] that for any f E W 1 ,2 (fl, JR 1 ) there exists h E Mk such
that the divergence free requirement in (32) is satisfied. Thus, we require Xk
to solve
(33)
d 1 . k+l 2 d 1 . k-l 2
-d - dlstW(XT' X ) I + -d - dlstW(XT' X ) I
J
T 2 T=O T 2 T=O
~ (Xk+l(x) - 2Xk(x) + Xk-l(X)) f(x) dx
n
-J J
(34)
(x - h(x)). \7f(h(x))Xk(X) dx - (f(h(x)) - f(x)) Xk(x) dx
n n
for all f E W 1 ,2 (fl,JR 1 ), hE Mk.
The last two integrals vanish due to the transport equation. After some calcu-
lations, suitable approximations, and the limit procedures E -7 0+, .6t -7 0+,
(again see [7] for details), we find that the dynamical system is given by a de-
generate equation
Pc (OttX(x, t) - >'OtX(x, t)) = div (X(x, t) ((pc - PL) g + 0:\7 H(x, t))),
(35)
x E S(\7X), t > O.
xE D, (36)
Since PLagrange (x, k 6. t) = PEuler (\1 tjJk,k+l (x), k 6. t), it follows from (36) that
the appropriate form of the density passed to the Euler equations written in
the fixed frame of reference, in the time-semidiscrete form, is the following
In other words, the tracking has to be computed one 6.t ahead of the solution
of the Euler equations. D
The Wasser stein distance, denoted distw(., .), is defined by, [13], [2],
infimum in (39) is attainable and, c. f. [Proposition 3.1, [2]], that the unique
optimal measure f-l* has the form
It can be shown, [Proposition 3.1, [2]], that the representating map is almost
everywhere equal to the gradient of a convex Lipschitz continuous scalar func-
tion which has an, almost everywhere, invertible gradient. Substituting the
measure f-l* back into (39), we obtain an alternative formulation of the Wasser-
stein distance more closely related to the Monge-Kantorovich problem, ([10]'
[4], [1], [5], [3], [6]' [14]). Namely, if the marginals have the same mass, i. e. ,
In so(x) dx = InSl (x) dx, it follows from the above discussion, that there exist
¢*, convex and Lipschitz continuous, such that '\l¢* E .G(so, Sl) and invertible,
solving
inf {{.,(X) I
1M (x) - xl' dx M E L(.", .<d, M inv"tibl, }, Whff'
:1(so, Sl) ~f {M: {so> O} <:::: 0 f--' {Sl > O} <:::: 0 I M measurable},
1
JM-l(w)
J
n
Sl(X)rp(X) dx = J
n
so(x)rp(M(x)) dx, for all rp E CO(O), (44)
for M E M. o
6 Acknowledgement
The authors would like to thank Professor Olivier Besson for his hospitality
and support while we worked on the presented material in the Institut de
Selfcontained Dynamics of Implicit Surfaces in Liquids 579
References
1. J.-D. Benamou and Y. Brenier. A numerical method for the optimal time-
continuous mass transport problem and related problems, pages 1-11. Contempo-
rary Mathematics: Monge-Ampere Equation:Application to Geometry and Op-
timization. American Mathematical Society, first edition, 1999. L. A. Caffarelli
and M. Milman, eds.
2. Y. Brenier. Polar factorization and monotone rearrangement of vector-valued
functions. Comm. Pure Appl. Math., 64:375-417, 1991.
3. L. A. Caffarelli. Boundary regularity of maps with convex potentials. Ann. of
Math., 2(3):453-496, 1996.
4. B. Dacorogna and J. Moser. On a partial differential equation involving the
jacobian determinant. Ann. Inst. H. Poincare Anal. Non Lineare, 7(1):1-26,
1990.
5. L. C. Evans. Partial differential equations and monge-kantorovich mass transfer.
Lecture Notes, pages 286-294, 1998.
6. W. Gangbo and R. J. McCann. The geometry of optimal transport. Acta Math.,
(2):113-161, 1996.
7. P. Kloucek, M. V. Romerio, and J. L. Wightman. The variational formulation
of tracking free surfaces in liquids. SIAM, J. Appl. Math., 2003. submitted.
8. C. Marchioro and M. Pulvirenti. Mathematical Theory of Incompressible Non-
viscous Fluids. Applied Mathematical Sciences. Springer-Verlag, 1994.
9. J.-M. Morel and S. Solomini. Variational Methods in Image Segmentation, vol-
ume 14 of Progress in Nonlinear Differential Equations and Their Applications.
Birkhiiuser, Boston, 1st edition, 1995.
10. J. Moser. On the volume elements on a manifold. Trans. Amer. Math. Soc.,
120:286-294, 1965.
11. S. Osher and R. Fedkiw. Level Set Methods and Dynamic Implicit Surfaces,
volume 153 of Applied mathematical Sciences. Springer Verlag, New York, 1
edition, 2003.
12. F. Otto. Evolution of microstructure in unstable porous media flow: a relaxation
approach. Comm. Pure Appl. Math., 2002. to appear.
13. S. T. Rachev. Probability metrics and the stability of stochastical models. Wiley,
New York, 1 edition, 1991.
14. S. T. Rachev and L. Ruschendorf. Mass transportation problem, volume I, II of
Probability and its Application. Springer-Verlag, New York, 1998.
Discrete Maximum Principles in Finite
Element Modelling
1 Introduction
n . n __ meas2ACDmeas2BCD
vp vq - 9 (meas3ABCD)2 coseY, (1.1)
Discrete Maximum Principles in Finite Element Modelling 581
where a is the angle between the faces ACD and BCD (see Fig. 1) and the
symbol measd stands for d-dimensional measure. Scalar product (1.1) is, there-
fore, independent of all the other 5 dihedral angles of the tetrahedron ABCD.
If a > 7r/2 then the scalar product (1.1) is obviously positive. Hence, each
obtuse dihedral angle of the tetrahedron ABCD gives a positive contribution
to the corresponding off-diagonal entry of the element stiffness matrix, when
solving a boundary value problem with the Laplace operator by the finite
element method.
ex:
A
c
Fig. 1. A tetrahedron whose two faces include the angle a
Note that the same is also true (see [11]) for a wider class of nonlinear
elliptic problems of the form
u=o an,
on (1.3)
where ,\ is a positive smooth function and n is a bounded polyhedral domain
with Lipschitz boundary an. Equation (1.2) describes, for instance, a station-
ary nonlinear heat conduction or magnetic potential in ferromagnetic media
(cf. [13]).
According to [5, p. 206]' problem (1.2)~(1.3) satisfies the maximum princi-
ple, i.e., if f ::::; 0 then the maximum of u over n is attained on the boundary
an (see also [14]). It is thus natural to look for a class of finite elements such
that the same implication is satisfied. Note that, e.g., for bilinear rectangular
elements the discrete maximum principle can be violated (see [1, p. 254]). The
same is true also for trilinear block elements (a simple example is given in [12,
p. 562]). However, if we decompose these elements into nonobtuse triangles or
tetrahedra keeping the number of degrees of freedom, the discrete maximum
principle will hold.
582 S. Korotov, M. Kflzek
2 N onobtuse tetrahedralizations
A tetrahedron is said to be nonobtuse if all six dihedral angles between its
faces are less than or equal to 7r /2. By a tetrahedralization we shall mean
a face-to-face partition of [2 into tetrahedra in the standard sense (see [3]).
A tetrahedralization is said to be nonobtuse if it contains only nonobtuse tetra-
hedra.
By [3, p. 150], linear triangular nonobtuse elements guarantee the validity
of the discrete maximum principle in the plane. This result is generalized into
three-dimensional space in [11], namely, linear tetrahedral elements applied to
problem (1.2)-(1.3) on nonobtuse partitions yield irreducibly diagonally dom-
inant stiffness matrices (whose all off-diagonal entries are non positive). It is
well known (see [15, p. 85]) that such matrices are monotone. Nonobtuse tetra-
hedral partitions thus guarantee the validity of the discrete maximum principle
for problem (1.2)-(1.3) solved by linear elements, i.e., we have Uh ::; 0 provided
f ::; 0, where Uh is the continuous and piecewise linear Galerkin approxima-
tion of the solution of (1.2)-(1.3). In other words, Uh attains its maximum
on the boundary 8[2, if the homogeneous Dirichlet boundary conditions are
considered. Note that nonobtuse tetrahedralizations enable us to prove the
Loo-convergence of finite element approximations to the weak solution (see,
e.g., [4]).
According to [7], an arbitrary polyhedron can be decomposed into no nob-
tuse tetrahedra. In order to improve the discretization error a given partition is
refined locally or globally. That is why, the issue of preserving the nonobtusity
appears during the refinement process.
In [6] we give a global refinement procedure yielding nonobtuse tetrahe-
dra over the whole domain. This procedure is briefly described in Section 3.
However, such a technique requires a large amount of computer memory to
store the associated stiffness matrix. Therefore, in Section 4 we introduce a lo-
cal refinement procedure yielding nonobtuse partitions that refine only near
a given vertex, where a singularity of the exact solution may appear (see, e.g.,
[2], [8], [9]).
D
Fig. 2. A path tetrahedron
The advantage of the above approach is that a common face F of any two
adjacent tetrahedra (satisfying the assumptions of Theorem 3.3) in a given
tetrahedralization of a polyhedral domain is divided in a unique way. This
enables us to develop global refinement techniques yielding only nonobtuse
subtetrahedra (see [6] for details).
The main idea of generating local refinement technique producing only nonob-
tuse partitions is exposed in the following theorem.
584 S. Korotov, M. Krizek
Theorem 4.1. Let ABCD be a path tetrahedron whose edges AB, BC,
and CD are mutually perpendicular. Then there exists an infinite family of
nonobtuse tetrahedralizations of ABC D into path tetrahedra that locally re-
fine ABC D in a neighbourhood of the vertex A.
For a detailed constructive proof see [2] or [9]. Its main idea is sketched on
Fig. 4. Using several sophisticated orthogonal projections, we first subdivide
the tetrahedron ABCD into five nonobtuse tetrahedra. Then we show that
the path tetrahedron AT SQ from Fig. 4 is similar to the original tetrahedron
ABCD. The subtetrahedron ATSQ can be now decomposed into 5 subtetra-
hedra in a similar way as ABCD, and thus we can get further refinement near
the point A. In this manner, we obtain recursively the required infinite family
of nonobtuse tetrahedralizations.
An algorithm for a local refinement of a cube producing only nonobtuse
tetrahedra is presented in [8]. If several cubes meet at one point, then we can
apply this algorithm to each of them so that the whole partition remains face-
to-face. For instance, in Fig. 5 we see such a local refinement a polyhedral
domain D = (-1,1)3 \ [0,1)3 which presents the union of 7 cubes. Each cube
is first divided in a standard way into 6 path tetrahedra having a common
vertex in the reentrant corner of D. For each of the 7 x 6 = 42 tetrahedra
we apply the algorithm given by Theorem 4.1 such that the partition of each
path tetrahedron is just the mirror image of the partition of any adjacent
tetrahedron having a common face. Note that the concave (reentrant) corner
in Fig. 5 is called the Fichera corner or the Fichera vertex.
Discrete Maximum Principles in Finite Element Modelling 585
D
Fig. 4. Partition of a path tetrahedron into 5 path subtetrahedra
Fig. 5. Local refinement technique producing nonobtuse tetrahedra near the Fichera
corner
5 Concluding remarks
Acknowledgement
This paper was supported by Grant InBCT of TEKES, Finland and Grant
No. A 1019201 of the Academy of Sciences of the Czech Republic.
References
1. Axelsson, 0., Barker, V.A. (1984): Finite Element Solution of Boundary Value
Problems. Theory and Computation. Academic Press, New York
2. Beilina, L., Korotov, S., Kflzek, M. (2003): Application of the local nonobtuse
tetrahedral refinement techniques near Fichera-like corners. Preprint 2003-02,
Chalmers Finite Element Center, Goteborg, 1-16
3. Ciarlet, P.G. (1991): Basic error estimates for elliptic problems. In: Ciarlet, P.G.,
Lions, J.L. (eds) Handbook of Numer. AnaL, vol. II. North-Holland, Amsterdam
4. Feistauer, M., Felcman, J., Rokyta, M., Vlasek, Z. (1992): Finite-element solution
of flow problems with trailing conditions. J. Comput. Appl. Math., 44, 131-165
5. Gilbarg, D., Trudinger, N.S. (1977): Elliptic Partial Differential Equations of
Second Order. Springer-Verlag, Berlin
6. Korotov, S., Kflzek, M. (2001): Acute type refinements of tetrahedral partitions
of polyhedral domains. SIAM J. Numer. Anal. 39, 724-733
7. Korotov, S., Kflzek, M. (2002): Dissection of an arbitrary polyhedron into nonob-
tuse tetrahedra. Preprint nr. B 3/2002 of Dept. Math. Inform. Technol., Univ. of
Jyviiskylii, 1-6
8. Korotov, S., Kflzek, M. (2003): Local nonobtuse tetrahedral refinements of
a cube. Appl. Math. Lett., 16, 1101-1104
9. Korotov, S., Kflzek, M. (2003): Global and local refinement techniques yielding
nonobtuse tetrahedral partitions (accepted by Comput. Math. Appl.), 1-10
10. Korotov, S., Kflzek, M., Neittaanmiiki, P. (2001): Weakened acute type condition
for tetrahedral triangulations and the discrete maximum principle. Math. Compo
70, 107-119
11. Kflzek, M., Lin Qun (1995): On diagonal dominance of stiffness matrices in 3D.
East-West J. Numer. Math. 3, 59-69
12. Kflzek, M., Liu, L. (2003): On the maximum and comparison principles for a
steady-state nonlinear heat conduction problem. Z. Angew. Math. Mech. 83,
559-563
13. Kflzek, M., Neittaanmiiki, P. (1990): Finite Element Approximation of Varia-
tional Problems and Applications. Pitman Monographs and Surveys in Pure and
Applied Mathematics vol. 50, Longman Scientific & Technical
14. Protter, M.H., Weinberger, H.F. (1967): Maximum Principles in Differential
Equations. Prentice-Hall, New Jersey
15. Varga, R.S. (1962): Matrix Iterative Analysis. Prentice-Hall, New Jersey
A Posteriori Error Estimation in Terms of
Linear Functionals for Boundary Value
Problems of Elliptic Type
Summary. The paper deals with a posteriori error estimation in terms of special
problem-oriented quantities, represented as a linear functionals that control the be-
havior of a solution in certain sub domains , along some lines, or at especially inter-
esting points. The method of estimating such quantities is based on the analysis of
the adjoint boundary-value problems, whose right-hand sides are formed by the con-
sidered linear functionals. On this way, we propose a new effective approach based
on two principles: (a) the original and adjoint problems are solved on non-coinciding
meshes, and (b) the term presenting the product of gradients of errors of the pri-
mal and adjoint problems is estimated by using the "gradient averaging" technique.
The model problem of elliptic type is analysed and the results of numerical tests are
presented.
1 Introduction
Known methods (see, e.g., [1], [6]' [11]) find estimates of < C, U - uh > for
a Galerkin approximation Uh by employing an additional (adjoint) problem,
whose right-hand side is formed by the functional C . If the Galerkin approx-
imation of the adjoint problem is computed on the same mesh as Uh, then
the functional < C, U - Uh > is expressed via a certain integral functional,
which can be estimated by using, e.g., "equilibrated residual method" (see,
e.g., [1], [11]).
In the present work, we propose a new way of estimation of "quantities of
interest". It is based on two principles: (a) the original and adjoint problems
are solved on non-coinciding meshes, and (b) the term presenting the product
of errors arising in the primal and adjoint problems is estimated by the gradient
recovery technique widely used in various applied problems (see [7], [10]' [12],
[13]). This differs our approach from others, where it is usually assumed that
the Galerkin approximations of the primal and adjoint problems are computed
in the same finite dimensional subspaces.
The effectivity of the method suggested in this paper, strongly increases
when one is interested not in a single solution of the primal problem for a con-
crete data, but analyzes a series of approximate solutions for a certain set of
boundary conditions and various right-hand sides (such a situation is typical
in the engineering design when it is necessary to model the behavior of a con-
struction for various working regimes). In this case, the adjoint problem must
be solved only once for each "quantity of interest", and its solution can be
further used in testing the accuracy of approximate solutions of various primal
problems.
2 General scheme
and the Banach space V, which is continuously embedded into H, with the
norms denoted by I ·llv. Let A E £(V, Y), A E £(Y, Y), and
cIilYII~ : : : (Ay, y)y ::::: c211yll~ 'Vy E Y, c31lwllv::::: IIAwlly 'Vw E Vo,
(2.1)
where Vo is a subspace of V and Cl, C2, C3 are positive constants. Given j E Vo*,
consider the following problem:
Primal Problem (P): Find U E Vo, such that
(AAu, Aw)y = < j, w > 'Vw E Vo, (2.2)
where < .,. > denotes the duality pairing of the spaces Vo and Vo*.
Let C be another element of Vo*, which forms the "quantity of interest"
< C, U - il >, for an arbitrary element il E Va viewed as an approximation
A Posteriori Error Estimation in Terms of Linear Functionals 5S9
where A* is the operator adjoint to A, i.e., (Ay, z)y = (y, A*z)y Vy, z E Y.
In the friequently encountered cases of the selfadjoint operators, the left-
hand sides of (2.2) and (2.3) coincide and both the problems are associated
with a functional of the type
1
J(w) = '2 (AAw, Aw)y + < /-L, w >,
which is known to have a unique minimizer on Vo for any /-L E Vo*.
J
f2
A \7u· \7w dx = J
f2
fw dx \lw E Va, (3.5)
A Posteriori Error Estimation in Terms of Linear Functionals 591
o
where Vo = W HD). Let U E Vo be an approximation of u. Assume that we are
interested in the value of
J
n
A'Vv· 'Vw dx =< £, W > Vw E Vo. (3.7)
Dh = U T~, DT = U T~ (3.11)
T~ETh T~ETT
where the respective triangulations are denoted by T", and '4, and their el-
ements are denoted by T~ and T4, respectively. Consider the corresponding
finite dimensional problems (we assume for a moment that Dh = DT = D):
Problem (ph): Find Uh E Vh such that
J
n
A'VUh' 'VWh dx = J
n
fWh dx VWh E Vh, (3.12)
J
n
A 'VvT . 'VwT dx =< £, WT > VWT E Vr- (3.13)
(3.14)
(3.15)
where
where luI2,2,n denotes the L 2 -norm of the second derivatives of u, which exist
due to the conditions imposed on Problem (P), see below.
The estimate (3.17) is sharp, in the sense that it gives a correct asymp-
totical order of the term Eo, whereas the terms E1 (Uh' vT) and E1 (Uh' v T) are
asymptotically smaller ones. Really,
(3.18)
and
A Posteriori Error Estimation in Terms of Linear F'unctionals 593
where m is any positive integer greater than 2, p,(h,7) contains higher order
terms, and the constant C does not depend on hand 7.
In this subsection, we present numerical results for the model problem (more
numerical tests on the subject can be found in [8]).
U = 0 on 8n,
where the exact solution is the infinitely smooth "hat-function":
< £, U - Uh >= J
w
(u - Uh) dx, where w = (0.125,0.250) x (0.125,0.250).
4 Final comments
1) The computations made in the item a) can be further used for estimation of
errors of an approximate solution obtained on another mesh and for different
righ-hand side functions f.
2) The approach is valid for another boundary conditions [8]' is suitable for
estimating local integral norms [8]' and can be applied to problems in linear
elasticity theory [9].
References
1. Ainsworth, M, Oden, J. T. (2000): A posteriori error estimation in finite element
analysis. John Wiley & Sons, Inc.
2. Babuska, 1., Rheinbold, W. C. (1978): Error estimates for adaptive finite element
computations. SIAM J. Numer. Anal. 15, 736-754.
3. Babuska, 1., Rheinbold, W. C. (1978): A posteriori error estimates for the finite
element method. Internat. J. Numer. Methods Engrg. 12, 1597-1615.
4. Babuska, 1., Strouboulis, T. (2001): The Finite Element Method and its Relia-
bility. Oxford University Press Inc., New York.
5. Bangerth, W., Rannacher, R. (2003): Adaptive finite element methods for dif-
ferential equations. Lectures in Mathematics ETH Zurich. Birkhauser Verlag,
Basel.
6. Becker, R., Rannacher, R. (1996): A feed-back approach to error control in finite
element methods: Basic approach and examples. East-West J. Numer. Math. 4,
237-264.
7. Hlavacek, 1., Krizek, M. (1987): On a superconvergent finite element scheme for
elliptic systems. ApI. Mat. 32, 131-154.
8. Korotov, S., Neittaanmiiki, P., Repin, S. (2003): A posteriori error estimation
of goal-oriented quantities by the superconvergence patch recovery. J. Numer.
Math. 11,33-59.
9. Korotov, S., Neittaanmaki, P., Repin, S. (2004): A posteriori error estimation
in terms of linear functionals for problems in the elasticity theory. Russian J.
Numer. Anal. Math. Modelling (to appear).
10. Krizek, M., Neittaanmaki, P. (1984): Superconvergence phenomenon in the finite
element method arising from averaging gradients. Numer. Math. 45, 105-116.
11. Oden, J. T., Prudhomme, S. (2001): Goal-oriented error estimation and adap-
tivity for the finite element method. Comput. Math. Appl. 41, 735-756.
12. Verfiirth, R. (1996): A review of a posteriori error estimation and adaptive mesh-
refinement techniques. Wiley-Teubner.
13. Zienkeiewicz, O. C., Zhu, J. Z. (1987): A simple error estimator and adaptive
procedure for practical engineering analysis. Internat. J. Numer. Methods Engrg.
24, 337-357.
Numerical Solution of Flow In Backward Facing
Step
Summary. The work deals with numerical testing of two different numerical meth-
ods based on finite volumes (FV) and finite elements (FE) for different Reynolds
numbers. The finite volume method is based on upwind scheme (third order) for con-
vective terms and central second order for dissipative terms. Finite element method
consists of stabilization of weak formulation for higher Reynolds numbers with the
help of streamline-upwind (Petrov-Galerkin) modification.
Authors compare both numerical results with experiment for laminar Re E
(100,700), where steady solution exists, using the length of separation domain on
lower wall as well as on upper wall for Re 2: 400.
1 Introduction
The work deals with the numerical solution of the 2D and 3D flow through
backward facing step. The mathematical model is the system of Navier-Stokes
or Reynolds averaged Navier-Stokes (RANS) equations with two-equation tur-
bulence models. Concerning BFS flow in 2D, we focus ourselves on the transi-
tional regimes from laminar to turbulent flow. The start of transition is investi-
gated using the model of laminar flow, which allows us to make even the small
differences between different numerical schemes visible. The end of transition
is investigated using the RANS model. A comparison with measurement data
is made. The laminar model is extended to 3D.
We are also interested in comparison of results obtained with finite volumes
and finite elements schemes. We discuss the stabilization of finite elements
with the help of advanced SUPG method in order to obtain robust solver for
unsteady incompressible laminar flows.
2 Mathematical model
The equations used to solve laminar flow of an incompressible viscous fluid are
the N avier-Stokes ones, which in 2D and Cartesian coordinates (x, y) have the
form
Numerical Solution of Flow in Backward Facing Step 597
where
W = col lip, u, vii, R = it = diag 110, 1, 111,
F = uf + col 110,p, all, G = vf + col 110, O,pll, f = col 111, u, vii (2)
Re is Reynolds number, u, v components of velocity vector, p static pressure
divided by density, and colli, I denotes column vector.
In the case of turbulent flow, we solve the Reynolds averaged Navier-Stokes
(RANS) equations to obtaind mean flow-field. The RANS equations have the
form of (1) with (u, v), p being mean values of velocity and pressure and right
hand side replaced by
(3)
with
(4)
where Tij is Reynolds stress tensor. It is approximated using low-Re two-
equation turbulence models mentioned in Section 5.
The formulation of laminar and turbulent 2D backward-facing step flow was
completed by the boundary conditions in the inlet: fully developed channel
flow, on the walls: zero velocity, and in the outlet: zero streamwise derivative
of velocity.
i -i
a numerical approximation of
ow,·· IDi,jl
R--'-'} + Fdy - Gdx = R1- Wxdy - Wydx, (6)
ot aDi,j Re aDi,j
where IDi,j I is area of the cell. The finite volumes compose structured (multi-
block) grid consisting of quadrilaterals. The grid is orthogonal and for turbu-
lent cases refined along walls.
598 K. Kozel et al.
The integrals in (6) are approximated using mid-point rule. Cell face values of
P, G, Wx,
Wy thus need to be defined. The face between cells Di,j and DH1,j is denoted
by i+ 1/2, j, analogically the face between cells Di,j and D i ,j+1 will be denoted
by i,j + 1/2.
The discretization of convective and pressure terms consists of defining cell
face velocities ui+l/2,j, Vi,j+1/2 and pressure in the central way,
1 1
1, 2 2,), + u'+l ,)
u'+1/2 ,J' = -(u' 'l, ,), Pi+1/2,j = '2 (pi,j + PH1,j),
1 1
Vi,j+1/2 = '2 (Vi,j + Vi,j+1), Pi,Hl/2 = "2(Pi,j + Pi,j+1),
and an upwind biased "Monotone Upstream-centered Schemes for Conserva-
tion Laws" (MUSCL) [12] interpolation in the direction of grid lines, here the
line j = canst, for cell face momentum in (2):
1 1
fHl/2,j = fi + 4(1 + K,)(fi+1 - fi) + 4(1 - K,)(fi - fi-l), uHl/2 > 0,
1 1
fHl/2,j = fHl + 4(1 + K, )(fHl - fi) + 4(1 - K,) (fH2 - fHd, ui+1/2 :s: 0,(7)
where the constant index j has been ommited in the Lh.s. and AH1/2 is
obtained similarly in j-direction depending on vi,Hl/2' We have used K, = 1/3,
i.e. up to third order accurate upwind. The same form is used for turbulent
flows as well, although the formal accuracy is degraded by non-regularity of the
grid. On the other hand, the grid refinement is needed inside of shear layers,
where the diffusive term is dominant. There was no need to use a limiter
in Eq. (7) for momentum equations. In case of turbulence model equations
a limiter is sometimes necessary to achieve better convergence in the non-
turbulent regions. We have used a minmod one.
The approximation of cell face velocity derivatives RWx , RWy needed in diffu-
sive terms is central. It uses quadrilateral dual finite volumes constructed over
each face of primary volume - the vertices are located at end of primary face
and in centres of adjacent primary volumes. The mid-point rule quadrature
formula is again used, with face value of velocity defined as average of values
in vertices of dual cell [11].
The finite volume is common for all unknowns. In order to avoid pressure-
velocity decoupling in this collocated arrangement, a pressure diffusion term
Numerical Solution of Flow in Backward Facing Step 599
The system (1) with R given in (5) is integrated in time by means of explicit
methods - multi-stage Runge-Kutta methods and MacCormack method - or
by implicit backward Euler method. In the implicit method, the system of
algebraic equations is linearized by Newton method, which results in an block
nine-diagonal system. The system is solved iteratively using a line Gauss-Seidel
method, applying direct block tridiagonal system inversion on the lines. For
turbulent flows, solely implicit method was used.
au 3un +1 _ 4u n + un - 1
at'" 27
This leads to a solution of one nonlinear system in each time step. We use the
second order linearization of the nonlinear convective term, which leads to the
semiimplicit scheme
600 K. Kozel et al.
where
- (p, \7 . V) + (\7 . u, q) ,
1
f(V) = 27 (4u n -un -\v) (11)
and by (.,.) we denote the scalar product in the space L2(0). Moreover, we
require that u satisfies the Dirichlet boundary conditions. The couple (u, p)
represents a solution on time level n + 1, i.e. u n + 1 := u and pn+1 := p.
Further, the use of the Galerkin FEM restrict the weak formulation from
couple of spaces (X, M) to approximate spaces (Xh' M h), i.e. find Uh E
(Xh' Mh) such that
(12)
The couple (Xh' Mh) of finite element spaces should satisfy the BB condi-
tion, which guarantees the stability of a scheme: there exists a constant c > 0
such that
(p, \7. w) >
sup
wEX h
I W I 1,2,n - cllpll, (13)
Lh,n(U, V) = ~)K (2: u - vLu + (w· \7) u + \7p, (w. \7)V) K'
where (15)
a(Uh' Vh) + Lh,n(Uh, Vh) + L Tk (\7 . Uh, \7 . Vh) = !(Vh) + Fh,n(Vh). (16)
KETh
where we used the additional grad - div-stabilization, for details see, e.g., [2].
The space-time discretization leads to the solution of the following system
of equations repeatedly for each time step
(17)
where 11: E Rnh and p E Rmh are vectors whose components represent degrees
of freedom defining the velocity u and the pressure p, respectively, S is a non-
singular nh x nh matrix and B is an nh x mh matrix. In practical computations
we solve this system with the help of iterative methods or - for a smaller sys-
tem - with the help of advanced direct methods, see, e.g., [6]. UMFPACK is
a set of routines for solving unsymmetric sparse linear systems, Ax = b, using
the Unsymmetric MultiFrontal method written in ANSI/ISO C.
5 Results
The Reynolds number is defined using 2/3 of maximum velocity in the inlet
(i.e. bulk velocity in the laminar case), and step height.
The figure 1 shows the distance of reattachment point on the lower wall
from the step. The grid has regular spacing in both directions equal to 0.05.
The result of finite volume method on coarse grid with double spacing is also
shown.
At higher Reynolds numbers, the laminar flow separates on the upper wall
as well. The results are shown in Fig. 2. This separation is for higher Reynolds
numbers predicted in better agreement with measurement than the primary
one in Fig. 1.
602 K. Kozel et al.
For Re > 6600, the flow is fully turbulent and two dimensional in the mean.
The primary separation length becomes independent of Re and according to [7]
equal 8 step heights. Our numerical results give well higher values 8.83 using
SST (Shear Stress Transport) turbulence model [9] and 9.05 using sst mC k-
E (modified Chien k-E with shear stress transport) one [10]. Both turbulence
models are low-Re two-equation ones, first one uses k-w formulation, second
one k-E. Several profiles of streamwise component of velocity for these models
are compared in the upper part of Fig. 3. The lower part of same figure shows
the k-variable of turbulence models, which could be interpreted as kinetic
energy of velocity fluctuations. Also the structure of primary separation highly
depends on turbulence model, as shown in Fig. 4. However, high dependence
on numerical method (and grid) should be mentioned as well - e.g. the small
corner vortex for SST model disappears if the limiter in w-equation is used.
Acknowledgements
The work has been partially supported by the Research Plan of MSMT No.
210000003 and by grants No. 101/03/0018 and No. 101/02/0684 of GA CR.
14,-----,-----,------,-----,------,-----,------,
•
12
4
/'. " . measurement
/.
~ .-0- .. FEM fine
~/'
0 AUSM coarse
- "*- AUSM fine
2
100 200 300 400 500 600 700 800
Re
18
16
6 measurement x2
measurement x3
4 - ",,- AUSM x2 fine
- ",,- AUSM x3 fine
- <)-- FEMx2
- <)-- FEMx3
:Prf2Jrrl??????VPv 12 12 [7 II
o 2 4 6 8 10 12 14 16 18
1
20
o 2 4 6 8 10 12 14 16 18 20
Fig. 3. BFS flow with SST and sst me k-€ model, Re = 6667
604 K. Kozel et al.
Fig. 4. Detail of streamlines for BFS flow, SST (left) and sst mC k-E model (right),
Re = 6667
References
1. Feistauer M. (1993): Mathematical Methods in Fluid Dynamics. Longman Sci-
entific & Technical, Harlow.
2. Gelhard T., Lube G., Olshanskii M. A.: Stabilized Finite Element Schemes with
LBB-stable elements for incompressible flows (preprint)
3. Svacek P., Feistauer M.(2003): Numerical Simulations of Laminar Viscous In-
compressible Flow Over a Profile, Proc. Topical Problems of Fluid Mechanics,
IT CAS, Prague.
4. Dolejsi V. (2001): Anisotropic Mesh Adaptation Technique for Viscous Flow Sim-
ulation, East-West Journal of Numerical Mathematics, 9(1):1-24.
5. Turek S. (1998): Efficient Solvers for Incompressible Flow Problems, Springer.
6. Davis T. A.,Duff I. S. (1999): A combined unifrontal/multifrontal method for
unsymmetric sparse matrices, ACM Transactions on Mathematical Software, 25,
no. 1, pp. 1-19.
7. Armaly B.F. et al. (1983): Experimental and Theoretical Investigation of
Backward-facing Step Flow, J. Fluid Mech., 127, pp. 473-496.
8. Kozel K., Louda P., Prihoda J. (2001): Numerical Solution of 2D and 3D Incom-
pressible Viscous Flow Problems, Internal Flows Vol. 2, (eds) P. Doerffer, 5th
ISAIF Symposium, pp. 847-854.
9. Menter F. R. (1994): Two-Equations Eddy-Viscosity Turbulence Models for En-
gineering Applications, AIAA Journal, 32, No.8, pp. 1598-1605.
10. Kozel K., Louda P., Prihoda J.(2003): Numerical Solution of Turbulent Backward
Facing Step and Impinging Jet Flows, Modelling Fluid Flow '03, vol. I, (eds) T.
Lajos, J. Vad, Budapest, pp. 694-701.
11. Louda P., (2002): Numerical Solution of 2D and 3D Turbulent Impinging Jet
Flow, Ph.D. thesis, CTU Prague (in Czech).
12. Vee H. C. (1989): A Class of High-Resolution Explicit and Implicit Shock-
capturing Methods, NASA Technical Memorandum 10 1088.
Periodicity Properties of Solutions
to a Hysteresis Model in Micromagnetics
Martin Kruzfk
1 Introduction
v q: J(t, q) + D(q(t), q)
it
J(t, q(t)) ~ and (1)
where J is Gibbs' stored energy of the system, "Var" stands for the total
variation, D is a dissipation functional ensuring a rate-independent response
and s ~ t E [0, TJ, where [0, T] is the process time interval. The formulation of
rate-independent evolutionary processes in continuum mechanics by means of
(1) and (2) appeared in [10]. Its application to problems of rate-independent
606 M. Kruzik
2 Model
2.1 Stored energy and its relaxation
where Xn : JRn --) {O, 1} is the characteristic function of fl. The demagnetizing-
field energy thus penalizes non-divergence free magnetization vectors. Stan-
dardly, we will understand (3) in the weak sense, i.e. U m E Hl(JRn) will be
called a weak solution to (3) if the integral identity JJF.n (mxn - Vum(x)) .
Vv(x) dx = 0 holds for all v E Hl(JRn), where Hl(JRn) :::::: W 1 ,2(JRn) denotes
the Sobolev space of functions from £2 (JRn) with all first derivatives (in the
Periodicity Properties of Solutions to a Hysteresis Model 607
E(m) = 1 n
cp(m(x)) dx + ~ r
2 i~n
lY'u m (x)12 dx . (4)
inr cp(m(x)) - r
{
= h(x) . m(x) dx + ~ lY'um(xW dx ,
2hn
subject to (3), (m,u m ) E A x HI(jRn) ,
(5)
where the introduced notation G stands for Gibbs' energy and A is the set of
admissible magnetizations
E(v)=lcpevdx+~
n
r
2 i~n
IY'U(idev)(xWdx, (6)
where [vev](x) := IrRn v(s)vx(ds) and id : jRn ----> jRn is the identity. The
set of Young measures Y(D; S) c L;:(D; rca(S)) 3:' LI(D; O(S))* is the
set of all weakly measurable essentially bounded mappings x f-+ Vx : D ---->
rca(S) 3:' O(S)* such that Vx is a probability Radon measure supported on the
sphere S for a.a. xED; the adjective "weakly measurable" means that Ve V
is Lebesgue measurable for any v E O(S). A natural embedding of a magne-
tization m E LOO(D; jRn), Im(x)1 = Ms, to Y(D; S) is v = i(m) defined by
Vx = 5m (x) with 58 denoting the Dirac measure at s E S. We say that a se-
quence {vkhEN C Y(D; S) converges weakly* to v if limk--->oo(vk, f) = (v, f)
for any JELI(D;O(S)) or, equally, for any J = g0v with g E LI(D) and
v E O(S), where the tensorial notation means naturally [g0v](x, s) = g(x)v(s).
From the last fact, we can also say that v k ----> v weakly* if and only if
608 M. Kruzik
The model (7) represents a so-called mesoscopic level model, because a mini-
mizing Young measure v records some, but not full information about spatial
oscillations of a minimizing sequence of (5) around each "macroscopic" point
x through volume fractions described as the probability distribution v x . This
information makes possible to describe the effective magnetic properties by
means of the first moment, the "macroscopic" magnetization m = id e v, and
moreover seems sufficient for designing a dissipative mechanism in a good
agreement with experiments, which will be just exploited further.
For usual loading regimes and magnetically hard materials, one must consider
certain dissipation. Our simplified standpoint is that the amount of dissipated
energy within the phase transformation from one pole to the other can be
described by a single, phenomenologically given number (of the dimension
J/m 3 =Pa) depending on the coercive force He. Hence, we need to identify the
particular poles according to the magnetization vector. Inspired by [8, 10] and
considering L poles (L = 2 for uniaxial magnets or 6 or 8 for cubic magnets),
we define a continuous mapping .£ : S ---> 6.L where 6.L := {~ E JR.L; ~i 2:
0, i = 1, ... ,L, 2:~=l~i = I}. In other words, {.£l"",.£L} forms a partition
of unity on S such that .£i(S) is equal 1 if s is in i-th pole, i.e. s E S is in
a neighborhood of i-th easy-magnetization direction. Of course, .£(m) in the
(relative) interior of 6.L indicates m in the region where no definite pole is
specified. Hence .£ plays the role of what is often called an order parameter.
In terms of the mesoscopic microstructure described by the Young measure
v, the "mesoscopic" order parameter is naturally defined as
where [.£ e v](x) :=Is .£(S)Vx ( ds). Thus A is just a continuous extension of the
mapping m f-+ .£( m), i.e. if {mk} converges to 1I weakly* in L~ ([2; rca( S)),
then '£(mk) ---'- Av weakly* in LOO([2; JR.L).
To described phenomenologically the dissipative energetics, one must pre-
scribe a (pseudo) potential of dissipative forces (} as a function of the rate of
A. For rate-independent processes, this potential must be convex and homo-
geneous of degree-I. Considering a (not necessarily Euclidean) norm I . \L
Periodicity Properties of Solutions to a Hysteresis Model 609
on jRL, one can postulate [1(>-) = Hel>-IL, a constant He > is the so-
called coercive force. The energy needed to transform i-th pole to j-pole is
°
then Helei - ej IL with ei the unit vector with 1 at the i-th position. The
state of the specimen D (at a given time t) will be described by the couple
q = q(t) == (v, A) = ({vx,t}xE!?, A(" t)). Let us denote by Q the convex set of
admissible configurations:
For the analysis below, we will need to consider rather a certain regulariza-
tion of the stored energy E which would control spatial smoothness of A. For
this, we will augment E by a higher-order term
From now on, we will work with this regularized relaxed stored energy Ep
rather than E.
Let us abbreviate the Gibbs energy by
where
Let us agree to identify quite naturally the mapping t f-+ v(t) = {[v(t)]x}xEn
with a Young measure (x, t) f-+ Vx,t.
Definition 2. We say that the process q = q(t) satisfies the energy inequality
if for all s, t E [0, T], s :S t,
9(t, q(t))
'-..---'
+Var(D,q;s,t) :S9(s,q(s)) it (~~,q(B))dB
" v .f
, (16)
effective Gibbs' - -dissipated
- - - - - - Gibbs'
'--v -'
ener- reduced work of
energy at time t energy gy at time 0 external field
where the total variation over the time interval [s, t] is defined standardly,
without using explicitly any time derivative, as
I
Var(D,q;s,t):= sup LD(q(ti-d,q(ti)) (17)
i=l
where the supremum is taken over all J E N and over all partitions of [s, t] in
the form s = to < tl < ... < tJ-l < tJ = t.
In what follows "BV" stands for a space of maps with bounded variations.
where Q is from (9), 9 is from (13), and D from (10). If a solution (i.e. a global
minimizer) to (18) is not unique, we just take an arbitrary one for q~. Then
we define the piecewise constant interpolation qr E LOO(O, T; L':}'(D; rca(S)) x
LOO(D;lRL)) so that qrl«k-l)r,kTj = q~ for k = 1, ... ,TIT while for t = Owe
put qr(O) = qo. Besides, assuming
Periodicity Properties of Solutions to a Hysteresis Model 611
we have certainly 'H E C(O, T; £1([2; C(S)) and we can define the piecewise
constant approximation of 'H, denoted by 'Hn by 'Hy(t) = 'H(kT) for t E
((k - I)T, kT] and by 'Hy(t) = 'H(O) for t = O. Besides, we will still need
the piecewise affine interpolation, denoted by 'H~ff, i.e. 'H~ff is affine in time if
restricted on the interval [(k -1)T, kT] for k = 1, ... , T /T and 'H~ff (kT) = H(kT)
for k = 0, ... , T /T. Also, we will naturally assume that the initial condition qo
is admissible and even stable:
WjE Q; (20)
note that it implies, in particular, that Ep(qo) < +00. The scheme (18) together
with a suitable spatial discretization is also suitable for a numerical solution
to our problem.
Let us define a sufficiently large set P where the values of all the processes
qr (-) will certainly live; here it is natural to put
(21)
the constant C 1 can be now considered arbitrary but sufficiently large, and
will be fixed later, see (22). We will endow P by the (weak*xweak)-topology
of £':)([2; rca(S)) x H"'([2; ~L). Clearly, the set P is compact.
The following can be found in [6] or in a slightly different form in [12].
Proposition 1. Let (12), (19) and (20) hold. Let qr = (Vr,Ar) be a solution
constructed recursively from solutions to (18) at the prescribed time incre-
ments. The following a-priori estimates hold:
where Q)y(t) := Ep(qy(t)) - ('Hy(t), qr(t)) denote Gibbs' energy of the approxi-
mate trajectory.
where
612 M. Kruzik
v:= 1 Q
meas
rr
} n }S
Avx(dA) dx = 1 Q
me as }n
m(x) dx , r (26)
Lemma 1. Let (12) hold. Let q1 = (V1' A1), q2 = (V2' A2) be two solutions of
(18) in Q for some fixed 1:::::: k:::::: T/r. Then 8(q1) = 8(q2)'
Proof. The assertion A1 = A2 follows from the convexity of I from (18) in q
and the strict convexity of I in A. Note also that I is strictly convex in 'VU(id. v),
i.e., in the magnetostatic field. Denoting m1 and m2 the macroscopic magne-
tization vectors corresponding to q1 and q2, respectively, the weak formulation
of (3) gives fn~.n(m1(x) - m2(x))Xn(x) . 'Vv(x) dx = 0 for any v E H1(]Rn). As
for any w E ]Rn one can find v E H1(]Rn) such that 'Vv = w in Q we have
In m1(X) dx = In m2(X) dx and the result follows. 0
Theorem 1. Let (12) and (19) be valid. Then there are a process
q = (v, A) E Y(Q x [0, T]; S) x BV([O, T]; Ll(Q; ]RL)), 8(q(0)) = 8(q(T)) and
a net {qT< hES, 8( qT< (0)) = 8( qT< (T)) such that:
(i) limEEsAT«t) = A(t) strongly in L2(Q;]RL) for all t E [O,T],
(ii) limEEs VT< (t) = v(t) weakly* in L,:/(Q; rca(S)) for all t E [0, T],
(iii) limEEsQT«t,qT«t)) = Q(t,q) for all t E [O,T].
Moreover, A = Av a.e. on Q for every t E [0, T] and q thus obtained is a solu-
tion process according to Definition 3 and 8(q(0)) = 8(q(T)).
Sketch of proof. The proof is similar to the one of [7, Th. 3.4] or
[6, Prop. 3.13]. The point (i) relies on the weak* compactness of the set
L,:/(Q; rca(S))[O,T] due to the Tychonoff's theorem on compactness of product
topologies. 0
Theorem 2. Let (12) and (19) hold. Then there is a solution process q =
(v, A) E Y(Q x [0, iT]; S) x BV([O, iT]; Ll(Q; ]RL)) such that 8(q(t + jT)) =
8(q(t)) for any t E [0, T] and any 1 :S j :S i -1. In particular, In m(x, t) dx =
In m(x, t + jT) dx.
Proof. The process q can be constructed by the T-periodic extension of the
process whose existence was established in Theorem 1. Clearly, the extended
definitions 1 and 2 hold for such process. 0
References
1. Brown, W.F. Jr. (1966): Magnetostatic principles in ferromagnetism. Springer,
New York
2. Dal Maso, C. (1993): An introduction to r-convergence. Birkhiiuser, Boston
3. Deimling, K. (1985): Nonlinear Functional Analysis. Springer, Berlin
4. DeSimone, A. (1993): Energy minimizers for large ferromagnetic bodies. Arch.
Rat. Mech. Anal., 125,99-143
5. Hubert, A., Schiifer, R. (1998): Magnetic Domains. Springer, Berlin
6. Kruzik, M. (2003): Periodic solutions to a hysteresis model in micromagnetics.
In preparation
7. Mielke, A., Roubicek, T. (2003): A rate-independent model for inelastic behavior
of shape-memory alloys. Multiscale modeling and simulations, to appear. +
614 M. Kruzik
Summary. A new mixed finite element method for the diffusion equations on gen-
eral polygonal and polyhedral meshes is presented. The basis vector functions in
macro cells are designed by solving the local mixed finite element problems with the
lowest order Raviart-Thomas elements. Numerical results for the Poisson equation
on distorted prismatic meshes are given.
1 Introduction
This paper is a natural generalization of the authors' recent results [3] to the
general diffusion equation with mixed type boundary conditions. The paper is
organized as follows. In Section 2 we give the formulation of the problem and
describe the meshes to be used. Section 3 is the most important in the paper.
We describe an approach to designing macroelements in the space of fluxes by
solving mixed finite element problems with the lowest order Raviart-Thomas
elements [1], [4] on macrocells. A convergence result is given in Section 4.
Finally, in Section 5 we present and discuss numerical results for a particular
3D test problem.
2 Problem formulation
We consider the diffusion problem in the form of the system of the first order
differential equations
K-1ii.+ gradp= 0
(1)
divii. + cp = f
in a bounded connected polygonal (polyhedral) domain n in ]R.d, d = 2 (d = 3),
with homogeneous boundary conditions
p=O on rD,
(2)
ii.·n=O on rN.
616 Yu. Kuznetsov, S. Repin
Here rD and rN are the Dirichlet and the Neumann parts of the boundary
aD, n is the outward unit normal to aD, K = K(x) is the diffusion tensor,
K = KT > 0, C = c(x) is a nonnegative function, and f E L2(D). We assume
that r D is a closed subset of aD consisting of a finite number of segments
(polygons) in the case d = 2 (d = 3).
The weak formulation of (1), (2) reads as follows: find
U E V == {v : v E Hdiv(D), V· n = 0 on rN}, P E Q == L 2 (D)
such that
j (K-1u) . v dx - j p(\7 . v) dx = 0
n n (3)
j (\7. u)qdx + j cpqdx = j fqdx
n n n
V x Q.
for all (v, q) E
Let Dh be a partitioning of D into m nonoverlapping polygonal (polyhedral)
macro cells E k :
m
Dh = UE k, (4)
k=l
and Vh and Qh be finite element subspaces of V and Q, respectively. In this
paper we assume that the interface nl = aEk n aEI between macro cells Ek
and El is either a point, or a segment, or a simply connected polygon in the
case d = 3 (see, for instance, Fig. 1). We also assume that for any function
Ph E Qh its trace on Ek is a constant, and for any vector-function v E Vh its
trace on Ek is a piecewise affine vector-function. Moreover, we assume that for
any vector-function Vh E Vh its normal component v . nkl on rkl is a constant,
where nkl is the unit normal vector to nl directed from Ek to E l , k > t. An
example of a polygonal mesh Dh is given in Fig. 1. The arrows on the interfaces
show the normal component of the fluxes.
In the case of a triangular (d = 2) or a tetrahedral (d = 3) partitioning
of D, Vh is the lowest order Raviart-Thomas space RTo(Dh) subject to the
boundary condition on rN.
The mixed finite element approximation to (1), (2) reads as follows: find
(Uh, Ph) E Vh x Qh such that
j(K-1Uh).VdX- jPh(\7.V)dX=O
n n (5)
j (\7 . uh)qdx + j cPhqdx = j fqdx
n n n
for all (v, q) EVh X Qh. The finite element problem results in the system of
linear algebraic equations
Mixed Finite Element Method on Polygonal and Polyhedral Meshes 617
(6)
A= ( M
B -E
BT) (7)
which gives
(9)
where
Inl = J
r1
dl and lEI = J
E
dx.
Under condition (9) problem (8) has the unique solution Wi = UE,h.
Repeating the same procedure on the same mesh Eh with the boundary
conditions UE,h . fiE = Ji,j on T j , 1 :::; j :::; s, i = 2, ... , s, we get the vector-
functions Wi, W2, ... , Ws which satisfy the conditions
W = L O!iWi (10)
i=l
Mixed Finite Element Method on Polygonal and Polyhedral Meshes 619
constE (12)
where
Ifil = J dl.
r
i
w·nE ai on ri , i = T,"S,
(13)
W·fiE o on BE n rN.
We denote the designed space of finite element vector-functions w(x) for
a macroelement Ek in fh by Vk,h. The global space Vh C Hdiv(D) for the
mesh is defined by
(14)
620 Yu. KuznetsQv, S. Repin
4 Convergence
To prove the convergence result for the mixed finite element problem (5) with
the designed space Vh we assume that
Then, the convergence result follows from [3] after some minor modifica-
tions.
Theorem 1. Under the assumptions made, the finite element solution (ih, Ph)
of problem (5) converges to the solution (u, p) of problem (3) in Hdiv(D) x
L2(D), i.e.
h max diamEk.
l::;k::;m
5 Numerical results
To illustrate the proposed method by numerical results we consider the Dirich-
let boundary value problem for the Laplace equation:
6p = 0 in D,
(17)
P = g on aD
where D is a prism in ]R3 with the rectangular faces orthogonal to the (x, y)-
plane and the triangular faces parallel to the (x, y)-plane, and a function g is
the trace on aD of the test harmonic function p = (x 2 - z2) + x(y2 - z2).
Let Dh be a distorted prismatic mesh which is obtained from the reference
rectangular prismatic mesh by perturbation of vertices along the vertical mesh
lines. An example of such a distorted macro cell is given in Fig. 3. The choice of
this mesh is relevant to geophysical applications (basin modeling and reservoir
simulation) where distorted prismatic meshes are used for discretizations of
diffusion type equations in layered formations.
Mixed Finite Element Method on Polygonal and Polyhedral Meshes 621
Fig. 3. Reference (dashed lines) and distorted (solid lines) prismatic macro cell
10-"
-"
?
§
!
/
/
/
/
10-3 /
10-;O':;_,---------w1O':;-_'----~--~--.J,0'
mesh slep size h
10' r---------_----------.--,
Normal oomponenl of fluxes: po(x, y, z):: {"} - 22) + x(i - z~, 0.=0.3
10-2L-_ _ _ _ _ _ _~~_--------~
10 2 10-' IOQ
mesh step size h
Four discretization methods have been utilized. The numerical results are
shown in Fig. 4 and Fig. 5. The first one is the nonconforming mixed finite
element method with the lowest order Raviart-Thomas elements. In fact, the
finite elements for the space of fluxes are nonconforming only on triangular
interfaces between distorted prisms where the continuity conditions for the
normal component of fluxes are imposed in the center of mass of the trian-
gles. On the quadrilateral interfaces which are orthogonal to (x, y)-coordinate
plane, the normal component of the fluxes are constant.
The second method which is called "Cubature" in the figures is a mod-
ification of the mimetic discretization [5] adopted to the prismatic meshes.
The third method is called " Interpolation" . This is a conforming finite ele-
ment method based on a modification of the Piola transformation adopted to
prismatic meshes.
Finally, the method proposed in this paper is called "Const Div". The
numerical results are given for the error functions in the discrete maximum
norms. For the solution function p (called "pressure") we measure the errors
for the mean values over the macrocells, and for the fluxes we measure the
mean values of the normal components over the interfaces.
The numerical results show that all the methods provide the same order
of accuracy of approximation to the solution of the differential problem (17).
We recall that the method proposed in this paper can be used on arbitrary
polygonal and polyhedral meshes.
6 Acknowledgments
The work was partly supported by a grant from Los Alamos Computational
Sciences Institute (LACSI). The authors are grateful to O. Boyarkin for pro-
viding us with the results of numerical experiments.
References
1. Brezzi, F., Fortin, M. (1991): Mixed and hybrid finite element methods. Springer,
Berlin
2. Kuznetsov, Yu. (2003): Spectrally equivalent preconditioners for mixed hybrid
discretizations of diffusion equations on distorted meshes. J. Numer. Math., v. 11,
No. 1,61-74
3. Kuznetsov, Yu., Repin, S. (2003): New mixed finite element method on polygonal
and polyhedral meshes. Russian J. Numer. Anal. and Math. Modelling, v. 18,
No.3, 261-278
4. Roberts, J.E., Thomas, J.-M. (1991): Mixed and hybrid methods. In: Ciarlet, P.,
Lions, J.-L. (eds) Handbook of Numerical Analysis II. Finite element methods,
523-639. Elsevier/North Holland, Amsterdam
5. Shashkov, M., Steinberg, S. (1996): Solving diffusion equations with rough coef-
ficients in rough grids. J. Compo Phys., 129, 383-405
Semi-discrete Schemes for Hamilton-J aco hi
Equations on Unstructured Grids
1 Introduction
Abgrall [1]. The numerical fluxes of Abgrall were combined with WENO recon-
structions on triangular meshes by Hu and Shu in [20]. Another finite-volume
scheme on unstructured grids was proposed by Kossioris et al. in [9].
While upwind schemes require solving Riemann problems (or at least ap-
proximating their solutions) on the interfaces between two cells, Godunov-type
central schemes utilize evolution points that are located away from the discon-
tinuous interfaces in order to avoid Riemann solvers altogether. Fully-discrete
central schemes for HJ equations were first introduced by Lin and Tadmor
in [12, 13], and further improved by Bryson and Levy in [2]. Fully-discrete
schemes of order greater than two were derived in [3] using central-WENO
reconstructions. Semi-discrete formulations of central schemes enjoy reduced
numerical dissipation while keeping track over the local speeds of propagation
of information that is propagating from the discontinuous interfaces. Second-
order semi-discrete central schemes for HJ equations were derived by Kurganov
and Tadmor in [11]. A more accurate estimate of the local speed of propagation
was then utilized to reduce the numerical dissipation in [10]. The numerical flux
of Kurganov, Noelle and Petrova, was combined with the WENO reconstruc-
tions of Jiang and Peng to obtain fifth-order, semi-discrete central schemes for
multi-dimensional HJ equations in [4].
In this paper we extend our previous works on semi-discrete central schemes
for multi-dimensional HJ equations to unstructured grids. Our derivation re-
sults with a new numerical flux that takes into account information regarding
the local speeds of propagation of information from the discontinuous inter-
faces. Similarly to any other central scheme, the evolution points are taken
away from the interfaces in order to avoid Riemann solvers. This numerical
Hamiltonian should be viewed as a central version of the numerical Hamilto-
nian of Abgrall [1]. It can also be viewed as a generalization of the numerical
Hamiltonians of Kurganov et al. [10, 11] that were derived for Cartesian grids.
When we assume that the "unstructured" grid is Cartesian (which still is an
admissible grid in our formulation) the scheme we obtain is different of the
schemes obtained in [11, 10]. This difference results from a different averaging
procedure we use with the values obtained at the different evolution points,
and is reflected by a different coefficient in front of the dissipative term.
¢t + H(\l¢) = 0, (1)
augmented with initial values ¢(x, t = 0) = ¢o(x). We assume that T is a
given triangulation of Q. The grid points are denoted by Xa. For every grid
point there are ma angular sectors T{' that are ordered counterclockwise. For
simplicity we will drop the a index from the triangles, and use the notation
Semi-discrete Schemes for Hamilton-Jacobi Equations 625
Fig. 1. Grid point Xu and its angular sectors. Each angular sector 1 has an associated
angle (h
Fig. 2. Evolution point, x~, derived from the maximal local speeds of propagation
into Tl, at and at
We then define d1 to be d1/ ,dt, a quantity that does not explicitly depend on
,dt.
The interpolant cp(x, tn) is evolved to the next time step t n+1 at the points
x~, which are located away from the propagating discontinuities (assuming
that the time step ,dt is sufficiently small). From (1), this is given to first
order in time by the Taylor expansion
'P(X~, t n+1) = cp(x~, tn) - ,dtH("Vcp(x~, tn)) + O(Llt2), (4)
where the approximation of the gradient "V 'P at x~, "V cp( x~, t n ), is obtained
from the reconstruction.
The next step is to combine the values of 'P at the different evolution
points around X a , x~, into one value 'P~+1. This is done by writing a convex
combination with weights 81 ::::: 0 that are yet to be determined:
n+1 _ 2:~181'P(x~, t n+ 1)
'Pa - ",rna 8 (5)
61=1 1
Using (4), we may express (5) as
n+1 _ 2:~181 [cp(x~, tn) - ,dtH("Vcp(x~, tn)]
'Po; - ",rna 8 (6)
61=1 1
If we now define PI to be the unit vector in the direction of x~ from Xo;, we
can use a Taylor expansion in space
cp(x~, tn) = CP(Xa, tn) + d1Pl . "Vcp(x~, tn) + O(Llt 2).
Here by "Vcp(x~, tn) we refer to the value of the gradient at Xa that is associated
with the reconstruction in sector Tl at x~. We may therefore rewrite (6) as the
fully discrete scheme
,d rna
'Pan+1 -n +
= 'Pa " 1 [d'lPl·
2:~1t 81 'tl'8 D
Vip
-( 1 n)
xa,t - H(D -( 1 n))] .
vip xa,t
(7)
where for each l, 'Vcp~(t) denotes limdt--+O 'Vcp(x~,tn). All that remains is to
determine the coefficients Sl in (8). These coefficients will be obtained through
a consistency condition. The consistency of the scheme implies that if the
value of the gradient is identical in every sector that surrounds Xa then the
numerical Hamiltonian should reduce to the differential Hamiltonian. Hence
we are seeking for coefficients Sl such that
ma
Ls1d1P1 =0. (9)
1=1
Such coefficients indeed exist, and we can use, e.g., the results of Abgrall in [1]
to find them. The observation that was made there was that if /-l1+1/2 denoted
a unit vector in a direction that is aligned with the interface between the
sectors Tz and TI +1, and if 01 < 7r (which is the case with a triangulation, e.g.),
then ma
L Il+!/-ll+! = 0, (10)
1=1
provided that
Il+! = E [tan (~) + tan (0 1;1) ], (11)
for any E > O. In order to incorporate (10)-(11) into our framework, we split
each angle 01 into two parts ot
that are defined as
± . a1±
01 = arCSIn -,-, (12)
d1
(see figure 3).
The consistency condition (9) is then satisfied if the weights Sl are defined
as
628 D. Levy, S. Nayak
where 8t are given by (12). In our case, the coefficient E will anyhow cancel
out in the semi-discrete formulation (8), so it can be omitted from (13).
To summarize, if we define
(14)
Remarks.
1. A simple version of the scheme can be obtained by assuming that all the
speeds of propagation are identical. In this case, the local speeds are re-
placed by their maximum, i.e., a = maxI {at, al }. In this case
, a
d1 = --:-:--..,..--,-
sin(8 1/2) ,
and the semi-discrete scheme (14) can be written in the simpler form
dd 'Pa(t) = _1
t
~
[-!!--oPI'
ma 1=1 sm '2
\l<p~(t) - H(\l<P~(t))]. (16)
!!.-
dt 'Pa
(t) = ~ ~f3
1r ~ 1+"2 Pl +"2
1(\l<P~(t) + \l<p~+1(t))
l'
2
_ H (l:;:;21r81\l<P~).
1=1
(17)
Here Pl+1/2 is the unit vector in the direction of the interface between the
sectors Tl and Tl+1, and f31+1/2 = tan(8z/2)+tan(81+1/2). The derivation of
(17) involved evolution points that were located on the interfaces between
the sectors. This resulted with the form of the dissipative term in (17) that
contains averages of gradients in adjacent sectors. Also, the scheme (17)
involves a Hamiltonian that is evaluated at the average of the derivatives
that are computed in different sectors (with weights that are proportional
Semi-discrete Schemes for Hamilton-Jacobi Equations 629
to the angles). This term was postulated to be in this form, and could have
taken different forms. In our case (14), this term is replaced by an average
over the Hamiltonian that is evaluated in different sectors. In our case, the
form of this term is dictated by the derivation of the scheme.
3. We would like to emphasize that the scheme (14) does not require Riemann
solvers. This is due to the scheme's derivation, which places the evolution
points away from the boundaries of the angular sectors around every point
Xa·
4. If the grid is a Cartesian grid with equal spacing in the x- and y-directions,
the number of angular sectors at each point is ma = 4, and sin( /2) = e
sinCrr / 4) = v'2/2. In the simple case where all velocities are taken to be
identical in both direction, the scheme (16) becomes
_ a (+
d
dt ipa (t) -"2 ipx - ipx_ + ipy+ - ipy_) (18)
1
-"4 [H(ip;,ipt) +H(ip;;;,ipt) +H(ip;,ip;) +H(ip;;;,ip;)] ,
with the obvious notations, e.g. H( ip;, ipt) is the Hamiltonian evaluated
at the gradient at Xa that is taken from the first quadrant. The scheme
(18) is identical to the consistent and monotone semi-discrete scheme that
was derived for Cartesian grids in [4, 11].
5. The order of accuracy of the scheme (14) is determined by the order of ac-
curacy of the reconstruction and the order of the ODE solver. Other than
that, the formulation (14) is independent of the order of accuracy of the
scheme. The scheme is a Godunov-type scheme with a global reconstruc-
tion that is evolved in time in evolution points that are located away from
the interfaces. In practice, the final semi-discrete scheme (14) requires only
the values of the gradient at the grid points Xa that are computed in the
different angular sectors around Xa. Hence, all that one needs from the
reconstruction is the values of these gradients. It is therefore possible to
use the same reconstructions that were developed for upwind schemes for
HJ equations on triangular meshes. For examples, a high-order (third- or
fourth-order) weighted essentially non-oscillatory (WENO) reconstruction
for HJ equations on triangular grids was derived by Zhang and Shu [20].
It can be incorporated as it is into the present framework.
3 Conclusion
References
1. Abgrall, R. (1996): Numerical discretization of the first-order Hamilton-Jacobi
equation on triangular meshes. Commun. Pure Appl. Math., 49, 1339-1373.
2. Bryson, S., Levy, D.: Central schemes for multi-dimensional Hamilton-Jacobi
equations. SIAM J. Sci. Comput. (to appear).
3. Bryson, S., Levy, D. (2003): High-Order Central WENO Schemes for Multi-
dimensional Hamilton-Jacobi Equations. SIAM J. Numer. Anal., 41, 1339-1369.
4. Bryson, S., Levy, D. (2003): High-order semi-discrete central-upwind schemes for
multi-dimensional Hamilton-Jacobi equations. J. Compo Phys., 189, 63-87.
5. Crandall, M.G., Ishii, H., Lions, P.-L. (1992): User's guide to viscosity solutions
of second order partial differential equations. Bull. Amer. Math. Soc., 27, 1-67.
6. Crandall, M.G., Lions, P.-L. (1983): Viscosity solutions of Hamilton-Jacobi equa-
tions. Trans. Amer. Math. Soc., 277, 1-42.
7. Jiang, G.-S., Peng, D. (2000): Weighted ENO schemes for Hamilton-Jacobi equa-
tions. SIAM J. Sci. Comp., 21, 2126-2143.
8. Jiang G.-S., Shu C.-W. (1996): Efficient implementation of weighted ENO
schemes. J. Compo Phys., 126, 202-228.
9. Kossioris G., Makridakis Ch., Souganidis P.E. (1999): Finite volume schemes for
Hamilton-Jacobi equations. Numer. Math., 83,427-442.
10. Kurganov, A., Noelle S., Petrova G. (2001): Semi-discrete central-upwind
schemes for hyperbolic conservation laws and Hamilton-Jacobi equations. SIAM
J. Sci. Comp., 23, 707-740.
11. Kurganov, A., Tadmor, E. (2000): New high-resolution semi-discrete central
schemes for Hamilton-Jacobi equations. J. Compo Phys., 160, 720-742.
12. Lin, C.-T., Tadmor, E. (2000): L1-stability and error estimates for approximate
Hamilton-Jacobi solutions. Numer. Math., 87, 701-735.
13. Lin, C.-T., Tadmor, E. (2000): High-resolution non-oscillatory central schemes
for approximate Hamilton-Jacobi equations. SIAM J. Sci. Comp., 21, 2163-2186.
14. Lions, P.L. (1982): Generalized solutions of Hamilton-Jacobi equations. Pitman,
London
15. Lions, P.L., Souganidis, P.E. (1995): Convergence ofMUSCL and filtered schemes
for scalar conservation laws and Hamilton-Jacobi equations. Numer. Math., 69,
441-470.
16. Liu, X.-D., Osher, S., Chan, T. (1994): Weighted essentially non-oscillatory
schemes. J. Compo Phys., 115, 200-212.
17. Osher, S., Sethian, J. (1988): Fronts propagating with curvature dependent
speed: algorithms based on Hamilton-Jacobi formulations. J. Compo Phys., 79,
12-49.
18. Osher, S., Shu, C.-W. (1991): High-order essentially nonoscillatory schemes for
Hamilton-Jacobi equations. SIAM J. Numer. Anal., 28, 907-922.
19. Souganidis, P.E. (1985): Approximation schemes for viscosity solutions of
Hamilton-Jacobi equations. J. Diff. Equations, 59, 1-43.
20. Zhang, Y.-T., Shu, C.-W. (2002): High-order WENO schemes for Hamilton-
Jacobi equations on triangular meshes. Siam J. Sci. Comput., 24, 1005-1030.
Numerical Simulation of Dislocation Dynamics
Summary. The aim of this contribution is to present the current state of our re-
search in the field of numerical simulation of dislocations moving in crystalline materi-
als. The simulation is based on recent theory treating dislocation curves and dipolar
loops interacting by means of forces of elastic nature and hindered by the lattice
friction. The motion and interaction of one dislocation curve and one dipolar loop
placed in 3D space is considered. Equations of motion for a parametrically described
dislocation curve are discretized by the flowing finite volume method in space. The
interaction force is computed for each dipolar loop and along the discretized curve.
The resulting system of ordinary differential equations is solved by a higher order
time solver.
mation, the rest is stored in the crystal. The deformed crystals supersaturated
with dislocations tend to decrease the internal energy by mutual screening
of their elastic fields. If dislocations possess a sufficient maneuverability pro-
vided by easy cross-slip (solids with wavy slip) the leading mechanism is an
individual screening. The dislocations are stored in the form of dipoles which
are transformed to prismatic dislocation dipolar loops of the prevailing edge
character or such loops are directly formed (the experimental evidence is sum-
marized in [9]).
The glide dislocations and the dislocation loops have much different char-
acteristic length scales and mobile properties:
- While the segments of glide dislocations extend over distances of microme-
ters, the size of the prismatic dipolar loops is of the order of 10 nm.
- The glide dislocations are moved by the shear stress resolved in the slip
plane, while the loops are drifted by stress gradients and/or swept by the
glide dislocations. The loops being prismatic they can move along the direc-
tion parallel to the direction of their Burgers vector only.
- During deformation, the glide dislocations become curved. The local curva-
ture of the glide dislocations seems to be one of the leading factors control-
ling the pattering [11, 12]. The loops can be approximately treated as rigid
objects.
Due to the above mentioned complexity the formation of dislocation struc-
tures as a consequence of the interactions among dislocations is still an open
problem. In this paper we will be concerned with a particular case: a disloca-
tion curve interacting with a dipolar loop.
-F
(a) Force interaction
Q
G::.'
• x
: 'if. "
(b) DL geometry
Fig. 1. Dislocation dynamics problem geometry: (a) Dislocation curve and dipolar
loop interaction; (b) Dipolar loop geometry
We denote the dipolar loop types and stable configurations by Vi, V2 for
the vacancy loop, and h, 12 for the interstitial loop, respectively (see Fig. 2).
In the mathematical model we represent the dipolar loop as a small rectangle
(t /)
y y
. .
. ..'. x
. .. "
'.
~
x
Fig. 2. Types and stable configurations of a dipolar loop. Longer sides of dipolar
loop are parallel to the z-axis and lie in different layers of the atomic lattice.
with two longer sides parallel to the z-axis and two shorter sides parallel either
to [1,1,0] or [1, -1,0] depending on the type ofloop. The dimensions of dipolar
loop are 2l and 2y2h, respectively, see Fig. lb.
Stress Field of Dipolar Loop Each type of dipolar loop produces a stress
field. The formula for this field will be needed in the numerical simulation of
the dislocation dynamics. In this work we use the stress field (Jij presented by
Kroupa et al [6, 7] (using Einstein's symbolic rule for sums):
634 V. Minarik et al.
(2)
where we denote:
Ii i-th component of the interaction force per the unit length of
the dislocation line
Cijk Levi-Civita symbol
ajm components of the stress field tensor at the dislocation position
bm components of the Burgers vector
Sk components of unit vector s which has the direction of the dis-
location line
Bv = r;,+F (3)
Numerical Simulation of Dislocation Dynamics 635
(5)
where O"xy = 0"12, and Sx, Sz are the components of the dislocation curve's
tangential vector, which can be also written as
(6)
Thus, the term F = O"extbcurve + O"xybcurve covers the stress of the dipolar
loop exerted on the dislocation curve, as well as any external stresses which
may the material be exposed to. To be more precise, in order to obtain force
vector acting at given position of the dislocation curve, one needs to multiply
F and the dislocation curve's normal vector X; at that position. We also
r
explicitly write the dependence of F on the dislocation curve t because the
curve position is required for the evaluation of its relative position to the
dipolar loop. The obtained relative position is then used in the evaluation
of O"xy.
The stress O"xy is given by (1), but we can simplify this formula for our
specific situation - Burgers vector has only one non-zero component and the
dipolar loop is a rectangle which has one of the two possible configurations.
Under the assumption that the dipolar loop parameter h is very small, we can
use Taylor expansion in (1) and integrate it to obtain an algebraic formula for
the stress:
636 V. Minarik et al.
where
With the upper sign in (7) we get the stress formula for the dipolar loops VI
and II, while with the lower sign we get the formula for V2 and h dipolar
loops.
In order to obtain the equation governing the motion of the dipolar loop we
have to sum all the stress contributions of the dislocation curve. It is enough
to consider only the contributions in the x-axis direction because it is the only
direction the dipolar loop is allowed to glide in.
The stress contribution of the dislocation curve can be obtained according
to the action-reaction principle by simple reversing the sign of Ix in (5) and
integrating along the dislocation curve:
F~ = J
r,
O'xybcurvenxds , (9)
where x(t) is the x-axis position of the dipolar loop, P = 4(l + V2h) is the
perimeter of the dipolar loop, and
F~ - Fo if F~ > Fo
Fx,total(rt,X(t)) =
{
0 c ~f IFf I < Fo (11)
Fx + Fo If Fx < - Fo .
The complete dislocation dynamics problem for one dislocation curve and one
dipolar loop then follows when we put (4) and (10) together with initial and
boundary conditions.
Numerical Simulation of Dislocation Dynamics 637
l Vi
aX
B-;:;-ds
ut
= 1 aX + 1
2
~ds
Vi uS Vi
F (aX)1-
-;:;-
uS
ds. (12)
(13)
where
(14)
and Fi is a constant approximation of F in dual volume Vi, Fi = CTxy(Ri)bcurve,
where Ri = Xi - [x(t), y, z] is the relative positional vector of Xi and the
dipolar loop center. If we replace the terms on the right-hand side by finite
differences and averaged values, respectively, we end up with the system of
ordinary differential equations
In discretization of the governing equation (9) for the dipolar loop we sum
contributions of every curve segment to obtain
M-l
F~ = L CTxy (x;+~ - x(t), -y, x:+~ - z) bcurve (Xt+l - Xn, (16)
i=O
where [x(t), y, z] is the center ofthe dipolar loop at time t. Next, we use formula
(11) applied to discrete F~ defined above and get
638 V. Minarik et al.
(17)
The complete discrete problem consists of (15) and (17) with accompanying
initial and boundary conditions.
References
1. Angenent, S.B., Curtin, M.E. (1989): Multiphase Thermomechanics with an In-
terfacial Structure 2. Evolution of an Isothermal Interface. Arch. Rat. Mech.
Anal., 108, 323-391
2. Benes, M., Mikula, K. (1998): Simulation of Anisotropic Motion by Mean Curva-
ture - Comparison of Phase Field and Sharp Interface Approaches. Acta Math.
Univ. Comenianae, Vol. LXVII, 1, 17-42
3. Dziuk, C. (1994): Convergence of a Semi Discrete Scheme for the Curve Short-
ening Flow. Mathematical Models and Methods in Applied Sciences, 4, 589-606
4. Cage, M., Hamilton, R.S. (1986): The Heat Equation Shrinking Convex Plane
Curves. J. Diff. Ceom., 23, 69-96
Numerical Simulation of Dislocation Dynamics 639
50 50 50
0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
~ ~ ~
'i' C)I C\J
'" 'i' C)I C\J
'" 'i' C)I C\J
'"
T =45.0480 T =55.0680 T =63.0840
200 200 200
50 50 50
0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
~ ~
'i' C)I
Fig. 4. Dipolar loop
C\J
'" 'i'
oscillations. Dipolar
loop
C)I C\J
'"
of type VI
starts to glide to the
'i' C)I C\J
'"
left
of the dislocation curve (timelevels T = 15.02, T = 25.022, T = 35.028). Then it
reverses as the attractive force of the dislocation curve gains the control over the
system for some time (T = 45.048, T = 55.068). Second reversing occurs before
T = 63.084.
60 60 60
40 40 40
20 20 20
0 0 0
60 60 60
40 40 40
20 20 20 "-----.,
"--, '----..., ~ ~
0 0 0
Fig. 5. Dipolar loop swept by the curve; on the other hand, the curve is distorted
by the stress field of the loop. In this test there were used: J.L = 80 CPa, 1/ = 0.33,
B = 10- 4 Pa s, b = 0.707 nm, I = 35 nm, h = v'2 nm, Fa = 4 MPa m, applied stress
aa = -1.155 MPa. Initial position of dipolar loop was [0,40, -30]. The subsequent
stages are shown for increasing time T.
1 Introduction
dUi
m·-
t dt = L k· ·(u· - u·) + oU·
tJ J t t" Oi = L kij (1)
#i j
Pi = Lfij, (4)
#i
is to be constructed in a nonlinear way so as to remove the excessive artificial
diffusion.
After an implicit time discretization we obtain a nonlinear algebraic system
Here, A is a 'suitable' pre conditioner and r(m) = b(m+l) - Au(m) is the defect
vector for the m-th iteration cycle. The latter incorporates the constant right-
hand side stemming from the low-order scheme plus compensating anti diffusion
Varying the correction factors O!ij between zero and unity, one can blend the
high-order method with the concomitant low-order one. The latter should be
used in the vicinity of steep gradients where spurious oscillations are likely
to arise. The construction of the solution-dependent correction factors and
of the fully discretized antidiffusive fluxes fi~m) is elucidated in [5]. As an
alternative to (7), an iterative limiting strategy was proposed for implicit FEM-
FCT schemes operated at large time steps. Roughly speaking, the amount
of previously accepted anti diffusion is taken into account so that only the
rejected portion of the antidiffusive flux needs to be limited at subsequent
defect correction steps.
3 Euler equations
Compressible flows are governed by the Euler equations which represent a sys-
tem of conservation laws for the mass, momentum and energy of an inviscid
fluid. These hyperbolic PDEs are typically written in divergence form
aU 3 apd
at + V· F = 0, where V·F= La'
d=l Xd
(8)
The vector of conservative variables U and the triple of fluxes F = (PI, p2, p3)
for each direction of the Cartesian coordinate system are defined as follows
F = [pV&t:+PIj.
pHv
(9)
aU
at + A . VU = 0, where (10)
644 M. Moller et al.
(11)
Here, Me denotes the block-diagonal consistent mass matrix for the coupled
system and K is a discrete counterpart of the operator -A· \7 for the quasi-
linear formulation (10). Let the entries of the mass matrix and the vector of
coefficients coming from the discretization of space derivatives be defined as
follows
(12)
As long as the mesh is fixed, the coefficients Cij remain constant and thus the
operator K can be assembled efficiently without resorting to a costly numerical
integration.
Recall that basis functions sum to unity, so that the sum of their derivatives
vanishes. Hence, the coefficients Cij satisfy Cii = - L:#i Cij and the right-hand
side of the five coupled equations for node i is given by
(13)
In his pioneering work on approximate Riemann solvers [9], Roe showed that
the differences between the components of F and U are related by F j - Fi =
, . . ' '1 '2 '3
Aij(Uj - Ui)' where the tnple of matnces Aij = (Aij,Aij,Aij) corresponds
to the Jacobian tensor A evaluated for the special set of density-averaged
variables
This enables us to express the nodal value (KU)i in terms of the conservative
variables
The dot product can be interpreted as a 'projection' of the triple Aij onto the
numerical edge ij. For our purposes, it is expedient to introduce the splitting
Cij . Aij = -(Aij + Bij), where the two components of the cumulative Roe
matrix are defined by [5]
Implicit FEM-FCT algorithm for compressible flows 645
(16)
b 'J.. -__ Cij + Cji (17)
2
A similar decomposition can be performed for the contribution of the edge ij
to (Ku)j
Integration by parts reveals that in the interior of the domain aij = -Cij while
b ij = 0 [5], so that only the skew-symmetric part Aij needs to be evaluated
for interior edges. The symmetric part Bij only applies to the cumulative Roe
matrices for boundary edges. According to (15)-(18), the contribution of the
edge ij to the term Ku reads
Together with the fact that the coefficients Cij remain constant and thus can
be assembled and stored once and for all during the initialization process,
(19)-(20) suggest an efficient edge-based algorithm for the matrix assembly.
The underlying data structure can be generated from the sparsity pattern of
the finite element matrix and contains entries for all pairs of nodes whose
basis functions have overlapping supports [5]. In contrast to the scalar case
this connectivity exists not only between basis functions for different nodes
but also between those for different variables. Hence, each coefficient of the
discrete operator is given by a square matrix of dimension equal to the number
of variables.
It can be readily inferred from (19)-(20) that the contribution of the nu-
merical edge ij to the global matrix K E jR5Nx5N is given by
(21)
These local Jacobians are evaluated edge-by-edge and their entries K~j, k, l =
1, ... ,5 are scattered to the corresponding positions in the 25 blocks K kl E
jRNxN [5].
5 Artificial viscosity
where (22)
F··
'1 = - '1 dt +D'
(M'~ '1 -B") (u· -U)
'J J ' ,
(23)
(24)
Let us 'project' the density-averaged velocity Vij onto the numerical edge ij
and define the local speed of sound as follows
Here, laij I denotes the Euclidean norm of the coefficient vector aij. As a con-
sequence, the diagonal matrix of eigenvalues can be readily computed as
(26)
In [5] we gave a detailed description of how to derive a generalization of Roe's
approximate Riemann solver from (24) by elimination of negative eigenvalues.
A much cheaper alternative is to add scalar dissipation proportional to
the spectral radius of the Roe matrix d ij = laij I(IVij I + Cij) [5]. The result-
ing artificial viscosity operator Dij = dijl, which in fact is the same for all
components, needs to be applied only to the five diagonal blocks of the finite
element matrix. Numerical examples demonstrate, that in the framework of
flux correction the final solution even benefits from this slightly over diffusive
low-order scheme because of an improvement in the phase accuracy [4], so that
the application of a costly Riemann solver does not payoff.
Implicit FEM-FCT algorithm for compressible flows 647
6 Defect correction
After an implicit time discretization we obtain a nonlinear algebraic system
similar to (5) which can also be solved by the defect correction scheme
(27)
In a practical implementation the 'inversion' of A is performed by applying
some inner iteration to solve the linear subproblem for the solution increment
(an improvement of the residual by 1-2 digits suffices) and update the last
iterate thereafter
u(m+l) = u(m) + L1u(m) , (28)
The matrix in the left-hand side of this linear system can be replaced by
a (block-diagonal) pre conditioner c(m), so as to decouple the discretized Euler
equations [5]. As an alternative we can apply the (preconditioned) BiCGSTAB
method directly to the coupled system (28).
Let us split the low-order operator into its diagonal, sub diagonal and su-
perdiagonal parts A (m) = w(m) + j(m) + E(m). In [5], a block-Jacobi precon-
ditioner c(m) = J(m) was suggested for the defect correction scheme, so that
only the five diagonal blocks need to be assembled and stored
and (29)
As a consequence, the linear system (28) resolves into a sequence of scalar
subproblems which can be solved separately or at best in parallel. For this
purpose an iterative method, e.g. (preconditioned) BiCGSTAB or geometric
multigrid, is applied to
j (m) (m) (m)
k
A
£..lU k = rk , k = 1, ... ,5 (30)
Uk
(m+l)
= (m)
Uk
+ £..lU k(m) ,
A U(O) _ Un
k - k·
However, this segregated solution approach disqualifies for larger time steps
since severe convergence problems of the outer iteration can be observed.
Longing for the full potential of the iterative limiter and the unconditional
positivity of fully implicit time-stepping we have been testing some coupled
solution strategies for (28) by means of a preconditioned BiCGSTAB method.
In this case, additional blocks of the low-order operator may need to be as-
sembled and stored or there should be another (direct) way to carry out the
matrix-vector multiplications for updating the residual.
Let the pre conditioner for the BiCGSTAB solver be given by w(m) + j(m),
where
7 Numerical examples
Let us illustrate the potential of the implicit FEM-FCT algorithm by con-
sidering a steady two-dimensional supersonic flow over a wedge. Here, the
free-stream Mach number is M = 2.5 and the deflection angle is () = 15°. The
results presented at the top of Figure 1 are computed on a mesh of 128 x 128
bilinear elements by the low-order scheme (left) and the implicit FEM-FCT
algorithm (right), respectively. It can be readily seen that the shock wave is
unacceptably smeared by the low-order method. Nevertheless, both the up-
stream and downstream Mach numbers are predicted correctly. The iterative
flux limiter resolves the shock very precisely within as few as 3-4 elements.
the grid points clustered in the vicinity of the shock wave, see Figure 1 (bottom
left). After two steps of global refinement this gives the computational mesh
of approximately 10,000 vertices. The resulting numerical solution (bottom
right) exhibits superb accuracy and remains absolutely free of oscillations.
Numerical results for a variety of standard gas dynamic test cases encom-
passing both transient and stationary flows are presented in [5]. Moreover, an
in-depth investigation of scalar problems can be found in the same publication.
8 Conclusions
To our knowledge, most of the finite element schemes for solving the Eu-
ler equations on unstructured grids are explicit and, consequently, subject to
an restrictive CFL condition. In this paper, an implicit high-resolution finite
element scheme for hyperbolic systems was presented making use of the flux-
corrected transport paradigm. The underlying low-order operator was con-
structed by applying scalar artificial viscosity proportional to the spectral ra-
dius of the cumulative Roe matrix for each edge of the sparsity graph. An
efficient edge-based approach to matrix assembly was proposed. The design
of suitable preconditioners for both segregated and coupled solution proce-
dures was addressed. The performance of the new FEM-FCT algorithm was
illustrated for a steady supersonic flow without and with adaptive mesh re-
finement. The development of robust and efficient iterative solvers for implicit
FEM-FCT schemes including FAS-FMG multigrid [6]'[8] and an analog of the
local MPSC smoother for the incompressible Navier-Stokes equations [10] will
be addressed in forthcoming publications.
References
1. J. P. Boris and D. L. Book, Flux-corrected transport. I. SHASTA, A fluid trans-
port algorithm that works. J. Comput. Phys. 11 (1973) 38-69.
2. C. A. J. Fletcher, The group finite element formulation. Comput. Methods Appl.
Mech. Engrg. 37 (1983) 225-243.
3. D. Kuzmin and S. Turek, Flux correction tools for finite elements. J. Comput.
Phys. 175 (2002) 525-558.
4. D. Kuzmin, M. Moller and S. Turek, Multidimensional FEM-FCT schemes for
arbitrary time-stepping. Int. J. Numer. Meth. Fluids 42 (2003) 265-295.
5. D. Kuzmin, M. Moller and S. Turek, High-resolution FEM-FCT schemes for mul-
tidimensional conservation laws. Technical report No. 231, University of Dort-
mund, 2003, Submitted to: Comput. Methods Appl. Mech. Engrg.
6. P. W. Hemker and B. Koren, Defect correction and nonlinear multigrid for steady
Euler equations. In: W.G. Habashi and M.M. Hafez (ed.). Computational fluid
dynamics techniques. London: Gordon and Breach Publishers, 1995, 699-718.
7. R. Lohner, K. Morgan, J. Peraire and M. Vahdati, Finite element flux-corrected
transport (FEM-FCT) for the Euler and Navier-Stokes equations. Int. J. Numer.
Meth. Fluids 7 (1987) 1093-1109.
650 M. Moller et al.
8. J.F. Lynn, Multigrid Solution of the Euler Equations with Local Preconditioning.
PhD thesis, University of Michigan, 1995.
9. P. L. Roe, Approximate Riemann solvers, parameter vectors and difference
schemes. J. Comput. Phys. 43 (1981) 357-372.
10. S. Turek, Efficient Solvers for Incompressible Flow Problems: An Algorithmic
and Comp utational Approach, LNCSE 2, Springer, 1999.
11. S. T. Zalesak, Fully multidimensional flux-corrected transport algorithms for flu-
ids. J. Comput. Phys. 31 (1979) 335-362.
A Singular Limit Method for the Stefan
Problems
1 Introduction
Wt = Ll(w+) in QT,
(SP) { w+ = A on an x (0, T),
w(x,O) = wo(x) for x En,
2 Our scheme
and 1im (t)z denotes the solution of the following heat equation
Ut = Llu in D x (t m , tm+l],
{ u=A on aD x (tm, t m+1], (3)
u(x, t m ) = z(x) for x E D.
Most of computational cost of Approximation Scheme 1 is spent on the
heat equation (3), and the first advantage '1. Our numerical method has low
computational cost' shown in Section 1 follows. If we use a finite element
method or finite volume method to solve (3), one can say '2. Our numerical
method may be used on arbitrary geometries on D'. Furthermore, we have '3.
There is no artificial parameters' as shown the above.
In the following we explain the basic idea of our scheme (see also [11]).
!
Eymard et al. [4] introduce the following reaction-diffusion system.
Proposition 1 (Eymard, Hilhorst, van der Hout and Peletier [4]). Let
us assume
A E Wi,l(QT) n C(QT),
{ uo(x) = A(x, 0) for xED,
o ::; Vo ::; K in D,
for some positive constant K. Then (RDk) has a unique weak solution
(U(k) , V(k)) E Wi,l(QT) x C O,l([O, T]; Loo(D)) for every k > 0, and
as k ----+ 00,
in D x (tm, t m+1],
in D x (tm' t m+l],
(4)
for xED,
for xED.
{
um+l(x) = qm(x, t m+1),
V m+1(x) = vm(x, tm+d.
We are now in the position to construct an approximation scheme to Prob-
lem (SP). Let e = k(t - t m ). Then (4) becomes
{
(Tf!' = - Um vm in D x (0, kT],
Vr = _umvm in D x (0, kT].
Here we use the facts that (U m - vm)o = 0, tJm(x, tm+d 2':: and Vm(x) 2':: 0.
From the above formal calculations, we obtain Approximation Scheme 1.
°
3 Numerical results
In this section, some numerical results obtained by our scheme are shown. We
deal with a one-dimensional case with a known exact solution and a three-
dimensional case. In the former, we show accuracy of our numerical interfaces
by comparing with the exact ones. In the latter, it is demonstrated that the
moving boundaries can be track even in high-dimensional problems.
°
Let s(t) be the position of the interface at time t. We deal with a simple one-
dimensional problem with s(o) = and u(O, t) = C for t 2':: 0, where C is
a positive constant (see [14]). In this case, it is already known that
2ae,,21" e- Z2 dz = ~. (6)
Let us compare our numerical interfaces with (5) when a = >.. = 1, [l = (0,2)
and T = 1. We compute a numerical approximation to the integral (6) using
Simpson's rule, then we employ 4.0601569 as an approximation to C. Fixed
uniform grids in space and time are adopted, that is, the spatial mesh size
is Ox = 21M and the time mesh size is Ot = liN, where M is a positive
integer. Let ui and vi (0 :s: i :s: M, and 0 :s: n :s: N) be the approximations
to u(iox, nOt) and v(iox, nOt), respectively, which implies that wi := ui - vf
is the approximation to the weak solution w(iox, nOt) of Problem (SF). The
initial data are given by
ug = C, vg = 0,
u? = 0, v? = 1 (0 < i :s: M).
We employ the implicit finite difference technique to obtain the numerical
solution of heat equation. Then our numerical method deduces iterating the
following two steps.
Step 1: Define iii by
iii+l - 2iii + iii_l
ox 2
(0 < i :s: M),
0.9 r-.--...---.---r---,--r---r--:l'7--;;w.-,
0:'·8::6.'#~
Exact
Ilx=10·2, at=10-3
0.8 ruc=10- 2/2, AI=10· 3/4
6x:::10· 2/4, 61=10. 3/16 ~""H'~"H'
ruc=10· 2/8, At=10· 3/64 •.•.•.•.
0.6
0.4
Fig. 1. Numerical interfaces for some Fig. 2. Close-up of the numerical in-
meshes terfaces in Fig. 1
Fig. 2 is a close-up of Fig. 1. One can say that the numerical interface converges
to the exact one.
References
1. Beckett, G., Mackenzie, J. A., Robertson, M. L. (2001): A moving mesh finite
element method for the solution of two-dimensional Stefan Problems. J. Compo
Phys., 168, 500-518
2. Bertsch, M., de Mottoni, P., Peletier, L.A. (1984): Degenerate diffusion and the
Stefan problem. Nonlinear Analysis TMA, 8, No.11, 1311-1336
3. Evans, L.C. (1993): Convergence of an algorithm for mean curvature motion.
Indiana Univ. Math. J., 42, No.2, 533-550
4. Eymard, R., Hilhorst, D., van der Hout, R., Peletier, L.A. (2000): A reaction-
diffusion system approximation of a one-phase Stefan problem. In: Menaldi, J.L.,
Rofman, E., Sulem, A. (ed) Optimal control and partial differential equations.
156-170
5. Grossmann, C., Noack, A. (2002): Smoothing and Rothe's method for Stefan
problems in enthalpy form. J. Compo Appl. Math., 138, 347-366
6. Hilhorst, D., van der Hout, R., Peletier, L.A. (1996): The fast reaction limit for
a reaction-diffusion system. J. Math. Anal. Appl., 199, 349-373
7. Ikota, R., Mimura, M., Nakaki, T.: A methodology for numerical simulations to
a singular limit. preprint
8. Kawarada, H. (1989): Free boundary problem: Theory and numerical method
(in Japanese). University of Tokyo Press
9. Merriman, B., Bence, J.K., Osher, S.J. (1994): Motion of multiple junctions: A
level set approach. J. Compo Phys., 112, 334-363
10. Murakawa, H. (2004): A numerical method for Stefan type moving boundary
problems (in Japanese). MA Thesis, Kyushu University, Japan
A Singular Limit Method for the Stefan Problems 657
Fig. 3. Initial data and numerical solutions in a three-dimensional case. Time runs
left to right and up to down.
11. Murakawa, H., Nakaki, T. (2003): A singular limit approach to moving boundary
problems and its applications. Theoretical and Applied Mechanics Japan, 52,
255-260
12. Nogi, T. (1974): A difference scheme for solving the Stefan problem. Publ. Res.
Inst. Math. Sci. Kyoto Univ. series A, 9, 543-505
13. Verdi, C., (1994): Numerical aspects of parabolic free boundary and hysteresis
problems. Lecture Notes in Mathematics, 1584, 213-284
14. Yamaguchi, M., Nogi, T. (1977): Stefan problem (in Japanese). Sangyotosho
Higher-Order Split-Step Schemes for the
Generalized Nonlinear Schrodinger Equation
1 Introduction
The generalized nonlinear Schrodinger (GNLS) equation is a nonlinear partial
differential equation given by
(2)
Higher-Order Split-Step Schemes 659
for the GNLS equation. If, for the moment, Land N are assumed to be t
independent, a formally exact solution of equation (5) is given by
(11)
where
(13)
The interval [0,271'] is divided into N equal subintervals with grid spacing
LlX = 271'1N where the integer N is even. The spatial grid points are given
by Xj = 271'j1N, j=0,1,2,oo.,N. The approximate solution to w(Xj,t) is
denoted by Wj (t). The discrete Fourier transform of the sequence {Wj} is
defined as
1
L
N-1
A
N N
Wk = .1'k[Wj ] = N Wjexp(-ikXj), --<k<--1.
2 - - 2 (14)
j=O
~-1
Wj =.1'j- 1[Wk]= L Wkexp(ikXj ), j=0,1,2,oo.,N-1. (15)
k=-~
Here .1' denotes the discrete Fourier transform and .1'-1 its inverse. These
transforms can be realized efficiently via a fast Fourier transform (FFT) algo-
rithm. For the FFT algorithm used here, the integer N must have only prime
factors 2 and 3. In both linear and nonlinear subproblems we approximate spa-
tial derivatives in both linear and nonlinear subproblems using discrete Fourier
transforms.
662 G.M. Muslu, H.A. Erbay
We consider a split-step method for the GNLS equation, in which the linear
equation
Wt - ipwxx = 0 (16)
and the nonlinear equation
are solved in a given sequential order corresponding to one of the splitting for-
mulas (9)-(11). The linear equation (16) can be solved by means of the discrete
Fourier transform and the advancements in time are performed according to
(18)
Here L1t is time step and Wr denotes the approximation to w(Xj ,mL1t). The
spatial discretization of the nonlinear equation (17) by a Fourier pseudospectral
method can be written as
dW
d/ = i (q1 IWj l2 Wj + q2 IWj l4 Wj ) - .1'j-1[ik'hFdI WjI2llWj
-.1'j-1[ikq4.1'k[Wj llIWj 12, j = 0,1,2, ... , N - 1. (19)
For the time integration of this equation, instead of using an approximate ana-
lytical technique [2] we adopt rather a different approach and employ a fourth-
order Runge-Kutta method. Now the total error involved in integrating from
time t to time t + L1t will be the sum of the splitting error and the temporal
discretization error of the nonlinear equation (17).
The first-order split-step Fourier method for the GNLS equation can be
summarized as follows: Given the data Wj at any time step t = t m , first
advance the solution according to the nonlinear part, namely solve equation
(19) using the fourth-order Runge-Kutta method for time integration. This
becomes the initial data for the linear problem which is solved by the discrete
Fourier transform as indicated by equation (18). The extension of the first-
order split-step scheme based on equation (9) to the second-order and fourth-
order split-step schemes based on equations (10) and (11), respectively, is
straightforward.
3 Numerical Experiments
To gain insight into the performance of the suggested split-step schemes we per-
form the following three numerical experiments. The conservation properties
of the split-step schemes are examined by calculating discrete analogues of the
conserved quantities 11 , hand h. The relative errors in discrete approxima-
tions to the conservation integrals (2), (3) and (4) are denoted by J1 , J2 and J3 ,
Higher-Order Split-Step Schemes 663
_ 1
r/J(x, t) = 2 tanh 1['2 tanh (x - 2t -15)] + x -15 (21)
-2
-1
-4
-2
-4 -10 SS4
-5 -12
1 10 (x 10') 1 10 (xl0')
Fig. 1. The Loo-erroN> at t = 3 as a function of the number of spatial grid points
for the first-order (SSl), second-order (SS2) and fourth-order (SS4) split-step Fourier
schemes. (a) The space interval: 5:S:; x :s:; 35, (b) The space interval: - 20 :s:; x :s:; 60
Table 1. Comparison of the convergence rates in time for the first-order, second-
order and fourth-order split-step Fourier schemes in the case of a single solitary wave
-20:S:; x :s:; 60.
the integrals. The results are represented in Table 2. Note that the computing
times in Table 2 are normalized so that the computing time of the first-order
split-step scheme is one unit. The results show that each of the conserved quan-
tities is very well preserved by the split-step schemes. Furthermore, we observe
that the computing time increases with the increasing order of the split-step
method. We conclude that the fourth-order split-step scheme is computation-
ally more efficient than the first-order and second-order schemes.
Table 2. Comparison of the Loo-error, the L2-error, the conservation errors 81,
82 and 83 and the computing times for the first-order, second- and fourth-order
split-step Fourier schemes (N = 512, - 20 :s:; x :s:; 60, .:1t = 0.61038 x 10- 3 ).
1 1 1 1
w(x,O) = Insech[ -(x - 15)] exp i{ -(x - 15) + tanh[ -(x - 15)]}
v2 2 4 2
1 1 . 1 1 1
+-sech[-(x - 35)] expt{ --(x - 35) + -tanh[-(x - 35)]} .
2v2 4 2 2 4
3.3 Blow-up
For certain values of the coefficients and certain initial conditions, solutions
to the GNLS equation experience finite time blow-up [1]. We now apply the
fourth-order split-step scheme to a case of the GNLS equation in which the
exact solution blows up in finite time. The initial condition is the Gaussian
function w(x,O) = exp( _x 2 ) and the coefficients are q1 = -2, q2 = 20,
q3 = 0, q4 = 0 . The exact values of the conserved quantities h, 12 and 13 are
h = J7r /2, h = ft(9v2 + 9 - 20v'6)/IS and h = 0 for this problem.
In [1], it has been shown analytically that the exact solution w(x, t) for this
problem will blow up in finite time and furthermore, an upper bound on the
blow-up time is t ::::::: 1.7.
In the present study, the above problem is solved on the interval -7.5 ::::: x :::::
7.5 for times up to t = O.OS. We present the numerical results obtained using
the fourth-order scheme on Table 3. Although a formal proof of the existence
of the blow-up is not presented here, the numerical results strongly indicate
that a blow-up is well underway by time t = O.OS. This is consistent with
the numerical results presented in [1] and [5]. The fact that the three results
about the predicted time of blow-up, which were obtained by totally different
methods, are in complete agreement makes one believe in their correctness. As
in [1] we conclude that the upper bound given in [1] is not sharp.
666 C.M. Muslu, H.A. Erbay
t It h h I W(O,t) I
0.00 1.253314 -2.684467 -9.793286E-17 1.000000
0.01 1.253314 -2.684467 -6.651917E-13 1.007348
0.06 1.253314 -2.684467 -4.673614E-12 1.526254
0.07 1.253314 -2.684448 -4.395637E-12 2.376429
0.08 1.253352 -2.829258 -6.085108E-1O 3.430374
4 Conclusions
References
1. Pathria, D., Morris, J. L1. (1989): Exact solutions for a generalized nonlinear
Schrodinger equation, Physica Scripta 39, 673-679.
2. Pathria, D., Morris, J. L1. (1990): Pseudo-spectral solution of nonlinear
Schrodinger equations, 1. Comput. Phys. 87, 108-125.
Higher-Order Split-Step Schemes 667
Summary. In this note we present some of our recent results concerning flows with
pressure and shear dependent viscosity. From the numerical point of view several
problems arise, first from the difficulty of approximating incompressible velocity fields
and, second, from poor conditioning and possible lack of differentiability of the in-
volved nonlinear functions due to the material laws. The lack of differentiability can
be treated by regularisation. Then, Newton-like methods as linearization technique
can be applied; however the presence of the pressure in the viscosity function leads to
an additional term introducing a new non-classical linear saddle point problem. The
difficulty related to the approximation of incompressible velocity fields is treated by
applying the nonconforming Rannacher-Turek Stokes element. However, then we are
facing another problem related to the nonconforming approximation for problems
involving the symmetric part of gradient: the classical discrete 'Korn's Inequality'
is not satisfied. A new and more general approach which involves the jump across
the inter-element boundaries should be used, which requires a small modification of
the discrete bilinear form by adding an interface term, penalizing the jump of the
velocity over edges. This is achieved via a modified procedure in the derivation of
a Discontinuous Galerkin formulation. As a solver for the discrete nonlinear systems,
a Newton variant is discussed while a 'Vanka-like' smoother as defect correction in-
side of a direct multigrid approach is presented. The results of some computational
experiments for realistic flow configurations are provided, which contain a pressure
dependent viscosity, too.
1 Introduction
nian rheology cannot describe granular flow accurately. It is assumed that the
material is incompressible, dry, cohesionless, and perfectly rigid-plastic. Such
properties are relevant for modelling the granular flows via special models for
continuum mechanics, as for instance the Schaeffer model [9].
. D
T =h p sm4>j[Dj[ + pI.
2 Problem formulation
Let us consider the flow of the stationary (!) generalized Navier-Stokes problem
in (1.3) in a bounded domain D C ]R2. If we restrict the set V of test functions
to be divergence-free and if we take the constitutive laws into account, the
above equations from (1.3) lead to:
In qdivudx = 0, \lqEM,
In this approach, the nonlinearity is first handled on the continuous level. Let
u 1 being the initial state, the (continuous) Newton method consists of finding
u E V such that
In
+ 202v(Dn(u 1),pl)[D(u l ) : D(v)]pdx
= In In
fv - 2v(Dn(u l ),pl)D(u 1) : D(v)dx, \Iv E V, (3)
where OiV(·, .); i = 1,2 is the partial derivative of v related to the first and
second variables, respectively. To see this, set X = D(u 1), X = D(u), Y =
pi, Y = p, F(x, y) = v(~lxI2, y)x and f(t) = F(X + tx, Y + ty), so that
\7 .[ 2v(Drr(u 1),pl)D(u)
+ 281 v(D rr (u 1),pl)(D(u1) : D(u))D(u1) (6)
+ 282 v(Drr(u 1),pl)pD(u1)]
where R u (-'·) and R p ("') denote the corresponding nonlinear residual terms
for the momentum and continuity equations, and the operators A<u1,pl), B,
A*(u1,pl) and B*(u1,pl) are defined as follows:
3 Discretization
The degrees offreedom are determined by the nodal functionals {Fj.,a,b) (.), r c
8Th },
for all basis functions ¢i and ¢j of W;,b. Taking into account an additional
relaxation parameter s = s(v), the corresponding stiffness matrices are defined
via:
(Su, v) = s L I~I L[U][V]dO" (17)
EEErUED
u+ . n+ + U- . n- on internal edges EJ ,
[u] = { U· n on Dirichlet boundary edges ED, (18)
o on Neumann boundary edges EN,
where n is the outward normal to the edge and (-)+ and (-)- indicate the value
of the generic quantity (.) on the two elements sharing the same edge.
4 Linear solver
This section is devoted to give a brief description of the involved solution tech-
niques for the resulting linear systems. For the nonconforming Stokes element
Flow with Shear and Pressure Dependent Viscosity 673
with matrix F = A + JdA * and A, B, A* and B* are the discrete matrices cor-
responding to the operators in (8), (9), (10) and (11). For the preconditioning
step only a part of the matrix, i.e. F + S*, is involved. All other components in
the multigrid approach, that means inter grid transfer, coarse grid correction
and coarse grid solver, are the standard ones and are based on the underly-
ing hierarchical mesh hierarchy and the properties of the nonconforming finite
elements (see [7] and [8] for the details).
5 Numerical tests
In this case, the gradient and tensor formulations are equivalent; the accuracy
and efficiency of the stabilized tensor discretization is checked by comparisons
with the gradient formulation (see Table 1); the tests have been performed for
the 'flow around cylinder' benchmark configuration [10]. For all three formu-
lations the lift and drag forces are very similar.
Table 1. Efficiency of the stabilized nonconforming FEM: Lift and Drag forces
Level 5
1/1/ grad tensor stab. tensor
1 Drag 31252 x 10 -1 31221 x 10 31231 x 10
Lift 30898 x 10- 3 30924 X 10- 3 30936 X 10- 3
NL/MG 3/3 7/200 3/3
1000 Drag 55657 x 10 -4 55531 x 10 -'i 55535 x 10 -'i
(Re = 20) Lift 10180 x 10- 6 10259 X 10- 6 10277 X 10- 6
NL/MG 11/4 11/12 11/3
674 A. Ouazzi, S. Turek
in contrast, the convection dominates with the increase of the Reynolds num-
ber, the average number of multigrid sweeps per nonlinear sweep decreases,
as the influence of the kernel function is getting irrelevant. This may explain
why many people from the CFD community did not pay much attention to
this problem before.
Table 3. Comparison of the aproximation results for lift, drag and pressure differ-
ence for two FEM approaches, the stabilized nonconforming {JI/Qo and the classical
conforming Q2/Pl (see [3]).
ILevel IElements I Drag I Lift I 6. p INN/NLII Drag I Lift I 6. p INN/NLI
Power r = 1.5 r = 1.1
4 Ql/QO 1594.20 14.25 24.56 9/2 916.02 3.7381 15.74 12/2
Q2/H 1635.80 14.39 25.09 8/140 953.94 3.9217 15.82 19/294
5 Ql/QO 1615.60 14.43 24.81 8/2 935.13 3.9954 15.82 15/3
Q2/P1 1637.60 14.44 25.07 9/723 957.64 4.0587 15.87 18/1162
6 QI/Qo 1626.20 14.46 24.94 8/2 946.22 4.0592 15.85 13/5
Table 4. Corresponding results for the number of nonlinear iterations and the av-
eraged number of linear sweeps per nonlinear cycle
v(z,p) = exp(,6p) Fixpoint Newton
Level (3 0.1 0.3 0.5 0.1 0.3 0.5
5 stab. tensor 6/2 12/2 33/2 3/3 4/2 4/3
gradient 6/2 11/2 34/2 3/3 4/2 4/3
6 stab. tensor 5/3 11/3 65/2 3/3 3/3 3/3
gradient 5/3 9/3 76/2 3/3 3/3 5/3
- Continuity equation
~ + V· (pu) = 0, and
- Normality condition
676 A. Ouazzi, S. Turek
References
Arvet Pedas
1 Introduction
Let b E R = (-00,00), b > 0 and set.:1b = {(t,s) E R2: O:s; t:s; b,O:S; s < t},
.:1b = {(t,s)
E R2: O:S; s:S; t:s; b}. We consider an initial-value problem for a
linear integro-differential equation of the form
J J
t t
with a constant c = c(K) for all (t, s) E .:1b and all non-negative integers i and
j such that i + j :s; m.
It follows from (3) (with i = j = 0, 0 :s; v < 1) that the kernels K1(t, s)
and K 2 (t, s) of (1) may possess a weak singularity as s -+ t. In case v < 0 the
kernels Kl and K2 are bounded on .:1b but their derivatives may be singular
as s -+ t. In particular, Kl and K2 may have the form
K",,{3(t,s) = K(t,S)(t - s)-allog(t - s)I{3, O:s; a < 1,,6 2:: 0,
where K : .:1b -+ R is a m times continuously differentiable function. Clearly,
Ka,o E wm,a(.:1 b), 0 < a < 1,. KO,l E wm,O(.:1 b) and K a,{3 E wm,a+,,(.:1 b)
for 0 :s; a < 1, ,6 > 0, with a small c > 0 (c < 1 - a). Especially, if Kl = 0
and K2 = Ka,o, 0 < a < 1, then equation (1) is of type which is often
referred to as the Basset equation; the last one is playing important role in
the mathematical modelling of the diffusion of discrete particle in a turbulent
fluid (see, for example, [10, 11]).
The set Cm,V[O, b], mEN, v < 1, consists of functions 1 y E C[O, b] which
are m times continuously differentiable in (0, b] and such that
m
L sup (Wj_(l_v)(t)ly(j)(t)l) :S;c. Here
j=lO<t~b
I if A < 0,
w)..(t) = { (1 + Ilogtl)-l if A = 0,
e· if A > 0,
m
Equipped with the norm Ilyllm,LI = max ly(t)l+ E sup (Wj_(l_Ll)(t)ly(j)(t)I) ,
O~t~b j=lO<t~b
em,LI[O, b] is a Banach space. Thus, if a function y belongs to em,LI[O, b], mEN,
l/ < 1, then its derivatives can be estimated by
1 if j < 1 - l/ ,
ly(j)(t)1 :::; c { 1 + Ilogtl if j = 1 - l/, (4)
t 1 - Ll - J if j > 1 - l/ ,
°
where < t :::; band j = 0,1, ... ,m. Note that em[O, b], the set of m times
continuously differentiable functions y : [a, b] ----+ R, belongs to em,LI[O, b] for
arbitrary l/ < 1.
Introducing a new unknown function
z =y', (5)
and using (2), equation (1) may be rewritten as a linear Volterra integral
equation of the second kind with respect to z,
J J J
t s t
J
t
J
where t
J
t
J
t
Proof. We present T (see (9) and (10)) in the form T = TOlTo2 + Tl + T2,
where the linear operators T Ol , T02 , Tl and T2 are defined by settings
J
t
J J
t t
J
t
For given N EN, r E R, r 2': 1, let 1T'N = {to, ... , t N : 0 = to < ... < t N = b}
be a partition (a grid) of the interval [0, b] given by the grid points
tj = b(j / Nr, j = 0, ... , N . (11)
Here r (also called the grading exponent) characterizes the non-uniformity of
the grid IIN: if r > 1 then the gridpoints (11) are more densely clustered near
the left endpoint of the interval [0, b]. Let
(12)
For given integers m 2': 0 and -1 ::; d ::; m - 1, let s5::.) (IIN) be the spline
space of piecewise polynomial functions on the grid IIN:
where 1T'm denotes the set of polynomials of degree not exceeding m and ul uj
is the restriction of u to the subinterval OJ, j = 1, ... , N. Note that elements
of S$;:l) (IIN) = {u: ul uj E 1T'm,j = 1, ... ,N} may have jump discontinuities
at the interior points t 1, ... , t N -1 of the grid IIN.
In every subinterval OJ (j = 1, ... ,N) we introduce mEN interpolation
points
l = 1, ... ,m (j = 1, ... ,N) (13)
where Til, ... , Tim do not depend on j and N and satisfy
o ::; Til < . . . < Tim ::; 1 . (14)
To a given continuous function z : [0, b] ~ R we assign a piecewise poly-
nomial interpolation function PN z E s~-!i (IIN) which interpolates z at the
points (13): (PNZ)(tjl) = Z(tjl), l = 1, ... , m; j 1, ... , N. Thus, PNZ is
independently defined in every subinterval aj (j = 1, ... ,N) and (PNZ)(t)
682 A. Pedas
1
N-m
N-m(1 +logN) for m = 1- v,r = 1;
sr;:,v,r) = N-m for m = 1- v,r > 1; (15)
N-r(l-v) for m > 1 - v, 1 :s; r < m/(1 - v);
N-m form>l-v,r;::::m/(I-v).
Lemma 3. Let T : LOO(O, b) ----) C[O, bj be a linear compact operator. Then
°
liT - PNTII,C(LOO(o,b),LOO(O,b» ----) as N ----) 00,
Proof. An easy observation shows that
IlpNII,C(C[O,bJ,LOO(O,b» :s; c, N EN, (16)
°
where c is a constant not depending on N. On the base of (16) and Lemma 2 we
obtain that liz - PNZIILOO(o,b) ----) as N ----) 00 for every z E C[O, b]. Together
with the compactness of T : LOO(O, b) ----) C[O, b] this yields the assertion of
Lemma.
4 Collocation method
We look for an approximation v to the solution z of equation (6) in the space
s;;;~i (lIN) determing v = v(N,m,r) E s;;;~i (lIN)' m ;:::: 1, from the following
conditions:
tjl S
Note that the choice of the collocation points (13) with 'T]l = 0, 'T]m = 1 in (14)
actually implies that the resulting collocation approximation v belongs to the
smoother polynomial spline space S~~l (1I~).
Theorem 2. Let Yo E R, K 1 , K2 E Wm,V(Ll b), p, q E Cm,V[O, b], mEN,
v E R, v < 1, and assume that the collocation points (13), with the grid
points (11) and parameters (14), are used.
°
Then, for all sufficiently large N, say N ~ No, and for every choice of
parameters (14) with'T]l > or'T]m < 1, the equalities (18) and (17) determine
unique approximations u E S~)(lI'N) and v E S;:~i(lI'N) (with vllTj = (ullTJ',
j = 1, ... , N) to the solution y of problem {(I), (2)} and its derivative y',
respectively. If'T]l = 0, 'T]m = 1, then u E S;;;) (lI'N) and v = u' E S~~l (lI'N).
For all N ~ No the following error estimates holds.'
Ile(i) 1100 :::; CEC;;,v,r) , i E {O, I}. (19)
J JI
t b
)=l tj _
J tj
1
N
I(PNZ)(S) - z(s)lds::; crN- r(2-v) Lf(2-l/)-m-1,
)=1
(23)
with c and C1 not depending on N. Furthermore, for a number 0 E R we have
N {NO:+ 1 ifo>-I,
LjO:::;c 1+llogNI~fo=-I, (24)
j=l 1 1fo<-I,
Piecewise polynomial approximations 685
°
In the latter form the collocation method in more particular case (K2 = 0,
Kl E W m,V(L1 b), < v < 1, p, q E Cm[O, b], mEN) has been examined in
[3,4,5,6].
3) The convergence results established by Theorems 2 and 3 are derived
under the assumptions that all needed integrals in (17) can be evaluated ana-
lytically. Since this is rarely possible in concrete applications, there arises the
question how to approximate these integrals so that the resulting fully dis-
cretized collocation method converges under the same conditions and with the
same rate as it is proved for the "exact" collocation method in Theorems 2
and 3. This question will be discussed elsewhere.
References
1. Brunner H. (1982): A survey of recent advances in the numerical treatment of
Volterra integral and integro-differential equations. J. Comput. Appl. Math., 213-
229.
2. Baker C. T. H. (2000): A perspective on the numerical treatment of Volterra
equations. J. Comput. Appl. Math., 125, 217-249.
3. Brunner H., van der Houwen P. J. (1986): The Numerical Solution of Volterra
Equations, CWI Monographs 3, North-Holland, Amsterdam.
4. Makroglou A. (1981): A block-by-block method for Volterra integro-differential
equations with weakly-sin gular kernels. Math. Comp., 37, 95-99.
5. Brunner H. (1986): Polynomial spline collocation methods for Volterra integra-
differential equations with weakly singular kernelss. IMA J. Numer. Anal., 6,
221-239.
6. Tang T. (1993): A note on collocation methods for Volterra integro-differential
equations with weakly singular kernels. IMA J. Numer. Anal., 13, 93-99.
7. Brunner H., Pedas A., Vainikko G. (2001): Piecewise polynomial collocation
methods for linear Volterra integro-differential equations with weakly singular
kernels. SIAM J. Numer. Anal., 39, 957-982.
686 A. Pedas
8. Brunner H., Pedas A., Vainikko C. (2001): Spline collocation method for linear
Volterra integra-differential equations with weakly singular kernels. BIT, 41, 5,
891-900.
9. Parts 1., Pedas A. (2003): Spline collocation methods for weakly singular Volterra
integra-differential equations. In: Brezzi F., Buffa A., Corsara S., Murli A. (eds)
Numerical Mathematics and Advanced Applications, Enumath 2001, Springer-
Verlag, Italia, Milano, 919-928.
10. McKee S., Stokes A. (1983): Praduct integration methods for the nonlinear Bas-
set equation. SIAM J. Numer. Anal., 20, 1, 143-160.
11. Brunner H., Tang T. (1989): Polynomial spline collocation methods for the non-
linear Basset equation. Computers Math. Applic., 18, 5, 449-457.
12. Vainikko C. (1993): Multidimensional Weakly Singular Integral Equations. Lec-
ture Notes in Mathematics, 1549, Spronger-Verlag, Berlin, Heidelberg, New-York.
On a Discontinuous Galerkin Method for
Radiation-Diffusion Problems
Summary. In this paper, we show that the discontinuous finite element method
recently developed by Warsa, Wareing and Morel for radiation-diffusion problems
belongs to a class of generalized local discontinuous Galerkin methods. We then
derive a priori error bounds for this method and numerically confirm them to be
sharp.
1 Introduction
In the recent work [7] and [6]' Warsa, Wareing and Morel introduced a discon-
tinuous finite element method for the discretization of a radiation-diffusion
problem that is represented by a system of two coupled first order equations
for the zeroth and first angular moments of the particle distribution (the scalar
flux and current, respectively). These so-called PI equations arise out of an
angular Galerkin approximation to the Boltzmann transport equation based
upon a spherical-harmonic trial space of first order; see [3] or [4] for more
details. In the absence of time dependence (the case considered here) the re-
sulting problem finds the current J = J(x) and the scalar flux iP = iP(x)
satisfying
\7 . J + (J"a(x)iP = Qo, (1)
subject to so-called vacuum and reflecting boundary conditions, respectively,
1 1
-iP--J·n=O
4 2
onrv, J. n = 0 on rR. (2)
LOCJ(Jl) and satisfy O"t(x) ::::: 0"* > 0 and O"a(x) ::::: 0 in Jl (O"a = 0 in purely
scattering subregions). For simplicity, we further assume that irv ds > o.
The method of Warsa, Wareing and Morel approximates both the un-
knowns J and tP by piecewise linear functions. It is designed in such a way that
the radiation energy as well as the radiation momentum are conserved over
each cell, as in standard, upwind Godunov schemes. In combination with effi-
cient preconditioning techniques, the results in [5-7] indicate that the method
of Warsa, Wareing and Morel can be applied to a wide range of problems.
In this note, we show that the discrete formulation of the PI equations
of Warsa, Wareing and Morel belongs to the general class of mixed discon-
tinuous Galerkin (DG) methods analyzed by Castillo, Cockburn, Perugia and
Schotzau in [1]. This class extends and generalizes the local discontinuous
Galerkin (LDG) method proposed by Cockburn and Shu [2]. In the original
LDG approach the vector unknown can be eliminated from the equations in
a local and element-wise manner. In contrast, the method of Warsa, Wareing
and Morel belongs to the" truly" mixed variants of the LDG method described
in [1] for which such a local elimination is no longer possible. While this can be
seen as a shortcoming of truly mixed DG methods it in fact leads to better and
nearly optimal convergence rates for the approximation of the vector variable;
see [1]. Furthermore, as described in [6]' the PI equations are used to accel-
erate the iterative convergence of the Boltzmann transport equation solution.
The transport equation is discretized with a spatial DG method and in many
cases effective acceleration strongly depends on how well the vector unknown
of the acceleration equations is approximated. This application is what makes
the use of the truly mixed LDG discretization for the PI equations necessary
and motivates the study presented in this paper.
We apply the theoretical results in [1] and conclude that the method of
Warsa, Wareing and Morel is well-posed. Moreover, the following a priori error
bounds hold: for an approximation order k ::::: 0, the method exhibits conver-
gence rates in the mesh size of order k + ~ in a suitable energy norm, and
of order k + 1 in the L2-norm of the scalar flux. We present a set of numer-
ical convergence tests for a three dimensional model problem on tetrahedral
meshes that verify the theoretical predictions. We must point out that these
tests complete the tests in [1] where no numerical results were shown for truly
mixed DG methods.
the same notation as in [1], we let Ey be the union of all interior faces of Th,
Ev and ER the union of all boundary faces of Th on Tv and TR, respectively,
and set E = Ey U Ev U ER . For piecewise smooth vector- and scalar-valued
functions wand u, we introduce the following trace operators. Let e c Ey be
an interior face shared by two elements K+ and K-, and write n± for the
outward normal unit vectors to the boundaries 8K±, respectively. Denoting
by w± and u± the traces on 8K± taken from K±, respectively, we define the
jumps across e by [w] = w+ . n+ + w- . n- and [u] = u+n+ + u-n-, and the
averages {{wJt = (w+ +w-)/2 and {{u]} = (u+ +u-)/2. On a boundary face
°
e c Ev U ER , we set [w] = w· n, [u] = un, {{w]} = wand {{u]} = u.
Let Th be a triangulation Th of [l and k ::::: an approximation order. We
wish to approximate (J,<P) by a piecewise polynomial function (Jh' <Ph); that
is, (Jh' <Ph)IK E Pk(K)d x Pk(K) for all K E Th. Here, Pk(K) denotes the
set of polynomials of degree at most k on K. This approximation is defined
by imposing that, for all elements K E Th and all test functions (w,u) E
Pk(K)d x Pk(K),
Here, Ih,K and $h are the so-called numerical fluxes, that are approximations
to the traces of J. nK and <P on the element interfaces and are chosen as follows
(see [6]).
First, for an element K+ and an interior face e shared by K+ and a neigh-
boring element K-, we define the following inwardly and outwardly directed
discrete flows (partial currents) by
1 A;- 1 - 1 A;+
j in
e,K+ = 4'¥h - 2J h . nK+, j out
e
,K+
_
- -'¥h
4 + -21 J h+ . nK+·
If the face e of K+ is contained in Ev U ER, the outwardly directed flow jouKt
e,
+
is defined as j~,lh = ;j<Pt + Pt . nK+· Further, if the face e of K+ belongs
to Ev , we set jin
e, K
+ = 0, whereas jin
€, K
+ is not needed for e c ER .
The numerical fluxes are then taken as
obviously the choice is not unique, but consistency with the intended applica-
tion in which solution of the PI equations is embedded requires that we make
this choice; see [6] for more details. Note that the fluxes can further be easily
adapted to take into account inhomogeneous boundary data.
We now cast the fluxes in (4) in the setting of [1]. Denote by Jh a vector
field such that Jhle . nK = lh,K, for all e c 8K (the definition of Jhle no
longer depends on which side it is taken from). Then we have
~ 1
Jhl e = {{J h}} + 4[<h], $hle = {{<Ph}} + Ph], if e C [T,
~ 1 1 ~ 1
Jhle = 2Jh + 4<Ph n, <Phle = 2<Ph + Jh . n, if e C [v,
3 Error analysis
In this section, we discuss the a priori error bounds that are obtained for
the formulation (3). For a piecewise smooth function (w, u), we define the
seminorm
2 12 2 2 1212
I(w, u)lh = 3110't wllo,D+11 [w] Ilo,EIUEV +211 [w] Ilo,ER +IIO'J UIl O,D+ 41 [u] IIEIUEV'
2
Whenever O'a > 0, 1(-, ')Ih actually defines a norm. Our main result is given
in Theorem 1, which derives directly from the analysis in [1, Theorem 2.2].
Strictly speaking, that analysis was carried out for O'a == 0 and simpler bound-
ary conditions but extension to the current situation poses no difficulties.
4 Numerical results
In practice the coefficient (Jt may have strong discontinuities at the interfaces
Notice that tf> is piecewise smooth, but only belongs to H~-c(D), c > 0,
whereas 3~tVtf> = (cp(y,z),(x - ~)8ycp(y,z),(x - ~)8zcp(y,z)) is a smooth
function. We use the same sequence of tetrahedral meshes as the numerical
experiments presented in Section 4.1.
We consider the two cases a = 1, b = 0.01 and a = 100, b = 1. Notice
that the jump in the normal derivative of tf> at the surface x = 0.5 is equal to
3(a - b)y2 z2. This jump is almost two orders of magnitude larger in the case
a = 100, b = 1 than in the case a = 1, b = 0.01. For this reason deterioration
of the convergence rates due to the lack of smoothness of the exact solution
might be more serious in the second case than in the first one.
On a Discontinuous Galerkin Method for Radiation-Diffusion Problems 693
k:::O
10- 1 r----,____---,____---,____-----,
_ 1[[<I\ll Dur
......... W[[<1\11 Io.v
¢"'%'" a[[Jkll lo.r
............ . [[Jl ll lo.v
- ~----1i R [[J.ll ~_,
10\'-----'---------,'-------:'':-6------c':
IO-Ic---~--,____---,____---,____--______:J
_11[[(I>~JlIl0.l
..... 11[[<I'b11Ilo,v
¢'''V¢ II [[J~llll o"
........ 1I[[J)Jll o.v
+~"'<i II [[JhllIlo,J(
'6 32
Table 2 shows the errors and the numerical convergence rates Ti in the l'lh-
seminorm and in the L2-norm of iP, as well as in the L2-norm of J, for the case
a = 1, b = 0.01, Order k+ ~ in the l'lh-seminorm, and order k+ 1 convergence
in the L2-norm for iP are obtained. These results agree with the theoretical
estimates in Theorem 1, even though the elliptic regularity assumptions of
Theorem 1 for the estimate of the L2-error in iP are not satisfied. Furthermore,
the L2-error in J also converges with the optimal order k + 1. In Figure 2,
the errors in the jumps of iP and J are shown. Again, we see that the interior
jumps converge at the rates O(hk+~) while the boundary jumps show the full
convergence rate of O(hk).
694 I. Perugia et al.
5 Conclusions
10"2~----:-----!;------,]!-::6-------:!32
Fig. 2. Two- material problem, a = 1, b = 0.01: L 2 -errors of the jumps in J and <P
References
1. P. Castillo, B. Cockburn, I. Perugia, and D. Schotzau. An a priori error analysis.
of the local discontinuous Galerkin method for elliptic problems. SIAM J. Numer.
Anal., 38:1676-1706, 2000.
2. B. Cockburn and C .-W. Shu. The local discontinuous Galerkin method for time-
dependent convection-diffusion systems. SIAM J. Num er. Anal., 35:2440-2463 ,
1998.
3. B. Davison. Neutron Transport Theory. Clarendon Press, 1957.
4. E. E. Le~is and Jr. W. F. Miller. Computational Methods of Neutron Transport.
American Nuclear Society, 1993.
696 I. Perugia et al.
1O- 12';-------------;----------!8,--------::'6,---------;3-2
n
k:::l, a",lOO. 1>=1
10
. . . . 1I[[<I>k11 1lu!
. . . . . . 11 [[<lIhll Uo,v
10-2 "~,,,,+ II [[Jb]] "0.[
........ 11 [[Jl )] nos
II [[Jhllllll,R
1O"'z';-------------:-------------;,--------!16,----------=32
1 Introduction
Various methods have been put forward to treat multi-phase flows. A classifi-
cation is given in [1]. The two methods that are most suitable for the current
research are the Volume-oj-Fluid (VOF) method and the Level-Set method.
For both methods a marker function is used to define the interface. In case of
the Volume-of-Fluid method, a marker function, say lJt, indicates the fractional
volume of a certain fluid, say fluid '1', in a computational cell. It can be seen
as the concentration of the marker particles of the MAC-method, when the
number of particles goes to infinity.
An alternative to the Volume-of-Fluid method is the Level-Set method
([3, 4]). The interface is now defined by the zero level-set of a marker function,
say t!>: t!> = 0 at the interface, t!> > 0 inside fluid '1' and t!> < 0 elsewhere.
The function t!> is chosen such that it is smooth near the interface. This eases
the computation of interface derivatives. Also, methods available from hyper-
bolic conservation laws can be used to advect the interface. The interface is
(implicitly) advected by advecting t!> as if it was a material constant:
at!>
at + u . \It!> = o. (1)
It makes the advection of the Volume-of-Fluid function lJi easy (i.e. without
interface reconstruction) and finding iP from lJi a straightforward task. This is
carried out by well-known numerical tools, like Picard and Newton iterations.
The PLIC method is not adopted (unlike CLSVOF), yet mass is conserved
in the same manner. Note that the CLSVOF method might not be easily
extendible to 3D space. Yet the extension of MCLS to three-dimensional space
can be done in a straightforward way. Note also that with this approach, it is
not necessary to smooth (or regularize) lJi, which is common for other methods.
2 Governing Equations
Consider two fluids '0', and '1' in domain [l E IR? which are separated by an
interface S. Both fluids are assumed to be incompressible, i.e.:
V· u = 0, (3)
where u = (u, v)t is the velocity vector and u and v are the velocities in
x- and y-direction respectively. The flow is governed by the incompressible
Navier-Stokes equations:
700 S.P. van cler Pijl et al.
au l I t
-m + u· Vu = --p Vp + -p V 'IL (Vu + Vu ) + g, (4)
where p, p, IL and g are the density, pressure, viscosity and gravity vector
respectively. The density and viscosity are constant within each fluid. Using
the Level-Set function iP these can be expressed as
(5)
and similar for p, where the subscript indicates the corresponding fluid and H
is the Heaviside step function.
(7)
x <-0;
Ixl::::; 0; (8)
x > 0;
3 Computational Approach
The strategy of modeling two-phase flows is to compute the flow with a given
interface position and subsequently evolve the interface in the given flow field.
In the foregoing, it has been described how the flow is computed with a given
interface position. Next we consider the evolution of the interface.
The interface is evolved by advecting the Level-Set function in the flow field
as if it were a material constant (Eqn. (1)):
8if.)
at +u, Vif.}= O. (10)
MCLS The difficulty with the Level-Set method is, that although if.} might
be conserved, this does not imply that mass is conserved. On the other hand,
with the Volume-of-Fluid method, mass is conserved when tj/ is conserved. In
order to conserve mass with the Level-Set method, corrections to the Level-
Set function are made by considering the fractional volume tj/ of a certain fluid
within a computational cell. First the usual Level-Set advection is performed:
first-order advection and unmodified re-initialization. Low order advection and
re-initialization will ensure numerical smoothness of if.}. Furthermore, when the
702 S.P. van der Pijl et al.
flow-field is computed, higher order accuracy might not be expected when the
CSF method is applied and viscosity is regularized. In that respect, higher
order discretization of Eqn. (10) will only lead to improved mass conservation
for the pure Level-Set methods. Since the obtained Level-Set function <pn+l,*
will certainly not conserve mass, corrections to <pn+l,* are made such that
mass is conserved. This requires three steps:
1. the relative volume of a certain fluid in a computational cell (called
'volume-of-fluid' function l}f) is to be computed from the Level-Set function
<p n : l}f = J(<p, V<p);
2. the volume-of-fluid function has to be advected conservatively during
a time step towards l}fn+l;
3. with this new volume-of-fluid function l}fn+l, corrections to <pn+l,* are
sought such that J(<pn+l, V<pn+l) = l}fn+l holds.
These three steps will be explained subsequently.
(11)
where H is the Heaviside step function. The Level-Set function <p is linearized
around <Pk, which leads to
(12)
Note that in contrast with other approaches, the Heaviside step function is
not regularized. After some mathematical manipulations, the function J is
evaluated as
<Pk '5-<Pmaxk
(13)
where
Modeling of Multi-Phase Flows with a Level-Set Method 703
,hn+l _ ,T,n 1 ( F F
'1"+1 l + (15)
2: ,J'+12' -'1"+1 '+1- Xi+lJ'+ xiJ'+
A A l -
't 't 2 ,] 2 .LJX£...ly '2 '2
FY 't'+ l2 ,)'+1 - FY 't'+ l2 ,).),
The fluxes are again computed by linearizing tP (just like Eqn. (13)). In fact,
the fluxes are computed by the straightforward application of J.
It is possible that fluid is fluxed more than once through different faces,
which would cause unphysical values of Jjf. As reported in e.g. [1], this can be
solved by employing either a multidimensional scheme or flux-splitting. For
reasons of simplicity we have chosen for the second approach. The order of
fluxing is: first in x-direction, then in y-direction. Currently the flux-splitting
of [5] is adopted. As reported in [5], undershoots and/or overshoots can still
occur, which leads to unphysical values of Jjf, namely < 0 and> 1. If these
values are replaced by 0 and 1 respectively, mass errors arise which are of
order 10- 4 . This is also experienced in the current research. Mass errors are
completely avoided by redistributing Jjf ([2]).
4 Applications
10·'
L
i
-10'" -e- l~O(d ~ r
-+- l!11ord~r. re_mit
--0-- 2 n oordeT
10'" .. ~ .. 2nd order. re-init.
H· 3rtlorder
~- ~ordar.re-init
10.1 +- MClS
Fig. 1. Relative mass for the linear advection test; E = 10- 8 (every 10th iteration
marked)
simulations (with and without re-initialization) are compared with the MCLS
method. ENO discretization is adopted for the pure Level-Set method (see
aforementioned references). The order of re-initialization is in agreement with
the order of advection. The tolerance in the VOF advection is taken to be:
E = 10- 8 . Globally speaking it can be said that mass is always lost for the
pure Level-Set advection. Mass losses are smaller for higher accuracy and re-
initialization causes much higher mass losses. The MCLS method conserves
mass up to the specified tolerance.
Modeling of Multi-Phase Flows with a Level-Set Method 705
In Fig. 2(b) results are shown for a falling droplet. The conditions are
the same as for the rising bubble, except for the sign of if> at t = 0 and
Yo = Lx. Mass conservation properties are the same as before. The result are
the same until the droplet hits the bottom. Thereafter differences occur. This
is thought to be due to limited number of grid cells available to capture the
flow-phenomena near the wall. The results compare well with [8]. Note that
the results in [8] span t ::::: 0.05; no results after collision are presented.
5 Conclusion
The mass Conserving Level-Set (MCLS) has been presented. The method is
based on the Level-Set methodology, where mass is conserved by considering
706 S.P. van der Pijl et aI.
References
1. S.P. van der PijI. Free-boundary methods for multi-phase flows. AMA report
02-13, Delft University of Technology, 2002. http://ta.twLtudelft.nI.
2. S.P. van der Pijl, A. Segal, and C. Vuik. A mass-conserving level-set method for
modeling of multi-phase flows. AMA report 03-03, Delft University of Technol-
ogy, 2003. http://ta.twi.tudelft.nI.
3. W. Mulder, S. Osher, and J.A. Sethian. Computing interface motion in com-
pressible gas dynamics. Journal of Computational Physics, 100:209-228, 1992.
4. S. Osher and R.P. Fedkiw. Level set methods: An overview and some recent
results. Journal of Computational Physics, 169:463-502, 2001.
5. M. Sussman and E.G. Puckett. A coupled level set and volume-of-fluid method
for computing 3D and axisymmetric incompressible two-phase flows. Journal of
Computational Physics, 162:301-337, 2000.
6. M. Sussman. A second order coupled level set and volume-of-fluid method for
computing growth and collapse of vapor bubbles. Journal of Computational
Physics, 187: 110-136, 2003.
7. D. Enright, R. Fedkiw, J. Ferziger, and I. Mitchell. A hybrid particle level set
method for improved interface capturing. Journal of Computational Physics,
183:83-116, 2002.
8. M. Kang, R.P. Fedkiw, and X.-D. Liu. A boundary condition capturing method
for multiphase incompressible flow. Journal of Scientific Computing, pages 323-
360, 2000.
9. Z. Li and M.-C. Lai. The immersed interface method for the navier-stokes equa-
tions with singular forces. Journal of Computational Physics, 171:822-842, 2001.
10. M. Sussman, P. Smereka, and S. Osher. A level set approach for computing
solutions to incompressible two-phase flow. Journal of Computational Physics,
114:146-159, 1994.
11. Y.C. Chang, T.Y. Hou, B. Merriman, and S. Osher. A level set formulation of
eulerian interface capturing methods for incompressible fluid flows. Journal of
Computational Physics, 124:449-464, 1996.
12. J.U. Brackbill, D.B. Kothe, and C. Zemach. A continuum method for modeling
surface tension. Journal of Computational Physics, 100:335-354, 1992.
Modeling of Multi-Phase Flows with a Level-Set Method 707
13. J.J.I.M. van Kan. A second-order accurate pressure correction method for viscous
incompressible flow. SIAM J. Sci. Stat. Comp., 7:870-891, 1986.
14. F.H. Harlow and J.E. Welch. Numerical calculation of time-dependent viscous
incompressible flow of fluid with free surfaces. Physics of Fluids, 8:2182-2189,
1965.
15. M. Sussman, E. Fatemi, P. Smereka, and S. Osher. An improved level set method
for incompressible two-phase flows. Computers and Fluids, 27:663-680, 1998.
16. M. Sussman and E. Fatemi. An efficient, interface-preserving level set redistanc-
ing algorithm and its application to interfacial incompressible fluid flow. SIAM
Journal on Scientific Computing, 20(4) :1165-1191, 1999.
Numerical Modeling of Bypass Flow
Summary. This paper deals with problem of numerical solution of laminar viscous
incompressible stationary and unstationary flows through vessel with bypass. One
could describe these problems using model of N avier-Stokes equations and find steady
solution of unsteady system by using multistage Runge-Kutta method together with
time dependent artificial compressibility method. Non-stationary solution is achieved
from initial stationary solution by prescribing of nonstationary outlet conditions.
Some results of numerical solution of cardiovascular problems are presented: station-
ary and unstationary 2D flows in a vessel and a bypass.
1 Mathematical model
In the cardiovascular system we could find many different types of vessels like
large arteries, vessels of medium size and capillaries. They differ in diameter
and in thickness and composition of the wall. In larger vessels the blood flow
can be assumed to behave as an incompressible continuum. One can describe
this type of flow using system of momentum and continuity equation written
in conservation form:
Dw
p - - \7. T = pf (1)
Dt
\7·w=O (2)
where T is stress tensor of the fluid, w is velocity vector and f is vector of
external forces, which is later not taken into account. Density of the fluid p
is supposed to be constant in physiological conditions, although it depends on
the red cells concentration. The functional dependence of the stress tensor T
on velocity vector wand the blood pressure p is descented by the following
relations:
(3)
(4)
Equations (3) and (4) describe Newtonian fluid. Important feature of blood
flow is pulsatility caused by the periodic motion of the heart. It is also known
[2] that there is scarcely any turbulence in vessels except some special cases.
Numerical Modeling of Bypass Flow 709
The walls of a tube which is the model of a vessel are supposed to be riggid
and the velocity vector w is null on them. Blood flow can be assumed to be
laminar [2]. Indeed, in physiological conditions, the values of speed involved
are low enough. Morover, generally, the periodicity of the flow, together with
short length of vascular districts, do not give rise to fully developed turbu-
lence. Reynolds number Re = d':;* is important feature of the flow behaviour.
Quantity w* is characteristic velocity, v = /J/ p is kinematic viscosity and d
is a lenght scale. In large and medium human vessels, the Reynolds number
ranges from 400 up to 10000. In stationary case pulsating nature of blood flow
is not considered. Elasticity of vessel tubes is not considered in both cases. The
flow could be then described as viscous, incompressible, laminar and stationary
(unstationary) in 2D by the system of Navier-Stokes equations without influ-
ence of exterior forces and heat exchange. The system is written in conservative
non-dimensional vector form rewritten from (1), (2)
- - 1
RWt + Fx + G y = R Re (Wxx + W yy ), (5)
2 Numerical model
Solution of the system (5) is obtained using method of artificial compressibility,
then equation of continuity is completed with term ;&Pt, where a 2 E R+.
Rewritten in vector form the improved system (5) is following
- 1
Wt + Fx + G y = R Re (Wxx + W yy ) , (6)
where W = (:2' u, v)T. Finding steady solution one could solve unsteady sys-
tem (6) by finite volume method together with time dependent method. System
of equations (6) could be solved using three stage Runge-Kutta method using
given steady boundary conditions. At the inlet extrapolation of pressure is
used. At the outlet the value of pressure is set constant for stationary case and
for unstationary case the pressure is prescribed by sinus function in form:
w n . = W(O)
1",) 1,,) (8)
Wi~~) = Wi~~) - arL1tRWi~~-l), (r = 1, ... , m) (9)
n +!
W 2,J = W(m)
t,) ,
m = 3, (10)
where
RW(r-1) = RW(r-1) _ DW(r-1) (11)
1"J 1,) 1,,)
(18)
where J.1,ij = fJD .. dxdy. The convergence of iterative process is followed using
'J
the behavior of residual in space L2
(19)
Acknowledgement: This wok was partly supported by grant GACR No. 201/00/0684
and Research Plan MSM 210000010.
=---_=..-
Re=1500, 5=40
Re=2000, 5=30
Re=2000, 5=30
Re=2000, s=40
Re=500, 8=40,1=100.2
per=4
Re=500,s=40, 1=103.2
per=4
Fig. 15. Bypass unst. flows no.7, Re=500, vector field of velocity
References
1. R. J. LeVeque: Numerical Methods for Conservation Laws, Birkhauser Verlag,
1990
2. A. Quarteroni, M. Tuveri, A. Veneziani: Computational Vascular Fluid Dynam-
ics: Problems, Models and Methods
3. R. Dvorak, K. Kozel: Mathematical modeling in aerodynamics,CVUT, 1996 (in
Czech)
4. S. Canic, D. Mirkovic: A Hyperbolic System of Conservation Laws in Modeling
Endovascular Treatment of Abdominal Aortic Aneurysm, International Series of
Numerical Mathematics Vo1.140, 2001
5. K. Kozel, V. Prokop: Numerical solution of incompresible viscous flows through
a channel with cavitas and a channel with bypass, Proceeding of "The 2nd Inter-
national Conference Of Applied Mathematics And Informatics At Universities
2002", Trnava, September 2002, p. 59 - 63
A Posteriori Estimation of Dimension
Reduction Errors
1 V.A. Steklov Institute of Mathematics, Fontanka 27, 191 011 St. Petersburg,
Russia [email protected]
2 Institute of Mathematics, Zurich University, CH-8057, Zurich, Switzerland
stas, [email protected]
1 Introduction
on the right-hand side of the given equation and considering a general geome-
try of the given domain. We show that, for the zero-order dimension reduction
method considered here, the estimator of Babuska and Schwab (see [1], [2])
can be obtained as a particular case of our estimator when the right-hand side
of the equation is zero and the original domain is a plate with plane parallel
faces. We demonstrate the optimal convergence of the estimator as the plate
thickness tends to zero (although, it is worth noting that the proposed estima-
tor preserves its reliability for any positive thickness). Finally, we observe how
accurately the estimator indicates the local error distribution, thus, allowing
for a local improvement of the model.
2 Problem setting
where D c ]R2 is its projection on the (Xl, x2)-plane (D has the Lipschitz
boundary F) and de and dtB are Lipschitz continuous functions: D- t R The
lower and upper faces of D are denoted by
and
3
I (Xl, X2)
~
rEB := {X E]R ED, X3 = dtB(Xl, X2)},
the lateral boundary by
where d = dtB - de is the domain thickness, d (Xl, X2) 2: d* > 0 \I(Xl' X2) E D.
Although the assumption is of purely qualitative nature, it serves as a basis
for the derivation of the corresponding two-dimensional reduced model. We
also have to notice that Figure 1 depicts a simplified case; in the geometrical
definitions we do not assume the domain thickness d (Xl, X2) to be a constant.
718 S. Repin et al.
where J E L 2(0), Fe ,FfB E L2(D), Ve and VfB are outward normal vectors
at re and r fB respectively. The matrix A = (aij(x))i,j=1,3 with the compo-
nents from Loo(O) is symmetric and uniformly positive definite, i.e. there exist
constants 0 < c < C < CX) such that
From now on we will frequently use the notation x = (Xl, X2), X = (x, X3), and
all functions depending only on (X1,X2) will be marked by ~; in addition, we
will distinguish between 3- and 2-dimensional divergence operator:
1n
A Vu . V w dx = rf
k
w dx + r Fe
J~
w ds + rF
J~
$ w ds Vw EVa. (6)
J
dEll (x)
g(x) := d tx) g(x, X3) dX3 for a.e. xED,
de(x)
we can deduce from (6) the reduced problem (see [7]) that reads
u
Problem (P): Find E Vo such that
h
were f~ -- f~+ Feyl+lV'deI2 +FEIly1+IV'dEl
d
l I2
an d A~ p (~)
x -- (~
a,).. (~)) - l·S the
x i,j=1,2
averaged "plane" part (Ap(x) = (aij(x))i,j=l,2) of the matrix A.
It is clear that problem (8) is a two-dimensional elliptic problem with the
homogeneous Dirichlet boundary condition:
+ (1 + ~) (1 + ~ ) Cf (1 + CA) Ml ,
where 111·111 is the energy norm, Illvlll := UnA(x)Vv, Vv dx) 1/2 Vv E Vo,
n
Cn is the constant from Friedrichs' inequality (C 2 = inf 1IIIIIr I1l2 ), Cr
wEVo\{O} w L2(fl)
Mi := IIDivy* + fIIL(n) ,
* *
M3
2.
.=
~ 2
lIFe - y veIIL2(re) + IIFEB
~ 2
- y vEBIIL 2(rEj))'
We emphasize that the estimate is valid for any positive numbers "Y and 0 and
for any vector-function y* from the space H*(D, Div) defined as
H*(D, Div) :=
{y* E L 2 (D,lR 3 ) I Divy* E L 2 (D) , y •. ve E L 2 (Te ) , y*. vEB E L 2 (TEB )}·
While the best possible option would be to take as y* the exact flux A Vu
(then M2 and M3 would vanish and Ml would give us the energy norm of
the exact error e), we have to restrict ourselves to choosing some computable
quantity, i.e. not containing the unknown exact solution u. We approximate
the flux by
y* = Ap Vii: + r* , (12)
where r* = {O, 0, 1/J(x)}T, 1/J is the auxiliary function from L2(D) such that
1/J,3 E L 2 (D) , 1/J E L 2 (Te ) and 1/J E L 2 (TEB ). Using (9), it is easy to verify that
y* from (12) belongs to H*(D, Div). A discussion about other choices of y*
can be found in [7J.
Substituting (12) into the functionals M'f, M?, Ml, we obtain (see the
details in [7J)
A Posteriori Estimation of Dimension Reduction Errors 721
(13)
(14)
(15)
(16)
where M1 and M2 are defined by (13) and (14). The error majorant M has
been derived for quite general geometry of n and coefficient matrix A(x).
However, to make the estimate more transparent, we consider two particular
cases.
We assume that
dEB = de + do (do = canst> 0) (17)
and, in addition, that
With these assumptions the terms M1 and M2 in estimate (16) become simpler:
(20)
722 S. Repin et al.
One may notice that the integral in the first term M1 of the error majorant
M can be rewritten as
which means that the term M1 is of order O(d~/2) when the plate thickness do
tends to zero. If f E Loo(O), the second term M2 is obviously of the same order
O(d~/2), i.e. the whole estimator M converges to zero with the rate O(d~/2)
as do -+ o. This is the optimal convergence rate for the modelling error e in
the energy norm, as was shown in [9] for the simpler case of a plate with plane
parallel faces and f = o. It is worth noting that, if f E C1(0), the second
term in M is of higher order O(d~/2) as compared to the first term.
do do
dffJ = 2 ' de = - 2 (do = canst > 0) , (21)
the auxiliary function 'ljJ will take the simple form 'ljJ = F$;ioFe X3 + F$;Fe
and the error estimate (16) will read
(23)
which is exactly the estimator of Babuska and Schwab (see [1]) for the zero-
order reduced model. Thus, the latter estimator can be obtained as a particular
case of the error majorant (16) if one makes the assumptions (18), (19), (21)
and sets f = O. This is a particularly interesting fact, since we advocate the
estimation approach that is completely different from the one utilized in [1]
(see the details in [7] and [6]).
5 Numerical example
. do
dffi,e(x) = sm(k7rx) ± "2' k = 1,2, ... ,
-L\u = f in n,
u =0 at x = 0 and x = 1,
"ilu· vffi,e = Fffi,e at y = dffi,e,
and the right-hand sides of the equation and of the boundary condition are
computed using the exact solution
References
1. Babuska, I., Lee, I., Schwab, C. (1994): On the a posteriori estimation of the
modeling error for the heat conduction in a plate and its use for adaptive hier-
archical modeling. Appl. Numer. Math., 14, 5-21
2. Babuska, I., Schwab, C. (1996): A posteriori error estimation for hierarchic mod-
els of elliptic boundary value problems on thin domains. SIAM J. Numer. Anal.,
33, 221-246
3. Oden, J.T., Cho, J.R. (1996): Adaptive hpq-finite element methods of hierarchical
models for plate- and shell-like structures. Comput. Meth. Appl. Mech. Engrg.,
136, 317-345
724 S. Repin et al.
O.5~
Fig. 2. (left) The domain geometry, (right) Convergence rate of the exact error
and of the error majorant, k = 2, m = 2, the majorant is indicated by "0"
4. Repin, S.L (2000): A posteriori error estimation for variational problems with
uniformly convex functionals. Math. Comp., 69, 481~600
5. Repin, S.L, Sauter, S.A., Smolianski, A.A. (2003): A posteriori error estimation
for the Dirichlet problem with account of the error in the approximation of
boundary conditions. Computing, 70, 205~233
6. Repin, S.L, Sauter, S.A., Smolianski, A.A. (2003): A posteriori error es-
timation for the Poisson equation with mixed Dirichlet/Neumann bound-
ary conditions. Preprint 02~2003, Institut of Mathematics, Zurich University,
http://www.math.unizh.ch/index.php?preprint
7. Repin, S.L, Sauter, S.A., Smolianski, A.A. (2003): A posteriori estima-
tion of dimension reduction errors for elliptic problems on thin do-
mains. Preprint 18~2003, Institut of Mathematics, Zurich University,
http://www.math.unizh.ch/index.php?preprint
8. Stein, E., Ohnimus, S. (1997): Coupled model- and solution-adaptivity in the
finite-element method. Comput. Meth. Appl. Mech. Engrg., 150, 327~350
9. Vogelius, M., Babuska, 1. (1981): On a dimensional reduction method 1. The
optimal selection of basis functions. Math. Comp., 37, 31~46
A Posteriori Estimation of Dimension Reduction Errors 725
O.04,----~---,__--~--~---__,
0.035
0.03
0.025
0.03,----~--~---~--~--_____,
Fig. 3. Local error distribution provided by the exact error (solid line) and by the
M1-term of the majorant (dash-dot line), k = 4, m = 4: (left) do = 0.1, (right)
do = 0.05
Beatrice Riviere 1
Summary. The coupled Stokes and Darcy flows problem is solved by locally con-
servative numerical methods. Discontinuous Galerkin methods are used in the Stokes
region and discontinuous Galerkin methods coupled with mixed finite element meth-
ods are employed in the Darcy region. Optimal a priori error estimates are derived.
In this work, a numerical method for solving the coupled problem of Stokes
and Darcy equations is formulated and analyzed. The coupled system arises
from the study of the interaction between surface and subsurface flow. Discon-
tinuous finite elements and mixed finite elements (MFE) are used in subregions
of the subsurface domain while only discontinuous finite elements are used in
the surface domain. While mixed elements are popular and efficient on regular
grids, discontinuous Galerkin (DG) methods are accurate and easily imp le-
mentable on highly unstructured meshes. The proposed method enables to
take advantage of one of these locally conservative methods in a particular
subregion of the subsurface. This work is an extension of the coupling of DG
for Stokes and MFE for Darcy [8], and DG for Stokes and Darcy [7J. Similar
couplings are studied in the work of Layton, Schieweck and Yotov [5J and Dis-
cacciati and Quarteroni [2, 3J. Let n be a domain in IR 2 , subdivided into three
Q,
r13
"2 I n l2 j n13
~,~
Q 2 Q3
2 Numerical Method
The discontinuous Galerkin method is used in the region Dl U D2 , while the
mixed finite element method is used in the region D3 . For i = 1,2,3, let £~ be
a non-degenerate quasi-uniform subdivision of Di , let r~ be the set of interior
edges and let h denote the maximum diameter of elements. We assume that
the meshes match at the interface n3
match, but they may not match at the
interfaces r 12 and r 23 . Given a fixed normal vector n e , on each interior edge,
pointing from E; E;,
to the average {w} and jump [w] of function w is defined:
728 B. Riviere
On a boundary edge, the jump and average coincide with the trace of the
function. For any integer k ::::: 0, the Sobolev space on a domain 0 is denoted
by Hk(O) = {v E L2(O) : Dmv E L2(O), \llml::::; k}, with norm 11·lko. We
denote by L6(O) the space of square-integrable functions with zero average,
with L2 inner-product (', ')0.
Let kl' k2 and k3 be positive integers. The DG discrete spaces are
The MFE discrete spaces are the classical ones, such as the Raviart-Thomas
spaces [6]. We assume that the mixed velocity spaces X~ C H(div; D 3 ) con-
tains polynomials of degree k3 and the pressure spaces M~ C L2 (D3) polyno-
mials of degree k3 - 1. We associate to these spaces the following norms:
(9)
(10)
We recall a result proved in [4] that generalize a Sobolev imbedding. There
exists a constant C independent of h such that
-2p, L
eEr~uTr
1 e
{VUln e } . [VI] + 2WI L
eEr~uTr
1 e
{VVln e } . lUll
1 1
eE r 12 e eEr13 e
bl(VI,PI) =- L PI V . VI + L {PI}[VI]' n e ,
EEEh E eEr~url e
a2(P2, q2) = L
EEE~
1E
KVP2 . Vq2 + L
eEr~
(J1:'le 1 e
[P2][q2]
By introducing the parameters EI, E2 that take the value 1 or -1, we allow
for non-symmetric or symmetric bilinear forms al and a2. We assume that
in the non-symmetric case (EI = 1), the parameter (JI,e is equal to 1 and
in the symmetric case (EI = -1), the parameter (JI,e is bounded below by
a sufficiently large positive value. The same assumptions hold true for E2 and
(J2,e' Combining the subdomains bilinear forms, we define
With these forms, the numerical method is: find (U, P) E X h X Mh such that
= (f1,VI)S?, + (h,q2)S?2 - r
ir'3
P3(VI - V3)' n13,
L
E iE
r(2J1,D(UI) : D(VI) - PI \1 . VI)
-L
eEr~
1
e
{-PIJ + 2fLD(UI)}n e . [VI] + ELl eEr~ e
{2fLD(vr)}n e . lUll
- L 1(-PIJ + 2J1D(ur))n .
eEr'2 ur'3 e
a2(P2, q2) - L
eEr'2
1 e
UI . nl2q2 + L
eEr23
1 e
U3 . n23q2 = (12, q2)S?2'
and consider PE L2(SlI U Sl3) such that P = P + P. Then the equation (15)
becomes
732 B. Riviere
(P2 - P)(
Jr'
r 2
VI' n12 - r
Jr
23
V3' n23) = 0, Vv E Xh.
3 Error estimates
Theorem 1. Let (u,p) be the solution of the coupled problem (1)-(8) such
that uln i E (Hki+l(0;))2 for i = 1,3, plni E Hki (0;) for i = 1,3, and
pln2 E Hk2+l(02)' Then the discrete solution (U, P) of (15), (16) satisfies the
following estimate
+b3(U3-U3,6) +
Jrn2(P2-P2)XI' n12 - Jrn26(Ul-Ul)' n12
+
JrG36(U3- U3)' n23 - (Prp2)X3 . n23 +
Jrn3P3(XI-X3)' n13.
Analysis of a Multi-Numerics/Multi-Physics Problem 733
The pressure term b3(X3,P3 - P3) vanishes because of (24). The terms bl (UI -
uI,6) and b3(U3 - U3, 6) vanish because of property (19) of the approxima-
tion u. We now bound the remaining terms. By using Cauchy-Schwarz, trace
inequalities, and the approximation results (23) and (25), we can bound the
first three terms
1
al (UI - UI, Xl) + a2(P2 - P2, 6) + a3(u3 - U3, X3) :S 81lxllli~
1
+Ch2kllult+I,.rtl + 811611~~ + Ch2k2Ipl~2+1,.rt2
+~llx31Ii~ + Ch2k3+2Iul~dl,.rt3·
Because of the L2 projection, the first pressure term is reduced and bounded
as follows:
bl(XI,PI - PI) = L
eEr~ur,
1e {PI - pI}[xIl· ne
:S ~ L ~11I[XllI16,e+Ch2kllplt,.rt,.
eEr~ur,
The remaining terms are the interface terms. Using the approximation result
(25), the trace inequality (27) and Cauchy-Schwarz's inequality, we obtain
Similarly,
With the bound (11) and the approximation result (23), we have
Assume that each edge e of Fr3 is shared by the elements E; E [~ and E~ E [~.
Define the constant Ce = lel- 1 X3 . n13· Ie
L
eE['13
1 e
(P3 - P~)X1 . n13 = L
eE['13
1e
(P3 - P~)(X1 . n13 - ce )
Define ~ = (15711+ 15731)-1 IJ.l UJ.l3~ 1 and the function € E L6(57 1 U 573) by
€ = ~ -~.
B(v,O=B(v,€)+B(v,~)=B(v,€)-~(r V1·n12- r V3· n 23).
1 ['12 1['23
Let v E(HJ(571 U 573))2 such that -\7 . v = €
and Ilvllx~ + Ilvllx~ :::;
CII€llo,J.lI UJ.l3. Choose in (29) v = v the approximation of v, defined by (19)-
h
(23):
-- -1
e - IDII + ID31
1( _-)
D2 P2 P2 -
< ID21 I - - I
IDII + ID31 P 2 P2 O,D2
::; ClIP2- p211x~ + Chk2Ip2Ik2+I,D2'
Combining the results of Theorem 1 with the bounds above, give the optimal
error bound.
As a concluding remark, one can introduce Lagrange multipliers on the inter-
faces, and obtain an equivalent method. This allows the decoupling of each
subdomain, which may be advantageous for a parallel implementation.
References
1. G.S. Beavers and D.D. Joseph. Boundary conditions at a naturally impermeable
wall. J. Fluid. Mech, 30:197-207, 1967.
2. M. Discacciati and A. Quarteroni. Analysis of a domain decomposition method
for the coupling of stokes and darcy equations. In Brezzi et al., editor, Numerical
Analysis and Advanced Applications - Enumath 2001, pages 3-20. Springer, Milan,
2003.
3. M. Discacciati and A. Quarteroni. Convergence analysis of a sub domain iterative
method for the finite element approximation of the coupling of stokes and darcy
equations. Computing and Visualization in Science, 2004. to appear.
4. V. Girault, B. Riviere, and M. Wheeler. A discontinuous Galerkin method with
non-overlapping domain decomposition for the Stokes and Navier-Stokes prob-
lems. Mathematics of Computation, 2003, to appear.
5. W.J. Layton, F. Schieweck, and 1. Yotov. Coupling fluid flow with porous media
flow. SIAM J. Numer. Anal., 40(6):2195-2218, 2003.
6. R. A. Raviart and J. M. Thomas. A mixed finite element method for 2nd order
elliptic problems. In Mathematical Aspects of the Finite Element Method, Lecture
Notes in Mathematics, volume 606, pages 292-315. Springer-Verlag, New York,
1977.
7. B. Riviere. Analysis of a discontinuous finite element method for the coupled
Stokes and Darcy problems. Technical Report TR-MATH 03-11, University of
Pittsburgh, 2003.
8. B. Riviere and 1. Yotov. Locally conservative coupling of Stokes and Darcy flows.
SIAM J. Numer. Anal. revised.
9. P. Saffman. On the boundary condition at the surface of a porous media. Stud.
Appl. Math., 50:292-315, 1971.
The Discontinuous Galerkin Method
for Singularly Perturbed Problems
1 Introduction
-c:6u + b1 U x + cu = f
° on
{ (1)
u = f=on.
Here °
< c: « 1 represents a perturbation parameter and b1 , c and fare
real-valued functions defined on IT. Assuming
1 ObI
br(x) 2: (JI > 0, c(x) 2: 0, c(x) - "2ax (x) 2: I > 0, x E IT, (2)
8 jS 8 i+jE <
8xi+
_
'" . '" . x, y )
__ 1(
_ C E - i e -(I-x)/" ,
I
i 8yj (x, y) S; C,
I
1 1
ux'uyJ
from [10] for the Galerkin FEM with linear or bilinear elements on a Shishkin
mesh (see also the survey paper [6]). Surprisingly, up to now there is no result
in the literature for the streamline-diffusion finite element method (SDFEM).
Furthermore, for the SDFEM the optimal choice of the stabilization parameter
near parabolic layers is an open task, see discussion in [4] and [5]. This was
also a reason for us to investigate some alternative discretization technique.
Here, for numerical solving of (1)-(2) we use the h-version of a nonsymmet-
ric discontinuous Galerkin finite element method with interior penalties (the
NIPG method), [2], [7], [8]' [9]. The technique from [2] is applied, but with
a bilinear interpolant in error splitting instead of an L 2 -projection onto a fi-
nite element space. This allows us to use the well-known interpolation error
estimates for the problem (1)-(2). We finally show that on a specially chosen
shape-irregular mesh (Shishkin mesh), this method yields error bound that
is uniform in the perturbation parameter. Since our discretization involves
the layer-adapted mesh whose construction directly uses information from the
solution decomposition, the technique for proving E-uniform error estimates
from this paper cannot be applied on more general (nonrectangular) domains
n, when such a decomposition is not available.
Following [2] and the notation therein, let T be a general partitioning of the
domain n = (0,1)2 consisting of disjoint open axiparallel rectangles K, such
738 H.-G. Roos, H. Zarin
+l avw ds + lin~[v][w] ds
+L
I<ET
(- } rB_l<nr (b· f.1.)v+w+ ds - } r I<\r (b. f.1.K)lVJW+ dS)
fL
+s 1r
(v(Vw· f.1.) - (Vv· f.1.)w) ds + s 1~~
([v](Vw· v) - (Vv· v)[w]) ds,
The Discontinuous Galerkin Method for Singularly Perturbed Problems 739
(4)
Discretization mesh. For the discretization of the boundary value problem (1),
here we use an anisotropic tensor-product Shishkin with (N + 1) x (N + 1)
mesh nodes, that is adapted to the layers at x = 1, y = a and y = 1. Let N
be an integer divisible by 4 and let Ax and Ay be mesh transition parameters
defined by
where (31 is the lower bound for the function b1 and ;Y is a constant from
the solution decomposition (Theorem 1). The domain D is split into D =
D11 U D12 U D21 U D 22 , with
D11 = [1 - Ax, 1] x ([0, Ay] U [1 - Ay, 1]) , D12 = [1 - Ax, 1] x [Ay, 1 - Ay] ,
D21 = [0,1 - Ax] x ([0, Ay] U [1 - Ay, 1]) , D22 = [0,1 - Ax] x [Ay, 1 - Ay].
Then the intervals [0,1- Ax] and [1- Ax, 1] are uniformly dissected into Nj2
subintervals to give the mesh D;: in x-direction, while for D;: we dissect [0, Ay]
and [1 - Ay, 1] into Nj4 subintervals and [Ay, 1 - Ay] into Nj2 subintervals.
740 H.-G. Roos, H. Zarin
Taking the tensor of 0,r: and 0,lj, we obtain our final rectangular Shishkin
mesh.
On such a constructed partitioning we need to introduce several types of
edges (depending on the type, we later determine the discontinuity-penalization
parameter a e for each edge e E £). The edges of type I belong to the set
(1 - Ax, 1] x ([0, Ay) U (1 - Ay, 1D. They are part of the layer region (more pre-
cisely, they lie in the corner layers) and their length is either hx = 2A x N-l or
hy = 4AyN- 1. Assuming Ax = 2E/f31lnN, Ay = 2vfifYlnN and E::::: CN- 1,
we have hx « Hx := 2(1 - Ax )N- 1 and hy « Hy := 2(1 - 2Ay)N- 1.
Therefore we describe these edges as short; other edges are called long.
Edges of type I I are also short and belong to the rest of the layer region
0,12 U 0,21. Since this region contains also long edges, we refer to them as
being of the third type. Type I I I also contains long edges near the layers
from the set 0,~2 = 0,22 \ 0,:h. Finally, edges of type IV are long and lie in
0,22 = [0,1 - Ax - Hx] x [Ay + Hy, 1 - Ay - Hy].
while [14] and [15] contain results for \lTJ. All these results hold under the
assumption y'cln 2 N ::::: C and they are proved using the decomposition from
Theorem 1 and the technique from [1].
In the following section we present some of the key points of the error analy-
sis for the NIPG method on the Shishkin mesh when applied to the problem (1).
More details can be found in [14] and in the forthcoming paper [15].
4 Error analysis
1/2 1/2
h == ( lO'T)2 ds ) ( / 0'[T)]2 ds )
Jrmt
Is == ILl
I<ET I<
(b· \l~)T) dxl '
hn, hm and hlv denote the contributions of edges of the type I, II, I II
and IV to the terms I j , j = 3,4, ... ,7, respectively. Assuming that the values
of the discontinuity-penalization parameters are the same for all edges of the
same type, then from LOO-interpolation error estimates for rJ and "'\lrJ we have
Thus,
The Discontinuous Galerkin Method for Singularly Perturbed Problems 743
Is :::; CN-111~IIDG.
At the end, it can be proved, [14],
5 Numerical experiments
Table 2. The NIPG method on the Shishkin mesh for the test problem
it can be expected that this method, or a more general DGFEM, will exceed
the streamline-diffusion FEM for the problems with more complicated layer
structures where the flexibility of the DGFEM with respect to the mesh is
useful.
References
1. Dobrowolski, M., Roos, H.-G. (1997): A priori estimates for the solution of
convection-diffusion problems and interpolation on Shishkin meshes. J. Anal.
Appl., 16, 1001-1012
2. Houston, P., Schwab, C., Siili, E. (2002): Discontinuous hp-Finite Element
Methods for Advection-Diffusion-Reaction Problems. SIAM J. Numer. Anal.,
39, 2133-2163
3. Houston, P., Schwab, C., Siili, E. (2000): Discontinuous hp-finite element meth-
ods for advection-diffusion problems. Technical Report NA-00/15, Oxford Uni-
versity Computing Laboratory, Oxford, UK
4. Kopteva, N.V. (2001): How accurate is the streamline-diffusion FEM inside
parabolic layers? Lecture presented at the 19th Biennial Dundee Conference on
Numerical Analysis
5. LinE, T. (2002): Anisotropic meshes and streamline-diffusion stabilization for
convection-diffusion problem. Preprint MATH-NM-11-2002, Institut fUr Nu-
merische Mathematik, Technische Universitiit Dresden, Germany
6. LinE, T. (2003): Layer-adapted meshes for convection-diffusion problems.
Compo Meth. Appl. Mech. Eng., 192, 1061-1105
7. Oden, J.T., Babuska, 1., Baumann, C.E. (1998): A discontinuous hp finite ele-
ment method for diffusion problems. J. Compo Phys., 146, 491-519
8. Prudhomme, S., Pascal, F., Oden, J.T., Romkes, A. (2000): Review of a priori
error estimation for discontinuous Galerkin methods. TICAM Report 00-27,
Texas Institute for Computational and Applied Mathematics, Austin, USA
9. Riviere, B., Wheeler, M.F., Girault, V. (2001): A priori error estimates for fi-
nite element methods based on discontinuous approximation spaces for elliptic
problems. SIAM J. Numer. Anal., 39, 902-931
10. Roos, H.-G. (2002): Optimal convergence of basic schemes for elliptic boundary
value problems with strong parabolic layers. J. Math. Anal. Appl., 267, 194-208
11. Roos, H.-G., Skalicky, T. (1997): A comparison of the finite element method
on Shishkin and Gartland-type meshes for convection-diffusion problems. CWI
Quarterly, 10, 277-300
The Discontinuous Galerkin Method for Singularly Perturbed Problems 745
12. Roos, H.-G., Stynes, M., Tobiska, L. (1996): Numerical Methods for Singularly
Perturbed Differential Equations. Springer-Verlag, Berlin
13. Shishkin, G.l. (1992): Discrete Approximation of Singularly Perturbed elliptic
and Parabolic Equations. Russian Academy of Sciences, Ural Section, Ekater-
inburg, (in Russian)
14. Zarin, H. (2003): Finite element methods for singularly perturbed problems with
special emphasis on discontinuities. PhD Thesis, University of Novi Sad, Serbia
and Montenegro
15. Zarin, H., Roos, H.~G. (2003): Interior penalty discontinuous approximations
of convection~diffusion problems with parabolic layers. submitted to Numer.
Math.
A Finite-Volume Mass- and
Vorticity-Conserving Shallow-Water Model
using Penta- /Hexagonal Grids
1 Introduction
We present a mass- and vorticity-conserving finite-volume algorithm to solve
the shallow-water equations on the sphere (SWES), where the sphere is
spanned by a grid of pentagonal and hexagonal cells (hereafter referred to
as a PH grid).
The vector-invariant formulation of the SWES is
oh
-at + \7 . (vh) = 0 (1)
ov
at + ( + f)k x v + \7(1'i: + <1» = 0, (2)
The use of penta-/hexagonal grids - which cover the sphere with twelve spheri-
cal pentagons and an arbitrary number of hexagons - for atmospheric modeling
is not new. A summary of finite-difference methods on such grids was given by
Williamson in [13]. A milestone in the use of PH grids was achieved by Heikes
and Randall [4] who built much of the foundation for a genuine General Cir-
culation Model (GCM). Baumgartner, Majewski, et al. [6] built a production
model based on these grids which now provides numerical weather forecasts
at the German Weather Service (DWD).
Fig. 1. The standard lat-lon grid (left) illustrates the Mercator projection of surface
coordinates to the rectangular domain [-1f/2,pi/2] x [-1f,1f]. The PH grid (right)·
instead decomposes the sphere into 12 spherical pentagonal cells and the rest into
hexagonal cells.
(3)
(4)
The source terms X and Y in equations (6) and (7) are the time-averaged
fluxes for the finite-volume discretization of equation (2). Ignoring higher order
terms, these can be approximated for some scalar quantity qn at time step n
as:
1
X(u*, Ll;qn) = Llt I
t
t
+Llt u* q dt 1
Y(v*,Ll;qn) = Llt I
t
t
+Llt v*qdt. (8)
(9)
Crucial to the algorithm is the use of staggered grids (C- and D-grids,
described in [7]), on which the velocities are defined in the same point as the
flux. Thus there is a one-to-one dependency between fluxes and velocities. If the
fluxes depended on more than one velocity value, it would be necessary to solve
for velocity from a system of equations. But in this case it is straightforward
to calculate predictor values, u* and v* (on a C-grid) which are then inserted
into the corrector step, on a D-grid, in (5), (6) and (7). As this technique is
discussed at length in [10]' we will pass over this topic.
There is some additional complexity at the pole, where a pole cap cell (a reg-
ular polygon with m = 27r / Ll.:\ vertices) needs to be considered separately. The
poles are also treated in a way which conserves mass and vorticity.
After some analysis of the algorithm in [10]' several key features present
themselves. The global conservation of discrete mass follows directly from equa-
tion (1). With some calculation (see [11]), it can be shown that the global dis-
crete vorticity is conserved as well. Furthermore, orthogonality is inherently as-
sumed in the algorithm, particularly in its use of a flux-form Semi-Lagrangian
(FFSL) advection scheme presented in [9]. FFSL is a finite-volume scheme
which splits the time-averaged fluxes of the generic scalar quantity q into an
east-west X and north-south y. This splitting can only realistically be applied
on an orthogonal grid.
Our goal is to obtain a similar algorithm to solve the SWES which is not
limited to orthogonal grids, nor even to grids which are orthogonal to their dual
grid (i.e. possess the Voronoi-Delaunay property). To this end it is necessary
to introduce nomenclature which is sufficiently general for PH grids.
A Finite-Volume Mass- and Vorticity-Conserving Shallow-Water Model 751
(10)
w lli: k1 = WOi,k - Llt {:Fi';k('rJi , w~+1/2lli,k' Llt) + Qlli,k (K: + <I»} (12)
w1-:~ = w'li,k - Llt {:Fi';k('rJi , -w~+1/2lli,k' Llt) + QJ..i,k (K: + <I»} . (13)
Inserting the above updates into the definitions for discrete vorticity, we find:
752 W. Sawyer, R. Jeltsch
'T7~+1 = I~'I L (W~i.k - Llt {F~k('T7i' w1. i.kll i,k, Llt) + 911i.k (l-£+dJ)}) li,k + fi
t kENi
Vi Vi
Vi kENi
v
o
Similarly, we arrive at the anticipated form for the updated divergence:
5~+1 = 5f - I~tl L :F[';k ('T7i , -WlIlli,k, Llt)li,k - I~tl L 9J.. i,k (1-£ + dJ)li,k.
t kENi t kENi
A clear hurdle in the above PH grid updates are the calculations of the
gradients 9. Not only should these be as accurate as possible, they should also
satisfy the constraint that the discrete curl of G i , namely CG i , is identically
O. Given a generic scalar cell mean quantities qi, it is possible to find a higher
order expression for the edge gradients [11] at the flux point. For example, one
can define a spherical radial basis function (SRBF, see [2]) defined by a local
set of qi clustered around the flux point:
N
q(x) = LAkY(rj) +AN+1 + (AN+2, ... ,AN+d+1fx with rj = Ilx-xjll,
j=l
(14)
where Y(r) is a radial basis function, e.g. Y(r) = r 2 logr/161T for thin-plate
splines, and the A are found such that q(Xj) = qj. If one considers the subset
qj from cells centered on (and including) cell Di and uses the resulting SRBF
to approximate all the edge-parallel gradients of the central cell, the discrete
curl-gradient CG i disappears:
But the use of this gradient approximation will not produce consistent gra-
dients between neighboring cells, which is a requirement. On the other hand,
it is possible to use different SRBFs q(k) (x) - each built from a subset q?)
clustered around a flux point {i, k} - to approximate the gradients. In this
case there is only one consistent gradient value per flux point. However, CG i
does not necessarily disappear.
-=I- 0 in general.
9 Vi,k,s(i,k) - Vi,k,p(i,k)
Ikdq) = li,k
(16)
where p( i, k) and s( i, k) are the indices of the preceding and successive neighbor
cells of i and k around i in counter-clockwise fashion. The final formulation of
the predictor is
(18)
n+1/2
W 1. i ,k = W 1.n i ,k -
Llt
2 {'L'n ( n
.ri,k r]i, -WII ni,k,2 + 91.'i,k (K, + /F.)}
Llt) '¥ , (20)
(21)
The corrector step is
754 W. Sawyer, R. Jeltsch
w~i~kl = wrri,k - L1t {:F~k(TJi' w~+1/2ni,k' L1t) + Qlli,k (/'£* + iP*)} (23)
w~"t.! = w1 i,k - L1t {:F~k(TJi' -wt 1/ 2ni,k, L1t) + Q~i,k(/'£* + iP*)}. (24)
Thus for PH grids, the SWES problem, as in [10], can be completely de-
termined from gradient approximations and advective fluxes Fi,k across cell
boundaries. Fluxes between vertex neighbors are avoided since these do not
exist on PH grids. In this paper the flux determination is considered a "black
box": the advection problem on PH grids is far from simple, and finding a sec-
ond order monotone method for solving it is a matter of our current research.
References
1. L. Bonaventura. Development of the ICON dynamical core modelling strategies
and preliminary results. Unpublished manuscript, 2003.
2. G. Fasshauer and L. Schumaker. Scattered Data Fitting on the Sphere, pages
117-166. Vanderbilt Univ. Press, 1998.
3. R. H. Hardin, N. J. A. Sloane, and W. D. Smith. Spherical Codes. Book in
preparation; see http://www . research. att. comrnj as/ electrons/, 2003.
4. R. Heikes and D. A. Randall. Numerical Integration of the Shallow-water Equa-
tions of a Twisted Icosahedral Grid. Part I: Basic Design and Results of Tests.
Monthly Weather Review, 123:1862-1880, 1995.
5. R. Heikes and D. A. Randall. Numerical Integration of the Shallow-water Equa-
tions of a Twisted Icosahedral Grid. Part I: Basic Design and Results of Tests.
Monthly Weather Review, 123:1881-1887, 1995.
6. D, Majewski, D. Liermann, P. Prahl, B. Ritter, M. Buchhold, T. Hanisch,
G. Paul, and W. Wergen. The Operational Global Icosahedral-Hexagonal Grid-
point Model GME: Description and High-Resolution Tests. Mon. Wea. Rev.,
130:319-338, Feb. 2002.
A Finite-Volume Mass- and Vorticity-Conserving Shallow-Water Model 755
X 104
1.65
Hours Ratio Ratio
of peak of min
1.6
0 1.00 1.00
0.' 1.55 60 0.75 0.96
0.6
0.4 1.5
120 0.63 0.90
0.2 180 0.55 0.88
1.45
240 0.51 0.85
-0.2
-0.4
1.4 300 0.47 0.92
-0.6
1.35 360 0.45 0.90
-0.8
420 0.44 0.89
1.3
480 0.41 0.95
0.5
1.25
540 0.39 0.96
-0.5
1.2 600 0.38 0.99
-0.5
1 Mean-curvature flow
We study the following motion law for closed planar curves denoted as r:
VI' = -g(8)Kr + F, (1)
\1P
nr = -1\1PI' Kr = div(nr).
°
Here, ~ > is a parameter related to the thickness of the interface layer (it is
usually set to a value « 1). The polynomial fo(p) = ap(l - p)(p - ~) with
°
a > is derived from the double-well potential Wo as wb = - fo. The function
F = F(x, y) is bounded. The function Pini is an initial condition. We refer the
reader to [1], for details concerning the equation and physical background of
it.
Numerical schemes. We treat the PDE problems (2) and (3), both closely
related to (1), by several numerical schemes implemented by means of par-
allelization tools for the systems with distributed memory. The problems are
solved in a spatial domain D = (0, L 1 ) x (0, L 2 ), which is discretized by a rect-
angular uniform grid with mesh sizes hI, h2 in directions x and y.
We introduce the following notations for a given function u:
X,'J 2hl
Uij - Ui,j-l Ui,j+l - Uij Ui,j+l - Ui,j-l
Uy,ij = h2 Uy,ij = h2 UO = ---'''--'------,---'''--
y,ij 2h2
1 1
Uxx,ij = h 2 (UHl,j - 2Uij + Ui-l,j) , Uyy,ij = h 2 (Ui,j+l - 2Uij + Ui,j-d,
1 2
dph
e dt = eg(B) (L1 hph + fo(ph)) + elVhphW on Wh,
ph l'Yh= 0, ph(O) = Pini.
2 Parallelization techniques
The above described algorithms are parallelized by means of the Message Pass-
ing Library MPI both using Fortran 77/90 and C programming languages.
Computations using MPI, version 1.1 were performed on the supercomput-
ing systems IBM SP3 and Cray T3E at CINECA 1 , IBM SP, IBM SP2 at the
Czech Technical University in Prague, and computations using LAM MPI li-
brary 2 were performed on a local network of Linux PC workstations at the
Czech Technical University in Prague. In both approaches described below,
the computational task is performed by one or more processes, each of them
running either on a separate processor (a hardware unit, or virtual unit in the
emulated mode).
Cartesian domain splitting is an approach where a rectangular domain
fl is decomposed into rectangular subdomains, each of them treated by one
process. Boundaries of sub domains overlap by one grid line, on which they
exchange data. The amount of communication between processes depends on
the blocking strategy. We tested the row-wise blocking strategy, where the
domain is decomposed row-wise. Each block interacts with neighbouring blocks
during a timestep. The other tested strategy was the chequerboard blocking.
In this case, each block communicates with maximum eight neighbours during
a timestep.
Narrow-band technique introduced in [6] explores the fact that we are in-
terested only in the evolution of the curve r(t). It is therefore enough to follow
the evolution of P = P(t, x) in the vicinity of the levelset r(t). The presented
approach provides a significant speedup. On the other hand, it is less accurate
and more difficult to implement, because it requires a reconstruction of the
narrow band when r approaches its edge (the operation is called reinitializa-
tion). In our implementation, we cover the curve by overlapping squares of
a constant width which are assigned to processes in an intuitive way (Fig. 1).
For example, in case of 64 covering squares and 16 processes, the first four
squares are computed by the first process, the second four squares by the sec-
ond process etc. Consequently, the narrow band created by such squares is
not of constant width. The processes exchange data for all nodes, where the
squares overlap. The approach is easy to implement including processing of the
grid by parts small enough to fit them in the fast cache memory of processors.
For the purpose of algorithm evaluation, we define the following quantities:
Speed up =
run time in a single process
.. , Eff. = Speedup
run tIme m n processes number of processes
3.5'---~-~--~--"'"
2.5
1.5
Fig. 2. (a) A circle in (0,4) x (0,4) shrinking from the initial radius Ro = 1.35 to the
radius RT = 0.15 according to the isotropy law Vr = -Kr. (b) An initial circle of the
radius Ro = 3.0 deforming itself according to the 5-folded anisotropy law described
by Eq.(3), where N jo1d = 5 and ( = 0.025.
Study 2 (IBM SP). The above given problem (see Figure 2a) was recom-
puted using several choices of the mesh size and the time step. As it can be
seen from Table 2, efficiency of parallelization depends on the size of data ex-
changed between processes (e.g., it is faster to send 200kB of data than twice
100kB, due to an initiation).
Study 3 (IBM SP and Linux network). In this case, the initial condition
(a circle with the initial radius Ro = 1.35) evolves according to (1) with F = 0,
N fo1d = 5 and ( = 0.025 as indicated in Figure 2b). With numerical parameters
Parallel Computing Techniques 761
Table 2. Efficiency depends on the mesh size. Computation performed on the IBM
SP system (CPU time per process and efficiency).
Mesh size 200 x 200 267 x 267 400 x 400 667 x 667
Time step 4.0 . 10 -b 2.3.10 -b 1.0 . 10 -b 3.6.10- 0
Iterations 22500 40000 90000 250000
Speedup and efficiency of paralIe liz at ion for the Allen-Cahn equation
- Study 5 (IBM SP3). In this computation, we studied the isotropic curve
evolution starting at a four-folded pattern in a spatial domain (0,2) x (0, 2). The
curve approaches the circle ofradius R = 0.6 according to the law Vr = -I\;r+
F(x) where the forcing F is a suitable radially symmetric and linear function.
Other parameters are ~ = 0.01, hi = h2 = 0.00995. The curve evolution is in
Figure 4(a). The domain was divided into 1, 4, and 16 rectangular subdomains,
Parallel Computing Techniques 763
Q
0.75
0.5
0.25
0
0 0.25 0.5 0.75
Fig. 3. Evolution of the cardioida curve is driven by the regularized levelset equation,
solved in the unit square with € = 10- 8 , grid 600 x 600, T = 10- 6 and 20000 time
levels
1.6
1.4
1.2
0.8
0.6
0.4
(a) 0·~.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8
Fig. 4. (a) 4-folded initial curves in (0,2) x (0,2) approaches circle of radius R = 0.6
according to Vr = -Kr + F(x) for radially symmetric linear F; = 0.01, hi = e
h2 = 0.00995. (b) 4-folded initial curve in (0,2) x (0,2) shrinks inside and expands
outside of the circle ofradius R = 0.6 according to Vr = -g( O)Kr + F(x) for radially
e
symmetric linear F, g(O) = 1.0 - 0.8cos(40 - 7r/4); = 0.02, hi = h2 = 0.00995.
Acknowledgment. The first author was partly supported by the project No.
159 of the Czech-Slovak Science Programme, the second author was partly
supported by the project CTU 0309914 of the Czech Technical University
in Prague, and the third author was partly supported by the project No.
201/01/0676 of the Grant Agency of Czech Republic. The authors gratefully
acknowledge technical support of the Center for High Performance Computing
at the Czech Technical University in Prague (Project No. 145), and technical
support of the CINECA - High Performance Computing Centre, Bologna (Pro-
gramme Minos).
References
1. M. Benes, Mathematical and computational aspects of solidification of pure sub-
stances, Acta Mathematica Universitatis Comenianae 70, No.1 (2001), 123-152.
2. C.M. Elliott, M. Paolini, and R. Schatzle, Interface estimates for the
fully anisotropic Allen-Cahn equation and anisotropic mean curvature flow,
Math. Models Methods App!. Sci. 6 (1996), 1103-1118.
3. L.C. Evans and J. Spruck, Motion of level sets by mean curvature I, J. Diff. Geom.
33 (1991), 635-681.
Parallel Computing Techniques 765
Veronika Sobotikova
1 Introduction
2 Continuous Problem
a) u E Hl(fl), (3)
b) a(u,v) = L(v) \:Iv E H 1 (fl).
The forms a and L from (3,b) are defined for u,v E Hl(fl) by
LF(v) =
Ja.D
r cpvdS.
It was shown in [2] that the operator A: Hl(fl) ----* (Hl(fl))* corresponding
to the nonlinear form a by
h= max h T .
TETh
lSI> l/h.
Due to these two assumptions there exists a constant oX > 0 such that
The forms a and L defining the weak solution are discretized in two steps.
First we integrate in all integrals over th and ath instead of over D and
aD, respectively. In the second step we apply quadrature formulae to evaluate
these integrals. We suppose the formula used for the integration over triangles
to be exact for all constant functions and the formula used for the integration
over sides to be exact for all linear functions and to be monotone, i.e., all its
coefficients are positive. In such a way we come to the following approximations
of the forms defining the weak solution (Vh' Wh E H h ):
m
dh(Vh, Wh) = K 2.: 18 12.: fI/-'(lvh I"'vhwh) (XS,/-,) ,
SESh /-,=1
m
Here 'Ph is an approximation of the function 'P from the boundary condition,
which will be introduced later.
Let us note that in the approximation of the form b we do not use numerical
integration, as we integrate constant functions.
Now we can define an approximate solution of problem (3) as a function
Uh : Dh ----+ R such that
a) Uh E H h , (4)
b) ah(uh, Vh) = Lh(Vh)
4 Ideal Thiangulation
Since the problem is nonlinear in the boundary condition, we meet some diffi-
culties in the analysis of the FEM. These are caused especially by the fact that
in general the boundaries of Dh and D may not be identical, the union of all
triangles of T,. may not form D, and that we seek the approximate solution in
a space which may be different from the space in which we look for the weak
solution. We handle these difficulties with the aid of Zlamal's ideal elements
(see [12]).
770 V. Sobotikova.
First we introduce the concept of the ideal triangle. Let us have a triangu-
lation Th of the domain [h and let T E Th. If T is not a boundary triangle,
we set Tid = T and we denote its vertices by Pi, P2 , P3 in any way. If T
is a boundary triangle, then we numerate its vertices in such a way that the
vertices Pi and P3 lie on the boundary. Then we replace in the triangle T the
straight side S = P1 P3 c afh by the arc Es =P1 P3 C aD to get the curved
ideal triangle Tid. We denote the set of all ideal triangles by T~d and call it
the ideal triangulation associated with Th . It is obvious that the union of all
ideal triangles Tid from T~d forms D.
As the functions from the space Hh are not defined on the whole domain
D, we need to modify somehow these functions. We proceed in the following
way:
Let us consider the reference triangle K in the (6, 6) - plane with the
vertices Rl = (0,0), R2 = (1,0) and R3 = (0,1). We denote by XO the affine
mapping which maps the triangle K one-to-one on the triangle T in such away
that XO(R i ) = Pi for i = 1,2,3, and let 5° be its inverse.
By Zlamal [12], there exists such a mapping X, which maps one-to-one the
reference triangle K on the ideal triangle Tid, that it as well as its inverse 5 are
of the class C2. With the aid of these mappings we can now define a function
w associated with a function wE Hh by
w(x) = w(XO(5(x)))
It is obvious that we can construct this function w for any function w defined
on Dh and not only for w E Hh.
In a similar way we can introduce a function l' : aD --4 R associated with
a function 'Y defined on aDh . We put
Moreover, we can define for every function 'Y : aD --4 R its approximation
'Yh given on aDh by
In such a way we approximate the function 'P from the boundary condition.
Remark: It is obvious that WiT = WiT for every T E Th n T~d.
It was shown in [4] that
I r
Jan
fJ(x)dS - r
Janh
v(x)dSI S; c5h r
Jan h
Iv(x)ldS
for every v E Ll( aDh ). Moreover, the following relation between the seminorms
w
of wand in the Sobolev spaces Wl,p(S) and Wl,P(E) is valid:
ISI)l-P
( lET
Iwlf,p,s = Iwlf,p,E'
The Finite Element Analysis of an Elliptic Boundary Value Problem 771
where
1 :::; 1ST
1.01
< 1 + ch.
This estimates allows us to derive relations (5, a, b) between norms and
semi norms of wand w from the following Lemma 1. The proof of (5, d) can
be found in the same paper [4], for the proof of (5, c) see [10].
Lemma 1. Let p E [1,00). Then there exist positive constants C l = C l (p),
C2 = C 2(p), C3 = C 3 (p), C4 and hI E (0, ho] such that for every h E (0, hI)
Remark: In what follows, whenever we will refer to any result from [2] or
[3], where the domain D is polygonal, the polygonality of D was not used in
the associated proof.
°
Lemma 2. Let hI and C4 be as in Lemma 1. Then there exist h2 E (0, hI] and
C6 > such that
ah(Vh,Vh) 2: C 6 1I v hllI,2,nh
for every h E (0, h2) and Vh E Hh with Ilvhlll,2,nh 2: C 4 .
Remark: In what follows, we suppose ho to be so small that h2 = hI = ho
in Lemmas 1 and 2.
The coercivity of ah was obtained by the aid of the following estimate of
the error in the approximation of the form d :
Theorem 1. Discrete problem (4) has for every h E (0, h o ) exactly one solu-
tion Uh E Hh.
The monotonicity of the forms ah is sufficient in the proof of the existence and
uniqueness of the approximate solutions. However, for the derivation of the
error estimate of the method we need somewhat stronger result. By [11], the
forms ah are "almost uniformly monotone" in the following sense:
Id(vh,wh)-dh(vh,Wh)1 ::::;
::::; (C8hllvhllr,t,~h + C9h1-~ IVhI1,p,Sh Ilvhll~,oo,Dh) Il whI11,2,Dh
for all h E (0, h o) and Vh, Wh E H h . (We set ~ = 0.)
On the basis of the above inequalities an abstract error estimate was
established. In what follows, we denote by u the weak solution and by
Uh, h E (0, h o ), solutions of the discrete problem.
Lemma 3. Let Q(t) = "~t) for t > 0, Q(O) = 0, Q-1 be the inverse of
Q and let 1 < p ::::; 00. Then there exist positive constants C lO , C n , C 12 and
C 13 = C 13 (P) such that
Finally, we can formulate the main result, the error estimate for the solution
of the discrete problem (see [11], Theorem 4.1):
Theorem 2. Let the solution u of the continuous problem satisfy u E H2(rl)
and let U c E H2 (rl*) be an extension of the function u on R 2 . Then for every
p E (2,00) there exist h E (0, hal and a positive constant C = C(p, Ilu c I12,2,n*)
such that
Acknowledgment
References
1. Bialecki, R., Nowak, A.J. (1981): Boundary value problems in heat conduction
with nonlinear material and nonlinear boundary conditions. Appl. Math. Mod-
elling, 5, 417-421
2. Feistauer, M., Najzar, K. (1988): Finite element approximation of a problem with
a nonlinear Newton boundary condition. Numer. Math., 78,403-425
3. Feistauer, M., Najzar, K., Sobotikova, V. (1999): Error estimates for the finite
element solution of elliptic problems with nonlinear Newton boundary condition.
Numer. Funet. Anal. Optim., 20, 835-851
4. Feistauer, M., Najzar, K., Sobotikova, V. (2001): On the finite element analysis of
problems with nonlinear Newton boundary condition in nonpolygonal domains.
Appl. Math., 46, 353-382
5. Ganesh, M., Graham, I.G., Sivaloganathan, J. (1994): A pseudospectral three-
dimensional boundary integral method applied to a nonlinear model problem
from finite elasticity. SIAM J. Numer. Anal., 31, 1378-1414.
6. Kufner, A., John, 0., Fucik, S. (1977): Function spaces. Academia, Prague
7. Krizek, M., Liu, L., Neittaanmaki, P. (1999): Finite element analysis of a nonlin-
ear elliptic problem with a pure radiation condition. In: Sequeira, A., de Veiga,
H.B., Videman, J.H. (eds) Applied Nonlinear Analysis. Kluwer, Amsterdam, 271-
280
8. Moreau, R., Ewans, J.W. (1984): An analysis ofthe hydrodynamics of alluminium
reduction cells. J. Electrochem. Soc., 31, 2251-2259
774 V. Sobotikova
1 Introduction
Nowadays, the theory of the hp-version of the finite element method is well-
established and founded on solid results mostly due to the efforts of Babuska
and coworkers. However, the practical realization of fully automatic and robust
3D hp-adaptive algorithms still presents many serious difficulties mainly due
to excessive programming complexity.
We would like to introduce a novel fully automatic algorithmic approach to
goal-oriented hp-adaptivity for elliptic problems. The methodology does not
rely on estimates of error or its higher derivatives, and it is capable of achieving
exponential convergence not only in the asymptotic but also in preasymptotic
range of error level. Due to the limited length of this paper, only basic ideas of
the approach can be presented, but many details on both theory and computer
implementation can be found, e.g., in [3, 4, 5, 7, 8].
Error estimation forms an essential part of most h- and p-adaptive finite el-
ement algorithms. Recall that h-adaptivity is based on spatial refinement of
elements with largest contributions to the error and that p-adaptivity achieves
the reduction of the error by increasing the polynomial order in elements. In
both cases, thanks to the low number of options an element can be refined
(in most cases only one), the estimate of magnitude of error in elements is
sufficient to guide the adaptive process.
776 P. Solin, L. Demkowicz
The situation, however, changes with hp-adaptivity that allows both for
pure p-refinements and spatial refinements with suitable redistribution of the
polynomial order to element sons. Typically one has several options to choose
from, and the number of possibilities increases dramatically as the polynomial
order of elements in the mesh gets higher. The situation is illustrated in Fig.
1.
OR OR. ..
2
(3)
and the function Ure! is called reference solution.
There are various ways to calculate reference solutions. For example, highly
accurate approximations based on Babuska's extraction formulae (postprocess-
ing formulae applicable mainly to lower-order elements) were used to guide
hp-adaptivity in [5]. A robust way to obtain reference solutions for elliptic
problems without any limitation on the polynomial order was proposed by
Demkowicz [4]. The function ure! is defined as approximate solution on a uni-
formly hp-refined mesh, i.e. on a mesh where all elements are refined so that
h ~ hj2 and p ~ p + 1. Since Uh,p already contains useful information about
lower frequencies in the solution, the higher frequencies identifying Uh/2,p+l
can be obtained with a reasonable amount of work using a two-grid solver.
4 Projection-based interpolation
This elementwise local technique, that plays an essential role in the presented
automatic adaptive algorithm, generalizes the standard Lagrange (vertex) in-
terpolation to higher-order finite elements by combining it with projection on
spaces generated by hierarchic higher-order shape functions. We can confine
ourselves to a reference domain, since this is where almost all operations in an
hp-adaptive code are performed. Choose, for example, a reference triangle T.
Let pb denote the polynomial order in the interior of T and pl, p2 and p3 the
polynomial orders related to its edges el, e2 and e3, respectively. By Vl, V2, V3
denote the vertices of T. Consider a sufficiently regular function W defined in
T.
The projection-based interpolant Wh,p is constructed in three steps: First
one calculates the vertex interpolant Wh,p as a linear combination of vertex
shape functions ipVl, ipV2 and ipV3, such that
(4)
In the next step one subtracts the vertex interpolant from the original function
wand defines a new function
(6)
778 P. Solin, L. Demkowicz
Each function w~~p' k = 1, ... ,3, is a linear combination of edge shape func-
tions <p~k , ... , <p;~, such that
II W (1) ek I
- W h , P Hl/2( ) (7)
00 €k
is minimal. The minimum is achieved if and only if the trace of the difference
(w(l) - W~k,p ) lek is normal to the traces of all edge functions <p~k, ... , <pe~
p
on
the edge ek in the norm HU 2 (ek). Hence the discrete minimization problem
(7) translates for each edge ek, k = 1, ... ,3, into a system of pk - 1 linear
algebraic equations. The norm H~b2(ek) is defined using harmonic extensions
of functions defined on edges to the element interior, and therefore is difficult to
evaluate exactly. For practical computations One can replace it by a weighted
HJ-norm [4, 8].
In the last step one defines a new function
W (2) .=
.
w(1) _ we
h,p· (8)
Notice that this function generally does not vanish on edges. The bubble inter-
polant wk,p is obtained by projecting w(2) on a space generated by the bubble
functions of orders p :::; pb in the Hl-seminorm. Minimization of the difference
(9)
in the Hl-seminorm translates, analogously as in the previous case, into (pb -
1) (pb - 2) /2 linear algebraic equations.
Finally the projection-based interpolant Wh,p is defined as the sum of the
vertex, edge and bubble interpolants,
(10)
At the beginning of each mesh adaptation step one has at his disposal the
following information: the coarse mesh Th,p, the coarse mesh solution Uh,p, the
uniformly refined mesh Th/2,p+1, the reference solution ure! == Uh/2,p+l and
the approximate error function errh,p = Ure! - Uh,p. The question is how to
use the function errh,p to adapt the mesh Th,p in an optimal way.
Recall from Fig. 1 that there always are several possibilities an element
can be hp-refined. With no further information on the exact solution U it
is not known a-priori which hp-refinement is the optimal One. Therefore a
possible strategy could be to parse through all element refinement options,
always creating a new mesh Th ,computing a new approximate solution u'h ,p
~
Automatic Goal-Oriented hp-Adaptivity Without Error Estimates 779
and selecting the element refinement option that maximizes the value of error
drop 6errh,p,
N
6ERRh,p = L Ilure! - IIh,pure!lle,Ki -Ilure! - IIh,pUre!lle,Ki· (13)
i=1
6 One-dimensional illustration
10000 ,---,---,---r----.---,---,--,--,---,
100
0.01
0.0001
le-06
le-08
le-l0
10-12 L---'-_-'------''-----'-_-'----'_-'-_-'--....::J
5 10 15 20 25 30 35 40 45 50
Fig. 3. Convergence curve. x-axis: number of DOF, y-axis: IU-Uh,pl;"l (0,11") in decimal
logarithmic scale
/-~-----,
/\
'----
·2
350
300
250
200
150
100
50
0
0 0.5 1.5 2.5
Fig. 4. Left: Uh,p (solid line) and UTe! == Uh/2,p+l (dashed line). Right: projection
error decrease rates. Only element K3 exceeds 1/3 of maximum and is selected for
refinement. Largest decrease of interpolation error on K3 is achieved by means of its
p-refinement.
Let us recall the basic ideas leading to the formulation of the dual problem,
since they will be used to incorporate the goal-oriented adaptivity into the
energy-driven hp-adaptive strategy discussed in Section 5.
Consider problem (1) and its discrete version b(Uh,p,Vh,p) = !(Vh,p) for all
Vh,p E Vh,p where Vh,p C V is a polynomial finite element approximation of
space V. Define the error eh,p = U - Uh,p and consider the residual rh,p (Vh,p) =
! (Vh,p) - b(Uh,p, Vh,p)' Relate the residual rh,p to the error in the quantity of
interest, i.e. find G E V" such that G(rh,p) = L(eh,p). By reflexivity, G can be
related to an element v in the original space (influence function),
782 P. SoHn, L. Demkowicz
·2
Fig. 5. Left: Uh,p (solid line) and Uh/2,p+l (dashed line). Right: projection error
decrease rates. This time, all elements are selected for refinement. Automatically
selected best combination is p-refinement of Kl and K2 and hp-refinement with
PL = PR = 2 for K 3 .
-2
3.5
2.5
1.5
0.5
o I
o 0.5 1.5 2.5
Fig. 6. Left: Uh,p (solid line) and Uh/2,p+l (dashed line). Right: projection error
decrease rates. Elements Kl and K4 will be p-refined.
Automatic Goal-Oriented hp--Adaptivity Without Error Estimates 783
·2
0.5
1.8
1.6
1.4
1.2
0.8
0.6
l
0.4
0.2
.1
0
0 0.5 1.5 2.5
Fig. 7. Left: Uh,p (solid line) and Uh/2,p+l (dashed line). Right: projection error
decrease rates. Element K4 will be p-refined. In the following step (not depicted),
element K2 will be selected for hp-refinement with PL = PR = 2. And so on ...
Standard orthogonality property for the error in the solution was used.
The estimate (17) shows that the error IL(u) - L(Uh,p)1 in goal is con-
trolled by errors of both the primal and dual solutions in the energy norm.
Therefore, an hp-adaptive algorithm will minimize the error in goal if instead
of elementwise maximizing (13) it will elementwise maximize the product
LERRgoal(K)
h,p' = LERRprimal(K)LERRdual(K)
h,p' h,p' (18)
where
and
Here vref == Vh/2,p+l is the reference solution to the dual problem, calculated
on the uniformly hp-refined mesh Th/2,p+l.
Acknowledgment
The work of the first author was supported by the Grant Agency of the Czech
Republic under Grant No. GPI02/01/D114. The second author acknowledges
the financial support of Air Force under Contract F49620-98-1-0255.
References
1. Ainsworth M., Senior, B. (1997): Aspects of an hp-adaptive Finite Element
Method: Adaptive Strategy, Conforming Approximation and Efficient Solvers.
Comput. Methods App!. Math. Engrg., 150, 65-87.
2. Ainsworth M., Senior, B. (1997): Aspects of an hp-adaptive Finite Element
Method: Adaptive Strategy, Conforming Approximation and Efficient Solvers.
Comput. Methods App!. Math. Engrg., 150, 65-87.
Automatic Goal-Oriented hp-Adaptivity Without Error Estimates 785
3. Demkowicz, L., Oden, J. T., Rachowicz, W., Hardy, O. (1989): Toward a Uni-
versal hp-Adaptive Finite Element Strategy. Part 1: Constrained Approximation
and Data Structure. Comput. Methods App!. Math. Engrg. 77, 79-112.
4. Demkowicz, L., Rachowicz, W., Devloo, Ph. (2002): A Fully Automatic hp-
Adaptivity. J. Sci. Comput. 17, Nos.I-3, 127-155.
5. Rachowicz, W., Oden, J. T., Demkowicz, L. (1989): Toward a Universal hp-
Adaptive Finite Element Strategy. Part 3: Design of hp Meshes", Comput. Meth-
ods App!. Math. Engrg. 77, No.2, 181-212.
6. Heuveline, V., Rannacher, R. (2003): Duality-based adaptivity and the h-p finite
element method. In: M. Feistauer (ed) Proceedings of ENUMATH (August 18 -
22, 2003, Prague, Czech Republic), Springer Berlin Heidelberg
7. Solin, P., Demkowicz, L. (2003): Goal-oriented hp-adaptivity for elliptic problems.
Comput. Methods App!. Math. Engrg., accepted.
8. Solin, P., Segeth, K., Dolezel, I. (2003): Higher-Order Finite Element Methods.
Chapman & Hall/CRC Press, Boca Raton, London, New York, Washington, D.C.
A Compression Method for the Helmholtz
Equation
Summary. The collocation boundary element method for the Dirichlet boundary
value problem is considered. In order to solve efficiently the resulting linear systems
for several wave numbers, the adaptive cross approximation (ACA) method is applied
to the matrices. In particular, the algorithm is reformulated for complex problems
and the so-called Fourier method is used to compute the necessary entries. Finally,
some numerical examples for the solution are presented.
1 Introduction
Whereas we introduced in [10] and [11] a numerical method, for computing the
associated matrices, which is based on the Fourier transform with respect to
the wave number K" we now discuss an efficient method for solving the result-
ing linear systems. In particular, we apply the Adaptive Cross Approximation
(ACA) method (see e.g. [1], [2]) to the transformed matrices and discuss the
behaviour of the compression factors in dependence on the wave number.
The paper is organised as follows.
In Sect. 2, we consider the boundary integral formulation for the problem and
its discrete form. A review of the Fourier method for computing the matrices
is presented in Sect. 3. In Sect. 4, we reformulate the ACA algorithm for the
complex problem. Finally, we give some numerical results (Sect. 5).
where 1') E JR+, and G(x, y, K,) is the fundamental solution of the Helmholtz
equation defined by
1 eil<lx-yl
G(x,y,K,) = -4 I
1t X - Y
I' X,y E JR3.
Note that the fundamental solution and thus the above potential satisfy the
Sommerfeld radiation condition.
Using this potential to treat the Dirichlet BVP (1), we need to solve the
boundary integral equation (BIE)
(2)
For all wave numbers K" the following uniqueness theorem holds.
Theorem 1. The exterior Dirichlet B VP (1) has a unique solution for all
9 E HS(8D), s E JR,
For more details on the analysis of the Helmholtz equation, we refer the reader
to [3], [5].
In order to solve the equation (2) numerically, the surface f is discretised using
a system of N plane, triangle panels f ~ fh = U~l fj . It is natural to use
the approximate function !h in the following form f ~ fh(X) = I:f=l Vj'Pj(x),
where the associated ansatz functions 'Pj, j = 1, ... , N, are piecewise constant
on f j .
Therefore, the BIE above leads to
(4)
where r := Ix - yJ The vector v E(CN and the right-hand side of the systems
are given by (v)j = Vj and (e)i = g(Yi), where Yi, i = 1, ... , N, denote the
corresponding collocation points. It should be remarked that the matrices in
(4) explicitly depend on the wave number K.
3 Fourier-Method
In order to compute the matrices in Eq.( 4) for several wave numbers, we first
apply the inverse Fourier transformation (K ......, ~) to the matrices.
The elements of the inverse Fourier transformed Matrix A(~) = F:,~ [A] (~) E
lR NxN are given by
bij(~) = ~J
41T r
13 (rddz -1) 5(z)1 z=r-~
(nx,x-Yi)dFx , i,j = 1, ... ,N. (6)
rj
A Compression Method for the Helmholtz Equation 789
e,
h
---p ----'.
e,
and
- l(d d h )
bij(~) = 4n tf?~ 5(~ -Idl) - e-d2 J~Ld2_h2 ll[~=in,~=axl(~) , i -=I- j. (8)
, .J
v
~ij(O
C = B respectively. Using the expressions (7) and (8), we obtain new matrices
A(K),B(K) E(CNxN, where its elements are defined by
(9)
and bii(K) = 0,
( "21 I + B- (K)-i7)A(K)
- ) Y=f2. (11)
Due to the independence of the wave number K, the antiderivative O"ij (~) and
the approximation function aij(~) in (9) and, similarly, Yij(~) and ~ij(~) in
(10) need to be calculated only once. Thus we will treat the respective linear
systems (11) for several K :::; Kmax using always these computed data.
eil<lx-YI I el<lxl
I Ix _ yl - Lp(x, y) :::; C
N P2 y P ,
A Compression Method for the Helmholtz Equation 791
where
and c only depends on Va. In (12), jn denote the spherical Bessel functions, h~l)
the spherical Hankel functions and Pn correspond to the Legendre polynomials.
According to the paper [2], we outline the method for the Helmholtz equation,
i.e. we reformulate the so-called partially pivoted ACA for the collocation
matrices of the Helmholtz equation.
Let C E (Cm'xn' be a given block. The method produces vectors UI E (Cm',
VI E<D n ', l = 1, ... , k, from which the approximant Sk can be formed
7. Sk = Sk-l + Ukvk
8. Find ik+1, so that (Uk)ik+l = maxi~Z !(Uk)i!
until the stopping criterion is fulfilled.
Since the block C will not be generated completely, the Frobenius norm of the
approximant Sk will be used to obtain the stopping criterion. An appropriate
stopping criterion is to terminate the iteration, if for a given c > 0 at step k
it holds that
792 M. Stolper, S. Rjasanow
(12)
5 Numerical Experiments
by a soft-sound domain n. Then the total acoustic wave takes the form u = uI +
us, where uS denotes the scattered wave satisfying the Sommerfeld radiation
conditions. Further, the homogeneous Dirichlet boundary condition for u holds,
g = -uI. In particular, we present numerical experiments for the BIE (4).
Since we chose the surface ofthe unit sphere, see Fig. 2 (I), the solution f(y)
is known,
The solution, which arises from the Fourier method, is denoted by fFT.
We apply the algorithm to a family of surfaces converging to the unit sphere.
The sequence is generated by recursive refinement of the meshes dividing each
of the surface triangle in four and projecting the new knots to the unit sphere,
cf. [2].
A Compression Method for the Helmholtz Equation 793
For the approximation of the blocks we used the above algorithm and E in (12)
is chosen 10- 4 , while the relative accuracy of un-preconditioned GMRES was
10- 6 .
We always consider the compression factors (C F), i.e. the ratios of the amount
of storage needed when using the approximant and the amount of storage for
the original matrix.
In the tables below, the factors are printed for different N, the first one for
a fixed wave number K, = n/2 (Table 1) and the second one for various wave
numbers with K, • h ~ 0.36 (Table 2).
for the exterior Dirichlet BVP for the Helmholtz equation using collocation
with piecewise constant ansatz functions. Since we chose w = G(x, Yo, K,) with
794 M. Stolper, S. Rjasanow
Approximation %
100'-~~-=~~~==============~
80
60
40
20
3
kappa
References
1. Bebendorf, M. (2000): Approximation of boundary element matrices. Numer.
Math., 86, 565-589
2. Bebendorf, M., Rjasanow, S. (2003): Adaptive low-rank approximation of collo-
cation matrices. Computing, 70, 1-24
3. Chen, G., Zhou, J. (1992): Boundary element methods. Academic Press Ltd.,
London
4. Cheng, H., Greengard, L., Rokhlin, V. (1999): A fast adaptive multipole algo-
rithm in three dimensions. J. Comput. Phys., 155, 468-498
5. Colton, D. L., Kress, R.(1992): Integral equation methods in scattering theory.
Krieger Publishing Company, Malabar
6. Goreinov, S. A. (1999): Mosaic-skeleton approximation of matrices generated by
asymptotically smooth and oscillatory kernels. In: Tyrtyshnikov, E. (ed.) Matrix
Methods and Algorithms, INM RAS, Moscow (in Russian)
7. Hackbusch W. (1999): A sparse matrix arithmetic based on H-matrices. 1. Intro-
duction to H-matrices. Computing, 62, 89-108
A Compression Method for the Helmholtz Equation 795
°
cost and operational safety.
We assume that (0, T) with T > is a time interval and by Qt we denote
a computational domain occupied by the fluid at time t. By u = u(x, t) and
p = p(x, t), x E Qt, t E (0, T), we denote the velocity and the kinematic
pressure (i.e, dynamic pressure divided by the density of the fluid), respectively,
and l/ will denote the kinematic viscosity.
Application of a Stabilized FEM to Problems of Aeroelasticity 797
(3)
With the aid of ALE mapping we compute the so-called ALE derivative rt; ,
which is anologous to the material derivative in the Lagrangian approach. Foro
a function f : Dt x (0, T) ----+ R, we set
D.A oj
Dt f(x, t) = at (X, t), (4)
(9)
We denote by Twt the boundary of the airfoil at time t. (See Fig. 1 showing
schematically the difference between the Lagrangian and Arbitrary Lagrangian-
Eulerian mapping.) On Twt we assume that the fluid velocity u equals the
velocity lir of the profile:
The part To of the boundary represents the outlet, where we prescribe the
"do-nothing" boundary condition
-(p - Pre!)n + v -
au = 0 on To, (11)
an
where n is the unit outer normal to ant and Pre! is a prescribed reference
outlet pressure.
2 Discretization
where wn+l ;:::; W(tn+I)' This problem is equipped with the boundary condi-
tions (9) ~ (11) on 8Dtn +1 • Taking into account that Atn+l (A~I (Xi)) E Dtn +1 ,
we can transform equations (14) completely to the domain Dtn +1:
3un + 1 _ 4u n + u n- I
(15)
27
+ ((u n+1 _ wn+l) . \7) u n+1 - vLlun +1 + \7pn+l = 0
· U n+1 = 0
d IV .
In
n
Jttn+l'
where ui = u i 0 Atn+l 0 A~I. This system is again equipped with the boundary
conditions (9) - (11).
In what follows, we shall carry out the space discretization of the problem
to find approximations of functions u := u n +1 and p = pn+1 defined in the
domain Dtn+ll satisfying system (15) and boundary conditions (9) - (11). To
this end, we reformulate this problem in a weak sense. Let us set D = Dtn+l
and define the velocity spaces W = (HI(D))2,X = {v E W;VlrDnrWt = O}
and the pressure space M = L5(D) = {q E L2(D);J.nqdx = O}. Then it is
easy to find that the solution U = (u,p) of problem (15) satisfies
(18)
The couple (Xh, M h ) of the finite element spaces should satisfy the Babuska~
Brezzi (BB) condition, which guarantees the stability of the scheme: there
exists a constant c > 0 such that
Application of a Stabilized FEM to Problems of Aeroelasticity 801
(p, \7. w)
sup
WEXh
iw iHI(st) ~ ciipii£2(st), Vp E M h , h E (0, h o). (19)
Lh(U*, U, V) =
~
L 6K ( ~u - v6u + (w. \7) U
~
+ \7p, (w· \7)v)
K
,
KE'h
a(Uh, Uh, Vh) + Lh(Uh, Uh, Vh) + Ph(Uh, Vh) = f(Vh) + Fh(Vh), (24)
VVh E X h X M h .
802 P. Svacek, M. Feistauer
(25)
where
(26)
is the local Reynolds number and hK is the size of the element K measured
in the direction of w. The factor ~(-) is a monotonically increasing function
of Rew such that for local advection dominance (Re W > 1) ~ ---> 1 and for
local diffusion dominance (Re W < 1) ~ ---> 0. The parameter (j* E (0,1] is an
additional free parameter. We set, e.g.
W
_ =
~(ReW) min (Re
-6-,1 ) . (27)
The airfoil can oscillate in the vertical direction and in the angular direction
around the so-called elastic axis. This vertical and torsional motion is described
by the linearized system of ordinary differential equations (see [8], [3])
F -1 L
=
rWt
2
j=l
T2jn j dS, (33)
M = -1 L rWt
2
i,j=l
Tijnjr?rtdS,
where
r ort
l =- (
X2 - XT2,
)
r2
ort
= Xl - XT1,
n = (nl' n2) is the unit outer normal to ant on rWt (pointing into the airfoil)
and XT = (XTl' XT2) is the given position of the elastic axis (lying in the
interior of the airfoil) and P is the fluid density.
System (32) is tranformed to a first-order ODE system and then solved by
the second-order Runge-Kutta method. We proceed in such a way that the
computed approximate solution Uh of (24) on time levels tn and tn-1 and the
corresponding force F and moment M are extrapolated and used for obtaining
Hand Q at tn+!. This allows us to determine the mapping Atn+ll the domain
ntn +1 and approximate the domain velocity w n + 1 . Then we pass to (24) on
the next time level t n +1.
5 Numerical results
Reynolds number equal to 400000. Fig. 5 shows the position of the moving
profile and velocity isolines at four time instants. We can see conspicuously
von Karman vortices leaving the airfoil. Moreover, the oscillations of vertical
position h of the airfoil and the angle a of rotation around the elastic axis are
shown and values corresponding to the above airfoil positions are marked.
ll@
I'
f \ (I
2 I \ f \ /, I , ~
I ' I \ J \ / \,.../,..' ............. - - - -
, " IJ _
1
,
,
"~I
t ~
o '1:1J
time tima
ering large displacements". This research was also supported under Research
Plans MSM 21000003 and MSM 113200007 of the Ministry of Education of
the Czech Republic.
References
Kenji Tomoeda
Summary. The dynamical behavior of the pulses, which is governed by the interac-
tion between diffusion and absorption, shows the several phenomena. The remarkable
ones are the pulse splitting phenomena which accompany pulse connecting phenom-
ena. In this paper such phenomena are investigated from numerical points of view,
and the mathematical justification is stated.
1 Introduction
We consider the propagation of thermal waves in an one-dimensional absorbing
medium in which there is an interaction between diffusion and absorption. To
describe such a propagation we may use the nonlinear diffusion equation with
absorption, which is well-known as the description of the flow of the liquids
through the homogeneous porous medium and is represented in the form of
the initial value problem:
Vt =
(vm)xx - cv P , t > 0, (1.1 )
v(O,x) = vO(x), (1.2)
Here we have the following assumptions:
(i) m(> 1), p(> 0), and c(2: 0) are constants and m + p 2: 2;
(ii) vO(x) E CO(Rl) is nonnegative and has compact support.
In a heated plasma v denotes the temperature and -cv P describes the losses
caused by radiation. We may take p = 0.5 for bremsstrahlung radiation and
0.5:::; p :::; 2 for synchrotron radiation [14]. The diffusion rate of (1.1) vanishes
at points where v = O. This degeneracy causes the occurrence of the finite
propagation of the support.
From analytical points of view, Aronson [1], Oleinik, Kalashnikov and
Chzou Yui-Lin [13], Kalashnikov [7]'[8]' and Herrero and Vazquez [6] proved
the existence and uniqueness of a weak solution and the property of the finite
propagation of the support under the assumptions stated above. Moreover,
v(t, x) is smooth in the open set P(v) = {(t, x)lv(t, x) > 0 and t > O}, and has
the following properties:
Dynamical Behavior of Initiated Pulses 807
(P-1) For c = 0, or c > 0 and p ;:::: 1 the diffusion is active and supp v(t,·)
monotonously expands as t increases;
(P-2) For c > 0 and 0 < p < 1 the absorption is active and the solution
vanishes identically at some finite time T* > O.
In Case (P-1) supp v(t,·) never splits into any multiple connected components
for t > 0, when supp vO(x) is connected. Thus the pulse splitting phenomena
never appear. In Case (P-2) there is a possibility of the pulse splitting phe-
nomena caused by absorption, when VO(x) has two local maxima. Rosenau
and Kamin [14] suggested this possibility by numerical computation. Chen,
Matano and Mimura [3] constructed the pulse splits into multiple connected
components in a finite time. This motivates us to investigate the more detail
of the behavior of pulses. For this end we continue numerical computation
and find the following phenomena, where the initial pulse vO(x) has two local
maxima and a connected compact support:
(NS-1) The pulse splitting phenomena appear, and thereafter these two pulses
evolve separately until one of them vanishes (see the left hand side in
Fig. 1);
(NS-2) The pulse splitting phenomena never appear for t > 0 (see the right
hand side in Fig. 1);
(NS-3) After the pulse splitting phenomena appear, these pulses become con-
nected, and thereafter the pulse splitting phenomena appear again (see
Fig. 2).
0.16 0.18
o o
x x
-2 2 -2 2
0.24
t
o
X
OL_2~----~------------------~~----~2
_ {ih for~:Z\{L-1,R+1},
Xi- C fort-L-1, (4.4)
r for i = R + l.
L = L(C) == min{i E Z I ih > C}, C = C(Uh), (4.5)
R = R(r) == max{i E Z I ih < r}, r = r(uh). (4.6)
Sh,k is somewhat complicated form and its detail is stated in [10]'[11] and
[12]. We omit the description of Sh,k. The variable time step k = kn+l ==
tn+l - tn (to = 0) determined by
1
k = -max(uL, UL+l) for the approximation to the left interface, (4.7)
c'
or
1
k = -max(uR, UR-l) for the approximation to the right interface. (4.8)
c'
The left and right numerical interfaces are defined by
respectively. When Sh,kuh' == 0 holds for some integer n* > 0, we put the nu-
merical extinction time T:; = tn*+l == t n*+kn*+l, and stop the numerical com-
putation. We define the left(resp. right) numerical interface curves Ch(t)(resp.
rh(t)) by piecewise-linearly interpolating (tn,Cn)(resp.(tn,rn))(O::::; n::::; n*).
We obtain the basic estimates which enable the proofs of the convergence
of the numerical solutions and the convergence of the numerical interface
curves([10]' [11], [12]). The former can be proved by Graveleau and Jamet's ar-
gument used in the proofs of Lemma 6.1 and Theorem 7.1([5]). The latter can
be proved by applying the idea of DiBenedetto and Hoff [4] to our difference
scheme. Moreover, we obtained the interface equation(see Main Theorem in
[11]). We state the basic estimates and the convergence of numerical solutions
without proof.
Theorem 1 (Basic estimates). Under Condition A assume u~ E Vh. Then
u h either becomes extinct or belongs to Vh for each n ;::: 0, and the following
estimates hold for all n ;::: 0:
where
(2.2)
(2.3)
p > 0 and u > 0 are arbitrary numbers and [g]+ = max{g, O}. This solution
satisfies (1.1)~(1.2) with vO(x) = K(O,x,p,u) in the weak sense and becomes
a classical solution in the open set wherein K(t, x, p, u) > o. It is easily seen
that supp K(O, x, p, u) = [-p, p] and the right and left interface curves (+ and
(~ are written as follows, respectively:
When
u< 2p 1m (2.5)
(m-1)2V~
holds, supp K(t,x,p,u) expands for t E [0, T(p,u)], where
Dynamical Behavior of Initiated Pulses 811
(2.6)
For t > T(p, 0-) supp K(t, x, p, 0-) shrinks and K(t, x, p, 0-) identically vanishes
at the extinction time t = T* (p, 0-) given by
(2.7)
Fig. 3. Kersner's Exact Solution and Free Boundary with m = 1.5, p = 0.5, c = 1
O( ) _ {K(t,x+P+"(,P,o-) on (-00,0]'
(3.1)
v x - v O( -x ) on (0, (0).
Let 7) E (0, p)(o < 7) < p) be an arbitrary fixed constant and E: be an arbitrary
positive number such that E: < K(O, p - 7), p, 0-). For vO(x) we introduce an
even function v~ (x) satisfying the following
CONDITION C. i) v~(x) = v~(-x) and VO(x) :::; v~(x) hold on R\
ii)
on (-00, -"( - 7)],
(3.2)
on [-,,(,0]'
812 K. Tomoeda
Let v(t, x) and ve(t, x) be the solutions of (1.1)-(1.2) with v(O, x) = vO(x) and
ve(O, x) = v~(x), respectively. Then we obtain the theorem, which justifies the
appearance of the phenomena (NS-4).
Theorem 3. Under Conditions A and C let" p and a be constants such that
supp v(t,x) becomes connected at t = T(p,a). Then, for some €, there exist
i (0 < i < T(p, a)) and x (-, :::; x :::; ,) such that vg(i, x) = 0 holds, and
supp vg(T(p, a)" ) is connected.
(3.6)
I: I:
where Ue = (ve)m-l. By using these inequalities and Condition C we obtain
ue(t,x)dx = ue(O,x)dx
+
Jo
tj"l-"I {mue(t, X)Uexx(t, x) + a(uex(t, x))2 - c'} dxdt
I:
2,sm-l
-it
=
{2,c'-(m-2)a ue(t,x)uexx(t,x)dx-a[ue(t,x)uex(t'X)["I}dt
2,c'
(3.8)
d1 < a ( (2 - m)TV(u~) + 211u~1100 ) .
Dynamical Behavior of Initiated Pulses 813
Then, by the continuity of the solution ve(t, x) and the comparison theorem
on the initial data([2]) there exist positive constants El and Tl < T(p, 0") such
that
max ue(t, x) < dl for t < Tl and E < El < min (d l , K(O, P - 7], p, 0"))
[O,t] x [--y,-y]
(3.9)
We put
t. ( ) _ 2')'E m - l
(3.10)
2 E - {2')'c'-ad l ((2-m)TV(ug)+21I ugll oo ) } '
1:
and choose S < EI such that T2 (s) < TI . Hence, it follows from (3.7) that
5 Acknowledgments
This work was supported by Japan Society for the Promotion of Science
through Grant-in-Aid (No. 13440038) for Scientific Research (B).
References
l. D.G. Aronson, The porous medium equation, In some Problems in Nonlin-
ear Diffusion(eds. A. Fasano and M. Primicerio), Lecture Notes in Mathematics
1224, Springer-Verlag,1986.
2. M. Bertsch, A class of degenerate diffusion equations with a singular nonlinear
term, Non-linear Anal., 7(1983),117-127.
3. X.-Y. Chen, H. Matano and M. Mimura, Finite-point extinction and continuity
of interfaces in a nonlinear diffusion equation with strong absorption, J. reine
angew. Math., 459(1995),1-36.
4. E. DiBenedetto and D. Hoff, An interface tracking algorithm for the porous
medium equation, Trans. Amer. Math. Soc., 249(1984),463-500.
5. J.L. Graveleau and P. Jamet, A finite difference approach to some degenerate
nonlinear parabolic equations, SIAM J. Appl. Math., 20 (1971), 199-223.
6. M.A. Herrero and Vazquez, The one-dimensional nonlinear heat equation with
absorption: Regularity of solutions and interfaces, SIAM J. Math. Anal., 18
(1987), 149-167.
7. A.S. Kalashnikov, The propagation of disturbances in problems of non-linear
heat conduction with absorption, Zh. Vychisl. Mat. i Mat. Fiz., 14 (1974), 891-
905.
814 K. Tomoeda
1 Introduction
The aim of this paper is to describe the class of HLLE Riemann solvers and
propose a fully two-dimensional extension of the one-dimensional HLLC Rie-
mann solver from [8].
In 1983, Harten, Lax and van Leer suggested [3] to approximate the solu-
tion of a Riemann problem by three constant states, separated by two waves,
propagating with constant speeds. A particular algorithm for computation of
these wave speeds were presented five years later by Einfeldt [2]. Since the
assumption of two waves is correct only for hyperbolic systems of two equa-
tions, Toro, Spruce and Speares [8] added one more wave, creating the 4-state
one-dimensional HLLC solver. In fluid dynamics, this new wave corresponds
to a contact discontinuity. Later, Wendroff presented a series of 9-state solvers,
extending the 3-state HLLE approach to two dimensions [10]' [11]. In this pa-
per, we construct a contact-corrected version of this approach, adding six new
waves.
The outline is as follows: First, we introduce the basic principles of HLLE
and HLLC Riemann solvers in the simple one-dimensional case. Then we pro-
ceed to two dimensions, describe the 9-state solver from [10]' [11] and introduce
its new, contact-corrected version. Finally, we demonstrate the application of
our new solver in two particular numerical methods, namely the Godunov-
type difference scheme and the WAF (Weighted Average Flux) approach, and
present results of selected numerical tests.
816 P. Vachal et al.
t1
Xi+l!2
(a) (b)
2 One Dimension
Methods presented in this paper have been derived for the Euler equations. In
one dimension we have
where p is the density, u fluid velocity, p pressure and E the density of the
total energy. Completing the system (1) by the equation of state for an ideal
polytropic gas p = (,-1) (E - ~pu2), we can write it in the general differential
form Wt + f(w)x = O.
Consider the initial Riemann problem
+ ( ""2
~x - bI ~t ) WI + [1 (WI) - 1 (Wo)] ~t.
Note that data below the plots in Fig. 1 show the absolute position, while data
above the plots are relative distances to the center of the staggered cell. Such
simplified notation will be especially advantageous later in 2D.
Solving (3) for Wl. gives
2
Note that there is also another way to evaluate PI, EI Land EI R only from
2 2' 2 '
Rankine-Hugoniot conditions [6, 7, 1].
pu 2pu+p ) + pv
puv ) _ 0
(6)
( puv ( pv 2 + p -
(E+p)u x (E+p)v y
818 P. Vachal et al.
Y=Yj+ll2 ~ V},u:p- t
bg,lIzl1t
b~,1I2I1t b~,lf2At
"
bl~l2l1t "
bri~2At
Fig. 2. 2D Riemann solvers: (a) 9-state HLLE with highlighted initial condition; (b)
16-state HLLEC
with the equation of state for ideal polytropic gas p = ("(-l)(E - ~p(u2 +V2)),
where v is the fluid velocity in the y-direction.
First, let us summarize the fully two-dimensional 9-state HLLE Riemann
solver, originally presented in [11]. We want to solve the 2D Riemann problem,
formed by two partitions at x = xi+! and y = YH!' subdividing the domain
[Xi, Xi+l] X [Yj, Yj+l] into four regions with constant states. As the system
evolves in time, we approximate the solution as shown in Fig. 2(a). Waves,
propagating from the center with constant speeds, split the domain into 9 re-
gions. For ~t sufficiently small, none of the waves in the y-direction will reach
°
the edge Y = Yj, so that the state Wl is affected only by the one-dimensional
2'
Riemann problem in the x-direction, given by Wo,o and WI,o. Analogous con-
siderations along the other three edges lead us to following method: The corner
states ( Wo,o, WI,o, WO,I, WI,I) stay undisturbed. We take these to compute
the edge states (W:;.1 , 0, W 12 I, Wo '21, WI ' 1),
I 2
using a 3-state solver based on the
one-dimensional HLLE solver from section 2. Then we do not need to solve
the real two-dimensional problem in the central area, since W 1 1 is given by
2'2
a two-dimensional conservation law, applied to the whole staggered cell.
Such 9-state Riemann solver works well for problems containing shocks and
rarefaction waves, but its weakness is poor resolution of contact discontinuities.
Our goal was to modify it to a solver which resolves contact surfaces better'
(stationary contacts even exactly), and which degenerates to the ID 4-state
solver from Sect. 2 for one-dimensional initial problems in coordinate directions
(i.e. if there is only one interface in the 2D initial condition).
The most straightforward way is to follow Toro's one-dimensional approach
and split e?-ch of the intermediate edge regions (i.e. states W 1 0, W 1 I, Wo 1,
2' 2' '2
WI , 1)
2
by a wave corresponding to a contact discontinuity. It is obvious that
to fulfill the degeneracy condition mentioned above we must subdivide also
Fully Two-dimensional HLLEC Riemann Solver 819
the central region (i.e. state WI 1) by two additional waves. This gives us the
2'2
setup for a 16-state solver as shown in Fig. 2(b).
It is not difficult to compute speeds of splitting waves and values of new
intermediate states along the edges. Basically, we use the same process as in
one dimension (5), taking also transversal velocity into account. In [9] it is
proven that we can simply copy these from the nearest corner states, that is
along the edge Y = Yj we simply take VIOL 2' ,
= Vo ' 0 and VI2 0 R = VI 0, along
J , ,
and U1,1, weighted by the lengths of the interfaces of their regions with up-
per right central region, while for VI 1 UR, we taked weighted average of VI 0,
2' 2' '
VI 1 L, VI 1 U and vII. Then we estimate central densities. This time, for each
, 2 ' '2' '
central state, we compute the density from ID Rankine-Hugoniot conditions
across each interface with surrounding regions (i.e. with corner and edge re-
gions), and weigh the results again by lengths of interfaces. Since we know the
total mass in the central states, which must be conserved, we multiply now all
the four density estimates by a suitable factor to satisfy the mass conservation
law. Similarly, we correct the momentum estimates (corrected density multi-
plied by estimated velocity) in both directions by adding a suitable amount to
each state, in order to keep momentum conserved. Finally, we suppose pres-
sure to be the same in all four central regions and compute its value from the
conservation law for total energy.
Let us summarize the complete algorithm of our new 2D 16-state HLLEC
Riemann solver (consult Fig. 2(b»:
1. Compute wave speeds b1 0' bi 0' b1 1 , bI 1 , bg 1, b6 l ' b~ 1 and b~ 1, using
2' 2' 2' 2' '2 '2'2 '2
Einfeldt formulas from [7].
2. Compute the intermediate (light shaded) states for each of four ID prob-
lems along edges.
a) Using the ID 4-state HLLC solver with appropriate 2D fluxes, com-
pute the longitudinal velocities (i.e. Ul 0, Ul l' Vo 1, VII) and both
2' 2' , 2 ' 2 '
densities (i.e. PI2" 0 L, PI2" 0 R, PIlL'
2"
PI 1 R, Po 1 L, Po 1 U, PI 1 Land
2" '2' '2' '2'
Pq,u)·
820 P. Vachal et al.
b) Copy the transversal velocities always from the nearest corner state
(i.e. VIOL
2' ,
= Vo , 0, Uo '2'
1 L = Uo , 0, etc.
c) Compute the intermediate pressures (i.e. PI 0, PI 1, Po 1, PI 1 ) so, that
2' 2' '2 ) 2,'
the total energy in "lD-stripe" along each edge is conserved.
3. Compute the central state WI 1 from the 2D integral conservation law
as if it was not further subdi~ided. These values will be used to assure
conservation and split the central region.
4. Split the central region into four pieces (UL, UR, LL, LR) divided by waves
with speeds based on velocities from the previous step.
5. Estimate eight central velocities (i.e. Ul 1 "" VII"" nE{UL LL UR LR})
2' 2' .f ' 2) '"
from adjacent edge and corner regions, so that the solver degenerates to
ID solver for ID initial conditions in X-, resp. y-direction.
6. Estimate four central densities (i.e. PI2' 12 n' nE{UL LL UR LR}), using Rankine-
1 1 ) '
Hugoniot conditions across the interfaces with surrounding (edge and cor-
ner) states, weighted by lengths of particular interfaces.
7. Correct central densities from step 6 so, that the integral mass conservation
law over the whole staggered cell is satisfied.
8. Correct central velocities from step 5 so, that integral momentum conser-
vation laws in both coordinate directions over the whole staggered cell are
satisfied.
9. Finally, compute central pressure PI2'21 (assuming it is equal for all four
central states) so, that the total energy is conserved.
v (1) ax
-r----------~~------------r t=tn
(7)
where FLl and Fin+l are weighted average fluxes in the staggered cell at left,
2 2
resp. at right. These can be computed at each time level from real physical
fluxes in four states of the solver as
4
FH! = La
k=l
k f (Wi~~) . (8)
k = 1, ... ,4 (9)
One can use the van Leer limiter, minmod, or many others listed for example
in [7].
In two dimensions, the conservative formula is
and we compute each weighted average flux from cell-sized region centered
at the point given by its indices (i.e. flux FH!,j from rectangle [Xi,Xi+1] x
822 P. Vachal et al.
tnt! tn
Yj+l
Yj+ll2
Yj
/ / /
+---''''\7-,'-i-'~'--+ t ll+1
+---------~---+t"
(a) (b)
[y j _! ' y1+!], etc.). Rather than to introduce 2D limiters and derive the WAF
method in two dimensions, we use the 1D WAF approach in each appropriate
direction, with each of the four fluxes taken as an average from a suitable set
of states. This is shown schematically in Fig. 4 and described more precisely
in [9].
5 Numerical Results
In Fig. 5, we can see results for the density, as computed by the Godunov
scheme with 9-state solver, and by both Godunov and WAF schemes with the
new HLLEC solver. The test is a slightly modified version of Test 16 from [4],
originally presented and described in [5]. The domain [0, 1] x [0, 1] is split by
two partitions, located at x = 0.5 and y = 0.5, into four regions with constant
states, forming a 2D Riemann problem. Initial values
PUL = 1.0222 PUL = 1.0 PUR = 0.5313 PUR = 0.4
UUL = -0.7179 VUL = 0.1 UUR =a VUR = 0.1
PLL = 0.8 PLL = 1.0 PLR = 1.0 PLR = 1.0
ULL =0 VLL = 0.1 ULR = a VLR = 0.8276
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
(a) (b)
WAF with HLLEC (16-sl.)
0.2
0.1
(c)
Fig. 5. Contour plots of density at time t = 0.2 as computed by particular difference
schemes
exactly. However, also shock and rarefaction wave are also treated better by
the new solver. The price we have to pay for this improved resolution are
slight but visible artifacts at initial locations of discontinuities, in the middle
of the upper edge in Fig. 5 (b),(c). Comparing both schemes which use the
HLLEC solver, we see that WAF performs better than Godunov, which is not
surprising, since it is a second order accurate scheme,
References
1. P. Batten, N. Clarke, C. Lambert and D. M. Causon (1997): On the Choice of
Wavespeeds for the HLLC Riemann Solver, SIAM J. Sci. Comput., 18, 1553-1570
2. Einfeldt B. (1988): On Godunov-type Methods in Gas Dynamics. SIAM J. Nu-
mer. Anal., 25 (2), 294-318
3. Harten A., Lax P.D., van Leer B. (1983): On Upstream Differencing and
Godunov-type Schemes for Hyperbolic Conservation Laws. SIAM Rev., 25 (1),
35-61
4. Lax P.D., Liu X.-D. (1998): Solution of Two-Dimensional Riemann Problems of
Gas Dynamics by Positive Schemes. SIAM J. Sci. Comp., 19 (2), 319-340
5. Schulz-Rinne C.W., Collins J.P., Glaz H.M. (1993): Numerical Solution of the
Riemann Problem for Two-dimensional Gas Dynamics. SIAM J. Sci. Comput.,
14, 1394-1414
6. Toro E.F. (1989): A Weighted Average Flux Method for Hyperbolic Conservation
Laws. Proc. R. Soc. London A, 423, 401-418
7. Toro E.F. (1997): Riemann Solvers and Numerical Methods for Fluid Dynamics.
Springer Verlag, Berlin, Heidelberg
8. Toro E.F., Spruce M., Speares W. (1994): Restoration of the Contact Surface in
the HLL-Riemann Solver. Shock Waves, 4, 25-34
9. Vachal, P. (2003): Some Aspects of Numerical Resolution of Contact Discontinu-
ities in Conservation Laws. MS Thesis, Czech Technical University, Prague
10. Wendroff B. (1999): A Two-dimensional HLLE Riemann Solver and Associated
Godunov-type Difference Scheme for Gas Dynamics. Computers and Math. with
Applications, 38, 175-185
11. WendroffB. (1999): Approximate Riemann Solvers, Godunov Schemes, and Con-
tact Discontinuities. In: Godunov Methods. Theory and Applications. Oxford,
UK, Oct. 18-22, 1999
Deflation Accelerated Parallel Preconditioned
Conjugate Gradient Method in Finite Element
Problems
1 Introduction
Large linear systems occur in many scientific and engineering applications. Of-
ten these systems result from a discretization of model equations. The systems
tend to become very large for three-dimensional problems. Some models in-
volve time and space as independent parameters and therefore it is necessary
to solve such a linear system efficiently at each time-step.
In this paper we only consider symmetric positive definite (SPD) discretiza-
tion matrices. Since the matrices are sparse in our applications, we use an
iterative method to solve the linear system. In order to get a fast convergence
of the method we use a preconditioned Conjugate Gradient method, where
incomplete Choleski factorization is used as a preconditioner. This method is
very suitable for parallellization.
The present study involves a parallellization of the Conjugate Gradient
method in which the inner products, the matrix-vector multiplication and pre-
conditioning are parallellized. This parallellization is done by the use of domain
decomposition, where the domain of computation is divided into sub domains
and the overall discretization matrix is divided over the subdomains. To each
subdomain we allocate a processor. A well-known problem is that the parallel-
lized method is not scalable: the number of CG-iterations and wall-clock time
increase as the number of sub domains increases. To make the method scal-
able one uses a coarse grid correction (see for an overview and introduction
Smith et al [6]) or the deflation method. In [3] it is shown that deflation gives
a larger acceleration to the parallel preconditioned CG-method. The idea to
use deflation for large linear systems of equations is not new. Among others,
Nicholaides [4] and Vuik et al [8, 1, 10, 9] apply this method to solve large ill-
conditioned linear systems. The result of deflation is that the components of
826 F.J. Vermolen et al.
where u denotes the solution and u represents a given function. The resulting
discretization matrix is symmetric positive definite. The domain is divided
into sub domains and the resulting system of linear equations is solved by the
use of a parallellized Deflated ICCG. In the text the algorithm is given and
the issues of data-structure for the parallellization of the solution method are
described. Subsequently, we describe some numerical experiments. For more
mathematical background we refer to [1, 10,7].
Parallel deflated ICCe method 827
original domain
subdomain 1 subdomain 2
Since PAis singular the solution is not unique. We denote the solution that is
obtained by use of the ICCG method on equation (5) by x. To get the solution
x we use
x = (I _pT)x+pT x . (6)
It is shown in [7J that pTx = pT x, hence the solution of equation (5) can
be used. The second part (I - pT)x = Z(ZT AZ)-l ZTb is relatively cheap to
compute. Hence the solution x is obtained by addition of the two contribu-
tions, i.e. x = pTx+Z(ZT AZ)-l ZTb. For completeness we give the algorithm
of the Deflated ICCG:
Algorithm 1 (DICCG [9]):
- = P E.o'P.. = ~o = L- T L- 1E.o
k = 0 ,E.o -
1
while IIfkl12 > C
-T
k = k + 1, Q;k - E.k-1 ~k-1
- p..rPAP.. k
±k = ±k-1 + Q;P..k' fk = f k- 1 - Q;kPAP..k
-T
Z = L-TL-L;: (3 - E.k ~k
-k -k' k- -T
E.k-1 ~k-1
P..k-1 = ~k + (3kP..k
end while
The inner products, matrix vector multiplication and vector updates in the
above algorithm are easy to parallellize. Parallellization of the incomplete
Choleski preconditioned Conjugate Gradient method has been done before by
Perchat et al [5J. We use a restriction and a prolongation operator and block
preconditioners for the preconditioning step in the above algorithm. Note that
it is necessary to have a symmetric preconditioner. This is obtained by choosing
the restriction and prolongation matrices as transposes of each other. Let E.k be
the residual after k CG-iterations and N be the total number of sub domains,
then overall preconditioning is expressed in matrix form by:
We create the vectors Zj in the subdomain Dj only and send essential parts to
its direct neighbours. We explain the data-structure and communication issues
for a rectangular example. In the explanation we use the global numbering from
the left part of Figure 2. Note that in the implementation the local numbering
is used in the communication and calculation part. The global numbering is
used for post-processing purposes only. The example can be generalized easily
to other configurations. The situation is displayed in Figure 2.
Q4 Q Q
25 Q 3 47
21 22 23 24 8 97 8 9 3
16 17 18 19 20 4 5 6 4 5 6
11 12 13 14 15 1 2 3 1 2 3
7 8 9 7 8 9
Ii !7 IR 19 10
4 5 61 4 I) 6
1 2 3 4 5
1 1 2
Q 1 Q Q 1. j
3~
2 1
Fig. 2. A sketch of division of [2 into sub domains [21, ... , [24. Left figure represents
the global numbering of the unknowns, right figure represents the local numbering.
Processor 1 Processor 2
Make Zl Make Z2
Communication
Make Az l , Az[ Make Az2, Az[
Communication
AZl = AZl + Az[ AZ2 = AZ2 + Az[
Ell = zi AZl E22 = zI AZ2
E12 = zi Az[ E22 = zI Az[
Send E to proc 1
Choleski decomp E
Pv=
v - AZE-1ZTv
Compute ziv Compute zIv
Send zIv
Send y to proc 2
v - ylAz l - Y2Az[ v - Y2Az2 - Y1Az[
Parallel deflated Ieee method 831
4 Numerical experiments
200
~
,g
~150
.. .... ----------------
"
..' .............
50 ........
~L---~--~--~4--~~--~--~
~
°1L---~--~---4~--~--~--~
Fig. 3. Left figure: The number of iterations as a function of the number of layers for
deflated and non-deflated parallellized and sequential Ieee method. Right figure:
The wall clock-time as a function of the number of layers for deflated and non-deflated
parallellized and sequential Ieee method.
832 F.J. Vermolen et al.
roL,--~~--~,----7,----,~--~--~ ro~,--~--~~,---7,--~.~-7--~~
number of subdomains number of subdomains
Fig. 4. The number of iterations as a function of the number of layers for the
parallellized ICCG method for three methods: no projection, coarse grid correction
and deflation. Left graph: layered extension of the domain of computation, Right
graph: blockwise extension of the domain of computation
5 ConcI usions
References
Summary. Checkpointing techniques become more and necessary for the compu-
tation of adjoints. This paper presents the more common multi-level checkpointing
as well as the less known binomial checkpointing. The checkpointing approaches are
compared with respect to the number of time steps the adjoint of which can be calcu-
lated, the run-time needed for the adjoint calculation and the memory requirement.
Some examples illustrate the shown results
1 Introduction
i = 1, ... ,N,
where Yi E R n denotes the state and Ui E Rrn the control at time ti for a given
time grid to, ... , tN with to = 0 and tN = T. The operator Fi : Rn xRrn x R f--+
R n defines the time step to compute the state at time t i . Note that we do not
assume a uniform grid. To optimize a specific criterion or to obtain a desired
state, the cost functional
J(U) = l(y(u), u)
Reverse:
dOP=Cl,O,-1
Evaluate the time steps F i , p. C2 < i < N storing the states i, p. C2 :::;
i < N -1,
perform the adjoint steps Pi, N ?: i > p. C2 to calculate the adjoints,
set N = p' C2, if P > 0 read the contents of checkpoint p.
end do
Fig. 1 sketches the two-level checkpointing for N = 16 time steps and
C = Cl + C2 = 6. Throughout, the time steps are plotted along the vertical
axis and the computing time required for the adjoint calculation is represented
by the horizontal axis. Each solid horizontal line including the horizontal axis
itself represents a checkpoint. The time, when a state is stored in a checkpoint,
is marked with a black circle for the first level and with a black square for the
second level. The slanted black lines represent the evaluation of time steps. The
adjoint steps are drawn as dashed slanted lines. Finally, black arrows depict
the usage of a state Yi for an adjoint step Fi+l without performing the corre-
sponding time step F i . This adjoint calculation is possible due to the assumed
structure (1) of the adjoint steps. Note, that it may be required to evaluate
FN once to initialize the adjoints. This evaluation can be introduced right af-
ter the evaluation of F N - 1 for p = Cl. For illustration purposes, we suppose
throughout that all time steps and all adjoint steps have the same temporal
complexity normalized to 1. However, to apply the presented optimal check-
pointing techniques, only the identical temporal complexity of all time steps
is required. In this example, 24 time steps are performed. Hence, the number
1 10 20 30 40
,t
10
1
• level 1
• level 2
• level 3
N
where N r denotes the number of time steps for which the adjoint can be
calculated using the specific r-level checkpointing. The corresponding memory
requirement equals Mr. The number of time step evaluations required for the
adjoint calculation is given by TTl since at the first level c1Nr / (Cl +1) time steps
have to be evaluated to reach the second level. At the second level, one group
of time steps is divided into C2 + 1 groups. Hence, c2(Nr/Cl + 1)/(c2 + 1) time
steps have to be evaluated in each group to reach the third level. Therefore, we
obtain (Cl + 1)C2(Nr/Cl + 1)/(c2 + 1) = C2Nr/(C2 + 1) at the second level and
so on. It follows that each time step Pi is evaluated at most r times. Hence,
if we apply two-level checkpointing, each time step is evaluated no more than
two times.
The two- as well as the multi-level checkpointing technique have the draw-
back that at each level the checkpoints are not reused. Each checkpoint stores
at each level only one state and becomes idle as soon as the data that is stored
in the checkpoint has been used for the adjoint calculation. A method that
reuses the checkpoints as soon as possible is proposed in the next section.
When one applies the checkpointing technique proposed in [5], the adjoint
values are again generated piece by piece but only one state is employed for
838 A. Walther, A. Griewank
(c
n = N r_ + r) ,
r-1
where r the unique integer satisfying
(2)
1 10 20 30 40 50 60
. ' t
10
The integer r has the same meaning for both checkpointing approaches, namely
the maximal number of times any particular time step Fi is evaluated during
the adjoint calculation. Hence for comparing both approaches, assume at the
beginning that r has the same value and that the same amount of memory is
used, i.e. Mr = Mb = C.
Now, we examine the maximal number of time steps N* for which an
adjoint calculation can be performed using the two approaches. Assuming that
r is a divisor of c and Mr = c, one obtains the identity
C
with Ci=-, i=l, ... ,r,
r
N*
_b
N;
~ (c+r) r )-r
r (C-+1 =-1-
v27fr
(C-+1 )C ~--exp(r).
r 1
v27fr
Hence, the ratio of Ni; and N; grows exponentionally in r without any de-
pendence on the number of available checkpoints. Fig. 4 shows N; and Ni; for
the most important values 2 ::::: r ::::: 5. Since r denotes the maximal number
z
50000 50000 ./
*Z 40000 40000 .
30000 30000
20000 20000
10000 10000
0 O'--~""-~=...-~~~~~----'
0 5 10 15 20 25 30 35 40 45 50 o 5 10 15 20 25 30 35 40 45 50
#checkpoints #checkpoints
of times each time step is evaluated, we have the following upper bounds for
the number of time steps evaluated during the adjoint calculation using r-level
checkpointing and binomial checkpointing, respectively:
Advantages of Binomial Checkpointing 841
Tr
c
= C ( :;: + 1
)r-l < r N; and .Lb
rr> = r N*b - (c + r) <
r-l
r Nb* .
For example, it is possible to compute the adjoint for N = 23000 time steps
with only 50 checkpoints, less than 3N time step evaluations, and N adjoint
steps using binary checkpointing instead of three-level checkpointing, where
N; ::; 5515. If we allow 4N time step evaluations then 35 checkpoints suffice
to compute the adjoint for 80000 time steps using binomial checkpointing,
where Nt ::; 9040. These number are only two possible combinations taken
from Fig. 4 to illustrate the really drastic decrease in memory requirement
that can be achieved if binomial checkpointing is applied.
However, usually the situation is the other way round, i.e. one knows N
andlor c and wants to compute the adjoint as cheap as possible in terms of
computing time. Here, the first observation is that r-level checkpointing intro-
duces an upper bound on the number of time steps the adjoint of which can
be computed, because the inequality N ::; (clr + It must hold. Furthermore,
binomial checkpointing allows for numerous cases also a decrease in run-time
compared to the uniform checkpointing. For a given r-level checkpointing and
Mr = c, one has to compare Tr and T b. Let rb be the unique integer satisfying
(2). Since at least one checkpoint has to be stored at each level, one obtains
the bound r ::; c. I.e., one must have c >= log2(N) to apply uniform check-
pointing. Therefore, the following combinations of rand rb are possible for the
most important, moderate values of r:
For 3 ::; r ::; 5, one easily checks that Tr > n holds if rb < r. For r = rb, one
can prove the following, more general result:
Theorem 2. Suppose for a given N and a r-level checkpointing with Mr = c
that the corresponding rb satisfying (2) coincide with r. Then, one has
T2 = 2N - c- 2 = n ifr = rb = 2
Tr >n if r = rb > 2.
Proof: For rb = r = 2 the identity T2 = n is clear. For r = rb > 2, the
inequality
5 Conclusions
This article discusses several checkpointing techniques, namely multi-level
checkpointing and binomial checkpointing. A detailed analysis of the num-
ber of time steps the adjoint of which can be calculated, the run-time needed
for the adjoint calculation and the memory requirement is given.
One can conclude that binomial checkpointing allows adjoint calculations
with a surprisingly small fraction of the memory needed by the basic approach.
This storage reduction causes only a very moderate increase in run-time. On
the other hand, we see that r-level checkpointing induces for a given number of
checkpoints an upper bound on the number of time steps the adjoint of which
can be computed. This upper bound can only be increased by introducing a
next level of checkpointing. In addition it is shown that the run-time required
for the adjoint calculation with r-level checkpointing exceeds the run-time
needed for binomial checkpointing for the most important values of r > 2,
whereas for r = 2 both methods yield the same run-time. However, for r = 2
and a given amount of memory, binomial checkpointing allows the adjoint
computation for a larger number of time steps. Hence, even for r = 2 binomial
checkpointing is preferable.
Moreover, it is quite often the case that the number N of time steps is not
known a-priori, for example due to an adaptive time stepping method. Then,
it becomes difficult to distribute the checkpoints for the two- or multi-level
checkpointing such that the minimal run-time is attained. For binomial check-
pointing the extension a-revolve deals with the unknown number of time steps
by using a heuristic for the checkpoint placements. In addition, a-revolve can
also handle time steps with varying temporal complexity. For time steps the
cost of which do not change drastically, the heuristics implemented in a-revolve
work well such that the corresponding adjoint calculation is only a few percent-
ages slower than the one based on revolve [9]. Hence, binomial checkpointing
provides memory-reduced adjoint calculation also in more general situations.
References
1. Charpentier, I. (2001): Checkpointing schemes for adjoint codes: Application to
the meteorological model Meso-NH. SIAM J. Sci. Comput., 22, 2135-2151
2. Gockenbach, M., Reynolds, D., Symes, W. (2002): Efficient and Automatic Im-
plementation of the Adjoint State Method. Trans. Math. Soft., 28, 22-44
3. Griesse, R. (2003): Parametric Sensitivity Analysis in Optimal Control of a
Reaction-Diffusion System-Part II: Practical Methods and Examples. To appear
in Opt. Meth. Soft.
Advantages of Binomial Checkpointing 843
Summary. This paper uses the fictitious boundary method described in [1] for the
solution of incompressible flow with moving rigid bodies in complex geometries. The
method is based on a special treatment of Dirichlet boundary conditions inside of
a FEM approach in the context of a hierarchical multigrid scheme such that the flow
can be efficiently computed on a fixed computational mesh while the solid boundaries
are allowed to move freely through the given mesh. In this paper, we focus on the
calculations of the drag and lift forces acting on the moving solid bodies which are
not captured by the mesh. The comparison between the present and benchmark
results for the flow around a circular cylinder with different Reynolds numbers is
first presented, and then the result for a circular cylinder oscillating in a channel
is given. The simulation results compared with corresponding reference results are
found to be very reasonable and satisfactory.
1 Introduction
(3)
a(x) = {~ for
for
x E [le,
x E [l,
(4)
where x denotes the coordinates of the edge midpoints of cells, [le is the
domain occupied by the rigid bodies, [l is the fluid domain, the whole domain
is [IT = [l U [le. The importance of such a definition of the parameter can be
An Efficient Multigrid FEM Solution Technique 847
seen from the fact that the gradient of a is zero everywhere except at the wall
surface of the rigid bodies, and equal to the normal vector n defined on the
grid [7, 12], i.e.
n = -Va. (5)
The total stress tensor a- of the fluid flow is
(6)
Hence the forces acting over the wall surface of the rigid bodies can be
computed by
FT = 1 DT
a-ndQ = -1
DT
a-VadQ. (7)
The drag force and lift force can be obtained from the Eq.(7),
FD = - 1 [ (-aau-oaa +aua-yayoa)
DT
J.L
x x
- oa]
- p - dQ,
ax (8)
FL =- 1 [ (---+---
DT
ovoa ovoa)
J.L
a a ayayx x
-p-
oa] dQ.
ay (9)
Therefore through Eq.(8), Eq.(9) and Eq.(3) we can calculate the new
drag and lift coefficients (Cd and Cz) via the volume integral over the whole
domain Q T instead of the surface integral over the wall surface of the rigid
bodies in Eq.(2). The integral over each element covering the whole domain
QT is evaluated with the standard 3 x 3 point Gaussian quadrature. Since the
gradient Va is non-zero only at the wall surface of the rigid bodies, thus the
volume integrals need to be computed only in one layer of mesh cells around
the rigid bodies. It is convenient for the present fictitious boundary method to
calculate the Cd and Cz.
3 Numerical tests
This section consists of two parts. The first part presents a quantitative exam-
ination for the benchmark case of flow around a circular cylinder with Re = 20
and 100 solved by the present fictitious boundary method. The second part
gives the computing results for a circular cylinder oscillating in a channel. For
comparison, corresponding reference results are also presented.
848 D. Wan et al.
1111111111111 I II I I
(b) channel mesh I (LEVEL = 1)
.. -....
.:;~....................
310~---;;-';;;'-------;;';-------C~-----;CO'--'-----C~-----;C'---------;C;;;---;! "o~---;;-';;;'-------;;';-------C~-----;Co'--'-----c~-----;C,__----;C;;;--:!
, ,
Fig. 2. Periodical results of Cd and Cl for Re = 100
850 D. Wan et al.
IIII~IIIIIIIIIIIIIIIIIIIIII_IIII
(a) channel mesh (LEVEL = 1) (b) body-conformal mesh (LEVEL = 2)
Fig. 3. Coarse meshes used for the oscillating cylinder in a channel
(a) t = to (b) t = to + ~T
(c) t = to + ~T (d) t = to + ~T
Fig. 4. Vorticity contour plot for an oscillating cylinder in a channel
852 D. Wan et al.
4 Conclusions
The presented fictitious boundary method for simulating incompressible flows
with moving rigid bodies in complex geometries has been validated in a two-
dimensional configuration. We showed that the use of an arbitrarily fixed (un-
structured) FEM background mesh is accurate enough to calculate those sen-
sitive quantities (drag coefficient and lift coefficient) on the wall surface of the
cylinder. Comparisons of the results using a body-fitted mesh and a fixed
structured mesh show good agreement. The advantage of the present method
is that since the body motion is independent of the mesh, problems associated
with mesh reconfiguration and motion are avoided, computations on a fixed
grid are cheaper than on a body-fitted one, and finally, the extension of the
method to 3D is straightforward. It is also worthy to note that the availability
of the present method to accurately compute the forces acting on the moving
rigid bodies provides a good and solid base for further study of particulate flow
as well as the interaction between fluid and structure as proposed by Glowinski
in the paper [4].
References
1. Turek, S., Wan, D.C. and Rivkind, L.S.: The Fictitious Boundary Method for the
implicit treatment of Dirichlet boundary conditions with applications to incom-
pressible flow simulations. Lecture Notes in Computational Science and Engineer-
ing, Volume 35, Springer Verlag, (2003)
An Efficient Multigrid FEM Solution Technique 853
2. Feng, J., Hu, H.H and Joseph, D.D.: Direct simulation of initial value problems
for the motion of solid bodies in a Newtonian fluid; part 2, Couette and Poiseuille
flows. J. Fluid Mesh. 277 (1994) 271
3. Hu, H.H.: Direct simulation of flows of solid-liquid mixtures. Int. J. Multiphase
Flow. 22 (2) (1996) 335
4. Glowinski, R, Pan, T.W., Hesla, T.r. and Periaux, J.: A fictitious domain ap-
proach to the direct numerical simulation of incompressible viscous flow past
moving rigid bodies: application to particulate flow. J. Comput. Phys. 169 (2001)
363
5. Glowinski, R: Handbook of numerical analysis (Volume IX): Numerical methods
for fluids (Part 3). Ciarlet, P.G and Lions, J.L. editors, North-Holland, (2003)
6. Heywood, J., Rannacher, R and Turek, S.: Artificial boundaries and flux and
pressure conditions for the incompressible Navier-Stokes equations. Int. J. Numer.
Meth. Fluids 22 (1996) 325
7. Duchanoy, C. and Jongen, T.RG.: Efficient simulation of liquid-solid flows with
high solids fraction in complex geometries. Compters and Fluids 32 (2003) 1453
8. John, V.: Higher order finite element methods and multigrid solvers in a bench-
mark problem for the 3D Navier-Stokes equations. Int. J. Num. Meth. Fluids 40
(2002) 775
9. Schafer, M. and Turek, S.: Benchmark computations oflaminar flow around cylin-
der. in E.H. Hirschel (editor) Flow Simulation with High-Performance Computers
II. Volume 52 of Notes on Numerical Fluid Mechanics, Vieweg (1996) 547
10. Turek, S.: Efficient Solvers for Incompressible Flow Problems. Springer Verlag,
Berlin-Heidelberg-New York (1999)
11. Turek, S. et al.: FEATFLOW . Finite element software for the incompressible
Navier-Stokes equations: User Manual, Release 1.2 (1999) (www.featflow.de)
12. Brackbill, J.U., Kothe, D.E. and Zemach, C.: A continuum method for modelling
surface tension. J. Comput. Phys. 100 (1992) 335
Higher-Order FEM for a System of Nonlinear
Parabolic PDE's in 2D with A-Posteriori Error
Estimates
1 Introduction
(it stands for au/at) where u = (Ul,"" UNeq ) is the solution, a and fare
smooth vector-valued functions, n is the unit outward normal to the bound-
ary and 0[7 = rp u riN for all i = 1, ... , N eq . Further we denote g =
(gl, ... , 9Neq )· The 'V (nabla) operator is defined as usual, 'V = (0/ aXl, 0/ aX2).
The vector-valued coefficient a = (al,"" aNeq ) is bounded,
where t E (0, T) and the form of the space V C (Hl )Neq is dictated by rp,
1 :::; i :::; N eq . The symbol (.,.) stands for the L2([7) scalar product.
Following the concept of MOL, we discretize the spatial variable first and
leave the temporal variable continuous. Consider a finite element mesh Th,p cov-
ering il, where a polynomial order P(Ki) ~ 1 is associated with each element
Ki E Th,p, 1 :::; i :::; N elem . Let Vh ,p([7) be an appropriate piecewise-polynomial
subspace of V. We pose the semidiscrete problem to find Uh,p E Vh ,p([7) for
all t E J, such that
856 M. Zitka et al.
(Uh,p(t), X(x))+(a(uh,p, 'VUh,p, x, t)'VUh,p(X, t), 'VX(x))
= (f(Uh,p, 'VUh,p, x, t), X(x)) + (g(x, t), X(x ))rN, (6)
Uh,p(X,O) = Vh,p(X),
for all X E Vh,p(D) and t E J. Here, Vh,p E Vh,p(D) is the Hl-projection of
v E V(D).
3 Finite-dimensional spaces
Let Th,p be a mesh on D consisting of N e1em disjoint triangles K i , i =
1, ... ,Ne1em . The polynomial order P(Ki) ::::: 1 is assigned to each element
K i . Local polynomial orders on edges are determined using the minimum rule
(minimum of polynomial orders on adjacent elements). Polynomial orders on
all elements are collected into a vector p = {p(K l ),p(K2 ), ... ,p(KNelem)}' We
further define
By Uh,p we denote the elliptic projection of the exact solution to the space
Vt,p,o (fl) (i.e. V~,p,o (S?) with q = p). The main idea of the error estimation ap-
proach can be outlined as follows: The problem is solved with Uh , p E v'hP,p, o(D).
Vt+~(D)
Then the problem is solved once again with Uh'p ,E p ,
(i.e. V~,p, o(D) with
q = p + 1). The estimate is based on the difference Uh,p - Uh,p'
Let us begin with the identity
or
holds for almost every t E J and all functions X E Vt,~,~(D), and if identity
(11) holds for t = 0 and the same functions X.
Further, let us introduce a function eh,p E Vt,~,~(D) such that Uh,p + eh,p is
the elliptic projection of U to Vt,~,~(D).
858 M. Zitka et al.
Let £h,p and EpN depend on t in a sufficiently smooth way. Then there exists
a constant C > 0 such that
(18)
Remark 1. Analogous propositions can be shown also for II'rJPLlh and II'rJELlll
with minor differences in the proof.
By rearranging terms, substituting 'rJ E V{~,~ (D) for X, applying the Schwarz
inequality and using the fact that
(22)
Higher-Order FEM for Parabolic PDE's 859
where S > O. We omit the term with 11\7'1]11 2 on the left-hand side. Integrating
on both parts of (22), we find
if the corresponding functions are Lebesgue integrable on the interval [0, T].
Employing now (17), assuming that the corresponding functions are again
Lebesgue integrable on [0, TJ, and applying Gronwall's lemma, we obtain the
bound
(24)
Turning now back to the inequality (22) and assuming that 11'1]11 is nondecreas-
ing, we can write
(25)
Using (24) and (25), we finally obtain from (22) that
8= IIEI11. (27)
IIel11
The following theorem contains a principal result related to limh-+o 8:
Theorem 1. Let u(t) E HP+1 n HJ and Uh,p(t) E Vt,p,oU?) be the solutions
of (5) and (6) for all t E J. Let E E vt,;,1U?) be the solution of (10), (11)
(for EpN ), (11), (12) (for EpL ), or (13) (for EEL). Then, under appropriate
regularity assumptions,
lim8=1 (28)
h-+O
for almost every t E J, where 8 is 8 PN , 8 PL or 8 EL .
Proof. The proof is rather technical (see [10]). Its main idea is the same as in
the ID case presented in [5]. 0
5 Numerical results
au au au au )
C = diag ( C 1 (U, -,:;-, -,:;-, x, t), ... ,CNeq (U, -,:;-, -,:;-, x, t)
uXl uX2 uXl uX2
and a source term vector
au au
( F1(U, -,:;-, au au )T
F = -,:;-, x, t), ... FNoq (U, -,:;-, -,:;-, x, t)
uXl uX2 uXl uX2
Remark 2. In (30), we apply a/aXl and a/aX2 to vectors. By this notation we
mean that these operators are applied to each component of the vector.
The vector-valued initial condition has the standard form U;(x,O) = UP(x),
i = 1, ... ,Neq. To each solution component U;, i = 1, ... ,Neq , we can prescribe
either Dirichlet or Neumann boundary conditions. The Dirichlet boundary
conditions have the form
where n = (nl' n2) is the outer unit normal to the boundary. One can prescribe
various types of boundary conditions on different parts of the boundary.
862 M. Zitka et al.
The solver distributes the polynomial orders P(Ki) in the mesh Th,p accordingly
to user's input and uses the above mentioned minimum rule to decide about
local polynomial orders associated with edges. Partially novel Lobatto-based
hierarchic shape functions (see [8]) up to the 10th order were used in order to
obtain finite elements with excellent conditioning properties. For the sake of
efficiency, economical Gauss quadrature points and weights from the CD-ROM
[8] up to the 20th polynomial order were utilized.
The spatial semidiscretization yields a system of nonlinear ordinary differ-
ential equations (ODE's). The user can choose either to use the built-in implicit
Euler scheme which is only first-order accurate, or to utilize absolutely sta-
ble implicit higher-order adaptive ODEPACK subroutines. These subroutines
are very sophisticated and one of their significant advantages is that they are
based on explicit evaluation of the right-hand side of the ODE system - thus,
no linearization whatsoever is needed by the user. The solvers are capable of
numerically obtaining information about the Jacobi matrix of the right-hand
side that is needed for the backward differentiation formula (BDF) on which
they are based.
6.3 Outlook
References
1. Demkowicz, L., Oden, J. T., Rachowicz, W., Hardy, O. (1989): Toward a universal
hp-adaptive finite element strategy. Part 1: Constrained approximation and data
structure, Comput. Methods Appl. Math. Engrg., 77, 79-112
2. Demkowicz, L., Rachowicz, W., Devloo, Ph. (2002): A fully automatic hp-
adaptivity, J. Sci. Comput., 17, 127-155.
3. Moore, P.K. (1994): A posteriori error estimation with finite element semi- and
fully discrete methods for nonlinear parabolic equations in one space dimension.
SIAM J. Numer. Anal., 31, 149-169
4. Rachowicz, W., Oden, J. T., Demkowicz, L. (1989): Toward a universal hp-
adaptive finite element strategy. Part 3: Design of hp meshes, Comput. Methods
Appl. Mech. Engrg., 77, 181-212
5. Segeth, K. (1999): A posteriori error estimation with the finite element method
of lines for a nonlinear parabolic equation in one space dimension. N umer. Math.,
83,455-475
Higher-Order FEM for Parabolic PDE's 863
6. Segeth, K., Solfn, P., Kocifik, M. (2002): Some algorithmic aspects of higher-
order finite element schemes in multi dimensions. In: Software and Algorithms
of Numerical Mathematics 14. (Proceedings of Summer School, Kvilda 2001),
Plzen, University of West Bohemia, 199-221
7. Solfn, P., Demkowicz, L. (2003): Goal-Oriented hp-Adaptivity for Elliptic Prob-
lems, Comput. Methods App!. Mech. Engrg., accepted
8. Solfn, P., Segeth, K., Dolezel, 1. (2003): Higher-Order Finite Element Methods.
Chapman & Hall I CRC Press, Boca Raton, FL
9. Solin P. (2000): On a mesh generation technique based on a special smoothing
procedure for uniform inner point distribution, Acta Technica CSAV 45, 397-417
10. Z{tka, M. (2002): Higher-Order FEM for a System of Nonlinear Parabolic PDE's,
Diploma Thesis, Charles University, Prague