(Lecture Notes in Applied and Computational Mechanics 81) Kerstin Weinberg, Anna Pandolfi (Eds.)-Innovative Numerical Approaches for Multi-Field and Multi-Scale Problems_ in Honor of Michael Ortiz's 6
(Lecture Notes in Applied and Computational Mechanics 81) Kerstin Weinberg, Anna Pandolfi (Eds.)-Innovative Numerical Approaches for Multi-Field and Multi-Scale Problems_ in Honor of Michael Ortiz's 6
Kerstin Weinberg
Anna Pandolfi Editors
Innovative Numerical
Approaches for Multi-
Field and Multi-Scale
Problems
In Honor of Michael Ortiz’s 60th
Birthday
Lecture Notes in Applied and Computational
Mechanics
Volume 81
Series editors
Friedrich Pfeiffer, Technische Universität München, Garching, Germany
e-mail: [email protected]
Peter Wriggers, Universität Hannover, Hannover, Germany
e-mail: [email protected]
About this Series
Editors
Innovative Numerical
Approaches for Multi-Field
and Multi-Scale Problems
In Honor of Michael Ortiz’s 60th Birthday
123
Editors
Kerstin Weinberg Anna Pandolfi
Universität Siegen Politecnico di Milano
Siegen Milan
Germany Italy
During the past two decades research in the field of computational mechanics has
progressed remarkably, mainly because of the development of a sound mathe-
matical background and the introduction of new efficient computational strategies.
Beyond the classical finite element method, several innovative techniques and novel
approaches for the analysis of microstructural evolution, growth, damage, and
structural failure in multi-field and multi-scale problems have emerged vigorously.
With the aim to discuss different computational strategies for multi-field and
multi-scale problems, a remarkable group of scientists gathered in September 2014
to the IUTAM symposium “Innovative numerical approaches for materials and
structures in multi-field and multi-scale problems”. Hosted by the University of
Siegen, the venue of the symposium was the Castle Burg Schnellenberg, a mighty
fortress located in the green heart of Westphalia, Germany. There we discussed the
new horizons and perspectives of multi-field applied mechanics. The symposium
covered a large domain of recent research, from computational materials modeling,
crystal plasticity, micro-structured materials, and biomaterials to multi-scale sim-
ulations of multi-physics phenomena. The pioneering discretization methods for the
solution of coupled nonlinear problems at different length scales were particularly
emphasized.
The special occasion that motivated the organization of the symposium was the
60th birthday of Professor Michael Ortiz. Along his exceptional career, Michael
Ortiz has been at the forefront of computational mechanics, his work being a
constant source of inspiration for many. All participants of this symposium are
grateful to Michael Ortiz for being such an enthusiastic collaborator, a reliable
colleague, an illuminating scientist, and a valuable friend.
The friendship and fellow-feeling felt during the symposium inspired the idea to
collect the presentations of some of the convened researchers in a special book. Our
choice was to organize a book as part of the series ‘Lecture Notes in Applied and
Computational Mechanics’ (LNACM), which aims to document new high-level
developments in applied and computational mechanics. We are happy to present
here 13 high-quality contributions of current and past collaborators of Michael
v
vi Preface
Ortiz. All contributions have undergone full peer review, and we take the occasion
to thank the reviewers for their valuable comments.
It is our hope that the present volume will give the reader an insight into the
exciting new developments of computational solid mechanics which is still wide
open to discovery. The book attempts to provide a flavor of this challenging field
and to contribute to its popularity within the mechanics and physics communities.
vii
viii Contents
1 Introduction
M. Tanaka (B)
Toyota Central R&D Labs., Inc., Nagakute, Japan
e-mail: [email protected]
D. Balzani
Institute of Mechanics and Shell Structures, TU Dresden, Dresden, Germany
e-mail: [email protected]
J. Schröder
Institute of Mechanics, University Duisburg-Essen, Essen, Germany
e-mail: [email protected]
© Springer International Publishing Switzerland 2016 1
K. Weinberg and A. Pandolfi (eds.), Innovative Numerical Approaches
for Multi-Field and Multi-Scale Problems, Lecture Notes in Applied
and Computational Mechanics 81, DOI 10.1007/978-3-319-39022-2_1
2 M. Tanaka et al.
h 2 h 3
f (x + h) = f (x) + h f (x) + f (x) + f (x) + · · · . (1)
2! 3!
The often-used classical FD schemes consider a real-valued small perturbation h and
neglect the higher order terms O(h 2 ) and thus end up in the approximation
f (x + h) − f (x)
f FD (x) ≈ , (2)
h
[ f (x + ih)]
f CSDA (x) ≈ . (3)
h
denotes the operation of taking the imaginary part of complex functions. This
expression provides a high accuracy for very small values of h being remarkably
close to the analytic ones. This is due to the fact that no roundoff errors occur in
the application of the perturbation itself and technically perturbation values up to
e.g. h = 10−99 are possible. The main drawback of this approach is that higher-
order derivatives can not be directly computed and combinations of FD and CSDA
have to be considered, see [13]. This however still suffers from the problems of the
FD method. Automatic differentiation (AD) techniques, which are typically derived
based on repeated application of the chain rule of differentiation, see e.g. [15], are
alternatives to compute highly accurate derivatives. However, these schemes are dif-
ficult to be derived and formulated, in particular for tensorial derivatives. An equiva-
lent with a forward type AD, but more practicable approach can be accomplished by
using dual numbers, which have been originally introduced by Clifford [5]. Fike [6]
developed a higher dimensional version of the dual numbers, the hyper-dual num-
bers (HDNs), which are mainly characterized by the consideration of more than one
imaginary axis. The HDNs of second-order have two non-real unit numbers ε1 and
ε2 with the properties
which implies that each of the numbers squared is 0 and products of them are com-
mutative. In order to better understand calculations with hyper-dual numbers we
4 M. Tanaka et al.
shortly recapitulate some basic characteristics and operations. Given the two HDNs
a = (a1 + a2 ε1 + a3 ε2 + a4 ε1 ε2 ) and b = (b1 + b2 ε1 + b3 ε2 + b4 ε1 ε2 ), a and b are
equal if and only if all of their real and non-real parts are equal, i.e., a1 = b1 , a2 = b2 ,
a3 = b3 , a4 = b4 . We define the symbols ε1 , ε2 and ε1 ε2 which denote the opera-
tions of taking the imaginary parts of HDNs such that ε1 [a] = a2 , ε2 [a] = a3 and
ε1 ε2 [a] = a4 . As proposed by Fike [6] the HDSD for a scalar function f := f (x)
of a single scalar-valued argument x is obtained by inserting hε1 + hε2 into h in (1)
with exactly vanishing higher order terms O(h 3 ). The first derivative f (x) can then
be obtained by taking the coefficient of ε1 or ε2 , i.e.
The second derivative f (x) can be obtained by taking the coefficient, i.e.
In this section, different methods for the numerical approximation of the derivatives
of stress tensors with respect to deformation tensors, known as tangent moduli, are
compared. For this purpose numerical differentiation schemes pointed out in the
previous section are extended to directional derivatives of tensor fields.
Z(X + h A) − Z(X) ∂Z
DZ(X)[ A] := lim = : A. (7)
h→0 h ∂X
∂Z Z(X + h A) − Z(X)
: A≈ , (8)
∂X h
Second-order derivatives can be directly calculated by applying this formula also to
the second derivatives and using first-order FD for the individual first-order deriva-
tive function evaluations. By replacing the FD term by the complex step derivative
approximation, one obtains
∂Z [Z(X + ih A)]
: A≈ . (9)
∂X h
Unfortunately, the direct application of CSDA also to the second derivative yields
an expression which is not free from associated round-off errors and thus it may
also combined with the FD for the second derivative. The only approach which
enables second-order derivative approximation of high accuracy independent of the
choice of the perturbation values is based on hyper dual numbers. Such second-order
derivatives may be important in the context of hyperelasticity when considering the
strain energy function whose first derivative yields the stress and the second derivative
the tangent moduli. Thus, first and second directional derivatives of a scalar function
with respect to deformation tensors are considered now. Let A, B be arbitrary second-
order tensors on R2 or R3 . Then, the directional derivative of a scalar function z(X)
of a second-order tensor argument X in direction A is given by
z(X + h A) − z(X) ∂z
Dz(X)[ A] := lim = : A, (10)
h→0 h ∂X
with ∂z/∂ X denoting a second-order tensor. The second order directional derivative
of a scalar function z(X) of a second-order argument X in the two directions A, B
is
z(X + h A + k B) − z(X + h A) − z(X + k B) + z(X)
D2 z(X)[ A, B] := lim
h,k→0 hk
∂ z
2
= A: : B, (11)
∂ X∂ X
with small perturbation values h, k. Provided that appropriate energy, stress, strain
and directional tensors are substituted in z, Z, X and A, respectively, the required
tangent moduli can be automatically derived as shown in the following.
The material tangent moduli C are defined as the derivative of S with respect to E:
∂S ∂ 2ψ ∂S ∂ 2ψ
C= = , or C = 2 =4 . (14)
∂E ∂ E∂ E ∂C ∂ C∂ C
∗ ∗
KL KL
Now, substitute S in Z, C in X, and C in A in (8) and (9), where C is defined
as
∗ 1 K
C KL
= (G ⊗ G L + G L ⊗ G K ). (15)
2
Then, one obtains for the finite difference and the complex-step-derivative approxi-
mation scheme
∗
S I J (C + h C KL
) − S I J (C)
CFD
I J KL
= , (16)
h
∗
S I J (C + ih C KL
)
CCSDA
I J KL
=2 . (17)
h
Robust Numerical Schemes for an Efficient Implementation … 7
As mentioned above, the function evaluations for the second Piola-Kirchhoff stresses
could be replaced by numerical approximations in terms of the FD or the CSDA
approach. However neither of them would lead to a method without round-off errors.
This can be achieved by using hyper dual numbers. For this purpose consider Eq. (12)
∗ ∗ ∗ ∗
and substitute z by ψ, X by C, A by C IJ
and B by C KL
, where C IJ
and C KL
are
defined in (15), then one obtains
∗
ε1 ψ(C + hε1 C )
IJ
IJ
SHDSD =2 , (18)
h
∗ ∗
ε1 ε2 ψ(C + hε1 C IJ
+ kε2 C KL
)
CHDSD
IJKL
=4 . (19)
hk
Herein, α1 > 0, α2 > 0, α3 > 0, β1 > 0 and β2 > 2 are material parameters, n f is
the number of fiber families and • denotes the Macaulay bracket. The invariants
are given by I1 = tr[C], I3 = det[C], J4(a) = tr[C M (a) ] and J5(a) = tr[C 2 M (a) ] with
the structural tensor M (a) = a0(a) ⊗ a0(a) , where a0(a) denotes the fiber direction.
Herein, Rθ1 , Rθ2 , Rθ3 are the individual rotation tensors and the deformations F 0
contain dilatation and shear deformation. They are chosen as
⎡ ⎤ ⎡ ⎤
cos π4 − sin π4 0 cos π3 0 sin π3
Rθ1 = ⎣ sin π4 cos π4 0 ⎦ , Rθ2 = ⎣ 0 1 0 ⎦,
0 0 1 − sin π3 cos π3
⎡ ⎤ ⎡ ⎤
1 0 0 1.1 γ γ
Rθ3 = ⎣ 0 cos π6 − sin π6 ⎦ , F 0 = ⎣ 0 0.9535 γ ⎦ , (22)
0 sin π6 cos π6 0 0 0.9535
γ is the amplitude of the shear deformation. The material parameters are chosen
as α1 = 1.0, α2 = 1.0, α3 = 0.1, β1 = 1.0 and β2 = 3.0. Two preferred directions
T
are considered, i.e. n f = 2, which are defined as a0(1) = 1/3 1 2 2 and a0(2) =
√ T
1/ 5 2 1 0 . In this example we focus on the numerical performance of computing
the tangent moduli by using different numerical schemes, i.e., the FD approach (16)
and the CSDA scheme (17) starting from the stress tensor, and the HDSD scheme (19)
starting from the strain energy function (20). We compare their results with the ones
of an analytically derived tangent modulus (see [31, 32] for the detailed expressions
of stress and tangent moduli of the model (20)). In order to compare the results the
relative error eC is defined as
21 21
IJKL 2 IJKL 2
eC = (Canalyt ) I J K L − (Capprox ) / (Canalyt ) ,
I,J,K ,L I,J,K ,L
(23)
where (Canalyt ) I J K L denotes the coefficients of the analytic material tangent modulus
tensor in the Cartesian coordinate system, (Capprox ) I J K L denotes the values of the
approximated counterparts. This relative error eC is depicted for each numerical
approximation scheme in Fig. 1. As can be seen, the FD approach shows a quite
sensitive behavior having its optimal accuracy of eC ≈ 10−7 at perturbation value of
≈ 10−7 . For increasing and decreasing perturbation values the error increases. The
CSDA approach already reaches an accuracy of eC ≈ 10−7 for perturbations smaller
than 10−4 and reaches an error at computer accuracy for perturbation values smaller
than approximately 10−9 . The HDSD is independent on the perturbation value and
always yields an error of eC ≈ 10−16 which is computer accuracy.
(a) p0 (b)
y 16
44
44
48 [mm]
Fig. 2 a Geometry with boundary conditions. The beam is fixed in all directions at x = 0 (dashed
area) and a distributed load is applied at the opposite side in vertical direction leading to a bending-
dominated deformation. b Deformed configuration with distribution of Kirchhoff stresses τ 11 in
x-direction
10 M. Tanaka et al.
compute stresses and moduli numerically based on the strain energy function. As
a third option we implement the analytic expression for the first derivative of the
strain energy and compute then numerically the tangent moduli using the CSDA
scheme (17). The Kirchhoff stress distributions τxx in the most deformed configu-
ration are depicted in Fig. 2b. For any numerical differentiation scheme, the stress
distribution is very similar to the analytic implementation of stress and moduli. The
Euclidean norm of the residual vector of the discretized weak form of equilibrium for
the calculations based on the analytic tangent, the FD and HDSD approach is plotted
in Table 1, where h S = h C = 1 is considered for the HDSD. Even the values of the
residuals are almost identical for the HDSD method compared with the ones resulting
from an analytic implementation of stress and moduli. The values for the FD method
differ by orders of magnitudes and thus, no quadratic convergence is observed. Next,
the computing time of the different approaches to calculate the stresses and tangent
moduli is compared in Fig. 3. As expected, the computing time of the FD method
depends on the perturbation values since a different number of iterations results
therefrom. When considering the best performing FD scheme with the perturbation
values h S = 10−4 and h S = 10−6 , the lowest computing time is obtained for coarse
meshes; the HDSD however, is only slightly slower. This behavior changes for an
increasing number of elements and at a reasonable discretization the HDSD becomes
even faster than the FD method with optimal values. This is due to the fact that with
an increasing number of elements the amount of time required for the solution of the
system of equations becomes dominant compared to the calculation of the stiffness
matrix, which is influenced by the numerical approximation of stress and moduli.
Then, for finer meshes the number of iterations and thus the advantage of the HDSD
and CSDA method becomes more important. It is emphasized that the optimal choice
of perturbation values is of course not known in general and thus, the FD method
Table 1 Euclidean norm of the residual of the Cook-type problem based on the FD and HDSD
scheme; red colors indicate that no quadratic convergence could be obtained
FD with hS = 10−4
Analytical HDSD
−4
hC = 10 hC = 10−6 hC = 10−8
5.4504 × 10−1 6.3167 × 10+1 6.3167 × 10+1 6.3167 × 10+1 5.4504 × 10−1
1.8568 × 10+2 9.3803 × 10+1 9.0099 × 10+1 9.0116 × 10+1 1.8568 × 10+2
1.5341 × 10+0 2.0612 × 10+0 2.4092 × 10+0 2.4096 × 10+0 1.5341 × 10+0
3.4458 × 10+0 3.2625 × 10+0 3.2628 × 10+0 3.2630 × 10+0 3.4458 × 10+0
8.4932 × 10−3 6.5687 × 10−3 1.4598 × 10−2 1.4692 × 10−2 8.4932 × 10−3
2.0186 × 10−4 4.2600 × 10−4 5.3153 × 10−4 5.7587 × 10−4 2.0187 × 10−4
5.0665 × 10−8 2.3998 × 10−5 2.0946 × 10−6 5.5211 × 10−6 5.0161 × 10−8
− 2.3392 × 10−6 3.3563 × 10−7 3.3765 × 10−7 −
− 4.2348 × 10−7 − − −
− 3.3564 × 10−7 − − −
Robust Numerical Schemes for an Efficient Implementation … 11
can typically be expected to be slower since not the optimal perturbation values will
be used. This is shown by the computing time resulting from the values h S = 10−4 ,
h C = 10−4 in Fig. 3 leading to a computing time being significantly higher as for
the HDSD and CSDA. Moreover, the HDSD method is still faster than the CSDA
scheme, although the first and second derivatives are calculated numerically in the
case of HDSD, while the CSDA scheme requires the analytic expressions for the
stresses.
In this section a framework for the fully automatic calculation of internal vari-
ables, stresses and consistent tangent moduli for dissipative materials is proposed.
It mainly consists of applying the HDSD scheme to incremental variational formu-
lations (IVFs). Such formulations, see e.g. [18, 22], recast inelasticity theory as an
equivalent optimization problem where the incremental stress potential within a dis-
crete time interval is minimized in order to obtain the values of internal variables.
The IVFs provide a general framework for a broad range of standard dissipative
constitutive models in incremental stress potential regimes. Throughout this section,
isothermal conditions are considered.
Ortiz and Stainier [22] proposed an effective incremental stress potential W eff within
a discrete time interval Δt := tn+1 − tn as
The deformation gradient tensor and the generalized vector of a collection of internal
variables at time tn+1 are denoted by F n+1 and q n+1 , respectively. The incremental
stress potential W comprises a Helmholtz free energy function ψ and a dissipation
potential φ by
Δq
W (F n+1 , q n+1 ) = ψ(F n+1 , q n+1 ) − ψ(F n , q n ) + Δtφ(q n+1 , ); (25)
Δt
Δq := q n+1 − q n . In many cases the minimization problem (24) is solved by using
a Newton-Raphson iteration and q n+1 is determined using the first and second deriv-
atives ∂q W , ∂qq
2
W by updating
−1
q (k+1) (k)
n+1 = q n+1 − ∂qq W
2 (k)
· ∂q W (k) , k = 0, 1, 2, . . . , (26)
until it converges; (k) denotes the iteration number. The stress response P n+1 at time
tn+1 is given via the effective incremental potential function W eff as
where P is the first Piola-Kirchhoff stress tensor and A is the corresponding nominal
tangent modulus tensor. Note that q n+1 is dependent on the current deformation
gradient F n+1 .
At first, find a solution q n+1 to the minimization problem (24) by using the HDNs
ε1 and ε2 . Here, an asterisk (∗) is used as a superscript denoting the perturbed values
(k)
by means of the HDNs. The internal variables at k-th iteration q n+1 are perturbed by
using the HDN units ε1 and ε2 as
∗ (k) (k)
q n+1 = q n+1 + ε1 i r + ε2 i s , (29)
wherein i r denotes a unit vector of a set of internal variables q n+1 such that the j-th
component of i r is defined as i r j = 1 if j = r and 0 else. Then, the r -th components
of ∂q W (k) and accordingly of ∂qq 2
W (k) are obtained by taking the coefficients with
respect to ε1 and ε1 ε2 such as
∗ ∗
∂q W (k) r
= ε1 W (k)
, ∂qq
2
W (k) rs
= ε1 ε2 W (k)
, (30)
Robust Numerical Schemes for an Efficient Implementation … 13
and q n+1 is updated by Eq. (26). As a second step, the stresses P and tangent moduli
A are obtained by differentiating the minimized effective stress potential W eff with
respect to the deformation gradient F n+1 as
Note that Eqs. (31) and (32) are indeterminate since ∂ F q n+1 and ∂ F2 F q n+1 cannot be
computed using only 2nd-order HDNs. In order to avoid the calculations of ∂ F q n+1
and ∂ F2 F q n+1 , we assume the strict stationary condition
∂q W eff = 0, (33)
and also
Inserting (33) and (34) into (31) and (32), the stresses and tangent moduli in this
formulation are obtained by
λ e 2
ψe = b1 + b2e + b3e + μ (b1e )2 + (b2e )2 + (b3e )2 . (37)
2
14 M. Tanaka et al.
1 1
ψ p = y∞ α − (y0 − y∞ ) exp(−ηα) + hα 2 , (38)
η 2
with α being a strain-like isotropic hardening variable and the material parameters
y∞ , y0 , η and h representing the initial yield strength, the plastic yield strength at the
transition from exponential to linear hardening, the degree of exponential hardening,
and the slope of superimposed linear hardening, respectively. In our formulation, we
identify the internal variable q as
T
q = Fp, α , (39)
with Σ denoting the Mandel stress tensor and β denoting the thermodynamic conju-
gate force to α. The dissipation potential φ is obtained by the maximum-dissipation
principle as
φ= sup Σ n+1 : ΔL p + βn+1 · Δα , (41)
(Σ n+1 ,βn+1 )∈E
Herein, ΔL p is a constant plastic velocity gradient in the current time increment and
Δα := αn+1 − αn is a constant increment of hardening variable. The elastic domain
E is defined by a yield function f as
E := (Σ, β) f (Σ, β) = devΣ − 2/3β ≤ 0 ; (42)
23 [MPa]
(a) (b) (c)
300
275
l 250
225
200
l l
Fig. 4 a Initial configuration of SSRVE (the total number of elements is 10,889 and the number of
degrees of freedom is 43,578), b distribution of Kirchhoff stresses τyz in the deformed configuration
resulting from the classical standard return mapping method and c IVF with HDSD scheme
The performance of the proposed implementation scheme for finite strain plas-
ticity is investigated by the finite element simulation of a statistically similar repre-
sentative volume element (SSRVE) of a dual-phase (DP) steel microstructure. The
SSRVE shown in Fig. 4a is obtained by constructing a simplified microstructure in
terms of statistical measures as similar as possible to the real random microstruc-
ture, cf. [2]. To somehow represent a typical DP steel microstructure consisting of
a ferritic matrix phase with an embedded martensitic inclusion phase the material
parameters in Table 2 are used. A macroscopic deformation gradient F̄ including the
shear F̄yz is applied by prescribing the homogeneous deformation field x = F̄ X + w̃
to the SSRVE and periodic deformation fluctuations w̃ and anti-periodic tractions
0
0 0.01 0.02 0.03 0.04
-
Macroscopic deformation gradient F23
16 M. Tanaka et al.
In this section a CSDA-based robust approximation scheme for the numerical calcu-
lation of tangent stiffness matrices is presented in the context of nonlinear thermo-
mechanical finite element problems and its performance is analyzed.
5.1 Formulations
− DivF S − f = 0, (45)
1
S · Ċ + ρ0 r − Divq 0 − ρ0 (ψ̇ + θ˙η) = 0. (46)
2
Herein, the Legendre transform ψ = U − θ η has been performed, where ψ, U , θ
and η denote the Helmholtz free energy, the specific internal energy, temperature
and the specific entropy. The internal dissipation is considered to consist of two
additive parts, i.e. Dint = Dmech + Dtherm , with the thermal part Dtherm = ρ0 θ η̇p and
a mechanical part Dmech . q 0 is the heat flux through the body in the reference con-
figuration, which is related to the Cauchy heat flux q = −kθ gradθ by q 0 = J F −1 q.
Herein, kθ is the heat conduction coefficient and J is the determinant of F. Note
that grad(•) denotes the gradient with respect to coordinates in the reference config-
uration, respectively, and Div(•) denotes the divergence with respect to coordinates
in the reference configuration. Also, f , r and ρ0 are the body force vector, internal
heat source and the reference density of the body, respectively. For the solution of
boundary value problems the standard Galerkin method is typically applied which
requires the weak form of the balance equations. In this context see e.g. [33] or [29]
for further details. Multiplying the balance Eqs. (45), (46) with test functions and
Robust Numerical Schemes for an Efficient Implementation … 17
integrating over the physical domain leads to the weak forms G u := G int
u − Gu = 0
ext
and G θ = G θ − G θ = 0, wherein the internal and external parts of the weak forms
int ext
are given by
1
u :=
G int S · δC dV, (47)
B0e 2
θ :=
G int ( q 0 · Gradδθ + ρ0 θ ∂θθ ψ θ̇ δθ + ρ0 θ ∂θα ψ α̇ δθ
2 2
B0e
e
+ ρ0 θ ∂θ2be ψ · ḃ δθ + Dmech δθ ) dV, (48)
G ext
u := t 0 · δu dA + f · δu dV, (49)
∂B0e B0e
G ext
θ := Q · Nδθ dA. (50)
∂B0e
Herein, B0e is the domain of the reference configuration, δu and δθ denote the test
functions associated with the displacement and temperature fields, respectively. Also
note that t 0 and Q are the surface traction and the heat flux vectors, respectively,
which are assumed here to be independent of the displacements or temperatures.
Exploiting the principle of maximum dissipation and applying the Karush-Kuhn-
Tucker optimality conditions, c.f. [28], theexplicit form for the mechanical dissipa-
tion for isotropic hardening is Dmech = λ 23 y(θ ), with the consistency parameter
λ and the initial yield stress y(θ ). Here, for the finite element implementation the
discretized weak forms of the balance equations using matrix notation read
n ele
int,h
n ele
u ≈ Gu
G int int,h
= (δd eu )T r int,e
u and G int
θ ≈ G θ = (δd eθ )T r θint,e , (51)
e=1 e=1
with n ele denoting the number of finite elements. The external parts of the discretized
weak forms G uext,h and G ext,h
θ are reformulated accordingly. The element vector of
mechanical degrees of freedom is denoted by d eu which can be represented for a
three-dimensional setting by d eu = [(d uI =1 )T (d uI =2 )T . . . (d nen
u ) ] with nen being
T T
the number of nodes per element and d u = [dux duy duz ] denoting the displacements
I T
at a particular node I .
In a three-dimensional physical space the number of degrees of freedom associ-
ated with the displacements is nd f u = 3, i.e. dux , du y , and duz . Then the total number
of mechanical degrees of freedom for one element is tdo f u = nen × nd f u . Analo-
gously, the number of thermal degrees of freedom is nd f θ = 1, i.e. dθ , such that the
total number of thermal degrees of freedom per element is tdo f θ = nen × nd f θ .
After inserting standard approximations for the nodal displacements and temper-
atures and shifting to the Voigt notation to represent tensorial quantities of second
order as vectors, the discretized internal element residual vectors are
18 M. Tanaka et al.
nen
r e,int
u = (B uI )T S dV, (52)
I =1 B0e
nen
r e,int
θ = (B θI )T q 0 + N I ρ0 θ ∂θθ
2
ψ θ̇ + N I ρ0 θ ∂θθ
2
ψ α̇
I =1 B0e
e
+ N I ρ0 θ ∂θ2be ψ · ḃ + N I λ 2/3y(θ )) dV. (53)
Herein, B0e denotes the domain of the initial configuration of finite element e. The
external residual vectors r e,extu and r e,ext
θ are obtained analogously. Here, the same
nodal shape functions N are used for the displacements and the temperature; B uI are
I
the standard mechanical B-matrices associated with node I , c.f. [33], which consist
of the gradients of the nodal shape functions. The thermal B-matrix is given by B θI =
[N,1I N,2I N,3I ]T , where N,iI represents the derivative of the shape function with respect
to physical coordinate X i . Due to the material nonlinearities these equations have to
be solved numerically. For that purpose their linearizations LinG hu = G hu + ΔG hu and
LinG hθ = G hθ + ΔG hθ are required. Here ΔG hu and ΔG hθ are the linear increments of
the weak forms obtained as
n ele
ΔG int,h
u = (δd eu )T keuu Δd eu + keu θ Δd eθ , (54)
e=1
n ele
ΔG int,h
θ = (δd eθ )T keθu Δd eu + keθθ Δd eθ , (55)
e=1
Instead of implementing the analytic expressions for these derivatives the CSDA
scheme can be applied. Remember that the mechanical and thermal residual vectors
depend on all (mechanical and thermal) degrees of freedom, i.e. r eu := r eu (d eu , d eθ )
e
and r eθ := r eθ (d eu , d eθ ). Then the approximations of the k-th column vectors k̃uu(k) and
e e e
k̃θu(k) in keuu and keθu , respectively, and of the j-th column vectors k̃uθ( j) and k̃θθ( j)
in keuθ and keθθ , respectively, can alternatively be approximated by
e
e ∂ r e,int r eu (d eu + i h d̃ u(k) , d eθ )
k̃uu(k) := u ≈ , (57)
∂ due k h
e
e ∂ r e,int r eu (d eu , d eθ + i h d̃ θ( j) )
k̃uθ( j) := u e ≈ , (58)
∂ dθ j h
Robust Numerical Schemes for an Efficient Implementation … 19
e
e ∂ r e,int r eθ (d eu + i h d̃ u(k) , d eθ )
k̃θu(k) := θ ≈ , (59)
∂ due k h
e
e ∂ r e,int r eθ (d eu , d eθ + i h d̃ θ( j) )
k̃θθ( j) := θ ≈ , (60)
∂ dθe j h
where the indices k ∈ [1, tdo f u ] and j ∈ [1, tdo f θ ] on the left hand side of the
equations represent the column index. On the right hand side these indices correspond
to the individual perturbation vectors whose components with indices m ∈ [1, tdo f u ]
and q ∈ [1, tdo f θ ], respectively, are defined as
{d̃u(k)
e
}m = δ(k)m and {d̃θ(
e
j) }q = δ( j)q . (61)
(a) (b)
(M) (T) (V)
-650 MPa 939 K 3%
Loads
0 1 2 3 4
Ferrite
Time (in s)
Martensite
Fig. 6 a RVE, discretized with 3623 quadratic tetrahedral elements, and b loading protocol; M, T
and V indicate the mechanical compressive load, the thermal change and the volumetric expansion
(of the inclusion), respectively
20 M. Tanaka et al.
unloaded again. Second, the temperature is prescribed over the entire microstruc-
ture, increasing first from 293 K up to 939 K and then decreasing back to 293 K,
to reflect some heat treatment. Third, during the cooling procedure additionally a
volumetric change is applied in the inclusion phase to characterize the mechanical
fields resulting from a 3 % phase transformation volume change. This loading pro-
tocol is depicted in Fig. 6b. It is remarked that this procedure is only an idealization
of the production process of DP steel and serves here only as a numerical exam-
ple showing the applicability of the approximation scheme. For the calculation the
strain energy function as proposed in [29] is used along with a linear hardening law.
The material parameters chosen for the two phases are listed in Table 3, wherein
the hardening modulus is denoted by H . For the initial yield stress the temperature
dependency y = ω(θ − θ0 ) + y0 − ỹ0 + ỹ0 is taken into account, where y0 is the
initial yield stress at room temperature θ0 = 293 K. The parameters ω and y0 for the
two phases are also given in Table 3. The resulting accumulation of plastic strains
in the microstructure is visualized in Fig. 7. At time t = 1 s, after the application of
compression, we see a development of plastic strains in the ferrite, which is however,
lower than at time t = 4 s. To demonstrate the performance of the CSDA scheme
the convergence patterns are analyzed at particular times of interest during the entire
simulation. Thus, the times t = 1 s where the total compressive load is active and
t = 4 s at the end of the volume jump and cooling process are considered. The norms
of the residual vectors versus the corresponding Newton iteration, obtained at each of
these times, are plotted in Fig. 8 for the perturbation values of h = 10−5 , h = 10−8 ,
Fig. 7 Values of internal variable α in the microstructure at (a) t = 1 s, after the application of
compressive loads, and at (b) t = 4 s, after the application of the volumetric expansion
Robust Numerical Schemes for an Efficient Implementation … 21
(a) (b)
100 10000
log(h) = -05 log(h) = -05
0.0001 0.01
1e-06 0.0001
1e-08 1e-06
1e-10 1e-08
1e-12 1e-10
1 10 1 10
Fig. 8 Absolute values of the residual norms versus corresponding Newton iteration at times
(a) t = 1 s, (b) t = 4 s. Both axes are depicted in logarithmic scale
h = 10−10 and h = 10−30 . As can be seen, the perturbation h = 10−5 leads to too
inaccurate approximations and therefore gives unsatisfactory results, but smaller per-
turbation values lead to quadratically converging Newton iterations. Summarizing, a
stable convergence of the CSDA approach for the tangent stiffness matrix is demon-
strated in this example for a thermo-elastoplastic problem with a variety of loading
conditions.
6 Conclusion
Acknowledgments Financial funding by the DFG Priority Program 1648 (SPPEXA “Software
for Exascale Computing”), projects BA 2823/8-1 and SCHR 570/19-1 is greatly acknowledged
by D. Balzani and J. Schröder. Furthermore, assistance for the thermo-mechanical calculations by
Ashutosh Gandhi is appreciated.
References
1. Balzani, D., Neff, P., Schröder, J., & Holzapfel, G. A. (2006). A polyconvex framework for
soft biological tissues. Adjustment to experimental data. International Journal of Solids and
Structures, 43(20), 6052–6070.
2. Balzani, D., Scheunemann, L., Brands, D., & Schröder, J. (2014). Construction of two- and
three-dimensional statistically similar RVEs for coupled micro-macro simulations. Computa-
tional Mechanics, 54, 1269–1284.
3. Balzani, D., Gandhi, A., Tanaka, M., & Schröder, J. (2015). Numerical calculation of thermo-
mechanical problems at large strains based on robust approximations of tangent stiffness matri-
ces. Computational Mechanics, 55, 861–871.
4. Bleier, N., & Mosler, J. (2012). Efficient variational constitutive updates by means of a novel
parameterization of the flow rule. International Journal for Numerical Methods in Engineering,
89, 1120–1143.
5. Clifford, W. K. (1873). Preliminary sketch of biquaternions. Proceedings of the London Math-
ematical Society, 4(64), 381–395.
6. Fike, J. A. (2013). Multi-objective optimization using hyper-dual numbers. Ph.D. thesis, Stan-
ford university.
7. Fike, J. A., & Alonso, J. J. (2011). The development of hyper-dual numbers for exact second-
derivative calculations. In 49th AIAA Aerospace Sciences Meeting including the New Horizons
Forum and Aerospace Exposition.
8. Golanski, D., Terada, K., & Kikuchi, N. (1997). Macro and micro scale modeling of thermal
residual stresses in metal matrix composite surface layers by the homogenization method.
Computational Mechanics, 19, 188–201.
9. Kim, S., Ryu, J., & Cho, M. (2011). Numerically generated tangent stiffness matrices using
the complex variable derivative method for nonlinear structural analysis. Computer Methods
in Applied Mechanics and Engineering, 200, 403–413.
10. Kiran, R., & Khandelwal, K. (2015). Automatic implementation of finite strain anisotropic
hyperelastic models using hyper-dual numbers. Computational Mechanics, 55, 229–248.
11. Kiran, R., & Khandelwal, K. (2014). Complex step derivative approximation for numerical
evaluation of tangent moduli. Computers and Structures, 140, 1–13.
12. Klinkel, S. (2000). Theorie und Numerik eines Volumen-Schalen-Elementes bei finiten elastis-
chen und plastischen Verzerrungen. Dissertation thesis, Institut für Baustatik, Universität Karl-
sruhe.
13. Lai, K.-L., & Crassidis, J. L. (2008). Extensions of the first and second complex-step derivative
approximations. Journal of Computational and Applied Mathematics, 219, 276–293.
14. Lyness, J. N. (1968). Differentiation formulas for analytic functions. Mathematics of Compu-
tation, 352–362.
15. Martins, J. R. R. A., & Hwang, J. T. (2013). Review and unification of discrete methods for
computing derivatives of single- and multi-disciplinary computational models. AIAA Journal,
51(11), 2582–2599.
16. Martins, J. R. R. A., Sturdza, P., & Alonso, J. J. (2003). The complex-step derivative approxi-
mation. ACM Transactions on Mathematical Software, 29, 245–262.
17. Miehe, C. (1996). Numerical computation of algorithmic (consistent) tangent moduli in large-
strain computational inelasticity. Computer Methods in Applied Mechanics and Engineering,
134, 223–240.
Robust Numerical Schemes for an Efficient Implementation … 23
18. Miehe, C., & Lambrecht, M. (2003). Analysis of microstructure development in shearbands by
energy relaxation of incremental stress potentials: Large-strain theory for standard dissipative
solids. International Journal for Numerical Methods in Engineering, 58, 1–41.
19. Miehe, C., Schotte, J., & Schröder, J. (1999). Computational micro-macro-transitions and
overall moduli in the analysis of polycrystals at large strains. Computational Materials Science,
16, 372–382.
20. Mosler, J., & Bruhns, O. T. (2009). Towards variational constitutive updates for non-associative
plasticity models at finite strain: Models based on a volumetric-deviatoric split. International
Journal of Solids and Structures, 46, 1676–1684.
21. Mosler, J., & Bruhns, O. T. (2010). On the implementation of rate-independent standard dis-
sipative solids at finite strain—variational constitutive updates. Computer Methods in Applied
Mechanics and Engineering, 199, 417–429.
22. Ortiz, M., & Stainier, L. (1999). The variational formulation of viscoplastic constitutive updates.
Computer Methods in Applied Mechanics and Engineering, 171, 419–444.
23. Pérez-Foguet, A., Rodríguez-Ferran, A., & Huerta, A. (2000). Numerical differentiation for
local and global tangent operators in computational plasticity. Computer Methods in Applied
Mechanics and Engineering, 189, 277–296.
24. Pérez-Foguet, A., Rodríguez-Ferran, A., & Huerta, A. (2000). Numerical differentiation for
non-trivial consistent tangent matrices: An application to the mrs-lade model. International
Journal for Numerical Methods in Engineering, 48, 159–184.
25. Schröder, J. 2013. A numerical two-scale homogenization scheme: the FE2 -method. In Plas-
ticity and beyond—microstructures, chrystal-plasticity and phase transitions (CISM Lecture
Notes). Vienna: Springer.
26. Schröder, J., Neff, P., & Balzani, D. (2005). A variational approach for materially stable
anisotropic hyperelasticity. International Journal of Solids and Structures, 42(15), 4352–4371.
27. Simo, J. C. (1988). A framework for finite strain elastoplasticity based on maximum plastic
dissipation and the multiplicative decomposition: Part I. Continuum formulation. Computer
Methods in Applied Mechanics and Engineering, 66, 199–219.
28. Simo, J., & Hughes, T. J. R. (1998). Computational inelasticity. Berlin: Springer.
29. Simo, J., & Miehe, C. (1992). Associative coupled thermoplasticity at finite strains: Formu-
lation, numerical analysis and implementation. Computer Methods in Applied Mechanics and
Engineering, 98, 41–104.
30. Smit, R. J. M., Brekelmans, W. A. M., & Meijer, H. E. H. (1998). Prediction of the mechanical
behavior of nonlinear heterogeneous systems by multi-level finite element modeling. Computer
Methods in Applied Mechanics and Engineering, 155, 181–192.
31. Tanaka, M., Fujikawa, M., Balzani, D., & Schröder, J. (2014). Robust numerical calculation
of tangent moduli at finite strains based on complex-step derivative approximation and its
application to localization analysis. Computer Methods in Applied Mechanics and Engineering,
269, 454–470.
32. Tanaka, M., Sasagawa, T., Omote, R., Fujikawa, M., Balzani, D., & Schröder, J. (2015). A
highly accurate 1st- and 2nd-order differentiation scheme for hyperelastic material models
based on hyper-dual numbers. Computer Methods in Applied Mechanics and Engineering,
283, 22–45.
33. Zienkiewicz, O. C., & Taylor, R. L. (1967). The finite element method for solid and structural
mechanics. Oxford: Butterworth-Heinemann.
Folding Patterns in Partially Delaminated
Thin Films
Abstract Michael Ortiz and Gustavo Gioia showed in the 90s that the complex
patterns arising in compressed elastic films can be analyzed within the context of the
calculus of variations. Their initial work focused on films partially debonded from
the substrate, subject to isotropic compression arising from the difference in thermal
expansion coefficients between film and substrate. In the following two decades
different geometries have been studied, as for example anisotropic compression. We
review recent mathematical progress in this area, focusing on the rich phase diagram
of partially debonded films with a lateral boundary condition.
1 Introduction
Elastic films deposited on a substrate are often subject, after thermal expansion, to
compressive strains which are released by debonding and buckling, generating a
variety of microstructures. The work of Michael Ortiz and Gustavo Gioia in the 90s
[1, 2] opened the way for the use of the tools of calculus of variations in the study of
these structures. Their starting point was the Föppl-von Kármán plate theory, as given
in (4) below. One of their insights was that the key nonconvexity which gives rise
to the microstructure can be understood in terms of the out-of-plane displacement
alone, leading after some rescalings to the Eikonal functional, as given in (1) below.
This functional contains a term of the form (|Dw|2 − 1)2 , where w is the normal
D. Bourne
Department of Mathematical Sciences, Durham University, Durham, UK
e-mail: [email protected]
S. Conti (B) · S. Müller
Institut Für Angewandte Mathematik, Universität Bonn, Bonn, Germany
e-mail: [email protected]
S. Müller
e-mail: [email protected]
displacement, which favours deformations with the property that the gradient of w
is approximately a unit vector, independently of the orientation. Since the film is
still bound to the substrate at the boundary of the debonded region, the appropriate
boundary condition is w = 0, which prescribes that the average over Ω of the gradient
of w vanishes. Therefore the resulting low-energy deformations have gradient Dw
oscillating between different values. As in many nonconvex variational problems,
oscillations on very small scales may be energetically convenient, see [3, 4]. Corre-
spondingly, the variational problem Ω (|Dw|2 − 1)2 d x is not lower semicontinuous,
and - depending on the boundary data and forcing - does not have minimizers. How-
ever, the curvature term σ 2 |D 2 w|2 penalizes oscillations on an exceedingly fine scale
and thereby ensures existence of minimizers. The solutions then have oscillations
on an intermediate scale, which is determined by the competition between the two
terms. The analysis of the specific functional proposed by Ortiz and Gioia is reviewed
in Sect. 2 below.
The approach of Ortiz an Gioia was later extended to the full vectorial Föppl-von
Kármán energy, and also to three-dimensional elasticity. These refinements explained
the appearance of oscillations on two different length scales, with coarse oscillations
in a direction normal to the boundary, and fine oscillations in the direction tangential
to the boundary, as discussed in Sect. 3 below.
Recently interest has been directed to controlling the microstructures by designing
the geometry of the debonded region appropriately [5, 6]. The key idea is to introduce
a sacrificial layer between the film and the substrate, and then to selectively etch away
a part of it, so that the boundary of the debonded region is straight. The film then
partially rebonds to the surface, leading to complex patterns of tunnels. A study
of these patterns within the Ortiz-Gioia framework, with a variational functional
containing the Föppl-von Kármán energy and a fracture term proportional to the
debonded area, is presented in Sect. 4. The mathematical analysis leading to the upper
bounds of Theorem 6 suggests the presence of different types of patterns in different
parameter ranges. The picture is rather easy in the two extreme cases in which the
bonding energy per unit area is very small or very large. Indeed, in the first one the
patterns observed for completely debonded films give the optimal energy scaling,
in the second one the optimal state corresponds to the film completely bound to the
substrate. In the intermediate regime we expect a richer picture, with bonded areas
separated by thin debonded tunnels. For a certain regime, depending on the relation
between the bonding energy per unit area, the film thickness and the compression
ratio, a construction in which the tunnels branch and refine close to the boundary has
a lower energy than the one with straight tunnels, see discussion in Sect. 4 below.
The microstructure formation in thin films can be understood at a qualitative level
as a form of Euler buckling instability. The relevant experiments, however, are well
beyond the stability threshold, as discussed in Sect. 5 below.
Folding Patterns in Partially Delaminated Thin Films 27
Ortiz and Gioia showed that, if tangential displacements are neglected, the energy
of a compressed thin film can be characterized by the functional
2
Iσ [w] = |Dw|2 − 1 + σ 2 |D 2 w|2 d x, (1)
Ω
Fig. 1 Sketch of a
deformation achieving the
optimal energy in (1). Here
the debonded region
Ω = (0, 1)2 is a square, and
the distance to the boundary
gives a “tent”-form. The
convolution in (3) makes the
folds have smooth transitions
on a small scale
28 D. Bourne et al.
w w
x2 x2
Fig. 2 Sketch of the effect of the mollification in (3) in a direction orthogonal to the fold. Left
panel: the distance from the boundary dist(x, ∂Ω) is a function with slope ±1 and sharp kinks.
Right panel: the mollification defined in (3) still has slope ±1 on large parts of the domain, but has
smooth transitions from one value to the other over a length of the order of σ , see Fig. 3
Ω σ |D w| d x infinite. Therefore Ortiz and Gioia [1, 2] proposed to use a smoothed
2 2 2
A finer analysis of the nonlinear elasticity model that had led to (1) showed that, in
the case of isotropic compression, also the in-plane components exhibit fine-scale
oscillations which refine close to the boundary [22–25]. This analysis was based
on the Föppl-von Kármán model, which includes the tangential components of the
displacement u as well. After rescaling the energy takes the form (in the case of zero
Poisson’s ratio for simplicity)
E σ [u, w] = |Du + Du T + Dw ⊗ Dw − Id|2 + σ 2 |D 2 w|2 d x. (4)
Ω
Folding Patterns in Partially Delaminated Thin Films 29
Here Ω ⊂ R2 is, as above, the debonded region, and the displacements u and w
vanish at the boundary of Ω, corresponding to the fact that the rest of the film is still
bound to the substrate. The isotropic compressive strain has been scaled to 1, and
one can check that E σ [0, w] = 1 + Iσ [w]. The key result from [22, 23] was that the
minimum energy scales proportional to σ :
Theorem 1 (From [22, 23]) Let Ω ⊂ R2 be a bounded domain with piecewise
smooth boundary. Then there are two constants c L , cU > 0 such that
The argument used for proving the lower bound also proves that a finite fraction of
the energy is localized in a thin strip close to the boundary.
Similar statements hold if the plate theory in (4) is replaced by a fully three-
dimensional nonlinear elastic model. For v : Ω × (0, h) → R3 , h > 0, we define
1
E h [v] =
3D
W (Dv)d x (6)
h Ω×(0,h)
where W : R3×3 → [0, ∞) is the elastic stored energy density, which vanishes on
the set of proper rotations SO(3) and has quadratic growth, in the sense that
for some positive constants c and c . The factor 1/ h is included explicitly in (6) to
obtain an energy per unit thickness, corresponding to (4).
In the nonlinear case the thickness h of the film and the compression δ enter the
problem separately, however to leading order and after scaling the optimal energy
only depends on the combination σ = h/δ 1/2 . In order to understand this expression
it is instructive to recall the relation between the three-dimensional problem E h3D
and its two-dimensional counterpart E σ . In particular, a given pair (u, w) in (4)
corresponds to a three-dimensional deformation vδ of the form
where ⎛ ⎞
x1 + 2δu 1 (x1 , x2 )
ψ(x1 , x2 ) = ⎝x2 + 2δu 2 (x1 , x2 )⎠ (9)
(2δ)1/2 w(x1 , x2 )
is, to leading order, the normal to the surface described by ψ and gives the out-of-
plane component of the strain. An expansion of E h3D [vδ ] for small δ shows that the
leading order contribution is proportional to δ 2 E σ [u, w] if the Poisson’s ratio of the
material vanishes. See for example [22, App. A and App. B] for a more detailed
discussion of this point. A rigorous relation between E σ and E h3D was derived in
[26, 27] by means of Γ -convergence, these results however are appropriate for a
different regime, with much smaller energy, and therefore do not apply directly to
the situation of interest here.
Theorem 2 (From [25]) Let Ω ⊂ R2 be a bounded domain with piecewise smooth
boundary, δ ∈ (0, 1), h ∈ (0, δ 1/2 ). Then there are two constants c L , cU > 0 such
that
1 3D
c L σ ≤ min{ E [u] : u(x) = (1 − δ)x for (x1 , x2 ) ∈ ∂Ω} ≤ cU σ (11)
δ2 h
Fig. 3 Sketch of a deformation achieving the optimal upper bound in (5). As in Fig. 1, the debonded
region Ω = (0, 1)2 is a square. The starting point, at a coarse scale, is the “tent”-form illustrated in
Fig. 1. At a finer scale, folds orthogonal to the boundary relax the tangential compression (left panel,
folds are only drawn in a small region). The period of the folds is of order h close to the boundary,
and via a sequence of period-doubling steps becomes coarser in the inside (right, blow-up of the
folds from the left panel)
Folding Patterns in Partially Delaminated Thin Films 31
1 3D
lim E [vh ] = 0 (13)
h→0 hα h
The proof of this is based on the combination of three ingredients. The first one is
an approximation of short maps with Origami maps:
Theorem 4 (From [45]) Let v0 : Ω → R3 be a short map, i.e., a map which obeys
|v0 (x) − v0 (y)| ≤ |x − y| for all x, y ∈ Ω. Then there is a sequence v j of Origami
maps converging uniformly to v0 .
Here we say that a map v : R2 → R3 is an Origami map if it is continuous and
piecewise isometric, i.e., if the domain can be subdivided into pieces such that v is
a linear isometry (a translation plus a rotation) in each piece. The number of pieces
is allowed to diverge only at infinity, in the sense that only finitely many pieces are
allowed in any bounded subset of R2 .
The second step is to approximate any Origami maps with low-energy maps:
32 D. Bourne et al.
x3
x2
x1
Fig. 4 Geometry of the partially delaminated film. The intermediate sacrificial layer is removed
chemically only for x1 > 0. The free-standing film is subject to compression at the Dirichlet bound-
ary x1 = 0 and may rebond to the substrate
Folding Patterns in Partially Delaminated Thin Films 33
Fig. 5 Experimental picture of tube branching in Si1−x Gex film on a thick SiO2 substrate. Left
AFM image of the network near the etching front. Right autocorrelation pattern. Reprinted from
[5, Fig. 2] with permission from Wiley
The three terms represent stretching, bending and bonding energies respectively.
Here u : Ω → R2 are the (scaled) tangential displacements and γ > 0 is the bonding
energy per unit area (related to Griffith’s fracture energy), |{w > 0}| represents the
area of the set where the vertical displacement w is nonzero. Equivalently one could
take the debonded state as reference and consider a negative term proportional to the
rebonded area, −γ |{w = 0}|; the two energies only differ by an additive constant. The
appropriate boundary conditions correspond to the film being bound to a substrate
on one side of the domain; for simplicity we shall focus on Ω = (0, 1)2 with u = 0
and w = 0 on the {x1 = 0} side of Ω. As above, we assume w ≥ 0 everywhere.
The mathematical analysis of the energy (15) leads to the rich phase diagram
sketched in Fig. 6, which contains four different regimes [51] that we now illustrate.
For large specific bonding energy γ the film is completely bound to the substrate.
In particular the film is flat, so that there is no bending energy, but the stretching
energy is not released. The total energy is then proportional to the area of Ω, and
one obtains E σ,γ [0, 0] = 2. This is regime A in Fig. 6 and Theorem 6.
The opposite case of very small bonding energy γ is also easy to understand
after the foregoing discussion: here the bonding term plays no significant role and
A
B
x2
the film is completely detached from the substrate. One recovers the result of the
blistering problem of Theorem 1, E σ,γ [u, w] E σ [u, w] ≤ cσ . The corresponding
deformations are those illustrated in Fig. 3. This is regime D in Fig. 6 and Theorem 6.
For intermediate values of γ the situation is more complex, in particular debonded
channels are formed, which separate wider bonded regions. In regime B the pattern
is periodic and, away from the Dirichlet boundary, depends only on the tangential
variable x2 . A large part of the film is bonded to the substrate, but bonded regions
are separated by thin tubes, see Fig. 7. Denoting by h the period of the oscillations,
and by δ the width of a tube, the total volume fraction of the tubes is δ/ h, therefore
the bonding energy is proportional to γ δ/ h. Each tube has to release a compression
of h over a width δ, therefore the term |Dw|2 is of order h/δ inside the tubes (the
stretching energy is then completely relaxed). This gives |Dw| ∼ (h/δ)1/2 in the
tubes, and hence |D 2 w| ∼ (h/δ)1/2 /δ. Therefore the total energy can be estimated
by
2
δ δ h 1/2 /δ 1/2 δ σ2
γ + σ2 =γ + 2 . (16)
h h δ h δ
δ σ2
h+γ + 2. (17)
h δ
Inserting the value of δ obtained above and minimizing in h we conclude that h
and E are proportional to (σ γ )2/5 . The width of each tube δ is then proportional to
σ 4/5 γ −1/5 . This is regime B in Fig. 6; a precise version of this construction proves
the second bound in Theorem 6.
If the bending term becomes more important, it is convenient to insert period-
doubling steps, just like in the discussion of the functional (4). The resulting pattern
is shown in Fig. 8. In comparison to the pattern of Fig. 3 the key difference is that the
bending is localized to a small region, whereas large parts of the film are bond to the
Folding Patterns in Partially Delaminated Thin Films 35
substrate. The period-doubling steps are only possible at the expense of stretching
energy; balancing the different terms one finds [51] that the resulting energy is pro-
portional to σ 1/2 γ 5/8 . The result of the construction is summarized in the following
statement.
Theorem 6 (From [51]) Let γ > 0, σ ∈ (0, 1). There are u, w which obey the stated
boundary conditions and
⎧
⎪
⎪1 if σ γ > 1 (regime A),
⎨
(σ γ )2/5 if σ −4/9 ≤ γ ≤ σ −1 (regime B),
E σ,γ [u, w] ≤ c (18)
⎪
⎪σ 1/2 γ 5/8 if σ 4/5 ≤ γ ≤ σ −4/9 (regime C),
⎩
σ if γ < σ 4/5 (regime D).
The proof is based on making the constructions sketched above precise, details are
given in [22] for regime D and in [51] for regimes B and C. Regime A, as discussed
above, is immediate.
Optimality of the phase diagram just discussed can be at least partially proven
by providing matching lower bounds on the energy. In particular, one can show the
following.
Theorem 7 (From [51]) Let γ > 0, σ ∈ (0, 1). For any u, w which obey the stated
boundary conditions one has
⎧
⎨1 if σ γ > 1 (regime A),
E γ ,σ [u, w] ≥ c (σ γ )2/3 if σ 1/2 ≤ γ ≤ σ −1 (regime B ), (19)
⎩
σ if γ < σ 1/2 (regime D ).
Whereas the statement in regime D follows from [22], the other two bounds are
proven in [51] using the Korn-Poincaré inequality for SBD2 functions obtained in
[52].
36 D. Bourne et al.
The general form of the linearized Föppl-von Kármán plate theory under isotropic
compression is [53, 54]
1 h2
E FvK [u, w] = Yh (1 − ν)|ε|2 + ν(Trε)2 + (1 − ν)|D 2 w|2 + ν(Δw)2 d x ,
2 Ω 12
(20)
see also [2, 22] for a discussion in the present context and [27] for a rigorous mathematical
derivation. Here ν ∈ [−1, 1/2] is the Poisson ratio, Y Young’s modulus, h the film thickness,
and the strain ε is defined by
where δ is the eigenstrain (i.e., the compression enforced by the substrate). We recall
that we use |M|2 = TrM T M for the matrix norm. For ν = 0, after a rescaling (20)
reduces to (4). We recall that in [22, App. B] it was shown that the scaling behavior of
the functional E FvK is the same for all ν ∈ (−1, 1/2], hence our results hold also for
generic values of the Poisson ratio. Of course, the regime ν ≥ 0 is the most relevant.
For small δ one can linearize around the state u = 0, w = 0. After straightforward
computations this leads to
1
lin
E FvK [u, w] = Yh (1 − ν)|Du + Du T − 2δId|2 − 4(1 − ν)δ|Dw|2
2 Ω
h2
+ ν(2divu − 4δ)2 − 8δν|Dw|2 + (1 − ν)|D 2 w|2 + ν(Δw)2 d x .
12
and
x x Id x⊗x
D 2 w(x) = ϕ (|x|) ⊗ + ϕ (|x|) − .
|x| |x| |x| |x|3
Folding Patterns in Partially Delaminated Thin Films 37
This is positive definite if the first term, of order δ, is not larger then the second term,
of order h 2 /R 2 . Therefore the loss of stability, which corresponds to Euler buckling,
occurs at strains δ ∼ h 2 /R 2 . Inserting the experimental data from [5], namely, h ∼ 20
nm, R ∼ 10 µm, ν ∼ 0.277, leads to δcrit ∼ 4 × 10−6 , which corresponds to a strain
of 0.0004 %. This is over three orders of magnitude smaller than the experimentally
applied strain δExp ∼ 0.011 = 1.1 %. Therefore the experiments we discussed take
place well beyond the loss of linear stability, and a buckling-postbuckling analysis
does not seem appropriate to understand the deformations. Our variational approach
is instead constructed to deal with deformations and microstructures that appear in
the deeply nonlinear regime and is therefore more suitable to study the mentioned
experiments.
References
1. Ortiz, M., & Gioia, G. (1994). Dynamically propagating shear bands in impact-loaded
prenotched plates II. Numerical simulations. Journal of the Mechanics and Physics of Solids,
42, 531.
2. Gioia, G., & Ortiz, M. (1997). Delamination of compressed thin films. Advances in Applied
Mechanics, 33, 119.
3. Dacorogna, B. (1989). Direct methods in the calculus of variations. Applied Mathematical
Sciences, 78. Springer.
4. Müller, S.: F. Bethuel et al. (Ed.) Calculus of variations and geometric evolution problems.
Lecture Notes in Math (1713, pp. 85–210). Springer.
5. Mei, Y., Thurmer, D. J., Cavallo, F., Kiravittaya, S., & Schmidt, O. G. (2007). Semiconductor
sub-micro-/nanochannel networks by deterministic layer wrinkling. Advanced Materials, 19,
2124.
6. Cendula, P., Kiravittaya, S., Mei, Y. F., Deneke, C., & Schmidt, O. G. (2009). Bending and
wrinkling as competing relaxation pathways for strained free-hanging films. Physical Review
B, 79, 085429.
7. Aviles, P., & Giga, Y. (1987). A mathematical problem related to the physical theory of liq-
uid crystal configurations. Proceedings of the Centre for Mathematical Analysis, Australian
National University, 12, 1.
8. DeSimone, A., Kohn, R. V., Müller, S., & Otto, F. (2000). Magnetic microstructures : a paradigm
of multiscale problems. In J.M. Ball (ed.) ICIAM 99: proceedings of the Fourth International
Congress on Industrial and Applied Mathematics (pp. 175–190). Oxford: Oxford University
Press.
9. Aviles, P., & Giga, Y. (1996). The distance function and energy. Proceedings of the Royal
Society of Edinburgh Section A, 126, 923.
38 D. Bourne et al.
10. Aviles, P., & Giga, Y. (1999). On lower semicontinuity of a defect energy obtained by a singular
limit of the Ginzburg-Landau type energy for gradient fields. Proceedings of the Royal Society
of Edinburgh Section A, 129, 1.
11. Ambrosio, L., De Lellis, C., & Mantegazza, C. (1999). Line energies for gradient vector fields
in the plane. Calculus of Variations and Partial Differential Equations, 9, 327.
12. Jin, W., Kohn, R.V. (2000). Singular perturbation and the energy of folds. Journal of Nonlinear
Science, 10, 355.
13. DeSimone, A., Kohn, R. V., Müller, S., & Otto, F. (2001). A compactness result in the gradient
theory of phase transitions. Proceedings of the Royal Society of Edinburgh Section A, 131, 833.
14. Poliakovsky, A. (2007). Upper bounds for singular perturbation problems involving gradient
fields. Journal of the European Mathematical Society, 9(1), 1.
15. Conti, S., & De Lellis, C. (2007). Sharp upper bounds for a variational problem with singular
perturbation. Mathematische Annalen, 338, 119.
16. Poliakovsky, A. (2005). A method for establishing upper bounds for singular perturbation
problems. Comptes Rendus Mathematique Academy of Science Paris, 341, 97.
17. Gioia, G., DeSimone, A., Ortiz, M., & Cuitino, A. M. (2002). Folding energetics in thin-film
diaphragms. Proceedings of the Royal Society of London. Series A, 458, 1223.
18. Conti, S., DeSimone, A., & Müller, S. (2005). Self-similar folding patterns and energy scaling
in compressed elastic sheets. Computer Methods in Applied Mechanics and Engineering, 194,
2534.
19. Gioia, G., & Ortiz, M. (1998). Determination of thin-film debonding parameters from
telephone-cord measurements. Acta Materialia, 46(1), 169.
20. Audoly, B. (1999). Stability of straight delamination blisters. Physical Review Letters, 83,
4124.
21. Audoly, B. (2000). Mode-dependent toughness and the delamination of compressed thin films.
Journal of the Mechanics and Physics of Solids, 48, 2315.
22. Ben Belgacem, H., Conti, S., DeSimone, A., Müller, S. (2000). Rigorous bounds for the Föppl-
von Kármán theory of isotropically compressed plates. Journal of Nonlinear Science, 10, 661.
23. Jin, W., & Sternberg, P. (2001). Energy estimates for the von Kármán model of thin-film
blistering. Journal of Mathematical Physics, 42, 192.
24. Jin, W., & Sternberg, P. (2002). In-plane displacements in thin-film blistering. Proceedings of
the Royal Society of Edinburgh Section A, 132, 911.
25. Ben Belgacem, H., Conti, S., DeSimone, A., Müller, S. (2002). Energy scaling of compressed
elastic films. Archive for Rational Mechanics and Analysis, 164, 1.
26. Friesecke, G., James, R. D., & Müller, S. (2002). A theorem on geometric rigidity and the
derivation of nonlinear plate theory from 3d elasticity. Communications on Pure and Applied
Mathematics, 55, 1461.
27. Friesecke, G., James, R. D., & Müller, S. (2006). A hierarchy of plate models derived from
nonlinear elasticity by Gamma-convergence. Archive for Rational Mechanics and Analysis,
180, 183.
28. Kohn, R. V., & Müller, S. (1992). Branching of twins near an austenite-twinned-martensite
interface. Philosophical Magazine A, 66, 697.
29. Kohn, R. V., & Müller, S. (1994). Surface energy and microstructure in coherent phase transi-
tions. Communications on Pure and Applied Mathematics, 47, 405.
30. Conti, S. (2000). Branched microstructures: scaling and asymptotic self-similarity. Communi-
cations on Pure and Applied Mathematics, 53, 1448.
31. Conti, S. (2006). A lower bound for a variational model for pattern formation in shape-memory
alloys. Continuum Mechanics and Thermodynamics, 17, 469.
32. Zwicknagl, B. (2014). Microstructures in low-hysteresis shape memory alloys: scaling regimes
and optimal needle shapes. Archive for Rational Mechanics and Analysis, 213, 355.
33. Diermeier, J. (2013). Master’s thesis, Universität Bonn.
34. Chan, A., & Conti, S. (2015). Energy scaling and branched microstructures in a model for
shape-memory alloys with SO (2) invariance. Mathematical Models and Methods in Applied
Sciences, 25, 1091.
Folding Patterns in Partially Delaminated Thin Films 39
35. Choksi, R., & Kohn, R. V. (1998). Bounds on the micromagnetic energy of a uniaxial ferro-
magnet. Communications on Pure and Applied Mathematics, 51, 259.
36. Choksi, R., Kohn, R. V., & Otto, F. (1999). Domain branching in uniaxial ferromagnets: a
scaling law for the minimum energy. Communications on Pure and Applied Mathematics, 201,
61.
37. Viehmann, T. (2009). Uniaxial ferromagnets. Ph.D. thesis, Universität Bonn.
38. Choksi, R., Kohn, R. V., & Otto, F. (2004). Energy minimization and flux domain structure in
the intermediate state of a type-I superconductor. Journal of Nonlinear Science, 14, 119.
39. Choksi, R., Conti, S., Kohn, R. V., & Otto, F. (2008). Ground state energy scaling laws during
the onset and destruction of the intermediate state in a type-I superconductor. Communications
on Pure and Applied Mathematics, 61, 595.
40. Conti, S., & Ortiz, M. (2005). Dislocation microstructures and the effective behavior of single
crystals. Archive for Rational Mechanics and Analysis, 176, 103.
41. Conti, S., & Ortiz, M. (2008). Minimum principles for the trajectories of systems governed by
rate problems. Journal of the Mechanics and Physics of Solids, 56, 1885.
42. Lobkovsky, A. E., Gentges, S., Li, H., Morse, D., & Witten, T. A. (1995). Boundary layer
analysis of the ridge singularity in a thin plate. Science, 270, 1482.
43. Witten, T. A. (2007). Stress focusing in elastic sheets. Reviews of Modern Physics, 79, 643.
44. Venkataramani, S. C. (2004). Lower bounds for the energy in a crumpled elastic sheet a minimal
ridge. Nonlinearity, 17, 301.
45. Conti, S., & Maggi, F. (2008). Confining thin elastic sheets and folding paper. Archive for
Rational Mechanics and Analysis, 187, 1.
46. Li, X., Cai, W., An, J., Kim, S. et al. (2009). Large-area synthesis of high-quality and uniform
graphene films on copper foils. Science, 324(5932), 1312.
47. Zang, J., Ryu, S., Pugno, N., Wang, Q., Tu, Q., Buehler, M. J., et al. (2013). Multifunctionality
and control of the crumpling and unfolding of large-area graphene. Nature Materials, 12, 321.
48. Zhang, K., & Arroyo, M. (2013). Adhesion and friction control localized folding in supported
graphene. Journal of Applied Physics, 113, 193501.
49. Zhang, K., & Arroyo, M. (2014). Understanding and strain-engineering wrinkle networks in
supported graphene through simulations. Journal of Mechanics and Physics of Solids, 72, 61.
50. Arroyo, M., Zhang, K., & Rahimi, M. (2014). Mechanics of confined solid and fluid thin
films: Graphene and lipid bilayers. IUTAM Symposium on innovative numerical approaches
for materials and structures in multi-field and multi-scale problems.
51. Bourne, D., Conti, S., & Müller, S. (2015). Energy bounds for a compressed elastic film on a
substrate. Preprint arXiv:1512.07416
52. Chambolle, A., Conti, S., & Francfort, G. (2014). Korn-Poincaré inequalities for functions
with a small jump set. To appear in Indiana University Mathematics Journal. Preprint hal-
01091710v1.
53. Antman, S. S. (1995). Applied Mathematical Sciences. In Nonlinear problems in elasticity,
107. Springer.
54. Ciarlet, P. G. (1997). Theory of plates, Mathematical elasticity, vol. II. Elsevier.
Thermo-mechanical Behavior of Confined
Granular Systems
1 Introduction
A proper estimate of the mechanical strength, and of the thermal and electrical con-
ductivity of a compacted solid is contingent upon the knowledge of microstructure
formation during the deformation stage of the compression. Since thermally assisted
compaction of granular matter is of great importance for a wide set of manufacturing
processes, theoretical modeling and numerical simulations serve as significant tools
to forecast the macroscopic behavior of materials, essentially when experimental
techniques are also unfeasible.
At present, one of the most implemented methodologies to elucidate the col-
lective behavior of particulate materials is the continuum mechanics approach, in
which the granular material is assumed to be statistically homogenous [1]. This is
achieved by treating the system as units of ordered groups, simulating disordered
arrangements by statistical correlation functions or using empirical correlations. The
statistical averaging technique provides homogenized solutions of the highly hetero-
geneous granular media at the cost of imposing two assumptions: (i) affine motion
approximation, namely the motion of each grain follows the macroscopic strain, and
(ii) well-bonded structure, contact number and positioning do not change under the
applied load. Despite the fact that the effective medium theory particularly estimates
the effective elastic moduli of packed bed of spherical particles to a large extent, the
discrepancy between numerical and experimental results is remarkable. Makse and
co-workers questioned the relevance of force laws defined at single contact level,
where they pointed out that the simplification done in effective medium theory is the
misleading element in the formulation [2, 3]. Affine motion assumption demolishes
the ability of the approach to account for the relaxation and rearrangement of parti-
cles that are under shear deformation. Moreover concerning the variety of boundary
conditions and geometrical effects, experimentation techniques become insufficient
in providing sufficient information about the microstructure to feed empirical corre-
lations.
The second most adopted approach treats the particles as individual bodies. Orig-
inating from particle-particle interactions based on constitutive relations of contact
mechanics [4–6], the discrete element method has been widely used in the field of
particle scale research [7]. Pioneers of this approach, Cundall and Strack introduced
an explicit numerical scheme to practice the granular dynamics by defining particles’
interactions over the contact networks and solving for particles’ motion under the
state of force balance equilibrium [8]. The integration of particle motion and energy
to the macroscopic behavior of the assembly, provides the required understanding
of overall behavior of the confined material [9]. The main advantage of this method-
ology is the capability of presenting broad information about the micro-structural
arrangement of the granular media. Although there exists computational challenges
to model a large number of particles system with discrete elements methods, advances
in simulation techniques enhance the implementation of this approach into the field
of multi-physics problems of granular systems.
Recently researchers also focus on multi-scale approaches to describe the macro-
scopic behavior of granular systems. Zheng and Cuitino implemented a quasi-
continuum approach to bridge the gap between micro and meso scale description
by using a discrete-continuum formulation of elastic-inelastic deformations occur-
Thermo-mechanical Behavior of Confined Granular Systems 43
where nmn is the unit normal vector defined from centers of particle n to particle m.
xm and xn are the position of the particles.
Johnson identifies the elastic deformation of locally spherical particles that are
subject to a compression load by contact mechanics considerations in his book [27].
Small-strain deformation of conforming surfaces results in a flat circle of contact
area. Collinear contact force at this elastic contact of the particles m and n is defined
through Young’s moduli, E m and E n ; Poisson’s ratios, ν m and ν n ; particle radii, R m
and R n of particle m and n; and overlap, γ mn , between these particles,
4 mn mn 1/2 mn 3/2
F mn = E (R ) (γ ) (4)
3
where
−1
1 1
R mn
= + n (5)
Rm R
−1
1 − (ν m )2 1 − (ν n )2
E mn = + (6)
Em En
γ mn = R m + R n − xm − xn . (7)
One particular effect of applied thermal load on the system of particles is the
change in radii due to thermal expansion. Similar to previous studies in the literature
[26, 28], in the present study, linear thermal expansion formulation is taken into
consideration.
R m = Rrme f 1 + α m T m − Trme f (8)
Thermo-mechanical Behavior of Confined Granular Systems 45
The total heat flow to an individual particle, Eq. (1), is calculated by adding the
heat flow at each contact of the particle between its neighboring particles Eq. 9. As
discussed by the thermal contact models introduced in the literature [13, 14], Eq. (1)
requires that at each contact of the individual particle, the temperature is equal to the
temperature calculated at the center of the particle. In other words, the temperature
does not very significantly within the particle, which also imposes that the contact
conductance at the interface of conforming particles is relatively smaller than the
heat conductance within the particle.
46 G. Küçük et al.
2k mn a mn
1 (12)
k mn A/R m
where A is the cross sectional area, A = π(R m )2 and Eq. (12) defines the state of
Biot number much less than 1. This assertion is applied by several authors in earlier
studies [12, 29]. The condition of a mn R mn is also enforced by the assumption of
small-strain deformation of elastic bodies in contact.
Given such a setting that the chain of particles is compressed between two parallel
walls, which are maintained at different temperatures, the wall-particle interaction
is one of the important factors of this problem. In this study, analogous to ghost-cell
method, the contact between boundary particle and adjacent wall is simulated as the
contact between a boundary particle and a ghost particle. Based on the rigid wall
assumption, ghost particle and boundary particle are set to have the same material
properties and radius. The temperature difference between the ghost particle and the
wall surface is the same as the temperature deviation between the wall surface and the
boundary particle. The boundary wall is assumed to be located in the midst of these
symmetrically deformed particles. The temperature difference, overlap and contact
area are formulated in Eqs. (13)–(15), where T ws and T w refer to the temperature at
the wall surface and wall temperature, respectively. Subscript g is used to indicate
the ghost particle.
ΔT mg = 2(T m − T ws ) (13)
γ mg = 2(R m − ||xm − xws ||) (14)
R m 1/2
a mg = (γ mg ) (15)
2
At the boundary surfaces, heat transfer between the boundary particle to the wall
can be expressed by two main heat transfer mechanisms, (i) heat is conducted over
flat circle of contact between the particle and adjacent wall surface; (ii) convective
heat transfer, which is dependent on walls’ convection coefficient of h w , takes place
between the wall surface and the wall.
Q m−ws = k m a mg ΔT mg (16)
Q ws−w
= −h w π(a ) (T
mg 2 ws
−T )
w
(17)
The temperature at the wall surface can be obtained for the equilibrium of Eqs. (16)
and (17). The final set of equations, which define the wall-particle interaction, are
the following:
4k m T m + h w πa mg T w
Q = 4ka mg T m − (18)
4k + h w πa mg
m
E
F= R m γ mg . (19)
3(1 − (ν m )2 )
48 G. Küçük et al.
While the particle mechanics approach aims to elucidate the formation and the evo-
lution of the microstructure of the granular media at particle-level, there has been
considerable research directed towards understanding macroscopic behavior of com-
pacted materials. Some of the early work on theoretical modeling of transport prop-
erties are devoted to the estimation of thermal, and electrical conductivity, elastic,
plastic mechanical properties of ordered and disordered arrangements. Originating
from the pair interactions between particles, the macroscopic properties are obtained
using various homogenization techniques and postulating continuum constitutive
laws [31]. In this study, we consider a continuum system that mimics the particle
level description for small strain thermoelasticity, which incorporates the proposed
effective mechanical and thermal properties for granular beds under compaction.
Governing field equations of motion and energy are the following
div (σ ) = 0 (20)
div k grad (T ) = 0 (21)
For the basic problem of one dimensional steady state thermoelasticity of contin-
uum media, where the body forces are neglected, the solution depends linearly on
elastic constants (λ, μ), thermal expansion and conduction coefficients, compaction
strain and thermal gradient. Since ε22 = ε33 = 0 holds, ε11 is referred as ε. Equations
of motion and energy Eqs. (20) and (21) can be rewritten as
model. The effective mechanical properties can be expressed in terms of the applied
stress, σ , and bulk material properties [32],
μ E 1 2E
Cn = 4 =4 = (25)
1−ν 2(1 + ν) 1 − ν 1 − ν2
3 6π σ 1/3
λ̃ + 2μ̃ = Cn (φs Z )2/3 (26)
20π Cn
1
2/3 6π σ 1/3
3λ̃ + 2μ̃ = Cn (φs Z ) (27)
4π Cn
6σ 1/3
k̃ B&O = k
Cn
6σ 1/3
k̃ C&T = 0.9454k
Cn
6σ 1/3
k̃ S&L = 0.8278k
Cn
50 G. Küçük et al.
0.4
PMA
CMA, Chan and Tien’s Sol’n.
0.35 CMA, Bathcelor and O’Brien’s Sol’n.
CMA, Siu and Lee’s Sol’n.
0.3
0.25
Q (W)
0.2
0.15
0.1
0.05
0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1
ε
Fig. 2 Comparison of continuum solutions adopting different thermal contact models with respect
to particle mechanics solution. Heat versus compaction strain, ε, is evaluated at T2w − T1w = 600 K
4 Results
According to the Hertz theory [35], the collinear contact force between the elastically
compressed particles is a nonlinear function of the overlap, which is generated under
the effect of the external load, between the particles. For the case of thermally-
assisted compaction of granular system of particles, this dependency is altered under
the effect of applied thermal gradient. Figure 3 shows the ratio of force, needed to
compress the system, in particle mechanics approach to the force in conventional
continuum mechanics approach. CMA significantly overestimates the thermal stress
within the chains system, particularly for the range of high thermal gradient and
low mechanical load. Moreover concerning the highly compacted systems CMA
underestimates PMA solution for the system of particles by 10 %.
Similar to compaction force comparison, conventional continuum solution pre-
dicts higher heat transferred values for analogous particles system. It is also shown
in Fig. 4 that as the packing density of the deformed particles system is increased,
conventional continuum mechanics solution becomes more effective in estimating
the particle-level solution.
Under three difference compaction strains, 2.5, 5, 10 %, the effect of wall-particle
interaction is examined through the chain of particles by imposing a thermal gradient
of 300 K between the two boundary walls of the system. For each case, wall heat
transfer coefficient, h w , is ranged from 1 to 107 W/m2 K. Figure 5 indicates the two
limiting cases of perfect insulating and perfect conducting walls.
Thermo-mechanical Behavior of Confined Granular Systems 51
1000 1.1
800 1
0.95
700
0.9
T 2w − T 1 (K)
600
0.85
w
500
0.8
400
0.75
300
0.7
200
0.65
100
0.6
Fig. 3 Comparison of force calculated in PMA and CMA under varying thermal and mechanical
loading conditions
1000 1.04
Q /Q
PMA CMA
900 1.02
800 1
700 0.98
0.96
T 2 − T1 (K)
600
0.94
w
500
0.92
w
400
0.9
300
0.88
200
0.86
100
0.84
Fig. 4 Comparison of heat calculated in PMA and CMA under varying thermal and mechanical
loading conditions
0.014
0.012
0.008
0.006
0.004
ε = 0.025
0.002 ε = 0.05
ε = 0.1
0
0 1 2 3 4 5 6
10 10 10 10 10 10 10
2
h (W/m K)
w
Fig. 5 Correlation between heat and wall conductance at different compaction strains
Under the effect of a modest thermal gradient of 300 K, three different mechanical
loading conditions, 1, 2, 5–10 %, are compared in Fig. 7. In order to compare the
coupled effect of thermal gradient and mechanical deformation, two extreme cases for
wall-particle interactions are also considered. In case of perfect insulating walls, h w is
Thermo-mechanical Behavior of Confined Granular Systems 53
−0.4
−0.6
−0.8
−1
−1.2
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Nondimensional Position [ xi /L0 ]
Fig. 6 Relative displacement of each particle within the chain at ε = 0.025 and T2w − T1w = 300 K
0
ε=0.01, hw=1
Displacement/Total Deformation [ ui / (ε L0 ) ]
−0.5
−0.6
−0.7
−0.8
−0.9
−1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Nondimensional Position [ x i / L 0 ]
Fig. 7 Non-dimensional displacement versus initial position of the particles under different
mechanical loads, with varying heat convection coefficients of the boundary walls
Regarding the system of SS304 spherical particles’ chain Fig. 7 indicates that
nonlinearity in distribution of displacements is more dominant for low mechanical
loading cases.
A recent experimental study on silos of spherical glass particles showed that thermal
cycling, and the difference in thermal expansion properties of the granular material
with respect to its container, significantly affect the packing fractions of granular
materials in the absence of mechanical compaction [25]. In the current study we
focus on the active interval where thermal gradient acts as a dominant mechanism
compared to mechanical deformation. A chain of spherical particles is gradually
consolidated up to a compaction strain of 5 % of their initial length, while thermal
gradient between the two boundary walls is increased to 1000 K.
The ratio of the displacements calculated in PMA to CMA indicates a discrep-
ancy between these two approaches. In Fig. 8 the maximum difference between par-
ticle mechanics approach and conventional continuum mechanics approach is traced.
Under the effect of low mechanical deformation, such as ε < 0.02, and high ther-
mal gradient conditions the continuum solution overestimates the actual position
of particles up to 40 % of the solution provided by particle mechanics approach.
The difference between these two solutions diminishes as the packing density of the
granular system increases.
0.95
Max (UPMA / UCMA)
0.9
0.85
0.8
0.75
0.7
0.65
0.06
1000
800 0.04
600
w w
400
200
0.02 ε
T 2 −T 1 0 0
Fig. 8 Maximum difference between the calculated displacements in particle mechanics and in
conventional continuum mechanics approaches
Thermo-mechanical Behavior of Confined Granular Systems 55
5 Conclusion
Acknowledgments This work has been partially supported by U.S. Army ARDEC grant under
the project titled as: Multifunctional Nanomaterials: Processing, Properties, and Applications. The
authors would also like to acknowledge the support provided by the National Science Foundation
Engineering Research Center for Structured Organic Particle Systems (C-SOPS).
56 G. Küçük et al.
References
22. Majumdar, A., & Tien, C. L. (1991). Fractal network model for contact conductance. Journal of
Heat Transfer (Transactions of the ASME (American Society of Mechanical Engineers), Series
C;(United States), 113(3).
23. Majmudar, T. S., & Behringer, R. P. (2005). Contact force measurements and stress-induced
anisotropy in granular materials. Nature, 435(7045), 1079–1082.
24. Gonzalez, M., & Cuitiño, A. M. (2012). A nonlocal contact formulation for confined granular
systems. Journal of the Mechanics and Physics of Solids, 60(2), 333–350.
25. Chen, K., Cole, J., Conger, C., Draskovic, J., Lohr, M., Klein, K., et al. (2006). Granular
materials: Packing grains by thermal cycling. Nature, 442(7100), 257–257.
26. Vargas, W. L., & McCarthy J. J. (2007). Thermal expansion effects and heat conduction in
granular materials. Physical Review E, 76(4), 041301.
27. Johnson, K. L. (1987). Contact mechanics. Cambridge University press.
28. Lu, Z., Abdou, M., & Ying, A. (2001). 3d micromechanical modeling of packed beds. Journal
of nuclear materials, 299(2), 101–110.
29. Siu, W. W. M., & Lee, S. H.-K. (2004). Transient temperature computation of spheres in
three-dimensional random packings. International Journal of Heat and Mass Transfer, 47(5),
887–898.
30. Walton, K. (1975). The effective elastic moduli of model sediments. Geophysical Journal of
the Royal Astronomical Society, 43(2), 293–306.
31. Markov, K. Z. (2000). Elementary micromechanics of heterogeneous media. In Heterogeneous
media (pp. 1–162). Springer.
32. Norris, A. N., & Johnson, D. L. (1997). Nonlinear elasticity of granular media. Transactions-
American Society of Mechanical Engineers Journal of Applied Mechanics, 64, 39–49.
33. Vargas, W. L., & McCarthy, J. J. (2002). Stress effects on the conductivity of particulate beds.
Chemical Engineering Science, 57(15), 3119–3131.
34. Siu, W. W. M., & Lee, S. H.-K. (2000). Effective conductivity computation of a packed bed
using constriction resistance and contact angle effects. International Journal of Heat and Mass
Transfer, 43(21), 3917–3924.
35. Landau, L. D., & Lifshitz, E. M. (1959). Theory of elasticity. Pergamon Press.
Elastomeric Gels: A Model
and First Results
1 Introduction
2 Governing Equations
In the following sections we present the governing equations of the model, the balance
of mass and the balance of linear momentum and we recall the kinematics needed to
describe the model.
2.1 Kinematics
where the swelling part Fs is assumed isotropic, with λs the swelling stretch. We refer
te reader to [5, 6] for further examples of using the multiplicative decomposition of
the deformation gradient in nonlinear models of polymers and of phase-changing
elastomers.
At a fixed time t ∗ , Fs (X, t ∗ ) represents the distortion of the body only due to
swelling, while Fe (X, t ∗ ) is the subsequent stretching and rotation of the corre-
sponding swollen network structure, representing the mechanical elastic distortion.
Then J e = det Fe , J s = det Fs , and J = J e J s . The right Cauchy–Green strain
tensor is
C = FT F = (Fe Fs )T Fe Fs = FsT FeT Fe Fs = FsT Ce Fs , (2)
where
Ce = FeT Fe (3)
where
then
1 ˙s s−1
Ls = J J I. (9)
3
With reference to Fig. 1, F describes the deformation from the reference state to
the current state, while Fd is the deformation gradient from the dry state (when there
is no solvent) to the current state, F and Fd relate through F0 which is the spontaneous
62 M. de Luca and A. DeSimone
deformation that a dry body undergoes when immersed in a solvent (free swelling
deformation) [13]
Fd = F F0 , (10)
where
F0 = λ0 I. (11)
The free swelling deformation is isotropic, characterized by the free swelling stretch
λ0 . Then
Cd = Fd T Fd = (F F0 )T F F0 = F0 T FT F F0 = F0 T C F0 . (12)
the number of solvent moles n(x, t) absorbed by the solid network per unit current
volume V . The concentration field is a scalar function cc : Ωt × [0, T ] → R. The
mass of solvent in a region Rt ⊂ Ωt is
M(t) = cc (x, t) dV. (14)
Rt
n(X, t)
c(X, t) = , (15)
V0
and
c = J cc . (18)
Similarly,
n
cd = (19)
Vd
cd = J0 J cc = J0 c, (20)
is the concentration in the free swollen reference state per unit reference volume,
while n0
cd0 = (22)
Vd
is the concentration in the free swollen reference state per unit dry volume.
In our model, we impose the constraint of “molecular incompressibility”. This
means that the increase in volume of the body is only due to the solvent that enters
the polymer network. The increase in volume associated with the spontaneous free
swelling deformation described by F0 is
J0 = 1 + ν cd0 , (23)
J s = J0 −1 + νc, or J s J0 = 1 + ν J0 c, (24)
where jc is the relative solvent flux through the boundary of Rt and n is the outward
unit normal to ∂Rt .
Using the formulas for surface and volume change
we obtain
d
J cc (x, t) dV0 = − jc (x, t) · J F−T n0 dA0 , (27)
dt R0 ∂R0
and
d
J cc (x, t) dV0 = − JF−1 jc (x, t) · n0 dA0 . (28)
dt R0 ∂R0
Then, applying the divergence theorem and switching the derivative on the left hand
side with the integral sign we have:
d
c(X, t) dV0 = − Div j(X, t) dV0 , (29)
R0 dt R0
Elastomeric Gels: A Model and First Results 65
where
j(X, t) = JF−1 jc (x, t) (30)
dc
= −Div j. (31)
dt
We introduce the variable φ, the volume fraction occupied by the solid network in
the current configuration, defined as
Vd
φ= , (32)
V
then
V Vd + νn n
φ −1 = = =1+ν = 1 + νcd = 1 + ν J0 c = J0 J s . (33)
Vd Vd Vd
Vd
φ0 = = J0 −1 (34)
V0
φ = J s−1 φ0 . (35)
1
c= (φ −1 − 1), (36)
ν J0
Inserting relation (37) in Eq. (31) we have the mass balance with the polymer fraction
as independent variable:
φ0 φ̇
− = −Div j. (38)
ν φ2
66 M. de Luca and A. DeSimone
It is useful to define the fluid volume fraction to be able to compare our model
with the standard poro-elastic one
Vf
φf = 1 − φ = , (39)
V
f
f V0
φ 0 = 1 − φ0 = , (40)
V0
f
where V0 = V0 − Vd is the fluid volume enclosed by the solid elastic network in the
reference configuration, then thanks to Eqs. (35), (39) and (40) we have
f
φ f = 1 − J s−1 (1 − φ0 ), (41)
and
f
Φ f = J s − 1 + φ0 , (42)
φf φf
e= = (43)
1 − φf φ
The balance of linear and angular momentum are unchanged with respect to the usual
expressions. Time scales associated with fluid diffusion are ususally considerably
larger then those associated with wave propagation, hence inertial effects can be
neglected. The body is subjected to volume forces b and surface forces t, where,
denoting by n the outward unit normal to ∂Ωt , we have
t(n) = T n, (44)
By using the area and volume transformation formulas (26) we obtain the balance
of linear momentum in a fixed region R0 ⊂ Ω0
J T F−T n0 dA0 + J b dV0 = 0. (47)
∂R0 R0
where
P := J T F−T (49)
is the First Piola—Kirchhoff stress tensor and b0 = J b are the reference body forces,
so that
(DivP + b0 ) dV0 = 0. (50)
R0
Due to arbitrariness of R0 , we obtain the local form of the balance of linear momentum
in the reference configuration, which reads
DivP + b0 = 0 (51)
T = TT , (52)
P FT = F P T . (53)
3 Thermodynamics
Thus,
E˙ (R0 ) = I (R0 ) + Q(R0 ) + T (R0 ) (66)
Elastomeric Gels: A Model and First Results 69
By inserting Eqs. (57), (55), (59), and (60) in Eq. (66) we have
ε̇0 dV0 = P : Ḟ dV0 − q0 · n0 dA0 + Q0 dV0 − μ j · n0 dA0 .
R0 R0 ∂R0 R0 ∂R0
(67)
Then, by applying the divergence theorem
ε̇0 dV0 = P : Ḟ − Div q0 + Q0 − Div (μ j) dV0 , (68)
R0 R0
The second law of thermodynamics states that the net entropy production should
always be non negative:
S˙ (R0 ) − J (R0 ) ≥ 0. (70)
From Eq. (69) we can derive Q0 − Div q0 and inserting it in Eq. (73) to obtain
1 1
η̇0 ≥ ε̇0 − P : Ḟ + μ Div j + ∇μ · j + q0 · ∇θ . (74)
θ θ
Then, multiplying by θ each term and bringing all the terms on the right we have
1
ε̇0 − θ η̇0 − P : Ḟ + μ Div j + ∇μ · j + q0 · ∇θ ≤ 0. (75)
θ
We define the Helmotz free energy ψ0 as
ψ0 = ε0 − θ η0 , (76)
70 M. de Luca and A. DeSimone
1
ψ̇0 + θ̇ η0 − P : Ḟ + μ Div j + ∇μ · j + q0 · ∇θ ≤ 0, (78)
θ
that in isothermal condition reduces to
Thanks to the replacement of the mass balance (31) in Eq. (79) we finally have the
thermodynamic constraint
ψ̇0 − P : Ḟ − μ ċ + ∇μ · j ≤ 0. (80)
then
in which we have used relations (49) and (6), and we have defined two new measures
of stress
Pe = J T Fe−T (90)
Me = J FeT T Fe−T . (91)
Elastomeric Gels: A Model and First Results 71
1 s−1 ˙s
P : Ḟ = Pe : Ḟe + Me : J J I (92)
3
1 s−1 ˙s
= Pe : Ḟe + J J trMe (93)
3
= Pe : Ḟe − p̄ J˙s , (94)
1
p̄ = − J s−1 trMe . (95)
3
We define another symmetric measure of stress Se as
1 1
p̄ = − J e trT = − J s−1 Se : Ce . (98)
3 3
Then, recalling relation (3) we find
T
Se : Ċe = Se : (Ḟe Fe + FeT Ḟe ) (99)
T
=S : e
Ḟe F +S :F
e e eT
Ḟ e
(100)
= Se FeT : Ḟ eT + Fe Se : Ḟe (101)
= F S : Ḟ + F S : Ḟ
e e e e e e
(102)
= 2 P : Ḟ .
e e
(103)
We introduce in the model the incompressibility constraint for the elastic polymer
network by imposing
J e = 1. (104)
72 M. de Luca and A. DeSimone
that implies orthogonality between Fe−T and Ḟe . It follows that the introduction of
the incompressibility constraint in the form (105) does not have any effect if added
to Eq. (94) and, we can write
then
Ce−1 : Ċe = 2 Fe−T : Ḟe . (111)
1 e
P : Ḟ = S + p Ce−1 : Ċe − p̄ J˙s . (112)
2
To conclude our computation we observe that
J˙s = ν ċ (113)
1 e
P : Ḟ = S + p Ce−1 : Ċe − ν p̄ ċ. (114)
2
The introduction of Eq. (114) in the free energy inequality (80) gives:
1 e
ψ̇0 − S + p Ce−1 : Ċe − (μ − ν p̄) ċ + ∇μ · j ≤ 0. (115)
2
Elastomeric Gels: A Model and First Results 73
We define
where Seact is called “active stress”, and μact is called “active chemical potential”.
Introducing Eqs. (116) and (117) in Eq. (115), the free energy inequality becomes
1 e
ψ̇0 − S : Ċe − μact ċ + ∇μ · j ≤ 0 (118)
2 act
The independent variables of our model are the elastic deformation represented
by the right Cauchy—Green strain tensor Ce , and the concentration c of the solvent.
Hence we write
∂ ψ̂0 1 ∂ ψ̂0
− Seact : Ċe + − μact ċ + ĵ(Ce , c, ∇μ) · ∇ μ ≤ 0. (124)
∂C e 2 ∂c
Inequality (124) must hold for all values of Ce , c, and ∇ μ. Since Ce and c appear
linearly, the only possibility is that their coefficients vanish. Then inequality (124)
implies
∂ ψ̂0
Seact = 2 , (125)
∂Ce
∂ ψ̂0
μact = , (126)
∂c
74 M. de Luca and A. DeSimone
Recalling definitions (116) and (117), we finally have the expressions for the stress
tensor and the chemical potential:
∂ ψ̂0
Se = 2 − p Ce−1 , (128)
∂Ce
∂ ψ̂0
μ= + ν p̄. (129)
∂c
Combining Eqs. (128) and (98) we obtain
2 ∂ ψ̂0
p = J s p̄ − : Ce , (130)
3 ∂Ce
and the expression of the stress tensor Se depending on the mean (pore) pressure is:
8 ∂ ψ̂0
Se = − J s p̄ Ce−1 . (131)
3 ∂Ce
4 Constitutive Theory
The constitutive theory and the thermodynamic restriction that we propose are based
on ψ0 , the free energy density defined in the reference, free swollen, configuration.
In the literature the free energy density for polymeric swelling material is usually
defined per unit volume in the dry configuration, ψd , and is composed by a term due
to the mixing of the polymer and the solvent, ψd mix , and a term that accounts for the
energy change due to the stretching of the polymer molecules, ψd mech :
Recalling Eqs. (12) and (20) we obtain the free energy density per unit reference
volume as
1
ψ0 (C, c) = ψd (F0 T C F0 , J0 c), (135)
J0
where
ν J0 c 1
ψ0 mix (C, c) = R θ c log +χ , (136)
1 + ν J0 c 1 + ν J0 c
G
ψ0 mech (C, c) = (tr F0 T C F0 − 3). (137)
2 J0
where
Fs = λs I = (J s )1/3 I = (J0 )−1/3 (1 + ν J0 c)1/3 I, (140)
then
G
ψ0 mech (Ce , c) = ((J0 )−2/3 (1 + ν J0 c)2/3 tr F0 T Ce F0 − 3). (141)
2 J0
A1 = a1 ⊗ a1 , and A2 = a2 ⊗ a2 , (143)
76 M. de Luca and A. DeSimone
are the structural tensor that account for the material anisotropy and a1 and a2 are
the material unit vectors defining the direction of the two family of fibres.
To conclude, accounting for all contributions, the energy density for our material
reads
In this section we compute the partial derivatives of the energy density ψ0 that are
necessary to compute the stress and the chemical potential. We start with
where
IAe α = Ce : Aα , with α = 1, 2. (148)
Then
∂ψ0 (Ce , c) ∂ψ0 mix (Ce , c) ∂ψ0 mech (Ce , c) ∂ψ0 fibre (Ce , c)
= + + (149)
∂C e ∂Ce ∂Ce ∂Ce
We compute the partial derivatives of ψ0 with respect to the concentration c:
∂ψ0 mix (Ce , c) ν J0 c 1 χ
= R θ log + + , (150)
∂c 1 + ν J0 c 1 + ν J0 c (1 + ν J0 c)2
and
∂ψ0 fibre (Ce , c)
= 0, (152)
∂c
Elastomeric Gels: A Model and First Results 77
then
∂ψ0 (Ce , c) ∂ψ0 mix (Ce , c) ∂ψ0 mech (Ce , c) ∂ψ0 fibre (Ce , c)
= μ0
+ + + . (153)
∂Ce ∂c ∂c ∂c
The insertion of Eq. (149) in Eq. (128) finally gives the expression of the elastic part
of the second Piola-Kirchhoff stress tensor
G s2/3
Se (Ce , c) = J F0 F0 T − p Ce−1
J0
2
+ 2 k1 (IAe 1 − 1) exp k2 IAe 1 − 1 A1
2
+ 2 k1 (IAe 2 − 1) exp k2 IAe 2 − 1 A2 , (154)
where
J s = J0 −1 (1 + ν J0 c) . (155)
G −2/3 2/3
Se (Ce , φ) = φ φ0 F0 F0 T − p Ce−1
J0
2
+ 2 k1 (IAe 1 − 1) exp k2 IAe 1 − 1 A1
2
+ 2 k1 (IAe 2 − 1) exp k2 IAe 2 − 1 A2 , (156)
since
J s = φ −1 φ0 . (157)
Using Eqs. (96) and (54) we recover the expression of the second Piola-Kirchhoff
stress tensor
S = Fs−1 Se Fs−T = J s−2/3 Se = φ 2/3 φ0 −2/3 Se , (158)
G
S(Ce , φ) = F0 F0 T − p C−1
J0
2
+ 2 φ 2/3 φ0 −2/3 k1 (IAe 1 − 1) exp k2 IAe 1 − 1 A1
2
+ 2 φ 2/3 φ0 −2/3 k1 (IAe 2 − 1) exp k2 IAe 2 − 1 A2 , (159)
78 M. de Luca and A. DeSimone
where
C−1 = φ 2/3 φ0 −2/3 Ce−1 . (160)
The final expression of the chemical potential can be obtained by replacing Eq. (153)
in Eq. (129) to obtain
ν J0 c 1 χ
μ(Ce , c) = μ0 + R θ log + +
1 + ν J0 c 1 + ν J0 c (1 + ν J0 c)2
1 ν G 1/3
+ J0 (1 + ν J0 c)−1/3 tr F0 T Ce F0 + ν p̄, (161)
3 J0
where p̄, defined in Eq. (98), can be explicitly computed once the expression of the
stress tensor Se is known, as in Eq. (154). In terms of the polymer fraction φ, the
chemical potential becomes
In the constitutive theory, we made the assumption that the solvent flux j depends on
the deformation, the concentration, and the gradient of the chemical potential, see
Eq. (122). Following [3] we suppose that the fluid flux obeys a Darcy-tipe relation,
hence it depends linearly on the gradient of the chemical potential
j = −M ∇μ, (163)
where M is the mobility tensor. To respect the constitutive restriction (127) the
mobility M must be a positive definite tensor.
The fluid flux j is the amount of solvent that flows through the boundary of the
body per unit surface and per unit time, then it is measured in mol/(s m2 ). If we
introduce Eq. (129) in Eq. (163) we have
∂ψ0
j = −M ∇ + ν p̄ , (164)
∂c
Elastomeric Gels: A Model and First Results 79
and, in turn,
∂ψ0
ν j = −ν 2 M ∇ + p̄ . (165)
∂νc
We observe that ν j has the dimension of m3 /(m2 s),since the molar volume [ν] =
m3 /mol, that still represents the amount of solvent flowing through the boundary per
unit time and per unit surface, but simplifying we obtain that it has the dimension of
a velocity, i.e. m/s.
Recalling definition (34), (24), and (40), we have
Φ f = νc, (166)
where j/c represents the relative velocity vfs of the fluid with respect to the solid
network that usually appears in the generalized Darcy’s Law
∂ψ0
Φ v = −K ∇
f fs
+ p̄ , (168)
∂Φ f
K = ν 2 M. (169)
∂ψ0
We remark that in Terzaghi’s consolidation theory the term ∂Φ f , which accounts for
adsorption and capillarity, is neglected. In our case this term is not negligible due to
the importance of swelling phenomena.
For isotropic materials the mobility, and also the permeability, can be represented
by scalar quantities, so that the corresponding tensors are
M = m I, K = k I, (170)
and thanks to Eq. (169) we have a scalar relation between mobility and permeability:
k = ν 2 m. (171)
In [4] a linear dependence of the mobility upon the polymer fraction is proposed as
D
m(φ) = γ (φ), (172)
ν Rθ
80 M. de Luca and A. DeSimone
0
0 0.2 0.4 0.6 0.8 1
φ
where k0 is the permeability and e0 the void ratio in the reference state, and M is
an empirical positive coefficient. Using Eqs. (43) and (39) we obtain the relation
between the permeability and φ:
2
1−φ φ
k(φ) = k0 exp M −1 . (175)
1 − φ0 φ0
In the reference state φ = φ0 , and the permeability k assumes its reference value k0 .
In Fig. 2, a comparison between the two laws (173) and (175) is shown. Only for
this purpose arbitrary values of the constant are chosen, namely M = 10−6 , γs = 2,
φ0 = 0.5, k0 = 1. The important common feature of the proposed laws is the increase
of permeability with swelling, which causes a decrease in the polymer fraction.
The first equation is the balance of forces, the second the balance of fluid mass, the
third is the relation between the chemical potential and the polymer fraction and
the last represents the constraint of elastic incompressibility. To close the problem
we need to consider the boundary condition for the displacement and the fluid mass
equations.
Fd = λd I, (177)
Jd − 1
Jd = 1 + ν cd , and cd = , (178)
ν
with Jd = λ3d . When in Eq. (132) we replace Fd and cd with (177) and (178), we
obtain
Rθ 3 1 1 3
ψd (λd ) = λd − 1 log 1 − 3 + χ 1 − 3 + G0 λ2 − 1 .
ν λd λd 2
(179)
82 M. de Luca and A. DeSimone
The minimum of free energy (179) is numerically computed using Newton’s method,
for which expressions of its first and second derivatives with respect to λd are needed:
dψd (λd ) Rθ 1 3 3χ
= 3 λd l log 1 − 3 +
2
+ 4 + 3 G0 λd , (180)
dλd ν λd λd λd
and
d2 ψd (λd ) Rθ 1 9 λd 3 12 χ
= 6 λd log 1 − 3 + 3 − 2 − 5 + 3 G0 . (181)
dλd2 ν λd λd − 1 λd λd
-1
0 1 2 3 4 5
Elastomeric Gels: A Model and First Results 83
-5
0 0.5 1 1.5 2 2.5 3
-1
0 1 2 3 4 5
ple, in Fig. 5, the free energy ψd is computed using χ = 2, and the corresponding
minimizing spontaneous deformation is λ0 = λ∗ = 1.023. The corresponding poly-
mer fraction is φ0 = 0.934, i.e., a very small amount of solvent entered the polymer
network and the material stays almost dry.
7 Numerical Experiments
free. In this section, we want to study the dynamics of the free swelling deformation,
by simulating the hydrogel that, starting from the dry configuration swells until it
reaches the free swelling state. Then, in this case, the initial configuration will be
the dry, not stress free, configuration Ω∞ in Fig. 1. The presented model can be used
to simulate such problem by setting F0 = I, so that F = Fd and ψd = ψ0 , and the
system of equations we are going to solve is the following:
⎧
⎪
⎪DivP + b0 = 0
⎪
⎨− 1 φ̇ = −Div (j)
ν φ2 (182)
⎪
⎪μ = μ0 + R θ log (1 − φ) + φ + χ φ 2 + ν p φ
⎪
⎩ e
J =1
Figure 6 shows the initial dry configuration of the body. It is a cube, whose edge
is 0.1 cm long. For symmetry reasons only one eighth of the cube is shown. The
boundary conditions for the problem are μ = μ0 for all the boundaries of the body,
because we suppose that the body is fully immersed in the pure solvent.
In Fig. 8 the body is fully swollen, the solvent inside the body is in equilibrium
with the solvent outside (they have the same chemical potential), and the polymer
fraction φ = 0.2236 is uniform throughout the body. The corresponding free swelling
deformations in the three directions are the same, confirming that the deformation
is isotropic and uniform, with values λx = λy = λz = 1.6475. The computed values
for the polymer fraction and the deformations coincide with the values for the spon-
taneous deformation computed in the previous section through the minimization of
the free energy.
The numerical simulation of the swelling process allows us to observe the dynam-
ics of the solvent flowing inside the body. The intermediate states between the initial
and the final configuration are not characterized by an isotropic uniform deformation
as shown by Fig. 7. The solvent flows across the boundaries and migrates toward the
inner part of the body due to the gradient of the chemical potential. The diffusion
coefficient D = 5e − 12 m2 /s determines how fast the diffusion process happens.
After 36 h the dynamics of the solvent is over and equilibrium is reached (Fig. 8).
Indentation is one of the most common techniques for the mechanical characteri-
zation of materials. The test can be realized by pressing an indenter with different
possible shapes (spherical, conical, cylindrical flat-ended) on the upper surface of a
block of material. A control parameter can be the indentation depth or the pressure
exerted by the indenter and, from the material response it is possible to compute
material parameters which as the shear modulus and the Poisson coefficient.
When using sharp or spherical indenters the contact area varies during the inden-
tation, while with a flat-ended cylindrical indenter the contact area remains constant
[15]. In the following we will only consider the latter type of indenter, as shown in
Fig. 9.
If the deformation induced by the punch is sufficiently small, then the linear
theory of elasticity is applicable, and provided that the dimensions of the bodies are
large compared with the dimension of the contact area, the stresses in this region are
independent upon the shape of the bodies and the way they are supported far from
the contact area [11].
The displacement variable is u, the small strain tensor is
1
ε= ∇u + ∇uT , (183)
2
and the stress tensor is
ν
σ = 2G ε + tr(ε)I , (184)
1 − 2ν
2Gh
σzz (r, 0) = − √ . (188)
π(1 − ν) a2 − r 2
When the material is incompressible the distribution of pressure under the punch is
obtained by replacing ν = 1/2 in Eq. (188):
4Gh
σzzinc (r, 0) = − √ . (189)
π a2 − r 2
The contact force exerted by the punch on the material can be obtained by integrating
the distribution of pressure over the area of the punch:
2π a 2π a
F=− σzz r dr dα, and F inc = − σzzinc r dr dα, (190)
0 0 0 0
obtaining
4Gah
F= , and F inc = 8Gah, (191)
1−ν
which give the reaction forces that a material exerts against the indenter pressed with
indentation depth h. In Figure 11 we refer to F as F comp , because it is the value for a
compressible material.
To perform the numerical simulation of the indentation test we start from a cylinder
of hydro-gel fully swollen, in its stress-free reference state. The shear modulus we
are using in the numerical computation is G = 105 Pa, and the corresponding initial
polymer fraction φ0 = 0.0615. We suppose that the material is immersed in pure
solvent and it is in equilibrium, so that the chemical potential through all the body
is zero. From this condition a cylindrical flat indenter is pressed on the top center
of the cylinder, as shown in Fig. 10. The radius of the indenter is a = 0.1 cm, the
indentation depth is h = 0.03 cm.
When the indenter is pressed into the hydro-gel, the latter immediately deforms
elastically. The movement of the polymeric network causes a variation in the chemical
potential, that in order to find new equilibrium induces the discharge of the solvent
from the hydro-gel. Figure 10 shows a detail of the loaded area and the flux of the
solvent that is depicted by the arrows, which have the direction of the chemical
potential gradient. The solvent cannot flow through the contact area because the
indenter is considered impermeable. Figure 11 shows the graph of the reaction force
that the hydro-gel exerts against the indenter. It initially responds as an incompressible
elastic material and then the forces relax due to the migration of the solvent until the
new equilibrium is reached. The corresponding F comp is computed using the first of
Eq. (191), with a Poisson coefficient ν = 0.3.
88 M. de Luca and A. DeSimone
Fig. 10 Fully swollen hydro-gel loaded by a cylindrical flat indenter. The indentation causes a
change in the chemical potential, shown by the color bar, that induces a discharge of some solvent
from the body. A detail of the indentation zone is shown, where the arrows have the direction of
the chemical potential gradient. The tip of the indenter is considered impermeable
0.01
h [cm]
0 F gel [N]
F comp [N]
-0.01 F inc [N]
-0.02
-0.03
-0.04
-0.05
-0.06
0 0.5 1 1.5 2 2.5 3 3.5
t [s] 10 4
Fig. 11 Comparison of the reaction force Fgel exerted by the hydro-gel against the indenter with
the analytical values of the reaction forces for a compressible (Fcomp ) and an incompressible (Finc )
material. As expected, when the indentation starts the hydro-gel responds as an elastic (incompress-
ible) material, then after some time the material relaxes due to the discharge of some solvent. h is
the indentation depth
References
1. Argoubi, M., & Shirazi-Adl, A. (1996). Poroelastic creep response analysis of a lumbar motion
segment in compression. Journal of Biomechanics, 29(10), 1331–1339.
2. Biot, M. A. (1972). Theory of finite deformation of porous solids. Indiana University Mathe-
matics Journal, 21(7).
3. Chester, S. A., & Anand, L. (November 2010). A coupled theory of fluid permeation and large
deformations for elastomeric materials. Journal of the Mechanics and Physics of Solids, 58(11),
1879–1906.
Elastomeric Gels: A Model and First Results 89
4. Chester, S. A., & Anand, L. (2011). A thermo-mechanically coupled theory for fluid permeation
in elastomeric materials: Application to thermally responsive gels. Journal of the Mechanics
and Physics of Solids, 59(10), 1978–2006.
5. de Luca, M., & DeSimone, A. (2012). Mathematical and numerical modeling of liquid crystal
elastomer phase transition and deformation. In MRS Proceedings, 1403, 2012. doi:10.1557/
opl.2012.249. Copyright Materials Research Society 2012, Published online by Cambridge
University Press: 2012.
6. de Luca, M., DeSimone, A., Petelin, A., & Čopič, M. (2013). Sub-stripe pattern formation in
liquid crystal elastomers: Experimental observations and numerical simulations. Journal of the
Mechanics and Physics of Solids, 61(11), 2161–2177.
7. Ferguson, S. J., Ito, K., & Nolte, L.-P. (2004). Fluid flow and convective transport of solutes
within the intervertebral disc. Journal of Biomechanics, 37(2), 213–221.
8. Flory, P. J., & Rehner, J, Jr. (1943). Statistical mechanics of cross-linked polymer networks i.
rubberlike elasticity. The Journal of Chemical Physics, 11, 512.
9. Hong, W., Zhao, X., & Suo, Z. (2009). Formation of creases on the surfaces of elastomers and
gels. In Applied Physics Letters, 95
10. Ian, N. (1965). Sneddon. The relation between load and penetration in the axisymmetric boussi-
nesq problem for a punch of arbitrary profile. International Journal of Engineering Science,
3(1), 47–57.
11. Johnson, K. L. (1987). Contact mechanics. Cambridge: Cambridge university press.
12. Kang, M. K., & Huang, R. (2010). A variational approach and finite element implementation for
swelling of polymeric hydrogels under geometric constraints. Journal of Applied Mechanics,
77(6), 61004.
13. Lucantonio, A., Nardinocchi, P., & Teresi, L. (2012). Transient analysis of swelling-induced
large deformations in polymer gels. Journal of the Mechanics and Physics of Solids.
14. Murad, M. A., Bennethum, L. S., & Cushman, J. H. (1995). A multi-scale theory of swelling
porous media: I. application to one-dimensional consolidation. Transport in Porous Media,
19(2), 93–122.
15. Riccardi, B., & Montanari, R. (2004). Indentation of metals by a flat-ended cylindrical punch.
Materials Science and Engineering: A, 381(1), 281–291.
16. Rice, J. R., & Cleary, M. P. (1976). Some basic stress diffusion solutions for fluid-saturated
elastic porous media with compressible constituents. Reviews of Geophysics and Space Physics,
14(2), 227–241.
17. Zhang, J., Zhao, X., Suo, Z., & Jiang, H. (2009). A finite element method for transient analysis
of concurrent large deformation and mass transport in gels. Journal of Applied Physics, 105(9),
093522–093522.
A Tensegrity Paradigm for Minimal Mass
Design of Roofs and Bridges
1 Introduction
Tensegrity structures are very efficient, and tend to provide minimal mass solutions
to structure design under certain conditions. Some tensegrity papers have shown
minimal mass for tensile structures, subject to a stiffness constraint [1]. Some have
shown minimal mass for: compressive loads [2], cantilevered bending loads [3,
4], torsional loads [5], simply-supported bending loads [6], and distributed loads
on simply-supported spans, where significant structure is not allowed below the
roadway, [7]. The present work formulates a parametric design approach to simply-
supported (bridge-like) structures, which produces minimal mass shapes among all
possible tensegrity topologies (configurations of members).
The subject of form-finding of tensegrity structures continues to be an active
research area [8–13]. Particularly interesting is the use of fractal geometry as a form-
finding method for tensegrity structures, which is well described in [2, 3, 5, 14].
Such an optimization strategy exploits the use of fractal geometry to design tenseg-
rity structures, through a finite or infinite number of self-similar subdivisions of
basic modules. The strategy looks for the optimal number of self-similar iterations
to achieve minimal mass or other design criteria. This number is called the optimal
complexity, since this number fixes the total number of parts in the structure. The
‘fractal’ approach to tensegrity form-finding paves the way to an effective implemen-
tation of the tensegrity paradigm in parametric architectural design [9, 10, 15, 16].
Discrete to continuum approaches to trusses and tensegrity structures are available
in [17].
This paper finds the minimum mass design of tensegrity structures carrying simply
supported and distributed bending loads. In [7] numerical solutions where found for a
specified topology, without any theoretical guarantees that those topologies produced
minimal mass. This paper provides more fundamental proofs that provide necessary
and sufficient conditions for minimal mass.
The remainder of the paper is organized as follows. Section 2 describes the topol-
ogy of the tensegrity bridge under examination. For a simply-supported structure of
the simplest complexity, Sect. 3 describes the minimal mass bridge when the admis-
sible topology allows substructure or superstructure (that is, respectively, structure
below and above the roadbed). Section 4 defines deck mass and provides closed-form
solutions to the minimal mass bridge designs when only sub- or super-structure is
allowed. This finalizes the proof that the minimal mass bridge is indeed the substruc-
ture bridge. Section 4 also adds joint mass and shows that the optimal complexity is
finite. In Sect. 5, we describe an application of the above theory to the design of a
tensegirty bridge to be used for a solar panel covering of water canals. Conclusions
are offered at the end.
The tensegrity structures in this paper will be composed of rigid compressive mem-
bers called bars, and elastic tensile members called cables. We will assume that a
tensile member obeys Hooke’s law,
ts = k(s − s0 ), (1)
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 93
where k is cable stiffness, ts is tension in the cable, s is the length of the cable,
and s0 < s is the rest length of the cable. The tension members cannot support
compressive loads. For our purposes, a compressive member is a solid cylinder,
called a bar. All results herein are trivially modified to accommodate pipes, tubes
of any material, but the concepts are more easily demonstrated and the presentation
is simplified by using the solid bar in our derivations. The minimal mass of a cable
with loaded length s, yielding strength σs , mass density ρs , and maximal tension ts is
ρs
ms = ts s. (2)
σs
The planar bridge topology is considered here to elucidate the fundamental prop-
erties that are important in the vertical plane. We use the following nomenclature,
referring to Fig. 1: (i) a superstructure bridge has no structure below the deck level;
(ii) a substructure bridge has no structure above the deck level; (iii) a nominal bridge
contains both substructure and superstructure; (iv) Y means the design was con-
strained against yielding for both cables and bars; (v) B means the design was con-
strained against yielding for cables and buckling for bars; (vi) n means the number
of self-similar iterations involved in the design (n = 1 in Fig. 1); (vii) p means the
complexity of each iteration in the substructure ( p = 1 in Fig. 1c); (viii) q means
the complexity of each iteration in the superstructure (q = 1 in Fig. 1b); (ix) α is the
aspect angle of the superstructure measured from the horizontal; (x) β is the aspect
angle of the substructure measured from the horizontal.
We define the superstructure bridge of complexity (n, p = 0, q) where the sub-
structure below is deleted. We define the substructure bridge of complexity (n, p,
q = 0) where the superstructure above is deleted. It will be convenient to define the
following constants:
ρb /σb ρb L
ρ= , η= √ . (4)
ρs /σs (ρs /σs ) π E b F
(a)
(b)
(c)
Fig. 1 Basic modules of the tensegrity bridge with: a nominal bridge: n = q = p = 1; b super-
structure: n = q = 1; c substructure: n = p = 1. Compressive members (bars) are heavy black
lines, tensile members (cables) are thin red lines
ρb ρs
nb ns
m= f i bi + ti si , (6)
σb i=1 σs i=1
where (bi , si ) is respectively the length of the ith bar or ith cable, and respectively
( f i , ti ) is the force in the ith bar or cable.
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 95
Now we consider structures by increasing p, q. This section finds the minimal mass
of substructure, and superstructure bridges with complexity (n, p, q) = (1, p, q), for
any p and q greater then 1.
Refer to Fig. 2 for the notation. The angle between the bars is:
2β
γ = . (7)
p−1
From the equilibrium equations, we obtain the following relations for the forces:
F
f1 = , f2 = 2 f1 , (9)
4 cos β + sin β(p−1
p−2) β
/ sin p−1
f2 β
t2 = , t1 = t2 cos . (10)
2 sin β p−1
p−1
Fig. 2 Notations for forces and lengths of bars and cables for a substructure with complexity n = 1
and p > 1
96 G. Carpentieri et al.
The following theorems and corollaries can be obtained by minimizing the total
masses of the bridge in Fig. 2 (refer to [6] for an extended proof of the above theorem
and following theorems of this section).
Theorem 1 Consider a substructure bridge with topology described by (8), with
complexity (n, p, q) = (1, p, 0) (Fig. 2). At the yielding condition the dimensionless
total mass is:
⎡ ⎤
β β
t0 1 ⎣ ( p − 1) sin β sin p−1 + cos β cos p−1 ⎦
μY (β, p) = +
F 4 cos β sin p−1 β
+ sin β(p−1
p−2)
( p − 1) sin β
+ρ . (11)
4 cos β + sin β(p−1
p−2) β
/ sin p−1
Theorem 2 Consider a substructure bridge with topology defined by (8), with com-
plexity (n, p, q) = (1, p, 0), See Fig. 2. At the buckling condition the dimensionless
total mass is minimized at p = 2 and t0 = 0, where:
1 + tan2 β η tan2 β
μ B (β, p = 2) = + . (14)
4 tan β 2 1 + tan2 β 3/4
Refer to Fig. 3 for the notation. The angle between the bars is:
2α
γ = . (15)
q −1
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 97
Fig. 3 Notations for forces and lengths of bars and cables for a superstructure with complexity
n = 1 and q > 1
From the equilibrium equations, we obtain the following relations for the forces:
F t2
t2 = , t1 = , (17)
α(q−2) α 2
2 cos α + sin q−1
/ sin q−1
t2 α
f2 = , f 1 = f 2 cos . (18)
2 sin α q −1
q−1
The following theorems and corollaries can be obtained by minimizing the total
masses of the bridge in Fig. 3 (refer to [6] for an extended proof of the above theorem
and following theorems of this section).
1 √ √
μ∗Y (αY∗ , q ∗ ) = (1 + ρ) arctan ρ + ρ , (20)
4
where q ∗ → ∞ and the optimal angle αY∗ is:
√
αY∗ = arctan ρ. (21)
α η cos2 α
μ B (α, q → ∞) = + √ . (23)
4 2 sin α
It is important to consider that, for the solution q → ∞, buckling is not the mode
of failure since the lengths of the bars approaches zero. Also note that at α = 90◦ ,
μ B = π/8.
The left side of Fig. 4 shows a sequence of superstructures under yielding con-
straints, as q increases. From (19) the mass is minimized at q → ∞ and αY∗ =
45◦ (ρ = 1). The right side of Fig. 4 shows a sequence of superstructures under
buckling constraints, as q increases. The mass is minimized at α = 90◦ for q = ∞
(η = 857.71, same steel/steel material as above).
Moreover, the left side of Fig. 5 shows a sequence of substructures under yield-
ing constraints, as p increases. From (11) the mass is minimized at p → ∞ and
βY∗ = 45◦ (ρ = 1). The right side of Fig. 5 shows a sequence of substructures under
buckling constraints, as p increases. The mass is minimized at β = 90◦ for p = 1
(η = 857.71, same steel/steel material as above).
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 99
Fig. 5 Optimal topologies of substructure bridges with n = 1 under yielding constraints (left) and
buckling constraints (right) for different p, (steel for bars and cables, F = 1 N, L = 1 m)
Proof Michell [18] proved that the minimal mass structure constrained against yield-
ing with hinge/roller boundary conditions has the topology of the right side of Fig. 4
as q → ∞ and α → 90◦ . Theorem 4 provides the same topology for hinge/hinge
constraints.
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 101
Fig. 6 Minimal mass bridges under, a yielding constrained nominal bridges, b buckling constrained
superstructure bridge and c buckling constrained substructure bridge
Proof Michell [18] obtained these same results by starting with a continuum and
optimizing the shape.
Figure 6a illustrates the minimal mass nominal bridge under yielding constraints
(Theorem 6), leading to complexity (n, p, q) = (1, ∞, ∞). Figure 6b illustrates the
minimal mass superstructure bridge under buckling constraints, leading to complex-
ity (n, p, q) = (1, 0, q → ∞). Figure 6c illustrates the minimal mass substructure
bridge under buckling constraints, leading to complexity (n, p, q) = (1, 1, 0).
The total load that the structure must support includes the mass of the deck, which
increases with the distance that must be spanned between support points of the
structure design (which is determined by the choice of complexity n). We therefore
consider bridges with increasing complexity n. We will show that the smallest n = 1
102 G. Carpentieri et al.
yields smallest structural mass and the largest deck mass. The required deck mass
obviously approaches zero as the required deck span approaches zero, which occurs
as n → ∞. We will show that the mass of the deck plus the mass of the structure is
minimized at a finite value of n.
The deck is composed by 2n simply supported beams connecting the nodes on
the deck.
Let the deck parameters be labeled as: mass m d , mass density ρd , yielding strength
σd , width wd . The mass of one deck section is equal to:
c1 c1 1
m d = 3n + 2n c2 + , (24)
2 2 22n
where:
3 wd g ρd2 L 3 16 σd F
c1 = , c2 = . (25)
8 σd 3 wd g 2 L 3 ρd2
2n m d
μ∗d = . (26)
(ρs /σs ) F L
The total force acting on each internal node on the deck is the sum of the force due
to the external loads and the force due to the deck:
Ftot = F + 2n m d g. (27)
In this case, we make use of the notation illustrated in Fig. 7 in which complexity p is
fixed to be one. Complexity n is defined to be the number of self-similar iterations of
the basic module of Fig. 1c. Each iteration n = 1, 2, . . . generates different lengths
of bars and cables. The lengths at the ith iteration are:
L L
bi = tan β, si = , i = 1 − n. (28)
2i 2i cos β
Observing the multiscale structure of Fig. 7 it’s clear that the number of bars and the
number of cables at the ith self-similar iteration are
n si = 2i , n bi = 2i−1 . (29)
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 103
Fig. 7 Adopted notations for forces and lengths of bars and cables for a substructure with generic
complexity (n, p, q) = (n, 1, 0)
In this case the total force applied to the bridge structure is given by (27) and then
the forces in each member become:
F + 2n m d g F + 2n m d g
f bi = , t si = . (30)
2i 2(1+i) sin β
Theorem 7 Consider a substructure bridge with deck mass m d and topology defined
by (28), with complexity (n, p, q) = (n, 1, 0), see Fig. 7. The minimal mass design
under yielding constraints is given by:
1 md
μ∗Y = 1 − n 1 + 2n g 1 + ρ, (31)
2 F
Theorem 8 Consider a substructure bridge with topology defined by (28), with com-
plexity (n, p, q) = (n, 1, 0). The minimal mass design under yielding and buckling
constraints is given by:
1 + tan2 β B∗
μ∗B = β1 + ηβ2 tan2 β B∗ , (33)
2 tan β B∗
104 G. Carpentieri et al.
where:
1 md
β1 = 1 − n 1 + 2n g , (35)
2 F
√
1+2 2 1 md
β2 = 1 − 3n/2 1 + 2n g , (36)
7 2 F
1/3
β3 = 216β1 β2 η − β1 + 12 324β1 β2 η − 3β1 β2 η
2 2 3 2 4 4 4 2 2
. (37)
In this case, we make use of the notation illustrated in Fig. 8 in which complexity q
is fixed to be one. Complexity n is the number of self-similar iterations of the basic
module of Fig. 1b at different scales. After the ith self-similar iterations, the length
of the bars and cables for i ranging from 1 to n, are:
L L
bi = , si = i tan α. (38)
2i cos α 2
Fig. 8 Adopted notations for forces and lengths of bars and cables for a superstructure with
complexity (n, p, q) = (n, 0, 1)
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 105
Observing the multiscale structure of Fig. 8 it’s clear that the number of bars and the
number of cables after the ith self-similar iterations are:
n si = 2i−1 , n bi = 2i . (39)
In this case the total force applied to the bridge structure is given by (27) and then
the forces in each member become:
F + 2n m d g F + 2n m d g
f bi = , t si = . (40)
2(i+1) sin α 2i
Theorem 9 Consider a superstructure bridge with topology defined by (38), with
complexity (n, p, q) = (n, 0, 1), Fig. 8. Under a given total vertical force (27), the
minimal mass design under yielding constraints is given by:
1 md
μ∗Y = 1− n 1 + 2n g ρ (1 + ρ), (41)
2 F
1
α ∗B = arctan , (44)
2
where:
1 md 1
δ1 = 1 + 2n g 1− n , (45)
2 F 2
√
√ 1+2 2 md 1
δ2 = 2 1 + 2n g 1 − 3n/2 . (46)
7 F 2
The proofs of Theorems 9 and 10 are similar to the proofs of Theorems 7 and 8
in Sect. 4.2 and can be founded in [6].
106 G. Carpentieri et al.
Let $ j be the cost per kg of making joints and let $b be the cost per kg of making
bars. Then define Ω = $b /$ j . For perfect joints Ω = 0, for rudely made low cost
joints $ j is small and Ω is larger. Hence Ω is also approximatively the ratio of
material cost per joint divided by material cost per structural member being joined.
Consider the minimal masses of the substructure bridge (μ∗Y ) constrained against
yielding, for the cases with or without deck, see Eq. (31). Assume steel material for
cables, bars and deck beams and set F = 1 N, L = wd = 1 m. Without deck the
optimal aspect angle βY∗ (32) is 35.26◦ . For the case with neither deck nor joint√
mass,
∗
the optimum complexity n is 1, which corresponds to an optimal √ mass μY = 2/2.
As n approaches infinity the mass tends to a limit equal to 2, which is also the
optimal mass for the case with deck mass and perfectly manufactured joints, since
μ∗d approaches zero for n → ∞. Note that with the addition of joint masses as
illustrated in (47), the optimal complexity n ∗ can become a finite value. The above
procedure can be also used for the design under buckling constraints.
Figures 9 (for yielding) and Fig. 10 (for buckling) show the total minimal masses
obtained by using (47). In both Figs. 9 and 10 we also show with red curves the
minimal mass of substructures or superstructures only. In either case, the total mass
of the structure with deck (but no joint mass), is shown by black continuous lines in
Figs. 9 and 10, reaching minimum for an infinite complexity n. It is worth nothing
that, for infinite n, the mass of the deck is zero and the total minimum mass is just
the mass of the bridge structure. Then, with the dotted and dashed lines, we show
that a finite optimal complexity can be achieved if the joint’s masses are considered.
From Fig. 9 note that the minimal mass (μ ∼ = 21) bridge has complexity n = 11 for
Ω = 0.002, and has minimal mass μ ∼ = 15 with complexity n = 12 for Ω = 0.001.
Economic costs would decide if saving 25 % structural mass is worth the extra cost
of improving the joint precision by a factor of 2.
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 107
Fig. 9 Optimal masses under yielding of the substructures and superstructure (red curve) and total
optimal mass with deck and different joint factors (dashed and dottled curves) for different values
of the complexity n (steel for bars, cables, deck, F = 1 N, L = wd = 1 m)
Fig. 10 Optimal masses under buckling of the substructures (left) and superstructure (right) (red
curves) and total optimal masses with deck and different joint factors (dashed and dotted curves)
for different values of the complexity n (steel for bars, cables, deck, F = 1 N, L = wd = 1 m)
From open canals that bring water to cities all over the world, water evaporation
represents a significant loss of water. This section designs a minimal mass cover for
water canals while using solar panels as the cover. This is not a new idea. Since 2012
India has built solar-panel-covered canals (Gujarata, India, 2012, [19]). The truss
structure used in Gujarata is massive, threatening the economic survivability of the
project. Here we design a minimal mass cover for such canals.
108 G. Carpentieri et al.
The optimal complexity of a deployable support structure for a solar panel cover-
ing of water canals is derived in the following, along with deployable schemes which
are useful for construction, repairs, for sun following, and for servicing. It is shown
that the minimal structure naturally has deployable features so that extra mass is
not needed to add the multifunctional features. The design of bridge structures with
tensegrity architecture will show an optimal complexity depending only on mate-
rial choices and external loads. The minimization problem considers a distributed
load (from weight of solar panels and wind loads), subject to buckling and yield-
ing constraints. The result is shown to be a Class 1 tensegrity substructure (support
structure only below the deck). These structures, composed of axially-loaded mem-
bers (tension and compressive elements), can be easily deployable and have many
portable applications for small spans, or they can be easily assembled for prefabri-
cated component parts for large spans. The focus of this section is an application
of these minimal mass tensegrity concepts to design shading devices to prevent or
reduce evaporation loss, while generating electric power with solar panels as the
cover. While the economics of the proposed designs are far from finalized, we show
a technical solution that uses the smallest material resources, and shows the technical
feasibility of the concept.
Fig. 11 Different configurations of a deployable solar roof for water canals: a open onfiguration,
b transition between open/closed configurations, c closed configuration
(a) (b)
Fig. 13 Details of the canal structure: a deck system, b deformed shape of the deck cross cables
subjected to the solar panel force
The bridge structures are stabilized out of the plane with a set of longitudinal cables
as illustrated in Fig. 12. In particular diagonal cables and horizontal longitudinal
cables (the magenta element showed in Section B-B of Fig. 12) are used to stabilize
out of plane vertical movement.
The deck is composed of different orders of cables (refer to Figs. 12 and 13):
• longitudinal cables: the elements connecting each tensegrity bridge unit along the
length of the canal;
• transversal cables: the elements of each tensegrity bridge lying on the transversal
direction;
• cross cables: the elements that directly carry the solar panel loads and transfer
their weight to the bridge structures.
The total vertical force Ftot can be computed designing the deck diagonal cables
represented in Fig. 13. These cables directly support two different solar panel modules
of sizes by wd /2 (see Fig. 13). We design these cables assuming that, at the fully-
deployed configuration of the structure, the deck diagonal cables are inclined at a
fixed angle αd with respect to the horizontal (Fig. 13). At this configuration the length
and the tensile force in each deck diagonal cable are:
fp wd2 + 2
td = , sd = . (48)
4 sin αd 2 cos αd
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 111
We can compute the total mass of the deck diagonal cables as:
ρd ρd f p wd2 + 2
m d = 4 td s d = . (49)
σd σd 2 sin αd cos αd
The normalized total mass μ∗d and the total force Ftot can be computed with (26)
and (27), respectively.
The final total mass to be optimized is then the summation of the mass of the
bridge structure (33), the total mass of the deck (26) and the mass of the joints, Ωn n .
Let us now focus our attention on numerical results regarding the optimal design
of real-life roof structures featuring different complexities n. The examined struc-
tures show the following design data: L = 30.48 m, F = 12 kN, wd = 4.88 m,
αd = 1◦ , and the material properties of steel for bars (ρ = 7862 kg/m3 ; σ =
6.9 × 108 N/m2 ; E = 2.06 × 1011 N/m2 ) and Spectra® for cables (ρ = 970 kg/m3 ;
σ = 2.7 × 109 N/m2 ; E = 120 × 109 N/m2 ). We investigate on the optimal values
of the following parameters: μ∗B , μ∗tot and β B∗ , which respectively denote the dimen-
sionless minimal mass of a single bridge unit; the dimensionless minimal mass of the
overall system formed by the bridge and the deck; and the optimal aspect angle of
the bridge structure, under combined yielding and buckling constraints. The optimal
angle β B∗ of the examined structures can be computed from Eq. (34), and/or the plots
in Fig. 14. It is easy to verify that the minimal mas structure shows a markedly stream-
lined profile with β B∗ = 2.18◦ (Fig. 14-left). The global minimum of μ∗tot is attained
in correspondence with the complexity n ∗ = 3, both for Ω = 0 (μ∗tot ∼ = 23.07;
m ∗tot ∼
= 3.0318 kg, cf. Table 1), and for Ω > 0 (Fig. 14-right). The optimal design
leads to 0.10 kg mass per cables and 0.02 kg mass per bars per meter of the canal
lengthwise span (cf. Table 1).
Fig. 14 Left dimensionless masses μ B versus aspect angle β B ; right dimensionless total masses
μtot versus complexity n (η = 7569.04)
112 G. Carpentieri et al.
Table 1 Optimal masses μ∗B (33) and μ∗tot and optimal aspect angles β B∗ (34) of substructure
bridges with steel bars and Spectra® cables, under combined yielding and buckling constraints (B),
for different complexities n
n p Ftot (N) β B∗ (◦ ) μ∗B μ∗tot m ∗tot (kg)
1 1 12019.39 2.06 10.4357 25.4791 3.3480
2 1 12010.97 2.13 15.1186 23.6251 3.1044
3 1 12007.50 2.18 17.2522 23.0724 3.0318
4 1 12006.35 2.21 18.2414 23.1662 3.0441
5 1 12006.03 2.23 18.7080 23.3822 3.0725
6 Concluding Remarks
This paper provides closed form solutions (analytical expressions) for planar minimal
mass tensegrity bridge designs. The forces, locations, and number of members are
optimized to minimize mass subject to buckling and yielding constraints for a planar
structure with fixed-hinge/fixed-hinge boundary conditions.
We present the optimal complexity of the substructure bridge that minimizes the
sum of structural mass, deck mass and joint mass. Making better joints (less joint
mass) results in higher optimal complexity and less mass. So the economic tradeoff
between material cost of the truss structure and costs of making better joints will
lead to the proper trade between mass and labor costs.
We also define a 3D deployable tensegrity structure made of repetitive planar
substructure bridges (spanning the canal in the transversal direction) conveniently
stabilized out of plane with a set of cables, in both the transversal and the longitudinal
direction of the canal. Each planar structure has a self-similar fractal type of topology
generated by the complexity parameter n. The minimal mass solution yields com-
plexity n ∗ which depends upon material properties. Moreover, the topology of the 3D
structure is function of canal width (L), aspect angle (β) of the substructures bridges,
longitudinal aspect angle (αd ) governing the deploy-ability of the structure, the dis-
tance between consecutive repetitive structures in the longitudinal direction (wd ).
The design occupies much less volume and mass than the designs for the most
advanced attempts at energy production and shading over water canals (see Gujarata,
India, [19]). Formulas are given which will allow economic tradeoffs between mate-
rial costs of the structure, the labor cost (assuming price per joint is inversely propor-
tional to mass of the joint) of making more refined joints, and the choice of material
(steel, Spectra® , or other). Implicit in these tradeoffs, the optimized complexity n ∗
of the structure is derived to allow economic decisions on the number of components
(bars and cables) that will minimize mass for the given choice of material and joint
costs.
Numerical and experimental studies on the dynamics of these structures will
follow in subsequent work to impose further design constraints on stiffness issues
(vibrational frequencies, mode shapes [20, 21], displacements for high winds con-
A Tensegrity Paradigm for Minimal Mass Design of Roofs and Bridges 113
ditions, etc.), but the capability of all these choices and adjustments are within the
free parameters of the designs in this paper. The subsequent dynamics approach
will evaluate the value (economics and performance tradeoffs) the use of feedback
control for the deployable and service functions, or to adjust the stiffness of the
structure (varying the prestress of the cables) to modify stiffness or damping after
storm damage.
References
1. Skelton, R. E., & Nagase, K. (2012). Tensile tensegrity structures. International Journal of
Space Structures, 27, 131–137.
2. Skelton, R. E., & de Oliveira, M. C. (2010). Optimal complexity of deployable compressive
structures. Journal of the Franklin, I(347), 228–256.
3. Skelton, R. E., & de Oliveira, M. C. (2010). Optimal tensegrity structures in bending: the
discrete Michell truss. Journal of the Franklin, I(347), 257–283.
4. Nagase, K., & Skelton, R. E. (2014). Minimal mass tensegrity structures. Journal of the Inter-
national Association for Shell and Spatial Structures, 55(1), 37–48.
5. Skelton, R. E., & de Oliveira, M. C. (2010). Tensegrity systems. New York: Springer.
6. Carpentieri, G., Skelton, R. E., & Fraternali, F. (2015). On the minimum mass and optimal
complexity of planar tensegrity bridges, Internal Report of the University of California, San
Diego, Mechanical and Aerospace Engineering, No. 1-2014. www.fernandofraternaliresearch.
com/publications/arxiv_tensegrity_bridges_theory_2014.pdf. Accessed 23 June 2015.
7. Skelton, R. E., Fraternali, F., Carpentieri, G., & Micheletti, A. (2014). Minimum mass design of
tensegrity bridges with parametric architecture and multiscale complexity. Mechanics Research
Communications, 58, 124–132, ISSN 0093-6413, doi:10.1016/j.mechrescom.2013.10.017.
8. Koohestani, K. (2012). Form-finding of tensegrity structures via genetic algorithm. Interna-
tional Journal of Solids and Structures, 49, 739–747.
9. Rhode-Barbarigos, L., Jain, H., Kripakaran, P., & Smith, I. F. C. (2010). Design of tensegrity
structures using parametric analysis and stochastic search. Engineering and Computer, 26(2),
193–203.
10. Sakamoto , T., Ferrè, A., & Kubo, M. (Eds.). (2008). From control to design: Paramet-
ric/algorithmic architecture. Actar, Barcelona.
11. Sokóf, T., & Rozvany, G. I. N.: New analytical benchmarks for topology optimization and their
implications. Part I: bi-symmetric trusses with two point loads between supports. Structural
and Multidisciplinary Optimization, 46, 477–486 (2012).
12. Tilbert, A. G., & Pellegrino, S. (2011). Review of form-finding methods for tensegrity struc-
tures. International Journal of Space Structures, 18, 209–223.
13. Yamamoto, M., Gan, B. S., Fujita, K., & Kurokawa, J. (2011). A genetic algorithm based
form-finding for tensegrity structure. Procedia Engineering, 14, 2949–2956.
14. Fraternali, F., Marino, A., El Sayed, T., & Della Cioppa, A. (2011). On the structural shape
optimization through variational methods and evolutionary algorithms. Mechanics of Advanced
Materials and Structures, 18, 224–243.
15. Fraternali, F., Farina, I., & Carpentieri, G. (2014). A discrete-to-continuum approach to the cur-
vatures of membrane networks and parametric surfaces. Mechanics Research Communications,
56, 18–25.
16. Phocas, M. C., Kontovourkis, O., & Matheou, M. (2012). Kinetic hybrid structure development
and simulation. International Journal of Architectural Computing, 10(1), 67–86.
17. Fraternali, F., & Carpentieri, G. (2013). On the correspondence between 2D force networks
and polyhedral stress functions. International Journal of Space Structures, 29(3), 145–159.
114 G. Carpentieri et al.
1 Introduction
Fig. 1 An arbitrary cut (in red) through a quality tetrahedralization, representing an imaginary
fracture, yields a poor discretization of the fracture faces
Herein we present a few ideas for the simulation of brittle fracture that fall in
the latter category of conforming mesh methods by taking advantage of universal
meshes. Universal meshes are a paradigm for mesh generation that envision the use
of a single “background” mesh (the universal mesh) whose vertices closest to the
crack geometry are perturbed to obtain a subdivision conforming to it. An example of
such a perturbation is illustrated in Fig. 2. Because the same mesh can be deformed to
conform to the geometry of a class of cracks, we say that the mesh is universal for such
a class. The salient features of the method are its robustness, computational efficiency,
and the mesh-independence of the solutions it provides (in fact, convergence).
We provide a description of the algorithm of universal meshes in Sect. 2 followed
by the presentation of the algorithm for the simulation of brittle fracture in Sect. 3.
Later, in Sect. 4, we highlight some applications of universal meshes beyond brittle
fracture. We conclude the manuscript on some recent developments of universal
meshes in three dimensions in Sect. 5.
2 Universal Meshes
We introduce the basic algorithmic ideas behind a universal mesh next. For concrete-
ness, we focus the description on the aspects relevant to crack propagation, bearing
in mind that similar ideas apply equally well to other classes of evolving domains,
such as those encountered in fluid-structure interaction, as discussed later in Sect. 4.
2.1 Algorithm
Ω(t) = D \ Γ (t)
three conditions are satisfied: (1) the background mesh is sufficiently refined in a
neighborhood of Γ (t), (2) all positively cut triangles in Th are acute, and (3) the
curve Γ (t) is sufficiently smooth. This statement was proved for a domain with C 2
boundary (no cracks) in [33, 34]. The numerical examples strongly suggest that this
should also be possible for domains with interfaces, such as cracks.
The obvious way in which a Universal Mesh is useful for the simulation of a prop-
agating crack is by providing a mesh perfectly conforming to the crack at each step
of its evolution. However, there are advantages that are less evident: the conforming
mesh enables us to compute stress intensity factors to any order of accuracy, and
the few mesh changes from step to step make it possible to retain much of the data
structures in the computer implementation. The accuracy in the computation of the
stress intensity factors is a determinant factor in observing convergence of the crack
evolutions for “reasonable” mesh sizes.
In the following we present a numerical algorithm for the simulation of crack
evolution in a restricted set of problems, as introduced in [35]. The presentation of
the algorithm is followed by examples, including the formation of oscillatory crack
paths in quenched plates, which requires very accurate stress intensity factors to
converge.
K I [u] = K c ,
K I I [u] = 0,
C ([0 , − ]) ⊂ C ([0 , ]), ∀− < ,
There are three critical steps in the computation of the evolution of brittle crack
paths: (1) the generation of a triangulation that conforms to the cracked domain, (2)
the calculation of the elasticity fields, and (3) the evaluation of the stress intensity
factors for curvilinear cracks. The steps are highlighted in Fig. 3.
We construct a triangulation that conforms to each cracked domains from a uni-
versal mesh, as described in Sect. 2. To ensure that the elasticity fields are sufficiently
resolved, we draw on a class of finite element methods for domains with corners or
cracks that retain optimal convergence rate for elements of any order in the face of
singular solutions [10], in contrast to standard methods or methods with enrichments.
Lastly, given that we count with higher order solutions of the elasticity fields, we
employ a family of interaction integrals specifically designed to handle curvilinear
cracks [9] which yield stress intensity factors that converge rapidly to the exact ones
(namely, they converge with twice the rate of convergence of the derivatives of the
solution of the elasticity fields; a motivation to the use higher-order finite element
methods).
We approximate the crack set C ([0 , ]) by a cubic spline interpolant Γh of a
(− )/Δ
finite set of crack tips A = {xn }n=0 0 , where 0 is the initial crack length and
Δ > 0 is a crack discretization parameter. For a discrete crack Γh , indicates the
chord length (the length along the polygonal line formed by points in A , plus the
initial crack length 0 ) instead of its length. At any value of ≥ 0 , the algorithm
proceeds through the following three steps:
Universal Meshes for the Simulation of Brittle Fracture … 121
3.3 Examples
We next showcase the application of the algorithm for the simulation of brittle fracture
to two examples. The first example is a crack propagating along a circular arc,
which we compare against an exact solution, and the second is a crack propagating
in a perforated plate undergoing three-point bending, which we compare against
experimental results. We also show (preliminary) results on the application of a
slight modification of this algorithm to the computation of crack paths in a rapidly
quenched plate [8].
122 M.M. Chiaramonte et al.
The displacement and stress fields of an infinite medium that contains a crack shaped
as a circular arc subjected to far-field stresses and traction-free faces was computed
in [25]. The corresponding stress intensity factors can be found in [11]. We use this
solution to construct a loading history that, when is applied as Dirichlet boundary
conditions to a square-shaped domain, as illustrated in Fig. 4a, causes the crack to
(a) (b)
(c)
Fig. 4 The problem of a crack propagating along a circular arc. By imposing appropriate boundary
conditions g(), with the knowledge of the analytical solution as a function of the angle subdued
by the crack, and hence , we can guide the crack to propagate along a circular arc. This prob-
lem provides a benchmark to establish convergence of the computed crack paths. Four level of
refinements were used to perform the convergence of the computed crack path and convergence
rates were observed to be of the order O (h 1 ). a Modeled problem, b convergence of the solution,
c convergence of the crack path
Universal Meshes for the Simulation of Brittle Fracture … 123
propagate along a circular arc. For details on the construction of such a loading
history we refer the interested reader to [35].
Figure 4a shows a square-shaped domain Ω with a pre-existing crack of radius
R = 2 and angular span ϑ0 = π/8.The analyses were carried out with four pro-
gressively refined universal meshes. The coarsest background mesh as well as the
conformed mesh are shown in Fig. 5, and their refinements were constructed by
recursively subdividing each triangle of the background mesh into four similar
ones. The ratio Δ/ h ≈ 2 was kept constant over all simulations, where as usual
h denotes the maximum diameter of an element in a triangulation. As shown in
Fig. 4b, the crack path converges to a circular arc, and the convergence curves for the
L p ([0 , max ]), p = 2, ∞ and H 1 ([0 , max ]) norms are shown in Fig. 4c. Notably,
convergence of the tangents to the crack path is also obtained.
This simple, nonetheless illustrative example, suggests that the algorithm is indeed
convergent, and hence that the computed paths are largely independent of the chosen
mesh.
Fig. 5 In reference to the problem of a crack propagating along a circular arc, as presented in
Sect. 3.3.1, we showcase the universal mesh used for the entire crack propagation simulation (left).
Namely, the single background mesh (left) was used to generate a conforming triangulation to the
family of cracks {Γh } . On the right we showcase the conformed triangulation at the end of the
simulation with the background mesh in red
124 M.M. Chiaramonte et al.
Simulation
Experiment
Fig. 8 Comparison between experimental results and computed crack paths for the problem of
three-point-bending of a perforated plate. Experimental results were digitized from [5, 20]
Fig. 10 Representative results of experiments of wavy crack patterns in rapidly quenched plates
[38]. In both cases shown above, crack propagation along a straight crack is unstable. These cases
correspond to different immersion speeds
Time
Fig. 11 Computed evolution of a thermally driven crack in a quenched plate. The contours show
the temperature profile along the crack with dark blue representing θ − and dark red representing
θ+
5 Outlook
Fig. 12 Vorticity contours for two representative examples of flow past a moving obstacle. The
simulations consist of incompressible viscous flow, computed using a universal mesh together with
Taylor-Hood finite elements. a Flow past a rotating propeller, b flow past pitching airfoils
Universal Meshes for the Simulation of Brittle Fracture … 129
(a) (b)
(c) 10
-1.5
CD
CL
-2
10
-2.5
10
0
10-0.5 10
h
Fig. 13 Numerical simulation of incompressible, viscous flow past an oscillating disk using a
universal mesh. In a, the background mesh adopted during the simulation is shown. In b, a snapshot
of vorticity contours are shown. In c, the convergence of the drag and lift coefficient time series
under mesh refinement is shown. The reported error E is the square root of the integrated squared
error (Ci (t) − C̄i (t))2 , i = L , D, over the time interval [0, 1], relative to a reference solution C̄i (t)
obtained from a fine mesh with h = 0.145. Nearly quadratic convergence is observed. See [16] for
details
The construction of a universal mesh in three dimensions follows the steps described
in Sect. 2. Namely, given a smooth closed surface Γ ⊂ D ⊂ R3 immersed in a mesh
of tetrahedra Th , we first identify a set of faces Γh in Th that lie near Γ . These
faces are then mapped onto Γ via the closest point projection, and nearby nodes are
adjusted via a relaxation step that ensures the quality of the resulting mesh.
In analogy with the algorithm presented in Sect. 2, Γh is chosen as the union
of positive faces of positively cut tetrahedra in Th . A tetrahedron in Th is called
positively cut if it has three nodes on the non-negative side of Γ and one on the
negative side. A face is then called a positive face if it belongs to a positively cut
tetrahedron and all three of its vertices lie on the non-negative side of Γ . The closest
point projection defines a one-to-one mapping between a positive face and its image
on Γ provided that the mesh size is small compared to the local radius of curvature,
and more importantly, provided some dihedral angles in the mesh are acute [21].
130 M.M. Chiaramonte et al.
6
10
5
10
Number of Elements
4
10
3
10
2
10
1
10
0
10
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Quality
Fig. 14 The top row shows the tetrahedral meshes conforming to the geometry of an elephant in
two different postures. They were obtained from the same background universal mesh, discarding
exterior elements. In the middle row we show two cuts of the mesh displayed in the top right. In
the bottom row we showcase the distribution of the quality of the tetrahedra on a logarithmic scale
Some examples that illustrate the use of a universal mesh in three dimensions
are given in Figs. 14 and 15. In Fig. 14, two meshes of tetrahedra conforming to an
elephant undergoing changes in its posture were obtained from a single universal
mesh. In Fig. 15, the same procedure was used to construct conforming meshes of
tetrahedra of a human upper airway.
Universal Meshes for the Simulation of Brittle Fracture … 131
Fig. 15 The two figures in the middle show conforming meshes for two different configurations
of a human upper airway. The figures on the far left and the far right show a cut through each
tetrahedral mesh. We used a CT scan of a patient as an input for this example. The figure on the
far left shows the contours of the velocity field computed by solving the Navier-Stokes equations
inside. Coupling this tool to a solid mechanics analysis code for the upper airway would be useful
to study the collapse of the upper airway, quite often the area of interest for patients diagnosed with
obstructive sleep apnea
With an eye towards evolving crack fronts in three dimensions, we show next some
early results on how a universal mesh can conform to a smooth curve drawn over
a smooth surface, triangulating the interior of the curve over the surface, and con-
forming the tetrahedra to the surface and the mesh as well, see Fig. 16. To do so, a
planar parametrization of the surface was constructed, and a smooth approximation
of the given curve immersed in it. A conforming surface triangulation to the curve
was then achieved by using a modification of the algorithm in two dimensions, not
described here, and mapping the resulting planar triangulation back to the surface.
132 M.M. Chiaramonte et al.
10 7
10 6
Number of elements
10 5
10 4
10 3
10 2
10 1
10 0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Quality
Fig. 16 A conforming mesh of the given surface is generated from the background universal mesh,
as shown in the top left and top right figures. We then construct a planar parametrization from the
corresponding surface triangulation, and conform the mapped mesh to the surface in the parametric
planar domain. Mapping then back to the real space we obtain a surface triangulation that conforms
to the curve on the surface, and after a relaxation step of the nodes near the surface to ensure good
quality of the tetrahedra, we obtain the resulting mesh (bottom row). The distribution of the qualities
of tetrahedra in this mesh is shown at the bottom right
Acknowledgments This work was supported by the Office of Technology Licensing Stanford
Graduate Fellowship to Maurizio M. Chiaramonte, the National Science Foundation Graduate
Research Fellowship to Evan S. Gawlik, and the Franklin P. Johnson Jr. Stanford Graduate Fellow-
ship to Hardik Kabaria. Adrian J. Lew acknowledges the support of National Science Foundation;
contract/grant number CMMI-1301396.
References
1. Angelillo, M., Babilio, E., & Fortunato, A. (2012). Numerical solutions for crack growth based
on the variational theory of fracture. Computational Mechanics, 50(3), 285–301.
2. Armando Duarte, C., & Tinsley Oden, J. (1996). An h-p adaptive method using clouds. Com-
puter Methods in Applied Mechanics and Engineering, 139(1–4), 237–262.
3. Azócar, D., Elgueta, M., & Rivara, M. C. (2010). Automatic LEFM crack propagation method
based on local Lepp-Delaunay mesh refinement. Advances in Engineering Software, 41,
111–119.
Universal Meshes for the Simulation of Brittle Fracture … 133
4. Belytschko, T., & Black, T. (1999). Elastic crack growth in finite elements with minimal
remeshing. International Journal for Numerical Methods in Engineering, 45(5), 601–620.
5. Bittencourt, T. N., Wawrzynek, P. A., Ingraffea, A. R., & Sousa, J. L. (1996). Quasi-automatic
simulation of crack propagation for 2D LEFM problems. Engineering Fracture Mechanics,
55, 321–334.
6. Bourdin, B., Francfort, G. A., & Marigo, J.-J. (2000). Numerical experiments in revisited brittle
fracture. Journal of the Mechanics and Physics of Solids, 48(4), 797–826.
7. Camacho, G. T., & Ortiz, M. (1996). Computational modelling of impact damage in brittle
materials. International Journal of Solids and Structures, 33(20–22), 2899–2938.
8. Chiaramonte, M. M., Keer, L. M., & Lew, A. J. (2015). Crack path instabilities in thermoelas-
ticity.
9. Chiaramonte, M. M., Shen, Y., Keer, L. M., & Lew, A. J. (2015). Computing stress intensity
factors for curvilinear cracks. International Journal For Numerical Methods in Engineering.
10. Chiaramonte, M. M., Shen, Y., & Lew, A. J. (2015). The h-version of the method of auxiliary
mapping for higher order solutions of crack and re-entrant corner problem.
11. Cotterell, B., & Rice, J. R. (1965). Slightly curved or kinked cracks. International Journal of
Fracture Mechanics, 16, 155–168.
12. Fraternali, F. (2007). Free discontinuity finite element models in two-dimensions for in-plane
crack problems. Theoretical and Applied Fracture Mechanics, 47(3), 274–282.
13. Gawlik, E. S., & Lew, A. J. (2014). High-order finite element methods for moving boundary
problems with prescribed boundary evolution. Computer Methods in Applied Mechanics and
Engineering, 278, 314–346.
14. Gawlik, E. S., & Lew, A. J. (2015). Supercloseness of orthogonal projections onto nearby finite
element spaces. Mathematical Modelling and Numerical Analysis, 49, 559–576.
15. Gawlik, E. S. & Lew, A. J. (2015). Unified analysis of finite element methods for problems
with moving boundaries. SIAM Journal on Numerical Analysis.
16. Gawlik, E. S., Kabaria, H., & Lew, A. J. (2015). High-order methods for low reynolds number
flows around moving obstacles based on universal meshes. International Journal for Numerical
Methods in Engineering.
17. Germanovich, L., Murdoch, L. C., & Robinowitz, M. (2014). Abyssal sequestration of nuclear
waste and other types of hazardous waste.
18. Giacomini, A., & Ponsiglione, M. (2006). Discontinuous finite element approximation of qua-
sistatic crack growth in nonlinear elasticity. Mathematical Models and Methods in Applied
Sciences, 16(01), 77–118.
19. Gol’dstein, R. V., & Salganik, R. L. (1974). Brittle fracture of solids with arbitrary cracks.
International Journal of Fracture, 10(4), 507–523.
20. Ingraffea, A. R., & Grigoriu, M. (1990). Probabilistic fracture mechanics: A validation of
predictive capability. Technical report, Cornell University.
21. Kabaria, H., & Lew, A. J. (In preparation). Universal meshes for smooth surfaces with no
boundary in three dimensions. International Journal for Numerical Methods in Engineering.
22. Melenk, J. M., & Babuška, I. (1996). The partition of unity finite element method: Basic
theory and applications. Computer Methods in Applied Mechanics and Engineering, 139(1–4),
289–314.
23. Miehe, C., & Gürses, E. (2007). A robust algorithm for configurational-force-driven brittle
crack propagation with r-adaptive mesh alignment. International Journal for Numerical Meth-
ods in Engineering, 72(2), 127–155.
24. Moes, N., Dolbow, J., & Belytschko, T. (1999). A finite element method for crack growth
without remeshing. International Journal for Numerical Methods in Engineering, 46,
131–150.
25. Muskhelishvili, N. I. (Ed.). (1977). Some Basic Problems of the Mathematical Theory of Elas-
ticity: Fundamental Equations, Plane Theory of Elasticity, Torsion, and Bending (translated
from Russian) (2nd ed.). Leyden, The Netherlands: Noordhoff International Publishing.
26. Negri, M. (2005). A discontinuous finite element approach for the approximation of free
discontinuity problems. Advances in Mathematical Sciences and Applications, 15, 283–306.
134 M.M. Chiaramonte et al.
Alexander Mielke
Abstract In the modeling of solids the free energy, the energy, and the entropy play
a central role. We show that the free entropy, which is defined as the negative of
the free energy divided by the temperature, is similarly important. The derivatives
of the free energy are suitable thermodynamical driving forces for reversible (i.e.
Hamiltonian) parts of the dynamics, while for the dissipative parts the derivatives of
the free entropy are the correct driving forces. This difference does not matter for
isothermal cases nor for local materials, but it is relevant in the non-isothermal case
if the densities also depend on gradients, as is the case in gradient thermoplasticity.
Using the total entropy as a driving functional, we develop gradient structures for
quasistatic thermoplasticity, which again features the role of the free entropy. The
big advantage of the gradient structure is the possibility of deriving time-incremental
minimization procedures, where the entropy-production potential minus the total
entropy is minimized with respect to the internal variables and the temperature. We
also highlight that the usage of an auxiliary temperature as an integrating factor in
[30] serves exactly the purpose to transform the reversible driving forces, obtained
from the free energy, into the needed irreversible driving forces, which should have
been derived from the free entropy. This reconfirms the fact that only the usage of
the free entropy as driving functional for dissipative processes allows us to derive a
proper variational formulation.
1 Introduction
The mathematical theory of plasticity has its origin in the 1970 s based on the work
Moreau [22], Johnson [12], and Gröger [10], which treated the small-strain case with
quadrtic energies and fixed elastic domains. They developed a rich theory based on
convex analysis and monotone operators which allowed for significant generaliza-
tions, but still staying in the small-strain regime, see e.g. [1]. Finite-strain elastoplas-
A. Mielke (B)
WIAS, Mohrenstraße 39, 10117 Berlin, Germany
e-mail: [email protected]
ticity also plays a fundamental role in engineering applications, and many algorithms
were derived starting in the 1980s, see e.g. [14, 29]. A major breakthrough was the
discovery in [24, 25] that incremental problems in finite-strain elastoplasticity can be
formulated as minimization problems jointly for the elastic and the plastic updates.
This means that elastoplasticity can be formulated in terms of a generalized gradient
system with a dissipation potential R and a free energy F such that it reads
where u is the displacement, and z contains all internal (dissipative) variables. This
theory even led to the first mathematical existence results for the rate-independent
case, see [9, 13, 20].
However, the whole theory is restricted to the isothermal case, and it remains
a challenging problem to find a corresponding mechanical and mathematical the-
ory for thermoplasticity. There are major differences in the mechanical modeling
between the isothermal and the non-isothermal case. In the former case there is one
free energy, and time-incremental minimization procedures can be formulated by
minimizing the sum of the free energy plus the dissipation in the time step. In the
non-isothermal case one has to take care of the mechanical forces still given by the
free energy, but instead of dissipation one now has to model entropy production.
A time-incremental minimization procedure should involve the entropy production
minus the total entropy. A first step in this direction was done in [30], and here we
connect our work [16, 17] with the latter.
The major observation is that one has to formulate thermoplasticity in a suitable
thermodynamically consistent way, in order to recast it in variational form. For this,
we start from the GENERIC framework (General Equations for Non-Equilibrium
Reversible Irreversible Coupling). For this we use the total energy E and the total
entropy S as functionals with densities E and S, respectively, depending on the
displacement u, the vector z of internal variables, and a thermodynamical variable
r. Using the entropy-production potential P and its Legendre dual P ∗ , we find the
form (cf. [16])
ρü = − Du E (q)−Θ∗Du S (q) = −Du F u, z, Θ(u, z, r) ,
1
ż = ∂ξz Pz∗ q, Dz S (q)− ∗Dz E (q) = ∂ξz Pz∗ q, Dz H (u, z, Θ(u, z, r)) ,
Θ
Δu S(q)[u̇] Δz E(q)[ż] 1 ∂r S
ṙ = − − − div κ(q)∇ ,
∂r S(q) ∂r E(q) ∂r E ∂r E
where F and H are the total free energy and total free entropy expresses in terms
of the temperature θ = Θ(u, z, r), where r is an arbitrary scalar thermodynamic
E (u,z,r)
variable, such that Gibbs’ relation θ = Θ(u, z, r) = DDrrS (u,z,r)
holds.
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 137
The above form clearly shows the role of the free energy as the driving functional
for the reversible elastodynamics, while the free entropy
F(W, θ)
H (u, z, θ) = H(u, ∇u, z, ∇z, θ) dx with H(W, θ) = −
Ω θ
is the driving functional for the dissipative variables z (like the plastic tensor or the
hardening variables). Locally the free entropy is simply given as ‘minus the free
energy divided by the temperature’, but for functional derivatives, which involve
integration by parts, new terms appear and the naive relation Dz H (u, z, θ) =
− 1θ Dz F (u, z, θ) may be wrong. With F (u, z, θ) = Ω F(u, ∇u, z, ∇z, θ) dx we have
1 1
Dz H (u, z, θ) = − Dz F (u, z, θ) + ∂∇z F u, ∇u, z, ∇z, θ ∇ ,
θ θ
where the last term vanishes only in two important cases: (i) in the isothermal case
where ∇θ ≡ 0 and (ii) in the case “local case” where F does not depend on ∇z. In
these two cases, it is correct to use the derivative of the free energy and put the factor
θ into the dual entropy-production potential (thus turning it into a dual dissipation
potential). However, in all other cases, one has to distinguish the free energy and
the free entropy as driving functionals. Moreover, the inverse temperature 1/θ is the
driving force for heat transfer:
An important fact is that the terms on the right-hand side are independent of the choice
of the thermodynamical variable r, see Theorem 3, which gives a great flexibility in
the mathematical approaches.
Turning to the quasistatic case, we drop the interia term ρü and rewrite the remain-
ing system in the form
0 = Du E (q) − A(q)Dr E (q), ż = ∂ηz PZ∗ q; Dz S (q)−B(q)Dr S (q) ,
ṙ = −A(q)∗ u̇ − B(q)∗ ż − C(q)∗ div κ(q)∇(C(q)Dr S(q))
ξr (1)
with A(q)ξr = Dr S (q)
∗ Du S (q), B(q)ξr = Dr Eξr(q) ∗ Dz E (q),
ξr
and C(q)ξr = .
Dr E (q)
138 A. Mielke
Assuming that the first relation, which is static, can be solved in the form u = U(z, r),
Theorem 4 shows that this system can be formulated as a gradient system as follows
ż
= ∂ξ P∗ z, r; DS(z, r) with S(z, r) = S (U(z, r), z, r),
ṙ
where P is the primal entropy-production potential obtained from the dual potential
P ∗ (u, z, r; ξz , ξr ) = PZ∗ (q; ξz −B(q)ξr ) + 21 Ω ∇(C(q)ξr )·κ(q)∇(C(q)ξr ) dx via
Legendre-Fenchel transform.
We discuss the abstract theory along specific thermomechanical examples. The
simplest is a spring-damper system, see Examples 5 and 7. Section 4.1 discusses the
Penrose-Fife model and shows how in [21] the gradient structure is exploited to do a
rigorous homogenization, where the effective entropy functional is obtained by aver-
aging of the free energy density. Section 4.2 treats a time-dependent thermoplastic
model where the gradient structure in terms of the entropy involves a time-dependent
entropy-production because of the elimination u(t) = U(z(t), (t)), where is the
mechanical loading. Finally, a plastic model with thermal expansion is considered
in Sect. 4.3.
For all these models we need a specific and problem-dependent choice of the
thermodynamic variable r, which highlights the importance of a clear modeling in
terms of the free energy and free entropy giving the driving forces Du E(u, z, r) −
Θ(u, z, r)∗Du S(u, z, r), Dz S(u, z, r) − Θ(u,z,r)
1
∗Dz E(u, z, r), and 1/θ = ∂∂rr E(u,z,r)
S(u,z,r)
.
As a first example we treat the heat equation with energy density e and heat flux q:
ė + div q = 0 in Ω, q · n = 0 on ∂Ω.
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 139
Of course, by Gibbs’ relation θds = de we have the compatibility Θ(r) = E (r)/S (r).
Often this last relation is seen as the definition of the temperature. Note that already
here the inverse of the absolute temperature plays the role of an integrating factor
such that 1/θ de is the total differential ds, cf. [5].
In the classical form of the heat equation, the heat flux q is a linear function of
the temperature gradient, which is called Fourier’s law. In terms of r we arrive at
E (r)ṙ − div k(r)∇Θ(r) = 0, (3)
where k ∈ Rd×d is the symmetric and positive definite heat conductivity matrix.
However, for a proper coupling to other mechanical effects, we want to have a
gradient flow formulation in terms of the total entropy S as a driving functional,
while the total energy E should be conserved
S (r) = s(r(x)) dx and E (r) = E(r(x)) dx.
Ω Ω
where K is a selfadjoint positive definite operator that maps the field ξr = DS (r)
to the rate ṙ, where ξr is the thermodynamical driving force associated with r. The
operator K will be called Onsager operator, since Onsager showed that such linear
operators should be symmetric. Indeed, in [23] the symmetry K = K∗ is called
“reciprocal relation”.
The positive semidefiniteness ξr , K(r)ξr ≥ 0 guarantees that the second law
of thermodynamics is satisfied. Note that energy conservation needs the relation
K(r)DE (r) ≡ 0. Using the variational derivative DE (r) ≡ E (r) we see that the
only Onsager operators which are compatible with the usual heat equation (3) and
the gradient structure (4) have the form
1 ξr
K(r)ξr = − div κ(r)∇ ,
E (r) E (r)
As a result we see that the heat equation takes the general structure
1 S (r)
ṙ = − div κ(r)∇ = K(r)DS (r),
E (r) E (r)
since DS (r) ≡ S (r). In this general form we see that S (r)/E (r) = 1/Θ(r) is the
function under the spatial gradient, i.e. the heat flux has the form
q = κ(r)∇ 1/θ) = −k∇θ with κ(r) = k(r)Θ(r)2 .
Thus, we see that κ has to be chosen as k(r)Θ(r)2 . More importantly, we see that
the inverse temperature 1/θ is the driving force for energy flow, independently of the
choice of the scalar thermodynamical variable r.
To connect our theory to the work in [30] we introduce the dual entropy-production
potential (EPP), also called kinetic potential there, namely
ξ ξ
1 1
P ∗ (r, ξ) := ξ, K(r)ξ = ∇ · κ(r)∇ dx.
2 2 Ω E (r) E (r)
(i) ṙ = ∂ξ P ∗ (r, DS (r)), (ii) ṙ = Arg minv P(r, v) − DS (r), v,
(iii) ∂ṙ P(r, ṙ) = DS (r), (iv) DS (r) ∈ Arg minξ P ∗ (r, ξ) − ξ, ṙ.
Here the equivalence of (i) and (iii) is the Fenchel equivalence for the Legendre
transformation, while (ii) and (iv) are simply equivalent to (i) and (iii), respectively,
using the convexity of the EPPs P and P ∗ . To calculate the rate ṙ from the nonlo-
cal minimum principle (ii), the following local inf-sup formulation was introduced
in [30]:
(
ξ, ṙ) solves inf sup ξ, v − P ∗ (r, ξ) − DS (r), v .
v ξ
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 141
We now consider general elastic materials with internal parameters describing dis-
sipative effects such as plasticity, phase transformation, damage, magnetization, or
polarization, see e.g. [20]. We follow the approach presented in [16] but do not
emphasize the very useful framework GENERIC explicitly. Nevertheless we will
see remainders of the reversible (i.e. Hamiltonian) dynamics in the quasistatic elas-
tic force balance and of the irreversible dynamics in the flow rules for the internal
variable z and the heat equation.
We consider a body in the reference configuration Ω, which is a bounded domain
with Lipschitz boundary. The displacement is denoted by u : Ω ⊂ Rd → Rd , and
e(u) = 21 (∇u+∇uT ) is the linearized strain tensor. All the internal variables (also
called dissipative variables) are included in the variable z : Ω → Rm , which may
include plastic strains, phase indicators, or damage variables. By a general scalar
field r : Ω → R we describe the thermodynamical properties, e.g. r can be either
the temperature θ, the internal energy density e, or the entropy density s.
We consider a closed system, which means that we have no-flux boundary con-
ditions. The total energy and total entropy are given by
Ekin-pot (u, u̇, z, r) = Ekin (u̇) + E (u, z, r) and S (u, z, r) = S(∇u, z, ∇z, r) dx
Ω
ρ 2
where Ekin (u̇) := |u̇| dx and E (u, u̇, z, r) = E(u, ∇u, z, ∇z, r) dx,
Ω 2 Ω
∂r E(q)
where the consitutive laws E and S are related by Gibbs’ relation θ = Θ(q) := ∂r S(q)
.
Before we discuss the equations of motions for such material models, we introduce
the free energy f and the free entropy s and discuss their role in continuum mechanics:
As is common for the free energy, we also consider the free entropy only as a function
of r = θ and use the densities (with W = (u, ∇u, z, ∇z))
as fields on Ω and define total free energy F and the total free entropy H via
F (u, z, θ) = F(u, ∇u, z, ∇z, θ) dx and H (u, z, θ) = H(u, ∇u, z, ∇z, θ) dx.
Ω Ω
The major point we want to address here is that F and H can serve as driving
functionals, since their partial derivatives with respect to any of the variables
W = (W1 , ..., Wk ) = (u, ∇u, z, ∇z) are independent of the particular choice of the
thermodynamic quantity r. The physical requirement for a driving force is that it takes
the same physical value, independent of the choice of the thermodynamic quantity.
The main observation is the following lemma which relies on the Gibbs relation.
Proof For the first result, we first establish the Gibbs relation using the chain rule:
∂ρ
E(W, ρ) ∂r E(W, R(W, r))∂ρ R(W, ρ) ∂r E(W, R(W, r))
= = = Θ(W, R(W, ρ)).
∂ρ S(W, ρ) ∂r S(W, R(W, r))∂ρ R(W, ρ) ∂r S(W, R(W, r))
For the driving forces for W we again use the chain rule to obtain
DW
E(W, ρ) = DW E(W, R(W, ρ)) + ∂r E(W, R(W, ρ))DW R(W, ρ),
DW
S(W, ρ) = DW S(W, R(W, ρ)) + ∂r S(W, R(W, ρ))DW R(W, ρ).
i.e. the driving forces coincide as desired. However, for b (z) = 0 we have
However, there is still a major issue when considering fields over a body Ω and
considering the total free energy F and the total free entropy H . If we consider
variations of these functionals the variational derivatives involve integrations by part,
namely
Dz H (u, z, θ) = ∂z H(u, ∇u, z, ∇z, θ) − div ∂∇z H(u, ∇u, z, ∇z, θ) .
Now using the relation H = −F/θ we see that the differentials of F and H are not
simply related by multiplying with temperature, since we have
1 1
Dz H (u, z, θ) = − Dz F (u, z, θ) + ∂∇z F(u, ∇u, z, ∇z, θ)∇ .
θ θ
The last term, which destroys the naive relation Dz H (u, z, θ) = − 1θ Dz F (u, z, θ),
vanishes in two important cases: (i) in the isothermal case where ∇θ ≡ 0 and (ii) in
144 A. Mielke
the case “local case” where F does not depend on ∇z. In all other cases, we have to
be careful and distinguish the free energy and the free entropy as driving functionals.
In many situations it is helpful to use other thermodynamical fields r instead
of θ, in particular the internal-energy density e = E(W, θ) or the entropy density
s = S(W, θ) are often relevant. For these situations it is better to use the total energy
E and the total entropy S as function of (u, z, r). Hence, we need to adapt the nice
cancellation properties derived in Lemma 1 by introducing a multiplication “∗” for
scalar fields α and variational derivatives δz G as follows:
α ∗ δz G(z, ∇z) (x) := α(x)∂z G(z(x), ∇z(x)) − div α∂∇z G(z, ∇z) (x).
We also write α ∗ Dz G (u, z, r) for α ∗ δz G(u, ∇u, z, ∇z, r) and obtain the important
identities (5a, 5b) below. We should consider “α ∗ δz ” or “α ∗ Dz ” as one operator
acting on functions G or functionals G , respectively; see [20, Sect. 5.3] for a fully
abstract definition.
Proof The right-hand side in the first line can be written in full detail as
RHS := ∂u E(W, r) − Θ(W, r)∂u S(W, r) − div ∂∇u E(W, r) − Θ(W, r)∂∇z S(W, r) ,
where W = (u, ∇u, z, ∇z). Using Lemma 1 we can apply the relation for ∂u and ∂∇u
independently and find
RHS = ∂u F(W, Θ(W, r)) − div ∂∇u F(W, Θ(W, r))
= δu F(W, θ))|θ=Θ(W,r) = Du F (u, z, Θ(u, z, r)).
However, in other cases the usage of Dz F leads to equations that are thermody-
namically inconsistent for the local balance laws, while the total energy conservation
and total entropy production may still be valid, see the discussion in [16, Remark 4.1].
Acoording to [11, 16] the GENERIC framework suggests to write the coupling of
elastodynamics for u, dissipative dynamics for z, and energy transport for r in the
form
ρü = − Du E (q)−Θ∗Du S (q) = −Du F (u, z, Θ(u, z, r)),
1
ż = ∂ξz Pz∗ q, Dz S (q)− ∗Dz E (q) = ∂ξz Pz∗ q, Dz H (u, z, Θ(u, z, r)) ,
Θ
Δu S(q)[u̇] Δz E(q)[ż] 1 ∂r S
ṙ = − − − div κ(q)∇ ,
∂r S(q) ∂r E(q) ∂r E ∂r E
where the directional derivatives Δu S(q)[u̇] and Δz E(q)[ż] are defined via
Here the first equation described elastodynamics and contains the Hamiltonian
part. In particular, we see that the reversible (i.e. Hamiltonian) part of the dynamics
is driven by the derivative Du F (u, z, Θ(u, z, r)) of the free energy F . In contrast,
the dissipative effects described by the internal variable z and the thermodynamical
field r are driven by Dz H (u, z, Θ(u, z, r)) and 1/Θ = ∂r S/∂r E, respectively. In
particular, we can define a joint dual entropy-production potential (EPP) P ∗ via
where indicates the slot, where the corresponding argument (here ξr ) has to be
inserted.
We now discuss the two first terms on the right-hand side of the energy balance
for r, namely Δu S(u, z, r)[u̇]/∂r S and Δz E(u, z, r)[ż]/∂r E. The first term can be
seen as the latent-heat production term that is dual to the term ∂∂rr ES ∗ Du S in the
linear momentum balance and thus belongs to the reversible (=Hamiltonian) part of
dynamics. In particular it disappears completely if we choose r = s, which means
that it does not change the entropy. We refer to [16] for more details. In contrast, the
146 A. Mielke
Before restricting to the quasistatic case with ρ = 0 we look at the total energy
balance and the total entropy production using the given abstract form. First we
observe that along solutions q(t) = (u(t), z(t), r(t)) we have
d ρ 2
|u̇| dx + E (q(t))
dt Ω 2
= ρü · u̇ dx + Du E (q), u̇ + Dz E (q), ż + Dr E (q), ṙ
Ω
= −Θ ∗ Du S , u̇ + D∂rz EE , − Δ0u S[u̇] + M(q)∗ ∂η P0∗ q, M(q) D∂zrS
(1)
∂r S S
Dz E Dz S (3)
(2) ∗
= 0 + 0 + M(q) Dr E , ∂η P0 q, M ∂r S = 0,
where we used the momentum balance and (6) in (1). Equality (2) follows from
Gibbs’ relation Θ = ∂r E/∂r S and the definition of “∗”, whereas (3) uses the
special form of M giving M(q)(Dz E , Dr E )T = (0, 1)T and the energy conserva-
tion (0, 1) , ∂η P ∗ (q, ξ) = 0, which follows from the translationally symmetry
P0∗ (q, η+λ(0, 1)T ) = P0∗ (q, η) for all λ ∈ R.
Similarly, the total entropy production can be calculated as follows:
d
S (q(t)) = Du S (q), u̇ + Dz S (q), ż + Dr S (q), ṙ
dt 0 Dz S
= Du S (q), u̇ + D∂zrS
(i) ∗ ∗
S
, Δu S[u̇] + M(q) ∂η P0 q, M(q)
− ∂r S ∂r S
Dz S (ii)
= 0 + 0 + M(q) Dr S , ∂η P0∗ q, M D∂zrS S
≥ 0,
where we used (6) for (i) and the fact that P0∗ is a dual dissipation potential in (ii),
i.e. P0∗ (η) ≥ P0∗ (0) = 0 and convexity of P0∗ imply η, ∂P0∗ (η) ≥ 0.
Subsequently we choose the quasistatic approximation and neglect the kinetic energy,
i.e. we set the density ρ = 0. It was shown already in [17] that, after elimination of the
displacement u, the remaining equation for (z, r) is a gradient system if one uses the
specific choice r = s (the density of the entropy). Here we follow [16] and show that
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 147
the result holds for any choice of r, which is extremely helpful, since traditionally
one prefers r = θ (the temperature) and more recently also the choice r = e (the
density of the internal energy), but general r gives more flexibility, see e.g. Sects. 4.2
and 4.3. To simplify the formulas we restrict our subsequent discussion to the simpler
case
∗ ∗ 1
P0 (q; ηz , ηr ) = PZ (q; ηz ) + ∇ηr · κ(q)∇ηr dx.
Ω 2
still displaying the driving forces in terms of free energy and free entropy. However,
the special GENERIC structure discussed in [16, Sect. 2.4] guides us to write the
system in the form
where the operators A(q), B(q), and C(q) are defined via
ξr ξr ξr
Aξr = Dr S (q)
∗ Du S (q), Bξr = Dr E (q)
∗ Dz E (q), Cξr = Dr E (q)
. (8d)
For the following we assume that (7a) or (8a) can be solved uniquely in the form
u = U(z, r). As a shorthand, we also write q = Q(z, r) = (U(z, r), z, r).
Theorem 4 Assume that the mapping ξr → ξr + Dr U(z, r)∗ A(Q(z, r))ξr is invert-
ible and denote the inverse by J(z, r). Defining the functionals
Proof The last relation follows from the definition of P∗ and the identities
which is the desired Eq. (8c), because of u̇ = Dz U(z, r)ż + Dr U(z, r)ṙ.
z(t) u(t)
where α is the thermal expansion coefficient. The classical force balances are
1 2 1 2c
E(u, z, θ) = u + (u−z)2 + θ3/2 and S(u, z, u) = 2cθ1/2 − αu.
2 2 3
The driving force for the damper is the derivative of the free entropy with respect to
z, which is
This is consistent with the choice of the EPP which differs from the dissipa-
tion potential R(ż) = μ2 ż2 by a factor of temperature, namely P(θ; ż) = 2θ
μ 2
ż and
θ 2
P ∗ (θ; ξ) = 2μ ξ . Together, the equations take the form (8), namely
0 = Du F(u, z, θ) = 2u−z+αθ,
θ 1
ż = ∂z H(u, z, θ) = (u−z),
μ μ
αθ1/2 z−u
θ̇ = −A(u, z, θ)∗ u̇ − B(u, z, θ)∗ ż, where A(q) = − and B(q) = 1/2 .
c cθ
There is no heat conduction term, since the temperature is the same in the whole
system.
For the reduction we immediately find u = U(z, θ) = 21 (z−αθ). Inserting this
relation into the above system we see that the reduced ODE is given by
1 α2 −z
ż = − (z+αθ) and 1+ θ1/2 θ̇ = ż, (10)
2μ 2c 2cθ1/2
1 2 α2 2 2c 3/2 α α2
E(z, θ) = z + θ + θ , S(z, θ) = − z + θ + 2cθ1/2 ,
4 4 3 2 2
θ 2 z
P∗ (θ; ξz , ξθ ) = ξz − B(z, θ)ξθ , B(z, r) = .
2μ 2cθ + α2 θ
1/2
is the same as (10), while the individual driving forces Dz S(z, θ) = −α/2 and
Dθ S(z, θ) = c/θ1/2 +α2 /2 are different from what one might naively expect.
The above abstract result is a beautiful and mathematically clean reduction of the
quasistatically coupled system of elastostatics and dissipative material behavior to
a perfect gradient system driven by the physical entropy S. However, in practice it
is of limited use because of the involved nonlocal functionals. In particular, U(z, r)
depends nonlocally on (z, r), since it is obtained by solving an elliptic boundary
value problem. As a consequence, the operators J, B, and C are nonlocal as well.
Fortunately, there are cases, where the nonlocality disappears or is reduced to a
minimum. The most important case occurs if the entropy functional is independent
of u:
α ∗ Du S (u, z, r) = 0 for all α =⇒ A(q) ≡ 0.
As a consequence we obtain B(z, r) = B(Q(z, r)), C(z, r) = C(Q(z, r)), and J = id.
Moreover, the elastostatic equation reduces to Du E (u, z, r) = 0. Here we see the
advantage of using general thermodynamical variables r, since the form of A(u, z, r)
strongly depends on r: we have A(u, z, r) ≡ 0 only for specific choices, see Example
7 and Sects. 4.2 and 4.3.
Using the Fenchel equivalence ξ ∈ ∂P(v) ⇔ v ∈ P ∗ (ξ), we find that the system
(8) with A(q) ≡ 0 for q = (u, z, r) can be rewritten as follows:
Dz S (q) I B(q) ∂PZ (q; ż)
Du E (q) = 0, ∈ ∂(ż,ṙ) P(q; ż, ṙ) = .
Dr S (q) 0 I ∂Pheat (q; ṙ+B∗ ż)
We see that both relations are variational in the sense that derivatives of functionals
determine the solutions.
In particular, we can discretize the system in time such that we obtain time-
incremental minimization principles that are useful for proving existence of solutions
or for numerical simulation of concrete models.
Note that we do not enforce energy conservation, which could be done as well.
However, it is better to use the errors in the energy conservation as quality control
for the numerical accuracy, see the example below.
μ 1
(1) Find (zk+1 , sk+1 ) as minimizer of z − z k )2 − s
2Θ(u , s ) τ
k k
subject to s −
B(uk , zk , sk )z = sk − B(uk , zk , sk )zk ;
(2) find uk+1 as minimizer of E(u, zk+1 , sk+1 ).
Here the first minimization problem is quadratic, and the explicit solution can be
determined. In the second minimization problem the functional is cubic in u, so the
unique minimizer uk+1 = U(zk+1 , sk+1 ) can be obtained by solving (12a). Thus, we
find the incremental update formulas
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 153
Fig. 2 Numerical solution q(t) = (u(t), z(t), s(t)) of the ODE (12) for parameters μ = α = 1 and
c = 1/2. Here ε(t) = 106 (
E(q(t))−
E(q(0)))
Fig. 3 Numerical simulation via (13) for time steps τ = 1/30, 1/100, 1/300, 1/1000. Energy
conservation is checked via ετ (kτ ) = (
E(qk )−
E(q0 ))/(τ
E(q0 )), hence ετ (1) ≈ 5τ
Θ(qk ) k Θ(qk ) k 2
zk+1 = zk + τ B(q ), sk+1 = sk + τ B(q ) , uk+1 = U(zk+1 , sk+1 ).
μ μ
(13)
Inserting the explicit form of B we find the relation zk+1 = zk + μτ (uk −zk ), which
is an explicit discretization of (12b). Nevertheless, by construction of our algorithm
we know that it is entropy increasing. Figure 2 shows the numerical solution of
the ODE (12), while Fig. 3 shows numerical approximations obtain via the TIMP
(11), which yields (13). We observe that the scheme is only of first order in the
time step. However, we expect that it is stable even when treating fully nonlinear
thermomechanical systems.
In this section we give three examples of temperature dependent models that can be
rewritten in terms of entropic gradient flows.
This model is without any elastic deformation, so there is no need to eliminate the
variable u and the condition Du S ≡ 0 is trivially satisfied.
154 A. Mielke
ν
E(z, θ) = cθ + ψ0 (z) and S(z, ∇z, θ) = c log θ + c − ψ1 (z) − |∇z|2 . (14)
2
The equations in the variables z and θ take the form
⎧
⎨ ż = mδ S(z, θ)− 1 ∂ E = m νΔz−ψ (z)− 1 ψ (z)
z z
(PF) θ
1
θ 0 (15)
⎩
cθ̇ = −ψ0 (z)ż + div k(z, θ)∇θ .
Almost all mathematical work is restricted to the case E(z, θ) = cθ + λz, which is
physically only relevant in a small temperature range. In particular, the logarithmic
entropy σ(θ) = c log θ is only good for gases, while for solids one should have
σ(0) = 0, e.g. σ(θ) = cθα for α ∈ ]0, 1[ is more appropriate.
In [21] we consider
the internal energy e as thermodynamic variable r = e,
namely E (z, e) = Ω e(x) dx and S (z, e) = Ω S(z, ∇z, e) dx. Indeed, the above
case (14) can be rewritten in terms of e via s = S(z, ∇z, e) = c log(e−ψ0 (z)) −
c log c − ψ1 (z) − α2 |∇z|2 , but much more general functions S are possible.
The Penrose-Fife system (15) can be formulated as gradient system via the EPP
1 1
P ∗ (z, e; ξz , ξe ) = ξ, K(z, e)ξ = m(x, z, e))ξz (x)2 + ∇ξe (x) · κ(z, e)∇ξe (e) dx.
2 2 Ω
There is one special case where the gradient system can be rewritten as an evo-
lutionary variational inequality (EVI), cf. [2, 18]. For this we have to assume that
K (or equivalently P ∗ ) does not depend on the state (z, e). Moreover, one needs
to assume that (z, e) → −S (z, e) is λ-convex, i.e. for some λ ∈ R the function
(z, e) → −S (z, e) − λP(z, e) is convex, where P is the primal EPP. Under these
assumptions, a curve q = (z, e) : [0, T ] → XPF := H1 (Ω) × L1 (Ω) is a solution of
(PF), if and only if
t−s
(EVI) eλ(t−s) P(q(t)−w) + P(q(s)−w) ≤ 0 eλr dr S (q(t)) − S (w)
for all 0 ≤ s < t and all w = ( e) ∈ XPF .
z,
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 155
This variational formulation of the Penrose-Fife model is ideal for coarse graining.
Assuming that the entropy density S, the mobility m, and the heat conduction tensor
κ depend periodically on a microscopical variable in the form
1
Sε = S( 1ε x, z, e) − ∇z·A( 1ε x)∇z, mε (x) = M( 1ε x), κε (x) = H( 1ε x),
2
one can pass to the homogenization limit ε → 0 using the abstract methods for
evolutionary -convergence described in [18]. In [21] it is shown that solutions
(zε , eε ) for the gradient system (XPF , Sε , Pε ) converge to the unique solution (z0 , e0 )
of the limiting gradient system (XPF , S0 , P0 ), if this is true for the initial conditions.
The effective entropy functional S0 and the effective EPP P0 are
S0 (z, e) = Seff (z, e)− 21 ∇z·Ahom ∇z dx and
Ω
∗
P0 (ξz , ξe ) = 2
1
mharm ξz2 +∇ξe ·Hhom ∇ξe dx.
Ω
Here Ahom and Hhom are the classical homogenized effective tensors obtained from
the periodic functions A and H, respectively. Moreover, mharm is the harmonic mean
of M. More interesting is the homogenization of the nonlinear function S to obtain
Seff . Here one takes advantage of the concavity of the mapping e → S(y, z, e). Doing
a partial Legendre transform of −S with dual variable τ , one obtains the free energy
evaluated at θ = −1/τ . After simply averaging F(y, z, −1/τ ) over the periodicity
cell, one can reverse the Legendre transform and obtains Seff (z, e). We refer to [21]
for more details.
where |e|2C = e:Ce and Gibbs’ relation ∂θ Φ = θ∂θ S. Setting E(e, z, θ) = 21 |e−z|2C +
Φ(z, θ), we will explicitly use the decouplings ∂e ∂θ E = 0 = ∂e S.
We note that Du S ≡ 0 implies that the elastic equilibrium takes the form
0 = Du E (t, u, z, θ) − θ ∗ Du S (u, z, θ) = Du E (t, u, z, θ) = − div C(e(u)−z) − (t).
156 A. Mielke
Respecting the energy conservation, we can take the dual EPP P ∗ in the form
ξθ ξθ
P ∗ (q; ξu , ξz , ξθ ) = P0∗ q; ξz − ∗ Dz E (t, q), ,
∂θ Φ(z, θ) ∂θ Φ(z, θ)
where A is a bounded, symmetric, non-negative linear operator from L2 (Ω; Rd×d 0,sym )
into itself and a(t) = K(t) for a suitable bounded linear operator K. Thus, Theorem 4
yields the gradient-flow equation
ż
= ∂ξ P ∗ (z, θ; DS (z, θ))
θ̇
I 0 ∗ I −1
∂θ Φ
∗ Dz E Dz S
= −1 ∂ξ P0 z, θ; .
D
∂θ Φ z
E 1
∂θ Φ 0 1
∂θ Φ Dθ S
1
e(x) := z(x) : (Az)(x) + Φ(z(x), θ(x)),
2
where we note that the nonlocal operator A has to be taken with care. This relation
can be inverted to express the temperature as function of z and e as follows. Denote by
e) the unique solution of
θ = Θ(z, e = Φ(z, θ) and define e)),
e) = S(z, Θ(z,
S(z,
which gives ∂e S(z, e) by Gibbs’ relation ∂θ Φ = θ∂θ S. Then, with
e) = 1/Θ(z, e=
e − 21 z : Az, the total energy, total entropy, and the dual EPP read
E(t, z, e) = e − a(t):z dx, S(z, e) =
S(z, e− 21 z:Az) dx, and
Ω Ω
∗ ∗ ∗ I a(t) ξz
P (t, z, e; ξz , ξe ) := P0 (z, e; ξz +ξe a(t), ξe ) = P0 z, e; .
0 1 ξe
The energy balance dtd E (t, z(t), e(t)) = ∂t E (t, z(t), e(t)) along solutions still follows
∗ (z, e; ξ+(0, λ)T ) = P
from the relation P ∗ (z, e; ξ) for all constants λ ∈ R.
0 0
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 157
We emphasize that (TIMP)∗ is not equivalent to the one proposed in (11), since
here we eliminated u beforehand by using the nonlocal operator A. So, (TIMP)∗
should be preferable if A is available. Again, we observe that the concavity of S
implies that the minimum problem is convex. In the case of viscoplasticity, P 0 is
even strictly convex, so there is a unique minimizer in each time step. Thus, it should
be possible to show existence of solutions for the thermo-viscoplastic system in
(16). Unfortunately, the methods developed in [21] and based on the (EVI) are not
applicable because of the nonquadratic behavior of P 0 due to σyield > 0.
158 A. Mielke
Finally, we consider a classical plasticity model (see e.g. [4]) where thermal expan-
sion leads to a stronger coupling of elastostatics and heat conduction. As usual we
again start with a free energy containing a thermal expansion tensor E ∈ Rd×d
sym in the
form
1 σθ c
F(∇u, z, ∇z, θ) = |e(u)−z|2C + ψ1 (θ)E:e(u) + H(z) + |∇z|2 − θ1+α ,
2 2 α(1+α)
with c > 0 and α ∈ ]0, 1[. We obtain the energy and entropy functionals
1+α
1 1 (θ)E:e(u) + H(z) + cθ
E (u, z, θ) = |e(u)−z|2C + ψ dx,
2 1+α
Ω
c α ν
S (u, z, θ) = θ − ψ1 (θ)E:e(u) − |∇z|2 dx,
Ω α 2
This choice now guarantees that A(u, z, r) ≡ 0 and the reduction to a local gradient
system for (z, r) can be done as described at the end of Sect. 3.3. In particular the
solution u = U(z, r) can be obtained as the unique minimizer of the convex functional
u → E (u, z, r). The corresponding Euler-Lagrange equation reads
− div C(e(u)−z) + cα (r+E:e(u))1/α E = 0.
Acknowledgments The author is grateful to Michael Ortiz for many stimulating discussions on
the interaction of mathematics and mechanics. Moreover, he thanks Ulisse Stefanelli for the fruitful
collaboration concerning variational methods for evolutionary systems. The resarch was partially
supported by the European Research Council via the grant ERC-2010-AdG no.267802 Analysis of
Multiscale Systems Driven by Functionals.
Free Energy, Free Entropy, and a Gradient Structure for Thermoplasticity 159
References
1. Alber, H. -D. (1998). Materials with memory (Vol. 1682), Lecture Notes in Mathematics.
Berlin: Springer.
2. Ambrosio, L., Gigli, N., & Savaré, G. (2005). Gradient flows in metric spaces and in the space
of probability measures., Lectures in Mathematics Basel: ETH Zürich. Birkhäuser Verlag.
3. Bartels, S., & Roubíček, T. (2008). Thermoviscoplasticity at small strains. ZAMM—Journal of
Applied Mathematics and Mechanics, 88, 735–754.
4. Bartels, S., & Roubíček, T. (2011). Thermo-visco-elasticity with rate-independent plasticity
in isotropic materials undergoing thermal expansion. Mathematical Modelling and Numerical
Analysis (M2AN), 45, 477–504.
5. Carathéodory, C. (1909). Untersuchungen über die Grundlagen der Thermodynamik. Mathe-
matische Annalen, 67, 355–386.
6. Carstensen, C., Hackl, K., & Mielke, A. (2002). Non-convex potentials and microstructures
in finite-strain plasticity. Proceedings of the Royal Society of London Series A, 458(2018),
299–317.
7. Dal Maso, G., DeSimone, A., & Mora, M. G. (2006). Quasistatic evolution problems for linearly
elastic-perfectly plastic materials. Archive for Rational Mechanics and Analysis, 180(2), 237–
291.
8. Dal Maso, G., DeSimone, A., & Solombrino, F. (2011). Quasistatic evolution for cam-clay plas-
ticity: a weak formulation via viscoplastic regularization and time parametrization. Calculus
of Variations and Partial Differential Equations, 40(2), 125–181.
9. Gürses, E., Mainik, A., Miehe, C., & Mielke, A. (2006). Analytical and numerical methods
for finite-strain elastoplasticity. In R. Helmig, A. Mielke, & B. Wohlmuth (Eds.), Multifield
problems in solid and fluid mechanics (pp. 443–481). Berlin: Springer.
10. Gröger, K. (1978). Zur Theorie des quasi-statischen Verhaltens von elastisch-plastischen Kör-
pern. ZAMM—Journal of Applied Mathematics and Mechanics, 58(2), 81–88.
11. Hütter, M., & Svendsen, B. (2012). Thermodynamic model formulation for viscoplastic solids
as general equations for non-equilibrium reversible-irreversible coupling. Continuum Mechan-
ics and Thermodynamics, 24, 211–227.
12. Johnson, C. (1976). Existence theorems for plasticity problems. Journal de Mathematiques
Pures et Appliques (9), 55(4), 431–444.
13. Mainik, A., & Mielke, A. (2009). Global existence for rate-independent gradient plasticity at
finite strain. Journal of Nonlinear Science, 19(3), 221–248.
14. Miehe, C., & Stein, E. (1992). A canonical model of multiplicative elasto–plasticity. For-
mulation and aspects of numerical implementation. European Journal of Mechanics endash;
A/Solids, 11, 25–43.
15. Mielke, A. (2003). Energetic formulation of multiplicative elasto-plasticity using dissipation
distances. Continuum Mechanics and Thermodynamics, 15, 351–382.
16. Mielke, A. (2011). Formulation of thermoelastic dissipative material behavior using GENERIC.
Continuum Mechanics and Thermodynamics, 23(3), 233–256.
17. Mielke, A. (2011). On thermodynamically consistent models and gradient structures for ther-
moplasticity. GAMM—Mitteilungen, 34(1), 51–58.
18. Mielke, A. (2016). On evolutionary Γ -convergence for gradient systems. In A. Muntean,
J. Rademacher, & A. Zagaris (Eds.), Macroscopic and large scale phenomena: coarse grain-
ing, mean field limits and ergodicity, Lecture Notes in Applied Math. Mechanics, 3, 187–249.
Springer.
19. Mielke, A. (2013). Thermomechanical modeling of energy-reaction-diffusion systems, includ-
ing bulk-interface interactions. Discrete and Continuous Dynamical Systems—Series S, 6(2),
479–499.
20. Mielke, A., & Roubíček, T. (2015). Rate-independent systems: theory and application. Applied
Mathematical Sciences, 193. Springer.
21. Mielke, A., & Stefanelli, U. (2015). Homogenizing the penrose-fife system via its gradient
structure. In preparation.
160 A. Mielke
22. Moreau, J.-J. (1974). On unilateral constraints, friction and plasticity. In New Variational Tech-
niques in Mathematical Physics (Centro Internaz. Mat. Estivo (C.I.M.E.), II Ciclo, Bressanone,
1973) (pp. 171–322). Rome: Edizioni Cremonese.
23. Onsager, L. (1931). Reciprocal relations in irreversible processes, I+II. Physical Review, 37,
405–426. (part II, 38:2265–2279).
24. Ortiz, M., & Repetto, E. (1999). Nonconvex energy minimization and dislocation structures in
ductile single crystals. Journal of the Mechanics and Physics of Solids, 47(2), 397–462.
25. Ortiz, M., & Stainier, L. (1999). The variational formulation of viscoplastic constitutive updates.
Computer Methods in Applied Mechanics and Engineering, 171(3–4), 419–444.
26. Ortiz, M., Repetto, E., & Stainier, L. (2000). A theory of subgrain dislocation structures. Journal
of the Mechanics and Physics of Solids, 48, 2077–2114.
27. Penrose, O., & Fife, P. C. (1990). Thermodynamically consistent models of phase-field type
for the kinetics of phase transitions. Physica D, 43(1), 44–62.
28. Penrose, O., & Fife, P. C. (1993). On the relation between the standard phase-field model and
a "thermodynamically consistent" phase-field model. Physica D, 69(1–2), 107–113.
29. Simo, J., & Ortiz, M. (1985). A unified approach to finite deformation elastoplastic analysis
based on the use of hyperelastic constitutive relations. Computer Methods in Applied Mechanics
and Engineering, 49, 221–245.
30. Yang, Q., Stainier, L., & Ortiz, M. (2006). A variational formulation of the coupled thermo-
mechanical boundary-value problem for general dissipative solids. Journal of the Mechanics
and Physics of Solids, 54, 401–424.
Comparison of Isotropic Elasto-Plastic
Models for the Plastic Metric
Tensor C p = F pT F p
1 Introduction
Since the early days of the introduction of the multiplicative decomposition into
computational elasto-plasticity, the need was felt to reduce the level of complexity
and to discard the concept of a plastic rotation in the completely isotropic setting.
This means to consider a flow rule not for the plastic distortion F p (9-dimensional)
[5, 6, 11, 30, 34, 36, 43, 44], but to consider directly a flow rule for the plastic
metric tensor C p = F pT F p ∈ PSym(3) (6-dimensional) [1, 10, 38, 39, 45], which
is then automatically invariant under left-multiplication of F p with a plastic rotation.
The plastic distortion is in general incompatible F p = ∇ψ p , as is the plastic metric
C p = ∇ψ pT ∇ψ p . A formulation in the plastic metric C p is particular attractive
because it circumvents problems associated with the intermediate configuration
introduced by the multiplicative decomposition, which is trivially non-unique since
F = Fe · F p = Fe · Q T · Q · F p = Fe∗ · F p∗ , Q ∈ SO(3).
Several proposals with the aim of removing the non-uniqueness of the intermediate
configuration have been given in the literature. Our comparative study is related to
the following models: Simo’s model [41] (Reese and Wriggers [36], Miehe [21]);
Miehe’s model [22]; Lion’s model [17] (Helm [12]), (Dettmer-Reese [6]); Simo
and Hughes’ model [42]; Helm’s model [12] (Vladimirov, Pietryga and Reese [45],
Shutov and Kreißig [39], Reese and Christ [35], Brepols, Vladimirov and Reese [1],
Shutov and Ihlemann [38]); Grandi and Stefanelli’s model [10] (Frigeri and Stefanelli
[7]). All these models are given with respect to different configurations, either the
reference configuration, the intermediate configuration or the current configuration.
In order to be able to compare them, it is necessary to transform all to the same
configuration for that purpose. In our case we choose the reference configuration.
Moreover, any explicit dependence on F p instead of C p in the model formulation
must be able to be subsumed into a dependence on C p alone in the isotropic case. A
major body of our work consists in showing this for the models under consideration.
The paper is structured as follows. After a paragraph giving some definitions
which generalize the concepts from small strain-additive plasticity to finite strain
plasticity we established some auxiliary results. Then we discuss existing
6-dimensional flow rules from the literature. The main properties of the investigated
isotropic plasticity models are summarized in Figs. 1 and 2. Finally, in the appendix,
we obtain explicit formulas for some of the isotropic plasticity models.
In this paper, we use the standard Euclidean scalar product on R3×3 given by
X, Y := tr(X Y T ), and thus the Frobenius tensor norm is X 2 = X, X . The iden-
tity tensor on R3×3 will be denoted by 1, so that tr(X ) = X, 1. We let Sym(3) and
PSym(3) denote the symmetric and positive definite symmetric tensors respectively.
We adopt the usual abbreviations of Lie-group theory. Here and in the following
the superscript T is used to denote transposition, sym X = 21 (X + X T ) denotes the
symmetric part of the matrix X ∈ R3×3 , while dev3 X = X − 13 tr(X ) · 1 represents
the deviatoric (trace free) part of the matrix X .
The classical concept of associated perfect plasticity is uniquely defined in the
case of small strain-additive plasticity. In this case the total symmetric strain is decom-
posed additively into elastic and plastic parts ε = εe + ε p and the rate-independent
evolution law for the symmetric plastic strain ε p is given in subdifferential format
d
[ε p ] ∈ ∂ χ (Σlin ), tr(ε p ) = 0,
dt
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 163
and Σlin := −Dε p [Wlin (ε − ε p )] is the thermodynamic driving stress of the plastic
process. Here, Σlin is clearly symmetric.
In such a way, the principle of maximum dissipation (equivalent to the convexity
of the elastic domain and normality of the flow direction) is satisfied. The structure
of associated flow rules in geometrically nonlinear theories is by far not as trivial as
in the geometrically linear models. However, in this work we use:
Definition 1 (geometrically nonlinear associated plastic flow) We call a plastic flow
rule for some plastic variable P (whether symmetric or not) associated, whenever
the flow rule can be written as
d √ d −1 √
[P] P −1 ∈ ∂ χ (Σ) or P [P ] P ∈ f = ∂ χ (Σ),
dt dt
d d d
[W (F F p−1 (t)] = [W (C C −1
p (t)] = [Ψ (C, C p (t)] ≤ 0
dt dt dt
for all constant in time F (viz. C = F T F), depending in which format the elastic
energy is given.
Definition 4 (Loss of ellipticity in the elastic domain) We say that the elasto-plastic
formulation preserves ellipticity in the elastic domain whenever the purely elastic
response in elastic unloading of the material remains rank-one convex for arbitrary
large given plastic pre-distortion, see [9, 29].
driving the plastic evolution (see e.g. [2–4, 20, 28]), while τ is the Kirchhoff stress
tensor and Σe is defined in Remark 1.
Remark 1 We also need to consider the following elasto-plastic stress tensors:
Note that (1) is not at variance with symmetry of Σ and Σe in case of isotropy.
Using the fact that for given Fe ∈ GL+ (3) it holds FeT S Fe−T 2 ≥ 21 S2 for all
S ∈ Sym(3), the constant being independent of Fe [30], we obtain the estimate
1
dev3 Σe = FeT (dev3 τe )Fe−T ≥ √ dev3 τe ,
2
1
dev3 Σe = dev3 (FeT τe Fe−T ) = FeT τe Fe−T − tr(FeT τe Fe−T ) · 1
3
1
= Fe (τe − tr(τe )) · 1)Fe−T ,
T
3
we have
dev3 Σe = FeT (dev3 τe )Fe−T , dev3 τe = Fe−T (dev3 Σe )FeT , tr(Σe ) = tr(τe ).
where
2 ∂W ∂W 2 ∂W
α1 = 1/2 I2 (Ce ) + I3 (Ce ) , α2 = 1/2 ,
I3 (Ce ) ∂ I 2 (C e ) ∂ I 3 (C e ) I3 (Ce ) 1 (Ce )
∂ I
1/2 ∂W
α3 = −2 I3 (Ce )
∂ I2 (Ce )
dev3 Σe 2 = FeT (dev3 τe ) Fe−T , FeT (dev3 τe ) Fe−T = Be (dev3 τe ), (dev3 τe ) Be−1
= dev3 τe 2 .
166 P. Neff and I.-D. Ghiba
Σ (C C −1
:= 2 C DC [W −1 −1
p )] = 2 C D[ W (C C p )] C p ∈/ Sym(3), (3)
which is not symmetric, in general. For instance, for the simplest Neo-Hooke energy
W (Fe ) = tr(Ce ) = tr(C C −1 −1
p ) we have D W (C C p )=1 and Σ=2 C C p ∈
−1
/ Sym(3).
Lemma 2 Any isotropic and objective free energy W defined in terms of Fe can be
expressed as
(C C −1
W (Fe ) = W T −1
p ) = W (F F(F p F p ) ).
T
(4)
Proof It is clear that any objective elastic energy W (Fe ) which is isotropic w.r.t. Fe ,
can be expressed in terms of the invariants of Ce , i.e.
Therefore, we obtain
F pT ,
Σe = F p−T Σ = F T τe F −T .
Σ
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 167
(C C −1
D F [W T −1
p )], H = D F [ W (F F C p )], H
= D[W (C C −1 −1 −1
p )], F H C p + H F C p
T T
(C C −1
= 2 F sym[D[W −1
p )]C p ], H ,
(C C −1
for all H ∈ R3×3 . In view of Lemma 2 we have W (Fe ) = W p ). Therefore, we
obtain
(C C −1
2 F sym[D[W −1 −T
p )]C p ] = D Fe [W (Fe )] F p ,
and further
(C C −1
FeT D Fe [W (Fe )] F p−T = 2 FeT F sym[D[W −1 −T −1 −1
p )]C p ] = 2 F p C sym[D[ W (C C p )]C p ].
(C C −1
Σe = FeT D Fe [W (Fe )] = 2 F p−T C DC [W −T T
p )] F p = F p Σ F p .
T
Proof Using Cardano’s formula and due to the symmetry of C p , the continuity of
the map t → C p (t) implies the continuity of mappings t → λi (t), i = 1, 2, 3, where
λi (t) ∈ R are the eigenvalues of C p (t). Since λi (0) > 0 and λ1 (t)λ2 (t)λ3 (t) = 1 for
all t > 0, it follows that λi (t) > 0 for all t > 0 and the proof is complete.
We can slightly weaken the assumption in the previous lemma: det C p (t) > 0 for
all t > 0 is sufficient.
168 P. Neff and I.-D. Ghiba
In the remainder of this paper we discuss different proposal from the literature for
plasticity models in C p . Simo [41] (see also Reese and Wriggers [36] and Miehe [21,
p. 72, Proposition 5.25]) considered the spatial flow rule in the form
1
p ∂ τe Φ(τe ) · Be ,
− Lv (Be ) = λ+ (6)
2
the tensor τe = 2 ∂ Be W (Be ) · Be is the symmetric Kirchhoff stress tensor, the yield
function Φ(τe ) = dev3 τe − 23 σy and the plastic multiplier λ+
p satisfies the
Karush-Kuhn-Tucker (KKT)-optimality constraints
λ+
p ≥ 0, Φ(τe ) ≤ 0, λ+
p Φ(τe ) = 0. (7)
d −1
p F [∂ τe Φ(τe ) · Be ] F
[C ] = −2 λ+ −1 −T
dt p
+ −1 dev3 τe
= −2 λp F · Be F −T , (8)
dev3 τe
d −1
[C p ] ∈ −2 F −1 ∂ τe χ (dev3 τe ) · Be F −T , (9)
dt
We deduce (see the model Eq. (5.25) from [21]) an equivalent definition for
Lv (Be ) given by
1 dev3 τe
− Lv (Be ) = λ+ · Be .
2 p
dev3 τe
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 169
Since C p = F T Be−1 F we have Lv (Be )=F d
dt
[C −1
p ] F T =F d
dt
[F −1 Be F −T ] FT .
On the other hand, from (8) it follows that
d −1 dev3 τe
[C p ] C p = −2 λ+ F −1
· Be F
−T
F T Be−1 F
dt p
dev3 τe
∈ −2 F −1 ∂ τe χ (dev3 τe ) F. (10)
Since
d d d
[det C p−1 ] = Cof C p−1 , [C p−1 ] = det C p−1 C p , [C p−1 ]
dt dt dt
−1 d −1
= det C P 1, [C p ]C p , (11)
dt
from the flow rule (8) together with det C p (0) = 1 and tr(F −1 dev3 τe F) = 0 it
follows at once that det C p (t) = 1 for all t ≥ 0.
The next step is to prove that the flow rule (6) implies dtd [W (Fe )] ≤ 0 at fixed F,
i.e. the reduced dissipation inequality is satisfied. We compute for fixed in time F
d d d
[W (F F p−1 )] = D Fe W (Fe ), F [F p−1 ] = D Fe W (Fe ), F F p−1 F p [F p−1 ]
dt dt dt
d d
= FeT D Fe W (Fe ), F p [F p−1 ] = Σe , F p [F p−1 ]
dt dt
d
= −Σe , sym( [F p ]F p−1 ), (12)
dt
Dp
d d d d
[C p ] = [F pT F p ] = [F pT ]F p + F pT [F p ]
dt dt dt dt
−T d d −1
= Fp Fp
T
[F ] F p + F p
T T
[F p ]F p F p = 2 F pT D p F p ,
dt p dt
where D p := sym dtd [F p ]F p−1 . Hence, we easily deduce the representation D p =
1
F −T d [C p ]F p−1 . Therefore, with (12) we obtain
2 p dt
d 1 d
[W (F F p−1 )] = −Σe , F p−T [C p ]F p−1 . (13)
dt 2 dt
170 P. Neff and I.-D. Ghiba
d 1 d 1 d
[W (F F p−1 )] = − FeT τe Fe−T , F p−T [C p ]F p−1 = FeT τe Fe−T , F p [C −1 ]F pT
dt 2 dt 2 dt p
1 d 1 T d −1
= F pT FeT τe Fe−T F p , [C −1 −1
p ] = F τe Be F, [C ]
2 dt 2 dt p
1 d
= τe , F [C −1 ] F T Be−1 . (14)
2 dt p
d dev3 τe
[W (F F p−1 )] = −λ+
p τe , = −λ+
p dev3 τe ≤ 0. (15)
dt dev3 τe
Therefore, in the isotropic case, the flow rule (9) has a subdifferential structure:
d −1
[C ] ∈ −2 F p−1 [∂ Σe χ (dev3 Σe )] F p−T , (17)
dt p
In view of the above equivalent representations of the flow rule, we may summarize
the properties of the Simo-Miehe 1992 model:
(i) from (8) it follows, in the isotropic case (in which τe and Be commute), that
C p (t) ∈ Sym(3);
(ii) plastic incompressibility: from (10) and (11) it follows that det C p (t) = 1, since
the right hand side is trace-free;
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 171
(iii) for the isotropic case, the right hand-side of (8) is a function of C −1
p and C
−1 T −1 −1 −1 T −1
alone, since Be = F C p F and F Be F = F F C p F F = C p C;
(iv) from (i) and (ii) together and using Lemma 4 it follows that C p (t) ∈ PSym(3);
(v) it is thermodynamically correct;
(vi) the right hand side in the representation (8) is not the subdifferential of the
indicator function of some convex domain in some stress space. However, this
model is an associated plasticity model in the isotropic case, see Propositions 1
and 3.
Shutov [37] interpreted that Miehe in [22] considered the flow rule1
d −1
[C ] C p = −λ+
Φ(Σ),
p DΣ (18)
dt p
where Σ (C C −1
= 2 C DC [W p )] and
2 2
= dev3 τe −
Φ(Σ) σy = tr((dev3 2)
Σ) − σy .
3 3
2 ] = dev3 (Σ)
tr[(dev3 Σ) ⇔ (dev3 Σ)
dev3 Σ, T = dev3 Σ,
dev3 Σ
⇔ skew(dev3 Σ)
dev3 Σ, =0 ⇔ ∈ Sym(3)
dev3 Σ
⇔ ∈ Sym(3).
Σ
For the simplest Neo-Hooke elastic energy considered in Appendix A.2, W (Fe ) =
(C C −1
tr(Ce ) = W −1 −1
p ) = 2 tr(C C p ), we have Σ = C C p , which is not symmetric.
1
Hence tr[(dev3 Σ) 2 ] = dev3 Σ.
Let us again remark that Σ
is not necessarily
symmetric for general C p . However, using Lemma 3, we deduce
1 Miehe [22] only defines the elastic domain Ee (Σ, 2 σy2 ) := Σ ∈ R3×3 tr((dev3 Σ)
2 ) ≤ 2 σy2
3 3
in terms of τe , i.e. Ee (τe , 23 σy2 ) = τ ∈ Sym(3) dev3 τ 2 ≤ 23 σy2 . He uses the same notation
for the referential quantities. Therefore, we have two interpretations at hand Φ(Σ) = dev3 τe −
2 2
σy = tr((dev3 Σ) ) − σy . On the other hand, in the isotropic case, we have also Φ(Σ)
2 2 2 =
3 3
dev3 Σe − 2
3 σy2 .
172 P. Neff and I.-D. Ghiba
C p = F pT Σe F p−T C p = F pT Σe F p ∈ Sym(3)
Σ ⇒ · C p ∈ Sym(3),
dev3 Σ
(19)
C −1
p Σ
= C −1
p F pT Σe F p−T = F p−1 Σe F p−T ∈ Sym(3) ⇒ C −1
p
dev3 Σ ∈ Sym(3).
2 ). First,
In the following, we discuss first the sign of the quantity2 F 2 := tr((dev3 Σ)
we deduce
2 ] = (dev3 Σ)
tr[(dev3 Σ) (dev3 Σ),
1 = Σ 1 = C −1
(dev3 Σ),
p Σ (dev3 Σ) C p , 1
= C −1 T −1 T
p Σ (dev3 Σ · C p ) , 1 = C p Σ C p (dev3 Σ) , 1 (20)
= C −1
p Σ C p , dev3 Σ.
C −1 −1 −1 −1 −1
p Σ C p , dev3 Σ = U p U p Σ U p U p , dev3 Σ = U p Σ U p , U p dev3 Σ U p
1
U p , U p−1 Σ
= U p−1 Σ U p − tr(Σ) · 1
3
1
U p , U p−1 Σ
= U p−1 Σ U p − tr(U p−1 Σ U p ) · 1
3
U p , dev3 (U p−1 Σ
= U p−1 Σ U p ) = dev3 (U p−1 Σ
U p ), dev3 (U p−1 Σ
U p )
U p )2 ≥ 0,
= dev3 (U p−1 Σ (21)
d −1 λ+
[C p ] C p = −
p
T
(dev3 Σ)
dt 2]
tr[(dev3 Σ)
d −1 λ+
⇔ Cp [C p ] = −
p
dev3 Σ. (22)
dt 2]
tr[(dev3 Σ)
2 ] = −2 and tr[(dev3 Σ)
we obtain tr[(dev3 Σ) 2 ] = 2 , respectively. Hence, tr[(dev3 Σ)
2 ] is not
3
positive for all Σ ∈ R .
3×3
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 173
d −1 λ+
[C p ] = −
p
C −1 dev3 Σ
dt 2] p
tr[(dev3 Σ)
d λ+
⇔ [C p ] =
p
C p ∈ Sym(3).
(dev3 Σ) (23)
dt 2]
tr[(dev3 Σ)
Remark 3 Although the flow rule considered in this interpretation of the Miehe 1995
model [22] has a subdifferential structure, the yield-function Φ is not convex. Hence,
the flow rule is not a convex flow rule. In order to see the non-convexity of Φ(Σ)
we observe first by looking at sublevel-sets that
2 2
=
Φ(Σ) 2) −
tr((dev3 Σ) σ is convex ⇔ Σ)
Φ( = tr[(dev3 Σ)
2 ] is convex.
3 y
Σ)
The second derivative for the simpler function Φ( is
2 H ∈ R3×3 .
Φ(Σ).(H, H ) = (dev3 H ) , dev3 H = tr[(dev3 H ) ], ∀ Σ,
T 2
DΣ
We know that tr[(dev3 H )2 ] is not positive for all H ∈ R3×3 , since for the previous
considered matrix H , such that
⎛ ⎞
− 21 1 2
dev3 H = ⎝ −2 − 21 3 ⎠ ,
−1 −3 − 21
Σ)
we obtain tr[(dev3 H )2 ] = −2. Therefore Φ( is not convex, and thus Φ(Σ)
cannot
be convex.
This derivation was given by Lion [17, Eq.(47.2)] in the general form (see also [12,
Eq.(6.33)]) and by Dettmer-Reese [6] in the isotropic case. Following [6] we consider
a perfect plasticity model for the plastic metric C p based on the flow rule
d −1
[C ] ∈ −F p−1 ∂ χ (dev3 Σe ) F p−T ∈ Sym(3) for Σe ∈ Sym(3). (24)
dt p
174 P. Neff and I.-D. Ghiba
Again, in this model it is not clear from the outset, that it is a formulation in
C p alone. The goal of such a 6-dimensional formulation is to avoid any explicit
computation of the plastic distortion F p . However, the right hand side of the above
proposed flow rule is, in fact, a multivalued function in C and C −1
p alone. Hence, we
can express the flow rule (24) entirely in the form3
d −1
[C ] C p ∈ f (C, C −1
p ). (25)
dt p
In order to show this remarkable property (satisfied only for isotropic response), and
to determine the explicit form of the function f (C, C −1 p ), in view of (2) we remark
that
dev3 Σe 1 1
F p−1 F p−T = F p−1 Σe F p−T − tr(Σe ) C −1 ,
dev3 Σe dev3 Σe 3 p
It is clear that F p−1 Σe F p−T = 2 F p−1 Ce (α1 1 + α2 Ce + α3 Ce2 ) F p−T ∈ Sym(3), and
tr(Σe ) = 2 tr(α1 C C −1 −1 −1 −1 −1 −1
p + α2 C C p C C p + α3 C C p C C p C C p ),
Σe 2 = 4C −1 −1
p f , f C p .
where
f := α1 C + α2 C C −1 −1 −1
p C + α3 C C p C C p C.
Hence, we deduce
F p−1 Σe F p−T = 2 C −1 −1 −1
p f (C, C p ) C p ∈ Sym(3), (26)
tr(Σe ) = 2 tr(
f (C, C −1 −1
p ) C p ), Σe 2 = 4C −1 −1 −1 −1
p f (C, C p ), f (C, C p ) C p ,
1
dev3 Σe = 2 tr[( p ) C p ) ] − [tr(
f (C, C −1 −1 2
f (C, C −1 −1 2
p )C p )] ,
3
f (C, C −1 −1 −1 / Sym(3)
p ) is not necessarily symmetric. Moreover C p f (C, C p ) ∈
3 Note carefully, that
in general.
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 175
where
f (C, C −1 −1 −1 −1
p ) := α1 C + α2 C C p C + α3 C C p C C p C ∈ Sym(3), (27)
and αi = αi (I1 (Ce ), I2 (Ce ), I3 (Ce )), according to (2). Therefore, the multivalued
function f (C, C −1
p ) is given by
−1
−λ+ −1
p dev3 ( C p f (C, C p )) +
f (C, C −1 λ ∈ R+ .
p ) = p
tr[( p )C p ) ] − 3 [tr(
f (C, C −1 −1 2 1
f (C, C −1 −1 2
p )C p )]
(28)
In Appendix A.1 we give the specific expression for the functions f (C, C −1
p ) and
f (C, C −1
p ) in case of the Neo-Hooke energy.
On the other hand, in view of Eq. (24) we also have
−1 χ
= F pT ∂ χ (dev3 Σe )F p−T .
d d −1
p = −C p [C p ] ∈ C p F p ∂ (dev3 Σe ) F p
[C p ] C −1 −T
dt dt
d
[C p ] ∈ F pT ∂ χ (dev3 Σe ) F p−T C p = F pT ∂ χ (dev3 Σe )F p ∈ Sym(3), (29)
dt
which establishes symmetry of C p whenever C p (0) ∈ Sym(3).
Another important question is whether the solution C p of the flow rule (24) is
such that det C p (t) = 1, for all t ≥ 0. Let C p be the solution of the flow rule (24).
Then, we have
d −1 −1 dev3 Σe −1 dev3 Σe
Cp [C ] = −λ+
p C p Fp F −T = −λ+ T
p Fp Fp Fp F −T
dt p dev3 Σe p dev3 Σe p
λ+ −T
p dev3 (F p Σe F p )
T
=− , (30)
2 dev3 Σe
d −1 d
[C p ] C p , 1 = [C −1 ], C p = 0.
dt dt p
On the other hand, the flow rule (24) together with det C p (0) = 1 leads to
det C p (t) = 1, for all t ≥ 0.
176 P. Neff and I.-D. Ghiba
Let us remark that, in view of (25) and (28) we have for the flow rule (24)
d −1 −λ+
[C ] =
p
dev3 [ C −1 −1 −1
p f (C, C p )] · C p ,
dt p tr[(
p )C p ) ] − 3 [tr[
f (C, C −1 −1 2 1
f (C, C −1 −1 2
p )C p ]]
∈Sym(3)
/
∈Sym(3)
d −1 dev3 Σ
[C p ] = −λ+ · C −1
p ,
dt p
dev3 Σ
1
[tr(
f (C, C −1 −1 2 −1 −1 2 −1 −1 2
p )C p ) ] − [tr( f (C, C p )C p )] = dev3 ( f (C, C p )C p ) .
3
To see this, assume to the contrary that equality holds. Then we deduce
1
f (C, C −1
[tr( −1 2 −1 −1 2 −1 −1 2
p )C p ) ] − [tr( f (C, C p )C p )] = dev3 ( f (C, C p )C p )
3
⇔ f (C, C −1 −1 −1 −1 T −1 −1 −1 −1
p )C p , ( f (C, C p )C p ) = ± f (C, C p )C p , f (C, C p )C p .
(31)
Since
f (C, C −1
p ) ∈ Sym(3), we obtain
tr[(
f (C, C −1 −1 2 −1 −1 −1 −1
p )C p ) ] = C p f (C, C p )C p ( f (C, C p ), 1
= C −1 −1 −1 −1
p f (C, C p ), f (C, C p )C p . (32)
C −1 −1 −1 −1 −1 −1 −1 −1 −1 −1
p f (C, C p ), f (C, C p )C p = U p f (C, C p )U p , U p f (C, C p )U p
= U p−1
f (C, C −1 −1 2
p )U p , (33)
f (C, C −1 −1 −1 −1 T −1 −1 −1 −1
p )C p , ( f (C, C p )C p ) = f (C, C p )C p , f (C, C p )C p
⇔ f (C, C −1 −1
p )C p ∈ Sym(3), (34)
d −1 −λ+ −1 −1
p dev3 [ f (C, C p ) C p ]
Cp [C ] = . (35)
dt p tr[( p )C p ) ] − 3 [tr[
f (C, C −1 −1 2 1
f (C, C −1 −1 2
p )C p ]]
d d −1 d −1
[W (C C −1 −1
p )] = D[ W (C C p )], C [C ] = C DC [W (C C −1
p )] C p , [C p ]
dt dt p dt
1 d 1 d
= Σ C p , [C −1 ] = C −1 C p , C p [C −1
Σ ]
2 dt p 2 p dt p
1 d 1 λ+
= − C −1 −1 C −1
p
p Σ C p , [C p ] C p = − p Σ C p , dev3 Σ,
2 dt 4 dev3 Σ
(36)
leads to
which, using the formula F pT Σe F p−T = Σ,
d 1 λ+
[W (C C −1 C −1 F pT Σe F p−T C p , dev3 (F pT Σe F p−T )
p
)] = −
dt p
4 dev3 (F pT Σe F p−T ) p
1 λ+p 1
=− −T
Σe , Σe − tr(Σe ) · 1
4 dev3 (F p Σe F p )
T 3
1 λ+ p
=− dev3 Σe 2 ≤ 0. (37)
4 dev3 (F pT Σe F p−T )
d 1 λ+
[W (C C −1
p )] = −
p
2]
tr[(dev3 Σ) (38)
dt
4 dev3 Σ
1 λ+
=−
p
U p )2 ≤ 0,
dev3 (U p−1 Σ (39)
4 dev3 Σ
(iv) from (i) and (ii) together and using Lemma 4 it follows that C p (t) ∈ PSym(3);
(v) it is thermodynamically correct;
(vi) it is an associated plasticity model in the sense of Definition 1, see Proposition 3.
Remark 5 (Simo-Miehe 1992 model vs. Lion 1997 model) In the anisotropic case,
the flow rule proposed by Simo and Miehe [41] (and later by Reese and Wriggers
[36] and Miehe [21]) is not completely equivalent with the flow rule proposed by
Lion (see also [1, 6]), since dev3 τe = dev3 Σe does not hold true in general.
However, the difference is nearly absorbed by the positive plastic multipliers. The
models may differ due to different yield conditions, but the flow rules are similar,
having the same performance with respect to the thermodynamic consistency. Both
models are consistent according to our Definition 2, but we may not switch between
them, since different elastic domains are considered, namely EΣe and Eτe , respectively.
This is in fact the main difference between these two models. Having different elastic
domains we have different boundary points, since a point of the boundary of Eτe is not
necessarily on the boundary of Eτe . Hence, in these two flow rules we have a different
behaviour corresponding to the indicator function of different domains. The material
may reach the boundary of the elastic domain Eτe , while it is strictly inside the elastic
domain EΣe , for the same local response.
However, we have the following result:
Proposition 1 In the isotropic case the flow rule proposed by Simo and Miehe [41]
is equivalent with the flow rule proposed by Lion [17].
Proof We compare the flow rules (17) and (24) and the proof is complete.
The book [42] has been edited years after the untimely death of J.C. Simo. In this
book also a finite strain plasticity model is proposed. However, this model has a
subtle fundamental deficiency which we aim to describe in the interest of the reader.
The flow rule considered in [42, p. 310] is
d −1 2 devn τe Cp
[C ] = − λ+ tr(Be ) F −1 F −T , Cp = , (40)
dt p 3 p devn τe det C p
1/3
with f 1 (C, C −1 −1 −1 −1 −1 −1 −1
p ) : =α1 C p +α2 C p C C p + α3 C p C C p C C p ∈ Sym(3).Thus,
for elastically isotropic materials we deduce
2
F −1 [devn τe ] F −T =2 f 1 (C, C −1 −1
p ) − tr(F f 1 (C, C p ) F ) C
T −1
3
2
=2 f 1 (C, C −1 −1
p ) − f 1 (C, C p ), C C
−1
∈ Sym(3), (43)
3
1
dev3 τe =2 f 1 (C, C −1 −1 2 −1
p ) · C, C · f 1 (C, C p ) − f 1 (C, C p ), C .
2
9
n τe
Hence, F −1 dev
devn τe
F −T ∈ Sym(3) is a function of C, C p−1 . Therefore the flow rule
(40) can entirely be expressed in terms of C and C p−1 alone.
Remark 6 It is not true, in general, that the right hand side of the flow rule (40) is in
concordance with det C p (t) = 1, assuming that det C p (0) = 1.
Proof From (40) we obtain by right multiplication with C p
d −1 2 −1 −1 devn τe
[C p ] C p = − (det C p )−1/3 λ+ −1
p tr(C C p ) F p Fe F −T F p . (44)
dt 3 devn τe e
d −1 −1 d −1 −1 d −1 −1 d −1
[det C p ] = Cof C p , [C p ] = det C P C p , [C p ] = det C P 1, [C p ]C p .
dt dt dt dt
Hence, we deduce
d −1 2 −1 devn τe
[det C p ] = − (det C p )−1/3 λ+ −1
p tr(C C p ) Fe F −T , 1. (45)
dt 3 devn τe e
n τe
Since Fe−1 dev F −T is not necessarily a trace free matrix, we can not conclude
devn τe e
−1
that det C p (t) = const. for all t > 0. For instance, for elastically isotropic materials
(see (43)) we have
2
dev3 τe =2 f 1 (C, C −1 −1
p ), C p − f 1 (C, C p ), CC , C p
−1
3
1 −1
=2 α1 C −1 p , C p − C , CC −1
, C p
3 p
1 −1
+ 2 α2 C −1 p C C −1
p , C p − C C C −1
, CC −1
, C p
3 p p
1 −1
+ 2 α3 C −1 p C C −1
p C C −1
p , C p − C C C −1
C C −1
, CC −1
, C p ,
3 p p p
(46)
n τe
which shows that Fe−1 dev F −T is not necessarily a trace free matrix, see Appendix
devn τe e
A.4.
In this section we consider the model proposed by Helm [12], Vladimirov, Pietryga
and Reese [45, Eq. 25] (see also [35] and [39, Eq. 55] and the model considered by
Brepols, Vladimirov and Reese [1, page16], Shutov and Ihlemann [38, Eq. 80]). We
prove later that this model is similar to the model considered by Miehe [22] in 1995,
provided certain interpretations are included. Vladimirov, Pietryga and Reese [45,
Eq. 25] considered the following flow rule
d
dev3 Σ
[C p ] = λ+
p · Cp , (47)
dt 2)
tr((dev3 Σ)
= 2 C DC [W
where Σ (C C −1
p )] is not necessarily symmetric for general C p ∈
d
· C p ∈ Sym(3), see Sect. 3. Therefore, we have [C p ] ∈
PSym(3), while (dev3 Σ)
dt
Sym(3). The flow rule (47) implies
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 181
d d −1 (C C −1 d −1
[W (C C −1 −1
p )] = D[ W (C C p )], C [C p ] = C DC [W p )] C p , [C ]
dt dt dt p
1 d 1 d
= Σ C p , [C −1 ] = C −1 Σ C p , C p [C −1 ] (48)
2 dt p 2 p dt p
1 λ+
=−
p
C p , dev3 Σ.
C −1 Σ
2) p
2 tr((dev3 Σ)
d 1 λ+
[W (C C −1
p )] = −
p
2]
tr[(dev3 Σ)
dt 2)
2 tr((dev3 Σ)
λ+
=−
p
U p ) ≤ 0,
dev3 (U p−1 Σ (49)
2
which shows thermodynamical consistency.
Summarizing, the Helm 2001 (Reese 2008 and Shutov-Ihlemann 2014) model
has the following properties:
(i) from (49) it follows that it is thermodynamically correct;
(ii) plastic incompressibility: from (47) and (11) it follows that det C p (t) = 1;
(iii) for the isotropic case, the right hand-side of the flow rule (47) is a function of
C −1
p and C alone;
(iv) from dtd [C p ] ∈ Sym(3). it follows, in the isotropic case, that C p (t) ∈ Sym(3);
(v) from (ii) and (iii) together and using Lemma 4 it follows that C p (t) ∈ PSym(3);
(vi) it has formally subdifferential structure, see Proposition 4. However, the elastic
domain Ee (Σ, 2 σy2 ) is not convex w.r.t Σ,
see Remark 3.
3
Proposition 2 The flow rule considered by Helm [12] coincides with the flow rule
(47), i.e. with the interpretation of Miehe’s proposal [22] presented in Sect. 3.
Proof The proof follows from (47) and combined with (23).
In this section we present a model based on one representation used by Grandi and
Stefanelli [10] and previously used by Frigeri and Stefanelli [7, p. 7]. We start by
computing
182 P. Neff and I.-D. Ghiba
d d −1 (C C −1 d −1
W (C C −1 −1
p ) = D W (C C p ), C [C p ] = sym[C D W p )], [C ]
dt dt dt p
−1 −1 d
= C p sym[C D W (C C −1
p )] C p , C p [C −1 p ] C p .
dt
◦
:= 21 Σ ∈ Sym(3)
(50)
d ◦
◦ χ (dev3 Σ),
p ] C p ∈ −∂ Σ
C p [C −1 (51)
dt
◦
where χ (dev3 Σ) is the indicator function of the convex elastic domain
◦
◦ 1 2 ◦ ◦ 1
E e (Σ, σy ) := Σ ∈ Sym(3) | dev3 Σ2 ≤ σy2 ,
3 3
(C C −1
then C p ∈ Sym(3) and the reduced dissipation inequality dtd W p ) ≤ 0 is sat-
isfied. Thus, the model is thermodynamically correct. We also remark that the flow
rule (51) implies
d −1 d d
tr( [C p ] C p ) = [C −1
p ] Cp C p , 1 = C p [C −1
p ] C p , 1 = 0.
dt dt dt
Hence, we obtain det C p (t) = 1 and further C p (t) ∈ PSym(3).
Using Lemma 3, we give next some new representations of the stress-tensor
◦ −1 −1 −1 −1
(C C −1
Σ := 2 C p sym[C D W p )] C p
(C C −1
= 2 sym C p (C D W p )) C p
Note that Σe is symmetric in case of elastic isotropy. Hence, for the isotropic case,
we have
◦ ◦
Σ = R Tp Σe R p , Σ = R Tp FeT τe Fe−T R p . (52)
◦ ◦
However, we have Σ2 =R Tp Σe R p 2 =Σe 2 , tr(Σ) = tr(R Tp Σe R p ) = tr(Σe ).
Together, we obtain that
◦
dev3 Σ = dev3 Σe .
In conclusion, for isotropic elastic materials we have the equivalence of the elastic
domains
◦ ◦ 1 1
E e (Σ, σy2 ) = Ee (Σe , σy2 ). (53)
3 3
Therefore, the flow rule (51) proposed by Grandi and Stefanelli [10] has the
following properties:
(i) it is thermodynamically correct;
(ii) from this flow rule it follows C p (t) ∈ Sym(3) and det C p (t) = 1. Hence, it
follows that C p (t) ∈ PSym(3);
◦ ◦
(iii) the elastic domain E e is convex w.r.t. Σ;
(iv) it is an associated plasticity model in the sense of Definition 1;
(v) it preserves ellipticity in elastic loading if the energy is elliptic throughout the
◦
domain E e which makes it useful in association with the exponentiated Hencky
energy WeH [8, 9, 27, 29, 30, 32, 33].
We finish this section by comparing the Helm 2001 model and the Lion 1997 flow
rule with the Grandi-Stefanelli 2015 model.
Proposition 3 In the isotropic case, the Lion 1997 flow rule (i.e. the Dettmer-Reese
2004 model [6]) is equivalent with the Grandi-Stefanelli 2015 flow rule.
Proof We recall that the flow rule of the Lion 1997 model is
d −1 −1 dev3 Σe
[C p ] = −λ+
p Fp F −T , λ+
p ∈ R+ , (54)
dt dev3 Σe p
for Σe ∈
/ int(Ee (Σe , 13 σy2 )). Since F p = R p C p and in the isotropic case Σe =
◦
R p Σ R Tp , using (53) we rewrite the Lion’s flow rule in the form
◦
d −1 −1 T dev3 (R p Σ R Tp ) −1
[C p ] = −λ+
p Cp Rp ◦ Rp Cp , λ+
p ∈ R+ ,
dt dev (R Σ R T ) 3 p p
184 P. Neff and I.-D. Ghiba
◦ ◦ ◦
for R p Σ R Tp ∈
/ int(E e (Σ, 13 σy2 )), which is equivalent with
◦
d −1 dev3 Σ
C p [C p ] C p = −λ+
p ◦ , λ+
p ∈ R+ , (55)
dt dev3 Σ
◦ ◦ ◦ ◦ ◦ ◦ ◦
for R p Σ R Tp ∈
/ int(E e (Σ, 13 σy2 )). Moreover, R p Σ R Tp ∈ int(E e (Σ, 13 σy2 )) ⇔ Σ ∈
◦ ◦
int(E e (Σ, 13 σy2 )). Therefore, the flow rule (55) becomes
d −1 ◦
C p [C p ] C p ∈ −∂ ◦ χ (dev3 Σ), (56)
dt Σ
Proposition 4 In the isotropic case, the Helm 2001 flow rule is equivalent with the
Grandi-Stefanelli 2015 flow rule, i.e. it is also equivalent with the Lion 1997 flow
rule and the Dettmer-Reese 2004 model.
Proof We have
◦ −1
C p ) = sym( C p −1 Σ
Σ = sym( C p Σ C p C −1
p C p)
−1 −1
C p ) C p ),
= sym( C p (Σ
C p = F pT Σe F p−T C p = F pT Σe F p ∈ Sym(3)
Σ
Using the above identity, we may rewrite the Helm 2001-flow rule (47) in the form
◦ −1
d dev3 ( C p Σ C p )
[C p ] = λ+ · Cp . (57)
dt p ◦ −1 2
tr((dev3 ( C p Σ C p )) )
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 185
We also have
◦ −1 ◦
tr( C p Σ C p ) = tr(Σ),
◦ −1 ◦ −1
dev3 ( C p Σ C p ) = C p (dev3 Σ) C p ,
◦ −1 ◦ −1 ◦ −1
tr([dev3 ( C p Σ C p )]2 ) = tr([ C p (dev3 Σ) C p ]2 ) = tr( C p (dev3 Σ)2 C p )
◦ ◦
= tr((dev3 Σ)2 ) = (dev3 Σ)2 , 1
◦ ◦ ◦
= dev3 Σ, dev3 Σ = dev3 Σ2 .
Remark 7 The equivalence is true for an isotropic formulation only. However, the
Grandi-Stefanelli model will provide a consistent flow-rule for a plastic metric also
in the anisotropic case.
An existence proof for the energetic formulation [7] of the model given by Grandi
and Stefanelli [10] together with a full plastic strain regularization can be given along
the lines of Mielke’s energetic approach [18, 19, 24–26].
8 Summary
Fig. 1 Idealized, isotropic perfect plasticity models involving a 6-dimensional flow rule for C p
w.r.t. the reference configuration are considered. By definition, the trajectory for the plastic metric
C p (t) should remain in PSym(3). λ+ p is the plastic multiplier. We have recast all flow rules in the
√ √
format dtd [P −1 ] P ∈ −∂χ or P dtd [P −1 ] P ∈ −∂χ
Fig. 2 An inconsistent model and a 9-dimensional flow rule for F p . They are associative, since
both flow rules are in the format dtd [P] P −1 ∈ −∂χ or dtd [ε p ] ∈ ∂χ
Acknowledgments We would like to thank Prof. Stefanie Reese (RWTH Aachen), Prof. Jörn
Ihlemann (TU Chemnitz), Dr. Alexey Shutov (TU Chemnitz) and Prof. Alexander Mielke (WIAS-
Berlin) for in-depth discussion of flow rules in C p . I.D. Ghiba acknowledges support from the
Romanian National Authority for Scientific Research (CNCS-UEFISCDI), Project No. PN-II-ID-
PCE-2011-3-0521.
Appendix
A.1 The Lion 1997 Model for the Neo-Hooke Elastic Energy
For a quick consistency check we exhibit the consistency of this model directly for
a Neo-Hooke elastic energy and we give the concrete expression for the functions
f (C, C −1 −1
p ) and f (C, C p ). To this end, we consider the energy
NH (Ce ) = μ tr Ce det C p =1 Ce
W 1/3
+ h(det C) = μ tr + h(det C).
det Ce det C 1/3
d −1 −1 dev C e
[C p ] = −λ+
p Fp F −T
dt dev Ce p
λ+ 1
F p−1 Ce F p−T − tr(Ce ) · F p−1 F p−T
p
=−
dev Ce 3
+
λp −1 −1 1 −1 −1
=− C p C C p − tr(C p C) · C p . (59)
dev Ce 3
188 P. Neff and I.-D. Ghiba
We also deduce
1 1
dev Ce 2 = Ce 2 − [tr(Ce )]2 = F p−T C F p−1 2 − [tr(F p−T C F p−1 )]2
3 3
−T −1 −T −1 1 −T −1
= F p C F p , F p C F p − F p C F p ), 12
3
1 1
= C p C, C C p − [tr(C p C)]2 = [tr(C −1
−1 −1 −1
p C) ] −
2
[tr(C −1p C)] .
2
3 3
Therefore, we obtain
d −1 λ+ 1
C −1 −1
C −1
p
[C p ] = − p C − tr(C p C) · 1 p
dt tr[(C p C)2 ] − 13 [tr(C −1
−1 2 3
p C)]
λ+ 1
C −1 −1 −1
p
=− p C C p − tr(C C p ) · 1 .
[tr(C −1 −1 3
p C) ] − 3 tr[(C p C)]
2 1 2
(60)
f (C, C −1
p ) = C,
⎧ ⎫
⎨ −λ+ ⎬
f (C, C −1 ) =
p
dev3 ( C −1 C) λ+ ∈ R+ .
p
⎩ p p
⎭
tr[(C C −1 −1 2
p ) ] − 3 [tr(C C p )]
2 1
We clearly see that even for this simple energy, we have tr[(C C −1 −1 2
p ) ] − 3 [tr(C C p )]
2 1
= dev3 (C C −1
p ) , since if we assume the contrary we deduce
2
1 1
tr[(C C −1 −1 2 −1 2
p ) ] − [tr(C C p )] = C C p − [tr(C C p )]
2 −1 2
3 3
⇔ C C −1 −1 T −1 −1
p , (C C p ) = ±C C p , C C p . (61)
tr[(C C −1 −1 −1 −1 −1 −1
p ) ] = (C C p ) (C C p ), 1 = C p C C p (C C p ) C p , 1
2
= C −1 −1 −1 T −1 −1
p C C p C p (C C p ) , 1 = C p C, C C p . (62)
C −1 −1 −1 −1 −1 −1 −1
p C C p C p , C C p = U p U p C C p U p U p , C C p
= U p−1 C C −1 −1 −1
p Up, Up C C p Up
C C −1 −1 T −1 −1
p , (C C p ) = C C p , C C p ⇔ C C −1 −1
p , skew(C C p ) = 0
⇔ skew(C C −1
p )=0 ⇔ C C −1
p ∈ Sym(3). (64)
In conclusion, tr[(C C −1 −1 2 −1 2
p ) ] − 3 [tr(C C p )] = dev3 (C C p ) and the flow-rule
2 1
χ
p ] ∈ − ∂ (dev Σ).
does not have a subdifferential structure of the form C p dtd [C −1
(C C −1
For the simplest Neo-Hooke elastic energy W (Fe ) = tr(Ce ) = W p )=
1
2
−1
tr(C C p ), we have
(C C −1 1 −1
DC [W p )] = C ⇒ = C C −1
Σ p ∈/ Sym(3), (65)
2 p
and the flow rule (47) implies
d dev3 (C C −1
p )
[C p ] = λ+
p · Cp
dt tr[(dev3 (C C −1
p )) ]
2
λ+ 1
[ C − tr(C C −1
p
= p ) · C p ] ∈ Sym(3) ⇒ C p ∈ Sym(3),
tr[(dev3 (C C −1
p )) ]
2 3
and also
d dev3 (C C −1
p )
[C p ] C −1 +
p = λp ⇒ det C p = 1. (66)
dt tr[(dev3 (C C −1
p )) ]
2
d λ+
[W (C C −1
p )] = −
p
C −1
p Σ C p , dev3 Σ
dt −1 2
4 tr[(dev3 (C C p )) ]
λ+
C −1 −1
p
=− p C, dev3 (C C p )
tr[(dev3 (C C −1
p )) ]
2
190 P. Neff and I.-D. Ghiba
λ+
(C C −1 −1
p
=− p ) , dev3 (C C p )
T
(67)
tr[(dev3 (C C −1
p )) ]
2
λ+
C −1/2 dev3 (C C −1 1/2 −1/2
dev3 (C C −1
p
=− p )C C p )C
1/2
, 1
tr[(dev3 (C C −1
p )) ]
2
λ+
dev3 (C 1/2 C −1 ) dev3 (C 1/2 C −1
p
=− p C
1/2 T
p C
1/2
), 1
tr[(dev3 (C C −1
p )) ]
2
λ+
dev3 (C 1/2 C −1
p
=− p C ) ,
1/2 2
tr[(dev3 (C C −1
p )) ]
2
We recall that, in view of Lemma 2, any isotropic free energy W defined in terms
of Fe can be expressed as W (Fe ) = W (C C −1
p ). In order to assume that the reduced
dissipation inequality is satisfied, we compute
d d −1
W (C C −1 −1
p ) = D W (C C p ), C [C ]
dt dt p
(C C −1 −1 d
d −1
= C D W p ) Cp , [C −1 ] C p = Σ, [C ] C p .
dt p dt p
Here, Σ = 2 C DC [W (C C −1
p )], as in the Reese 2008 and Shutov-Ihlemann 2014
model. It is tempting to assume the flow rule in the associated form (see e.g. the
habilitation thesis of Miehe [21, p. 73, Satz 5.32] or [23] and also [22, Table 1])
d −1
[C ] C p ∈ − ∂ Σ χ (dev3 Σ),
(68)
dt p
where χ (dev3 Σ)
is the indicator function of the convex elastic domain
2 2 2
Ee (Σ, σy ) := Σ 2 ≤ σy2 .
∈ R3×3 | dev3 Σ
3 3
and C −1
5 Surprisingly, this follows even if C
p do not commute in general. If C and C p commute, then
X =C C −1
p ∈ Sym(3) and the quantity does have a sign, since then X T , dev3 X = dev3 X 2
≥ 0.
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 191
Note that this flow rule (68) is not the formulation which Miehe seemed to intend.
We have discussed the correct interpretation in Sect. 3.
Regarding such a formulation we can summarize our observations:
(i) this flow rule is thermodynamically correct;
(ii) the right hand side is a function of C and C −1
p only, i.e. Σ = Σ(C, C p );
−1
(iii) plastic incompressibility: from this flow rule it follows that det C p (t) = 1, since
the right hand side is trace-free;
(iv) however, the computed tensor C p (t) will not be symmetric since Σ C −1
p ∈/
Sym(3) in general. For instance, for the simplest Neo-Hooke energy W (Fe ) =
tr(Ce ) = tr(C C −1
p ) we have Σ = 2 C C p ∈
−1
/ Sym(3), Σ C −1
p = 2 C Cp ∈
−2
/
Sym(3), in general, and the flow rule becomes
d −1 λ+ 1
[C C −2 tr(C C −1 −1
p
[C p ] = −2 p − p ) · Cp ] ∈
/ Sym(3);
dt dev(C C −1
p ) 3
(69)
d −1 dev3 Σ
[C p ] C p = −λ+ , (70)
dt p
dev3 Σ
which is not equivalent with the flow rule (22) considered by Miehe in [22], since
∈
Σ / Sym(3). Let us remark that we have the symmetries dev3 Σ · C p ∈ Sym(3),
−1
C p dev3 Σ ∈ Sym(3), but these do not assure that the flow rule (70) implies C p ∈
Sym(3).
n τe
In order to see that the quantity Fe−1 dev F −T which appears in the Simo-Hughes
devn τe e
flow rule is not necessarily a trace free matrix, we consider two energies: the
192 P. Neff and I.-D. Ghiba
μ λ μ λ
WSVK = Ce − 12 + [tr(Ce − 1)]2 = Be − 12 + [tr(Be − 1)]2 ,
4 8 4 8
and the corresponding Kirchhoff stress tensor is given by
λ
τeSVK (U ) = D Be [W SVK (Be )] = μ (Fe−T Ce FeT − 1) + tr(Ce − 1) · 1
2
λ
= μ (Fe FeT − 1) + tr(Fe FeT − 1) · 1.
2
Hence, we deduce
1
Fe−1 [devn τeSVK ] Fe−T = μ Fe−1 devn [ Fe FeT ] Fe−T = μ Fe−1 [ Fe FeT − tr(Fe FeT ) · 1] Fe−T
3
1 −1 −T
= μ [ 1 − tr(Fe Fe ) · Fe Fe ]
T
3
1
= μ [ 1 − tr(Fe FeT ) · Fe−1 Fe−T ],
3
and further
1
Fe−1 [devn τeSVK ] Fe−T , 1 = μ 1 − tr(Fe FeT ) · Fe−1 Fe−T ], 1
3
1 −1 −T
=μ 3− tr(Fe Fe ) tr(Fe Fe )
T
3
1 −1
=μ 3− tr(Ce ) tr(Ce )
3
1
=μ 3− tr(Ce ) tr(Cof Ce ) .
3 det Ce
We remark that Fe−1 [devn τeSVK ] Fe−T , 1 = 0 if and only if tr(Ce ) tr(Cof Ce ) =
9 det Ce , which does not hold true in general. Since Ce and CofCe are coaxial and
symmetric, the problem can be reduced to the diagonal case, i.e. we may assume Ce =
diag(λ1 , λ2 , λ3 ), λi > 0. Hence the condition tr(Ce ) tr(Cof Ce ) = 9 det Ce , becomes
9 λ1 λ2 λ3 = (λ1 + λ2 + λ3 )(λ1 λ2 + λ2 λ3 + λ3 λ2 )
⇔ 0 = λ1 (λ2 − λ3 )2 + λ2 (λ3 − λ1 )2 + λ3 (λ1 − λ3 )2
On the other hand, the energy considered by Simo and Hughes [42, p. 307] is
μ Be κ
WSimo (Be ) = 1/3
− 1, 1 + (det Be − 1) − log(det Be ) ,
2 det Be 4
Hence, we deduce
1
Fe−1 [devn τeSimo ] Fe−T , 1 = μ 1/3
Fe−1 [dev3 Be ] Fe−T , 1
det Be
1
=μ 1/3
dev3 Be , Fe−T Fe−1
det Be
1 1 1
=μ 1/3
dev3 Be , Be−1 =μ 1/3
Be , Be−1 − tr(Be ) tr(Be−1 )
det Be det Be 3
1 1
=μ 1/3
3 − tr(Be ) tr(Be−1 )
det Be 3
1 1
=μ 4/3
3 det Be − tr(Be ) tr(Cof Be ) .
det Be 3
Therefore Fe−1 [devn τeSimo ] Fe−T , 1 = 0 if and only if 9 det Be = tr(Be ) tr(Cof Be ).
Similar as above, it follows that this holds true if and only if Fe = λ · SO(3) ∈
R+ · SO(3).
References
1. Brepols, T., Vladimirov, I., & Reese, S. (2014). Numerical comparison of isotropic hypo-and
hyperelastic-based plasticity models with application to industrial forming processes. Interna-
tional Journal of Plasticity, 63, 18–48.
2. Cleja-Ţigoiu, S. (2003). Consequences of the dissipative restrictions in finite anisotropic elasto-
plasticity. International Journal of Plasticity, 19(11), 1917–1964.
3. Cleja-Ţigoiu, S., & Iancu, L. (2013). Orientational anisotropy and strength-differential effect
in orthotropic elasto-plastic materials. International Journal of Plasticity, 47, 80–110.
4. Cleja-Ţigoiu, S., & Maugin, G. A. (2000). Eshelby’s stress tensors in finite elastoplasticity.
Acta Mechanica, 139(1–4), 231–249.
5. Cuitino, A., & Ortiz, M. (1992). A material-independent method for extending stress update
algorithms from small-strain plasticity to finite plasticity with multiplicative kinematics. Engi-
neering Computations, 9(4), 437–451.
6. Dettmer, W., & Reese, S. (2004). On the theoretical and numerical modelling of Armstrong-
Frederick kinematic hardening in the finite strain regime. Computer Methods in Applied
Mechanics and Engineering, 193(1), 87–116.
194 P. Neff and I.-D. Ghiba
7. Frigeri, S., & Stefanelli, U. (2012). Existence and time-discretization for the finite-strain Souza-
Auricchio constitutive model for shape-memory alloys. Continuum Mechanics and Thermo-
dynamics, 24(1), 63–77.
8. Ghiba, I. D., Neff, P., & Silhavy, M. (2015). The exponentiated Hencky-logarithmic strain
energy. International Journal of Non-Linear Mechanics, 71, 48–51.
9. Ghiba, I. D., Neff, P., & Martin, R. J. (2015). An ellipticity domain for the distortional Hencky-
logarithmic strain energy. In Proceedings of the Royal Society of London A, 471(2184). doi:10.
1098/rspa.2015.0510.
10. Grandi, D., & Stefanelli, U. (2015). Finite plasticity in P T P. PreprintarXiv:1509.08681
11. Gupta, A., Steigmann, D. J., & Stölken, J. S. (2011). Aspects of the phenomenological theory
of elastic-plastic deformation. Journal of Elasticity, 104(1–2), 249–266.
12. Helm, D. (2001). Formgedächtnislegierungen: experimentelle Untersuchung, phänomenol-
ogische Modellierung und numerische Simulation der thermomechanischen Materialeigen-
schaften. Ph.D-Thesis: Universität Kassel.
13. Kröner, E. (1955). Der fundamentale Zusammenhang zwischen Versetzungsdichte und Span-
nungsfunktionen. Zeitschrift fur Angewandte Mathematik und Physik, 142(4), 463–475.
14. Kröner, E. (1958). Kontinuumstheorie der Versetzungen und Eigenspannungen. Berlin:
Springer.
15. Kröner, E. (1959). Allgemeine Kontinuumstheorie der Versetzungen und Eigenspannungen.
Archive for Rational Mechanics and Analysis, 4(1), 273–334.
16. Lee, E. H. (1969). Elastic-plastic deformation at finite strains. Journal of Applied Mechanics,
36(1), 1–6.
17. Lion, A. (1997). A physically based method to represent the thermo-mechanical behaviour of
elastomers. Acta Mechanica, 123(1–4), 1–25.
18. Mainik, A., & Mielke, A. (2005). Existence results for energetic models for rate-independent
systems. Calculus of Variations and Partial Differential Equations, 22(1), 73–99.
19. Mainik, A., & Mielke, A. (2009). Global existence for rate-independent gradient plasticity at
finite strain. Journal of nonlinear science, 19(3), 221–248.
20. Maugin, G. (1994). Eshelby stress in elastoplasticity and ductile fracture. International Journal
of Plasticity, 10(4), 393–408.
21. Miehe, C. (1992). Kanonische Modelle multiplikativer Elasto-Plastizität. Thermodynamis-
che Formulierung und numerische Implementation. Habilitationsschrift: Universität Hannover,
Germany.
22. Miehe, C. (1995). A theory of large-strain isotropic thermoplasticity based on metric transfor-
mation tensors. Archive of Applied Mechanics, 66, 45–64.
23. Miehe, C. (1998). A constitutive frame of elastoplasticity at large strains based on the notion
of a plastic metric. International Journal of Solids and Structures, 35(30), 3859–3897.
24. Mielke, A. (2003). Energetic formulation of multiplicative elasto-plasticity using dissipation
distances. Continuum Mechanics and Thermodynamics, 15(4), 351–382.
25. Mielke, A. (2004). Existence of minimizers in incremental elasto-plasticity with finite strains.
SIAM Journal on Mathematical Analysis, 36, 384–404.
26. Mielke, A., & Müller, S. (2006). Lower semicontinuity and existence of minimizers in incre-
mental finite-strain elastoplasticity. ZAMM - Journal of Applied Mathematics and Mechanics,
86, 233–250.
27. Montella, G., Govindjee, S., & Neff, P. (2015) The exponentiated Hencky strain energy in
modelling tire derived material for moderately large deformations. To appear in Journal of
Engineering Materials and Technology-Transactions of the ASME. Preprint arXiv:1509.06541.
28. Neff, P., Chełmiński, K., & Alber, H. D. (2009). Notes on strain gradient plasticity: finite
strain covariant modelling and global existence in the infinitesimal rate-independent case.
Mathematical Models and Methods in Applied Sciences, 19, 307–346.
29. Neff, P., & Ghiba, I. D. (2014). Loss of ellipticity for non-coaxial plastic deformations in
additive logarithmic finite strain plasticity. International Journal of Non-Linear Mechanics,
81, 122–128. Preprint arXiv:1410.2819.
Comparison of Isotropic Elasto-Plastic Models for the Plastic … 195
30. Neff, P., & Ghiba, I. D. (2015). The exponentiated Hencky-logarithmic strain energy. Part III:
Coupling with idealized isotropic finite strain plasticity. Continuum Mechanics and Thermo-
dynamics, 28, 477–487. doi:10.1007/s00161-015-0449-y, the special issue in honour of D. J.
Steigmann.
31. Neff, P., & Knees, D. (2008). Regularity up to the boundary for nonlinear elliptic systems arising
in time-incremental infinitesimal elasto-plasticity. SIAM Journal on Mathematical Analysis,
40(1), 21–43.
32. Neff, P., Ghiba, I. D., & Lankeit, J. (2015). The exponentiated Hencky-logarithmic strain
energy. Part I. Journal of Elasticity, 121, 143–234.
33. Neff, P., Ghiba, I. D., Lankeit, J., Martin, R., & Steigmann, D. J. (2015). The exponentiated
Hencky-logarithmic strain energy. Part II: Coercivity, planar polyconvexity and existence of
minimizers. Zeitschrift fur Angewandte Mathematik und Physik, 66, 1671–1693.
34. Ortiz, M., & Simo, J. C. (1986). An analysis of a new class of integration algorithms for elasto-
plastic constitutive relations. International Journal for Numerical Methods in Engineering,
23(3), 353–366.
35. Reese, S., & Christ, D. (2008). Finite deformation pseudo-elasticity of shape memory alloys-
Constitutive modelling and finite element implementation. International Journal of Plasticity,
24(3), 455–482.
36. Reese, S., & Wriggers, P. (1997). A material model for rubber-like polymers exhibiting plastic
deformation: computational aspects and a comparison with experimental results. Computer
Methods in Applied Mechanics and Engineering, 148, 279–298.
37. Shutov, A.V. (2014). Personal comunication. 8/2014.
38. Shutov, A. V., & Ihlemann, J. (2014). Analysis of some basic approaches to finite strain elasto-
plasticity in view of reference change. International Journal of Plasticity, 63, 183–197.
39. Shutov, A. V., & Kreißig, R. (2008). Finite strain viscoplasticity with nonlinear kinematic
hardening: Phenomenological modeling and time integration. Computer Methods in Applied
Mechanics and Engineering, 197(21), 2015–2029.
40. Shutov, A. V., Landgraf, R., & Ihlemann, J. (2013). An explicit solution for implicit time
stepping in multiplicative finite strain viscoelasticity. Computer Methods in Applied Mechanics
and Engineering, 265, 213–225.
41. Simo, J.C. (1993). Recent developments in the numerical analysis of plasticity. In E. Stein
(ed.), Progress in computational analysis of inelastic structures (pp. 115–173). Springer.
42. Simo, J.C., & Hughes, J.R. (1998). Computational Inelasticity., volume 7 of Interdisciplinary
Applied Mathematics. Springer, Berlin.
43. Simo, J. C., & Ortiz, M. (1985). A unified approach to finite deformation elastoplastic analysis
based on the use of hyperelastic constitutive equations. Computer Methods in Applied Mechan-
ics and Engineering, 49, 221–245.
44. Steigmann, D. J., & Gupta, A. (2011). Mechanically equivalent elastic-plastic deformations
and the problem of plastic spin. Theoretical and Applied Mechanics, 38(4), 397–417.
45. Vladimirov, I., Pietryga, M., & Reese, S. (2008). On the modelling of non-linear kinematic
hardening at finite strains with application to springback-comparison of time integration algo-
rithms. International Journal for Numerical Methods in Engineering, 75(1), 1–28.
Quasi-Static Evolutions in Brittle Fracture
Generated by Gradient Flows: Sharp Crack
and Phase-Field Approaches
Matteo Negri
Abstract In this paper we will describe how gradient flows, in a suitable norm,
are natural and helpful to generate quasi-static evolutions in brittle fracture. First,
we will consider the case of a brittle crack running along a straight line according
to Griffith’s law. Then, we will see how the same approach leads to quasi-static
evolutions in the phase field setting, taking into account the alternate minimization
scheme. In the latter, the norm associated to the gradient flow is not “user supplied”,
however, the algorithm itself together with the separate quadratic structure of the
energy defines a family of norms which, in the limit, characterizes the quasi-static
evolution. Mathematically speaking, all of these evolutions are (parametrized) BV -
evolutions.
1 Introduction
The idea of using monotone descent paths (among which the gradient flow) for
quasi-static crack propagation goes back to the foundation of fracture mechanics:
according to Griffith’s principle [12] “the system can pass from the unbroken to the
broken condition by a process involving a continuous decrease of potential energy”.
Choosing the gradient flow, in a suitable norm, as optimal and most common descent
path, Griffith’s criterion would be: “the system follows the gradient flow of the
potential energy”.
As a matter of fact, gradient flows usually refer to time dependent problems while
Griffith’s law does not make any reference to time, neither directly or indirectly
(e.g. in terms of velocities). In our rate-independent setting, the gradient flows will
provide in fact a parametrization of the path connecting “the unbroken” with “the
broken condition”. Such a parametrization will appear both in the construction of the
solution, by time discretization, and in the quasi-static evolution itself, specifically
in the instantaneous “catastrophic” propagations.
M. Negri (B)
Department of Mathematics, University of Pavia, Via Ferrata 1, 27100 Pavia, Italy
e-mail: [email protected]
possible and worth studying, e.g. [22]. In this work, proofs and fine mathematical
details are not included; the interested reader can make reference for instance to
[14, 19, 22, 23].
2 Sharp Crack
In order to avoid technical issues as much as possible, we will state our results only
for a representative example, cf. Fig. 1. Denote by Ω the open set in Fig. 1 (obtained
removing a couple of symmetric holes from a rectangle). Let ∂Ω = ∂ D Ω ∪ ∂ N Ω
where ∂ N Ω is the boundary of the rectangle while ∂ D Ω denotes (the union of) the
boundaries of the circular holes. Assume that the initial crack K 0 is given by the
line segment (0, l0 ] × {0} for l0 > 0. In our simple setting the crack will propagate
horizontally, thus our family of admissible cracks will be given by the line segments
of the form K l = (0, l] × {0}. Clearly such a family is simply parametrized by the
scalar l ∈ [l0 , L), which gives as well the position of the crack tip.
Consider on ∂ D Ω a proportional boundary condition of the form u = ±t ê where
the sign ± is chosen as in Fig. 1.
We consider in-plane elasticity with linearised energy density
W (Du) = 1
2
Du : C[Du] = 1
2
ε(u) : σ (u)
where ε(u) = (Du + Du T )/2 and C[Du] = σ (u) = 2με(u) + λtr(ε(u))I , for
λ, μ > 0 the Lamé coefficients. For t ∈ [0, T ] and l ∈ [l0 , L) the space of admissible
configurations is
Ut,l = {u ∈ H 1 (Ω\K l , R2 ) : u = ±t ê ∂ D Ω} .
Fig. 1 An ASTM-compact
tension geometry: the set
Ω\K 0
0 l0 L
200 M. Negri
Note that the Neumann homogeneous boundary condition holds on the boundary
∂ N Ω of the rectangle and on both the crack faces, above denoted by K l± .
Let us now turn to dissipation. Since we are interested in brittle fracture the energy
dissipated by the crack will be provided by a potential K : [l0 , L) → R+ which is
simply of the form K (l) = G c (l − l0 ), being G c > 0 the material toughness.
In the sequel we will always work with the reduced total energy F : [0, T ] ×
[l0 , L) → R+ given by
F (t, l) = E (t, l) + K (l).
where P ext is the power of the external forces while G denotes as usual the energy
release rate. Moreover, G(t, ·) is non-negative and locally Lipschitz continuous in
[l0 , L).
A proof can be adapted e.g. from [19] or [23]. In this setting, by irreversibility,
equilibrium reads
In other terms, we advance the crack up to the closest equilibrium point. This is in
some sense a return mapping algorithm on the set {G(tk+1 , l) ≤ G c } of equilibrium
points at time tk+1 , which is usually at the core of many crack tracking algorithms.
Now, let us see how to recast the incremental problem as a gradient flow. By
irreversibility the crack cannot heal, for this reason it is convenient to introduce the
one sided “slope”
|∂l F (t, l)|−
where | · |− denotes the negative part. Next, let us introduce an auxiliary parameter
s ∈ R+ and an auxiliary function l : R+ → [l0 , L). We set (tk+1 ) = sups l(s) where
l solves the gradient flow
˙ = |∂l F (tk+1 , l(s))|−
l(s)
l(0) = (tk ).
In this simple setting the gradient flow boils down to an autonomous Cauchy problem
for a non-linear, first order ODE. Intuitively, l grows when ∂l F (tk+1 , l) < 0, i.e. when
G(tk+1 , l) > G c and thus when the crack is not in equilibrium. It is easy to see that
there exists a unique solution and that the definition (tk+1 ) = sups l(s) coincides
with (3) (for a proof, see [19]).
At this point we have defined (tk ) for tk = kΔt. Now consider a sequence of time
steps Δtn 0 and denote by n the corresponding discrete evolutions, defined in
the discrete points tn,k = kΔtn . Denote again by n : [0, T ] → [l0 , L) the piecewise
affine interpolate of n (tn,k ). By Helly’s Theorem it follows that (up to subsequences)
n converges pointwise to a limit evolution . Our goal is now the characterization
of : we will provide two characterizations, the first in terms of KKT conditions,
the second in terms of parametrized BV-evolutions. In order to better understand
the meaning of these characterizations it is useful to show first an explicit example,
which has been computed numerically.
2.3 Example
First, let us comment on Fig. 2. Remember that the “loading” is monotone increasing.
The set of critical points of the energy, i.e. {(t, l) : G(t, l) = G c } is represented with
a dotted curve. This curve splits the (t, l)-plane into two regions: on the left is the set
202 M. Negri
Fig. 2 Quasi-static
evolutions: of energetic type
+
(dashed) and of BV-type
(solid) G < Gc
G > Gc
−
t# t∗
− +
{(t, l) : G(t, l) < G c } of stable points while on the right is the set {(t, l) : G(t, l) >
G c } of unstable points. From the picture it is clear that up to time t ∗ the crack is
not moving, since it is inside the stable region. At time t ∗ a “catastrophic evolution”
occurs: in the instantaneous transition from − to + the system crosses the unstable
region, since G(t ∗ , l) > G c for every l ∈ (− , + ). The energy landscape at time t ∗
is reported in Fig. 3: it is clear that F (t ∗ , ·) is not convex and that makes a transition
from − to + following a descent path, in particular F (t ∗ , − ) > F (t ∗ , + ).
Figure 2 shows (in dashed bold line) also the energetic evolution, obtained by
global energy minimization; this evolution presents a discontinuity as well, however
the qualitative behaviour is quite different: the system crosses first the stable and
then the unstable region, in particular a propagation occurs even if G(t # , − ) < G c .
The next Theorem provides the first characterization of the limit evolution. For a
proof, see [19].
Theorem 1 The limit evolution , obtained letting Δtn 0, is non-decreasing and
belongs to BV (0, T ). Moreover
Let us make some comments on the above Theorem. Clearly (4) and (5) play the
role of the “classical” KKT conditions, with some technical differences: the left limit
− is used instead of and the weak measure theoretic derivative d is used instead
of the speed . ˙ Both these technical details are due to the fact that in general the
evolution belongs to BV (0, T ) and thus it may have jump discontinuities. However
the qualitative meaning of (4) and (5) is quite clear and consistent with standard
KKT conditions. What is instead not common in the study of quasi-static evolutions
is condition (6) which characterizes the behaviour in the jumps in terms of unstable
branches of propagation (cf. also Fig. 2).
Finally, in terms of derivatives of the energy, the above Theorem reads as follows.
Corollary 1 The limit , obtained letting Δtn 0, is non-decreasing and belongs
to BV (0, T ). Moreover
where + (t)
diss(F (t, ·)) = |∂l F (t, l)|− dl
− (t)
evolution; they are indeed equivalent to the KKT conditions of Theorem 1. Without
entering too much into the technical details (for a complete proof the reader can
follow [20] or [22]) let us see how (7)–(9) follow from (10)–(11). First,
+ F (t, ·),
t∈J ()
where dac denotes the (weak) measure theoretic derivative of in [0, T ] \ J ().
Comparing with (11) we get
t
∂l F (τ, − (τ )) dac (τ ) + F (t, ·) = − diss(F (t, ·)).
0 t∈J () t∈J ()
Since the measure dac is supported in [0, T ]\J () it follows that
and that
+ (t) + (t)
F (t, ·) = ∂l F (t, l) dl = − |∂l F (t, l)|− dl, for every t ∈ J ().
− (t) − (t)
The former leads to (8), thanks to the continuity of G, while the latter leads to
that is (9).
We have seen that the limit evolution can have jump discontinuities in time. For this
reason it is convenient, both for theoretical and numerical purposes, to represent
Quasi-Static Evolutions in Brittle Fracture Generated … 205
s
F (t (s), l(s)) = F (0, l(0)) + ∂t F (t (r ), l(r )) t (r ) dr +
s 0
It is not difficult to see that, upon choosing the right parametrization, (12)–(13)
follows from (10)–(11) and vice versa (for a proof see [22]).
In this section we will deal with the phase-field approach for fracture, which in the
last decade has been an effective and popular method for the simulation of crack
propagation, see e.g. [1, 5, 7, 13, 16, 17] and many others. In particular we will
consider evolutions defined by the alternate minimization scheme; we will see that
in the limit they will provide indeed parametrized BV-evolution, with respect to a
suitable family of norms.
206 M. Negri
3.1 Setting
For sake of simplicity let us consider the same geometry and the same boundary
conditions of Sect. 2.1. In the phase-field framework it is however necessary to re-
define the space of admissible displacements by
Ut = {u ∈ H 1 (Ω, R2 ) : u = ±t ê on ∂ D Ω}
Z = {z ∈ H 1 (Ω) : 0 ≤ z ≤ 1}.
In the sequel we will work indeed with the spaces U and Z which are independent
of time. For ε > 0 and ηε > 0, typically with ηε = o(ε), the phase field elastic and
dissipated energy [2] will be respectively
Eε (t, u, z) = 1
2
t 2 (z 2 + ηε )W (Du) d x,
Ω
Kε (z) = 1
G
2 c
(z − 1)2 /2ε + ε|∇z|2 d x.
Ω
For our purposes the spaces of admissible variations for Z and U will be provided
respectively by
To conclude this section, let us see how to define a notion of energy release,
with respect to a variation ξ , and how to get the power of external forces. To this
end, denoting u(t, z) ∈ argmin {Eε (t, u, z) : u ∈ U } we will call “energy release
functional” (with respect to a variation ξ )
In this way the displacement field changes “simultaneously” with the variation of
the phase field variable. Actually, by minimality of u(t, z), the derivative can be
represented explicitly as (see e.g. [15])
Gε (t, z)[ξ ] = −∂z Eε (t, u, z) [ξ ] = − t 2 zξ W (Du) d x . (17)
Ω
Note that these variations normalize the “variation of crack length” since
dKε (z)[ξ̂ ] = G c (z − 1)ξ̂ /4ε + ε∇z · ∇ ξ̂ d x ≤ G c .
Ω
σ z (w) = (z 2 + ηε )σ (w).
As we did in Sect. 2.2, given Δt > 0, let tk = kΔt and set the initial conditions
u(t0 ) = u 0 and z(t0 ) = z 0 . Known u(tk−1 ) and z(tk−1 ) we will introduce a couple of
auxiliary sequences, u m and z m , with u 0 = u(tk−1 ) and z 0 = z(tk−1 ) defined recur-
sively by the following alternate minimization scheme [5]
208 M. Negri
u m ∈ argmin Fε tk , · , z m−1 : u ∈ U ,
(20)
z m ∈ argmin Fε (tk , u m , · ) : z ∈ Z with z ≤ z m−1 },
where the constraint z ≤ z m−1 models the irreversibility of the crack. Then we define
the updates u(tk ) = limm→+∞ u m and z(tk ) = limm→+∞ z m . More precisely, we have
the following result (for a proof see [14]).
In other terms, u(tk ) is an equilibrium point for Fε (tk , ·, z(tk )) while z(tk ) is an
equilibrium point for Fε (tk , u(tk ), ·) (for the latter remember the irreversibility con-
straint).
In this way, given Δt > 0 a time discrete evolution is provided in terms of the
equilibrium configurations (u(tk ), z(tk )) for tk = kΔt. In order to understand the
limit evolution, obtained by letting Δt → 0, we have first to recast the alternate
minimization scheme as a gradient flow. This is the goal of the next section.
In order to better understand the gradient flow structure behind (20) let us start with
an example: the minimization of a quadratic functional in a finite dimensional√setting.
Let F(x) = 21 x T Ax + b T x + c for x ∈ Rn and A T = A > 0. Let x A = x T Ax
be the “natural” norm induced by the symmetric, positive define matrix A, with
associated scalar product ·, · A .
Our problem is the following: given x0 find the increment x∗ in such a way that
x0 + x∗ is the minimizer of F. Since F is quadratic we can write
In the last term of the previous row we have introduced the notation ∇ AT F(x0 ) which
denotes the gradient of F (computed in x0 ) with respect to the norm · A , that is
the (unique) vector such that d F(x0 )[x ] = ∇ A F(x0 ), x A for every x ∈ Rn . For
our purposes it is just important to remark that ∇ A F(x0 ) = ∇ F(x0 )A−1 and that
A F(x0 ) = −∇ A F(x0 )/∇ A F(x0 ) A = argmin d F(x0 )[x ] : x A ≤ 1 .
−∇
In other terms, the normalized gradient ∇ A F(x0 ) provides the steepest descent
direction with respect to the norm · A . Now, let k = ∇ A F(x0 ) A and define
x(s) = x0 − λ(s) ∇A F(x0 ) where λ(s) = 1 − e−ks for s ∈ [0, +∞). Then, by homo-
geneity, x(s) solves the gradient flow
x (s) = −∇ A F(x(s)) for s ∈ [0, +∞),
(21)
x(0) = x0 .
Note that the right hand side is evaluated in x(s), and not in x0 , and that lims→+∞
x(s) = x0 + x∗ is exactly the minimizer. In other terms, the linear interpolation of
the points x0 and x0 + x∗ with parametrization λ(s) is the solution of the gradient
flow (21).
It is now time to turn back to the phase field setting.
With the above definitions the quadratic structure of the energy looks very clear,
indeed we can write the energy as
210 M. Negri
Fε (t, u, z) = 21 |u|2t,z + cz for cz = 21 G c (z − 1)2 /2ε + ε|∇z|2 d x,
Ω
⎧
⎪
⎨b(z) = G c z/2ε d x,
Fε (t, u, z) = 21 z2t,u − b(z) + ct,u for Ω
⎪
⎩ct,u = 1 ηε t 2 W (Du) + G c /2ε d x.
2
Ω
Note that cz is independent of u and vice versa ct,u is independent of z, while b(·)
is linear. As a consequence, the partial derivatives as well take a particularly simple
form, being
We will see in Sect. 3.4.3 how to write more explicitly the gradient flows originating
from alternate minimization.
Finally, it will be very convenient, if not necessary, to define a couple of slopes,
with respect to the “norms” defined above, that is
Consider a sequence Δtn 0. For each Δtn , let u(tn,k ) and z(tn,k ) (for tk = kΔtn )
be given by the alternate minimization scheme (20). In order to define the limit as
Δtn 0, it is natural to introduce, for every time tk , an arc length interpolation
of the alternate minimizing path (u m , z m ) which links (u(tk−1 ), z(tk−1 )) = (u 0 , z 0 )
with (u(tk ), z(tk )) = limm→+∞ (u m , z m ). In the end, the interpolation provides a
parametrization of the evolution, i.e. a map (0, Sn ) s → (tn (s), u n (s), z n (s)) with
tn (0) = 0, u n (0) = u 0 , vn (0) = v0 and tn (S) = T . In this respect, it is technically
quite delicate to show that the lengths Sn of the interpolating curve are uniformly
finite with respect to Δtn (for the detail the reader should make reference to the
forthcoming [14]). We can then apply an abstract results developed in [22] which
yields the following Theorem.
Theorem 3 Given Δtn 0, let s → (tn (s), u n (s), z n (s)) be the parameteriza-
tions of the discrete evolutions provided by the alternate minimization scheme
(20). Then, up to subsequences, there exists a limit normalized parametrization
s → (t (s), u(s), z(s)) with t (s) ≥ 0, z (s) ≤ 0 and t (s) + |u (s)|t (s),z(s) + z (s)
t (s),u(s) ≤ 1. Moreover, for every s with t (s) > 0 the following equilibrium
conditions holds
Quasi-Static Evolutions in Brittle Fracture Generated … 211
In the next section we will explain better the meaning of the previous Theorem,
which is by itself quite technical. However, the analogy with Theorem 2 should be
quite evident.
Let us start considering the points where t (s) > 0, which correspond in the para-
metric setting to continuity points in time. Equation (22) gives equilibrium. Indeed,
by definition of the slopes
Let us write more explicitly the equilibrium conditions. Introducing the phase field
stress σ z (u) = (z 2 + ηε ) σ (u) we get
∂u Eε (t (s), u(s), z(s))[φ] = σ z(s) (u(s)) : ε(φ) d x = 0, forφ ∈ Φ,
Ω
and thus ⎧
⎪
⎨div(σ z(s) (u(s))) = 0 Ω
u(s) = ±ê ∂D Ω (24)
⎪
⎩
σ z(s) (u(s)) n̂ = 0 ∂ N Ω,
reads
− Gε (t (s), z(s))[ξ ] + ∂z Kε (z(s))[ξ ] ≥ 0 for ξ ∈ Ξ (25)
where
∂z Kε (z(s))[ξ ] = G c (z − 1)ξ/4ε + ε∇z · ∇ξ d x .
Ω
Employing the normalized set Ξ̂z(s) and taking the supremum with respect to ξ̂ ∈
Ξ̂z(s) from (25) it follows
G ε (t (s), z(s)) ≤ G c , (26)
u(s) ∈ argmin Eε (t (s), u, z(s)) : u ∈ U
z(s) ∈ argmin Fε (t (s), u(s), z) : z ∈ Z , z ≤ z(s) .
Note that z(s) is only a constrained minimizer and that in general it is not true that
(u(s), z(s)) ∈ argmin Fε (t (s), u, z) : u ∈ U , z ∈ Z , z ≤ z(s)
In this section we will discuss a couple of thermodynamic issues: the first is simply the
energy balance (which will also lead to a KKT condition) while the second originates
from the relationship between irreversibility constraint and dissipated energy. In both
the cases we will assume that t (s) > 0 in a parametrization interval [s1 , s2 ] (so that
the corresponding evolution is continuous in time). By (19) and (22) we can rewrite
(23) as
s2
Fε (t (s2 ), u(s2 ), z(s2 )) = Fε (t (s1 ), u(s1 ), z(s1 )) + Pεext (t (r ), u(r ), z(r )) t (r ) dr,
s1
which is the usual energy balance in parametrized integral form. Using the chain rule
the above energy identity reads: for every s ∈ (s1 , s2 )
Fε (t (s), u(s), z(s)) = ∂t Fε (t (s), u(s), z(s)) t (s) + ∂u Fε (t (s), u(s), z(s))[u (s)] +
+∂z Fε (t (s), u(s), z(s))[z (s)]
= Pεext (t (s), u(s), z(s)) t (s) .
Quasi-Static Evolutions in Brittle Fracture Generated … 213
Here, note that equilibrium gives ∂z Fε (t (s), u(s), z(s))[ξ ] ≥ 0 for every ξ ∈ Ξ .
Using the notation of (25) the above identity in KKT fashion becomes
Gε (t (s), z(s)) − ∂z Kε (z(s)) [z (s)] = 0 , (27)
includes a gradient term which is not monotone. For instance, consider z 1 ≤ z 2 with
z 1 constant and with z 2 (highly) oscillating. Then
(z 1 − 1)2 /4ε d x ≥ (z 2 − 1)2 /4ε d x
Ω Ω
while
ε|∇z 1 |2 d x < ε|∇z 2 |2 d x.
Ω Ω
Note that
∂z Eε (t (s), u(s), z(s))[z (s)] = t 2 (s) z(s) z (s) W (Du(s)) d x ≤ 0,
Ω
because the only negative term is z (s). It follows that ∂z Kε (z(s))[z (s)] ≥ 0 and
thus the energy s → Kε (z(s)) is non-decreasing.
214 M. Negri
In this section we want to collect some properties of the evolution in the jumps, i.e.,
in the parametrization intervals [s1 , s2 ] where t (s) is constant, and thus t (s) = 0. It
is fair to say that at the current stage the picture is not fully clear and detailed. In
order to understand the main qualitative features it is not too restrictive to assume
that for s ∈ [s1 , s2 ] it holds
Under these assumptions, (23) implies (by the chain rule and convexity arguments)
that for a.e. s ∈ [s1 , s2 ] we have
u (s) ∈ argmin {∂u Fε (t (s), u(s), z(s))[φ] : φ ∈ Φ, |φ|t (s), z(s) = 1},
In other terms, u and z are the steepest descent direction for Fε with respect to
the “intrinsic norms”. If the mathematical meaning is formally clear, the physical
behaviour is understood only for the gradient flow of u. Indeed (cf. [14]) on the jumps
the displacement field evolves like a “phase-field visco-elastic flow”
⎧
⎪
⎨div(σ z(s) (u(s) + u (s))) = 0 Ω
u(s) = ±ê ∂D Ω
⎪
⎩
σ z(s) (u(s) + u (s)) n̂ = 0 ∂ N Ω.
On the contrary, it seems not easy to provide a meaningful PDE for the evolution of
the phase field variable z since Ξ is not a space, but just a convex set.
4 Open Problems
In a broader perspective, the most interesting, and probably most difficult, open
problem is the convergence of the parametrized quasi-static BV -evolutions, say
s → (tε (s), u ε (s), z ε (s)), as ε → 0. On the base of Γ -convergence [2, 6] it is
expected a sharp crack evolution, possibly in the space SBD [4] or GSBD [8]. How-
ever, Γ -convergence has been crafted to study the convergence of energies and global
minimizers but it is not enough to provide convergence of equilibrium points and
slopes which are the main ingredients for gradient flows and BV -evolutions. In gen-
eral to have convergence of BV -evolutions it is necessary to have at least a sort of
Γ -liminf inequality for the slopes [22], as it is for gradient flows [25]. However, it
is not yet known any reasonable notion of slope in SBD spaces.
Quasi-Static Evolutions in Brittle Fracture Generated … 215
In this direction some partial results have been published: for instance [11] (in the
one dimensional setting) [21, 27] (with geometrical restriction on the crack) and [3]
(with a regularized energy).
References
1. Abdollahi, A., & Arias, I. (2012). Phase-field modeling of crack propagation in piezoelectric
and ferroelectric materials with different electromechanical crack conditions. Journal of the
Mechanics and Physics of Solids, 60(12), 2100–2126.
2. Ambrosio, L., & Tortorelli, V. M. (1990). Approximation of functionals depending on jumps
by elliptic functionals via Γ -convergence. Journal of the Mechanics and Physics of Solids,
43(8), 999–1036.
3. Babadjian, J. F. & Millot, V. (2014) Unilateral gradient flow of the Ambrosio-Tortorelli func-
tional by minimizing movements. Journals Annales de l’Institut Henri Poincar.é Annales:
Analyse Non Lineaire, 31(4), 779–822.
4. Bellettini, G., Coscia, A., & Dal Maso, G. (1998). Compactness and lower semicontinuity
properties in SBD(Ω). Mathematische Zeitschrift, 228(2), 337–351.
5. Bourdin, B., Francfort, G. A., & Marigo, J.-J. (2000). Numerical experiments in revisited brittle
fracture. Journal of the Mechanics and Physics of Solids, 48(4), 797–826.
6. Chambolle, A. (2003). A density result in two-dimensional linearized elasticity and applica-
tions. Archive for Rational Mechanics and Analysis, 167(3), 211–233.
7. Conti, S., Focardi, M. & Lurlano, F. Phase-field approximation of cohesive fracture energies.
8. Dal Maso, G. (2013). Generalised functions of bounded deformation. Journal of the European
Mathematical Society (JEMS), 15(5):1943–1997.
9. Efendiev, M. A., & Mielke, A. (2006). On the rate-independent limit of systems with dry
friction and small viscosity. Archive for Rational Mechanics and Analysis, 13(1), 151–167.
10. Francfort, G. A., & Marigo, J.-J. (1998). Revisiting brittle fracture as an energy minimization
problem. Journal of the Mechanics and Physics of Solids, 46(8), 1319–1342.
11. Francfort, G. A., Le, N. Q., & Serfaty, S. (2009). Critical points of Ambrosio-Tortorelli converge
to critical points of Mumford-Shah in the one-dimensional Dirichlet case. Archive for Rational
Mechanics and Analysis, 15(3), 576–598.
12. Griffith, A. A. (1920). The phenomena of rupture and flow in solids. Journal of the Mechanics
and Physics of Solids, 18, 163–198.
13. Hesch, C., & Weinberg, K. (2014). Thermodynamically consistent algorithms for a finite-
deformation phase-field approach to fracture. Journal of the Mechanics and Physics of Solids,
99(12), 906–924.
14. Knees, D. & Negri, M. Convergence of alternate minimization schemes for phase field fracture
and damage.
15. Knees, D., Rossi, R., & Zanini, C. (2013). A vanishing viscosity approach to a rate-independent
damage model. Journal of the Mechanics and Physics of Solids, 23(4), 565–616.
16. Larsen, C. J., Ortner, C., & Süli, E. (2010). Existence of solutions to a regularized model of
dynamic fracture. Journal of the Mechanics and Physics of Solids, 20(7), 1021–1048.
17. Miehe, C., Welschinger, F., & Hofacker, M. (2010). Thermodynamically consistent phase-field
models of fracture: Variational principles and multi-field FE implementations. Journal of the
Mechanics and Physics of Solids, 83(10), 1273–1311.
18. Mielke, A. (2011). Differential, energetic, and metric formulations for rate-independent
processes. In nonlinear PDE’s and applications, volume 2028 of lecture notes in mathematics
(pp. 87–170). Heidelberg: Springer.
19. Negri, M. (2010). A comparative analysis on variational models for quasi-static brittle crack
propagation. Journal of the Mechanics and Physics of Solids, 3, 149–212.
216 M. Negri
20. Negri, M. (2010). From rate-dependent to rate-independent brittle crack propagation. Journal
of the Mechanics and Physics of Solids, 98(2), 159–178.
21. Negri, M. (2013). From phase-field to sharp cracks: Convergence of quasi-static evolutions in
a special setting. Journal of the Mechanics and Physics of Solids, 26, 219–224.
22. Negri, M. (2014). Quasi-static rate-independent evolutions: Characterization, existence,
approximation and application to fracture mechanics. Journal of the Mechanics and Physics
of Solids, 20(4), 983–1008.
23. Negri, M., & Ortner, C. (2008). Quasi-static propagation of brittle fracture by Griffith’s criterion.
Journal of the Mechanics and Physics of Solids, 18(11), 1895–1925.
24. Pandolfi, A., & Ortiz, M. (2012). An eigenerosion approach to brittle fracture. Journal of the
Mechanics and Physics of Solids, 92(8), 694–714.
25. Sandier, E., & Serfaty, S. (2004). Gamma-convergence of gradient flows with applications to
Ginzburg-Landau. Journal of the Mechanics and Physics of Solids, 57(12), 1627–1672.
26. Schmidt, B., Fraternali, F., & Ortiz, M. (2008). Eigenfracture: An eigendeformation approach
to variational fracture. Journal of the Mechanics and Physics of Solids, 7(3), 1237–1266.
27. Sicsic, P., & Marigo, J. -J. (2013). From gradient damage laws to Griffith’s theory of crack
propagation. Journal of Elasticity, 113(1), 55–74.
Improving the Material-Point Method
Abstract The material-point method (MPM) was introduced about 20 years ago and
is a versatile method for solving problems in continuum mechanics. The flexibility
of the method is achieved by combining two discretizations of the material. One is
a Lagrangian description based on representing the continuum by a set of material
points that are followed throughout the calculation. The second is a background grid
that is used to solve the continuum equations efficiently. In its original form, some
applications of the method appeared to be second order accurate while other tests
showed poor or no convergence. This paper provides a framework for analyzing the
errors in MPM. Moreover, the analysis suggests modifications to the algorithm to
improve accuracy. The analysis also points to connections between MPM and other
meshfree methods.
1 Introduction
mials up to a given order. These methods most often use a weak formulation of the
governing equations resulting in an implementation that is similar in structure to finite
element methods but with shape functions developed from the scattered data points.
A continuing research topic is development of efficient and stable methods for the
integration of the weak form equations [7]. More recently, the Optimal Transporta-
tion Meshfree (OTM) [12] scheme was developed based on max-ent interpolation [1].
The equations of motion are formulated from optimal transportation theory which
discretizes the inertial action in space and time within a variational framework.
The material-point method (MPM) [20] was developed contemporaneously with
the meshfree technologies but its formulation was inspired by the particle-in-cell
method [6, 10]. Unlike general meshfree methods, MPM is restricted to solving
problems in continuum mechanics. The method discretizes the continuum based on
representing it by a set of Lagrangian material points that are followed throughout the
calculation. Like meshfree methods, functions are reconstructed from these scattered
data points. However, these functions are then evaluated to provide information for
a grid where continuum equations of motion are solved efficiently.
Equations of motion are solved in an updated Lagrangian frame on the computa-
tional grid, using standard finite difference or finite element methods. Advection is
modeled by moving the material points in the computed velocity field. Each numer-
ical material point carries its material properties without error while it is advected.
Since all the properties of the continuum are assigned to the numerical material
points, the information carried by these points is enough to characterize the flow
and the grid carries no permanent information. Accuracy and consistency of MPM
have been examined with fragmentary results in the literature, e.g. [18, 19]. The goal
of this paper is to establish a framework for studying the accuracy of MPM. The
accuracy of each step in the algorithm is examined as well as how the steps interact,
in order to make improvements.
For time-dependent problems, there are four steps in the algorithm: (i) choose a
convenient computational grid; (ii) map information from the material points to the
grid; (iii) solve the field equations on the grid; and (iv) update the material points
based on the grid solution. Similar accuracy should be maintained by the function
reconstruction in step (ii) and the grid-based solution method chosen to advance the
solution in step (iii). Finally, a scheme of consistent accuracy must be employed to
update the material points in step (iv) in order to maintain the overall accuracy of the
method. Our analysis brings to light connections between MPM and other meshfree
methods. Moreover, our analysis shows how to exploit the function reconstruction
techniques used in meshfree methods to improve MPM.
2 Preliminaries
tioned into disjoint subsets, Γ0 = Γu Γt and Γu Γt = ∅, such that displacement
is prescribed on Γu and traction is prescribed on Γt .
Let the spatial (current) configuration of the same body be Ω(t) ∈ Rd , with points
labeled x. Assume there exists a smooth mapping, the motion of the body, ϕ : Ω(0) ×
[0, T] → Rd , such that Ω(t) = ϕ(Ω(0), t) and x = ϕ(X, t), where [0, T] ⊂ R is
the time interval of interest. The current boundary is Γ (t) = ∂Ω(t) with Γ (t) =
Γt (t) ∪ Γu (t) and Γt (t) ∩ Γu (t) = ∅, where Γu (t) = ϕ(Γu , t) and Γt (t) = ϕ(Γt , t).
The deformation gradient of the motion is defined as
∇ · σ + ρb = ρa in Ω(t) (2)
σ ·n =t on Γt (t) (3)
u=u on Γu (t) (4)
In these equations, the mass density is ρ(x, t), the acceleration is a(x, t), the velocity
is v(x, t), the displacement is u(x, t), the specific body force is b(x, t) and the
Cauchy stress is σ (x, t). Boundary conditions can consist of applied traction, t, on
a portion of the boundary denoted by Γt (t), with n being the unit outward normal
to the boundary, and prescribed displacement, u, on a portion denoted by Γu (t). A
constitutive equation for the stress is required to complete the specification. For this
work, we will use a neo-Hookean constitutive model extended to the compressible
range,
J σ = λ(ln J )I + μ F F T − I , (5)
where μ and λ are the Lamé constants and I is the second order identity tensor.
The philosophy of MPM is to solve the equations of motion on a background
grid, but to keep track of trajectories of a set of material points, where the material
points represent the geometry of the body Ω(t) during deformation and carry the
solution. The process is accomplished by mapping information between the grid and
the material points. The details are given in the next section.
220 D. Sulsky and M. Gong
A body occupying Ω(0) is discretized into a finite set of N p regions, Ω p (0), with
a material point at the centroid, with position x p (0), p = 1, 2, . . . , N p . Each of
these points represents a volume of material, V p (0), the volume of Ω p (0), with a
mass, m p = Ω p (0) ρ0 (X ) d V . The material points move in time and the associated
volume moves with them. Mass conservation requires that m p be constant in time.
The aim is to determine the position, x p (t), the density, ρ p (t), the velocity, v p (t),
and the stress, σ p (t), at time t associated with these points given the initial values,
ρ p (0) = ρ0 (x p (0)), v p (0) = v0 (x p (0)) and σ p (0) = σ0 (x p (0)).
Mapping the information from the material points to the grid involves reconstructing
a function from scattered data on the material points, and evaluating the reconstructed
function on the background grid. That is, given data, u p , p = 1, 2, . . . , N p at material
points, we wish to determine basis functions ψ p[r ] (x) so that the reconstructed function
can be written in the form
Np
u R (x) = ψ p[r ] (x)u p . (6)
p=1
Np
u(x) = ψ p[r ] (x)u(x p ) ∀u ∈ Pr . (8)
p=1
(2, 0, 0, . . . , 0), (1, 1, 0, . . . , 0), . . . , (0, 0, . . . , 0, r ). The basis functions in (6) are
written in terms of a weight function wh (x) with compact support defined by h, and
a local polynomial basis,
ψ p[r ] (x) = wh (x − x p ) (x − x p )α aα (x), p = 1, 2, . . . , N p . (9)
|α|≤r
Np
m α (x) = wh (x − x p )(x − x p )α . (11)
p=1
Np
M(x) = wh (x − x p )h(x − x p )h(x − x p )T , (13)
p=1
The moment matrix is symmetric and positive semi-definite and the sum of rank one
matrices. A necessary condition for M(x) to be nonsingular is that for any x there
be at least Nr nonzero terms in the sum (13). The resulting shape functions ψ p[r ] (x)
form a partition of unity and if wh ∈ C k then ψ p[r ] ∈ C k , p = 1, 2, . . . , N p .
For this work, we restrict our attention to functions u(x) in a Hilbert space and
assume the boundary of the domain is smooth. Under appropriate hypotheses on the
distribution of material points so that the moment matrix is nonsingular, we have the
following error estimates for the function reconstruction [9]. For any u ∈ H m+1 (Ω)
and w ∈ C k
in particular, for l = 0
wh (x − x p )
ψ p[0] (x) = Np
, (19)
p=1 w h (x − x p )
for some weight function, w. The standard MPM uses a linear hat function weighted
by the material-point mass for the weight function
wh (x − x p ) = m p s(x − x p ). (20)
In one dimension, the linear hat function, written in terms of natural coordinates, is
s = ŝ(ξ ),
1 − |ξ | |ξ | ≤ 1
ŝ(ξ ) = , (21)
0 otherwise
where ξ = (x p − x)/ h is the natural coordinate with h being the support of s and the
mesh spacing. In two space dimensions, the weight function is the tensor product of
one-dimensional weight functions, s(ξ, η) = ŝ(ξ ) ⊗ ŝ(η), with a similar construc-
tion in three dimensions.
0.8 0.8
f (x)
f (x)
0.6 0.6
0.4 0.4
0.2 0.2
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
x x
Fig. 1 Shepard interpolation using the functions f (x) = 1 and f (x) = x (shown with solid lines)
and a randomly-placed sampling points or b equally-spaced sampling points. The circles on the x-
axis show the positions of the sampling points. The other circles in the plots show the reconstructed
values f R (x I ). With equally-spaced points, the reconstructed function is exact for constant functions
and exact at the interior nodes for linear functions
element, sample the function f (x) at two randomly selected points. The sampling
points, x p , p = 1, 2, . . . , N p = 2Ne , play the role of material points. Set f p = f (x p ),
p = 1, 2, . . . , N p . Now reconstruct the function using the material-point data to
obtain f R (x). In MPM, we are interested in the value of the reconstructed function
at nodes, f R (x I ).
By construction, Shepard interpolation reproduces constant functions. If we set
f (x) = 1 and take a = 0, b = 1 and Ne = 8, Fig. 1a shows the exact function (solid
line) and the reconstructed values at nodes, x I , (circles). The circles along the x-axis
indicate the position of the randomly selected material points. The figure shows that
f (x) = 1 is reconstructed exactly. The figure also shows the reconstructed nodal
values for f (x) = x which are not exact. However, for equally-distributed sampling
points, the Shepard interpolation reproduces linear functions except on the boundary
of the domain. Figure 1b shows the reconstruction of f (x) = 1 and f (x) = x when
the material points are equally spaced with N pe = 2 material points per element
located at x p = x I −1 + (2 p e − 1)h/N pe , p e = 1, 2, . . . , N pe , I = 1, 2, . . . , Ne and
p = (I − 1)N pe + p e . For linear functions, f R (x I ) = x I exactly in the interior of
the domain but there is an O(h) error at the boundary points.
Figure 2 shows the same type of reconstruction except using the smooth func-
tion f (x) = sin π x on [0, 1], again with Ne = 8 and N pe = 2. For this example, the
reconstruction is not exact whether or not the sampling points are equally spaced
or randomly selected. However, the errors get smaller as h gets smaller, but the rate
of convergence differs. We examine the L 2 convergence rate as given by the error
estimate (17). For each h, we compute the L 2 error excluding the endpoints of the
domain by approximating the integral
x Ne
( f (x) − f R (x))1/2 d x (22)
x1
224 D. Sulsky and M. Gong
(a) 1 (b) 1
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
f (x)
f (x)
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
x x
(c) 0
unequally spaced
equally spaced
-2
-4
log 10(L2 error)
-6
1
-8
-10 2
-12
-14
-6 -5 -4 -3 -2 -1
log 10(h)
Fig. 2 Shepard interpolation using the function f (x) = sin π x (solid line) with a randomly placed
sampling points and b equally spaced sampling points. The circles on the x-axis show the positions
of the sampling points. The other circles in the plots show the reconstructed values f R (x I ). c Shows
the convergence rate with h using random and equally spaced sampling points
using the trapezoidal rule based on the nodal values of the integrand. With randomly-
selected sampling points, the convergence rate is O(h) as expected from (17). With
equally-spaced points the convergence rate is one order higher than expected, O(h 2 ).
Since there are O(h) errors at the end points, if the end point values are included in
the error estimate, the rate would be O(h 3/2 ) for one space dimension.
In MPM, Eq. (2) is solved in weak form. In order to simplify the task, assume u = 0
on Γu (t) and Γt (t) = ∅. That is, displacement is prescribed over the entire boundary
and the domain boundary is fixed in time. We also assume a nice boundary, such as a
piecewise polygonal boundary with the background mesh conforming to the domain
boundary. Under these conditions, the weak form of the governing equation is
Improving the Material-Point Method 225
σ · ∇η − ρ(b − a) · η dv = 0 (23)
Ω(t)
D h n
d
N n N
vh (x, t) = u (x, t) = u I (t)N I (x) = v I (t)N I (x). (25)
Dt I =1
dt I =1
D h n
d
N n N
a h (x, t) = v (x, t) = v I (t)N I (x) = a I (t)N I (x). (26)
Dt I =1
dt I =1
Over a time step, the grid is Lagrangian. Thus, over the step, the shape function
does not change with time. This fact is significant since a time derivative of the
shape function is not required in the formulas for velocity and acceleration. Also,
the natural coordinates for a material point in an element remain constant over the
Lagrangian time step.
The approximations, (28)–(30), along with a representation for an admissible,
smooth, virtual displacement η,
Nn
ηh (x) = η I N I (x), (27)
I =1
Nn
− ∇ηh : σ dv = − ηI · ∇ N I (x) · σ (x, t)dv (29)
Ω(t) I =1 Ω(t)
Nn
ηh · ρ(x, t)bdv = ηI · ρ(x, t)b(x, t)N I (x)dv. (30)
Ω(t) I =1 Ω(t)
Np
∼ ρ(x p (t), t)N I (x p (t))N J (x p (t))V p (t)
(31)
p=1
Np
= m p N I (x p )N J (x p ),
p=1
where the material-point mass is related to the density and volume through the
equation m p = ρ(x p (t), t)V p (t).
The numerical simulations use the simpler lumped-mass matrix. This matrix is
diagonal with the diagonal entries obtained by summing over the corresponding row
of the consistent mass matrix and using the property NJ =1
n
N J (x) = 1,
Np
M I (t) = ρ(x, t)N I (x)dv ∼ m p N I (x p ). (32)
Ω(t) p=1
Np (33)
∼− ∇ N I (x)|x=x p σ p (t)V p (t).
p=1
In the above equation, the notation for the stress at the material point position,
σ p (t) = σ (x p (t), t) has been introduced.
Improving the Material-Point Method 227
Finally, the nodal values of the external forces arise from the body forces, Eq. (30),
Np (34)
∼ m p b(x p (t), t)N I (x p (t)).
p=1
The weak form of the momentum balance equates (28) to the sum of the forces
(29)–(30). Since the weak form must hold for arbitrary η I , except at nodes on the
boundary where the displacement is prescribed, we obtain the semi-discrete equation
for the nodal acceleration at unconstrained nodes
Nn
dv J (t)
M I J (t) = FI int (t) + FI ext (t). (35)
J =1
dt
dv I (t)
M I (t) = FI int (t) + FI ext (t). (36)
dt
The momentum equation is solved for the acceleration at unconstrained nodes, which
is then integrated in time to obtain the corresponding velocity and displacement.
dv I (t)
= a h (x I , t)
dt (37)
du I (t)
= vh (x I , t).
dt
Nodes constrained by the displacement boundary conditions move according to those
prescribed constraints.
Once the equation of motion on the grid is solved, we need to update the material-
point information. The material points move along with the flow within an element
as it deforms in a Lagrangian manner over the time step. The material-point velocity
and position are updated as
dv p (t)
= a h (x p , t)
dt (38)
d x p (t)
= v (x p , t).
h
dt
228 D. Sulsky and M. Gong
d F p (t)
= L p (t)F p (t), L p (t) = ∇vh (x, t)|x=x p (t) . (39)
dt
The density and volume are obtained from the algebraic update
After the material-point information is updated, a new grid is generated for the next
time step. Usually in MPM the grid points that have moved during the Lagrangian
step are moved back to their previous locations. Note, however that one can generate
any convenient grid instead.
4 Time Discretization
where the mass comes from (32). The internal force comes from Eq. (33)
Np
FI int,n = − ∇ N I (x)|x=x np σ pn V pn , (42)
p=1
Improving the Material-Point Method 229
ext,n
Np
FI = m p b(x np , t n )N I (x np ). (43)
p=1
n+ 21 n− 21
vI = vI + Δt n a nI . (45)
n+ 21 u n+1 − u nI 1
vI = I
= 1 (u I
n+1
− u nI ). (46)
t n+1 −t n
Δt n+ 2
Likewise, this formula can be converted into an update for the displacement
1 n+ 21
u n+1
I = u nI + Δt n+ 2 v I . (47)
The material points are transported in the same fields as the nodes. Since a nI =
n+ 21 1
a h (x I , t n ) and v I = vh (x I , t n+ 2 ), the material points are updated using
n+ 21 n− 21 n− 21
Nn
vp = vp + Δt n a h (x np , t n ) = v p + Δt n N I (x np )a nI ,
I =1
n+ 21
Nn
n+ 21 (48)
n+ 21 h n+ 21 n+ 21
x n+1
p = x np + Δt v (x p ,t )= x np + Δt N I (x np )v I ,
I =1
u n+1
p = x n+1
p − x 0p ,
Information from the grid is also used to update the material-point stress state. In this
work, we will consider hyperelastic models that are specified through the deformation
gradient. The deformation gradient is updated using the chain rule
∂ x n+1 n
F pn+1 = F , (49)
∂xn p
230 D. Sulsky and M. Gong
and
∂ x n+1 1 n+ 1
= I + Δt n+ 2 L p 2 , (50)
∂x n
where
n+ 21
Nn
n+ 21
Lp = ∇ N I (x np )v I . (51)
I =1
Once the deformation gradient is advanced in time, we can compute the stress, σ pn+1
at material points from the constitutive equation. Likewise the density is ρ n+1 p =
ρ0 (x 0p )/J pn+1 with J pn+1 = det(F pn+1 ). Similarly, the volume corresponding to the
material point is V pn+1 = J pn+1 V p (0).
5 Numerical Example
To keep the presentation simple while illustrating the ideas, a one-dimensional exam-
ple will be used to demonstrate the convergence properties of MPM. The example
is constructed using the method of manufactured solutions described in [18, 19] and
outlined in the next section.
The standard MPM and its improvements will be examined through the solution of
a one-dimensional bar problem. We would like a problem with an exact solution so
that we may accurately compute the order of convergence of the method. An example
consisting of a longitudinally vibrating bar can be obtained by writing the problem
in the reference configuration. Consider the time dependent problem for a 1D, elastic
bar of length, L, with fixed ends, and constant cross-section. The equivalent equation
to (2) for the bar problem in the reference configuration is
∂ P(X, t) ∂ V (X, t)
+ ρ0 (X )B(X, t) = ρ0 (X ) , (52)
∂X ∂t
with boundary conditions
V (0, t) = V (L , t) = 0. (53)
In these equations V is the velocity, ρ0 is the original mass density, B is the body
force per unit mass, and P is the 1st Piola-Kirchoff stress. The relationship between
P and the Cauchy stress is
Jσ = P FT . (55)
For the manufactured solution, determine the body force so that the displacement,
U , and velocity, V , are given by
A
U (X, t) = sin π X sin Cπ t, V (X, t) = A sin π X cos Cπ t. (57)
Cπ
The corresponding initial conditions are
∂V
= −C 2 π 2 U. (60)
∂t
Therefore, to have a solution to equation (52), the body force must be
λ μ
ρ0 B = π 2 U 2
(1 − ln F11 ) + 2
(F 2
11 + 1) − ρ0 C 2
. (61)
F11 F11
This solution is valid for any choice of C 2 . We will use C 2 = E/ρ0 , where E is the
Young’s modulus. With ν being Poison’s ratio, we can set μ = E/(2(1 + ν)) and
λ = Eν/((1 + ν)(1 − 2ν)) in the usual way.
If we introduce dimensionless variables so that lengths are scaled by L, density
by ρ0 , velocity by C, and stresses by E, we need only consider a bar of unit length
with dimensionless density ρ0 = 1, dimensionless modulus, E = 1 and therefore
232 D. Sulsky and M. Gong
C = 1. The only free parameter is the dimensionless amplitude of the initial velocity
 = A/C which controls the amount of deformation in the bar.
The material-point solution is an approximation to the solution of the partial
n+ 1 1
differential equations where u np ∼ U (x 0p , t n ), v p 2 ∼ V (x 0p , t n+ 2 ), σ pn ∼ P(x 0p , t n ),
and F pn ∼ F(x 0p , t n ). In one space dimension, the two-norm of the error can be
computed using one-point quadrature to approximate the integral defining the norm.
In order to measure the error in mesh-based quantities, the exact material-point
quantities are projected to the grid. For example, the exact material-point velocity,
1
V (x 0p , t n+ 2 ), is used to reconstruct a velocity field which is then evaluated at grid
n+ 21
nodes and compared with the numerical solution, v I .
Standard MPM with leapfrog time differencing is given by the equations in Sect. 4
with function reconstruction based on ψ [0] as given in Sect. 3.1.1. To test convergence,
the method is applied to solve the problem of Sect. 5.1. Since we wish to consider
smooth solutions, the choice of the dimensionless parameter  should be less than
one to avoid the formation of shocks. The last consideration in running the problem
is stability. A necessary condition for the stability of the leapfrog scheme is that the
Courant-Friedrichs-Levy (CFL) condition be satisfied. For this difference scheme,
the CFL stability requirement is
cΔt
max ≤ 1. (62)
h
The maximum is over all x at time t and c is √ the maximum√ wave speed.√ In the
one-dimensional case, the wave speed is c = E/ρ = C ρ0 /ρ = C J√. For the
manufactured
√ solution of Sect. 5.1, we can compute the maximum of J to be
1 + A/C. To insure stability for all cases, we choose Δt = 0.7h.
Figure 3a shows a plot of the logarithm of the L 2 norm of the error in various
computed quantities versus the logarithm of the mesh size h for standard MPM.
The error is the error in the computed solution to the problem from Sect. 5.1 solved
using E = 1, ν = 0, ρ0 = 1, L = 1 and  = 0.001π . The mesh sizes considered are
h = 2−(i+2) , i = 1, 2, 3, 4. There are initially two, equally-spaced, material points
per element. The computed quantities considered in the figure are σ p , ρ p , v p , and
u p —the stress, density, velocity and displacement stored at material points, and,
v I , the velocity stored on the grid. The simulations are run while the time satisfies
t ≤ tmax and tmax is set to a dimensionless time of 2. Thus, the simulations terminate at
a time tend such that tend ≤ tmax and tend + Δt > tmax . The error norms are computed
at tend for the density, stress and displacement, and at tend − Δt/2 for the velocity.
The observed convergence rate for the deformation gradient, density and stress is
1.8. The rate is the same for these quantities since the density and stress are algebraic
Improving the Material-Point Method 233
(a) (b)
Fig. 3 Convergence of various computed quantities using the standard MPM for the problem in
Sect. 5.1 with a  = 0.001π and b  = 0.1π
functions of the deformation gradient. The figure shows data for the density only. The
observed convergence rate for the displacement is 1.7. The convergence behavior for
the velocity is a bit erratic but a least squares fit gives an overall convergence rate of
2.4.
The calculation is repeated in Fig. 3b except with a larger value of  = 0.1π .
The results shown in the plot indicate that MPM fails to converge. The difference
in these outcomes is due to the change in the parameter Â. Note that the maximum
displacement of a material point is Â/π and the distance of a material point to the
boundary of an element is initially h/4. Thus, when the mesh size is smaller than
h = 4 Â/π , material points cross element boundaries during the numerical solution
procedure. In Fig. 3a, the mesh sizes used are such that no material points cross
cell boundaries and in Fig. 3b all of the mesh sizes used result in material points
crossing cell boundaries. These results emphasize the conditional consistency of the
standard MPM and may explain seemingly contradictory reports of convergence and
of non-convergence of the method in the literature.
In the standard MPM, there are two main steps that account for inaccuracies of the
algorithm. The primary error corresponds to the quadrature rule used in standard
MPM to approximate the nodal forces or nodal masses. The material points are used
as the quadrature points. Since the material points move, they can be located arbi-
trarily in the computational element, degrading accuracy. Indeed, for large enough
motion relative to the grid size, the method is inconsistent. The other source of error
corresponds to the mapping from the material-point information to the grid. As we
saw in Sect. 3.1, the standard algorithm uses Shepard interpolation which is first
order for arbitrarily placed material points. The improvements considered in this
section are intended to achieve consistent second order accuracy of the method.
234 D. Sulsky and M. Gong
Since the function reconstruction outlined in Sect. 3.1 can, in principle, be used to
reconstruct functions to any order of accuracy from scattered data, we use this prop-
erty to obtain second order accuracy in MPM. The guiding principle is to regard the
material points as sampling points, and use the function reconstruction to approxi-
mate quantities as needed. For example, use the material-point velocity to reconstruct
the velocity field and evaluate the reconstructed velocity field at grid nodes to start
a time step. Similarly, reconstruct a stress field from the material-point values and
evaluate the reconstructed stress field at quadrature points when computing internal
forces.
To obtain the nodal velocity, standard MPM uses a shape function determined
from the general form given in Eq. (9) using r = 0 with the weight function wh
chosen to be the linear hat function given in Eq. (21), weighted by material-point
mass. With r = 0, in general, only constant functions are reproduced exactly. To
increase the order of accuracy, the improved method uses ψ p[1] (x) determined from
the same general form, but using a polynomial basis consisting of the constant and
linear monomials. There is also the choice of weight function; and to construct
ψ p[1] (x) we use the linear hat function, Eq. (21), without the mass weighting. The
reconstructed function is then continuous and the consistency error is O(h 2 ).
Smoother reconstructed functions can be achieved using higher order splines for
the weight function. For example, the cubic spline would give a C 2 reconstructed
function. The cubic spline expressed in natural coordinates is
⎧2
⎨ 3 − ξ 2 + 21 |ξ |3 |ξ | ≤ 1
ŝ3 (ξ ) = 43 − 2|ξ | + ξ 2 − 16 |ξ |3 1 ≤ |ξ | ≤ 2 . (63)
⎩
0 otherwise
To distinguish a shape function constructed with the cubic spline as weight from
ψ p[1] (x) which uses the linear hat function for the weight function, introduce the
notation C [1]
p (x) to denote the former. Note that both shape functions have the same
O(h ) consistency error since they are constructed to reproduce constant and linear
2
functions. In addition to being smoother, the wider support of the shape function
C [1]
p (x) is also useful if a situation arises where there is only one material point in
an element; since, in that case, the moment matrix for ψ p[1] (x) would be singular.
The linear hat function or the cubic spline are convenient weight functions for
reproducing nodal values since it is easy to determine which nodes are in the support
of the shape function associated with a material point. To determine function values
at the centers of elements (the quadrature points for one-point quadrature), it is
numerically simpler to use a quadratic spline rather than the linear hat function or
cubic spline since the quadratic spline does not require logic to determine which
element centers are in the support of the weight function of a given material point.
The shape function Q [1]p (x) used to reconstruct values at the quadrature points is the
Improving the Material-Point Method 235
⎧3
⎨ 4 − ξ2 |ξ | ≤ 21
ŝ2 (ξ ) = 2 ( 2 − |ξ |) 21 ≤ |ξ | ≤
1 3 2 3
. (64)
⎩ 2
0 otherwise
The quadratic and cubic spline weight functions render the reconstruction nonlocal
to the element and they require the use of a structured (logically rectangular) grid.
We again consider the function f (x) = sin π x on [0, 1]. Divide the domain into
Ne = 8 elements and use N pe = 2 randomly chosen material points per element as
sampling points. Figure 4a shows the reconstructed values f R (x I ) at nodes, using
(a) 1 (b) 1
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
f (x)
f (x)
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
x x
(c) -4
nodal values
-5 element values
-6
log 10(L2 error)
-7
-8
-9
-10 2
-11
-12
-13
-7 -6 -5 -4 -3 -2
log 10(h)
Fig. 4 Function reconstruction using the functionf (x) = sin π x (solid line) with randomly chosen
sampling points. The circles on the x-axis show the positions of the sampling points. The other
circles in the plots show the reconstructed values a f R (x I ) using ψ p[1] (x) and b f R (xe ) using
Q [1]
p (x). c Shows the convergence rate with h for nodal and element values
236 D. Sulsky and M. Gong
ψ [1] (x) as the shape function. Similarly, Fig. 4b shows function values reconstructed
at element centers, using Q [1]
p (x) as the shape function. The rate of convergence for
both methods is shown in Fig. 4c. It is observed that the errors are nearly identical
for both methods and both have O(h 2 ) consistency errors.
n− 21
Np
n− 21
ψ p[1] (x I )v p
1
vI = v R (x I , t n− 2 ) = .
p=1
The density and the stress are needed at quadrature points for the elements in
order to compute the mass matrix and the internal forces, respectively. For the
one-dimensional problem, we use one-point quadrature with the quadrature point
located at the center of the element. The reconstructed values are
Np
ρen = ρ (xe , t ) =
R n
Q [1]
p (x e )ρ p ,
n
p=1
Np
σen = σ R (xe , t n ) = Q [1]
p (x e )σ p ,
n
p=1
Ne
f Iext,n = ρ(x, t n )b(x, t n )N I (x) d x ∼ N I (xe )ρen b(xe , t n )h. (66)
Ω e=1
Ne
M In = ρ(x, t n )N I (x) d x ∼ ρen N I (xe )h. (67)
Ω e=1
(3) Update the information on the material points. The update follows the same
equations as the standard method given in Eq. (48) for the velocity, position
and displacement. The density and deformation gradient are updated following
Eq. (49).
(4) Generate a new grid.
In this section we repeat the analysis of Sect. 6 using the improved method. We restrict
attention to the larger deformation case where  = 0.1π . Nodal values are obtained
using ψ p[1] and element values are obtained using Q [1] p . Figure 5 shows the logarithm
of the L 2 error as a function of the logarithm of the mesh size for the material-point
quantities, stress, velocity and displacement, and also for the grid stress, velocity
and displacement. The mesh sizes used in this figure are h = 2−(i+3) , i = 1, 2, 3, 4.
Fig. 5 Convergence of
various computed quantities
using the improved MPM for
the problem in Sect. 5.1 with
 = 0.1π . The observed
rates of convergence are σe
(1.98), σ p (1.87), v I (2.02),
v p (1.98), u i (2.01), u p
(2.01)
238 D. Sulsky and M. Gong
The observed accuracy is second order. This figure should be compared with Fig. 3b
which shows failure of the standard method to converge for this large deformation
case.
Similar behavior is obtained using C [1]
p for computing nodal values from the material
point data.
8 Conclusions
This paper has presented a framework for improving the accuracy of MPM. The
material points are viewed as providing data for a finite element method. In this
finite element method, in essence, a new grid is used each step with the configuration
at the n-th time step being the reference configuration for the step. Thus, it is necessary
to initialize the values on the grid at the beginning of each step. The initialization
is accomplished by evaluating a function at nodes that is reconstructed from the
material-point data. This function reconstruction from scattered data appears in other
contexts, notably other meshfree methods for continuum mechanics. We therefore
draw on the literature to find methods that perform the reconstruction to any desired
accuracy. A potential advantage to using MPM over other meshfree methods that
rely on the same function reconstruction arises from the use of the finite element
mesh to take gradients and perform quadrature. In MPM the shape functions for
function reconstruction are not used as the basis functions for solving the equations
of motion, avoiding costly differentiation of these functions. A potential advantage
of MPM over using the finite element method on its own is in avoiding problematic
mesh distortion when materials undergo large deformations.
To illustrate the ideas for improving MPM in one space dimension, two-noded dis-
placement elements have been used with function reconstruction based on reproduc-
ing linear polynomials. B-splines have been used as the weight function in building
the shape functions used for the function reconstruction. We have also used four-
noded quadrilaterals in two space dimensions and will report the results elsewhere.
Clearly, higher-order element formulations need to be paired with higher-order func-
tion reconstruction for overall higher-order accuracy of the method. Also, if higher-
order accuracy is needed in time, then the leapfrog scheme considered in this paper
also needs to be replaced with a method of appropriate order.
For small deformations relative to the grid size and equally-spaced, or nearly
equally-spaced material points, it is possible to obtain apparent second-order accu-
racy with the standard MPM algorithm. We have demonstrated that second-order
accuracy is possible with large deformations using the improved method. However,
there is more work to be done to achieve second-order accuracy in general. The prin-
cipal remaining issue is related to the fact that MPM represents the geometry of a
body implicitly through the distribution of material points. This representation makes
the treatment of general boundary conditions difficult, especially if one is interested
in accuracy greater than first order. We also find apparent instabilities arise if we drive
the simulations to very small mesh sizes. The source of the instability is unknown
Improving the Material-Point Method 239
at this time. Two potential sources are ill-conditioning of the moment matrices or a
ringing instability [5].
We have shown a path to obtaining methods with specified accuracy. It is of interest
to provide a mathematical analysis of the methods to make precise the conditions for
convergence at a given rate. This topic, as well as a stability analysis of the methods,
remain topics of continued research.
Acknowledgments This work was partially supported by the National Science Foundation under
grant ARC 1023667 to the University of New Mexico.
References
1. Arroyo, M., & Ortiz, M. (2006). Local maximum-entropy approximation schemes: A seamless
bridge between finite elements and meshfree methods. International Journal for Numerical
Methods in Engineering, 65(13), 2167–2202.
2. Atluri, S. N., & Zhu, T. (1998). A new meshless local Petrov-Galerkin (MLPG) approach in
computational mechanics. Computational Mechanics, 22, 117–127.
3. Beltschko, T., Lu, Y. Y., & Gu, L. (1994). Element-free Galerkin methods. International Journal
of Numerical Methods in Engineering, 37, 229–256.
4. Belytschko, T., Krongauz, Y., Organ, D., Fleming, M., & Krysl, P. (1996). Meshless meth-
ods: An overview and recent developments. Computer Methods in Applied Mechanics and
Engineering, 139, 3–47.
5. Brackbill, J. U. (1988). The ringing instability in particle-in-cell calculations of low-speed flow.
Journal of Computational Physics, 75, 469–492.
6. Brackbill, J. U., & Ruppel, H. M. (1986). FLIP: A method for adaptively zoned, particle-in-cell
calculations of fluid flows in two dimensions. Journal of Computational Physics, 65, 314–343.
7. Duan, Q., Gao, X., Wang, B., Li, X., Zhang, H., Belytschko, T., et al. (2014). Consistent
element-free Galerkin method. International Journal for Numerical Methods in Engineering,
99, 79–101.
8. Duarte, A. C., & Oden, J. T. (1996). H-p clouds–an h-p meshless method. Numerical Methods
for Partial Differential Equations, 12(6), 673–705.
9. Han, W., & Meng, X. (2001). Error analysis of the reproducing kernel particle method. Com-
puter Methods in Applied Mechanics and Engineering, 190, 6157–6181.
10. Harlow, F. H. (1957). Hydrodynamic problems involving large fluid distortions. Journal of the
Association for Computing Machinery, 4, 137.
11. Li, S., & Liu, W. K. (2002). Meshfree and particle methods and their applications. Applied
Mechanics Reviews, 55(1), 1–34.
12. Li, B., Habbal, F., & Ortiz, M. (2010). Optimal transportation meshfree approximation schemes
for fluid and plastic flows. International Journal for Numerical Methods in Engineering, 83,
1541–1579.
13. Liu, W., Jun, S., Li, S., Adee, J., & Beltschko, T. (1995). Reproducing kernel particle methods
for structural dynamics. International Journal of Numerical Methods in Engineering, 38, 1655–
1680.
14. Liu, W. K., Jun, S., & Zhang, Y. (1995). Reproducing kernel particle methods. International
Journal for Numerical Methods in Engineering, 20, 1081–1106.
15. Melenk, J. M., & Babuska, I. (1996). Partition of unity finite element method. Computer
Methods in Applied Mechanics and Engineering, 139, 314–389.
16. Nguyen, V. P., Rabczuk, T., Bordas, S., & Duflot, M. (2008). Meshless methods: A review and
computer implementation aspects. Mathematics and Computers in Simulation, 79(3), 793.
240 D. Sulsky and M. Gong
1 Introduction
some time is required for drainage to take place. When this drainage (i.e. dissipa-
tion of excess pore pressures) is complete, the solid phases totally takes the external
pressure. This process is known as soil consolidation [1]. The implementation of
the Biot’s equations [2] is a well-known way to solve problems in porous media
from a macro-scale point of view. The advantage of this method is the possibility of
accounting for coupling between the fluid phase and the solid skeleton. The u − pw
formulations, where u denotes the solid phase displacement, and pw is the pore fluid
pressure [3], have traditionally been employed for simulating coupled problems in
saturated porous media since the final equations work with less degrees of freedom
(three in 2D, four in 3D problems) compared with that of a complete formulation. The
recent u − w formulation, where w represents the relative fluid displacement with
respect to the solid phase, which is usually referred as the displacement-based or com-
plete formulation, has been employed in several numerical schemes (López-Querol
et al. [4], and recently adopted by Cividini and Gioda [5]). Such a methodology is
assumed in this work, first for its simplicity in imposing impervious boundary con-
ditions compared to the u − pw approaches; second, as the free surface comes out
naturally as the zero-pressure contour, no detection algorithm is necessary; third, it
facilities the modelling of large and/or nonlinear deformations of the solid phase as
well as the possible separation between the solid and fluid phases in the case of local
failure (liquefaction or slope instability). Since meshfree numerical schemes have
been known to perform particularly well in the regime of large deformations, we
endeavour to apply such schemes to coupled problems in saturated porous media,
using the u − w formulation.
There are many different flavours of meshfree methods available. The present
research has been carried out using the principle of maximum entropy [6], the shape
functions developed by Arroyo and Ortiz [7], in particular, the OTM framework [8],
for its numerous advantages in comparison with its alternatives. For example, the
exact mass transport, the satisfaction of the continuity equation, exact linear and
angular momentum conservation in order to solve different problems as spurious
modes, tensile instabilities and unknown convergence or stability properties. Since
the deformation and velocity fields are interpolated from nodal values using max-
ent shape functions, the Kronecker-delta property at the boundary makes it possible
for the direct imposition of essential boundary conditions. In the current work, an
Eulerian framework is employed to solve the Biot’s equations for porous media. In
addition, the parameters pertinent to the local maximum entropy are obtained effi-
ciently and independent of the support size through the Nelder-Mead algorithm [9].
Locking in near-incompressible materials is not unusual for the numerical meth-
ods based on either finite elements approaches [10–17] or meshfree approximation
schemes [18]. In the case of flow through saturated media, in a displacement formu-
lation, since both the undrained soil phase and the fluid phase are nearly incompress-
ible, locking may also occur. The recent approach developed by Ortiz and Suku-
mar [18], avoids locking by averaging the volumetric part of the strain tensor with
the value of the pressure. However, since the pressure term is part of the constitutive
model employed, the constitutive model is necessarily modified. As an alternative,
we implement a volumetric-strain average instead of a pressure average approach.
Meshfree Methods Applied to Consolidation Problems … 243
2 Mathematical Framework
In this section, we first summarise the governing equations for unconfined seep-
age problems, in particular, the Biot’s equations, formulated in a u − w framework,
which have been successfully utilised by López-Querol et al. [20, 21], Cividini and
Gioda [5]; next, the B-bar implementation in axisymmetric framework for elastic
and porous media are given in detail.
The Biot’s equations [22] are based on formulating the mechanical behaviour of a
solid-fluid mixture, the coupling between different phases, and the continuity of flux
through a differential domain of saturated porous media. For clarity, bold symbols
for notation of vectors and matrices, and regular letters to denote scalar variables, are
used. Let ρ and ρf respectively represent the mixture and fluid phase densities; b and
κ stand for the external acceleration vector and the permeability coefficient in [m3 ·
s/kg] (in civil engineering, however, the notion of hydraulic conductivity, k = κρf g, in
[m/s], is often used instead), the three equations of Biot, which represent the mixture
equilibrium, the fluid phase equilibrium and the continuity equation respectively, are
expressed as follows:
ST dσ − ρd ü − ρf d ẅ + ρdb = 0, (1)
ρf
−∇dpw − κ −1 d ẇ − ρf d ü − d ẅ + ρf db = 0, (2)
n
d ṗw
∇ · d ẇ + m · d ε̇ s + = 0, (3)
Q
244 P. Navas et al.
Additionally in Eq. (3), m represents the unit matrix expressed in Voigt form, whereas
Q stands for the mixture compressibility, which is calculated as
−1
Q = Ks−1 (1 − n) + nKf−1 , (5)
where Ks and Kf are the bulk modulus of the solid grains and the compressive modulus
of the fluid phase.
A 2D approach is considered in the derivations presented hereinafter, therefore
the differential operator, S, and the unit matrix m are written as
⎛∂ ⎞ ⎛ ⎞
∂x
0 1
∂
S = ⎝ 0 ∂y ⎠ , m = ⎝1⎠ . (6)
∂ ∂
∂y ∂x
0
Assuming tensile stresses (except pore pressure pw , which is positive for com-
pression) and strains as positive, the Terzaghi’s effective stress [23] is defined as
follows
σ = σ − pw m, (7)
where σ and σ are the respective vectorial form in Voigt notation for the effective
and total stress tensor. For the case of linear elasticity, the incremental relationship
between stresses and strains is governed by:
dσ = De dε s , (8)
where De denotes the elastic tensor, which under plane strain conditions, it is given
by: ⎛ ⎞
1−ν ν 0
λ
De = ⎝ ν 1 − ν 0 ⎠ (9)
ν 0 0 1−2ν2
Note that an isotropic medium is assumed in the above equations. The final system
of equations, once the elementary matrices have been assembled, can be expressed
as:
Kdu + Cd u̇ + Md ü = df , (13)
Voigt notation is assumed in order to obtain the final B-bar matrix. The process to
obtain B matrix in index notation is:
where ⎡ ⎤
1−ν ν ν 0
λ⎢ ν 1 − ν ν 0 ⎥
D= ⎢⎣
⎥. (18)
ν ν ν 1 − ν 0 ⎦
1−2ν
0 0 0 2
Stiffness matrix is calculated by taking into account that the volume integral is
extended around the whole ring of material as follows:
Kp = 2π BT DB r dr dz, (19)
where the superscript p represents the fact that the matrix is calculated for each
material point within a patch. The final expression is written as:
T
Kp = 2π B DB r A, (20)
The external forces in Eq. (13) are calculated in the same way:
2π rfr
f= , (21)
2π rfz
where fr and fz respectively denote radial and vertical components of the external
force.
If a B-Bar based algorithm is implemented in this problem, the starting point is
similar to the plane strain one, which is based on the transformation of the ε tensor
to the ε tensor, a tensor where the volumetric part is obtained as an average of the
neighbour integration points, as we can see in the following equation:
1 1 p
ε=ε− tr(ε)I + [tr(ε)] I, (22)
d d
p
where d is the dimension of the problem, in this case 3; and [tr(ε)] is the average
trace of ε of the neighbour integration points in a chosen patch, calculated by:
p
Nb
[tr(ε)] = tr[ε(i) ] wi . (23)
i=1
In addition, the trace could be obtained from the strain vector as:
⎡ ⎤
εr
⎢ εz ⎥
εx + εy = 1 1 1 0 ⎢ ⎥
⎣ εθ ⎦ = εkk = mk εk . (24)
γrz
where ∂N1
N1 ∂N1 ∂N2 N2 ∂N2
T= ∂r
+ r ∂z ∂r
+ r ∂z
··· . (26)
Thus, the final lth-component of the tensor ε (in Voigt form) for a single integration
point i is calculated in the same way as in 2D problems:
⎡ ⎛ ⎞⎤
1 Nb
(j)
ε (i)
l = ⎣Blk(i) − ml ⎝Tk(i) − [Tk w(j) ]⎠⎦ ukh
d j=1
(i)
= Blk ukh .
248 P. Navas et al.
d ṗw
∇ · d ẇ + mT d ε˙s + =0
Q
d ṗw = −Q [∇ · d ẇ + mT d ε˙s ]
σ = De ε s + Q [tr(εs ) + tr(ε w )] I.
If: ⎤ ⎡
εrs
⎢ s⎥
⎡ s ⎤ ⎢ εsz ⎥
ε ⎢ε ⎥
⎢ θs ⎥
ε=⎣ ⎦=⎢ ⎢γrz
⎥=Su
⎥
εw ⎢ εw ⎥
⎢ rw ⎥
⎣ εz ⎦
εθw
If the problem needs a B-Bar based algorithm, it is necessary to calculate the average
value of ε. Thus, the main equation yields:
1 1 1 1
ε=ε− tr(ε s )I + [tr(ε s )]p I − tr(ε w )I + [tr(ε w )]p I.
d d d d
In Voigt notation the equation, the lth-component of the strain tensor is:
⎛ ⎞
1⎝ Nb
(j)
Nb
[εkk w(j) ] mls − εkk [εkk w(j) ] mlw ⎠ ,
w(j)
εl = εl + −εkk mls + w
mlw +
d j=1 j=1
where
⎡ ⎤ ⎡ s⎤
εrs εr
⎢ εs ⎥ ⎢ εs ⎥
⎢ sz ⎥ ⎢ sz ⎥
⎢ε ⎥ ⎢ε ⎥
⎢ θ⎥ ⎢ θs ⎥
εkk
s
= mks εk = 1 1 1 0 0 0 0 ⎢ γ
⎢ rz
s⎥
⎥ , ε w
kk = m w
k kε = 0 0 0 0 1 1 1 ⎢γ ⎥ .
⎢ rz ⎥
⎢ εw ⎥ ⎢ εw ⎥
⎢ rw ⎥ ⎢ rw ⎥
⎣ εz ⎦ ⎣ εz ⎦
εθw εθw
where ∂N1
N1 ∂N1 ∂N2 N2 ∂N2
Ts = ∂r
+ r ∂z
00 ∂r
+ r ∂z
0 0 ··· ,
∂N1 N1 ∂N1 ∂N2 N2 ∂N2
Tw = 0 0 ∂r
+ r ∂z
00 ∂r
+ r ∂z
··· .
Thus, the final lth-component for the new strain tensor ε at a single integration point i
in Voigt notation is calculated as:
⎛ ⎞
1 s ⎝ s(i) h s(j) (j) h ⎠
Nb
εl (i)
= Blk(i) ukh
− ml Tk uk − [Tk w ]uk
d j=1
⎛ ⎞
1 Nb
− mlw ⎝Tkw(i) ukh − [Tk w(j) ]ukh ⎠
w(j)
d j=1
⎡ ⎛ ⎞
1 Nb
= ⎣Blk(i) − mls ⎝Tks(i) − [Tk w(j) ]⎠
s(j)
d j=1
⎛ ⎞⎤
1 Nb
− miw ⎝Tkw(i) − [Tk w(j) ]⎠⎦ ukh
w(j)
d j=1
≡ Blk ukh .
Meshfree Methods Applied to Consolidation Problems … 251
In this case, the analytical solution for this problem is available, and thus it is com-
pared with the solution proposed by the present method. Although the analytical
solution is presented in non-dimensional terms, the geometry of the problem carried
out is shown in Fig. 1. It consists of a column of 30 m of soil resting on an imperme-
able rigid base layer and loaded by a vertical, homogeneous loading at the top. The
lateral displacements are restricted for both the solid and fluid phase. At the base
layer, the solid phase is fixed, whereas the vertical movement of the fluid phase is
1m
Nodes
P P=P·e
Material Points
x
y (m)
y 5
HT=30 m
0
0 1
x (m)
Base layer, impermeable
Fig. 1 Geometry, loading condition of the consolidation column, and the discretised nodes and
material points (shown for the first 5 m only). The same geometry has been employed for both static
and dynamic simulations
252 P. Navas et al.
100 kPa
0.1 s t
prevented. The column is discretised into 240 nodes and 183 material points. The
external loading, in this case, is static at the end, but gradually applied as depicted
in Fig. 2. The behaviour of the consolidation is led by the vertical consolidation
coefficient, cv , which is function of the vertical permeability coefficient, kv :
kv (1 + e) kv kv
cv = = Em = (27)
ρw gav ρw g ρw gmv
In Eq. (27), Em is the oedometric modulus, related with the Young’s modulus E
according to the following equation:
2ν 2
E = Em 1− . (29)
1−ν
Typical values adopted for clays are 2 MPa for the Young’s modulus and 0.33 for
the Poisson’s ratio.
The basic equation for one-dimensional consolidation derived by Terzaghi in
1923 [23] is
∂ 2u ∂u
cv 2 = . (30)
∂z ∂t
The solution searched is a measure of the consolidation of the soil. It depends on the
vertical time factor, Tv , defined by:
cv t
Tv = . (31)
H2
Meshfree Methods Applied to Consolidation Problems … 253
ue (z) 2
m=∞ z
Uv (z) = 1 − =1− sin M 1 − exp(−M 2 Tv ) (32)
u0e m=0
M H
where π
M= (2m + 1), m = 0, 1, 2, . . . , ∞. (33)
2
In Fig. 3, it is given the comparison between the analytical and the numerical solution
along the depth of the column of soil for different values of Tv . As it is seen, all the
values for Tv are dimensionless. With this comparison, we consider the current model
employed is sufficiently validated.
This solution has been also compared with PLAXIS, in order to have an idea on the
accuracy of this commercial software, since it is going to be employed hereinafter
for several other theoretical examples. The direct conclusion obtained in Fig. 4 is
that the accuracy decreases at the final stages of the consolidation. For low values
of Tv PLAXIS solution of Uv along the column of soil is similar to the calculated
Present Research
1
Analytical solution
0.9
Tv=0.01
0.8
0.03
Tv=
0.7
5
0.0
0.6 Tv=
0.1
Tv= 0.2
Tv=
z/H
0.5
0.4
0.3 .5
Tv= =0
Tv
0.3
.7
Tv=
=0
1.0
Tv
0.9
Tv=
0.2
Tv=
0.1
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Consolidation degree, Uv
Present Research
Analytical solution
1
PLAXIS solution
0.9
Tv=0.01
0.8
0.7
0.6
0.2
Tv=
z/H
0.5
0.4 .5
=0
Tv
0.3
0.2
0.1
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Consolidation degree, Uv
Fig. 4 Solutions of Uv for several values of Tv obtained in the current work compared with the
analytical ones and those obtained with PLAXIS software
in this research, but is getting noticebly different for higher values of Tv . Therefore,
PLAXIS solutions will provide a good approach of the trend of the pressure along the
consolidation without an accurate degree of precision, at least, for one dimensional,
static problems.
The solution for the monotonic loading at the upper side of the column shows
the dissipation of the pore pressure along time. Figure 5 provides the comparison
between the solution obtained with the present methodology, the u − w solution
obtained with a quadratic FEM model, and the one calculated using the software
GeHoMadrid [25]. Hardly any perceptible difference can be noticed from these
three solutions.
In this example, using the same geometry for the soil column, the consolidation of a
soil column vertically subjected to harmonic pressure is obtained. This problem was
first analytically solved by Zienkiewicz et al. [3] in 1980s, and recently by López-
Meshfree Methods Applied to Consolidation Problems … 255
100
80
López-Querol (2006)
60
PW (kPa)
GeHo-Madrid
Present research
40
20
0 1 2 3 4
t (s)
Fig. 5 Pore pressure evolution solutions at the top of the consolidation column
Table 1 Material parameters employed for the dynamic consolidation problem of a soil column
G (MPa) ν (–) n (–) ρ (kg/m3 ) ρf (kg/m3 ) Kf (MPa) ω (rad/s) Ks (MPa)
312.5 0.2 0.333 3003 1000 103 3.379 1034
Querol [26], among others, through employing quadratic finite element method. The
parameters for the material are those presented in Table 1, where ω, is the applied
loading frequency. A periodic surface load with the amplitude of 100 kPa, a frequency
of 3.379 rad/s, is imposed. This dynamic load as well as all the material properties
in Table 1 are chosen to be the same as those of Zienkiewicz et al. [3].
The variation of the pore pressure with depth is illustrated for different values of
π1 , a dimensionless parameter defined as follows,
Vc2
π1 = k ρf , (34)
g ρ
ω HT2
where HT is the column height and Vc is the compressive wave velocity calculated
as:
Kf 1 2G(1 − ν)
Vc = D+ , D= , (35)
n ρ 1 − 2ν
where D the bulk modulus of the soil skeleton (dry mixture). Note that, for the
given material properties and loading frequency, π1 is proportional to the hydraulic
conductivity k. Once k (thus π1 ) is known, transient calculations can be carried out to
obtain the envelop of the pore pressure history for different points along the column
depth, or the isochrone.
256 P. Navas et al.
P/PT
0 0.5 1 1.5
0 10-2
10-1
Zienkiewicz et al. (1980)
Present research
100 López-Querol (2006)
H/HT 0.5 2
10
103 101
Fig. 6 Isochrones of the pressure in the whole column for different π1 values: comparison for
solutions taken from Zienkiewicz et al. [3], López-Querol [26] and those obtained in the present
research. The depth is normalised by the column height, HT , whereas the pore pressure is made
non-dimensional by PT , 100 kPa
After performing six different calculations for six different levels of π1 , from 10−2
to 103 , we obtain the isochrones of the pore pressure depicted in Fig. 6. Additionally
plotted are the results obtained by López-Querol [26] using quadratic finite elements
formulating the problem in displacements as well, along with the analytical solutions
provided by Zienkiewicz et al. [3]. It is noteworthy that the three different approaches
achieve quite similar isochrone maps; nevertheless, whenever more scattering is
observed, the current meshfree solution is closer to the analytical one.
The second problem carried out in this research is about radial consolidation. The
physical equation which governs this problem is different from the Terzaghi’s equa-
tion [23] shown in the previous section, i.e.:
∂ 2 ur 1 ∂ur ∂ur
ch + = , (36)
∂r 2 r ∂r ∂t
re
rw
Impervious layer
In Fig. 7, a scheme of drains with induced radial flux is shown, where r and z
directions are defined as depicted. There is an analytical solution for this problem
given by Barron in 1948 [27] who defined the radial consolidation degree as a function
of the non-dimensional time Tr :
2Tr
Ur (Tr ) = 1 − exp − , (38)
F(nr )
where
nr2 3nr2 − 1
F(nr ) = log(nr ) − . (39)
nr2 − 1 4nr2
where rw is the drain radius and re is the radius of influence for each type of problem.
In this case a quadrangular net of drains shown in the scheme of Fig. 8 is studied.
re
Drains
258 P. Navas et al.
0.2
0.3
n r=5
0.4
Ur
0.5
n r=15
Present research
0.6
Analytical solution
0.7
n r=10
0.8
00
0.9
Fig. 10 Comparison Tr
between PLAXIS and 0.01 0.1 1 10 100
present research solutions of 0
Ut = Ur + Uv along the 0.1
non-dimensional time Tr for
case A (permeable boundary) 0.2
Present research
and B (impervious boundary) 0.3
PLAXIS
0.4
0.5
Ut
A
0.6
0.7
B
0.8
0.9
1
Meshfree Methods Applied to Consolidation Problems … 259
A dynamic loading on the surface has been applied, aiming to simulate its effect on the
development of excess pore water pressure in the domain. The frequency of loading
is the same as for the case of the soil column, while the amplitude is 50 kN. The
geometry is the same as represented in Fig. 7. Vertical displacements of water in the
entire domain have been prevented. Figure 11 represents the evolution of pore water
pressure at the lateral, lowest corner, which clearly demonstrates the cyclic response
of this result as well. From this figure it can be concluded that the generation and
dissipation of excess pore water pressure are balanced in every cycle, and steady state
is achieved from the very beginning of the loading. Additionally, Fig. 12 represents
1.5
0.5
P/P0
−0.5
−1
−1.5
0 1 2 3 4 5 6 7 8 9 10
t (s)
Fig. 11 Evolution of normalised excess pore water pressure during external cyclic loading (for the
dynamic, radial consolidation problem)
Maximum Minimum
10 1 10 −0.8
9 0.98 9 −0.82
8 0.96 8 −0.84
7 0.94 7 −0.86
6 0.92 6 −0.88
z (m)
z (m)
5 0.9 5 −0.9
4 0.88 4 −0.92
3 0.86 3 −0.94
2 0.84 2 −0.96
1 0.82 1 −0.98
0 0.8 0 −1
0 1 2 3 4 5 0 1 2 3 4 5
p/p0 p/p0
r (m) r (m)
Fig. 12 Maximum and minimum normalised excess pore water pressures in dynamic, radial con-
solidation
260 P. Navas et al.
maximum and minimum values of excess pore water pressure in the entire domain
during the cyclic loading. This figure demonstrates the 2D nature of the problem, the
drain being clearly displayed on the left hand side of the domain. Higher excess pore
water pressures can be seen at the lower, left corner for both maximum and minimum
cases. Moreover, the figure demonstrates the alternate positive and negative values
for the pressures in the domain, as in Fig. 11.
The insertion of singular points inside the domain may vary the consolidation behav-
iour. The existence of a sink in the middle of horizontal soil layer is expected to
accelerate the consolidation of the porous media, since this means an output of water
at the permeable top boundary. To reproduce the sink, excess pore water pressure
is not allowed to develop in several nodes in the centre of the domain (see Fig. 13).
A square with one meter edge length is proposed for the study of this problem.
Material properties are given in Table 2. Figures 14 and 15 show the comparison of
results obtained with the present model as well as with PLAXIS for point (0, 0).
Figure 14 provides the evolution in time of the degree of consolidation, U, at one of
the corners at the bottom of the domain, whereas Fig. 15 represents the solutions in
the whole domain after two seconds. In spite of slight differences in the final part of
the evolution, it can be concluded that both results are fairly similar, demonstrating
the good performance of the present formulation for this kind of problems.
0.5 m.
1 m.
Drain
0.5 m.
Impervious
1 m.
Table 2 Material parameters employed for the consolidation problem of a soil with a sink
E (MPa) ν (–) n (–) ρ ρf Kf Ks k (m/s) ksink
(kg/m3 ) (kg/m3 ) (MPa) (MPa) (m/s)
100 0.0 0.333 3003 1000 103 1034 10−3 10
Meshfree Methods Applied to Consolidation Problems … 261
t (s)
0.1 1 10 100
Consolidation degree, U
0
0.2
0.4
Present research
0.6
PLAXIS
0.8
Fig. 14 Evolution of consolidation degree at the bottom corner for the sink problem. Present model
versus PLAXIS
0.4 0.4
0.3 0.3
0.3 0.3
0.2 0.2
0.2 0.2
0.1 0.1 0.1 0.1
0 0 0 0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x x
Fig. 15 Field of consolidation degrees in the domain after 2 s. Present model versus PLAXIS
As for the case of radial consolidation, a dynamic simulation has been carried out. The
geometry is the same as in Fig. 13. Figure 16 represents the evolution of normalised
excess pore water pressure at a bottom corner, clearly showing the cyclic nature of
the solution, which is steady from the beginning. Moreover, Fig. 17 shows maximum
and minimum values in the whole domain. This figure clearly demonstrates the
location of the sink, with zero water pressures in the centre of the domain for both
cases. The alternate positive and negative values are also clear from the plot. Once
again, these results demonstrate the suitability of the present formulation for dynamic
consolidation problems in saturated soils.
262 P. Navas et al.
1.5
0.5
P/P0
−0.5
−1
−1.5
0 1 2 3 4 5 6 7 8 9 10
t (s)
Fig. 16 Evolution of normalised excess pore water pressure during external cyclic loading (dynamic
consolidation in a soil with a sink)
z/H
0.5 0 0.5 0
0.4 −0.2 0.4 −0.2
0.3 −0.4 0.3 −0.4
0.2 −0.6 0.2 −0.6
0.1 −0.8 0.1 −0.8
0 −1 0 −1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/L x/L
Fig. 17 Maximum and minimum normalised excess pore water pressures for dynamic soil consol-
idation with a sink
4 Conclusions
Acknowledgments This research has been partially funded by the Spanish Ministry of Economy
and Competitiveness through the projects BIA2012–31678 and MAT2012–35416. The first author
also acknowledges the financial support via the fellowship No. BES2013–063924.
References
18. Ortiz, A., Puso, M. A., & Sukumar, N. (2010). Maximum-entropy meshfree method for com-
pressible and near-incompressible elasticity. Computer Methods in Applied Mechanics and
Engineering, 199, 1859–1871.
19. Navas, P., López-Querol, S., Yu, R. C., & Li, B. (2016). B-bar based algorithm applied to
meshfree numerical schemes to solve unconfined seepage problems through porous media.
International Journal for Numerical and Analytical Methods in Geomechanics, 40, 962–984.
20. López-Querol, S., Navas, P., Peco, J., & Arias-Trujillo, J. (2011). Changing impermeabil-
ity boundary conditions to obtain free surfaces in unconfined seepage problems. Canadian
Geotechnical Journal, 48, 841–845.
21. Navas, P., & López-Querol, S. (2013). Generalized unconfined seepage flow model using
displacement based formulation. Engineering Geology, 166, 140–141.
22. Biot, M. A. (1956). Theory of propagation of elastic waves in a fluid-saturated porous solid. I.
Low-Frequency range. Journal of the Acoustical Society of America, 28(2), 168–178.
23. Terzaghi, K. V. (1925). Principles of Soil Mechanics. Engineering News-Record, 95, 19–27.
24. Zienkiewicz, O. C., & Taylor, R. L. (1994). El método de los elementos finitos. Vol 1: Formu-
lación básica y problemas lineales. Barcelona: CIMNE.
25. Fernández Merodo, J. A., Mira, P., Pastor, M., & Li, T. (1999). GeHoMadrid User Manual.
Madrid: CEDEX. Technical Report.
26. López-Querol, S. (2006). Modelización geomecánica de los procesos de densificación, licue-
facción y movilidad cíclica de suelos granulares sometidos a solicitaciones dinámicas. PhD
thesis, University of Castilla-La Mancha, Ciudad Real, Spain.
27. Barron, R. A. (1948). Consolidation of fine-grained soils by drain wells. Transationc ASCE,
113, 718–754.
A Multiscale Microstructural Model
of Permeability in Fractured Solids
Permeability is the property of materials that measures the ability for fluids (gas or
liquid) to flow through a porous solid material, essentially related to the void topology
and not to the properties of the fluid.
The void space of a porous medium, where liquid or gaseous fluids are allowed
to flow, can be thought as composed of a spatial network of interconnected random
passages, channels or tubes of varying length, cross section and orientation, and
junctions, where channels meet [1]. Considering an elementary volume with a suffi-
ciently large number of channels, experimentally it is observed that the network of
channels produces average gradients of pressure, density, viscosity and solute con-
p v2
h= +z+ , (2)
ρg 2g
where p is the gauge pressure, ρ is the density of the fluid, and g is the gravitational
acceleration. The first term, p/ρg, is called pressure head, the second term, z, is called
elevation head, and the third term, v2 /2g, is called velocity head. The pressure head is
the equivalent gauge pressure of a column of water at the base of the piezometer. The
elevation head is the relative potential energy in terms of an elevation. The velocity
head is related to the kinetic energy of the fluid, and in most applications, where
the fluid velocity is very small, is disregarded. In the following we will discard the
kinetic contribution.
Fluid flow across packed porous media is in general characterized by laminar
regime (Reynolds number Re ≤ 1) and a drop in the hydraulic head. The hydraulic
gradient ∇h measures the change of the hydraulic head field and is related to the
direction of the flow in the porous medium. In a cartesian orthogonal reference system
it is defined as
∂h ∂h ∂h
∇h = e1 + e2 + e3 . (3)
∂x1 ∂x2 ∂x3
Analytical models of fluid flow in rocks use constitutive relations that link the average
velocity of the fluid across the medium to the pressure drop. In particular, Darcy’s
law states that the discharge rate in a porous media is proportional to the hydraulic
head gradient and inversely proportional to the fluid viscosity
A Multiscale Microstructural Model of Permeability … 267
ρg
q = −κ ∇h, (4)
μ
where q is the discharge rate and μ the fluid viscosity, and κ the medium permeability.
In anisotropic media the permeability is described by a symmetric and positive
definite second order tensor. Symmetry of κ is the consequence of the Onsager recip-
rocal relations, while positive definiteness follows from the fact that a fluid cannot
flow against the pressure drop. These two properties render the tensor diagonaliz-
able. Eigenvalues of the permeability tensor represent the principal permeabilities,
and the corresponding eigenvectors define the principal directions of flow, i.e., the
directions where flow is parallel to the pressure drop.
where La is the average length of the channels and L the macroscopic length of the
flow path. Relation (5) incorporates a characteristic microstructural length parameter
similar to that used in other analyses of permeability [5].
The estimation of the shape coefficients appearing in the equation has been pro-
moting an active research [6]. The specific internal surface area has been evaluated
with several methods, for example by combining the contributions of plastic clayey
and granular silty-sandy fractions [7], or scanning electron microscope images [8].
Simplified and more tractable tortuosity models have been introduced, for example
considering pore channels of variable shape but constant cross-section [9] or by intro-
ducing alternative definitions of tortuosity [10]. By using the analog of networks of
electrical resistors, τ can be linked empirically to the electrical conductivity of rocks
and the brine (salt solution) saturation through a quantitative relation due to Archie
[11]. Starting from Archie’s law, permeability models have been able to include the
pore connectedness in the correlation between permeability and local electric field
[12–15].
By describing the connected pore space as a bundle of tortuous leaky hydraulic
tubes, alternative permeability models have been proposed by Civan [16, 17]
β
n
k=γn , (7)
1−n
where the parameter γ and the fractal exponent β have been originally determined
by experimental data fitting, and successively derived analytically, showing how the
two parameters γ and β can be linked to physically meaningful parameters of the
porous media.
The scalar nature of variables and parameters used in this class of models leads
to scalar definitions, and the correct tensor nature of the permeability is disregarded.
Therefore, such models are not meaningful if applied to soils characterized by the
presence of sedimentation layers or fissures. The complexity of the relationship
between the permeability tensor and a scalar property such as the porosity in rocks
has been clearly pointed out [18]. In particular, permeability depends not only on the
actual stress and on the strain during the loading history, but also on the evolution of
the crack patterns, which is anisotropic in nature. The Kozeny-Carman permeability
models are primarily intended for applications to static porous materials, whose
effective or conductive pore structure and properties remain unchanged during fluid
flow. Hence, these models do not allow for the modification of the porous medium
microstructure due to fluid-porous matrix interactions, or by the presence of a variable
confining pressure.
A Multiscale Microstructural Model of Permeability … 269
Fractures and faults represent the most ubiquitous and efficient ways for flows in
natural rock formations. The availability of fault and fracture mappings in reser-
voirs is an important recent achievement in geology, but the understanding of the
influence of these structures on fluid flows is still far from being satisfactory, in
particular when the mechanical coupling is significant. Difficulties emerge from the
complexity of the topology and geometry of faults. Each group or class of faults is
characterized by orientation, spacing, distribution, and connectivity, which affect the
entrapment of fluids, limiting or advantaging the migration and flow of fluids in a
given environment [24].
Clearly, the complexity of natural fracture networks is associated to the stress state
history, which is hardly known. Furthermore, cracks and fracture can evolve due to
the action of gravity, superposed localized pressures, and shear tractions resulting
from the viscosity of the flowing fluids.
The observation that intact rocks contain distributed flaws and cracks, arranged
between particles of various shape, has motivated the use of fracture mechanics to
study their organization and the conditions that promote their growth [25]. The most
widely used fracture mechanics models to describe the progressive microfracturing
of rocks upon increasing loading are the open crack and the sliding crack models.
However, standard approaches of fracture mechanics have limiting drawbacks related
to the explicit treatment of cracking.
270 A. Pandolfi et al.
F = Fm Ff . (8)
The deformation gradient Ff can be easily linked to the fault kinematic activity.
Consider a material vector dX, shorter than the system size but longer than the
internal scale L, that spans two material points P and Q in the material configuration.
The number of faults m traversed by the vector is
272 A. Pandolfi et al.
Fig. 1 Inelastic kinematics of the fault system. The opening displacement Δ applied to all the
faults at distance L leads to a deformed configuration characterized by the inelastic deformation
gradient Ff . a Reference configuration. b Spatial configuration
1
m= dX · N.
L
Let us now apply an opening displacement Δ to each fault, Fig. 1a. In the spatial
configuration the two points P and Q are joined by the vector dx given by
1 1
Δ = dX +
dx = dX + mΔ (dX · N) Δ = (I + Δ ⊗ N) dX,
L L
where we set
1
Ff ≡ I + Δ ⊗ N. (9)
L
Note that, once N and L are supplied, Ff and Δ are in one-to-one correspondence,
and the opening displacements Δ follow from Ff through the relation
Δ = L (Ff − I) N. (10)
Note that the matrix may, in turn, accommodate a second fault family:
1 2
F = Fm1 Ff , Fm1 = Fm2 Ff . (11)
This decomposition can be applied recursively for as many levels as necessary, assign-
ing the innermost level a purely elastic behavior
k k−1 2 1
F = Fe Ff Ff . . . Ff Ff . (12)
1
A(Fm , Δ , q) = W m (Fm ) + Δ, q),
Φ(Δ (13)
L
A Multiscale Microstructural Model of Permeability … 273
where W m is the strain-energy density per unit volume of the matrix, Φ is the cohe-
sive energy per unit fault surface, suitably divided by the scale length L to result in a
specific energy per unit of volume, and q is an internal variable describing the max-
imum opening displacement ever experienced by the faults. Note that the separation
of the variables excludes strong coupling between the two energies. The particular
form of the energy densities W m and Φ can be selected with a certain degree of
freedom according to the particular material considered.
In keeping with the irreversible nature of fracture, decohered faults permanently
damage the material. This can be expressed by the internal variable q evolution
constraint q̇ ≥ 0. Another important constraint is the impenetrability of the faults
surfaces upon closure, i.e., the component of the opening displacement along the
fault normal cannot be negative, thus N · Δ ≥ 0.
The behavior of irreversible materials can be characterized variationally by
recourse to time discretization [33, 36], that requires to consider a process of defor-
mation at distinct successive times t0 , . . . , tn+1 = tn + Δt, . . . . We assume that the
state of the material at time tn is known and the deformation Fn+1 at time tn+1 is
assigned. The problem is to determine the state of the material at time tn+1 . We
define an effective, incremental, strain-energy density taking the infimum of the
constrained energy with respect to Δ n+1 and qn+1 , as
The subindex n used in Wn signifies the dependence on the initial state. The irre-
versibility and the impenetrability constraints render the effective strain-energy den-
sity Wn dependent on the initial conditions at time tn , and account for all the inelastic
behaviors, such as irreversibility, hysteresis, and path dependency. Thus, Wn (Fn+1 )
acts as a potential for the first Piola-Kirchhoff stress tensor Pn+1 at time tn+1 [36],
i.e., as
∂ Wn (Fn+1 )
Pn+1 = . (15)
∂Fn+1
The stable equilibrium configurations are the minimizers of the corresponding effec-
tive energy. The fault geometrical features N and L, until now considered as known,
can be determined with the aid of the time-discretized variational formulation.
Energy optimization will ascertain whether the insertion of faults is energetically
favorable, and the optimal orientation of the faults. For a given deformation Fn+1 ,
we test two end states of the material, one with faults and another without faults,
and choose the end state which results in the lowest incremental energy density
Wn (Fn+1 ). The orientation of the faults N and the remaining state variables are
obtained variationally from the extended minimum problem:
274 A. Pandolfi et al.
The actual orientation of the faults is defined by the surrounding stress state. In
particular, faults can be originated under tensile stress if the maximum tensile stress
reaches the tensile resistance of the material. Under compressive stress, it is likely
that the material would fail in shear.
The length L can be computed variationally by accounting for the misfit energy
Δ, L) contained in the boundary layers that forms where the faults meet a
E mis (Δ
confining boundary. In fact, the compatibility between the faults and their container
is only on average, and this gives rise to boundary layers that penetrate into the
faulted region to a certain depth. The addition to the energy furnishes a selection
mechanism among all possible microstructures leading to the relaxed energy [33].
The total free-energy density of the faulted region becomes
The optimal fault separation is determined by two competing demands. On one hand,
the cohesive energy favors a large value of L resulting in fewer faults per unit volume.
On the other hand, the misfit energy favors a small value of L resulting in a narrow
boundary layer.
Internal friction is an important dissipation mechanism in brittle materials, espe-
cially in geological applications. We assume that friction operates at the faults con-
currently with cohesion, but if faults loose cohesion completely upon the attainment
of a critical opening displacement, friction may become the sole dissipation mech-
anism at the faults. In order to retain the variational structure of the model, we
define an incremental strain energy density Wn (Fn+1 ) that includes a dual kinetic
potential, attending the frictional dissipation, with suitable convexity and regularity
properties [37].
So far we have consider either an intact material or a single family of parallel
faults. The material with a single fault family is referred to as rank-1 faulting pattern
material. More complex microstructures, can effectively be generated by applying
the previous construction recursively. In the first level of recursion, we simply replace
A Multiscale Microstructural Model of Permeability … 275
the elastic strain-energy density W (Fm ) of the matrix by Wn (Fm ), i.e., by the effec-
tive strain-energy density of a rank-1 faulting pattern. This substitution can now be
iterated, resulting in a recursive definition of Wn (Fn+1 ). The recursion stops when the
matrix between the faults remains elastic. The resulting microstructures are shown
in Fig. 1a, and consist of faults within faults. The level of recursion is the rank of the
microstructure.
According to the particular loading history, at the time tn and at the generic point
P the material is characterized by a particular opening displacement Δ , which will
be respectful of the equilibrium and compatibility conditions. The model is therefore
able to account for a variable opening of the faults.
Under the assumption of a perfectly impermeable matrix and considering the pres-
ence of a single fault family, the permeability tensor for the material fractured brittle
damage model can be directly derived from the particular geometry of faults. The
permeability of a particular geometry of parallel and equidistant faults has been exam-
ined by Irmay [38]. Snow [39, 40] and Parsons [41] obtained analytical expressions
for the anisotropic permeability, similar to the one described here, by considering
networks of parallel fissures.
According to Fig. 2, the opening displacement Δ decomposes into a normal com-
ponent ΔN and a sliding component ΔT computed as
ΔN = N · Δ , Δ T = (I − N ⊗ N) Δ , ΔT |,
ΔT = |Δ (19)
The fluid fills layers of variable thickness ΔN and the average fluid flow will take
place in the direction of the layer.
We begin by considering the solution of the Navier-Stokes’ equation for average
velocity vs along the direction s in a fault of constant width ΔN :
Δ2N ρg ∂h
vs = , (20)
12 μ ∂s
where ∂h/∂s is the hydraulic head gradient in the direction s. By considering a porous
medium where pores are in the form of parallel faults, the discharge qs in the direction
of the flow is
ΔN Δ2N ρg ∂h
qs = n vs = , (21)
L 12 μ ∂s
where
ΔN
n= (22)
L
can be intended as a measure of the local porosity. The permeability in the direction
of the fault plane becomes
ΔN Δ2N
ks = . (23)
L 12
Now we restate the above equations in vector form. By introducing the unit vector d
in the direction of the fluid flow
∂x1 ∂x2 ∂x3
d= e1 + e2 + e3 (24)
∂s ∂s ∂s
and using Eq. (3), the directional gradient can be expressed as
Δ2N ρg
vs = ∇h · d, (26)
12 μ
Δ2N ρg
vs = (∇h · d) d. (27)
12 μ
ΔN Δ2N ρg
qs = n vs = d⊗d ∇h, (28)
L 12 μ
ΔN Δ2N
kf = d ⊗ d. (29)
L 12
A Multiscale Microstructural Model of Permeability … 277
To account for a generic direction of the flow in the layer of normal N, in (29) the
tensor d ⊗ d must be replaced by the projection (I − N ⊗ N), leading to
ΔN Δ2N
kf = (I − N ⊗ N) . (30)
L 12
If Q fault families are present in the porous medium, each of which character-
ized by a normal NK , a separation L K and a normal opening displacement ΔKN , the
equivalent permeability is given by the sum of the corresponding permeabilities:
Q 2
ΔKN ΔKN
kf = K
I − NK ⊗ NK . (31)
K=1
L 12
4 Examples
We want to study the response of the brittle damage model to the action of external
loadings and to analyze the correspondent variation of the permeability. For the sake
of simplicity, we assume that the permeability of the intact matrix is null, thus the
permeability will be exclusively related to the formations of the faults.
We specialize the strain energy density W to a standard neo-Hookean material
extended to the compressible range, i.e.,
1 1
W (Fm ) = λ log2 J m + μ (FmT Fm ) : I − 3 − 2 log J m , (33)
2 2
where λ and μ are the Lamé coefficients, and J m = det Fm is the determinant of
Fm . Following [34], the cohesive energy on a fault with orientation N is assumed to
depend on an effective opening displacement Δ defined as
Δ= Δ · N)2 + β 2 |Δ
(1 − β 2 ) (Δ Δ |2 , (34)
Tc Δ (1 − 0.5 Δ/Δc ) if Δ ≤ Δc
Δ, q) = Φ(Δ, q) =
Φ(Δ , (35)
Gc = 0.5 Tc Δc otherwise
where Gc is the critical energy release rate of the material, Tc is the tensile resis-
tance and Δc the maximum opening displacement associated to the presence of
cohesive tractions, see Fig. 3. Irreversibility is enforced by recording the maximum
ever attained effective opening displacement q = Δmax , and assuming unloading and
reloading to/from the origin, see Fig. 3, with the kinetic equation
Δ̇ if Δ = q and Δ̇ ≥ 0,
q̇ = (36)
0 otherwise.
∂Φ T
T= = Δ · N) N + β 2Δ ,
1 − β 2 (Δ (37)
Δ
∂Δ Δ
where
∂Φ
T= = 1 − β −2 (T · N)2 + β −2 |T|2 . (38)
∂Δ
The material is characterized by the constants reported in Table 1. We assume an
intact material, with no pre-existent or natural faults, and limit our attention to the
constitutive response. We feed the material with an assigned deformation gradient,
whose significant components grow according to a prescribed history. The material is
allowed to form up to three families of faults, which cannot have the same orientation
NK .
(a) (b)
12 9E-08
10
K11
K22
8 6E-08 K33
4 3E-08
0 0
0 0.01 0.02 0.03 0 0.01 0.02 0.03
Fig. 4 a Uniaxial response σ3 versus ε3 of the material, stress in MPa. The drop in stiffness follows
the linearly decreasing law of the cohesive layers. b Variation of the material permeability, expressed
in m2 , in direction e1 , e2 , and e3 as a function of the deformation ε3 related to the opening of the
faults. After the formation of the faults, the matrix is unstressed
Compression
0 Extension
2E-08 Reclosure
-20
1E-08
-40 Compression
Extension
Reclosure
-60 0
-0.008 -0.004 0 0.004 0.008 0.012 -0.008 -0.004 0 0.004 0.008 0.012
(c) (d)
3E-08 3E-08
Compression Compression
Extension Extension
2E-08 Reclosure 2E-08 Reclosure
1E-08 1E-08
0 0
-0.008 -0.004 0 0.004 0.008 0.012 -0.008 -0.004 0 0.004 0.008 0.012
Fig. 5 Multi-stage multiaxial response of the material, undergoing an isotropic compression, fol-
lowed by an isotropic extension, and by a final anisotropic compression. a Stress σ1 (MPa) versus
strain ε1 . Although the material fails, generating three nested families of faults, the material pre-
serves its ability of sustaining load. b Permeability (m2 ) in direction e1 as a function of the strain.
c Permeability (m2 ) in direction e2 as a function of the strain. d Permeability (m2 ) in direction e3
as a function of the strain
reaches the strength of the material and causes triple tensile failure (open circles);
note that the three stress components show the behavior observed in the uniaxial
loading, cf. Fig. 4a. The final compressive stretch is characterized by a null stress
until faults close completely. Afterwards, the contact algorithm provides the com-
pressive tractions that guarantee the equilibrium of the system (grey circles). The
resulting reduction of the stiffness of the material due to the damage is remarkable.
Figures 5b–d show the permeability in direction e1 , e2 , and e3 , respectively. The
permeability is null until the material fails (black circles). Then the permeability
reaches a maximum corresponding to the maximum extension imposed to the mate-
rial (open circles). The different values of the maximum permeability for the three
directions is the combined result of the different spacing of the fault families and
of the stress anisotropy deriving from the formation of faults. Upon faults closure,
permeability decreases until it goes to zero (gray circles). Note that the anisotropy
A Multiscale Microstructural Model of Permeability … 281
The brittle damage model hereby introduced may be used in applications where
porous brittle materials experience a stress history leading to the formation of dam-
aged zones characterized by micro or macro cracks. In particular, the model has the
potential to be successfully employed in numerical simulations of localized stimu-
lation of oil or gas reservoirs by means of hydraulic fracture.
Brittle materials are sensitive and fail under tensile stresses or under non-isotropic
compressive stresses characterized by high shear components. The model here illus-
trated accounts for the progressive damage that can occur in brittle porous materials
failing under tensile or shear stress. As far as deep rock formations are concerned,
while the shear critical case can occur in natural situations under non-isotropic stress
states, the tensile critical state is more likely to happen under the action of a highly
pressurized fluid acting on a localized area, which changes the compressive stress
state in a tensile stress.
The mechanical features of the damage model have been described widely in the
original paper [33]. More importantly, unlikely standard damage theory models, the
brittle damage model is based on cohesive theories of fracture, therefore it describes
the formation of cohesive surfaces within the medium in the correct physical way.
Moreover, the model is characterized by multiple scales, represented by the L K
spacings between faults in each set, and is able to describe the microstructures that
can be observed in layered geologic media. The model is characterized variationally,
by introducing a generalized free energy density, so that stress and material tangent
stiffness are obtained analytically, and preserve a symmetric structure that makes
them appealing in numerical applications. Furthermore, inclusion of friction and
other forms of dissipative phenomena is easily achieved through the expedient of
introducing dual dissipation potential in the free energy density.
The porous version of the damage model retains the same mechanical features as
the original model. Thus, the microstructures resulting from the damage evolution
can be characterized with a permeability tensor, which is of fundamental importance
for the evaluation of the permeation and of the flow of fluids within the porous
material.
282 A. Pandolfi et al.
References
1. Bear, J. (1972). Dynamics of fluids in porous media. New York: American Elsevier Publishing
Company Inc.
2. Kozeny, J. (1927). Über kapillare leitung des wassers im boden. Sitzungsberichte der Kaiser-
lichen Akademie der Wissenschaften, 136(2a), 36.
3. Carman, P. C. (1937). Fluid flow through granular beds. Transactions, Institution of Chemical
Engineers, 15, 17.
4. Carman, P. C. (1956). Flow of gases through porous media. London.
5. Blair, S. C., Berge, P. A., & Berryman, J. G. (1996). Using two-point correlation functions
to characterize microgeometry and estimate permeabilities of sandstones and porous glass.
Journal of Geophysical Research - Solid Earth, 101, 20359–20375.
6. Göktepe, A. B., & Sezer, A. (2010). Effect of particle shape on density and permeability
of sands. Proceedings of the Institution of Civil Engineers: Geotechnical Engineering, 163,
307–320.
7. Sanzeni, A., Colleselli, F., & Grazioli, D. (2013). Specific surface and hydraulic conductivity
of fine-grained soils. Journal of Geotechnical and Geoenvironmental Engineering, 139, 1828–
1832.
8. Schaap, M. G., & Lebron, I. (2001). Using microscope observations of thin sections to estimate
soil permeability with the Kozeny-Carman equation. Journal of Hydrology, 251, 186–201.
9. Lasowska, A., Cies̀licki, K., & Smolarski, A. Z. (2000). The influence of channel tortuousity
on hydraulic resistance. Proceedings of the Institution of Civil Engineers: Geotechnical Engi-
neering, 48, 161–173.
10. Saripalli, K. P., Serne, R. J., Meyer, P. D., & McGrail, B. P. (2002). Prediction of diffusion
coefficients in porous media using tortuosity factors based on interfacial areas. Ground Water,
40, 346–352.
11. Archie, G. E. (2003). The electrical resistivity log as an aid in determining some reservoir
characteristicsn. In SPE reprint series (pp. 9–16).
12. Katz, A. J., & Thompson, A. H. (1987). Prediction of rock electrical conductivity from mercury
injection measurements. Journal of Geophysical Research, 92, 599–607.
13. Schwartz, L. M., Sen, P. N., & Johnson, D. L. (1989). Novel geometrical effects in electrolytic
conduction in porous media. Physica A: Statistical Mechanics and its Applications, 157, 493–
496.
14. Revil, A., & Cathles, L. M. (1999). Permeability of shaly sand. Physica A: Statistical Mechanics
and its Applications, 35, 651–662.
15. Cosentini, R. M., Della Vecchia, G., Foti, S., & Musso, G. (2012). Estimation of the hydraulic
parameters of unsaturated samples by electrical resistivity tomography. Géotechnique, 62(7),
583–594.
16. Civan, F. (2001). Scale effect on porosity and permeability: Kinetics, model, and correlation.
AIChE Journal, 47, 271–287.
17. Civan, F. (2002). Relating permeability to pore connectivity using a power-law flow unit equa-
tion. Petrophysics, 43, 457–476.
18. Schulze, O., Popp, T., & Kern, H. (2001). Development of damage and permeability in deform-
ing rock salt. Engineering Geology, 61(2), 163–180.
19. Mishra Parker, J. C., & Singhal, S. N. (1989). Estimation of soil hydraulic properties and their
uncertainty from particle size distribution data. Journal of Hydrology, 108, 1–18.
20. Pachepsky, Y. A., Timlin, D. J., & Ahuja, L. R. (1999). Estimating saturated soil hydraulic
conductivity using water retention data and neural networks. Soil Science, 164, 552–560.
21. Latief, F. D. E., & Fauzi, U. (2012). Kozeny-Carman and empirical formula for the permeability
of computer rock models. International Journal of Rock Mechanics and Mining Sciences, 50,
117–123.
22. Dardis, O., & McCloskey, J. (1998). Permeability porosity relationships from numerical sim-
ulations of fluid flow. Geophysical Research Letters, 25(9), 1471–1474.
A Multiscale Microstructural Model of Permeability … 283
23. Bird, M. B., Butler, S. L., Hawkes, C. D., & Kotzer, T. (2014). Numerical modeling of fluid
and electrical currents through geometries based on synchrotron X-ray tomographic images of
reservoir rocks using Avizo and COMSOL. Computers and Geosciences, 73, 6–16.
24. Aydin, A. (2000). Fractures, faults, and hydrocarbon entrapment, migration and flow. Marine
and Petroleum Geology, 17(7), 797–814.
25. Yuan, S. C., & Harrison, J. P. (2006). A review of the state of the art in modelling progressive
mechanical breakdown and associated fluid flow in intact heterogeneous rocks. International
Journal of Rock Mechanics and Mining Sciences, 43(7), 1001–1022.
26. Oda, M. (1985). Permeability tensor for discontinuous rock masses. Geotechnique, 35(4),
483–495.
27. Tang, C. A., Tham, L. G., Lee, P. K. K., Yang, T. H., & Li, L. C. (2002). Coupled analysis
of flow, stress and damage (fsd) in rock failure. International Journal of Rock Mechanics and
Mining Sciences, 39(4), 477–489.
28. Arson, C., & Gatmiri, B. (2008). On damage modelling in unsaturated clay rocks. Physics and
Chemistry of the Earth, Parts A/B/C, 33, S407–S415.
29. Arson, C., & Pereira, J.-M. (2013). Influence of damage on pore size distribution and perme-
ability of rocks. 37(8), 810–831.
30. Yuan, S. C., & Harrison, J. P. (2005). Development of a hydro-mechanical local degradation
approach and its application to modelling fluid flow during progressive fracturing of heteroge-
neous rocks. International Journal of Rock Mechanics and Mining Sciences, 42(7), 961–984.
31. Shao, J.-F., Zhou, H., & Chau, K. T. (2005). Coupling between anisotropic damage and perme-
ability variation in brittle rocks. International Journal for Numerical and Analytical Methods
in Geomechanics, 29(12), 1231–1247.
32. Maleki, K., & Pouya, A. (2010). Numerical simulation of damage- permeability relationship
in brittle geomaterials. Computers and Geosciences, 37(5), 619–628.
33. Pandolfi, A., Conti, S., & Ortiz, M. (2006). A recursive-faulting model of distributed damage
in confined brittle materials. Journal of Mechanics and Physics of Solids, 54, 1972–2003.
34. Ortiz, M., & Pandolfi, A. (1999). Finite-deformation irreversible cohesive elements for three-
dimensional crack propagation analysis. International Journal for Numerical Methods in Engi-
neering, 44, 1267–1282.
35. Pandolfi, A., & Ortiz, M. (2002). An efficient adaptive procedure for three-dimensional frag-
mentation simulations. Engineering with Computers, 18(2), 148–159.
36. Ortiz, M., & Stainier, L. (1999). The variational formulation of viscoplastic constitutive updates.
Computer Methods in Applied Mechanics and Engineering, 171, 419–444.
37. Pandolfi, A., Kane, C., Marsden, J. E., & Ortiz, M. (2002). Time-discretized variational for-
mulation of non- smooth frictional contact. International Journal for Numerical Methods in
Engineering, 53(8), 1801–1829.
38. Irmay, S. (1937). Flow of liquid through cracked media. Bulletin of the Research Council of
Istrael, 1(5A), 84.
39. Snow, D. T. (1965). A parallel plate model of fractured permeable media. PhD thesis, California:
University of California at Berkeley.
40. Snow, D. T. (1969). Anisotropic permeability of fractured media. Water Resources Research,
5(6), 1263–1279.
41. Parsons, R. C. (1966). Permeability of idealized fractured rocks. Journal of the Society of
Petroleum Engineers, 6, 126–136.
Thermal Diffusion in a Polymer Blend
1 Introduction
Composed materials, such as metallic alloys, solid solutions and multiphase plastics,
play an increasing role in industrial design. In particular, polymer blends became
a matter of interest in recent years. In order to combine beneficial characteristics
of single polymers, specific multiphase blends are composed. Such mixtures are
subjected to a great variety of microstructural changes such as separation of phases
and coarsening processes, cf. [6, 9, 13, 17, 22, 24, 25].
When a multicomponent system at a critical composition is quenched from an
initially homogeneous state, usually at high temperature, the blend destabilizes and
becomes susceptible to external perturbations. At a critical point the mixture decom-
poses into two phases of different composition—an effect which is known from
metallic alloys and corresponds to two minima in the free energy function, see
Fig. 1. Polymer blends with a close-to-critical-point composition are very suscep-
tible to external fields such as convective, strain, electric, thermal or just random
fields. Such driven systems have been studied and simulated by the authors in [1, 3,
4]. External fields allow to control the microstructure evolution in the mixture.
In this text we will focus on the effect of thermal diffusion (Ludwig-Soret effect)
induced by non-uniform temperature gradients on the microstructural evolution in
a blend consisting of polydimethylsiloxane (PDMS) and polyethyl-methylsiloxane
(PEMS). Despite of a wide range of applications, comparatively little studies have
been performed for the phase behavior of such polymer blends subjected to non-
uniform temperature fields. Lee et al. worked on spinodal decomposition in the
presence of local temperature gradients in [19–21] and Köhler et al. published out-
standing studies on periodically driven and thermally patterned polymer mixtures in
[11, 25, 26]. The experimental setup to perform thermal patterning experiments is
explained in detail by Voit in [24].
Motivated by these publications we will present here a robust numerical scheme
to approximate the coupled heat-diffusion equations at hand by means of B-spline
based finite element analyses. To do so, we first derive the required thermal dif-
fusion equation and the resulting extended Cahn-Hilliard phase-field model. Then,
the numerical discretization where we use B-splines to exactly fulfill the continu-
ity requirements of the problem is presented. Finally, two- and three-dimensional
simulations of a thermal patterning experiment in a polymer blend are illustrated.
of the classical TIP is the local equilibrium hypothesis, where thermodynamic state
variables in non-equilibrium states are considered to be the same as in equilibrium.
In the thermodynamic community it is still under debate that other variables, not
found at equilibrium, are able to influence non-equilibrium processes. For detailed
remarks on limitations of classical TIP the reader is referred to [18, pp. 63–65] or [8].
For thermal diffusion in solids and liquids, however, we have no reason to assume
additional state variables and so the TIP framework is fully acceptable.
At equilibrium the entropy per unit mass s is a well-defined function depending
on state variables such as the internal energy u, the specific volume ν and the mass
fractions ck of a multicomponent system with n components. In such a situation the
total differential of s in equilibrium is given by the common Gibbs relation
1
n
ds = du + pdν − μk dck (1)
T k=1
2.1 Balances
We follow the local equilibrium hypothesis and make use of the substantial time
derivative d (•) /dt = ∂ (•) /∂t + v · ∇ (•) with the barycentric velocity field v and
the spatial gradient ∇ to rewrite the balance (1) in the form
dν dck
n
ds du
T = +p − μk . (2)
dt dt dt k=1
dt
We see here that Eq. (2) already includes the mass balance of the kth component by
dck /dt and the energy balance by du/dt and, thus, compliance with the basic laws
of thermodynamics is inherently guaranteed. Introducing the total mass density of
our multicomponent system by
n
ρk
ρ= ρk with ck = , (3)
k=1
ρ
dck r
ρ = − div Jk + vk j k̂ j , (k = 1, 2, . . . , n) , (4)
dt j=1
288 K. Weinberg et al.
n
vk j = 0, ( j = 1, 2, . . . , r ) . (5)
k=1
Now we need to deduce an appropriate formulation for the rate of the specific internal
energy du/dt implying the conservation of the total specific energy. To this end we
start with the equation of motion
dv n
ρ = − div P + ρk Fk , (6)
dt k=1
∂ψk
Fk = −∇ψk , = 0. (7)
∂t
Multiplying Eq. (6) by the barycentric velocity field v and making use of the relation
div (Pv) = div P · v + P : ∇v, we obtain the balance equation for the kinetic
energy
d 21 v2 n
ρ = − div (Pv) + P : ∇v + ρk Fk · v. (8)
dt k=1
Accordingly, from the local form of the continuity mass equation dρ/dt = −ρ div v
we get for an arbitrary scalar field Φ the identity
dΦ ∂ (ρΦ)
ρ = + div (ρΦv) , (9)
dt ∂t
which is now used for a reformulation of Eq. (8) into
n
∂ 21 ρv2 1 2
= − div ρv · v + Pv + P : ∇v + ρk Fk · v. (10)
∂t 2 k=1
Thermal Diffusion in a Polymer Blend 289
In the next step we reformulate Eq. (4) by means of identity (9) into
∂ρk r
= − div (Jk + ρk v) + vk j k̂ j , (k = 1, 2, . . . , n) . (11)
∂t j=1
By combining Eqs. (11) and (7) with the differential formulae (see [16])
establish an equation for the rate of change of the potential energy density ρψ ≡
we
k ρk ψk which is of the form
⎛ ⎞
∂ρψ
n
n
n
n
r
= − div ⎝ρψv + ψk Jk ⎠ − ρk F k · v − J k · Fk + ψk vk j k̂ j .
∂t
k=1 k=1 k=1 k=1 j=1
(13)
Since we consider only chemical reactions where the potential energy is conserved,
for each reaction it holds
n
ψk vk j = 0, ( j = 1, 2, . . . , r ) , (14)
k=1
and the last term of Eq. (13) vanishes. As a result we obtain an equation for the rate
of change of the mechanical energy as a sum of kinetic energy 21 ρv2 and potential
energy ρψ,
∂ρ 1 2
v +ψ 1 2 n
2
= − div ρ v + ψ v + Pv + ψk Jk
∂t 2 k=1
(15)
n
+ P : ∇v − J k · Fk .
k=1
Clearly, since a source term appears at the right-hand side of the equation (underlined
in 15), the mechanical energy is not a conserved quantity.
According to the first law of thermodynamics (conservation of energy) the total
energy within an arbitrary control volume Ω in the system can only change due to
energy fluxes Je through the boundary ∂Ω:
d ∂ρe
ρe dΩ = dΩ = − Je · n dΩ. (16)
dt Ω Ω ∂t ∂Ω
290 K. Weinberg et al.
Here e is the total specific energy and n denotes the unit outward normal on ∂Ω. An
application of Gauss’ theorem provides the local form of energy conservation
∂ρe
= − div Je . (17)
∂t
In general the total energy flux Je includes a convective term ρev, an energy
flux Pv
due to mechanical work performed on the system, a potential energy flux k ψk Jk
due to diffusion and finally a heat flow Jθ
n
Je = ρev + Pv + ψk Jk + Jθ . (18)
k=1
If we keep in mind that the total specific energy is defined as the sum of specific
kinetic energy 21 v2 , the specific potential energy ψ and the specific internal energy
u, we can subtract Eq. (15) from Eq. (17) and use the formula for the energy flux (18)
to obtain the balance equation for the internal energy
∂ρu n
= − div (ρuv + Jθ ) − P : ∇v + J k · Fk . (19)
∂t k=1
This equation shows that also the internal energy u is not a conserved quantity. Again
there is a source term, which is equal but of opposite sign to the source term in the
balance equation (15) of the mechanical energy. Therefore, such a formulation of the
balance equation for the internal energy inherently guarantees the conservation of
total energy. In our notation it is convenient to write Eq. (19) in an alternative form.
For this purpose we split the total pressure tensor P into a spherical/hydrostatic part
pI = tr (P) /3I and a deviatoric part S̄,
P = pI + S̄ (20)
where I is the identity tensor. With relations (20) and (9), Eq. (19) becomes
du n
ρ = − div Jθ − p div v − S̄ : ∇v + J k · Fk . (21)
dt k=1
Here we have used that I : ∇v = tr (∇v) = div v. Another version of the mass
continuity equation in terms of the specific volume ν ≡ ρ −1
dν
ρ = div v (22)
dt
Thermal Diffusion in a Polymer Blend 291
du dν n
= −ν div Jθ − p − ν S̄ : ∇v + ν J k · Fk . (23)
dt dt k=1
In order to find an explicit form of the entropy balance equation we have to insert
the expressions for du/dt (23) and dck /dt (4) into Eq. (2), which becomes
1 1 1
n n r
ds 1 1
ρ = − div Jθ − S̄ : ∇v + J k · Fk + μk div Jk − A j k̂ j .
dt T T T T T
k=1 k=1 j=1
(24)
Here we have introduced the so-called chemical affinity A j of the j-th reaction
defined by
n
Aj = vk j μk , ( j = 1, 2, . . . , r ) . (25)
k=1
Now we intend to bring Eq. (24) into the typical structure of a balance equation
ds
ρ = − div Js + πs , (26)
dt
where Js is a general entropy flux and πs is an entropy source strength. According
to the second law of thermodynamics the entropy source πs vanishes for reversible
(or equilibrium) thermodynamic processes and it holds πs > 0 for irreversible ther-
modynamic transformations. Consequently, it must hold πs ≥ 0 for a general ther-
modynamic process.
By means of the relations (12) we obtain the entropy balance equation in the
required form
⎛
n ⎞
Jθ − μk Jk
ds ⎜ ⎟
ρ = − div ⎜
k=1 ⎟ − 1 Jθ ∇T
dt ⎝ T ⎠ T2
(27)
1
n μ 1 1
r
k
− Jk · T ∇ − Fk − S̄ : ∇v − A j k̂ j .
T k=1
T T T j=1
From comparison with (26) it is possible to identify the entropy flux and the entropy
source term as
1 n
Js = Jθ − μk Jk , (28)
T k=1
292 K. Weinberg et al.
1 1
n μ 1 1
r
k
πs = − J θ ∇T − J k · T ∇ − F k − S̄ : ∇v − A j k̂ j ≥ 0.
T2 T k=1 T T T j=1
(29)
At the first glance the applied separation into flux quantities and entropy source con-
tribution seems to be arbitrary, but Js and πs have to satisfy a number of requirements
which determine this separation uniquely; for a discussion we refer to [2].
Let us now have a closer look at the expressions for the entropy flux Js (28) and
the entropy source (29). Expression (28) indicates that the entropy flow consists
of a reduced heat flow Jθ /T and a current due to diffusion. The entropy source
expression (29) demonstrates that it can be divided into four contributions which all
are connected to a flow quantity. The first term of (29) arises from heat conduction
and is connected to heat flow Jθ , the second is a weighted mass diffusion flow Jk ,
the third one connects the momentum flow/viscous pressure S̄ to gradients of the
velocity field and the fourth term is a sum of chemical rates J j multiplied by their
affinities A j . In miscellaneous physical applications the terms in the entropy source
are classified into thermodynamic fluxes, and quantities which multiply the fluxes
are called thermodynamic forces or affinities.
In order to derive the model for thermal diffusion based on our entropy source
formulation (29), it is convenient to split off all thermodynamic forces proportional
to the temperature gradient which multiply the diffusive flux Jk . By means of the
thermodynamic relation
μ 1 hk
k
d = (dμk )T =const. − 2 dT, (30)
T T T
where h k := μk − T ∂μk /∂ T is the partial specific enthalpy of component k. Please
note that relation (30) takes into account that the chemical potentials μk depend on
a spatially non-uniform temperature field. Now we introduce a generalized heat flux
Jθ as
n
Jθ = Jθ − h k Jk (31)
k=1
1 1
n r
1 1
πs = − 2 Jθ ∇T − Jk · ((∇μk )T =const. − Fk ) − S̄ : ∇v − A j k̂ j .
T T k=1 T T j=1
(32)
Thermal Diffusion in a Polymer Blend 293
The generalized heat flux Jθ involves with the term k h k Jk a transfer of heat due
to diffusion. Therefore the quantity Jθ can be interpreted as an irreversible heat flow.
An alternative form of the conservation of mass accounts for the fact, that the sum
of all mass fluxes is zero,
n
Jk = 0. (33)
k=1
1
r
1 1
πs = − J ∇T − S̄ : ∇v − A j k̂ j (34)
T2 θ T T j=1
1
n−1
− Jk · (∇ (μk − μn ))T =const. − Fk + Fn .
T k=1
Now we will leave the general framework in order to reduce the general form of the
entropy source (34) to a multicomponent isotropic mixture where the concentrations
and the temperature are non-uniformly distributed over the system. In this text we
consider only isobaric, mechanically equilibrated systems where no external forces
and chemical reactions are supposed to be present. In this case the entropy source
reduces to the simple equation
1
n−1
1
πs = − J θ ∇T − Jk · [∇ (μk − μn )]T, p=const. . (35)
T2 T k=1
n
ρk δμk = −ρsδT + δp (36)
k=1
n
ρk ∇μk = 0. (37)
k=1
In this context δ (•) denotes the variation with respect to spatial coordinates. With
the help of Eq. (37) we can eliminate μn from (35). This gives
1
n−1
1
πs = − J θ ∇T − Jk · Akm [∇μm ]T, p=const. , (38)
T2 T k=1
294 K. Weinberg et al.
and assume a linear dependency between the fluxes and affinities in form of:
n−1
J• = L •θ X• + L •k Xk . (41)
k=1
This leads to a quadratic expression for the entropy source strength. To guarantee the
positive definiteness of the entropy source strength πs it is therefore appropriate to
take the following choice of the phenomenological equations for the thermodynam-
ical fluxes Jθ and Ji :
1
n−1
L θθ
Jθ = − ∇T − L θk Akm [∇μm ]T, p=const. , (42)
T2 T k,m=1
1
n−1
L iθ
Ji = − 2 ∇T − L ik Akm [∇μm ]T, p=const. . (43)
T T k,m=1
Let us finally focus on the case of a binary mixture (n = 2). In this situation the
coefficients Akm reduce to
c1 1
A11 = 1 + = (45)
c2 c2
Thermal Diffusion in a Polymer Blend 295
L θθ L θ1
Jθ = − ∇T − [∇μ1 ]T, p=const. , (46)
T2 T c2
L 1θ L 11
J1 = − 2 ∇T − [∇μ1 ]T, p=const. . (47)
T T c2
L θθ
Λ= thermal conductivity (48)
T2
L θ1
Dθ = Dufour coefficient (49)
ρc1 c2 T 2
L 1θ
DT = thermal diffusion coefficient (50)
ρc1 c2 T 2
L 11
M= mobility coefficient (51)
ρT c2
The Onsager reciprocal relations imply equality between the thermal mass diffusivity
DT and the Dufour coefficient Dθ which leads to thermodynamical fluxes in the form:
JT = −ρ DT c (1 − c) ∇T. (54)
296 K. Weinberg et al.
In a single phase it is not possible to achieve thermal diffusion, since JT vanishes for
c = 0 and c = 1. Usually the thermally activated diffusive mass current may occur
in either direction, dependent on the materials involved. Thermophilic substances
diffuse in the direction of the temperature gradient. Thermophobic materials diffuse
in the direction opposite to the temperature gradient. Typically the heavier/larger
species in a mixture exhibits a thermophobic behavior while the lighter/smaller
species exhibit thermophilic behavior. In addition to the sizes of the various types of
particles and the steepness of the temperature gradient, the heat conductivity and heat
absorption of the particles play a significant role in thermal diffusion. However JT
leads to a buildup of a concentration gradient, which is accompanied by a generalized
Fickean type mass diffusion current,
J D = −ρM∇μ. (55)
Consequently, the entire diffuse mass flux is then J = J D +JT . Note that the mobility
coefficient M may be replaced by a tensor valued mobility to account for anisotropic
effects.
At this point it should be clear that the Ludwig-Soret effect can be used in various
technical applications to influence the microstructure of materials. Thermal diffusion
is a powerful tool in pharmacology to discover and design new types of drugs [5]. As
well it is employed as a technique for manipulating single biological macromolecules
such as DNA in polymer micro- and nanochannels, cf. [23, 27].
m κi (T )
where Ψ con is the configurational energy density and Ψ int (φ, T ) = i=1 2
∇φi 2
characterizes the contribution of interfacial energy. The temperature dependent para-
meters κi are related to surface energy density γi and length li of the transition regions
between the domains of each phase. In this context T is the temperature of the sys-
tem, k B denotes the Boltzmann constant and v characterizes a corresponding unit
volume. In (56) the free energy density Ψ = Ψ con + Ψ int is a dimensionless quantity
which is multiplied by the coefficient k B T /v to obtain an energy density J/m3 .
Thermal Diffusion in a Polymer Blend 297
The corresponding unit volume v comprises an average volume that a monomer with
Kuhn length (persistence length) l undergoes within the scenario of a random walk.
In the following we consider specifically a PDMS-PEMS mixture. Both compo-
nents are polymeric organosilica with a wide range of technical applications, such as
material for contact lenses, adhesives, coating implementations, silicone based lubri-
cants, and even as a food additive in defoaming agents. The structural formulas of
PDMS and PEMS show their similar molecular structure. Both are terminated with
trimethylsiloxane endgroups, one methyl respectively ethyl group in the repeating
unit makes the only difference.
⎡ ⎤
CH3 CH3 CH3
⎢ ⎥
⎢ ⎥
CH3 Si O⎢
⎢ Si O⎥
⎥ Si CH3 (PDMS)
⎣ ⎦
CH3 CH3 CH3
n
⎡ ⎤
CH3 CH3 CH3
⎢ ⎥
⎢ ⎥
CH3 Si O⎢
⎢ Si O⎥
⎥ Si CH3 (PEMS)
⎣ ⎦
CH3 C2 H5 CH3
n
c (1 − c)
Ψ con (c) = g A c + g B (1 − c) + ln (c) + ln (1 − c) + χ c (1 − c) .
NPDMS NPEMS
(57)
Here NPDMS and NPEMS represent the degrees of polymerization, and χ is a temper-
ature dependent material parameter characterizing the chemical interaction between
the constituents of the mixture. The value of χ is usually approximated by a rela-
tion of the form χ = a + bT −1 , where a and b are experimentally obtained fitting
parameters. The terms g A c and g B (1 − c) quantify the free energy of the individual
components.
In order to investigate the evolution of the concentration field c (x, t) and the
corresponding temperature field T (x, t) in a representative domain Ω within the
time t¯ we make use of the diffusion equation, which expresses the conservation of
mass in the system:
∂c
= −∇ · j + ξ (x, t) . (58)
∂t
298 K. Weinberg et al.
The flux j is given by flux (53) divided by the average density ρ, ξ (x, t) is a random
variable arising from thermal fluctuations. The chemical potential is related to Ψ =
Ψ con + Ψ int by the variational derivative:
Using this relation, Eq. (58) results in a modified Cahn-Hilliard equation ([7], cf. [2])
coupled with a heat diffusion equation with specific heat capacity c p , conductivity
k
and heat source q. Thus,
in the strong form the problem reads: Find c : Ω × 0, t¯ →
R and T : Ω × 0, t¯ → R such that
∂c
= ∇ · (M∇μ + DT c(1 − c)∇T ) , in Ω × [0, t¯] (60)
∂t
∂T
cpρ = ∇ · (k∇T ) + q, in Ω × [0, t¯]. (61)
∂t
Initial concentration and temperature are given by
with properties ∂Ω = ∂Ωe ∪ ∂Ωn and ∂Ωe ∩ ∂Ωn = ∅. Vector n denotes the unit
outward normal to ∂Ω. The specific heat capacity c p and thermal conductivity k are
assumed to have the form
where PPDMS , PPEMS , Q PDMS and Q PEMS are polynomial functions of degree 3
which fit the specific heat capacity and thermal conductivity of the two species
for T ∈ [50 K, 340 K] and T ∈ [230 K, 410 K], respectively. We use an Onsager
coefficient of M = 1.815 × 10−18 m5 / (J s) and a thermal diffusivity of DT =
−2 × 10−13 m2 / (s K). Additional parameters are summarized in Table 1. The mate-
rial data as well as the experimental results are obtained from [17, 24–26].
Since PDMS is a thermophilic polymer its thermal diffusion coefficient has a
negative sign. The corresponding unit volume is assumed as v = 4π (σ A /2)3 /3.
The critical temperature of the considered polymer blend is Tcrit. = 313 K and the
critical interaction parameter χcrit. evaluates to χcrit. = 0.0084318. At this point it is
Thermal Diffusion in a Polymer Blend 299
important to mention that the chosen value for a = −1.8557 × 10−3 significantly
differs from the value given in [25]. There the authors suggest a = 2.9× 10−3 , which
is obviously erroneous, because it does not fit χcrit. . The approximation procedure
for the values of the interaction parameter χ prescribes negative values for a and
positive values for b in such a manner that it holds
⎧
⎪
⎨χcrit. , for T = Tcrit.
χ (T ) = χ (T ) < χcrit. , for T > Tcrit. (65)
⎪
⎩
χ (T ) > χcrit. , for T < Tcrit. .
Our value for a perfectly fits these conditions. The corresponding plot of the config-
urational energy density is presented in Fig. 2.
4 Numerical Approximation
For finite element analysis we reformulate our coupled diffusion model in a vari-
ational form. Note that the mass diffusion equation involves spatial derivatives of
fourth order. Thus, we define the spaces of admissible test functions V c = {δc ∈
H 2 (Ω) | ∇δc · n = 0 on ∂Ω × (0, t˜)} and V T = {δT ∈ H 1 (Ω) | δT =
0 on ∂Ωe × (0, t˜)} where H 1 (Ω) and H 2 (Ω) are the Sobolev space of square
integrable functions with square integrable derivatives of first and of second order,
respectively. The weak forms of the boundary-value problems (60) and (61) follows
as
300 K. Weinberg et al.
∂T
ρ cp δT dV + k∇T · ∇δT dV − α I δT dV = 0,
∂t
Ω Ω Ω
∂c
δc dV + M ∇∂c Ψ · ∇δc dV + DT c(1 − c)∇T · ∇δc dV
con
(66)
∂t
Ω Ω Ω
+ λM ΔcΔδc dV = 0,
Ω
d
B A = Bpi (ξ ) = Bpi (ξ 1 , . . . , ξ d ) = Nil , pl (ξ l ), (67)
l=1
with multi-index i=[i 1 , . . . , i d ] and supp(B A ) = [ξi11 , ξi11 + p1 +1 ]×· · ·×[ξidd , ξidd + pd +1 ].
It provides the necessary support for the required continuity. The recursive definition
of a univariate B-spline is given as follows
ξ − ξill ξill + pl +1 − ξ
Nil , pl (ξ ) = Nil , pl −1 (ξ ) + Nil +1, pl −1 (ξ ), (68)
ξill + pl − ξill ξill + pl +1 − ξill +1
∂c cn+1 − cn
= (70)
∂t Δt
with time step Δt = tn+1 −tn . The time integration is performed by an implicit Crank-
Nicholson scheme, known to be second-order accurate. Now the fully discretized
problem reads
ρ
c p,n+1/2 (Tn+1 − Tn )B A dV + kn+1/2 ∇Tn+1/2 · ∇ B A dV − α I B A dV = 0,
Δt
Ω Ω Ω
1
(cn+1 − cn )B A dV + M (∂c2 Ψn+1/2
con ∇c A
n+1/2 + ∂c ∂T ∇Tn+1/2 ) · ∇ B dV
Δt
Ω Ω
+ DT cn+1/2 (1 − cn+1/2 )∇Tn+1/2 · ∇ B A dV + λM Δcn+1/2 ΔB A dV = 0,
Ω Ω
(71)
for all A ∈ {1, . . . , n}.
5 Simulation Results
is used. The laser spot points at the middle of the upper surface of an aged cuboid
of side length l x = l y = 1.5 µm and hight 0.75 µm, see Fig. 4, with the diameter of
the spot being about 3/5 of the domain size in the two-dimensional and 1/5 in the
tree-dimensional model. The laser intensity and the optical absorption coefficient are
given by I0 = 3.75 × 107 W m−2 and α = 500 m−1 , respectively. In both settings
we consider a critical PDMS-PEMS blend with a composition of 55 % PDMS and
302 K. Weinberg et al.
Fig. 4 Temperature field with laser intensity (72) in K (left) and computational mesh (right) for
the three-dimensional simulation
Fig. 5 Micrographs of a PDMS-PEMS polymer blend exposed to a focused laser spot. The specimen
size is about 100 µm × 100 µm. Subfigures are adapted from [24]
45 % PEMS. Therefore, the initial concentration is set c0 (x) = 0.55 including slight
randomly perturbed inhomegeneities (±1 %).
Köhler et al. studied the microstructural evolution of a PDMS-PEMS blend sub-
jected to a focused laser spot in [24, 25]. In their experimental set-up the blend
under consideration was initially quenched into the spinodal regime for approxi-
mately 120 min before exposing it to the laser, see Fig. 5a. At this moment the blend
has already completed phase decomposition and reached an intermediate state of
coarsening. The recorded experimental observations indicate the following scenario.
Starting from a homogeneous one-phase configuration the early stages of microstruc-
tural evolution are driven by spinodal decomposition of phases. After the sample is
exposed to laser illumination, thermal forcing asserts itself gradually against the sup-
pression of phase decomposition. Over time the inhomogeneous temperature field
increasingly coins the microstructure. Figure 5b, c show the sample after an exposure
of 100 and 200 s, respectively. Due to the local heating, a circle of PDMS evolves in
the middle of the micrograph. Since a PDMS enrichment takes place at the heated
domain within the sample an outer ring of PEMS-rich material surrounds the concen-
Thermal Diffusion in a Polymer Blend 303
tric PDMS circle. The structures slowly propagate through the sample like spherical
waves in a medium. Outside the heated spot a more irregular spinodal pattern unfolds
due to the decreasing impact of thermal diffusion, see Fig. 5 and [24, 25].
The simulation setting is also arranged in such a manner that the polymer blend
is at first homogeneously quenched from the one-phase regime, see Fig. 6. Here
and below the reddish areas denote the PDMS-rich α-phase and the blue domains
represent the PEMS-rich β-phase. Heating with the laser spot starts at a system time
t = 0.03 s. Similar to the experimental observations a spherical PDMS-rich phase
emerges in the center of the domain, where the laser induces an enrichment of the
thermophilic PDMS. The PDMS is attracted from the surrounding area, and—like
in the experiments—a second ring of a PEMS-rich phase evolves around the heated
spot. It seems that the PEMS-ring spreads like a spherical wave starting at the spot
in the middle of the domain Ω. In the outer part of Ω the thermal effect decreases
and more irregular spinodal pattern of phases become prevalent. Note that since
the relaxation into thermal equilibrium occurs on a time scale that is by a factor
of 1000 faster then the time scale at which diffusion takes place, we simplified the
two-dimensional model by assuming quasistatic equilibrium for the heat equation.
As we continue in time, a typical coarsening process dominates the evolution of
phases outside the laser spot. The microstructure within the exposure of the laser spot
is characterized by a PDMS-rich concentric ring structure, which grows towards the
center of the laser illumination, see Fig. 7. After the PDMS enrichment within the
Fig. 6 Phase decomposition and early stages of coarsening for a critical PDMS-PEMS polymer
blend subjected to a focused laser spot
304 K. Weinberg et al.
Fig. 7 Phase coarsening of the PDMS-PEMS polymer blend subjected to a focused laser spot (red
PDMS, blue PEMS)
heated spot is finished and the PEMS material disappeared from the center of the
spot at t = 6 s, the phase coarsening process takes the leading role in microstructural
evolution. In the very late stages additional PDMS material from the surroundings
is deposited at the PDMS circle.
In order to integrate the focused laser spot into the three-dimensional simulation,
a laser intensity I0 (x) = 3.2 × 1011 W m−2 is focused on a domain Ω̄ which is
defined by
! 2 2 "
3 3 3 29 3
Ω̄ := x | x1 − µm + x2 − µm ≤ µm, µm ≤ x3 ≤ µm .
4 4 20 40 4
a ring of PDMS forms around the heated region. The subsequent pictures show an
enrichment of PDMS in the heated center whereas the microstructural evolution in the
remaining region is mainly driven by spinodal decomposition of phases. The last row
of Fig. 8 illustrates the distribution across the depth where we see that thermophoresis
mainly takes place at the heated surface, whereas the decomposition of phases in the
remaining domain is a slower effect.
306 K. Weinberg et al.
6 Conclusions
Acknowledgments The authors gratefully acknowledge the support of the Deutsche Forschungs-
gemeinschaft (DFG) under the grants WE2525/2-3, WE2525/4-1 and WE2525/8-1.
References
1. Anders, D., & Weinberg, K. (2011). A variational approach to the decomposition of unstable
viscous fluids and its consistent numerical approximation. ZAMM—Zeitschrift für angewandte
Mathematik und Mechanik, 91(8), 609–629.
2. Anders, D., & Weinberg, W. (2012). Thermophoresis in binary blends. Mechanics of Materials,
47, 33–50.
3. Anders, D., Reichert, R., & Weinberg, K. (2011). Isogeometric analysis of thermal diffusion
in binary blends. Computational Materials Science, 52(1), 182–188.
4. Anders, D., Hoffmann, A., Scheffler, H.-P., & Weinberg, K. (2011). Application of operator-
scaling anisotropic random fields to binary mixtures. Philosophical Magazine, 91(29), 3766–
3792.
5. Baaske, P., Wienken, C. J., Reineck, P., Duhr, S., & Braun, D. (2010). Optical thermophoresis
for quantifying the buffer dependence of aptamer binding. Angewandte Chemie International
Edition, 49, 2238–2241.
6. Baumgärtner, A., & Heermann, D. W. (1986). Spinodal decomposition of polymer films. Poly-
mer, 27(11), 1777–1780.
7. Cahn, J. W. (1961). On spinodal decomposition. Acta Metallurgica, 9(9), 795–801.
8. Cimmelli, V. A., Jou, D., Ruggeri, T., & Ván, P. (2014). Entropy principle and recent results
in non-equilibrium theories. Entropy, 16(3), 1756.
9. de Gennes, P. G. (1980). Dynamics of fluctuations and spinodal decomposition in polymer
blends. Journal of Chemical Physics, 72(9), 4756–4763.
10. de Groot, S. R., & Mazur, P. (1962). Non-equilibrium thermodynamics. Amsterdam: North-
Holland.
11. Enge, W., & Köhler, W. (2004). Thermal diffusion in a critical polymer blend. Physical Chem-
istry Chemical Physics, 6, 2373–2378.
12. Flory, P. J. (1942). Thermodynamics of high polymer solutions. Journal of Chemical Physics,
10(1), 51–61.
13. Hashimoto, T., Kumaki, J., & Kawai, H. (1983). Time-resolved light scattering studies on
kinetics of phase separation and phase dissolution of polymer blends. 1. Kinetics of phase
separation of a binary mixture of polystyrene and poly(vinyl methyl ether). Macromolecules,
16(4), 641–648.
14. Hesch, C., Schuß, S., Dittmann, M., Franke, M., & Weinberg, K. (2016). Isogeometric analysis
and hierarchical refinement for higher-order phase-field models. Computer Methods in Applied
Mechanics and Engineering, 303, 185–207.
Thermal Diffusion in a Polymer Blend 307
15. Huggins, M. L. (1942). Theory of solutions of high polymers. Journal of the American Chemical
Society, 64(7), 1712–1719.
16. Itskov, M. (2007). Tensor algebra and tensor analysis for engineers (with applications to
continuum mechanics). Berlin: Springer.
17. Krekhov, A. P., & Kramer, L. (2004). Phase separation in the presence of spatially periodic
forcing. Physical Review E, 70, 061801.
18. Lebon, G., Jou, D., & Casas-Vázquez, J. (2007). Understanding non-equilibrium thermody-
namics. In Foundations, Applications, Frontiers. Springer: Berlin.
19. Lee, K.-W. D., Chan, P. K., & Feng, X. (2002). A computational study of the thermal-induced
phase separation phenomenon in polymer solutions under a temperature gradient. Macromole-
cular Theory and Simulations, 11, 996–1005.
20. Lee, K.-W. D., Chan, P. K., & Feng, X. (2003). A computational study of the polymerization-
induced phase separation phenomenon in polymer solutions under a temperature gradient.
Macromolecular Theory and Simulations, 12(6), 413–424.
21. Lee, K.-W. D., Chan, P. K., & Feng, X. (2004). Morphology development and characterization of
the phase-separated structure resulting from the thermal-induced phase separation phenomenon
in polymer solutions under a temperature gradient. Chemical Engineering Science, 59(7), 1491–
1504.
22. Strobl, G. R. (1985). Structure evolution during spinodal decomposition of polymer blends.
Macromolecules, 18(3), 558–563.
23. Thamdrup, L. H., Larsen, N. B., & Kristensen, A. (2010). Light-induced local heating for
thermophoretic manipulation of DNA in polymer micro- and nanochannels. Nano Letters,
10(2), 826–832.
24. Voit, A. (2007). Photothermische Strukturierung binärer Polymermischungen. Ph.D. thesis,
University of Bayreuth.
25. Voit, A., Krekhov, A., Enge, W., Kramer, L., & Köhler, W. (2005). Thermal patterning of a
critical polymer blend. Physical Review Letters, 94, 214501.
26. Voit, A., Krekhov, A., & Köhler, W. (2007). Quenching a UCST polymer blend into phase
separation by local heating. Macromolecules, 40, 9–11.
27. Würger, A. (2007). Das Salz in der DNA-Suppe. Physik Journal, 7(22–24), 2014.