Sobolev
Sobolev
Sobolev Spaces
S. Kesavan
The Institute of Mathematical Sciences,
Chennai.
e-mail: [email protected]
2
1 Distributions
In this section we will, very briefly, recall concepts from the theory of dis-
tributions that we will need in the sequel. For details, see Kesavan [1],
Chapter 1.
Throughout these lectures, we will be working with an open set Ω ⊂ RN .
Let us briefly motivate our study of distributions and Sobolev spaces.
One of the important partial differential equations that we often study is the
Laplace’s equation:
−∆u = f in Ω
together with some appropriate boundary condition. It turns out that in
elasticity and structural engineering, the importance of the solution u stems
from the fact that it minimizes, amongst ‘admissible functions’ v, the energy
functional Z Z
1 2
J(v) = |∇v| dx − f v dx.
2 Ω Ω
Now it can happen, in many applications, that the function f is not
continuous but just, say, in L2 (Ω). Then, for the second term in the above
expression for J to make sense, it follows that v must also belong to L2 (Ω).
The first term in J will make sense if all the first partial derivatives of u, i.e.
∂u
∂xi
, 1 ≤ i ≤ N are all in L2 (Ω) as well.
But, when we are dealing with functions in L2 (Ω), what do we mean by its
derivatives? This is where we need to generalize the notion of a function and
its derivatives and interpret partial differential equations in the new set-up.
The framework for this comes from the theory of distributions. Just as we
can think of a real number as a linear operator on R acting by multiplication,
we can consider certain functions as linear operators on some special space
of functions and then generalize this.
Definition 1.1 Let f : Ω → R be a continuous function. Its support,
denoted supp(f ), is the closure of the set where f is non-zero. The function
is said to be of compact support in Ω if the support is a compact set
contained inside Ω.
The space D(Ω) is a very rich space and can be made into a locally convex
topological vector space.
Definition 1.3 The dual of D(Ω) is called the space of distributions,
denoted D0 (Ω), on Ω and its elements are called distributions on Ω.
Example
R 1.1 A function u : Ω → R is said to be locally integrable
if K |u| dx < ∞ for every compact subset K of Ω. A locally integrable
function defines a distribution in the following way: if ϕ ∈ D(Ω), then
Z
u(ϕ) = uϕ dx
Ω
3
where we use the same symbol u for the function as well as the distribution
it generates, for, if u and v generate the same distribution, then it can be
shown that u = v a.e. Now every smooth function and every function in any
of the spaces Lp (Ω), for 1 ≤ p ≤ ∞, are all locally integrable and so they all
can be considered as distributions.
Example 1.3 Let µ be a measure on Ω such that µ(K) < ∞ for every
compact subset K contained in Ω. Then it defines a distribution given by
Z
Tµ (ϕ) = ϕ dµ
Ω
for every ϕ ∈ D(Ω). The Dirac distribution mentioned above is just the
distribution generated by the Dirac measure supported at the origin.
T (ϕ) = ϕ0 (0)
for every ϕ ∈ D(R). This distribution does not fall into any of the categories
of the preceding examples and is an entirely new object. It is called the
dipole distribution.
Similarly, Z Z Z
00 0 0
u ϕ dx = − u ϕ dx = uϕ00 dx
R R R
and so on. We can use this to define the derivative of any distribution T on
R as follows: if T is a distribution on R, then, for any positive integer k,
k
define the distribution ddxTk by
dk T
k
(ϕ) = (−1)k T (ϕ(k) )
dx
k
for any ϕ ∈ D(R), where ϕ(k) = ddxϕk . We can do this on any open subset
Ω ⊂ RN , for any space dimension N . To describe it we first establish some
4
useful notation.
Notation
A multi-index α is an N -tuple of non-negative integers. Thus,
α = (α1 , · · · , αN )
x α
= xα1 1 xα2 2 · · · xαNN
for x = (x1 , x2 , · · · , xn ) ∈ RN ;
∂ |α|
Dα = .
∂xα1 1 · · · ∂xαNN
dδ
Example 1.5 The dipole distribution (cf. Example 1.4) is nothing but − dx ,
where δ is the Dirac distribution (cf. Example 1.2).
5
same? The answer lies in the integration by parts formula. If, for every
ϕ ∈ D(Ω) we have Z Z
u ϕ dx = − uϕ0 dx,
0
then the classical derivative is also the distributional derivative. This obvi-
ously happens when u is a smooth function. It is also true for absolutely
continuous functions, cf. Kesavan [1].
In this context, we can also ask the following question. We know that
if u is a differentiable function on R such that u0 ≡ 0, then u is a constant
function. Does the same hold for the distribution derivative? That is, if T is
a distribution so that dTdx
is the zero distribution, then, is T the distribution
generated by a constant function? In other words, does there exist a constant
c ∈ R such that, for every ϕ ∈ D(R), we have
Z
T (ϕ) = c ϕ dx?
R
which establishes our claim with c = T (ϕ0 ). Note that whatever the function
ϕ0 we may choose in D(R) whose integral is unity, the value of T (ϕ0 ) is the
same (why?) so that c is well-defined.
We conclude this section by mentioning two very useful collections of
smooth functions.
6
Consider the following function defined on RN .
(
− 1 2
ρ(x) = e 1−|x| , if|x| < 1,
0, if|x| ≥ 1,
where |x| is the euclidean length of the vector x ∈ RN . It can be shown that
this is a function in D(RN ), with support in B(0; 1), the ball centred at the
origin and of unit radius. Let
Z
k = ρ(x) dx.
RN
Definition 1.5 The family of mollifiers {ρε }ε>0 is the collection of func-
tions defined by
ρε (x) = k −1 ρ(x/ε).
Then it is easy to see that ρε ∈ D(RN ) with support B(0; ε), the ball centered
at the origin and of radius ε and is such that
Z Z
ρε (x) dx = ρε (x) dx = 1
RN B(0;ε)
∞
Definition 1.6 Let {Ui }m i=1 be open sets whose union is U . Then a C -
partition of unity, subordinate to the collection {Ui }m i=1 is a collection
∞
{ψi }mi=1 of C functions defined on U such that
(i) supp(ψi ) ⊂ Ui , for all 1 ≤ i ≤ m;
(ii) 0P ≤ ψi (x) ≤ 1, for all x ∈ U and for all 1 ≤ i ≤ m;
(iii) m i=1 ψi (x) = 1, for all x ∈ U.
2 Sobolev Spaces
Let Ω ⊂ RN be an open set and let ∂Ω denote its boundary.
Definition 2.1 Let m be a positive integer and let 1 ≤ p ≤ ∞. The Sobolev
Space W m,p (Ω) is defined by
The space W m,p (Ω) is a vector space contained in Lp (Ω) and we endow it
with the norm k.km,p,Ω defined as follows.
p1
X
kukm,p,Ω = kDα ukpLp (Ω)
|α|≤m
if 1 ≤ p < ∞ and
kukm,∞,Ω = max kDα ukL∞ (Ω) .
|α|≤m
7
Notations and conventions
• When p = 2, we will write H m (Ω) instead of W m,2 (Ω). The corre-
sponding norm k.km,2,Ω will be written as k.km.,Ω and it is generated by
the inner-product
X Z
(u, v)m,Ω = Dα uDα v dx.
|α|≤m Ω
8
and so it follows that u ∈ W 1,p (Ω) and that un → u in W 1,p (Ω). This proves
the completeness of the spaces W 1,p (Ω).
The image of W 1,p (Ω) in (Lp (Ω))N +1 via the isometry described above
is thus a closed subspace and so it inherits the reflexivity and separability
properties of (Lp (Ω))N +1 . This completes the proof of the theorem.
Remark 2.1 Let {un } be a sequence in W 1,p (Ω), 1 < p < ∞. Let un → u
in Lp (Ω) and let { ∂u
∂xi
n
} be bounded in Lp (Ω) for all 1 ≤ i ≤ N . Then, since
Lp (Ω) is reflexive, it follows that there exists a subsequence {unk } such that
∂unk
* vi
∂xi
weakly in Lp (Ω). Thus if ϕ ∈ D(Ω), then we have
Z Z
∂unk ∂ϕ
ϕ dx = − unk dx
Ω ∂xi Ω ∂xi
from which we deduce that
Z Z
∂ϕ
vi ϕ dx = − u dx
Ω Ω ∂xi
∂u
so that it follows that u ∈ W 1,p (Ω) and that ∂xi
= vi . This is a very useful
1
observation. Since L (Ω) is separable, the same idea will also work for the
case p = ∞.
Remark 2.2 Theorem 2.1 is true for all spaces W m,p (Ω).
Definition 2.2 The closure of the subspace D(Ω) in W m,p (Ω) is the closed
subspace W0m,p (Ω) .
We will see later that, in general, W0m,p (Ω) is a proper closed subspace
of W m,p (Ω). We will, however, see in the next section that the two spaces
coincide when Ω = RN .
3 The case Ω = RN .
Let p = 2 and consider the case H m (RN ). If u belongs to this space, then it
follows that u, Dα u ∈ L2 (RN ), for all |α| ≤ m. Now, for a square integrable
function on RN , we can define its Fourier transform, which will also be square
integrable on RN , and by the Plancherel theorem, the L2 -norms of both the
function and its Fourier transform will be the same. Further, we also know
that
D
d α u(ξ) = (2πi)|α| ξ α u
b(ξ)
for ξ ∈ RN . Thus it follows that if u ∈ H m (RN ), then u
b(.) and ξ 7→
α 2 N
ξ ub(ξ), |α| ≤ m are all in L (R ), and conversely.
9
We can see easily that the same powers of ξ occur both in (1 + |ξ|2 )m
and in the sum |α|≤m |ξ α |2 and so we have the existence of two constants
P
M1 > 0 and M2 > 0, which depend only on m and N , such that
X
M1 (1 + |ξ|2 )m ≤ |ξ α |2 ≤ M2 (1 + |ξ|2 )m .
|α|≤m
By the Plancherel theorem, it also follows that the norm defined by the
following relation is equivalent to the norm in H m (RN ) and we will denote
it by the same symbol:
Z
2
kukm,RN = (1 + |ξ|2 )m |b
u(ξ)|2 dξ.
RN
supp(ρε ∗ u) ⊂ K + B(0; ε)
which is compact and is contained within a fixed compact set, say, K +B(0; 1)
if we restrict ε to be less than or equal to unity. On this compact set, u is
uniformly continuous and the δ corresponding to η in the previous step is now
independent of the point x and so the pointwise convergence is now uniform.
(iii) From the step (ii) above it is immediate that if u is continuous with
compact support in RN , then ρε ∗ u converges to u in Lp (RN ) as well, since
the entire family is supported in a single compact set and the convergence
there is uniform. Now, we know that continuous functions with compact
support are dense in Lp (RN ) for 1 ≤ p < ∞ and so given u ∈ Lp (RN ), we
can find, for every η > 0, a continuous function g with compact support such
that
η
|u − g|0,p,RN < .
3
Then, for ε sufficiently small, we have
η
|ρε ∗ g − g|0,p,RN < .
3
10
Then
Step 1. Let {ρε }ε>0 be the family of mollifiers. Then, if u ∈ W 1,p (RN ), it
follows from the preceding lemma that ρε ∗ u → u in Lp (RN ). Also, since
Dα (ρε ∗ u) = ρε ∗ Dα u, for any multi-index α, it follows, again from the
preceding lemma, that ρε ∗ u → u in W 1,p (RN ) as well. Further, notice that
ρε ∗ u ∈ C ∞ (RN ), by the properties of convolutions.
ζk (x) = ζ(x/k).
ϕk = ζk uk .
The first term on the right hand-side of the above inequlaity can be made as
small as we please for large k since uk → u in Lp (RN ). The second term can
also be made as small as we please for large k, since it represents the tail of
a convergent integral over RN . Thus, ϕk → u in Lp (RN ).
11
Then, since the derivatives of uk converge to those of u in Lp (RN ), we have
∂u
that the first term in the above relation converges to ∂xi
in Lp (RN ), exactly
as in Step 2 above. Further
∂ζk 1 ∂ζ x
= .
∂xi k ∂xi k
Since the derivatives of ζ are uniformly bounded, we deduce that
∂ζk
uk → 0
∂xi
in Lp (RN ). Thus, it follows that
∂ϕk ∂u
→
∂xi ∂xi
in Lp (RN ) for all 1 ≤ i ≤ N . Hence it follows that ϕk → u in W 1,p (RN ) and
so u ∈ W01,p (RN ). This completes the proof.
12
Proof: Let, as before, u e denote the extension of u by zero outside Ω. Then
if {ρε }ε>0 denotes the family of mollifiers, we know that (cf. Lemma 3.1)
ρε ∗ u
e→u e → u in Lp (Ω). Now, let Ω0 ⊂⊂ Ω. Then
e in Lp (RN ) and so ρε ∗ u
we can find another relatively compact open subset Ω00 such that
Ω0 ⊂⊂ Ω00 ⊂⊂ Ω.
Let ψ ∈ D(Ω) be such that ψ ≡ 1 in Ω00 . Let d = d(∂Ω0 , ∂Ω00 ) > 0. Now,
supp(ρε ∗ ψu
f − ρε ∗ u
e) = supp(ρε ∗ (1 − ψ)e
u)
⊂ B(0; ε) + supp(1 − ψ)
ρε ∗ ψu e in Ω0 .
f = ρε ∗ u
∂(ρε ∗ u
e) ∂(ρε ∗ ψu)
f ∂u
= → .
∂xi ∂xi ∂xi
Thus, for a sequence εk ↓ 0, we have constructed vk = ρεk ∗ u
e such that
vk ∈ C (R ), vk → u in L (Ω) and such that, for all 1 ≤ i ≤ N , ∂v
∞ N p k
∂xi
∂u
→ ∂x i
p 0 0
in L (Ω ), for any relatively compact open subset Ω contained in Ω. Now let
ζk be as in the proof of Theorem 3.1 and set uk = ζk vk which will have the
same convergence properties as the sequence {vk }.
Theorem 4.2 If there exists an extension operator on W 1,p (Ω), then given
u ∈ W 1,p (Ω), there exists a sequence {un } in D(RN ) such that un → u in
W 1,p (Ω).
13
Proof: There exists a sequence {un } in D(RN ) converging to P u in W 1,p (Ω),
as proved in Theorem 3.1. Since P u = u in Ω, the result follows immediately.
Corollary 4.1 If there exists an extension operator on W 1,p (Ω), then C ∞ (Ω)
is dense in W 1,p (Ω).
RN
+ = {x = (x1 , · · · , xN ) ∈ R
N
| xN > 0}
∂(G ◦ u) ∂u
= (G0 ◦ u) , 1 ≤ i ≤ N,
∂xi ∂xi
where (G ◦ u)(x) = G(u(x)).
Proof: Since G(0) = 0, by the mean value theorem, it follows that |G(s)| ≤
M s, for all s ∈ R. Thus, it follows that both G ◦ u and (G0 ◦ u) ∂x∂u
i
are in
p
L (Ω). Thus it suffics to prove the formula for the derivative of G ◦ u given
in the statement above to prove the theorem.
14
Case 1: Let 1 ≤ p < ∞. Let {un } be a sequence in D(RN ) as in Friedrich’s
theorem. Then, it is clear that G ◦ un → G ◦ u in Lp (Ω) and also that, for a
subsequence that we will henceforth work with and therefore continue to de-
note as {un }, G0 (un (x)) → G0 (u(x)) almost everywhere. Now, let ϕ ∈ D(Ω).
Choose a relatively compact subset Ω0 contained in Ω such that supp(ϕ) ⊂ Ω0 .
Since un is smooth, we have by the classical Green’s theorem (integration by
parts)
Z Z Z
∂ϕ ∂ϕ ∂un
(G ◦ un ) dx = (G ◦ un ) dx = − (G0 ◦ un ) ϕ dx.
Ω ∂xi Ω0 ∂xi Ω0 ∂xi
Then, with the convergences observed above and those guaranteed by Friedrich’s
theorem, we get, on passing to the limit,
Z Z Z
∂ϕ 0 ∂u ∂u
(G ◦ u) dx = − (G ◦ u) ϕ dx = − (G0 ◦ u) ϕ dx
Ω ∂xi Ω0 ∂xi Ω ∂xi
which proves the result.
Remark 4.1 The condition G(0) = 0 was used only to prove that if u ∈
Lp (Ω),then so does G ◦ u. If Ω were bounded, then constant functions are
in Lp (Ω) for all 1 ≤ p ≤ ∞ and the mean value theorem yields |G(u(x))| ≤
|G(0)| + M |u(x)| which shows that G ◦ u ∈ Lp (Ω) and so the condition
G(0) = 0 is no longer necessary in that case.
∂|u| ∂u
= sgn(u) , 1≤i≤N
∂xi ∂xi
where
+1, if u(x) > 0,
sgn(u)(x) = 0, if u(x) = 0,
−1, if u(x) < 0.
√
Proof: Let ε > 0. Let fε (t) = t2 + ε. Then fε ∈ C 1 (R) and
t
fε0 (t) = √
t2 + ε
which shows that |fε0 (t)| ≤ 1. Since Ω is bounded, the chain rule holds (even
though fε (0) 6= 0). Thus, u ∈ W 1,p (Ω) implies that fε ◦ u ∈ W 1,p (Ω) and
∂(fε ◦ u) u ∂u
= √ , 1 ≤ i ≤ N.
∂xi u + ε ∂xi
2
15
√
Now, |fε (t) − |t| | ≤ ε. Hence fε ◦ u → |u| in Lp (Ω). Further,
∂(fε ◦ u) p ∂u p
Z Z
∂xi dx ≤
dx
Ω ∂xi
Ω
which is bounded. Thus, since 1 < p < ∞, we deduce that (cf. Remark 2.1)
|u| ∈ W 1,p (Ω) and that
∂|u| ∂(fε ◦ u)
= lim
∂xi ε→0 ∂xi
in Lp (Ω) if that limit exists. But
u ∂u ∂u
√ → sgn(u)
u2 + ε ∂xi ∂xi
pointwise and since the p-th powers of the absolute values of all these func-
∂u p
tions are bounded by | ∂xi
| , it follows from the dominated convergence theo-
∂u
rem that the L -norms of these functions converge to the Lp -norm of ∂x
p
i
. The
pointwise convergence and the convergence of the norm implies convergence
in Lp and this completes the proof.
Theorem 4.5 Let Ω ⊂ RN be a bounded open set and let 1 < p < ∞. Let
u ∈ W 1,p (Ω). Then, for any t ∈ R, and for any 1 ≤ i ≤ N , we have that
∂u
∂xi
= 0 almost everywhere on the set {x ∈ Ω | u(x) = t}.
∂u ∂|u| ∂u
sgn(u) = = .
∂xi ∂xi ∂xi
∂u
It then follows that on the set {x ∈ Ω | u(x) = 0}, we have that ∂xi
= 0
almost everywhere. If u ∈ W 1,p (Ω), then u = u+ − u− , where
1 1
u+ = (|u| + u), u− = (|u| − u).
2 2
Then
Remark 4.2 Notice that the set {x ∈ Ω | u(x) = t} may itself be of measure
zero, in which case the above result gives no new information. However, if
the function takes a constant value on a set of positive measure, then its
derivative vanishes almost everywhere in that set.
16
Theorem 4.6 (Stampacchia) Let Ω ⊂ RN be a bounded open set and let
1 < p < ∞. Let f : R → R be a Lipschitz continuous function. Then, if
u ∈ W 1,p (Ω), we have f ◦ u ∈ W 1,p (Ω) and if f 0 is continuous except at a
finite number of points {t1 , · · · , tk }, then
0 ∂u
∂(f ◦ u) def (f ◦ u) ∂x (x), if u(x) 6∈ {t1 , · · · , tk },
(x) = vi = i
∂xi 0, otherwise.
Proof: First of all, it follows from the Lipschitz continuity of f and the
boundedness of Ω that f ◦ u ∈ Lp (Ω). If M is the Lipschitz constant of f ,
then we also have that |f 0 (t)| ≤ M . Then it also follows that vi ∈ Lp (Ω) for
1 ≤ i ≤ N.
Let {ρε }ε>0 denote the family of mollifiers in R. Choose a sequence εn ↓ 0
and set fn = ρεn ∗ f . Then fn ∈ C ∞ (R) and (cf. Lemma 3.1) fn (t) → f (t)
for all t ∈ R. Now,
R
|fn (t) − fn (t0 )| = |s|<εn (f (t − s) − f (t0 − s))ρεn (s) ds
Thus, it follows that |fn0 (t)| ≤ M for all t ∈ R. We also have that fn ◦u → f ◦u
in Lp (Ω) since for any x ∈ Ω,
Z
|fn (u(x)) − f (u(x))| = (f (u(x) − s) − f (u(x)))ρεn (s) ds ≤ M εn
|s|<εn
17
and so by the dominated convergence theorem, we deduce that the limit in
Lp (Ω) of ∂((f∂xni◦u) is indeed vi for all 1 ≤ i ≤ N , which completes the proof.
Proposition 4.1 Let u ∈ W 1,p (Ω), 1 ≤ p < ∞. Let K ⊂ Ω be compact. If
u vanishes on Ω\K, then u ∈ W01,p (Ω).
Proof: Choose relatively compact sets Ω00 and Ω0 such that
K ⊂ Ω00 ⊂⊂ Ω0 ⊂⊂ Ω.
Let ψ ∈ D(Ω) such that ψ ≡ 1 on Ω00 and such that supp(ψ) ⊂ Ω0 . Then
ψu = u. Let {un } be a sequence in D(RN ) such that un → u in Lp (Ω) and
∂un ∂u
∂xi
→ ∂x i
in Lp of all relatively compact open subsets of Ω.Then ψun ∈ D(Ω)
and ψun → ψu = u in W 1,p (Ω) since all the supports are contained in
Ω0 ⊂⊂ Ω. This shows that u ∈ W01,p (Ω).
Proposition 4.2 Let Ω be bounded and let 1 < p < ∞. If f : R → R is a
Lipschitz continuous function such that f (0) = 0 and whose derivative exists
and is continuous except at a finite number of points, then, if u ∈ W01,p (Ω),we
also have f ◦ u ∈ W01,p (Ω).
Proof: Let {un } be a sequence in D(Ω) converging in W 1,p (Ω) to u. Then,
since f is Lipschitz continuous, we have
5 Poincaré’s inequality
In the previous section, we saw that the existence of an extension operator for
W 1,p (Ω) depended on the nature of the domain. In general, the extension by
zero will not map W 1,p (Ω) into W 1,p (RN ). For example, let Ω = (0, 1) ⊂ R
and consider the function u ≡ 1 in Ω. Then u e ∈ Lp (R) for any 1 ≤ p ≤ ∞.
But
de
u
= δ0 − δ1
dx
18
where δ0 (ϕ) = ϕ(0) and δ1 (ϕ) = ϕ(1) for ϕ ∈ D(R), and this cannot come
from any locally integrable function, as already observed earlier.
We will now show that, irrespective of the nature of the domain, the
extension by zero provides an extension operator from W01,p (Ω) to W 1,p (RN )
for all 1 ≤ p < ∞.
Theorem 5.1 Let 1 ≤ p < ∞. Let u e denote the extension by zero outside Ω
for any function u defined on Ω ⊂ RN . Then, if u ∈ W01,p (Ω), we have that
e ∈ W 1,p (RN ) and that
u
∂e
u ∂u
g
= , 1 ≤ i ≤ N.
∂xi ∂xi
Proof: The extension by zero maps functions in Lp (Ω) into the space Lp (RN ).
Thus it suffices to prove the above formula for the derivatives of u
e.
Since u ∈ W01,p (Ω), let {un } be a sequence in D(Ω) converging to u in
W 1,p (Ω). Let ϕ ∈ D(RN ). Then, by classical integration by parts for smooth
functions, we have
Z Z
∂ϕ ∂un
un dx = − ϕ dx.
Ω ∂xi Ω ∂xi
|u|0,p,Ω ≤ C|u|1,p,Ω
19
where x = (x0 , xN ) and x0 = (x1 , · · · , xN −1 ). Then, by Hölder’s inequality,
p1
∂u 0 p
1
|u(x)| ≤ (x , t) dt |xN + a| p0
∂xN
where p0 = p/(p − 1) is the conjugate exponent. Thus,
∂u 0 p
Z a
p
p
|u(x)| ≤ (2a) p0 (x , t) dt
−a ∂xN
|u|0,p,Ω = |e
u|0,p,Ωe ≤ 2a|e
u|1,p,Ωe = 2a|u|1,p,Ω
∂u ∂u
since e
∂xi
= ∂x
f
i
. This completes the proof.
20
• This inequality also shows that if Ω is bounded, then W 1,p (Ω) and
W01,p (Ω) cannot be equal (which was the case for Ω = RN ). For the
constant function u ≡ 1 on Ω is in W 1,p (Ω) and |u|1,p,Ω = 0 but |u|0,p,Ω >
0. Hence the constant function cannot belong to W01,p (Ω).
|u|1,p,Ω ≤ C|u|2,p,Ω .
|u|0,p,Ω ≤ C 0 |u|2,p,Ω .
6 Imbedding theorems
We know that if u ∈ W 1,p (Ω), then, u ∈ Lp (Ω). We now ask the question if
the information that the first order partial derivatives of u are also in Lp (Ω)
will give us more information on the function u, vis-à-vis its smoothness or
integrability with respect to other exponents. In particular we would like to
∗
know if W 1,p (RN ) is continuously imbedded in Lp (RN ), where p 6= p∗ . In
other words, we are looking for an inequality of the form
21
for all u ∈ W 1,p (RN ), where C > 0 is a constant which depends only on p
and N . A simple analysis will show us when this will be possible, if at all,
and what is the value of p∗ that we should expect. Let λ > 0 be a fixed real
number. Then consider the scaling x 7→ λx which maps RN onto itself. Let
u ∈ W 1,p (RN ). Then if we set uλ (x) = u(λx), it follows that uλ ∈ W 1,p (RN )
as well. We have
∂uλ ∂u
(x) = λ (λx).
∂xi ∂xi
Consequently, by the change of variable formula, we get
∂uλ p
Z Z p Z p
p
∂u p−N
∂u
∂xi (x)
dx = λ
∂xi (λx)
dx = λ ∂xi dx.
(x)
RN R N RN
Thus,
N
|uλ |1,p,RN = λ1− p |u|1,p,RN .
Similarly
N
|uλ |0,p∗ ,RN = λ− p∗ |u|0,p∗ ,RN .
hence, for all λ > 0, we must have
|uλ |1,p,RN N N |u|1,p,RN
0 < C ≤ = λ1− p + p∗ .
|uλ |0,p∗ ,RN |u|0,p∗ ,RN
If 1 − Np + pN∗ is strictly positive, then we let λ → 0 to get a contradiction and
if it is strictly negative, we can let λ → ∞ to get a contradiction. Thus, if
at all we can hope for such an inequality, it follows that we must have that
this number is zero, i.e.
1 1 1
∗
= − .
p p N
For this to be possible, it is necessary that p < N as well. In that case notice
that we also have p∗ > p.
We now split our investigation into three cases, viz., p < N, p = N and
p > N.
Theorem 6.1 (Sobolev’s Inequality) Let 1 ≤ p < N and define p∗ as above.
Then, there exists C = C(p, N ) > 0 such that for all u ∈ W 1,p (RN ),
|u|0,p∗ ,RN ≤ C|u|1,p,RN .
In particular, we have the continuous inclusion
∗
W 1,p (RN ) ,→ Lp (RN )
.
The proof of this result is rather technical and we refer the reader to
Kesavan [1].
Corollary 6.1 Let 1 ≤ p < N . Then we have the continuous inclusions
W 1,p (RN ) ,→ Lq (RN )
for all q ∈ [p, p∗ ].
22
Proof: Let p < q < p∗ . Then, there exists α ∈ (0, 1) such that
1 α 1−α
= + .
q p p∗
p p∗
αq N (1−α)q
Then |u| ∈ L (R ) and |u|
αq ∈ L (RN ) if u ∈ W 1,p (RN ).Thus,
(1−α)q
(The inequality in the second line above comes from the generalised AM-GM
inequality.) This completes the proof.
Corollary 6.2 Let Ω ⊂ RN , and let 1 ≤ p < N . Then, there exists C > 0
such that, for all u ∈ W01,p (Ω),
|u|0,p∗ ,Ω ≤ C|u|1,p,Ω
|u|0,q,Ω ≤ Ckuk1,p,Ω , for all p < q < p∗ .
If Ω = RN 1
+ or if Ω has bounded boundary and is of class C , then we also
have the continuous inclusions
for all p ≤ q ≤ p∗ .
Proof: The extension by zero imbeds W01,p (Ω) into W 1,p (RN ) and now it is
easy to see the conclusions for the space W01,p (Ω). If Ω = RN + or if Ω has
1
bounded boundary and is of class C , then there exists an extension operator
|u|1,p,Ω ≥ S|u|0,p∗ ,Ω .
|u|1,p,Ω
S(p, N, Ω) = inf .
u∈W0 (Ω) |u|0,p∗ ,Ω
1,p
u6=0
23
Now if Ω1 ⊂ Ω2 are two bounded open subsets of RN , then the extension
by zero outside Ω1 maps an element of W01,p (Ω1 ) into W01,p (Ω2 ), keeping
the norms unchanged in their value. It follows from this immediately that
S(p, N, Ω1 ) ≥ S(p, N, Ω2 ). Now let B1 = B(0; r1 ) and B2 = B(0; r2 ) be two
concentric balls centered at the origin. Then the scalings
r2 r1
x 7→ x and x 7→ x
r1 r2
map functions in W01,p (B1 ) into functions in W01,p (B2 ) and vice-versa. It then
follows that
S(p, N, B1 ) = S(p, N, B2 ).
Since the Lebesgue measure is translation invariant, this is also true if the
balls are concentric but are centered elsewhere in RN . Now if Ω is any
bounded open set, we can always find two concentric balls one within Ω and
the other containing Ω. Thus it follows that S(p, N, Ω) is independent of
the domain. In fact for all bounded domains Ω, we have that S(p, N, Ω) =
S(p, N ), where
|u|1,p,RN
S(p, N ) = inf .
u∈W 1,p (RN ) |u|0,p∗ ,RN
u6=0
The value of this best constant has been worked out (independently) by
Aubin and Talenti. When p = 2, the only minimizers of this optimization
problem have been shown to be the functions U (x), Uε (x − x0 ), where ε >
0, x0 ∈ RN and
N −2
U (x) = C(1 + |x|2 )− 2
N −2
Uε (x) = Cε (ε + |x|2 )− 2
where C and Cε are positive normalization constants.
In particular, this shows that for any Ω a bounded open subset of RN ,the
minimization problem stated above can never have a solution. For, if there
were one, then the extension by zero outside Ω would be a minimizer for the
problem in RN as well, but it has been shown that the only minimizers in
RN are the functions U and Uε above and they never vanish anywhere. This
is true for all 1 ≤ p < ∞.
24
Once again, we refer the reader to Kesavan [1] for the proof.
We do not have the inclusion of any of the spaces mentioned in the pre-
ceding theorem in L∞ of the corresponding domain.
We now turn to the case p > N . To motivate the theorem, let us consider
the case N = 1 and p > 1.
Example 6.2 Let 1 < p < ∞. Let I = (0, 1) and consider u ∈ W 1,p (I).
Then u0 ∈ Lp (I) and so is integrable. Thus the function
Z x
u(x) = u0 (t) dt
0
is absolutely continuous and its derivative (both in the classical and distri-
butional sense) is u0 . Thus, we have that (u − u)0 = 0 and it follows that
u = u + c, a.e.
25
In addition, we have, for x, y ∈ I,
Z y
u(x) − u(y) = u0 (t) dt
x
and notice that p10 = 1 − p1 . Thus all the functions in W 1,p (I) are not only
absolutely continuous, but are also Hölder continuous with exponent 1− p1 .
and there exists a constant C = C(N, p) > 0 such that, for all u ∈ W 1,p (RN ),
and for almost all x, y ∈ RN , we have
N
|u(x) − u(y)| ≤ C|u|1,p,RN |x − y|1− p .
1 1 1 1 2
∗∗
= ∗− = − .
p p N p N
More genrally, we have the following result.
1 1 m
W m,p (RN ) ,→ Lq (RN ), = − .
q p N
1 m
(ii) If p
− N
= 0, then
26
1 m
(iii) If p
− N
< 0, then
In the last case, set k to be the integral part and θ to be the fractional part
of m − Np . Then there exists C > 0 such that for all u ∈ W m,p (RN ), we have
Remark 6.1 If m > Np , and if |α| < k, where k is the integral part of
m − Np , it follows that Dα u is Lipschitz continuous, by virtue of the mean
value theorem and the fact that the highest order derivatives are bounded.
7 Compactness Theorems
In the previous section we saw various continuous inclusions of the Sobolev
spaces in the Lebesgue spaces and spaces of smooth functions. We now
investigate the compactness of these inclusions.
For example consider the case p > N and W 1,p (Ω) where Ω ⊂ RN is
bounded. Then we saw (cf. Theorem 6.3) that the functions in W 1,p (Ω)
are in C(Ω) and that they are also Hölder continuous with exponent 1 − p1 .
Thus, it follows that if B is the unit ball in W 1,p (Ω), then the elements of B
are uniformly bounded and that they are equicontinuous as well. Since Ω is
compact, it then follows that B is relatively compact in C(Ω), by the Ascoli-
Arzela theorem. Consequently the inclusion of W 1,p (Ω) in C(Ω) is compact.
This argument works for all the cases covered in Theorem 6.4 as well, when
p > N.
When p ≤ N , we have continuous inclusions into the Lebesgue spaces.
To examine the compactness of these inclusions, we need an analogue of the
Ascoli-Arzela theorem which describes criteria for the relative compactness
of subsets in the Lebesgue spaces. This comes from the theorem of Fréchet
and Kolmogorov. We omit the proofs and refer the reader to Kesavan [1].
27
Theorem 7.1 (Rellich-Kondrasov) Let Ω ⊂ RN be a bounded open set. Let
1 ≤ p < ∞.Then, the following inclusions are compact:
(i) if p < N, W 1,p (Ω) ,→ Lq (Ω), for all 1 ≤ q < p∗ ;
(ii) if p = N , W 1,N (Ω) ,→ Lq (Ω), for all 1 ≤ q < ∞;
(iii) if p > N , W 1,p (Ω) ,→ C(Ω).
Example 7.1 We do not have compactness if Ω is unbounded. For example,
consider Ω = R. Let I = (0, 1) and let Ij = (j, j + 1) for j ∈ Z. Let f be a
C 1 function supported in I. Define
fj (x) = f (x − j), j ∈ Z.
Then all the fj ’s are in W 1,p (R) for any 1 ≤ p < ∞ and the sequence {fj } is
clearly bounded in that space. However, since they all have disjoint supports,
we have that if i 6= j, then
1
|fi − fj |0,q,R = 2 q |f |0,q,R .
Thus the sequence {fj } cannot have a convergent subsequence in any Lq (R)
for 1 ≤ q < ∞ and so none of the inclusions
W 1,p (R) ,→ Lq (R)
can be compact.
where
J(v) = |v|1,Ω .
Denote the infimum by m ≥ 0. Let {vn } be a minimizing sequence. Then, it
is clearly bounded in H01 (Ω) and so it has a weakly convergent subsequence
{vnk } converging (weakly) to, say, v ∈ H01 (Ω). Since the inclusion
∗
H01 (Ω) ,→ L2 (Ω)
is assumed to be compact, it follows that the subsequence converges in norm
∗
in L2 (Ω) and so we have that |v|0,2∗ ,Ω = 1. Now |.|1,Ω is a norm on H01 (Ω)
equivalent to the usual norm (Poincaré’s inequality) and since the norm is
weakly lower semi-continuous, we have that
m ≤ |v|1,Ω ≤ lim inf |vnk |1,Ω = m.
k→∞
28
8 The spaces W s,p(Ω)
In this section, we will try to define the spaces W s,p (Ω) when 1 < p < ∞ and
when s ∈ R is an arbitrary real number. We start with the case of negative
integers.
Proposition 8.1 Let Ω ⊂ RN be an open set and let 1 < p < ∞. Let F
belong to the dual space of W 1,p (Ω) (respectively, W01,p (Ω)). Then, there exist
0
f0 , f1 , · · · , fN ∈ Lp (Ω), where p0 is the conjugate exponent of p, such that,
for all v ∈ W 1,p (Ω) (respectively, v ∈ W01,p (Ω)), we have
Z N Z
X ∂v
F (v) = f0 v dx + fi dx.
Ω i=1 Ω
∂xi
If we define
N
X ∂v
kvk1,p,Ω = |v|0,p,Ω +
∂xi
i=1 0,p,Ω
and of course, kΦk = kF k by definition. Now, for any v ∈ W 1,p (Ω) (respec-
tively, W01,p (Ω)), we have v = S(T (v)) and so
Z N Z
X ∂v
F (v) = Φ(T (v)) = f0 v dx + fi dx.
Ω i=1 Ω
∂xi
29
If Ω is bounded then we can work (thanks to Poincaré’s inequality) with the
isometry T : W01,p (Ω) → (Lp (Ω))N given by
∂v ∂v
T (u) = ,···,
∂x1 ∂xN
and so we may take f0 = 0. This completes the proof.
Now, if ϕ ∈ D(Ω), then, by definition of the distribution derivative,
Z N
X ∂f
F (ϕ) = f0 ϕ dx − < ,ϕ >
Ω i=1
∂x i
where the bracket < ., . > denotes the action of a distribution on an element
of D(Ω). Since D(Ω) is dense in W01,p (Ω), a continuous linear functional on
W01,p (Ω) is completely defined by its action on D(Ω) and so, in this case, we
can identify F with the distribution
N
X ∂fi
f0 − .
i=1
∂xi
This identification is not possible for functionals on W 1,p (Ω) since D(Ω) is not
dense there. Now, if m > 1 is any positive integer, any first order derivative
of an element in W m,p (Ω) falls in W m−1,p (Ω). For consistency, we would like
0
this to be true for m = 0 as well. Since derivatives of Lp (Ω) functions are in
0
the dual of W01,p (Ω), we therefore define this dual space as W −1,p (Ω).
Definition 8.1 Let 1 < p < ∞ and let m ≥ 1 be an integer. Then, the
0
space W −m,p (Ω) is the dual of the space W0m,p (Ω), where p0 is the conjugate
exponent of p.
Let 1 < p < ∞ and let s > 0. Then the spaces W s,p (Ω) can be defined in a
variety of ways when s is not an integer. Usually interpolation theory in Lp
spaces is used. We will not go into these aspects here. The space W0s,p (Ω) will
0
be the closure of D(Ω) in W s,p (Ω) and its dual will be the space W −s,p (Ω)
where p0 is the conjugate exponent of p.
We will henceforth concentrate on the case p = 2. We saw that H m (RN )
can be defined via the Fourier transform. We can immediately generalize
this.
Definition 8.2 Let s > 0 be a real number. Then
s
b(ξ) ∈ L2 (RN )}.
H s (RN ) = {u ∈ L2 (RN ) | (1 + |ξ|2 ) 2 u
The norm in this space is given by
Z 21
kuks,RN = (1 + |ξ|2 )s |b
u(ξ)|2 dξ .
RN
30
Let Ω ⊂ RN be a bounded open set with ‘sufficiently smooth’ boundary ∂Ω
and is such that Ω always lies to the same side of each connected component
of the boundary. Then, at each point of the boundary, there is a neighbour-
hood U and a bijective mapping T : Q → U , where Q is the unit cube in
RN centered at the origin, with the properties given in Definition 4.2. Since
∂Ω is compact, it can be covered by a finite number {Ui }m i=1 of such neigh-
∞
bourhoods. Let {Ti }i=1 be the corresponding maps. Let {ψi }m
m
i=1 be a C
m
partition of unity subordinate to the collection {Ui }i=1 .
Being a smooth (N − 1)-dimensional manifold, ∂Ω can be provided with
the (N − 1)- dimensional surface measure induced on it from RN . Thus, we
can easily define the spaces Lp (∂Ω), for 1 ≤ p ≤ ∞. Now, given u ∈ L2 (Ω),
we can write m
X
u = ψi u.
i=1
The function ψi u is supported inside the open set Ui . Consider the functions
It can be checked that this definition is independent of the choice of the atlas
{Ui }m
i=1 on ∂Ω. We also define
H −s (∂Ω) = (H s (∂Ω))∗
9 Trace theory
Sobolev spaces are the ideal functional analytic setting to study boundary
value problems. In that case given u ∈ W 1,p (Ω), where Ω ⊂ RN is a bounded
open set, we would like to assign meanings expressions like ‘u restricted to
∂Ω’ or ‘the outer normal derivative ∂u ∂ν
of u on ∂Ω’ and so on. But when
u ∈ W 1,p (Ω), it is a priori an element of Lp (Ω) and so is only defined almost
everywhere. Since ∂Ω has measure zero in RN , it is therefore not meaningful
to talk of the values u takes on the boundary.
However, since we have additional information on the derivatives of u,
when u is in some Sobolev space, we can indeed give such notions a meaning
consistent with our intuitive understanding of terms such as boundary value
and exterior normal derivative. We will now make this more precise.
While the theory outlined below can be done for all 1 < p < ∞, for
simplicity, we will restrict ourselves to the case p = 2.
31
N −1
Theorem 9.1 There exists a continuous linear map γ0 : H 1 (RN 2
+ ) → L (R )
1 N N
such that, if v ∈ H (R+ ) is continuous on R+ , then γ0 (v) is the restriction
N −1
of v to ∂RN+ = R .
Proof: Let v ∈ D(RN ). Then
R∞
|v(x0 , 0)|2 = − 0 ∂x∂N (|v(x0 , xN )|2 ) dxN
R ∞ ∂v 0
= −2 0 ∂x N
(x , xN ).v(x0 , xN ) dxN
R ∞ ∂v 0 2
≤ 0 ∂x (x , x ) + |v(x0 , xN )|2 dxN .
N
N
Since we know that (cf. Example 4.1 and Corollary 4.1) the restrictions of
functions in D(RN ) are dense in H 1 (RN
+ ), the above inequality implies that
the map
v 7→ v|RN −1
N −1
extends uniquely to a continuous linear map γ0 from H 1 (RN 2
+ ) into L (R ).
1 N N N
Now let v ∈ H (R+ ) be continuous on R+ . Extend v to all of R by
reflection on RN −1 (cf. Example 4.1). Choose a sequence εm ↓ 0 and let ρεm
be the corresponding mollifiers. Let ζ ∈ D(RN ) be such that 0 ≤ ζ ≤ 1, ζ ≡
1 on B(0; 1) and such that supp(ζ) ⊂ B(0; 2). Define ζk (x) = ζ(x/k). Then
(cf. Lemma 3.1), ρεm ∗ v(x) → v(x) for all x ∈ RN and also, since ζm ≡ 1 on
B(0; m), we have vm (x) → v(x) for all x ∈ RN , where
vm = ζm .(ρεm ∗ v).
We also know that (cf. Theorem 3.1) vm → v in H 1 (RN ) and hence in
N −1
H 1 (RN 2
+ ) as well. Thus γ0 (vm ) → γ0 (v) in L (R ). But, since vm ∈ D(RN ),
it follows that γ0 (vm ) is the restriction of vm to RN −1 and we saw that this
converges pointwise to the restriction of v to RN −1 . Thus, it follows that
γ0 (v) is the restriction of v to RN −1 . This completes the proof.
It can be shown (cf. Kesavan [1]) that the range of γ0 is the space
1
H (RN −1 ) and that its kernel is H01 (RN
2
+ ). Thus we can interpret elements of
1 N 1 N
H0 (R+ ) as those of H (R+ ) which ‘vanish on the boundary’.
In the space H 2 (RN+ ), apart from γ0 , we can imitate the proof of the
N −1
preceding theorem to show the existence of a map γ1 : H 2 (RN 2
+ ) → L (R )
2 N 1 N
such that if v ∈ H (R+ ) ∩ C (R+ ), then
∂v
γ1 (v) = − .
∂xN N −1
R
32
1 3
Its range would be H 2 (RN −1 ) while that of γ0 will be H 2 (∂Ω). The kernel
of the map
3 1
N −1
(γ0 , γ1 ) : H 2 (RN
+ ) → H (R
2 ) × H 2 (RN −1 )
is H0m (RN
+ ).
Let us now turn to the case of a sufficently smooth bounded open set Ω
of RN . Let {Ui }m m
i=1 together with the associated maps {Ti }i=1 be an atlas
m ∞
for the boundary ∂Ω. Let {ψi }i=1 be an associated C partition of unity
subordinate to the collection {Ui }m 1
i=1 . If u ∈ H (Ω), then (after extension by
zero) we have that for each 1 ≤ i ≤ m, the functions (ψi u|Ui ∩Ω )◦Ti ∈ H 1 (RN
+)
1
N −1
and so we can define its trace γ0 as an element of H 2 (R ). Coming back
−1
by Ti , we can define the trace on Ui ∩ ∂Ω. Piecing these together, we get
the trace γ0 u ∈ L2 (∂Ω) and, by our definition of the spaces on ∂Ω, the range
1
will be precisely H 2 (∂Ω). Similarly, we can define higher order traces, which
generalize the notion of exterior normal derivatives of various orders.
Theorem 9.2 (Trace Theorem) Let Ω ⊂ RN be a bounded open set of class
C m+1 . Then there exist maps γ0 , γ1 , · · · , γm−1 from H m (Ω) into L2 (∂Ω) such
that
(i) if v ∈ H m (Ω) is sufficently smooth, then,
∂ m−1 v
∂v
γ0 (v) = v|∂Ω , γ1 (v) = , · · · , γm−1 (v) =
∂ν ∂Ω ∂ν m−1 ∂Ω
(iii) The kernel of the map (γ0 , γ1 , · · · , γm−1 ) is the space H0m (Ω).
33
Proof: We know that C ∞ (Ω) is dense in H 1 (Ω). If {un } and {vn } are
sequences of smooth functions converging in H 1 (Ω) to u and v respectively,
we have, by the classical Green’s theorem,
Z Z Z
∂vm ∂um
um dx = − vm dx + um vm νi dσ
Ω ∂xi Ω ∂xi ∂Ω
for any 1 ≤ i ≤ N . The result now follows on passing to the limit as n → ∞.
If one of them is in H01 (Ω), then the integrand in the boundary integral van-
ishes.
We conclude this section by showing that the kernel of the trace map γ0
is indeed H01 (Ω) in the one-dimensional case.
Theorem 9.4 Let 1 < p < ∞. Let I = (0, 1) ⊂ R. Then
W01,p (I) = {u ∈ W 1,p (I) | u(0) = u(1) = 0}.
Proof: Let u ∈ W01,p (I). Then there exists a sequence {un } in D(I) such
that un → u in W 1,p (I). But this implies that (cf. Example 6.2) that un → u
uniformly on I and so it follows that u(0) = u(1) = 0.
34
References
[1] Kesavan, S. Topics in Functional Analysis and Applications,
New Age International (formerly Wiley-Eastern), 1989.
35