Convex Functions - Pages From Royden-Fitzpatrick-130-134
Convex Functions - Pages From Royden-Fitzpatrick-130-134
Definition A real-valued function rp on (a, b) is said to be convex provided for each pair of
points Xl. X2 in (a, b) and each A with 0 ~ A ~ 1,
(38)
If we look at the graph of rp, the convexity inequality can be fonnulated geometrically by
saying that each point on the chord between (Xl, rp( Xl» and (X2, rp( X2) ) is above the graph
of rp.
Observe that for two points Xl < X2 in (a, b), each point X in (Xl, X2) may be expres-
sed as
X2 -x
X = Axl + (1- A)X2 where A = - - .
X2 - Xl
Thus the convexity inequality may be written as
rp(x) ~ X2 --
[- X ] rp(xt} + [ -
x-- Xl ] rp(X2) for Xl < X < X2 in (a, b).
X2 - Xl X2 -Xl
(39)
Therefore convexity may also be fonnulated geometrically by saying that for Xl < X < X2, the
slope of the chord from (Xl, rp(Xl» to (X, rp(x» is no greater than the slope of the chord
from (x, rp(x» to (X2, rp(X2».
Proof Let Xl, X2 be in (a, b) with Xl < X2, and let X belong to (Xl. X2). We must show that
However, apply the Mean Value Theorem to the restriction of rp to each of the intervals
[Xl, X] and [x, X2] to choose points q E (Xl. x) and C2 E (x, X2) for which
D
Section 6.6 Convex Functions 131
Example Each of the following three functions is convex since each has a nonnegative
second derivative:
q>(X) = xP on (0, 00) for p 2: 1; q>(x) = IF on (-00, 00); q>(x) = In(1/x) on (0, 00).
The following final geometric reformulation of convexity will be useful in the estab-
lishment of differentiability properties of convex functions.
The Chordal Slope Lemma Let II' be convex on (a, b). If Xl < X < X2 belong to (a, b), then
for PI = (Xl> q>(xt}), P = (X, q>(x», P2 = (X2, q>(X2»,
we denote this limit by g'(xo) and call it the left-hand derivative of gat Xo. Similarly, we
define g' (x6) and call it the right-hand derivative of g at Xo. Of course, g is differentiable at
Xo if and only if it has left-hand and right-hand derivatives at Xo that are equal. The continuity
and differentiability properties of convex functions follow from the following lemma, whose
proof follows directly from the Chordal Slope Lemma.
Lemma 16 Let II' be a convex function on (a, b). Then II' has left-hand and right-hand
derivatives at each point X E (a, b). Moreover, for points u, v in (a, b) with u < v, these
one-sided derivatives satisfy the following inequality:
CoroUary 17 Let cp be a convex function on (a, b). Then cp is Lipschitz, and therefore
absolutely continuous, on each closed, bounded subinterval [c, d] of (a, b).
(41)
and therefore
Icp(u) - cp( v)1 ::; Mlu - vi for all u, v E [c, d],
where M = max{lcp'(c+)I, Icp'(d-)I}. Thus the restriction of cp to [u, v] is Lipschitz. A
Lipschitz function on a closed, bounded interval is absolutely continuous. 0
We infer from the above corollary and Corollary 6 that any convex function defined
on an open interval is differentiable almost everywhere on its domain. In fact, much more
can be said.
(42)
Proof Define a = f~ f(x)dx. Choose m to lie between the left-hand and right-hand
derivative of Ip at the point a. Then y = m( t - a) + Ip( a) is the equation of a supporting line
at (a, Ip( a» for the graph of Ip. Hence
= [f
m f(x)dx - a] + Ip(a} = Ip(a}. D
A few words regarding the assumption, for Jensen's Inequality, of the integrability of
Ip 0 f over [0, 1] are in order. We have shown that a convex function is continuous and
therefore Proposition 7 of Chapter 3 tells us that the composition Ip 0 f is measurable if Ip is
convex and f is integrable. If Ip 0 f is nonnegative, then it is unnecessary to assume the Ip 0 f
is integrable since equality (42) trivially holds if the right-hand integral equals +00. In the
case Ip 0 f fails to be nonnegative, if there are constants ct and C2 for which
(43)
then we infer from the integral comparison test that Ip 0 f is integrable over [0, 1] if f is. In
the absence of the growth assumption (43), the function Ip 0 f may not be integrable over
[0, 1] (see Problem 71).
PROBLEMS
61. Show that a real-valued function Ip on (a, b) is convex if and only if for points Xl, ... , xn in
(a, b) and nonnegative numbers At. ... , An such that ~LI Ak = 1,
63. A function on a general interval I is said to be convex provided it is continuous on I and (38)
holds for all Xl, X2 E I. Is a convex function on a closed, bounded interval [a, b] necessarily
Lipschitz on [a, b]?
64. Let Ip have a second derivative at each point in (a, b). Show that Ip is convex if and only if Ip"
is nonn~ive.
°
65. Suppose a :::: and b > 0. Show that the function Ip( t) = (a + bt)P is convex on [0, 00) for
1 S p < 00.
66. For what functions Ip is Jensen's Inequality always an equality?
67. State and prove a version of Jensen's Inequality on a general closed, bounded interval [a, b].
68. Let f be integrable over [0, 1]. Show that
69. Let {an} be a sequence of nonnegative numbers whose sum is 1 and Un} a sequence of positive
numbers. Show that
00 00
IT ~. S ~ an Cn·
n=l n=l
70. Let g be a positive measurable function on [0, 1]. Show that log (J~ g( X ) dx) :::: J~ log (g( x ) ) dx
whenever each side is defined.
71. (Nemytskii) Let Ip be a continuous function on R. Show that if there are constants for which
(45) holds, then Ip 0 f is integrable over [0, 1] whenever f is. Then show that if Ip 0 f is
integrable over [0, 1] whenever f is, then there are constants C1 and C2 for which (45) holds.