Taylor Series
Taylor Series
Calculus of variations
Lecture notes for MA342H
P. Karageorgis
1 / 43
Introduction
For instance, J(y) could represent area, length, energy and so on.
We are interested in finding the minimum/maximum values of J(y)
over all functions y(x) that satisfy some given constraints.
A function y(x) for which the minimum/maximum value is attained is
called an extremal of J(y). We shall assume that both y(x) and the
Lagrangian function L are twice continuously differentiable.
One may impose various constraints on y(x), but the most common
one amounts to specifying its values at the endpoints x = a, b. This
requires the graph of y(x) to pass through two given points.
2 / 43
Directional derivative
3 / 43
Directional derivative: Proof
4 / 43
Euler-Lagrange equation
d
Ly′ = Ly .
dx
5 / 43
Euler-Lagrange equation: Proof
(x − x1 )3 (x2 − x)3
if x1 ≤ x ≤ x2
φ(x) = .
0 otherwise
y ′ Ly′ − L = constant.
d d
y ′ Ly′ = y ′′ Ly′ + y ′ Ly =
L.
dx dx
8 / 43
Example 1. Shortest path
d y ′ (x)
Ly′ = Ly = 0 =⇒ Ly′ = p = c1 .
dx 1 + y ′ (x)2
This shows that y(x) must be a linear function. In other words, the
shortest path between any two points is given by a line.
9 / 43
Example 2. Minimal surface, page 1
2πy(x)y ′ (x)2
c = y ′ Ly′ − L = p
p
− 2πy(x) 1 + y ′ (x)2
1 + y ′ (x)2
c y(x) y(x)2
= −p =⇒ 1 + y ′ (x)2 = .
2π 1 + y ′ (x)2 a2
10 / 43
Example 2. Minimal surface, page 2
11 / 43
Example 3. Brachistochrone, page 1
Consider a particle that slides along the graph of a function y(x) from
one point to another under the influence of gravity. We assume that
there is no friction and wish to minimise the overall travel time
Z bp
1 + y ′ (x)2 dx
J(y) = .
a v(y(x))
12 / 43
Example 3. Brachistochrone, page 2
Since the Lagrangian does not depend on x directly, one has
p
y ′ (x)2 1 + y ′ (x)2
c = y ′ Ly′ − L = p p −p
2g(y0 − y) 1 + y ′ (x)2 2g(y0 − y)
by the Beltrami identity. We clear denominators to get
p p
c 2g(y0 − y) 1 + y ′ (x)2 = −1
a2
2 1
x0 − x = a θ − sin(2θ) = (φ − sin φ) ,
2 2
a2 a2
y0 − y = a2 sin2 θ = (1 − cos(2θ)) = (1 − cos φ) ,
2 2
where φ = 2θ. These are the parametric equations of a cycloid.
14 / 43
Case 1. Fixed boundary conditions
Suppose that we wish to find the extremals of
Z b
J(y) = L(x, y(x), y ′ (x)) dx
a
for every test function φ. Since the boundary terms vanish, one can
find the possible extremals by solving the Euler-Lagrange equation
d
Ly′ = Ly .
dx
Thus, we get a second-order ODE subject to two boundary conditions.
15 / 43
Case 1. Example
1 = y(0) = b, 3 = y(1) = a + b.
16 / 43
Case 2. Variable boundary conditions
for every function φ. This obviously includes all test functions, so the
Euler-Lagrange equation remains valid, but we must also have
Ly′ = 0 when x = a, b.
d d
Ly′ = Ly , Lz ′ = Lz .
dx dx
In particular, one can find the extremals by solving a system of two
second-order equations subject to four boundary conditions.
19 / 43
Case 3. Example
20 / 43
Case 4. Isoperimetric constraints
Let us denote by I(y) the integral that appears in the last equation.
Then the extremals of J(y) are either critical points of I(y) or else
critical points of J(y) − λI(y) for some Lagrange multiplier λ ∈ R.
In particular, one has to solve the Euler-Lagrange equation for two
different Lagrangians, namely M and also L − λM .
21 / 43
Case 4. Example, page 1
Let us denote by I(y) the integral that appears in the last equation.
Its critical points must satisfy the Euler-Lagrange equation
d
L = y(x) sin x =⇒ Ly′ = Ly =⇒ 0 = sin x
dx
and this means that I(y) has no critical points at all.
22 / 43
Case 4. Example, page 2
To find the critical points of J(y) − λI(y), we note that
d
L = y ′ (x)2 − λy(x) sin x =⇒ Ly′ = Ly
dx
=⇒ 2y ′′ = −λ sin x.
Then the extremal y(x) must be either a critical point of I(y) or else
a critical point of J(y) − λI(y) for some Lagrange multiplier λ ∈ R.
25 / 43
Lagrange multipliers: Proof, page 2
26 / 43
Second variation
The first variation of J(y) is defined as the limit
J(y + εφ) − J(y)
J ′ (y)φ = lim
ε→0 ε
and one can use a Taylor series expansion to derive the formula
Z b
J ′ (y)φ = Ly φ + Ly′ φ′ dx.
a
and one can use a Taylor series expansion to derive the formula
Z b
′′
J (y)φ = Lyy φ2 + 2Lyy′ φφ′ + Ly′ y′ (φ′ )2 dx.
a
27 / 43
Second variation: Sketch of proof
28 / 43
Necessary condition
ε2 ′′
J(y + εφ) = J(y) + εJ ′ (y)φ + J (y)φ + . . .
2
and the fact that J ′ (y)φ = 0 for all critical points of J(y).
It may happen that J ′′ (y)φ > 0 for all functions φ, even though J(y)
has no local minimum. In particular, the given condition is necessary
but not sufficient for the existence of a local minimum.
29 / 43
Necessary condition: Sketch of proof
ε2 ′′ ε2
J(y + εφ) = J(y) + εJ ′ (y)φ + J (y)φ + R(y, ε, φ)
2 2
for some remainder term R such that R(y, ε, φ) → 0 as ε → 0.
Since y(x) is a point of a local minimum, this implies that
ε2 ′′ ε2
0 ≤ J(y + εφ) − J(y) = J (y)φ + R(y, ε, φ)
2 2
for all small enough ε. Letting ε → 0, we conclude that
J ′′ (y)φ = lim J ′′ (y)φ + R(y, ε, φ) ≥ 0.
ε→0
30 / 43
Legendre condition
xy ′ (x) x
Ly′ = p =⇒ Ly ′ y ′ = .
1 + y ′ (x)2 (1 + y ′ (x)2 )3/2
Since Ly′ y′ changes sign on the given interval, one finds that J(y)
has neither a local minimum nor a local maximum.
31 / 43
Legendre condition: Sketch of proof, page 1
If it happens that Ly′ y′ < 0 at some point, then Ly′ y′ < 0 on some
interval [x0 − ε, x0 + ε] by continuity. Consider the test function
( )
sin3 π(x−xε
0)
if |x − x 0 | ≤ ε
φ(x) = .
0 otherwise
This function is bounded for all ε > 0, but its derivative becomes
unbounded as ε → 0. One thus expects the second variation
Z b
′′
J (y)φ = Lyy φ2 + 2Lyy′ φφ′ + Ly′ y′ (φ′ )2 dx
a
32 / 43
Legendre condition: Sketch of proof, page 2
33 / 43
Poincaré inequality
34 / 43
Sufficient condition
for all test functions φ. Then J(y) attains a local minimum at y(x).
35 / 43
Sufficient condition: Sketch of proof, page 1
We use Taylor’s theorem with remainder to write
ε2 b
Z
J(y + εφ) = J(y) + Lyy φ2 + 2Lyy′ φφ′ + Ly′ y′ (φ′ )2 dx
2 a
with the second derivatives of L evaluated at a point of the form
ε2 ′′
J(y + εφ) = J(y) + J (y)φ
2
ε2 b
Z
+ R1 φ2 + 2R2 φφ′ + R3 (φ′ )2 dx
2 a
for some functions R1 , R2 , R3 which approach zero as ε → 0. We
now estimate the integral that appears on the right hand side.
36 / 43
Sufficient condition: Sketch of proof, page 2
In this case, the critical points are lines, namely functions y(x) whose
derivative is constant, say y ′ (x) = c. One can easily check that
1 1
Ly′ y′ = = =δ
(1 + y ′ (x)2 )3/2 (1 + c2 )3/2
for some constant δ > 0, while Lyy = Lyy′ = 0. This implies that
Z b
′′
J (y)φ = δφ′ (x)2 dx,
a
x∗ = x + ε, y∗ = y.
x∗ = x, y∗ = y + ε.
39 / 43
Noether’s theorem
Q = α L − y ′ Ly′ + βLy′
∂x∗ ∂y∗
α= , β= .
∂ε ε=0 ∂ε ε=0
40 / 43
Noether’s theorem: Sketch of proof, page 1
L∗ = L + ∆x · Lx + ∆y · Ly + ∆y ′ · Ly′ .
41 / 43
Noether’s theorem: Sketch of proof, page 2
42 / 43
Noether’s theorem: Sketch of proof, page 3
Q = αL + (β − αy ′ )Ly′
43 / 43