2015 Book SchwingerSQuantumActionPrincip
2015 Book SchwingerSQuantumActionPrincip
Kimball A. Milton
Schwinger’s Quantum
Action Principle
From Dirac’s Formulation
Through Feynman’s Path
Integrals, the Schwinger-
Keldysh Method,
Quantum Field Theory, to
Source Theory
123
SpringerBriefs in Physics
Editorial Board
Egor Babaev, University of Massachusetts, Massachusetts, USA
Malcolm Bremer, University of Bristol, Bristol, UK
Xavier Calmet, University of Sussex, Sussex, UK
Francesca Di Lodovico, Queen Mary University of London, London, UK
Maarten Hoogerland, University of Auckland, Auckland, New Zealand
Eric Le Ru, Victoria University of Wellington, Wellington, New Zealand
Hans-Joachim Lewerenz, California Institute of Technology, Pasadena, USA
James Overduin, Towson University, Towson, USA
Vesselin Petkov, Concordia University, Concordia, Canada
Charles H.-T. Wang, University of Aberdeen, Aberdeen, UK
Andrew Whitaker, Queen’s University Belfast, Belfast, UK
More information about this series at http://www.springer.com/series/8902
Kimball A. Milton
Schwinger’s Quantum
Action Principle
From Dirac’s Formulation Through
Feynman’s Path Integrals,
the Schwinger-Keldysh Method,
Quantum Field Theory, to Source Theory
123
Kimball A. Milton
Homer L. Dodge Department of Physics and
Astronomy
The University of Oklahoma
Norman
USA
Starting from the earlier notions of stationary action principles, we show how Julian
Schwinger’s Quantum Action Principle descended from Dirac’s formulation, which
independently led Feynman to his path-integral formulation of quantum mechanics.
The connection between the two is brought out, and applications are discussed. The
Keldysh-Schwinger time-cycle method of extracting matrix elements in nonequi-
librium situations is described. The variational formulation of quantum field theory
and the development of source theory constitute the latter part of this work. In this
document, derived from Schwinger’s lectures over four decades, the continuity of
concepts, such as that of Green’s functions, becomes apparent.
v
Contents
1 Historical Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
vii
viii Contents
References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Chapter 1
Historical Introduction
Variational principles for dynamical systems have a long history. Although precursors
go back at least to Leibnitz (see for example (Euler 1752)) and Euler (1744) the
“principle of least action” was given modern form by de Maupertuis (1744, 1746).
We will not attempt to trace the history here; a brief useful account is given in
Sommerfeld’s lectures (Sommerfeld 1964). The most important names in the history
of the development of dynamical systems, or at least those that will bear most directly
on the following discussion, are those of Lagrange (1788) and Hamilton (1834, 1835).
Here we are concentrating on the work of Julian Schwinger (1918–1994), who
had profound and pervasive influence on 20th century physics, and whose many
students have become leaders in diverse fields.1 For biographical information about
his life and work see (Mehra 2000; Milton 2007). Therefore, we will take up the
story in the modern era. Shortly after Dirac’s work with Fock and Podolsky (Dirac
1932), in which the demonstration of the equivalence between his theory of quantum
electrodynamics, and that of Heisenberg and Pauli (Heisenberg 1929), P. A. M.
Dirac wrote a paper on “The Lagrangian in Quantum Mechanics” (Dirac 1933).
This paper had a profound influence on Richard Feynman’s doctoral dissertation at
Princeton on “The Principles of Least Action in Quantum Mechanics” (Feynman
1942), and on his later work on the formulations of the “Space-Time Approach to
Quantum Electrodynamics” (Feynman 1949). Dirac’s paper further formed the basis
for Schwinger’s development of the quantum action principle, which first appeared
in his final operator field formulation of quantum field theory (Schwinger 1951b),
which we will describe in Chap. 6.
The response of Feynman and Schwinger to Dirac’s inspiring paper was
completely different. Feynman was to give a global “solution” to the problem of
1 For complex reasons, Schwinger’s influence on modern physics is not widely appreciated. His
contributions to our current understanding of nature are underrepresented in textbooks, with some
notable exceptions (Toms 2007).
© The Author(s) 2015 1
K.A. Milton, Schwinger’s Quantum Action Principle,
SpringerBriefs in Physics, DOI 10.1007/978-3-319-20128-3_1
2 1 Historical Introduction
This section grew out of lectures given by Schwinger at UCLA around 1974,
which were substantially transformed into Chap. 8 of Classical Electrodynamics
(Schwinger 1998). (Remarkably, considering his work on waveguide theory during
World War II, now partially recorded in Ref. (Milton 2006), he never gave lectures
on this subject at Harvard after 1947.)
We start by reviewing and generalizing the Lagrange-Hamilton principle for a
single particle. The action, W12 , is defined as the time integral of the Lagrangian,
L, where the integration extends from an initial configuration or state at time t2 to a
final state at time t1 : t1
W12 = dt L . (2.1)
t2
The integral refers to any path, any line of time development, from the initial to
the final state, as shown in Fig. 2.1. The actual time evolution of the system is
selected by the principle of stationary action: In response to infinitesimal variations
of the integration path, the action W12 is stationary—does not have a corresponding
infinitesimal change—for variations about the correct path, provided the initial and
final configurations are held fixed,
δW12 = 0. (2.2)
This means that, if we allow infinitesimal changes at the initial and final times,
including alterations of those times, the only contribution to δW12 then comes from
the endpoint variations, or
δW12 = G 1 − G 2 , (2.3)
time ↑
• t2
state variables →
of the action principle, where, for simplicity, we will restrict our attention to a single
particle.
Here, the independent variables are r and t, so that two kinds of variations can be
considered. First, a particular motion is altered infinitesimally, that is, the path is
changed by an amount δr:
r(t) → r(t) + δr(t). (2.5)
Second, the final and initial times can be altered infinitesimally, by δt1 and δt2 ,
respectively. It is more convenient, however, to think of these time displacements as
produced by a continuous variation of the time parameter, δt (t),
t → t + δt (t), (2.6)
Because of this redefinition of the time variable, the limits of integration in the action,
1 1 (dr)2
W12 = m − dt V , (2.10)
2 2 dt
are not changed, the time displacement being produced through δt (t) subject to (2.7).
The resulting variation in the action is now
2
1 ∂
· ·
dr d dδt 1 dr
δW12 = dt m δr − δr ∇V − m + V − δt V
2 dt dt dt 2 dt ∂t
1 2
=
2
dt
d
dt
m
dr
dt
·
δr −
1
2
m
dr
dt
+ V δt
2
d2 ∂
·
+ δr −m 2 r − ∇V + δt
dt
d 1
dt 2
m
dr
dt
+V − V
∂t
,
(2.11)
where, in the last form, we have integrated by parts in order to isolate δr and δt.
Because δr and δt are independent variations, the principle of stationary action
implies that the actual motion is governed by
d2
m r = − ∇V, (2.12a)
dt 2
2
d 1 dr ∂
m +V = V, (2.12b)
dt 2 dt ∂t
while the total time derivative gives the change at the endpoints,
·
G = p δr − Eδt, (2.12c)
with
2
dr 1 dr
momentum = p = m , energy = E = m + V. (2.12d)
dt 2 dt
Therefore, we have derived Newton’s second law [the equation of motion in second-
order form], (2.12a), and, for a static potential, ∂V /∂t = 0, the conservation of
energy, (2.12b). The significance of (2.12c) will be discussed later in Sect. 2.4.
6 2 Review of Classical Action Principles
Using the above definition of the momentum, we can rewrite the Lagrangian as
L=p · dr
dt
− H (r, p, t), (2.13)
p2
H= + V (r, t). (2.14)
2m
We are here to regard r, p, and t as independent variables in
1
W12 =
2
·
[p dr − dt H ]. (2.15)
dr ∂ H p
= = , (2.17a)
dt ∂p m
dp ∂H
=− = −∇V, (2.17b)
dt ∂r
dH ∂H
= , (2.17c)
dt ∂t
·
G = p δr − H δt. (2.17d)
In contrast with the Lagrangian differential equations of motion, which involve sec-
ond derivatives, these Hamiltonian equations contain only first derivatives; they are
called first-order equations. They describe the same physical system, because when
(2.17a) is substituted into (2.17b), we recover the Lagrangian-Newtonian equation
(2.12a). Furthermore, if we insert (2.17a) into the Hamiltonian (2.14), we identify
H with E. The third equation (2.17c) is then identical with (2.12b). We also note the
equivalence of the two versions of G.
2.2 Hamiltonian Viewpoint 7
But probably the most direct way of seeing that the same physical system is
involved comes by writing the Lagrangian in the Hamiltonian viewpoint as
2
m dr 1 dr 2
L= −V − p−m . (2.18)
2 dt 2m dt
dr
p=m . (2.19)
dt
But, if we accept this as the definition of p, the corresponding term in L disappears
and we explicitly regain the Lagrangian description. We are justified in completely
omitting the last term on the right side of (2.18), despite its dependence on the
variables r and t, because of its quadratic structure. Its explicit contribution to δL is
−
1
m
p−m
dr
dt
· d
δp − m δr + m
dt
dr dδt
dt dt
, (2.20)
and the equation supplied by the stationary action principle for p variations, (2.19),
also guarantees that there is no contribution here to the results of r and t variations.
Here we take r, p, and the velocity, v, as independent variables, so that the Lagrangian
is written in the form
L=p · dr
dt
1
− v + mv 2 − V (r, t) ≡ p
2
dr
dt
·
− H (r, p, v, t), (2.21)
where
1
·
H (r, p, v, t) = p v − mv 2 + V (r, t).
2
(2.22)
dp ∂H
=− = −∇V, (2.24a)
dt ∂r
dr ∂ H
= = v, (2.24b)
dt ∂p
∂H
0= − = −p + mv, (2.24c)
∂v
dH ∂H
= , (2.24d)
dt ∂t
·
G = p δr − H δt. (2.24e)
Notice that there is no equation of motion for v since dv/dt does not occur in the
Lagrangian, nor is it multiplied by a time derivative. Consequently, (2.24c) refers to
a single time and is an equation of constraint.
From this third approach, we have the option of returning to either of the other
two viewpoints by imposing an appropriate restriction. Thus, if we write (2.22) as
p2 1
H (r, p, v, t) = + V (r, t) − (p − mv)2 , (2.25)
2m 2m
and we adopt
1
v= p (2.26)
m
as the definition of v, we recover the Hamiltonian description, (2.13) and (2.14).
Alternatively, we can present the Lagrangian (2.21) as
2 2
L=
m
2
dr
dt
− V + (p − mv) · dr
dt
−v −
m dr
2 dt
−v . (2.27)
dr
v= , p = mv, (2.28)
dt
the resultant form of L is that of the Lagrangian viewpoint, (2.4). It might seem that
only the definition v = dr/dt, inserted in (2.27), suffices to regain the Lagrangian
description. But then the next to last term in (2.27) would give the following additional
2.3 A Third, Schwingerian, Viewpoint 9
There is more content to the principle of stationary action than equations of motion.
Suppose one considers a variation such that
δW12 = 0, (2.30)
independently of the choice of initial and final times. We say that the action, which is
left unchanged, is invariant under this alteration of path. Then the stationary action
principle (2.3) asserts that
δW12 = G 1 − G 2 = 0, (2.31)
or, there is a quantity G(t) that has the same value for any choice of time t; it is
conserved in time. A differential statement of that is
d
G(t) = 0. (2.32)
dt
The G functions, which are usually referred to as generators, express the interrelation
between conservation laws and invariances of the system.
Invariance implies conservation, and vice versa. A more precise statement is the
following:
If there is a conservation law, the action is stationary under an infinitesimal transformation
in an appropriate variable.
Fig. 2.2 δω × r is δω × r
perpendicular to δω and r,
δω
and represents an r
infinitesimal rotation of r
about the δω axis
Then the variation (which as a rigid displacement in time, amounts to a shift in the
time origin)
δt = constant (2.34)
will give δW12 = 0 [see the first line of (2.16), with δr = 0, δp = 0, dδt/dt = 0,
∂ H/∂t = 0]. The conclusion is that G in (2.17d), which here is just
G t = −H δt, (2.35)
δr = δω × r, (2.37)
where the constant vector δω gives the direction and magnitude of the rotation (see
Fig. 2.2). Now specialize (2.14) to
p2
H= + V (r ), (2.38)
2m
where r = |r|, a rotationally invariant structure. Then
1
W12 =
2
·
[p dr − H dt] (2.39)
·
G δω = p δr = δω r × p. · (2.40)
d
L = r × p, L = 0. (2.41)
dt
Of course, this is also contained within the equation of motion,
d ∂V
L = −r × ∇V = −r×r̂ = 0, (2.42)
dt ∂r
since V depends only on |r|.
Conservation of linear momentum appears analogously when there is invariance
under a rigid translation. For a single particle, (2.17b) tells us immediately that p is
conserved if V is a constant, say zero. Then, indeed, the action
1 p2
W12 =
2
·
p dr −
2m
dt (2.43)
δr = δ = constant, (2.44)
and
G δ = p δ · (2.45)
is conserved. But the general principle acts just as easily for, say, a system of two
particles, a and b, with Hamiltonian
pa2 p2
H= + b + V (ra − rb ). (2.46)
2m a 2m b
· ·
G δ = pa δra + pb δrb = (pa + pb ) δ · (2.49)
12 2 Review of Classical Action Principles
d
P = pa + pb , P = 0. (2.50)
dt
Something a bit more general appears when we consider a rigid translation that
grows linearly in time:
δra = δrb = δv t, (2.51)
using the example of two particles. This gives each particle the common additional
velocity δv, and therefore must also change their momenta,
The action is not invariant; its variation has end-point contributions. But there is still
·
a conservation law, not of G = P δvt, but of N δv, where ·
N = Pt − (m a ra + m b rb ). (2.54)
m a ra + m b rb
R= , M = ma + mb, (2.55)
M
the statement of conservation of
N = Pt − MR, (2.56)
namely
dN dR
0= =P−M , (2.57)
dt dt
is the familiar fact that the center of mass of an isolated system moves at the constant
velocity given by the ratio of the total momentum to the total mass of that system.
2.5 Nonconservation Laws: The Virial Theorem 13
The action principle also supplies useful nonconservation laws. Consider, for constant
δλ,
δr = δλr, δp = −δλp, (2.58)
·
which leaves p dr invariant,
· ·
δ(p dr) = (−δλp) dr + p (δλdr) = 0. · (2.59)
p2
H = T ( p) + V (r), T ( p) = , (2.60)
2m
is given by the noninvariant form
δ H = δλ(−2T + r ∇V ). · (2.61)
Therefore we have, for an arbitrary time interval, for the variation of the action (2.15),
1 1
· ·
d
δW12 = dt[δλ(2T − r ∇V )] = G 1 − G 2 = dt (p δλr) (2.62)
2 2 dt
·
d
(r p) = 2T + V. (2.65)
dt
We apply this to a bound system produced by a force of attraction. On taking the
time average of (2.65) the time derivative term disappears. That is because, over an
·
arbitrarily long time interval τ = t1 − t2 , the value of r p(t1 ) can differ by only a
·
finite amount from r p(t2 ), and
d
(r p) ≡·
1 t1
dt
d
r p=· ·
r p(t1 ) − r p(t2 ) ·
→ 0, (2.66)
dt τ t2 dt τ
14 2 Review of Classical Action Principles
2T = −V , (2.67)
·
Again p dr is invariant:
·
δ(p dr) = −δλ
p rp r
−
· · dr + p · δλ
dr r dr
− δλr 3
·= 0, (2.69)
r r3 r r
d
r L2
dt r
·
p =
mr 3
−
dV
dr
, (2.71)
which, when applied to the Coulomb potential, gives the bound-state time average
relation
L2 1 V
3
=− . (2.72)
m r r
This chapter again grew out of Schwinger’s UCLA lectures. These evolved, tortur-
ously, into Chap. 9 of Classical Electrodynamics (Schwinger 1998). Here we use
Gaussian units.
It was stated in our review of mechanical action principles in the previous chapter that
the third viewpoint, which employs the variables r, p, and v, was particularly conve-
nient for describing electromagnetic forces on charged particles. With the explicit,
and linear, appearance of v in what plays the role of the potential function when
magnetic fields are present, we begin to see the basis for that remark. Indeed, we
have only to consult (2.21) to find the appropriate Lagrangian:
· ·
dr 1 e
L=p − v + mv 2 − eφ + v A, (3.1)
dt 2 c
where φ and A are the scalar and vector potentials, respectively. To recapitulate, the
equations resulting from variations of p, r, and v are, respectively,
dr
= v, (3.2a)
dt
·
d 1
p = − e∇ φ − v A , (3.2b)
dt c
e
p = mv + A. (3.2c)
c
We can now move to either the Lagrangian or the Hamiltonian formulation. For the
first, we simply adopt v = dr/dt as a definition (but see the discussion in Sect. 2.3)
and get
2
·
1 dr e dr
L= m − eφ + A. (3.3)
2 dt c dt
1 e
v= p− A , (3.4)
m c
and find
L =p · dr
dt
− H, (3.5a)
1 e 2
H= p − A + eφ. (3.5b)
2m c
this introduces the Lagrange function, or Lagrangian density, L. The total Lagrangian
must be the sum of the particle part, (3.1), and the field part, (3.6), where the latter
must be chosen so as to give the Maxwell equations, in Gaussian units:
1 ∂
·
4π
∇×B= E+ j, ∇ E = 4πρ, (3.7a)
c ∂t c
1 ∂
−∇ × E =
c ∂t
B, ∇ B = 0. · (3.7b)
The homogeneous equations here are equivalent to the construction of the electro-
magnetic field in term of potentials, or,
1 ∂
A = − E − ∇φ, (3.8a)
c ∂t
B = ∇ × A. (3.8b)
3.2 Electrodynamic Action 17
Thus, we recognize that A(r, t), E(r, t), in analogy with r(t), p(t), obey equations
of motion while φ(r, t), B(r, t), as analogues of v(t), do not. There are enough
clues here to give the structure of Lfield , apart from an overall factor. The anticipated
complete Lagrangian for microscopic electrodynamics is
L= pa ·dra
dt
1 ea
− va + m a va2 − ea φ(ra ) + va
2 c
· A(ra )
a
1 ∂
· ·
1 1 2
+ (dr) E − A − ∇φ − B ∇ × A + (B − E 2 ) . (3.9)
4π c ∂t 2
The terms that are summed in (3.9) describe the behavior of charged particles
under the influence of the fields, while the terms that are integrated describe the field
behavior. The independent variables are
ra (t), va (t), pa (t), φ(r, t), A(r, t), E(r, t), B(r, t), t. (3.10)
We now look at the response of the Lagrangian to variations in each of these variables
separately, starting with the particle part:
· · ·
d dpa va
δra : δL = (δra pa ) + δra − − ∇ a ea φ(ra ) − A(ra ) ,
dt dt c
(3.11a)
·
ea
δva : δL = δva −pa + m a va + A(ra ) , (3.11b)
c
δL = δpa ·
dra
δpa : − va . (3.11c)
dt
so that
ea
· 1
−ea φ(ra ) + va A(ra ) = (dr) −ρφ + j A .
c c
· (3.14)
a
The volume integrals extend over sufficiently large regions to contain all the fields of
interest. Consequently, we can integrate by parts and ignore the surface terms. The
responses of the Lagrangian (3.9) to field variations, and the corresponding equations
of motion deduced from the action principle are
·
1
δφ : δL = (dr) δφ(∇ E − 4πρ), (3.15a)
4π
·
∇ E = 4πρ,
(3.15b)
δA : δL = −
1 d
4π c dt
(dr) δA E ·
1 ∂E 4π
+
1
4π
(dr) δA
c ∂t
+
c
·
j−∇×B , (3.15c)
1 ∂ 4π
∇×B= E+ j, (3.15d)
c ∂t c
1 ∂
·
1
δE : δL = (dr) δE − A − ∇φ − E , (3.15e)
4π c ∂t
1 ∂
E= − A − ∇φ, (3.15f)
c∂t
·
1
δB : δL = (dr) δB (−∇ × A + B), (3.15g)
4π
B = ∇ × A. (3.15h)
We therefore recover Maxwell’s equations, two of which are implicit in the con-
struction of E and B in terms of potentials. By making a time variation of the action
[variations due to the time dependence of the fields vanish by virtue of the stationary
action principle—that is, they are already subsumed in Eqs. (3.15)],
d dH
δt : δW = dt (−H δt) + δt , (3.16)
dt dt
·
ea 1
H= pa − A(ra ) va − m a va2 + ea φ(ra )
a
c 2
+
1
4π
· 1
·
(dr) E ∇φ + B ∇ × A + (E 2 − B 2 ) ,
2
(3.17)
which is a constant of the motion, d H/dt = 0. The generators are inferred from the
total time derivative terms in (3.11a), (3.15c), and (3.16),
δW12 = G 1 − G 2 , (3.18a)
to be
· ·
1
G= δra pa − (dr) E δA − H δt. (3.18b)
a
4π c
3.3 Energy
1 ea 2 2
E + B2
H= pa − A(ra ) + ea φ(ra ) + (dr) − ρφ .
a
2m a c 8π
(3.20)
The first term here is the energy of the particles moving in the field [particle energy—
see (3.5b)], so we might call the second term the field energy. The ambiguity of these
terms (whether the potential energy of particles is attributed to them or to the fields,
or to both) is evident from the existence of a simpler form of the Hamiltonian
1 ea 2 E 2 + B2
H= pa − A(ra ) + (dr) , B = ∇ × A, (3.21)
a
2m a c 8π
where we have used the equivalence of the two terms involving φ, given in (3.14).
This apparently startling result suggests that the scalar potential has disappeared
from the dynamical description. But, in fact, it has not. If we vary the Lagrangian
(3.19), where H is given by (3.21), with respect to E we find
20 3 Classical Field Theory—Electrodynamics
1 ∂
·
1
δL = − (dr) δE A + E = 0. (3.22)
4π c ∂t
Do we conclude that 1c ∂t∂ A + E = 0? That would be true if the δE(r, t) were arbitrary.
They are not; E is subject to the restriction—the constraint—(3.15b), which means
that any change in E must obey
∇ δE = 0. · (3.23)
The proper conclusion is that the vector multiplying δE in (3.22) is the gradient of a
scalar function, just as in (3.15f),
1 ∂
A + E = −∇φ, (3.24)
c ∂t
for that leads to
δL = −
1
4π
·
(dr) (∇ δE)φ = 0, (3.25)
as required.
The fact that the energy is conserved,
dH
= 0, (3.26)
dt
where
1 E 2 + B2
H= m a va2 + (dr) U, U = , (3.27)
a
2 8π
is a simple sum of particle kinetic energy and integrated field energy density, can be
verified directly by taking the time derivative of (3.20). The time rate of change of
the particle energy is computed directly:
d 1 ∂
·
ea
m a va2 + ea φ(ra ) = ea φ(ra ) − va A(ra ) . (3.28)
dt a 2 a
∂t c
We can compute the time derivative of the field energy by using the equation of
energy conservation,
d
dt
(dr) U = − (dr) j E, ·
(3.29)
to be
E 2 + B2 ∂ ∂
·
d
(dr) − ρφ = (dr) −j E − φ ρ − ρ φ
dt 8π ∂t ∂t
3.3 Energy 21
∂ 1 ∂
=− (dr) ρ φ − j
∂t c ∂t
A ·
∂ ∂
=− ea
∂t
1
φ(ra ) − va
c ∂t
·
A(ra ) .
a
(3.30)
by charge conservation. Observe that (3.28) and (3.30) are equal in magnitude and
opposite in sign, so that their sum is zero. This proves the statement of energy
conservation (3.26).
The action principle not only provides us with the field equations, particle equations
of motion, and expressions for the energy, but also with the generators (3.18b).
The generators provide a connection between conservation laws and invariances
of the action (recall Sect. 2.4). Here we will further illustrate this connection by
deriving momentum and angular momentum conservation from the invariance of the
action under rigid coordinate translations and rotations, respectively. [In a similar
way we could derive energy conservation, (3.26), from the invariance under time
displacements—see also Sect. 3.6].
Under an infinitesimal rigid coordinate displacement, δ, a given point which is
described by r in the old coordinate system is described by r + δ in thenew one.
(See Fig. 3.1.) The response of the particle term in (3.18b) is simple: δ · a pa ; for
the field part, we require the change, δA, of the vector potential induced by the rigid
that is, the new function F of the new coordinate equals the old function F of the
old coordinate. The change in the function F at the same coordinate is given by
so that
δ F(r) = F(r − δ) − F(r) = −δ ∇ F(r), · (3.34)
If the positions of all the particles, the ra , are displaced by δ, the charge density
changes to
ρ(r) + δρ(r) = ea δ(r − ra − δ), (3.36)
a
where
·
δ(r − ra − δ) = δ(r − ra ) − δ ∇r δ(r − ra ), (3.37)
and therefore
δρ(r) = −δ ∇ρ(r), · (3.38)
where the last rearrangement makes use of (3.15b) and (3.15h), and the vector identity
·
δ × (∇ × A) = ∇(δ A) − (δ ∇)A. · (3.40)
Including the particle part from (3.18b) we find the generator corresponding to a
rigid coordinate displacement can be written as
3.4 Momentum and Angular Momentum Conservation 23
·
G = δ P, (3.41)
where
ea 1
P= pa − A(ra ) + (dr) E × B ≡ m a va + (dr) G, (3.42)
a
c 4π c a
with G the momentum density. Since the action is invariant under a rigid displace-
ment,
0 = δW = G 1 − G 2 = (P1 − P2 ) δr, · (3.43)
we see that
P1 = P2 , (3.44)
that is, the total momentum, P, is conserved. This, of course, can also be verified by
explicit calculation:
d 1 1
(dr) E×B=− (dr) ρE + j × B
dt 4π c c
1
=− ea E(ra ) + va × B(ra ) , (3.45)
a
c
since a vector transforms in the same way as r, so the new function at the initial
numerical values of the coordinates is
·
A(r) = A(r) − (δr ∇)A(r) + δω × A(r). (3.48)
·
δA = −(δr ∇)A + δω × A. (3.49)
·
G = δω J, (3.50)
24 3 Classical Field Theory—Electrodynamics
1
J= ra × m a va + (dr) r × E×B , (3.51)
a
4π c
ea
ea ∂
L → L ≡L +
c
∇λ
dra
dt
·− va +
c ∂t
ea
λ + va ∇λ
c
·
a
ea ∂
·
dra
=L + λ+ ∇λ
a
c ∂t dt
d
=L + w, (3.54)
dt
where ea
w= λ(ra , t). (3.55)
a
c
3.5 Gauge Invariance and the Conservation of Charge 25
G = G + δw. (3.57)
This alteration reflects the fact that the Lagrangian itself is ambiguous up to a total
time derivative term. [This term may also be seen as arising from the field term in
the generator (3.18b).]
To ascertain the implication of gauge invariance, we rewrite the change in the
Lagrangian given in the first line of (3.54) by use of (3.12c),
∂
·
1
L−L= (dr) ρ λ + j ∇λ , (3.58)
c ∂t
and apply this result to an infinitesimal gauge transformation, λ → δλ. The change
in the action is then
t1
∂
δW12 = G δλ1 − G δλ2 −
t2
dt (dr) δλ
1
c ∂t
ρ+∇ j , · (3.59)
In view of the arbitrary nature of δλ(r, t), the stationary action principle now demands
that, at every point,
∂
∂t
ρ + ∇ j = 0,· (3.61)
that is, gauge invariance implies local charge conservation. (Of course, this same
result follows from Maxwell’s equations.) Then, the special situation δλ = constant,
where δA = δφ = 0, and W12 is certainly invariant, implies a conservation law, that
of
1
G δλ = δλ Q, (3.62)
c
in which
Q= (dr) ρ (3.63)
We have just derived the local conservation law of electric charge. Electric charge is
a property carried only by the particles, not by the electromagnetic field. In contrast,
the mechanical properties of energy, linear momentum, and angular momentum are
attributes of both particles and fields. For these we have conservation laws of total
quantities. What about local conservation laws? The usual development of electro-
dynamics refers to local non-conservation laws; they concentrated on the fields and
characterized the charged particles as sources (or sinks) of field mechanical proper-
ties. It is natural to ask for a more even-handed treatment of both charges and fields.
We shall supply it, in the framework of a particular example. The property of gauge
invariance will be both a valuable guide, and an aid to simplifying the calculations.
The time displacement of a complete physical system identifies its total energy.
This suggests that time displacement of a part of the system provides energetic
information about that portion. The ultimate limit of this spatial subdivision, a local
description, should appear in response to an (infinitesimal) time displacement that
varies arbitrarily in space as well as in time, δt (r, t).
Now we need a clue. How do fields, and potentials, respond to such coordinate-
dependent displacements? This is where the freedom of gauge transformations enters:
The change of the vector and scalar potentials, by ∇λ(r, t), −(1/c)(∂/∂t)λ(r, t),
respectively, serves as a model for the potentials themselves. The advantage here is
that the response of the scalar λ(r, t) to the time displacement can be reasonably
taken to be
(λ + δλ)(r, t + δt) = λ(r, t), (3.64a)
or
∂
δλ(r, t) = −δt (r, t) λ(r, t). (3.64b)
∂t
Then we derive
∂ 1 ∂
δ(∇λ) = − δt
(∇λ) + − λ c∇δt, (3.65a)
∂t c ∂t
1 ∂ 1 ∂ 2 1 ∂ ∂
δ − λ = − δt − λ − − λ δt, (3.65b)
c ∂t c ∂t 2 c ∂t ∂t
∂
δA = − δt A + φc∇δt, (3.66a)
∂t
∂ ∂
δφ = − δt φ − φ δt, (3.66b)
∂t ∂t
or, equivalently,
3.6 Gauge Invariance and Local Conservation Laws 27
1 ∂ ∂ ∂
δE = − (cδtE) = −δt E − E δt, (3.68a)
c ∂t ∂t ∂t
∂
δB = ∇ × (cδtE) = −δt B − E × ∇cδt; (3.68b)
∂t
1
Lfield = (E 2 − B 2 ). (3.69)
8π
Then we can apply the field variation (3.68b) directly, and get
∂ 1 2∂
δLfield = − δt
∂t
Lfield −
4π
E
∂t
δt −
c
4π
E × B ∇δt ·
∂ ∂
= − (δtLfield ) −
∂t
1
8π
(E 2 + B 2 ) δt −
∂t
c
4π
E × B ∇δt. · (3.70)
Before commenting on these last, not unfamiliar, field structures, we turn to the
charged particles and put them on a somewhat similar footing in terms of a continuous,
rather than a discrete, description.
We therefore present the Lagrangian of the charges in (3.9) in terms of a corre-
sponding Lagrange function,
L charges = (dr) Lcharges , (3.71a)
where
Lcharges = La (3.71b)
a
and
·
1 ea
La = δ(r − ra (t)) m a va (t)2 − ea φ(ra , t) + va (t) A(ra , t) ; (3.71c)
2 c
28 3 Classical Field Theory—Electrodynamics
the latter adopts the Lagrangian viewpoint, with va = dra /dt accepted as a definition.
Then, the effect of the time displacement on the variables ra (t), taken as
the last step exhibits both the explicit and the implicit dependences of δt (ra , t) on
t. In computing the variation of φ(ra , t), for example, we combine the potential
variation given in (3.66b) with the effect of δra :
∂ ∂ ∂
·
d
δφ(ra (t), t) = −δt φ − φ δt − δtva ∇ a φ = −δt φ − φ δt, (3.74a)
∂t ∂t dt ∂t
and, similarly,
∂
δA(ra (t), t) = −δt
∂t
· d
A + φc∇δt − δtva ∇ a A = −δt A + φc∇δt. (3.74b)
dt
The total effect of these variations on La is thus
∂
d
dt
2 ea
δLa = −δt La + δ(r − ra (t)) −m a va − A va + ea φ
c ∂t
·
δt + va ∇δt , ·
(3.75a)
or
∂
·
d
δLa = − (δtLa ) − δ(r − ra (t))E a δt + va ∇δt , (3.75b)
dt ∂t
1
Ea = m a va2 . (3.76)
2
We have retained the particle symbol d/dt to the last, but now, being firmly back
in the field, space-time viewpoint, it should be written as ∂/∂t, referring to all t
dependence, with r being held fixed. The union of these various contributions to the
variation of the total Lagrange function is
∂ ∂
δLtot = −
∂t ∂t
·
(δtLtot ) − Utot δt − Stot ∇δt, (3.77)
1
Utot = (E 2 + B 2 ) + δ(r − ra (t))E a (3.78a)
8π a
and c
Stot = E×B+ δ(r − ra (t))E a va , (3.78b)
4π a
are physically transparent forms for the total energy density and total energy flux
vector.
To focus on what is new in this development, we ignore boundary effects in the
stationary action principle, by setting the otherwise arbitrary δt (r, t) equal to zero
at t1 and t2 . Then, through partial integration, we conclude that
t1 ∂
δW12 =
t2
dt (dr) δt
∂t
·
Utot + ∇ Stot = 0, (3.79)
∂
∂t
·
Utot + ∇ Stot = 0. (3.80)
Chapter 4
Quantum Action Principle
This Chapter, and the following one, are based on lectures given by the author in
quantum field theory courses at the University of Oklahoma over several years, based
in turn largely on lectures given by Schwinger at Harvard in the late 1960s.
After the above reminder of classical variational principles, we now turn to the
dynamics of quantum mechanics. We begin by considering the transformation func-
tion a , t + dt|b , t. Here |b , t is a state specified by the values b = {b } of a
complete set of dynamical variables B(t), while |a , t + dt is a state specified by
values a = {a } of a (different) complete set of dynamical variables A(t + dt),
defined at a slightly later time.1 We suppose that A and B do not possess any explicit
time dependence—that is, their definition does not depend upon t. Here
where the infinitesimal time translation operator is related to the generator of time
translations as follows,
U = 1 + i G = 1 − i dt H. (4.2)
where δdyn corresponds to changes in initial and final times, δt2 and δt1 , and in the
structure of H , δ H . [By reintroducing dt in the state on the left in the second line,
we make a negligible error of O(dt 2 ).]
However, we can also consider kinematical changes. To understand these, consider
a system defined by coordinates and momenta, {qa (t)}, { pa (t)}, a = 1, . . . , n, which
satisfy the canonical commutation relations,
n
U = 1 + i Gq , Gq = pa δqa . (4.8)
a=1
1
U −1 qa U = qa − [qa , G q ]
i
= qa − δqa , (4.9)
while
1
U −1 pa U = pa − [ pa , G q ] = pa . (4.10)
i
The (dual) symmetry between position and momentum,
q → p, p → −q, (4.11)
4 Quantum Action Principle 33
δkin | = | − | = |i G, (4.13a)
δkin | = | − | = −i G| , (4.13b)
dpa (t) 1
= [ pa (t), H (q(t), p(t), t)]
dt i
∂H
=− (t), (4.15)
∂qa
so
dpa (t) ∂H
pa (t + dt) − pa (t) = dt = −dt (t). (4.16)
dt ∂qa
dqa ∂H
= (4.17)
dt ∂ pa
implies that
∂H
qa (t + dt) − qa (t) = dt (t). (4.18)
∂ pa
δq a , t + dt|b , t
∂H
= ia , t + dt| pa (t)[δqa (t + dt) − δqa (t)] − dt δqa (t) + O(dt 2 )|b , t
a
∂q a
= ia , t + dt|δq pa (t).[qa (t + dt) − qa (t)] − dt H (q(t), p(t), t) |b , t,
a
(4.19)
34 4 Quantum Action Principle
δ p a , t + dt|b , t
= − ia , t + dt| [qa (t + dt)δpa (t + dt) − qa (t)δpa (t)]|b , t
a
∂H
= − ia , t + dt| qa (t)[δpa (t + dt) − δpa (t)] + dt (t)δpa (t)|b , t
a
∂ pa
= ia , t + dt|δ p − qa (t).( pa (t + dt) − pa (t)) − dt H (q(t), p(t), t) |b , t.
a
(4.20)
We see here two alternative forms of the quantum Lagrangian. Note that the two
forms differ by a total time derivative,
d
Lq − L p = pa .qa . (4.23)
dt a
We now can unite the kinematic transformations considered here with the dynamic
ones considered earlier, in Eq. (4.6):
δ = δdyn + δkin : δa , t + dt|b , dt = ia , t + dt|δ[dt L]|b , t. (4.24)
Suppose, for concreteness, that our states are defined by values of q, so that
δ p a , t + dt|b t = 0. (4.25)
4 Quantum Action Principle 35
∂H
δp Lq = δpa q̇a − = 0. (4.26)
a
∂ pa
t1 |t2 = t1 |t1 − dtt1 − dt|t1 − 2dt · · · t2 + 2dt|t2 + dtt2 + dt|t2 , (4.27)
So in this way we deduce the general form of Schwinger’s quantum action principle:
t1
δt1 |t2 = it1 |δ dt L|t2 . (4.28)
t2
where the generator G a depends on p and q at time ta . Comparing with the action
principle (4.28) we see
t1
δ dt L = G 1 − G 2 , (4.30)
t2
which has exactly the form of the classical action principle (2.3), except that the
Lagrangian L and the generators G are now operators. If no changes occur at the
endpoints, we have the principle of stationary action,
t1
δ pa .dqa − H dt = 0. (4.31)
t2 a
We will again assume δpa , δqa are not operators (that is, they are proportional to the
unit operator); then we recover Hamilton’s equations,
∂H dH
= , (4.34a)
∂t dt
∂H dqa
= , (4.34b)
∂ pa dt
∂H dpa
=− . (4.34c)
∂qa dt
(Schwinger also explored the possibility of operator variations, see, for example, his
les Houches lectures (Schwinger 1970a).) We learn from the generators,
G t = −H δt, G q = pa δqa , (4.35)
a
dF 1
δF = δt + [F, G], (4.36)
dt i
so we deduce
dF ∂F 1
= + [F, H ], (4.37a)
dt ∂t i
∂F 1
= [F, pa ]. (4.37b)
∂qa i
1 ∂H
ṗa = − [H, pa ] = − . (4.39)
i ∂qa
4 Quantum Action Principle 37
which implies
∂F 1
= − [F, qa ]. (4.41)
∂ pa i
[qa , qb ] = 0, (4.42)
1 ∂H
q̇a = [qa , H ] = . (4.43)
i ∂ pa
It is easy to show that the effect of changing the Lagrangian by a total time derivative
(which is what is done in passing from L q to L p ) is to change the generators.
We now turn to examples.
[y, y † ] = 1, (4.44a)
1
H =ω y y +
†
. (4.44b)
2
dy 1 1
= [y, H ] = ωy, (4.45a)
dt i i
dy † 1 † 1
= [y , H ] = − ωy † . (4.45b)
dt i i
2 We follow Schwinger’s usage of y for the annihilation operator, instead of the more usual a.
38 4 Quantum Action Principle
y|y = y |y , (4.46a)
y |y = y y |,
† † † †
(4.46b)
while y | and |y † do not exist.3 These are the famous “coherent states,” to whom the
name Roy Glauber (1963) is invariably attached, although they were discovered by
Erwin Schrödinger (1926), and Glauber’s approach, as he acknowledged, followed
that of his mentor, Schwinger (1953).
The transformation function we seek is therefore
y † , t1 |y , t2 . (4.48)
1 ∂H ∂H
ẏ = ωy = , i ẏ † = −ωy † = − . (4.49)
i ∂iy † ∂y
L = iy † .ẏ − H. (4.51)
Because we use y as our state variable at the initial time, and y † at the final time, we
must exploit our freedom to redefine our generators to write
1
W12 = dt L − iy † (t1 ).y(t1 ). (4.52)
2
i 1 i d 2
iy † .ẏ = (q − i p).(q̇ +i ṗ) = ( p.q̇ − q. ṗ) + (q + p 2 ), (4.50)
2 2 4 dt
where the first term in the final form is the average of the Legendre transforms in L q and L p .
4.1 Harmonic Oscillator 39
δy † , t1 |y , t2 = iy † , t1 | − iδy1† y1 − iy2† δy2 − ωy1† y1 (δt1 − δt2 )|y , t2 , (4.54)
and we have dropped the zero-point energy. Now use the equations of motion (4.45a)
and (4.45b) to deduce that
and hence
From this we can deduce that the transformation function has the exponential form
y † , t1 |y , t2 = exp y † e−iω(t1 −t2 ) y , (4.58)
where both states are expressed at the common time t2 , so, upon inserting a complete
set of energy eigenstates, we obtain (t = t1 − t2 )
y † |Ee−i Et E|y , (4.60)
E
∞
(y † )n −inωt (y )n
√ e √ . (4.61)
n=0
n! n!
40 4 Quantum Action Principle
E n =nω, n = 0, 1, 2, . . . , (4.62a)
(y † )n
y † |E n = √ , (4.62b)
n!
(y )n
E n |y = √ . (4.62c)
n!
These correspond to the usual construction of the eigenstates from the ground state:
(y † )n
|E n = √ |0. (4.63)
n!
where K (t) is an external force (Kraft is force in German). The equation of motion is
dy ∂H
i = † = [y, H ] = ωy + K (t), (4.65)
dt ∂y
while y † satisfies the adjoint equation. In the presence of K (t), we wish to compute
the transformation function y † , t1 |y , t2 K .
Consider a variation of K . According to the action principle
δ K y † , t1 |y , t2 K = y † , t1 |iδ K W12 |y , t2 K
t1
= − iy † , t1 | dt[δ K y † + δ K ∗ y]|y , t2 K . (4.66)
t2
We can solve this differential equation by noting that the equation of motion (4.65)
can be rewritten as
d iωt
i e y(t) = eiωt K (t), (4.67)
dt
which is integrated to read
t
e iωt
y(t) − e iωt2
y(t2 ) = −i dt eiωt K (t ), (4.68)
t2
4.2 Forced Harmonic Oscillator 41
or t
y(t) = e−iω(t−t2 ) y2 − i dt e−iω(t−t ) K (t ), (4.69)
t2
δ K y † , t1 |y , t2 K
= δ K lny † , t1 |y , t2 K
y † , t1 |y , t2 K
t1 t1
=−i dt δ K (t) y † e−iω(t1 −t) − i dt e−iω(t −t) K ∗ (t )
t2 t
t1 t
∗ −iω(t−t2 ) −iω(t−t )
−i dt δ K (t) e y −i dt e K (t ) . (4.73)
t2 t2
Notice that in the terms bilinear in K and K ∗ , K always occurs earlier than K ∗ .
Therefore, these terms can be combined to read
t1
− δK dt dt K ∗ (t)η(t − t )e−iω(t−t ) K (t ), (4.74)
t2
Since we already know the K = 0 value from Eq. (4.58), we may now immediately
integrate our differential equation:
where we now suppose that the forces turn off at the initial and final times, t2 and t1 ,
respectively.
A check of this result is obtained by computing the probability of the system
remaining in the ground state:
∞
|0, t1 |0, t2 | = exp
K 2
dt dt K ∗ (t)e−iω(t−t )
−
−∞
× [η(t − t ) + η(t − t)]K (t )
∞
∗ −iω(t−t )
= exp − dt dt K (t)e K (t )
−∞
= exp −|K (ω)|2 , (4.78)
is thus satisfied. We see here a resonance effect: If the oscillator is driven close to
its natural frequency, so K (ω) is large, there is a large probability of finding the
system in an excited state, and therefore of not remaining in the ground state. Let us
calculate this transition amplitude to an excited state. By setting y = 0 in Eq. (4.76)
we obtain
4.2 Forced Harmonic Oscillator 43
∞
y † , t1 |0, t2 K = exp −iy † dt e−iω(t1 −t) K (t) 0, t1 |0, t2 K
−∞
= y † , t1 |n, t1 n, t1 |0, t2 K , (4.81)
n
where we have inserted a sum over a complete set of energy eigenstates, which
possess the amplitude [see Eq. (4.62b)]
(y † )n
y † |n = √ . (4.82)
n!
(−i)n
n, t1 |0, t2 K = √ e−inωt1 [K (ω)]n 0, t1 |0, t2 K . (4.83)
n!
6A Poisson probability distribution has the form p(n) = λn e−λ /n!. The mean value of n for this
distribution is
∞
∞
∞
λn e−λ
n̄ = n p(n) = =λ p(n) = λ. (4.85)
(n − 1)!
n=0 n=0 n=0
.
44 4 Quantum Action Principle
∞
y(t) = e−iω(t−t2 ) y2 + dt G(t − t )K (t ). (4.89)
−∞
which demonstrates that knowledge of 0, t1 |0, t2 K for all K determines everything:
δ(t−t
y † , t1 |y , t2 K = 0, t1 |0, t2 K (t)+iy 2 )+iy
† δ(t−t
1) . (4.93)
Although much more familiar, the path integral formulation of quantum mechanics
(Feynman 1942, 1949, 1965) is rather vaguely defined. We will here provide a formal
“derivation” based on the Schwinger principle, in the harmonic oscillator context.
Consider a forced oscillator, defined by the Lagrangian (note in this section, H does
not include the source terms)
δ
i 0, t1 |0, t2 K = 0, t1 |y † (t)|0, t2 K , (4.96a)
δ K (t)
δ
i 0, t1 |0, t2 K = 0, t1 |y(t)|0, t2 K , (4.96b)
δ K ∗ (t)
where we have introduced the concept of the functional derivative. The equation of
motion
∂H ∂H
i ẏ − † − K = 0, −i ẏ † − − K ∗ = 0, (4.97)
∂y ∂y
δ
K (t ) = δ(t − t ), (4.100)
δ K (t)
so
t1
iδ
d iδ iδ iδ
i K (t), dt i . −H ,
t2 δ K (t ) dt δ K ∗ (t ) δ K ∗ (t ) δ K (t )
d iδ ∂ iδ iδ
=i ∗
− H ∗
, , (4.101)
dt δ K (t) ∂(iδ/δ K (t)) δ K (t) δ K (t)
which corresponds to the first two terms in the equation of motion (4.97), under the
correspondence
δ δ
y↔i , y† ↔ i . (4.102)
δK ∗ δK
Since [[K , W ], W ] = 0, we can write the functional equation (4.98) as
∗ ,iδ/δ K ] ∗ ,iδ/δ K ]
0 = ei W [iδ/δ K K e−i W [iδ/δ K 0, t1 |0, t2 K . (4.103)
The above equation has a solution (up to a constant), because both equations (4.97)
must hold,
∗
0, t1 |0, t2 K = ei W [iδ/δ K ,iδ/δ K ] δ[K ]δ[K ∗ ], (4.104)
46 4 Quantum Action Principle
where δ[K ], δ[K ∗ ] are functional delta functions. The latter have functional Fourier
decompositions (up to a multiplicative constant),
δ[K ] = [dy † ]e−i dt K (t)y † (t)
, (4.105a)
dt K ∗ (t)y(t)
δ[K ∗ ] = [dy]e−i , (4.105b)
where y, y † are now numerical, and the functional integration is over all possible
functions,
over all possible “paths.” Of course, the classical paths, the ones for which
W − dt (K y † + K ∗ y) is an extremum, receive the greatest weight, at least in the
classical limit, where → 0.
4.3.1 Example
Then
∞ ∞ dν
dt y † (t)y(t) = y(ν)y ∗ (−ν), (4.108a)
−∞ −∞ 2π
∞ ∞
dν
dt iy † (t)ẏ(t) = νy(ν)y ∗ (−ν). (4.108b)
−∞ −∞ 2π
4.3 Feynman Path Integral Formulation 47
since the functional integral in the third equality, obtained by shifting the integration
variable,
K (ν)
y(ν) − →y(ν), (4.110a)
ν−ω
K ∗ (−ν)
y ∗ (−ν) − →y ∗ (−ν), (4.110b)
ν−ω
∗
is 0, t1 |0, t2 K = K = 0 = 1. How do we interpret the singularity at ν = ω in the
remaining integral? We should have inserted a convergence factor in the original
functional integral:
dν dν
exp i [. . . ] → exp i · · · + iy(ν)y ∗ (−ν) , (4.111)
2π 2π
which is evaluated by closing the ν contour in the upper half plane if t − t < 0, and
in the lower half plane when t − t > 0. Since the pole is in the lower half plane we
get
G(t − t ) = −iη(t − t )e−iω(t−t ) , (4.114)
1 ω1
q = √ (y + y ), p =
†
(y − y † ), (4.115a)
2ω 2i
ω ip ω ip
y= q+ , y =
†
q− . (4.115b)
2 ω 2 ω
Then the numerical Lagrangian appearing in (4.106) may be rewritten as (see footnote
4 above)
L = iy † ẏ − ωy † y − K y † − K ∗ y
ω p ṗ ω2 p2
=i q −i q̇ + i − q + 2
2
2 ω ω 2 ω
ω ip ω ∗ ip
− K q− − K q+
2 ω 2 ω
ω d p 2 1 d 1
=i q 2 + 2 + pq̇ − ( pq) − ( p 2 + ω2 q 2 )
4 dt ω 2 dt 2
2 2
− K q − K p
ω ω
d
= w + L(q, q̇, t), (4.116)
dt
where, if we set q̇ = p, the Lagrangian is
1 2 1 2 2
L(q, q̇, t) = q̇ − ω q + Fq, (4.117)
2 2
if √
K = 0, F = − 2ωK . (4.118)
and so from the penultimate line of Eq. (4.116), the path integral (4.106) becomes
t1
0, t1 |0, t2 F = [dy][dy † ] exp i dt L(y, y † )
t2
t1
1 2 1 2 2
= [dq][dp] exp i dt pq̇ − p − ω q + Fq . (4.121)
t2 2 2
Here we have discretized time so that p(ti ) = pi , so the final functional integral
√ over
p is just an infinite product of constants, each one of which equals e−iπ/4 2π/Δt.
Thus we arrive at the form originally written down by Feynman [Feynman 1965],
t1
0, t1 |0, t2 = F
[dq] exp i dt L(q, q̇, t) , (4.123)
t2
with the Lagrangian given by Eq. (4.117), where an infinite normalization constant
has been absorbed into the measure.
In a general sense, the exponent in Eq. (4.124) is an integrated form of the action. In
solving the equation of motion, we found in Eq. (4.89)
y(t) = e−iω(t−t2 ) y(t2 ) + dt G(t − t )K (t ), (4.126)
where the first term is effectively zero here. The net effect is to replace an operator
by a number:
y (t) = dt G(t − t )K (t ). (4.127)
Recall that the action was was the integral of the Lagrangian (4.94), or
∂
W = dt y † i y − ωy † y − y † K (t) − yK ∗ (t) , (4.129)
∂t
so we see one term in Eq. (4.128) here, and the equation of motion (4.65) cancels
out the rest! So let’s add something which gives the equation for y :
d
i dt y † i dt y −ωy † y −y † K −y K ∗
0t1 |0t2 = e
K
= ei W . (4.130)
This is the starting point for the development of source theory, which will be treated
in Chap. 7.
Chapter 5
Time-Cycle or Schwinger-Keldysh
Formulation
which expresses the expectation value in terms of the matrix elements of the operator
F in a complete set of states defined at time t1 , {|a t1 }. Suppose the operator F has
no explicit time dependence. Then we can use the action principle to write
t1
δa t1 |b t2 = ia t1 |δ dt L |b t2 , (5.2a)
t2
and so t1
δb t2 |a t1 = −ib t2 |δ dt L |a t1 , (5.2b)
t2
which can be obtained from the first equation by merely exchanging labels,
t1 t2
=− . (5.3)
t2 t1
t2
If we consider
b t2 |b t2 = b t2 |a t1 a t1 |b t2 , (5.4)
a
We can interpret the above as a cycle in time, going from time t2 to t1 and then
back again, as shown in Fig. 5.1. But, now imagine that the dynamics is different on
the forward and return trips, described by different Lagrangians L + and L − . Then
t1 t1
δb t2 |b t2 = ib t2 |δ dt L + − dt L − |b t2 . (5.6)
t2 t2
H = H0 + λ(t)F, (5.7)
t2
Let’s illustrate this with a driven harmonic oscillator, as described by Eq. (4.64),
so now
∗
H+ = ω y † y + K + (t)y + K + (t)y † , (5.10a)
∗
H− = ω y y + †
K− (t)y + K − (t)y , †
(5.10b)
which describes the oscillator evolving forward in time from t2 to t1 under the influ-
ence of the force K + , and backward in time from t1 to t2 under the influence of K − ,
as shown in Fig. 5.2. From the variational principle we can learn all about y and y † .
We have already solved this problem by a more laborious method above, in Sect. 4.2.
It suffices to solve this problem with initial and final ground states. If we consider
only a K ∗ variation,
K + ,K −
t1 ∗ ∗
δ K ∗ 0t2 |0t2 = −i0t2 | dt δK + (t)y+ (t) − δK − (t)y− (t) |0t2 . (5.11)
t2
Now we must solve the equations of motion, so since effectively y(t2 ) → 0, we have
from Eq. (4.69),
t
y+ (t) = − i dt e−iω(t−t ) K + (t ), (5.12a)
t2
t1 t
y− (t) = − i dt e−iω(t−t ) K + (t ) − i dt e−iω(t−t ) K − (t ). (5.12b)
t2 t1
so naming the advanced and retarded Green’s functions by extending the definition
in Eq. (4.86),
54 5 Time-Cycle or Schwinger-Keldysh Formulation
η(t − t)
G a,r (t, t ) = ie−iω(t−t ) , (5.14)
−η(t − t )
∗ (t)G (t−t )K (t )
∗ (t)e−iω(t−t ) K (t )
dt dt K − dt dt K −
× ei a − e + .
(5.16)
The effect on the second transformation function is the same as moving the n, t1 state
to a later time,
nt1 |0t2 K (t+T ) = nt1 + T |0t2 K (t) = e−inωT nt1 |0t2 K (t) , (5.19)
t2
5 Time-Cycle or Schwinger-Keldysh Formulation 55
which gives us the probabilities directly. From the formula (5.16) we have, using
Eq. (5.17),
dt dt K ∗ (t)e−iω(t−t ) [K (t +T )−K (t )]
0t2 |0t2 K − K + = e
dt dt K ∗ (t)e−iω(t−t ) [e−iωT −1]K (t )]
=e
e−iωT −1
= e|γ|
2
, (5.21)
where
γ= dt eiωt K (t). (5.22)
(|γ|2 )n
p(n, 0) K = e−|γ|
2
. (5.23)
n!
The above Eq. (5.21) can be directly used to find certain average values. For
example,
2 −iωT −1
e−inωT 0K = e|γ| e . (5.24)
or
n 1
=√ . (5.28)
n n
For large quantum numbers, which corresponds to the classical limit, the fluctuations
become relatively small.
56 5 Time-Cycle or Schwinger-Keldysh Formulation
we see that since we can change the source functions at will, and make very localized
changes, it makes sense to define the variational derivatives
δ
∗ (t) | = |y+ (t)| K ± ,
K±
i (5.30a)
δK +
δ †
−i | K ± = |y− (t)| K ± . (5.30b)
δK − (t)
All expectation values of operator products at any time can be obtained in this way—
in particular, correlation functions. Repeating this operation we get
δ δ δ
(−i) i ∗ (t ) t2 |t2
K±
=−i t2 |y+ (t )|t2 K ±
δK − (t) δK + δK − (t)
†
=t2 |y− (t)y+ (t )|t2 K ± . (5.31)
The operators are multiplied in the order of the time development. The only place
where K − appears is in the latter part of the time development. See Fig. 5.4.
The distinction between ± disappears if we now set K + = K − :
δ δ
∗ (t ) 0t2 |0t2 K ±
= 0t2 |y † (t)y(t )|0t2 K . (5.32)
δK − (t) δK + K + =K − =K
As an example, set t = t = t1 ; then, from Eq. (5.16), this reads for the number
operator N (t) = y † (t)y(t),
∗
N (t1 )0K = dt G r (t1 − t )K (t )
dt K (t)G a (t − t1 )
=i dt e−iω(t−t1 ) K ∗ (t)(−i) dt e−iω(t1 −t ) K (t ) = |γ|2 ,
(5.33)
t2
5 Time-Cycle or Schwinger-Keldysh Formulation 57
t1
K+
K−
t2
t2
Fig. 5.5 Time cycle with different forces, K + and K − . on the forward and backward moving
segments. Now the initial time of the time cycle, t2 , is different from the final time of the time cycle,
t2 , with τ = t2 − t2 . It is assumed that the time t1 is later than both t2 and t2 , and that the forces are
localized as shown
Here, by introducing a complete set of states at the time of the operator, we have
expressed the formula in terms of the matrix elements of stationary operators.
58 5 Time-Cycle or Schwinger-Keldysh Formulation
Remarkably, we see that the two expressions are equal; in effect, there is a peri-
odicity present here:
y+ (t2 ) = y− (t2 ), (5.37)
as far as traces are concerned. Now, the equations of motion (4.65) for the operators
read
d
i − ω y(t) = K (t), (5.38)
dt
which has solution (5.12b) with the addition of the initial term, or
t1
−iω(t−t2 )
y− (t) = e y+ (t2 ) − i dt e−iω(t−t ) K + (t )
t2
t1
−iω(t−t )
+i dt e K − (t ). (5.39)
t
In particular,
y− (t2 ) = e−iωτ y+ (t2 ) − i dt e−iω(t2 +τ −t ) (K + − K − )(t ). (5.40)
Note that the integrals sweep over the full force history. Let us let t2 = 0 for simplicity,
although we will keep the label. Because of the periodicity condition (5.37) this reads
eiωτ − 1 y+ (t2 ) = −i dt eiωt (K + − K − )(t) = −i(γ+ − γ− ), (5.41)
or
1
y+ (t2 ) = (−i)(γ+ − γ− ). (5.42)
eiωτ − 1
The denominator, which refers to the free harmonic oscillator, is immediately eval-
uated as
∞
1
trt2 |t2 = e−inωτ = . (5.44)
1 − e−iωτ
n=0
1
− δ(γ ∗ − γ−
∗
)(γ+ − γ− ). (5.46)
eiωτ − 1 +
we must multiply by
1
exp[− |γ+ − γ− |2 ]. (5.48)
eiωτ −1
iωτ −1)−1
∗ −K ∗ )(t)e−iω(t−t ) (K −K )(t )
dt dt (K +
× e−(e − + − , (5.49)
which is the exponential of a bilinear structure. This is a generating function for the
amplitudes nt2 |nt2 K ± . But it is useful as it stands.
Put τ = −iβ; then this describes a thermodynamic average over a thermal mixture
at temperature T , where β = 1/kT in terms of Boltzmann’s constant,
e−βnω | n
n
−βnω (5.50)
ne
1 1
→ βω = nβ , (5.51)
eiωτ − 1 e −1
because
ne−inωτ ∂ ∂ 1 1
n −inωτ = ln( e−inωτ ) = ln −iωτ
= iωτ .
n e ∂(−iωτ ) n
∂(−iωτ ) 1 − e e −1
(5.52)
60 5 Time-Cycle or Schwinger-Keldysh Formulation
Now consider a time cycle with displacement T : the system evolves from time t2
to time t1 under the influence of the force K + (t), and backwards in time from t1 to
t2 under the force K − (t):
This is again as illustrated in Fig. 5.5, with these replacements. What is the physical
meaning of this? Insert in Eq. (5.49) a complete set of states at time t1 :
nt2 |nt2 K ± = nt2 |n t1 K − n t1 |nt2 K + . (5.54)
n
We did this before for the ground state. The effect is the same as moving the starting
and ending times. Appearing here is
n t1 |nt2 K (t+T ) = n t1 + T |nt2 + T K (t) = e−in ωT n t1 |nt2 K (t) einωT . (5.55)
Therefore,
nt2 |nt2 K (t),K (t+T ) = e−i(n −n)ωT p(n , n) K = e−i(N −n)ωT nK . (5.56)
n
Therefore, as a generalization for finite τ of Eq. (5.26), we have from Eq. (5.49)
−1
−in ωτ
e e−inωτ e−i(N −n)ωT nK
n n
1
= exp e−iωT − 1 |γ|2 − e iωT
− 1 e −iωT
− 1 |γ| 2
, (5.57)
eiωτ − 1
where T gives the final state, and τ the initial state. This used the observation
−iωT
dt e iωt
K (t + T ) = e dt eiωt K (t). (5.58)
e−inωτ
− iωT N − nnK −in ωτ
= −iωT |γ|2 , (5.59)
n n e
or
N − nβK = |γ|2 , (5.60)
5 Time-Cycle or Schwinger-Keldysh Formulation 61
which generalizes the earlier result (5.25). Now apply Eq. (5.59) as a generating
function,
N − nnK = |γ|2 , (5.61)
1
e−inωτ e−i(N −N )ωT nK
e−inωτ
−iωT 1 −iωT
= exp e − 1 + iωT |γ| − iωτ
2
e −1 e iωT
− 1 |γ| .
2
e −1
(5.63)
1 1
e−inωτ (N − N )2 nK = |γ|2 + 2 |γ|2 , (5.64)
e−inωτ eiωτ −1
or
(N − N )2 βK = |γ|2 [1 + 2nβ ]. (5.65)
If, instead, we multiply Eq. (5.64) through by n e−inωτ , we can use this as a
generating function, and learn from Eq. (5.52) that
Note the simplicity of the derivation of this result, which does not involve complicated
functions like Laguerre polynomials.
Chapter 6
Relativistic Theory of Fields
σ + dσ
σ
Fig. 6.1 Spacetime volume bounded by two spacelike surfaces, σ and σ + dσ . Points in the blue
and green shaded regions cannot interfere
Since measurements made at points in the space-like shell dσ cannot interfere (e.g.,
in the two shaded areas, shown in Fig. 6.1) their contribution to W12 is additive,
σ1 σ1
W12 = (d x)L(x) = (d x)L[χa (x)], (6.4)
σ2 σ2
where the χa (x) = χa (x, y, z, t) are the dynamical variables of the system, neces-
sarily Hermitian operators; the relativistic requirements automatically introduce the
concept of fields. The relativistic notation used has the form
x 0 = t = −x0 , x k = xk , where k = 1, 2, 3,
∂
(d x) = d x 0 d x1 d x2 d x3 , ∂μ = μ , (6.5)
∂x
where the metric used is ⎛ ⎞
−1 0 0 0
⎜0 1 0 0⎟
gμν =⎜
⎝0
⎟. (6.6)
0 1 0⎠
0 0 0 1
The action principle again states that for a given dynamical system, the variations
arise only from the end-point, that is,
As before, from this requirement follow the equations of motion and the generators
of infinitesimal transformations which yield the commutation relations of the field
operators. The Lagrangian density L which will yield first order field equations is
1
μ
L= χ A ∂μ χ − ∂μ χ Aμ χ − H (χ ) , (6.8)
2
where the Aμ are a set of four numerical matrices, and space and time derivatives
appear on a symmetric basis. The symmetrization of the kinematical term relates
to the possibility of adding to L the relativistic analogue of our previous total time
derivative, a four dimensional divergence. If
L = L − ∂μ f μ , (6.9)
6 Relativistic Theory of Fields 65
then
σ1
W 12 = W12 − (d x)∂μ f = W12 − dσμ f + dσμ f μ ,
μ μ
(6.10)
σ2 σ1 σ2
and
G(σ1 ) = G(σ1 ) − dσμ f μ , G(σ2 ) = G(σ2 ) − dσμ f μ . (6.11)
σ1 σ2
The rank of Aμ is that of the number of independent fields. Note that the variation
δa1 σ1 |a2 σ2 is independent of any coordinate system, since L is a Lorentz scalar.
We can now infer some fundamental properties from the requirement of invariance
of L. Consider the coordinate transformation (Poincaré transformation)
x μ = μ ν x ν − μ (6.13)
where
gμν μ λ ν κ = gλκ , μ λ μ κ = δ λ κ . (6.14)
We can divide the transformations into two subsets by considering the effect of (6.13)
on g00 ,
2 3
2
2
2
− g00 =1= 0 −
0
k 0 0 0 = 1 + k 0 ≥ 1. (6.15)
k=1 k
1
μ
L= χ A ∂ μ χ − ∂ μ χ Aμ χ − H (χ )
2
1 T μ ν
= χ L A μ L ∂ν χ − ∂ν χ ν μ L T Aμ L χ − H (Lχ ) . (6.17)
2
Thus
L T Aμ L = μ ν Aν , H (Lχ ) = H (χ ) . (6.18)
H = χ Bχ = χ L T B Lχ , (6.19)
showing that the combination B −1 Aμ transforms like a vector; this is implied by the
required invariance of the kinematical term of L.
Consider now a general inhomogeneous infinitesimal Lorentz transformation, of
the form
x μ = x μ − μ + μν x ν , (6.21)
where the μ and μ ν are infinitesimals, and the rotational nature of the μν is
expressed by the relations μν = −νμ . Then L can be written as L = 1 + 2i μν S μν ,
where only the rotational μν terms appear, since the translations μ do not effect the
gradient operators ∂ν , and no corresponding changes in the χα are required to keep
L invariant. The S μν are (imaginary) operators, acting on the field variable, which
will express the spin character of the fields.
The variation δWα allows us to change the field components at each space-time
point (call these changes δα χα ), and to change the region of integration by displacing
the boundary surfaces σ1 and σ2 . In the previous non-relativistic treatment, instead of
varying the end-point times t1 and t2 ,we used t (τ ). Proceeding similarly here, we can
express the variation of a space-like surface by varying the space-time coordinates
under an infinitesimal Lorentz transformation (δxμ = μ − μν x ν ) so chosen that
on σ1 and σ2 the required displacement is obtained. The change in the action is
σ1
1
μ
δW12 = δ0 (d x) χ A ∂μ χ − ∂μ χ Aμ χ − H
2
σ2
6 Relativistic Theory of Fields 67
+ dσμ δx μ L − dσμ δ0 x μ L
σ1 σ2
σ1
= (d x) δ0 χ Aμ ∂μ χ − ∂μ χ Aμ δ0 χ − δ0 H
σ2
σ1
1
+ (d x) ∂μ χ Aμ δ0 χ − δ0 χ Aμ χ
2
σ2
+ dσμ δx μ L − dσμ δx μ L
σ1 σ2
σ1
μ 1
μ μ
= (d x) δ0 L + dσμ δx L + χ A δ0 χ − δ0 χ A χ
2
σ2 σ1
1
μ
− dσμ δx μ L + χ A δ0 χ − δ0 χ Aμ χ . (6.22)
2
σ2
Applying the action principle, the interior variation δ0 L must vanish, giving the
field equations of motion. What remains is the difference of two generators, G(σ1 ) −
G(σ2 ), where
μ 1
μ μ
G(σ ) = dσμ δx L + χ A δ0 χ − δ0 χ A χ . (6.23)
2
σ
To re-write (6.23) we recognize that the total change in the fields is due to both the
δ0 variation of the fields at a given space-time point on σ , and also to the variation
induced by the infinitesimal Lorentz transformation of coordinates as σ is displaced
to σ + dσ ; the latter is obtained from
i
χ (x) = Lχ (x) = χ (x) + μν S μν χ (x), (6.24)
2
But
χ (x) = χ(x) − δx μ ∂μ χ (x), (6.25)
and therefore
i
χ (x) − χ (x) = δx μ ∂μ χ (x) + μν S μν χ (x). (6.26)
2
68 6 Relativistic Theory of Fields
The right hand side of (6.26) is then the variation induced in the χα by the coordinate
transformation; the total variation of the fields is then
i
δχ (x) = δ0 χ (x) + δx μ ∂μ χ (x) + μν S μν χ (x). (6.27)
2
Solving for δ0 χ , and substituting into (6.23), we obtain for the generator
1
μ
G(σ ) = dσμ δx μ L + χ A δχ − δχ Aμ χ
2
1
− dσμ χ Aμ ∂ ν χ − ∂ ν χ Aμ χ δxν
2
i
− λν χ Aμ S λν χ − S λν χ Aμ χ . (6.28)
4
∂μ T μν = 0. (6.32)
The generator (6.29) can be split into two parts, one representing changes induced
by the coordinate variation, and the other giving the variation induced by a change
in the field variables,
6 Relativistic Theory of Fields 69
1
μ
Gχ = χ A δχ − δχ Aμ χ ,
dσμ (6.33a)
2
1
G x = dσμ T μν δxν = ν P ν + μν J μν , (6.33b)
2
where
Pν ≡ dσμ T μν , J μν ≡ dσλ x μ T λν − x ν T λμ ; (6.34)
P ν and J μν are the generators for translations and rotations, respectively, and their
commutation relations are determined by the group of transformations they represent.
Specifically, P ν is recognized as the 4-momentum operator, and J μν as the relativistic
generalization of the angular momentum operator.
The field equations are obtained by the vanishing of δ0 L in (6.22),
∂ H ∂r H
2 A μ ∂μ χ = or − 2 AμT ∂μ χ = , (6.35)
∂χ ∂χ
in terms of left and right derivatives, corresponding to the two equivalent ways of
writing G χ :
Gχ = dσμ χ Aμ δχ , or Gχ = − dσμ δχ Aμ χ . (6.36)
If we continued with these two pairs of expressions,we would obtain two forms for
the commutation rules of χ ; their equivalence then leads to the requirement that the
Aμ and χ must decompose:
χ = φ + ψ, Aμ = a μ + s μ , (6.37)
where the a μ are anti-symmetric and real, the s μ are symmetric and imaginary, and
the φ and ψ represent the kinematically independent fields of the Bose-Einstein and
Fermi-Dirac types, respectively. The field equations are then
∂ H ∂r H
2a μ ∂μ φ = = , (6.38a)
∂φ ∂φ
∂ H ∂r H
2s μ ∂μ ψ = =− . (6.38b)
∂ψ ∂ψ
The first of Eq. (6.40) combined with Eq. (6.38b) implies that H must be an even
function of ψ.
The field equations may be written as equations of motion by singling out the
time differentiation,
∂ H
2 A 0 ∂0 χ = − 2 A k ∂k χ . (6.41)
∂χ
1
G x = ν P ν + μν J μν , (6.42)
2
which, when applied to the space-time coordinates xμ , generates the new x μ ,
x = x μ − μ + μν x ν , (6.43)
which is Eq. (6.21). Accompanying this transformation, we have the apparent change
in the fields given by (6.26),
i
− δχ (x) = χ(x) − χ (x) = δx μ ∂μ χ (x) + μν S μν χ (x), (6.44a)
2
6.1 Inference of Particle Properties 71
or
1
− δχ (x) = μ ∂μ + μν x μ ∂ ν − x ν ∂ μ − i S μν χ (x), (6.44b)
2
From (6.45b) the identification of S μν with the intrinsic spin characteristics of the
particle is evident. Considering the time component (6.45a) we obtain the standard
commutator equation of motion
∂χ
χ , P 0 = −i∂ 0 χ = i∂0 χ = i . (6.46)
∂t
To determine the manner in which the particle interpretation enters, we now represent
the fields by Fourier integrals,
μp
χ (x) = (dp)ei x μ χ ( p), (6.47)
where the χ ( p) are operator functions of the numbers pμ . Substituting into (6.45a)
and equating coefficients, we obtain
χ ( p), P ν = p ν χ ( p), (6.48a)
or
P ν χ ( p) = χ ( p) P ν − p ν . (6.48b)
if p 0 < 0. Since χ is Hermitian, χ can be split, in a Lorentz covariant way, into two
parts, χ = χ (+) + χ (−) , where the superscripts indicate the positive and negative
nature of the p 0 terms which enter in the respective Fourier transforms, and where
(χ (+) )† = χ (−) . Then χ (+) applied to a state of definite energy acts as an energy
annihilation operator, and χ (−) as an energy creation operator.
The χ (+) and χ (−) can then be used to create any physical state from the vacuum
state, where we take the latter as the unique lowest-energy state. This vacuum state
must necessarily correspond to the eigenvalue P k = 0, (k = 1, 2, 3), since if
one of the P k were not zero, a rotation of the coordinate system could yield three
non-zero momentum components, requiring (a super-position of) the corresponding
eigenvectors for its description. But this precludes a description of the vacuum by a
single non-degenerate state; we must therefore require that each P k = 0. Since the
vacuum is to have the lowest energy possible, and we may arbitrarily take this to be
zero, we then characterize the vacuum state as that unique state for which P μ = 0,
where χ (+) |0 = 0.
So far we have considered χ as representing general fields; to introduce particle
properties consider the Fourier transform of χ , and imagine the numbers pμ related
by the relation:
− pμ p μ = m 2 , p 0 = ± m 2 + p2 . (6.51)
Then χ (+) , for example, remains an operator which annihilates energy, but now is
correlated with a momentum decrease. This is just the usual particle interpretation;
if χ (+) has this character it may be spoken of as a particle annihilation operator, and
conversely for χ (−) .
and since
∂ H ∂r H
= , (6.53)
∂φ ∂φ
for these variations of fields of the first kind, both terms in the above bracket must
be equivalent, implying that B (1) = B (1)T . Since B (1) is also Hermitian, it is real.
6.2 The Connection Between Spin and Statistics 73
For δψ variations, the identical argument shows that B (2) = −B (2)T , i.e., B (2) is
imaginary. It is precisely here that the connection between spin and statistics arises:
We can construct matrices for B −1 a μ only for particles of integral spin, and for
B −1 s μ only when the particle spin is an integer plus one-half. [See Eq. (6.37).]
Writing the equations of motion (6.35) for either field
∂ H
2 A μ ∂μ χ = = 2Bχ , (6.54)
∂χ
i B −1 Aμ pμ χ ( p) = χ ( p). (6.56)
Now consider the matrix (of finite order) i B −1 Aμ pμ ≡ M, which must satisfy its
algebraic minimal characteristic equation M n + a1 M
n−1 + · · · + an =
0, wherethe
an are numbers. From Eq. (6.20) we know that L −1 B −1 Aμ L = μ ν B −1 Aν . If
we insert the proper combinations L −1 L in each term, and use the relation
L −1 B −1 Aμ pμ L = B −1 Aν μ ν pμ = B −1 Aμ p ν , (6.57)
then the result of a Lorentz transformation, giving us back the identical minimal equa-
tion (as it must), shows that the a j are Lorentz invariants, or functions of invariants.
Since the only 4-vectors available are the pμ , we take a j = a j (− pμ p μ ). Further-
more, the minimal equation must be valid independently of the value of the numbers
pμ , i.e., it must be an algebraic identity in pμ ; the coefficients a j (− p μ pμ ) must
then be of the form (− p μ pμ ) j times a numerical factor c j , which is independent of
the pμ . We can identify the two possible cases, corresponding to the degree of the
minimal equation being either even (n = 2k) or odd (n = 2k + 1); in either case the
power to which (B −1 Aμ pμ ) is raised must decrease in steps of two,
2k
2k−2
n = 2k : i B −1 Aμ pμ + − pμ p μ c1 i B −1 Aμ pμ + ...
+ − pμ p μ ck = 0,
k
(6.58a)
2k+1
2k−1
n = 2k + 1 : i B −1 Aμ pμ + − pμ p μ c1 i B −1 Aμ pμ + ...
+ − pμ p μ ck i B −1 Aμ pμ = 0.
k
(6.58b)
†
B † = B, i Aμ = i Aμ , (6.59a)
†
and i B −1 Aμ pμ = i Aμ B −1 pμ = B i B −1 Aμ pμ B −1 . (6.59b)
Taking the adjoint of the minimal equation then corresponds to making a similarity
transformation on the matrices and complex conjugating every c j term. Since each
B can be combined with a B −1 term, and since we must still have the same unique
minimal equation, it follows that each c j = c∗j . All of the above is a direct inference
from the requirement of Lorentz invariance.
If we now apply (6.58a) and (6.58b) to the field χ ( p), and use (6.56) we obtain
for either case
2
k
1 + − pμ p μ c1 + − pμ p μ c2 + · · + − pμ p μ ck χ ( p) = 0. (6.60)
1 μ ν
γ , γ = −g μν (6.65)
2
as a necessary condition for Eq. (6.62) to be satisfied algebraically by pμ .
Before proceeding further, we remark that the construction of 5 matrices, satisfy-
ing (6.65), each of a definite symmetry (to represent the γ μ and γ 5 = γ 0 γ 1 γ 2 γ 3 ),
6.3 Fermi-Dirac Fields of Spin 1/2 75
can be achieved in only one way: Three of the matrices must be symmetric, and the
remaining two antisymmetric. This statement, which is easily verified, together with
(6.62) shows that we can construct matrices satisfying (6.65) only for Fermi-DIrac
fields, as follows: Since α μ is real, and β imaginary, α μ = βγ μ = −βγ μ∗ , i.e., all
the γ μ are imaginary. From (6.65) we note that
2
2
γ 0 = −g 00 = +1 γ k = −g kk = −1. (6.66)
We are looking for matrices of definite symmetry; equating j and , (6.67) can be
satisfied only if n0 j = − 0jn , nk j = + kjn ; i.e., γ 0 is antisymmetric (and imag-
inary, and therefore Hermitian), and the three γ k . are symmetric (and imaginary,
and therefore
skew-Hermitian).
γ 5 is then antisymmetric and real. It then follows
that β, γ = β, γ = 0, and since β and γ 0 have the same properties, we may
k 0
identify them: β = γ 0 .
If we now attempt to repeat this for Bose-Einstein fields, then the α μ are anti-
symmetric and imaginary, β is symmetric and real, all the γ μ are the same as for the
Fermi-Dirac case, and again β, γ = 0. But this is a direct violation of the require-
k
ments that there be but three independent symmetric matrices, satisfying (6.66).
Equation (6.65), from which the spin 21 formalism is obtained, therefore refers only
to Fermi-Dirac fields.
The Lagrangian for the Fermi-Dirac spin 21 field is then
1 m
L= ψ, iα μ ∂μ ψ − [ψ, βψ] . (6.68)
2 2
More precisely, this is the Lagrangian for the uncharged Dirac-Majorana field—in
order to represent charge we shall later have to double the number of components
of ψ. For notation, we introduce the use of the dot · to symbolize the proper sym-
metrization brackets to be used for Fermi-Dirac or Bose-Einstein fields. Then
where, since ψ is Hermitian, the adjoint of these equations must, and do, yield
identical equations.
To obtain the commutation relations, we use:
G ψ = dσμ χ Aμ δχ → dσ0 ψs 0 δψ, (6.71)
i
χ , G χ = δψ(x), (6.72)
2
or i
dσ0 ψ(x), ψ(x )s 0 δψ(x ) = δψ(x), (6.73)
2
where the factor 21 enters because the ψ are “non-canonical,” and [ ] → { } since δψ
anti-commutes with ψ. This then yields
i 0
−1 (0)
ψ(x), ψ(x ) = s δ (x − x0 ), (6.74)
2
or
1
ψα (x), ψβ (x ) = δαβ δ (0) (x − x ), (6.75)
2
where x and x are points on the same space-like surface, and (s 0 )−1 = −i.
From this equation, which is an identity in pμ , we can make statements only about
the symmetric part of the matrix products. If we completely symmetrize the matrix
factor with respect to μ, σ , and ν, it must vanish. This is the sum of three terms of
the form
β μ β σ β ν + β ν β σ β μ = −g μσ β ν − g νσ β μ , (6.77)
6.4 Spin Zero and One 77
which are the familiar Kemmer-Duffin commutation relations (Petiau 1936; Duffin
1938; Kemmer 1939) which are used to describe a particle of spin zero and one. All
the β matrices are singular. These matrices have 126 independent elements and hence
are reducible to three sub-matrices of dimensionality 10, 5, and l; the sub-matrix of
dimensionality 1 is trivially the null matrix; the one of dimensionality 5 and rank 2
describes a particle of spin 0; the matrix of dimensionality 10 and rank 6 describes
a particle of spin 1.
The Lagrangian can be written, by choosing B = m, as
L = φ· iβ μ ∂μ φ − mφ· φ, (6.78)
which verifies what has been put into the theory as the connection between energy
and momentum. Thus each component of φ satisfies the Klein-Gordon equation.
The generator is
G φ = −i dσ0 φβ 0 δφ. (6.81)
which yields
φa (x), β 0 φ(x ) (0)
= −δab δ (x − x ), (6.83)
b
where δab is a diagonal matrix having six ones and four zeros along the diagonal,
which refer to the independent and dependent components, respectively. This equa-
tion cannot be solved for the commutation relations because β 0 is a singular matrix.
It eliminates the dependent components
of φ from the commutation relations.
If we multiply the field equation by 1 − (β 0 )2 and use the commutation rela-
tions, we find
β k β 0 β 0 ∂k + im 1 − β 0 β 0 φ = 0, (6.84)
78 6 Relativistic Theory of Fields
which is independent of time, and is the equation expressing the dependent compo-
nents of φ in terms of the independent components.
Let us re-examine the spin zero and spin one fields. Integral spin fields may be
described in terms of ordinary tensors. They do not require the special spinor proper-
ties of the half-integral spin fields. We have seen that the spin zero representation of
the Kenner-Duffin algebra has a dimensionality of five. If the only tensor in addition
to the field components which is introduced to form bilinear products in a scalar
Lagrangian is the four-divergence, we must describe the field with tensors differing
in rank by one. We shall construct the spin zero field with a scalar and four-vector
as the necessary 5 components, and the spin one field with a four-vector and an
anti-symmetrical second rank tensor as representing the 10 components.
1
m 2
L= φ· ∂μ φ μ − φ·μ ∂μ φ − φ − φμ φ μ . (6.85)
2 2
The field equations are determined by varying φ and φ μ ,
which imply
1 ν μ
∂ μ ∂μ φ = m∂ μ φμ = m 2 φ, ∂ μ ∂μ φ ν = ∂ ∂ ∂μ φ = m 2 φ ν , (6.87)
m
again yielding the Klein-Gordon equation.
The generator is
1
1
G= dσμ φδφ μ − φ μ δφ → dσ0 φδφ 0 − φ 0 δφ . (6.88)
2 2
Thus only φ and φ 0 are independent variables. This is reflected in the fact that the
Klein-Gordon equation is second order, and hence we must specify both the wave
function and its time derivative.
The equations of motion have to be examined to see if the other components of
the field are determined. We have from the field equations, as equations of motion,
∂0 φ = mφ0 ∂0 φ 0 = mφ − ∂k φ k , (6.89)
6.5 Spin Zero 79
and the following which is not an equation of motion, but does show that the φ k are
determined in terms of the two independent components,
∂k φ = mφk . (6.90)
From the generator we see that we have one set of canonically conjugate variables,
φ and φ0 . This means that the field has only one internal degree of freedom, and
must be a spin zero field. This pair of conjugate variables must obey the canonical
commutation relations (at equal times), as is easily verified by using the generators
for their respective change
φ(x), φ(x ) = φ 0 (x), φ 0 (x ) = 0, φ 0 (x), φ(x ) = iδ (0) (x − x ), (6.91)
1 μ
φ(x), φ(x ) = 0 φ (x), φ(x ) = δ (μ) (x − x ),
i
1 μ 1
φ (x), φ ν (x ) = − ∂ μ δ (ν) (x − x ) + ∂ ν δ (μ) (x − x ) . (6.93)
i m
The description of a spin one field requires the use of a ten-component wave function.
As we shall see, not all of these components are independent. We shall use a four-
vector and an anti-symmetrical tensor of rank two. The Lagrangian is chosen to
be
1
μν μν
m μ 1 μν
L= φ ν · ∂μ G − G · ∂μ φ ν − φ φμ − G G μν . (6.94)
2 2 2
The factor of 1/2 in the last term is inserted because the unrestricted sum over μ and
ν counts each component twice. The field equations are
∂μ φν − ∂ν φμ = mG μν , ∂μ G μν = mφ ν . (6.95)
80 6 Relativistic Theory of Fields
The generator is
1
G= dσ0 φk δG 0k − G 0k δφk . (6.96)
2
Thus only six of the ten components can be varied independently, φk and G 0k . Let
us see if they have equations of motion and if the other 4 components are determined
in terms of the independent ones. The equations of motion are
∂0 φk = ∂k φ0 + mG 0k , ∂0 G 0k = mφ k − ∂l G lk , (6.97)
∂k φ − ∂ φk = mG k , mφ 0 = ∂k G k0 = −∂k G 0k , (6.98)
which determine the four dependent field components. Thus all ten components of
the field are determined.
From the generator we see that we have 3 sets of canonically conjugate field
variables; thus the field has 3 internal degrees of freedom. This corresponds to the
three sub-states of a spin one field.
The commutation relations obeyed by the field components are obviously the
canonically conjugate relations,
φ k (x), φ (x ) = G 0k (x), G 0 (x ) = 0,
1 0k
G (x), φ (x ) = δk δ (0) (x − x ). (6.99)
i
The commutation relations obeyed by the dependent components can be realized
from the above, by using their definitions in terms of the independent components.
For instance,
1 0 1 1
φ (x), φ (x ) = − δk ∂k δ (0) (x − x ) = − ∂ δ (0) (x − x ). (6.100)
i m m
These relations can be generalized to refer to an arbitrary coordinate system,
1 μ 1 μ (ν)
φ (x), φ ν (x ) = − ∂ δ (x − x ) + ∂ ν δ (μ) (x − x ) ,
i m
1 μν μ
G (x), φλ (x ) = −δλν δ (μ) (x − x ) − δλ δ (ν) (x − x ),
i
1 μν 1 μλ ν (κ)
G (x), G λκ (x ) = − g ∂ δ (x − x ) + ∂ κ δ (ν) (x − x )
i m
plus antisymmetrical terms . . . . (6.101)
6.7 Electromagnetic Field 81
The electromagnetic field is a spin one, massless field. A re-definition of the field
variables in the limit as the mass approaches zero will be made in the spin-one
Lagrangian. Set
√ 1
mG μν ≡ F μν , √ φν ≡ A ν . (6.102)
m
1
1
L= Aν· ∂μ F μν − F·μν ∂μ Aν + F μν Fμν . (6.103)
2 4
The field equations are
∂μ Aν − ∂ν Aμ = Fμν , ∂μ F μν = 0, (6.104)
Thus, as in the case of a spin one non-zero mass field, only six of the field components
can be varied independently. Their equations of motion are
We must now examine the rest of the field equations to see if the dependent
components are determined,
∂k A − ∂ Ak = Fk , ∂k F 0k = 0. (6.107)
Thus F kl is determined, but because of the vanishing of the photon mass, A0 is not
determined and a further restriction is placed on the electric field, F 0k . This is exactly
the freedom of making a gauge transformation, which is allowed by the vanishing
of the photon mass. That is,
Aμ → Aμ + ∂μ . (6.108)
It is convenient to split the vector part of Aμ into a longitudinal part, which is the
gradient of a scalar, and a transverse part, which is the curl of a vector. Thus we see
that the gauge transformation affects only the longitudinal part of Ak . F 0k is purely
transverse because it has no divergence.
The equations of motion also break up into longitudinal and transverse components,
The first equation is entirely consistent with the possibility of making a gauge trans-
formation which implies
ALk = ∂k , A0 = ∂0 . (6.110)
By setting
Ak = ∂k + ATk (6.111)
Since F 0k and ATk appear as canonical variables, we can impose the canonical
commutation relations
ATk (x), AT (x ) = F 0k (x), F 0l (x ) = 0, (6.113a)
T
F 0k (x), AT (x ) = δk δ (0) (x − x ) , (6.113b)
where only the transverse component of the term on the right side of (6.113b) can
appear. If we define a term that subtracts off the longitudinal part of the delta function,
the commutation relation can be written as
F 0k (x), AT (x ) = δk δ (0) (x − x ) − ∂ k ∂ D0 (x − x ). (6.114)
∇ 2 D0 (x − x ) = −δ (0) (x − x ), (6.116)
or
1
D0 (x − x ) = , (6.117)
4π |x − x |
which shows that the longitudinal part of Ak is intimately connected with the
Coulomb potential.
6.8 Introduction of Charge 83
It was previously mentioned that the description of charge requires a doubling of the
number of component fields χ .
The simplest case to consider is the case of one internal degree of freedom. Calling
the two basic fields χ(1) and χ(2) , L [Eq. (6.8)] becomes
Since both χ(1) and χ(2) have identical space-time properties, the kinematical portion
of L is invariant under rotations and reflections in a new two-dimensional space
composed of the components χ(1) and χ(2) in accordance with the sum-of-squares
notation of (6.118). The rotations in this space, given by
χ (1) = cos λχ(1) + sin λχ(2) , χ (2) = − sin λχ(1) + cos λχ(2) , (6.119)
where
χ(1) 0 −i
χ= q= . (6.121)
χ(2) i 0
or
χ (1) = χ(2) , χ (2) = −χ(1) . (6.123)
Re-defining Aμ as
Aμ 0
, (6.124)
0 Aμ
L = χ· Aμ ∂μ χ − H(χ ), (6.125)
where we have assumed that H has the required invariance under reflections and
rotations in this space. Use of (6.120) and q T = −q is then sufficient to guarantee
the invariance of L under rotations. The reflection transformations in (6.122) can be
compactly written as
π
χ = Cei 2 q χ χ = Cχ , (6.126)
84 6 Relativistic Theory of Fields
where
01
C= (6.127)
10
and incorporating this infinitesimal variation into the action principle will yield a
quantity which is conserved. The notation will suggest the interpretation of this
quantity as electrical charge, but until the manner in which it appears in interactions
between the various fields is specified, the nature of this charge is irrelevant.
If we now imagine that δλ of (6.128) is a continuous function of space time, the
finite change in λ which builds up between an initial t (σ2 ) and final t (σ1 ) times
corresponds to different successive choices of χ . The stipulated invariance of L then
implies that δλ L = 0. If δλ were constant, δλ L would vanish trivially, in accordance
with (6.120) and (6.126); we obtain something new since the dependence of δλ on
x μ introduces terms depending on ∂μ δλ:
δλ L = −iχ· Aμ qχ ∂μ δλ ≡ j μ ∂μ δλ = ∂μ j μ δλ − δλ∂μ j μ , (6.129)
where
The action principle now requires that the coefficient of δλ vanish, yielding the
differential conservation law
∂μ j μ = 0. (6.131)
The term ∂μ [ j μ δλ] gives the generator for changes on the boundary space-like
surfaces σ1 . σ2 . Taking δλ as constant over each surface, which corresponds to
different, but definite, choices of the χ on σ1,2 , we have
G λ = δλ dσμ j μ ≡ Qδλ. (6.132)
Further, if δλ(σ1 ) = δλ(σ2 ) then it is obvious that we get the same description as we
would have obtained with δλ = 0 in both surfaces, i.e.,
This result also follows from (6.131); the quantity Q, called the “charge", is therefore
conserved.
6.8 Introduction of Charge 85
1 1
δχ = [χ , G] = [χ , Qδλ] = −iqχ δλ, (6.134)
i i
or
[χ , Q] = qχ , (6.135a)
or
χ(1) , Q = −iχ(2) , χ(2) , Q = iχ(1) . (6.135b)
1
[χ , Pν ] = ∂ν χ
i
ν
e−iα·P χ (x)eiα·P = eα ∂ν χ (x) = χ (x + a). (6.137)
There must also exist a unitary operator, in analogy with (6.136), which accom-
plishes the improper transformations of Eq. (6.123):
C −1 χ(1) C = χ(1)
C −1 χ C = cχ , or . (6.138)
C −1 χ(2) C = −χ(2)
C −1 QC = −Q. (6.139)
Thus, C has the interpretation of a charge reflection operator. Since C 2 = +1, its
eigenvalues are C = ±1. From (6.139) we see that [Q, C] = 0 and the two opera-
tors cannot be simultaneously diagonalized, (in the representation of χ(1) , χ(2) , C is
diagonal) except for states of zero charge. If we arbitrarily assign the vacuum state
the eigenvalue C = +1 , then this state of zero charge, Q = 0, has both a definite
charge and charge symmetry.
To obtain a state (other than Q = 0 ) of definite charge, where the operator Q is
diagonalized, we must utilize non-Hermitian operators. Define
C −1 χ± C = χ∓ . (6.142)
showing that χ+ creates a state for which the eigenvalue of Q is Q −1, and conversely
for χ− These, then, represent charge annihilation and creation operators respectively.
If we imagine that every physical state can be created by the action of these operators
on the vacuum |Q = 0 then the only possible values of Q are 0, ±1, ±2, …etc.
(Note that charge annihilation represents the destruction of positive charge or the
creation of negative charge, and vice versa.) To connect this with our previous particle
description in Sect. 6.1, where we had χ (+) and χ (−) , we now obviously have the
four possibilities:
(+)
• χ+ destroys a particle and decreases charge by 1,
(−)
• χ− creates a particle and increases charge by 1.
(−) † (+)
These are inverse operations: χ− = χ+ .
(+)
• χ− destroys a particle but increases charge by 1.
(−)
• χ+ creates a particle but decreases charge by 1.
(+) † (−)
These are also inverse operations: χ− = χ+ . This exhaustive description
permits us to describe particles that carry charge.
In terms of the non-Hermitian variables defined by (6.140), the Lagrangian (6.118)
becomes
1
L= χ−· Aμ ∂μ χ+ + χ+· Aμ ∂μ χ− − H, (6.144)
2
and the generator is
1
G= dσμ χ− Aμ δχ+ + χ+ Aμ δχ− , (6.145)
2
which shows that χ+ and χ− are a canonically conjugate set of field variables. The
current (6.130) is
i
jμ = χ+ A μ χ − − χ − A μ χ + . (6.146)
2
For variables of the second kind the commutation rules are
6.8 Introduction of Charge 87
χ+ (x), χ− (x ) A0 = iδ (0) (x − x ),
χ± (x), χ± (x ) = 0. (6.147)
1 † μ
L= ψ· α i ∂μ ψ − i ∂μ ψ † α μ ψ − mψ·† βψ. (6.150)
2 ·
Setting
α μ = βγ μ , ψ † β = ψ, (6.152)
the Lagrangian is
1
L= ψ · γ μ i∂μ ψ − i∂μ ψ · γ μ ψ − mψ · ψ. (6.153)
2
If we wish to interpret j μ as the electric current, we can write down the Lagrangian
for the electromagnetic field plus the charged Dirac field and specify the coupling
so that the usual field equations arise, in which j μ acts as the source for the vector
potential. The form of the coupling term is of course limited by Lorentz invariance
and the spin 21 and spin 1 algebra. Thus the most general Lagrangian which can be
formed in this case is
88 6 Relativistic Theory of Fields
1
L= ψ · γ μ i∂μ ψ − i∂μ ψ · γ μ ψ − mψ · ψ
2
1
+ Aν· ∂μ F μν − F·μν ∂μ Aν + F μν Fμν
4
μ 1 μν
+ e A· jμ + μF· ψ · σμν ψ . (6.154)
2
The symmetrization between Aμ and j μ is necessary because in general, the sources
for the Aμ are partly the jμ and hence these need not commute. The last term, a
Pauli moment, even though it appears covariant, may, in fact, not be covariant due to
its operator properties. These show up only in higher order terms in the perturbation
expansion.
The Lagrangian given by Eq. (6.154) is invariant under the rotation and reflection
of χ(1) and χ(2) , which correspond to the replacements
1
ψ → eiλ(x) ψ, ψ → e−iλ(x) ψ, Aμ → Aμ + ∂μ λ(x), (6.155)
e
and to the interchange of ψ and ψ, respectively. Note that in the second case—that
of reflection—we must also replace Aμ → −Aμ , Fμν → −Fμν , in order to have L
unaltered.
We obtain the equations of motion and generators by variations
1
δψ : γμ ∂μ − e Aμ· + m ψ = 0, (6.156a)
i
μ
δψ : ψ γ i∂μT − e Aμ· + m = 0, (6.156b)
δ Fμν : ∂μ Aν − ∂ν Aμ = Fμν , (6.156c)
μν μ μ
δ A μ : ∂ν F = eψ · γ ψ ≡ j . (6.156d)
∂0 Ak = ∂k A0 + F0k , Fk = ∂k A − ∂ Ak ,
∂0 F k0
= j − ∂ F ,
k k0
∂k F 0k = j 0 . (6.159)
Since we have the freedom of making a gauge transformation, wherein the longitu-
dinal components of Aμ are then arbitrary, it is advantageous to re-write equations
(6.159) in terms of transverse (T) and longitudinal (L) components:
demonstrating that the longitudinal part of the electromagnetic field is not a kine-
matically independent quantity, but depends on the j 0 of all the charged particles
present. Note that item (5) is not an independent statement, since it is the result of
the conservation law for j μ (x). Since ALk = ∂k (x), using item (5) shows that
A0 = ∂0 (x) + dσ D(0) (x − x ) j0 (x ), (6.162)
By the addition of a surface term, the last term in the integrand in (6.163) becomes
equal to = j 0 δ, which does not refer to the kinematically independent part of the
electromagnetic field, and gives the generator for the change in the Dirac field when
an infinitesimal gauge transformation is made—See Eq. (6.132). The electromagnetic
field commutation rules are again as given in equations (6.113a) and (6.113b); the
anti-commutation relation for the Dirac fields works out to be
ψ(x), ψ(x ) = γ 0 δ (0) (x − x ) = −γ0 δ (0) (x − x ) → −γμ δ (μ) (x − x ). (6.164)
C −1 ψC = ψ † , C −1 ψ † C = ψ. (6.165)
90 6 Relativistic Theory of Fields
In order to maintain the invariance of L, we must now also require that this unitary
operator reverse the sign of the Aμ .
C −1 Aμ C = −Aμ . (6.166)
Again, C 2 = +1, and therefore its eigenvalues are C = ±1. From the arguments of
the preceding section, use of the transformation:
C −1 QC = −Q (6.167)
permits a classification, in terms of the quantum number c , for systems of zero net
charge. For example, consider the production of photons by repeated application of
the operator A (actually A(−) , a photon creation operator) to the vacuum state |0,
characterized by c = +1, and assume that no coupling terms are present in L (i.e.,
e = 0). If A is applied n times to |0 then by (6.166)
and we have a state |(A · · · A ) whose eigenvalue of C depends on the (even or odd)
number of photons present: C = (−1)n = ±1. Whichever +1 or -1 eigenvalue of c
we begin with persists as the coupling is turned on, even though what is now called
the state of n photons is a superposition of many different states.
As an example of this, let us consider the decay of positronium. In the absence of
the coupling term between Aμ and ψ (i.e., e = 0), the system of (e+ + e− ) is stable,
and has a definite eigenvalue c . When the coupling is introduced, the only permitted
final states will be those of the same eigenvalue. From our previous interpretation,
ψ is the operator which creates a particle e− and annihilates
e+ , and conversely
for
ψ † . To obtain a state with e− at x and e+ at x , we form ψ † (x)· ψ(x ) |0, and
recognize that the actual state is a superposition of these with the amplitude wave
function ψ(x, x ). Using (6.165) we then have
C|e+ e− = C (d x)(d x )ψ(x, x ) ψ † (x)· ψ(x ) |0
=− (d x)(d x )ψ(x, x ) ψ † (x )· ψ(x) |0
=− (d x)(d x )ψ(x , x) ψ † (x)· ψ(x ) |0. (6.169)
|a , b | = |a b | (7.1)
represents a idealized process in which the state b is annihilated and a new state a
is produced. Wouldn’t it be useful to separate these processes? Recall how oscillator
states, as discussed in Chap. 4, were created by forces. We generalize to sources which
can create and destroy particles. A scattering process, which occurs in a more or less
localized scattering region, can be abstracted into a two-stage process, in which first
an incoming particle is absorbed, and then a new particle is created, as sketched
in Fig. 7.1. In the individual processes the source K acts to create or annihilate the
particle.
What does this mean quantum mechanically? We must describe the processes
by probability amplitudes; we want to construct everything from the one particle
production mechanism,
1p+ |0− K , 0+ |1p− K , (7.2)
where the first amplitude represents the process in which a single particle state of
momentum p is created, where before the source acts only the vacuum state is present,
while the second represents the process in which a single particle of momentum p is
absorbed after which the vacuum state is present.
Because the processes occur in space and time, the source must be a function
K (r, t), which exhibits a certain degree of localizability. How does the effectiveness
of the source vary with different degrees of freedom? The complementary measure is
the corresponding function in momentum space K (p, E), where, nonrelativistically,
E = p 2 /(2m). We expect the relationship between the production amplitude and
the source function to be, at least for a weak source,
(dp)
1p+ |0− K = (−i)K (p). (7.3)
(2π )3
Here (dp) = dp1 dp2 dp3 . The −i factor is purely conventional for later convenience.
The square root of the momentum-space element is present to properly account for
the density of states in the continuum picture. To compute the annihilation amplitude
0+ |1p− , we can use orthogonality, 1p− |0− = 0. This must be maintained by the
dynamics. The completeness relation
1 = |0+ 0+ | + |1p+ 1p+ | + |1p+ 1p + 1p+ 1p + | + . . . , (7.4)
p p,p
implies
0 = 1p− |0+ K 0+ |0− K + 1p− |1p + K 1p + |0− K + . . . , (7.5)
p
where we shall consider a weak source, so we will drop the higher terms. To lowest
order (in powers of the source)
so
0 = 1p− |0+ K + 1p+ |0− K , (7.7)
7 Nonrelativistic Source Theory 93
or ∗
0+ |1p− K = − 1p+ |0− K . (7.8)
K̄ (r, t) = K (r + R, t + T ). (7.10)
Relativity (here Galilean) means that the same effect occurs by displacing the space-
time coordinate system to which the initial and final states are referred, the generator
of such an infinitesimal displacement being
G = P · δ − H δt, (7.11)
U = eiP·R−iH T . (7.12)
0−
Now using the one-particle creation and annihilation amplitudes (7.9), together with
the Fourier transform (7.17) we see that the one-particle exchange term here is
(dp) ip·(r−r )−iE(t−t )
−i (dr)dt (dr )dt K 1∗ (r, t) −i e K 2 (r , t ). (7.20)
(2π )3
But the source is a unitary whole—results can depend only on the total source K
and not its parts. This is a statement of the uniformity of space and time. This will
introduce terms that will refer to each component source separately; what we don’t
want is a term that involves K 2∗ and K 1 ; therefore, we define the retarded Green’s
function
(dp) ip·(r−r )−iE(t−t )
G(r − r , t − t ) = −iη(t − t ) e , (7.21)
(2π )3
7 Nonrelativistic Source Theory 95
0−
Now we want to remove the restriction to weak sources. Suppose we have a beam
of noninteracting particles, detected for example by spatially separated sources, as
illustrated in Fig. 7.3. Each particle is produced and detected by a single pair of weak
sources K (α) , α = 1, 2, 3, . . . . There is no interaction between different pairs of
sources. Thus the vacuum persistence amplitude for this arrangement is
(α)
0+ |0− K = 1 − i (dr)dt (dr )dt K ∗ (r, t)G(r − r , t − t )K (r , t ) ,
α
(7.24)
because each source is weak. But only K , not K (α) should enter; physics shouldn’t
depend on the channel. So
(K ∗ G K )(α) K (α)∗ G K (α) K ∗G K
0+ |0− K = e−i = e−i α = e−i , (7.25)
α
where the last step depends upon the arrangement that prohibits cross coupling
between the component sources. This looks just like the structure we saw for the
harmonic oscillator (4.91), except now the integrals are over space as well as time:
96 7 Nonrelativistic Source Theory
K2
0−
(dr)dt (dr )dt K ∗ (r,t)G(r−r ,t−t )K (r ,t )
0+ |0− K = e−i . (7.26)
We will apply this to generalized (not weak) emission and absorption processes, but
still assuming that the particles are not interacting.
Figure 7.4 shows the exchange of noninteracting particles between causally sep-
arated sources, K = K 1 + K 2 . The total vacuum persistence amplitude is
K 1∗ G K 2
0+ |0− K = 0+ |0− K 1 0+ |0− K 2 e−i . (7.27)
The fact that there is a causal relation between the two sources means that
(dp) ip·(r−r )−iE(t−t )
G(r − r , t − t ) = −i e . (7.28)
(2π )3
∞ (−iK ∗ )n p (−iK ∗ )n p
1p 2p
0+ |0− K = 0+ |0− K 1 0+ |0− K 2 , (7.32)
p n =0 n p ! n p !
p
where the occupation numbers in a given momentum cell are given by {n} = {n p }.
From this we infer the probability amplitudes for producing and absorbing particles
by a strong source:
(−iK p )n p
{n}|0− K = 0+ |0− K , (7.33a)
p n p !
(−iK p∗ )n p
0+ |{n} K = 0+ |0− K . (7.33b)
p n p !
Independently,
|0+ |0− | = exp − (dr)dt (r )dt K ∗ (r, t)iG((r − r , t − t )K (r , t )
K 2
∗ ∗
− (dr)dt (r )dt K (r, t)[iG((r − r, t − t)] K (r , t ) .
(7.35)
which defines new objects which refer to the pre-existing situation. Here
ψ(r, t) = (dr )dt G(r − r , t − t )K (r , t) (7.40)
ψ ∗ (r, t) = (dr )dt K ∗ (r , t)G(r − r, t − t)K (r , t ); (7.41)
G being the retarded Green’s function. The differential equation satisfied by the
Green’s function (7.23) is
∂
i − T G(r − r , t − t ) = δ(r − r )δ(t − t ), (7.43)
∂t
where T = −∇ 2 /(2m) is the kinetic energy differential operator. Therefore, the field
ψ satisfies
∂
i − T ψ(r, t) = K (r, t). (7.44)
∂t
This is analogous to the equation satisfied by the harmonic oscillator variable (4.65),
or
d
i − ω y(t) = K (t). (7.45)
dt
Now because
∂
−i − T G(r − r, t − t) = δ(r − r )δ(t − t ), (7.46)
∂t
which is the complex conjugate equation. The boundary conditions are different in
the two cases: ψ is a retarded solution, while ψ ∗ is an advanced solution.
What does W have to do with action? Let us write the alternative forms
7 Nonrelativistic Source Theory 99
W =− (dr)dt K ∗ (r, t)ψ(r, t) = − (dr)dt ψ ∗ (r, t)K (r, t)
∗ ∂
= − (dr)dt ψ (r, t) i − T ψ(r, t). (7.48)
∂t
But the definition (7.40) of ψ and ψ ∗ shows that the last variation is zero, which is
a statement of the stationary action principle:
∂ ∗
δ ψ,ψ ∗ W = (dr)dt δψ i −T ψ−K
∂t
∂ ∗ ∗
+ (dr)dt δψ −i − T ψ − K = 0. (7.51)
∂t
That is, the stationary action principle, that W is unchanged under infinitesimal field
variations, supplies the equations of motion (7.44) and (7.47).
Fundamental to physics is the notion of the uniformity of space and time, that the
laws of physics are independent of the locale. This is reflected in the indistinguisha-
bility of identical particles, which, in turn, is reflected in the probability amplitude
of a source producing a multi-particle distribution, Eq. (7.33a), which says all that
is possible. Experimentally, we know of two kinds of statistics. Here n p is unlim-
ited, so this cannot refer to Fermi-Dirac statistics. In the Bose-Einstein case we have
stimulated emission. Let us see this.
Figure 7.5 shows the interchange of noninteracting particles between sources K 2
and K 1 , but now with a weak source K 0 in between. The total source is composed
of three causally separated pieces,
K = K1 + K0 + K2. (7.52)
K0
K2
0−
Compare this with the multi-particle exchange description between the three sources,
0+ |0− K = 0+ |{n}− K 1 {n}+ |{n}− K 0 {n}+ |0− K 2 . (7.55)
{n}{n}
Now recall the connection between K p and {n}|0− K , Eq. (7.33b), so we infer
∗
0+ |{n} K 1 (−iK 1p )= n p + 10+ |{n} + 1p K 1 . (7.56)
In the last factor, the 1 represents spontaneous emission, and the n p is the enhance-
ment effect of stimulated emission.
The corresponding analysis in the absorption case gives
{n}|{n + 1p } K = −iK p∗ n p + 1, (7.59)
or
√
{n − 1p }|{n} K = −iK p∗ n p , (7.60)
that is, the probability of absorbing one particle is proportional to the incident inten-
sity.
Let us come back to space and time. Adopting a more telegraphic notation, we
can write the vacuum persistence amplitude as
∗
0+ |0− K = e−i K G K = 1 − i d1 d1 K ∗ (1)G(1 − 1 )K (1 )
1
− d1 d1 d2 d2 K ∗ (1)K ∗ (2)G(1 − 1 )G(2 − 2 )K (1 )K (2 ) + . . . .
2
(7.61)
Here the numbers represent space-time points, 1 = r1 , t1 , etc. Because the product
of sources is symmetrical, we can replace
1
G(1 − 1 )G(2 − 2 ) → [G(1 − 1 )G(2 − 2 ) + G(1 − 2 )G(2 − 1 )], (7.62)
2
Diagrammatically, the two terms can be represented as in Fig. 7.6. This builds in the
symmetry in the labels. The third term in Eq. (7.61) is
1
− d1 d2 K ∗ (1)K ∗ (2)ψ(1)ψ(2), ψ(1)ψ(2) = ψ(1, 2) = ψ(2, 1), (7.63)
2
Fig. 7.6 Exchange of two particles between spatially and temporally separated sources. Bose-
Einstein symmetry implies that the particles are exchanged between either spatially separated source.
No interaction is to be inferred where the lines cross
102 7 Nonrelativistic Source Theory
7.1 Interactions
Real particles interact with each other. Thus we should have processes such as
sketched in Fig. 7.7. The particles emitted by the sources are scattered by inter-
actions represented by the black box labeled T. Let us begin with a simpler situation,
scattering from a fixed center, for example, a heavy nucleus. This process can be
represented with only two sources, as shown in Fig. 7.8. This diagram schematically
represents the tracks of particles as seen in a detector. In the absence of interactions
W = − K ∗G0 K , (7.64)
where now we have used the superscript 0 to designate the free particle propagator
or Green’s function previously denoted simply by G. This process in represented
by the dotted line in Fig. 7.8. Now we want to add something to this, the scattering
process, represented generically by T. From a source point of view, both processes
contribute to the vacuum amplitude
W = − K ∗ G 0 K − K ∗ G 0 TG 0 K , (7.65)
T
7.1 Interactions 103
for the same reason as before. Because we have a well-defined causal situation,
ψ 0 (1) = G 0 (1 − 1 )K (1 ), t1 > t1 , (7.68)
we can write
(dp) ip·r−iEt
ψ (r, t) =
0
e (−i)K p ≡ ψp (r, t)(−i)K p . (7.69)
p
(2π ) 3
p
Similarly,
ψ (1) =
0∗
K ∗ (1 )G 9 (1 − 1), t1 < t1 (7.70)
104 7 Nonrelativistic Source Theory
V T
implies
(dp) −ip·r+iEt
ψ (r, t) =
0∗
e (−i)K p∗ ≡ ψp∗ (r, t)(−i)K p∗ . (7.71)
p
(2π ) 3
p
The term iW is the only term which describes the process being considered. The
vacuum amplitude includes
∗
0+ |1p K 1p+ |1p − 1p + |0 K ≈ (−iK p∗ )1p+ |1p − (−iK p ). (7.72)
p p
where ψ 0 = G 0 K and Tψ 0 may be thought of as an effective source. In this way
we get a superposition of effects. Alternatively, we may emphasize the last scattering
act,
ψ = ψ + G 0 V ψ,
0
(7.76)
So we see, indeed, that V is the potential energy. Thus, we indeed have a multiple
scattering process.
This also generalizes the concept of the Green’s function. If we write
ψ= GK, (7.81)
But here, the particles do their thing, and the scattering is represented by
1
W4 = − (K ∗ G 0 )1 (K ∗ G 0 )2 T(12, 1 2 )(G 0 K )1 (G 0 K )2 . (7.86)
2
Again, the scattering amplitude 1p1 1p1 |1p1 1p2 is given simply in terms of T.
Moreover, we want to analyze the scattering in terms of elementary processes, as
sketched in Fig. 7.10. The terms in the vacuum amplitude of interest are contained
in ei(W2 +W4 ) which when expanded contains the following terms with four sources:
1 2 1 ∗ 0 ∗ 0 i
− W2 + iW4 = − K ψ K ψ − (K ∗ G 0 )(K ∗ G 0 )T(G 0 K )(G 0 K )
2 2 2
1
=− K ∗ (1)K ∗ (1 )ψ(1, 2), (7.87)
2
V T
7.1 Interactions 107
where
ψ(1, 2) = ψ (1)ψ (2) + i
0 0
G 01 G 02 Tψ 0 ψ 0 . (7.88)
This says that the source detects what comes directly, and what comes from the
scattering process. On the other hand, we can look at this from the last scattering
point of view,
ψ(1, 2) = ψ 0 (1)ψ 0 (2) + i G 01 G 02 V ψ(1 , 2 ), (7.89)
where V is the measure of the single-scattering process. Again one can iterate. Look
at the corresponding differential equation,
∂ ∂
i − T1 i − T2 ψ(1, 2) = K (1)K (2) + i(V ψ)(1, 2), (7.90)
∂t1 ∂t2
where a matrix notation is used in the last term. This can be written as
∂ ∂
i − T1 i − T2 − iV ψ(1, 2) = K (1)K (2). (7.91)
∂t1 ∂t2
V (1, 2; 1 , 2 ) = δ(t1 − t2 )δ(t1 − t2 )δ(t1 − t1 )V12 (r1 − r2 , r1 − r2 ). (7.92)
V12 (r1 − r2 , r1 − r2 ) = δ((r1 − r2 ) − (r1 − r2 ))V (r1 − r2 ), (7.93)
but we won’t assume this in the following. Then the Green’s function defined by
ψ(1, 2) = G(1, 2; 1 , 2 )K (1 )K (2), (7.94)
satisfies
∂ ∂
i − T1 i − T2 − iV G = δ(1 − 1 )δ(2 − 2 ) sym , (7.95)
∂t1 ∂t2
where the subscript denotes symmetrization. Now because of Eq. (7.89), the integral
equation for the Green’s function is (symmetrization suppressed)
G = G 01 G 02 + iG 0 G 0 V G. (7.96)
108 7 Nonrelativistic Source Theory
where in the last three-particle and higher interactions have been omitted. Here we
include in the second term only the effects of interaction, through
The integral equation (7.96) written in terms of the last interaction can also be
written in term of the first interaction,
G = G 0 G 0 + GiV G 0 G 0 . (7.103)
1 1
G= G0G0 = G0G0 , (7.104)
1 − G G iV
0 0 1 − iV G 0 G 0
7.1 Interactions 109
which are formally identical. This assumes the instantaneous interaction given by
Eq. (7.92). Let us write
where
G 0 (t − t ) → −iδ(r − r ) as t → t + 0, (7.109)
Sources must be able to create composite structures. We could have started with
composite particles, after all, so this is a aspect of self-consistency. Write the integral
equation for G symmetrically,
G = G 0 G 0 + G 0 G 0 iV [G 0 G 0 + GiV G 0 G 0 ] = G 0 G 0 + G 0 G 0 iV G 0 G 0
+ G 0 G 0 iV GiV G 0 G 0 . (7.110)
We are still working with the instantaneous interaction approximation. So G has the
property of depending on two times. Write again
The description of the bound states is contained in G 1+2 . The part of the vacuum
amplitude involving G 1+2 is
110 7 Nonrelativistic Source Theory
i
K ∗ K ∗ G 0 G 0 V12 G 1+2 V12 G 0 G 0 K K
0+ |0− = e− 2 . (7.112)
1
R= (r1 + r2 ), r = r1 − r2 . (7.113)
2
The Green’s function equation (7.107) becomes
∂ P2 p2
i − − − V (r) G(r, R, t; r , R , t ) = δ(t − t)δ(r − r )δ(R − R ),
∂t 2M 2μ
(7.114)
where we see the appearance of the total mass M, the reduced mass μ, the total
momentum P and the relative momentum p. We are interested in the motion of the
center of mass. Let the internal motion be described by an eigenfunction φk governed
by a Schrödinger equation,
p2
Ek − − V φk (r) = 0, (7.115)
2μ
Now from Eq. (7.112) we identify the effective source for an atomic state
1 r
K n (R, t) = √ (dr)φ ∗ (r)V12 (r)G 0 (R + − r1 , t − t1 )
2 2
r
× G 0 (R − − r2 , t − t2 )K (r1 , t1 )K (r2 , t2 )d1 d2. (7.118)
2
We must be very explicit that E n < 0, meaning that the sources are extended: They
must put out less energy than two free particles would have. So G 0 does not refer to
the propagation of a free particle, but rather it propagates an excitation which does
7.2 Bound States 111
not get very far. As a result, the Green’s functions become real in effect, by virtue of
the integration, and
1
K n∗ (R, t) = √ K ∗ K ∗ G 0 G 0 V12 φn (7.119)
2
is actually the complex conjugate of Eq. (7.118). The consistency of the vacuum
persistence amplitude
K n∗ (R,t)G n (R−R ,t−t )K n (R ,t )
0+ |0− = e−i n (7.120)
with
∂ 1
W = ψ i − T ψ − K ∗ ψ − ψ ∗ K − ψ ∗ ψ ∗ V ψψ
∗
∂t 2
1 1
− ψ ∗ ψ ∗ V χ − χ ∗ V ψψ
2 2
i ∗ ∂ ∂
− χ i −T i − T − iV χ . (7.123)
2 ∂t ∂t
Here we have introduce the 2-particle field χ . What does this imply under field
variations? Varying with respect to ψ ∗ gives
∂
i − T ψ = K + ψ ∗ V (ψψ + χ ), (7.124)
∂t
This says that χ is the part of the two-particle field that has interacted at least once,
thus
χ = GiV ψψ. (7.126)
1 One of Schwinger’s last publications (Schwinger 1993) described the centrality of Green’s func-
tions to his life work.
© The Author(s) 2015 113
K.A. Milton, Schwinger’s Quantum Action Principle,
SpringerBriefs in Physics, DOI 10.1007/978-3-319-20128-3_8
114 8 Concluding Remarks
Acknowledgments I thank the Laboratoire Kastler Brossel, ENS, UPMC, CNRS, for its hospitality
during the completion of the first part of manuscript. I especially thank Astrid Lambrecht and Serge
Reynaud. The work was supported in part with funding from the Simons Foundation, CNRS,
and the Julian Schwinger Foundation. I thank my many students at the University of Oklahoma,
where much of the material reported here was used as the basis of lectures in electrodynamics,
quantum mechanics, and quantum field theory. I especially thank Herb Fried for permission to use
his transcription of Schwinger’s 1956 lectures as the basis for Chap. 6 here, and Walter Becker for
his LATEX conversion of those notes.
References
P.M. Bakshi, K.T. Mahanthappa, Expectation value formalism in quantum field theory. J. Math.
Phys. 4, 1 (1963)
P.A.M. Dirac, The physical interpretation of the quantum dynamics. Proc. Roy. Soc. Lond. A
113(765), 621–641 (1927)
P.A.M. Dirac, V.A. Fock, B. Podolsky, On quantum electrodynamics. Phys. Zeits. Sowjetunion 2,
468 (1932)
P.A.M. Dirac, The Lagrangian in quantum mechanics. Phys. Zeits. Sowjetunion 3, 64 (1933)
R.J. Duffin, On The characteristic matrices of covariant systems. Phys. Rev. 54, 1114 (1938). doi:10.
1103/PhysRev.54.1114
L. Euler, Methodus Inveniendi Lineas Curvas Maximi Minive Proprietate Gaudentes (Bousquet,
Lausanne and Geneva, 1744)
L. Euler, Investigation of the letter, allegedly written by Leibniz, translated by Wikisource (1752),
http://en.wikisource.org/wiki/Investigation_of_the_letter_of_Leibniz
R.P. Feynman, The principles of least action in quantum mechanics. Ph.D dissertation, Princeton
University, Princeton, NJ. (University Microfilms, Ann Arbor, Publications No. 2948) (1942)
R.P. Feynman, Space-time approach to quantum electrodynamics. Phys. Rev. 76, 769 (1949)
R.P. Feynman, A.R. Hibbs, Quantum Mechanics and Path Integrals (McGraw-Hill, New York,
1965)
R.J. Glauber, Coherent and incoherent states of radiation field. Phys. Rev. 131, 2766–2788 (1963)
W.R. Hamilton, On a general method in dynamics. Part I. Phil. Trans. Roy. Soc. 124, 247–308
(1834)
W.R. Hamilton, On a general method in dynamics. Part II. Phil. Trans. Roy. Soc. 125, 95–144 (1835)
W. Heisenberg, W. Pauli, Zur Quantendynamik der Wellenfelder. Z. Phys. 56, 1–61 (1929)
L.V. Keldysh, Diagram technique for nonequilibrium processes. Zh. Eksp. Teor. Fiz. 47, 1515–1527
(1964) (English translation: Soviet Physics JETP 20: 1018–1026 (1965).)
N. Kemmer, The particle aspect of meson theory. Proc. Roy. Soc. A 173, 91–116 (1939). doi:10.
1098/rspa.1939.0131
J.-L. Lagrange, Méchanique Analytique. p. 226 (1788)
K.T. Mahanthappa, Multiple production of photons in quantum electrodynamics. Phys. Rev. 126,
329 (1962)
P.C. Martin, J. Schwinger, Theory of many-particle systems. Phys. Rev. 115, 1342 (1959)
P.L.M. de Maupertuis, Accord de différentes lois de la nature qui avaient jusqu’ici paru incompati-
bles”. Mém. As. Sc. Paris p. 417 (1744)