Fitzpatrick
Fitzpatrick
Richard Fitzpatrick
Associate Professor of Physics
The University of Texas at Austin
Contents
1 Introduction 5
1.1 Major sources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2 Fundamental concepts 6
2.1 The breakdown of classical physics . . . . . . . . . . . . . . . . . . 6
2.2 The polarization of photons . . . . . . . . . . . . . . . . . . . . . . 7
2.3 The fundamental principles of quantum mechanics . . . . . . . . . 9
2.4 Ket space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.5 Bra space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.6 Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.7 The outer product . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.8 Eigenvalues and eigenvectors . . . . . . . . . . . . . . . . . . . . . 20
2.9 Observables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.10 Measurements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
2.11 Expectation values . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.12 Degeneracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.13 Compatible observables . . . . . . . . . . . . . . . . . . . . . . . . 27
2.14 The uncertainty relation . . . . . . . . . . . . . . . . . . . . . . . . 28
2.15 Continuous spectra . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
4 Quantum dynamics 55
4.1 Schrödinger’s equations of motion . . . . . . . . . . . . . . . . . . 55
4.2 Heisenberg’s equations of motion . . . . . . . . . . . . . . . . . . . 59
2
4.3 Ehrenfest’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 61
4.4 Schrödinger’s wave-equation . . . . . . . . . . . . . . . . . . . . . 65
5 Angular momentum 71
5.1 Orbital angular momentum . . . . . . . . . . . . . . . . . . . . . . 71
5.2 Eigenvalues of angular momentum . . . . . . . . . . . . . . . . . . 74
5.3 Rotation operators . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
5.4 Eigenfunctions of orbital angular momentum . . . . . . . . . . . . 81
5.5 Motion in a central field . . . . . . . . . . . . . . . . . . . . . . . . 84
5.6 Energy levels of the hydrogen atom . . . . . . . . . . . . . . . . . . 86
5.7 Spin angular momentum . . . . . . . . . . . . . . . . . . . . . . . 89
5.8 Wave-function of a spin one-half particle . . . . . . . . . . . . . . . 91
5.9 Rotation operators in spin space . . . . . . . . . . . . . . . . . . . 93
5.10 Magnetic moments . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
5.11 Spin precession . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
5.12 Pauli two-component formalism . . . . . . . . . . . . . . . . . . . . 99
5.13 Spin greater than one-half systems . . . . . . . . . . . . . . . . . . 105
5.14 Addition of angular momentum . . . . . . . . . . . . . . . . . . . . 110
3
6.16 The electric dipole approximation . . . . . . . . . . . . . . . . . . . 162
6.17 Energy-shifts and decay-widths . . . . . . . . . . . . . . . . . . . . 165
4
1 INTRODUCTION
1 Introduction
The textbooks which I have consulted most frequently while developing course
material are:
The principles of quantum mechanics, P.A.M. Dirac, 4th Edition (revised), (Ox-
ford University Press, Oxford, UK, 1958).
The Feynman lectures on physics, R.P. Feynman, R.B. Leighton, and M. Sands,
Volume III (Addison-Wesley, Reading MA, 1965).
Quantum mechanics, E. Merzbacher, 2nd Edition (John Wiley & Sons, New York
NY, 1970).
Modern quantum mechanics, J.J. Sakurai, (Benjamin/Cummings, Menlo Park
CA, 1985).
5
2 FUNDAMENTAL CONCEPTS
2 Fundamental concepts
6
2.2 The polarization of photons 2 FUNDAMENTAL CONCEPTS
It is known experimentally that when plane polarized light is used to eject photo-
electrons there is a preferred direction of emission of the electrons. Clearly, the
polarization properties of light, which are more usually associated with its wave-
like behaviour, also extend to its particle-like behaviour. In particular, a polariza-
tion can be ascribed to each individual photon in a beam of light.
7
2.2 The polarization of photons 2 FUNDAMENTAL CONCEPTS
cos2 α are absorbed. Thus, we conclude that a photon has a probability sin 2 α of
being transmitted as a photon polarized in the plane perpendicular to the optic
axis, and a probability cos2 α of being absorbed. These values for the probabilities
lead to the correct classical limit for a beam containing a large number of photons.
Note that we have only been able to preserve the individuality of photons,
in all cases, by abandoning the determinacy of classical theory, and adopting a
fundamentally probabilistic approach. We have no way of knowing whether an
individual obliquely polarized photon is going to be absorbed by or transmitted
through a polaroid film. We only know the probability of each event occurring.
This is a fairly sweeping statement, but recall that the state of a photon is fully
specified once its energy, direction of propagation, and polarization are known.
If we imagine performing experiments using monochromatic light, normally in-
cident on a polaroid film, with a particular oblique polarization, then the state of
each individual photon in the beam is completely specified, and there is nothing
left over to uniquely determine whether the photon is transmitted or absorbed by
the film.
The above discussion about the results of an experiment with a single obliquely
polarized photon incident on a polaroid film answers all that can be legitimately
asked about what happens to the photon when it reaches the film. Questions as
to what decides whether the photon is transmitted or not, or how it changes its
direction of polarization, are illegitimate, since they do not relate to the outcome
of a possible experiment. Nevertheless, some further description is needed in
order to allow the results of this experiment to be correlated with the results of
other experiments which can be performed using photons.
8
2.3 The fundamental principles of quantum mechanics 2 FUNDAMENTAL CONCEPTS
There is nothing special about the transmission and absorption of photons through
a polaroid film. Exactly the same conclusions as those outlined above are ob-
tained by studying other simple experiments, such as the interference of photons
(see Dirac, Sect. I.3), and the Stern-Gerlach experiment (see Sakurai, Cha. 1;
Feynman, Cha. 5). The study of these simple experiments leads us to formulate
the following fundamental principles of quantum mechanics:
1. Dirac’s razor: Quantum mechanics can only answer questions regarding the
outcome of possible experiments. Any other questions lie beyond the realms
of physics.
2. The principle of superposition of states: Any microscopic system (i.e., an atom,
molecule, or particle) in a given state can be regarded as being partly in
each of two or more other states. In other words, any state can be regarded
as a superposition of two or more other states. Such superpositions can be
performed in an infinite number of different ways.
3. The principle of indeterminacy: An observation made on a microscopic system
causes it to jump into one or more particular states (which are related to
9
2.4 Ket space 2 FUNDAMENTAL CONCEPTS
The first of these principles was formulated by quantum physicists (such as Dirac)
in the 1920s to fend off awkward questions such as “How can a system suddenly
jump from one state into another?”, or “How does a system decide which state to
jump into?”. As we shall see, the second principle is the basis for the mathemat-
ical formulation of quantum mechanics. The final principle is still rather vague.
We need to extend it so that we can predict which possible states a system can
jump into after a particular type of observation, as well as the probability of the
system making a particular jump.
|Ai. (2.1)
Suppose that state A is, in fact, the superposition of two different states, B and
10
2.4 Ket space 2 FUNDAMENTAL CONCEPTS
11
2.4 Ket space 2 FUNDAMENTAL CONCEPTS
that it comes in irreducible packets called photons, electrons, atoms, etc. If we ob-
serve a microscopic system then we either see a state (i.e., a photon, or an atom,
or a molecule, etc.) or we see nothing—we can never see a fraction or a multiple
of a state. In classical physics, if we observe a wave then the amplitude of the
wave can take any value between zero and infinity. Thus, if we were to represent
a classical wave by a vector, then the magnitude, or length, of the vector would
correspond to the amplitude of the wave, and the direction would correspond to
the frequency and wave-length, so that two vectors of different lengths pointing
in the same direction would represent different wave states.
We have seen, in Eq. (2.3), that any plane polarized state of a photon can
be represented as a linear superposition of two orthogonal polarization states
in which the weights are real numbers. Suppose that we want to construct a
circularly polarized photon state. Well, we know from classical physics that a cir-
cularly polarized wave is a superposition of two waves of equal amplitude, plane
polarized in orthogonal directions, which are in phase quadrature. This suggests
that a circularly polarized photon is the superposition of a photon polarized in
the x-direction (state B) and a photon polarized in the y-direction (state C), with
equal weights given to the two states, but with the proviso that state C is 90 ◦
out of phase with state B. By analogy with classical physics, we can use complex
numbers to simultaneously represent the weighting and relative phase in a linear
superposition. Thus, a circularly polarized photon is represented by
where c1 and c2 are complex numbers. We conclude that a ket space must be
a complex vector space if it is to properly represent the mutual interrelations
between the possible states of a microscopic system.
Suppose that the ket |Ri is expressible linearly in terms of the kets |Ai and |Bi,
so that
|Ri = c1 |Ai + c2 |Bi. (2.8)
12
2.4 Ket space 2 FUNDAMENTAL CONCEPTS
We say that |Ri is dependent on |Ai and |Bi. It follows that the state R can be
regarded as a linear superposition of the states A and B. So, we can also say that
state R is dependent on states A and B. In fact, any ket vector (or state) which
is expressible linearly in terms of certain others is said to be dependent on them.
Likewise, a set of ket vectors (or states) are termed independent if none of them
are expressible linearly in terms of the others.
13
2.5 Bra space 2 FUNDAMENTAL CONCEPTS
A snack machine inputs coins plus some code entered on a key pad, and (hope-
fully) outputs a snack. It also does so in a deterministic manner: i.e., the same
money plus the same code produces the same snack (or the same error message)
time after time. Note that the input and output of the machine have completely
different natures. We can imagine building a rather abstract snack machine which
inputs ket vectors and outputs complex numbers in a deterministic fashion. Math-
ematicians call such a machine a functional. Imagine a general functional, labeled
F, acting on a general ket vector, labeled A, and spitting out a general complex
number φA . This process is represented mathematically by writing
hF|(|Ai) = φA . (2.9)
Let us narrow our focus to those functionals which preserve the linear dependen-
cies of the ket vectors upon which they operate. Not surprisingly, such functionals
are termed linear functionals. A general linear functional, labeled F, satisfies
where |Ai and |Bi are any two kets in a given ket space.
14
2.5 Bra space 2 FUNDAMENTAL CONCEPTS
where the α∗i are the complex conjugates of the αi . hA| is termed the dual vector
to |Ai. It follows, from the above, that the dual to chA| is c∗ |Ai, where c is a
complex number. More generally,
DC
c1 |Ai + c2 |Bi ←→ c∗1 hA| + c∗2 hB|. (2.17)
15
2.5 Bra space 2 FUNDAMENTAL CONCEPTS
Recall that a bra vector is a functional which acts on a general ket vector, and
spits out a complex number. Consider the functional which is dual to the ket
vector
XN
|Bi = βi |ii (2.18)
i=1
acting on the ket vector |Ai. This operation is denoted hB|(|Ai). Note, however,
that we can omit the round brackets without causing any ambiguity, so the oper-
ation can also be written hB||Ai. This expression can be further simplified to give
hB|Ai. According to Eqs. (2.11), (2.12), (2.16), and (2.18),
N
X
hB|Ai = β∗i αi . (2.19)
i=1
Mathematicians term hB|Ai the inner product of a bra and a ket. 2 An inner prod-
uct is (almost) analogous to a scalar product between a covariant and contravari-
ant vector in some curvilinear space. It is easily demonstrated that
hB|Ai = hA|Bi∗ . (2.20)
Consider the special case where |Bi → |Ai. It follows from Eqs. (2.12) and (2.20)
that hA|Ai is a real number, and that
hA|Ai ≥ 0. (2.21)
The equality sign only holds if |Ai is the null ket [i.e., if all of the α i are zero in
Eq. (2.11)]. This property of bra and ket vectors is essential for the probabilistic
interpretation of quantum mechanics, as will become apparent later.
Given a ket |Ai which is not the null ket, we can define a normalized ket | Ãi,
where
1
|Ãi = q |Ai, (2.23)
hA|Ai
2
We can now appreciate the elegance of Dirac’s notation. The combination of a bra and a ket yields a “bra(c)ket”
(which is just a number).
16
2.6 Operators 2 FUNDAMENTAL CONCEPTS
2.6 Operators
We have seen that a functional is a machine which inputs a ket vector and spits
out a complex number. Consider a somewhat different machine which inputs a
ket vector and spits out another ket vector in a deterministic fashion. Mathemati-
cians call such a machine an operator. We are only interested in operators which
preserve the linear dependencies of the ket vectors upon which they act. Such
operators are termed linear operators. Consider an operator labeled X. Suppose
that when this operator acts on a general ket vector |Ai it spits out a new ket
vector which is denoted X|Ai. Operator X is linear provided that
17
2.6 Operators 2 FUNDAMENTAL CONCEPTS
for all kets in the ket space in question. Operator X is termed the null operator if
for all ket vectors in the space. Operators can be added together. Such addition
is defined to obey a commutative and associate algebra:
X + Y = Y + X, (2.29)
X + (Y + Z) = (X + Y) + Z. (2.30)
X Y 6= Y X. (2.33)
So far, we have only considered linear operators acting on ket vectors. We can
also give a meaning to their operating on bra vectors. Consider the inner product
of a general bra hB| with the ket X|Ai. This product is a number which depends
linearly on |Ai. Thus, it may be considered to be the inner product of |Ai with
some bra. This bra depends linearly on hB|, so we may look on it as the result of
some linear operator applied to hB|. This operator is uniquely determined by the
original operator X, so we might as well call it the same operator acting on |Bi. A
suitable notation to use for the resulting bra when X operates on hB| is hB|X. The
equation which defines this vector is
for any |Ai and hB|. The triple product of hB|, X, and |Ai can be written hB|X|Ai
without ambiguity, provided we adopt the convention that the bra vector always
goes on the left, the operator in the middle, and the ket vector on the right.
Consider the dual bra to X|Ai. This bra depends antilinearly on |Ai and must
therefore depend linearly on hA|. Thus, it may be regarded as the result of some
18
2.7 The outer product 2 FUNDAMENTAL CONCEPTS
linear operator applied to hA|. This operator is termed the adjoint of X, and is
denoted X† . Thus,
DC
X|Ai ←→ hA|X† . (2.35)
It is readily demonstrated that
hB|X† |Ai = hA|X|Bi∗ , (2.36)
plus
(X Y)† = Y † X† . (2.37)
It is also easily seen that the adjoint of the adjoint of a linear operator is equiva-
lent to the original operator. A Hermitian operator ξ has the special property that
it is its own adjoint: i.e.,
ξ = ξ† . (2.38)
So far we have formed the following products: hB|Ai, X|Ai, hA|X, X Y, hB|X|Ai.
Are there any other products we are allowed to form? How about
|BihA| ? (2.39)
This clearly depends linearly on the ket |Ai and the bra |Bi. Suppose that we
right-multiply the above product by the general ket |Ci. We obtain
|BihA|Ci = hA|Ci|Bi, (2.40)
since hA|Ci is just a number. Thus, |BihA| acting on a general ket |Ci yields
another ket. Clearly, the product |BihA| is a linear operator. This operator also
acts on bras, as is easily demonstrated by left-multiplying the expression (2.39)
by a general bra hC|. It is also easily demonstrated that
(|BihA|)† = |AihB|. (2.41)
Mathematicians term the operator |BihA| the outer product of |Bi and hA|. The
outer product should not be confused with the inner product, hA|Bi, which is just
a number.
19
2.8 Eigenvalues and eigenvectors 2 FUNDAMENTAL CONCEPTS
In general, the ket X|Ai is not a constant multiple of |Ai. However, there are
some special kets known as the eigenkets of operator X. These are denoted
|x 0 i, |x 00 i, |x 000 i . . . , (2.42)
and have the property
X|x 0 i = x 0 |x 0 i, X|x 00 i = x 00 |x 00 i . . . , (2.43)
where x 0 , x 00 , . . . are numbers called eigenvalues. Clearly, applying X to one of its
eigenkets yields the same eigenket multiplied by the associated eigenvalue.
(i) The eigenvalues are all real numbers, and the eigenkets corresponding to
different eigenvalues are orthogonal. Since ξ is Hermitian, the dual equation to
Eq. (2.44) (for the eigenvalue ξ 00 ) reads
hξ 00 |ξ = ξ 00∗ hξ 00 |. (2.45)
If we left-multiply Eq. (2.44) by hξ 00 |, right-multiply the above equation by |ξ 0 i,
and take the difference, we obtain
(ξ 0 − ξ 00∗ )hξ 00 |ξ 0 i = 0. (2.46)
Suppose that the eigenvalues ξ 0 and ξ 00 are the same. It follows from the above
that
ξ 0 = ξ 0∗ , (2.47)
where we have used the fact that |ξ 0 i is not the null ket. This proves that the
eigenvalues are real numbers. Suppose that the eigenvalues ξ 0 and ξ 00 are differ-
ent. It follows that
hξ 00 |ξ 0 i = 0, (2.48)
20
2.9 Observables 2 FUNDAMENTAL CONCEPTS
(ii) The eigenvalues associated with eigenkets are the same as the eigenvalues
associated with eigenbras. An eigenbra of ξ corresponding to an eigenvalue ξ 0 is
defined
hξ 0 |ξ = hξ 0 |ξ 0 . (2.49)
(iii) The dual of any eigenket is an eigenbra belonging to the same eigenvalue,
and conversely.
2.9 Observables
21
2.9 Observables 2 FUNDAMENTAL CONCEPTS
There is nothing special about the polarization states of a photon. So, more
generally, we can say that when a dynamical variable of a microscopic system
is measured the system is caused to jump into one of a number of independent
states (note that the perpendicular and parallel polarization states of our photon
are linearly independent). In general, each of these final states is associated with
a different result of the measurement: i.e., a different value of the dynamical
variable. Note that the result of the measurement must be a real number (there
are no measurement machines which output complex numbers). Finally, if an
observation is made, and the system is found to be a one particular final state,
with one particular value for the dynamical variable, then a second observation,
made immediately after the first one, will definitely find the system in the same
state, and yield the same value for the dynamical variable.
How can we represent all of these facts in our mathematical formalism? Well,
by a fairly non-obvious leap of intuition, we are going to assert that a measure-
ment of a dynamical variable corresponding to an operator X in ket space causes
the system to jump into a state corresponding to one of the eigenkets of X. Not
surprisingly, such a state is termed an eigenstate. Furthermore, the result of the
measurement is the eigenvalue associated with the eigenket into which the system
jumps. The fact that the result of the measurement must be a real number implies
that dynamical variables can only be represented by Hermitian operators (since only
Hermitian operators are guaranteed to have real eigenvalues). The fact that the
eigenkets of a Hermitian operator corresponding to different eigenvalues (i.e., dif-
ferent results of the measurement) are orthogonal is in accordance with our ear-
lier requirement that the states into which the system jumps should be mutually
22
2.9 Observables 2 FUNDAMENTAL CONCEPTS
23
2.10 Measurements 2 FUNDAMENTAL CONCEPTS
2.10 Measurements
24
2.11 Expectation values 2 FUNDAMENTAL CONCEPTS
X
hA| = hA|ξ 0 ihξ 0 |, (2.52)
ξ0
X X
hA|Ai = hA|ξ 0 ihξ 0 |Ai = |hA|ξ 0 i|2 , (2.53)
ξ0 ξ0
where the summation is over all the different eigenvalues of ξ, and use has been
made of Eq. (2.20), and the fact that the eigenstates are mutually orthogonal.
Note that all of the above results follow from the extremely useful (and easily
proved) result X
|ξ 0 ihξ 0 | = 1, (2.54)
ξ0
where 1 denotes the identity operator. The relative probability of a transition to
an eigenstate |ξ 0 i, which is equivalent to the relative probability of a measure-
ment of ξ yielding the result ξ 0 , is
P(ξ 0 ) ∝ |hA|ξ 0 i|2 . (2.55)
The absolute probability is clearly
25
2.12 Degeneracy 2 FUNDAMENTAL CONCEPTS
X X
0 0 0
= ξ hA|ξ ihξ |Ai = hA|ξ|ξ 0 ihξ 0 |Ai, (2.58)
ξ0 ξ0
which reduces to
hξi = hA|ξ|Ai (2.59)
with the aid of Eq. (2.54).
Consider the identity operator, 1. All states are eigenstates of this operator
with the eigenvalue unity. Thus, the expectation value of this operator is always
unity: i.e.,
hA|1|Ai = hA|Ai = 1, (2.60)
for all |Ai. Note that it is only possible to normalize a given ket |Ai such that
Eq. (2.60) is satisfied because of the more general property (2.21) of the norm.
This property depends on the particular correspondence (2.16), that we adopted
earlier, between the elements of a ket space and those of its dual bra space.
2.12 Degeneracy
Suppose that two different eigenstates |ξa0 i and |ξb0 i of ξ correspond to the same
eigenvalue ξ 0 . These are termed degenerate eigenstates. Degenerate eigenstates
are necessarily orthogonal to any eigenstates corresponding to different eigen-
values, but, in general, they are not orthogonal to each other (i.e., the proof of
orthogonality given in Sect. 2.8 does not work in this case). This is unfortunate,
since much of the previous formalism depends crucially on the mutual orthogo-
nality of the different eigenstates of an observable. Note, however, that any linear
combination of |ξa0 i and |ξb0 i is also an eigenstate corresponding to the eigenvalue
ξ 0 . It follows that we can always construct two mutually orthogonal degenerate
eigenstates. For instance,
26
2.13 Compatible observables 2 FUNDAMENTAL CONCEPTS
This result is easily generalized to the case of more than two degenerate eigen-
states. We conclude that it is always possible to construct a complete set of mu-
tually orthogonal eigenstates for any given observable.
27
2.14 The uncertainty relation 2 FUNDAMENTAL CONCEPTS
We have seen that the condition for two observables ξ and η to be simultane-
ously measurable is that they should possess simultaneous eigenstates (i.e., every
eigenstate of ξ should also be an eigenstate of η). Suppose that this is the case.
Let a general eigenstate of ξ, with eigenvalue ξ 0 , also be an eigenstate of η, with
eigenvalue η 0 . It is convenient to denote this simultaneous eigenstate |ξ 0 η 0 i. We
have
ξ|ξ 0 η 0 i = ξ 0 |ξ 0 η 0 i, (2.63)
η|ξ 0 η 0 i = η 0 |ξ 0 η 0 i. (2.64)
We can left-multiply the first equation by η, and the second equation by ξ, and
then take the difference. The result is
where |Ai is a general ket. The only way that this can be true is if
ξ η = η ξ. (2.67)
We have seen that if ξ and η are two noncommuting observables, then a deter-
mination of the value of ξ leaves the value of η uncertain, and vice versa. It is
28
2.14 The uncertainty relation 2 FUNDAMENTAL CONCEPTS
The variance of ξ is a measure of the uncertainty in the value of ξ for the particu-
lar state in question (i.e., it is a measure of the width of the distribution of likely
values of ξ about the expectation value). If the variance is zero then there is no
uncertainty, and a measurement of ξ is bound to give the expectation value, hξi.
which is analogous to
|a|2 | b|2 ≥ |a · b|2 (2.71)
in Euclidian space. This inequality can be proved by noting that
where c is any complex number. If c takes the special value −hB|Ai/hB|Bi then
the above inequality reduces to
Let us substitute
29
2.14 The uncertainty relation 2 FUNDAMENTAL CONCEPTS
into the Schwarz inequality, where the blank ket | i stands for any general ket.
We find
h(∆ξ)2 ih(∆η)2 i ≥ |h∆ξ ∆ηi|2 , (2.76)
where use has been made of the fact that ∆ξ and ∆η are Hermitian operators.
Note that
1 1
∆ξ ∆η = [∆ξ, ∆η] + {∆ξ, ∆η} , (2.77)
2 2
where the commutator, [∆ξ, ∆η], and the anti-commutator, {∆ξ, ∆η}, are defined
[∆ξ, ∆η] ≡ ∆ξ ∆η − ∆η ∆ξ, (2.78)
{∆ξ, ∆η} ≡ ∆ξ ∆η + ∆η ∆ξ. (2.79)
The commutator is clearly anti-Hermitian,
([∆ξ, ∆η])† = (∆ξ ∆η − ∆η ∆ξ)† = ∆η ∆ξ − ∆ξ ∆η = − [∆ξ, ∆η] , (2.80)
whereas the anti-commutator is obviously Hermitian. Now, it is easily demon-
strated that the expectation value of a Hermitian operator is a real number,
whereas the expectation value of an anti-Hermitian operator is a pure imaginary
number. It is clear that the right hand side of
1 1
h∆ξ ∆ηi = h[∆ξ, ∆η]i + h{∆ξ, ∆η}i, (2.81)
2 2
consists of the sum of a purely real and a purely imaginary number. Taking the
modulus squared of both sides gives
1 1
|h∆ξ ∆ηi|2 = |h[ξ, η]i|2 + |h{∆ξ, ∆η}i|2 , (2.82)
4 4
where use has been made of h∆ξi = 0, etc. The final term in the above expression
is positive definite, so we can write
1
h(∆ξ)2 ih(∆η)2 i ≥ |h[ξ, η]i|2 , (2.83)
4
where use has been made of Eq. (2.76). The above expression is termed the
uncertainty relation. According to this relation, an exact knowledge of the value
of ξ implies no knowledge whatsoever of the value of η, and vice versa. The one
exception to this rule is when ξ and η commute, in which case exact knowledge
of ξ does not necessarily imply no knowledge of η.
30
2.15 Continuous spectra 2 FUNDAMENTAL CONCEPTS
ξ|ξ 0 i = ξ 0 |ξ 0 i. (2.84)
But, ξ 0 can now take a continuous range of values. Let us assume, for the sake of
simplicity, that ξ 0 can take any value. The orthogonality condition (2.50) gener-
alizes to
hξ 0 |ξ 00 i = δ(ξ 0 − ξ 00 ), (2.85)
where δ(x) denotes the famous Dirac delta-function. Note that there are clearly a
nondenumerably infinite number of mutually orthogonal eigenstates of ξ. Hence,
the dimensionality of ket space is nondenumerably infinite. Note, also, that eigen-
states corresponding to a continuous range of eigenvalues cannot be normalized
so that they have unit norms. In fact, these eigenstates have infinite norms: i.e.,
they are infinitely long. This is the major difference between eigenstates in a
finite-dimensional and an infinite-dimensional ket space. The extremely useful
relation (2.54) generalizes to
Z
dξ 0 |ξ 0 ihξ 0 | = 1. (2.86)
Note that a summation over discrete eigenvalues goes over into an integral over
a continuous range of eigenvalues. The eigenstates |ξ 0 i must form a complete set
if ξ is to be an observable. It follows that any general ket can be expanded in
31
2.15 Continuous spectra 2 FUNDAMENTAL CONCEPTS
These results also follow simply from Eq. (2.86). We have seen that it is not possi-
ble to normalize the eigenstates |ξ 0 i such that they have unit norms. Fortunately,
this convenient normalization is still possible for a general state vector. In fact,
according to Eq. (2.89), the normalization condition can be written
Z
hA|Ai = dξ 0 |hA|ξ 0 i|2 = 1. (2.90)
We have now studied observables whose eigenvalues can take a discrete num-
ber of values as well as those whose eigenvalues can take any value. There are
number of other cases we could look at. For instance, observables whose eigen-
values can only take a finite range of values, or observables whose eigenvalues
take on a finite range of values plus a set of discrete values. Both of these cases
can be dealt with using a fairly straight-forward generalization of the previous
analysis (see Dirac, Cha. II and III).
32
3 POSITION AND MOMENTUM
3.1 Introduction
33
3.2 Poisson brackets 3 POSITION AND MOMENTUM
where u and v are regarded as functions of the coordinates and momenta qi and
pi . It is easily demonstrated that
[qi , qj ] = 0, (3.4)
[pi , pj ] = 0, (3.5)
[qi , pj ] = δij . (3.6)
34
3.2 Poisson brackets 3 POSITION AND MOMENTUM
[u, c] = 0, (3.9)
[u1 + u2 , v] = [u1 , v] + [u2 , v], (3.10)
[u, v1 + v2 ] = [u, v1 ] + [u, v2 ] (3.11)
[u1 u2 , v] = [u1 , v]u2 + u1 [u2 , v], (3.12)
[u, v1 v2 ] = [u, v1 ]v2 + v1 [u, v2 ], (3.13)
and
[u, [v, w]] + [v, [w, u]] + [w, [u, v]] = 0. (3.14)
The last relation is known as the Jacobi identity. In the above, u, v, w, etc.,
represent dynamical variables, and c represents a number. Can we find some
combination of noncommuting operators u and v, etc., which satisfies all of the
above relations?
Well, we can evaluate the Poisson bracket [u1 u2 , v1 v2 ] in two different ways,
since we can use either of the formulae (3.12) or (3.13) first. Thus,
[u1 u2 , v1 v2 ] = [u1 , v1 v2 ]u2 + u1 [u2 , v1 v2 ] (3.15)
= {[u1 , v1 ]v2 + v1 [u1 , v2 ]} u2 + u1 {[u2 , v1 ]v2 + v1 [u2 , v2 ]}
= [u1 , v1 ]v2 u2 + v1 [u1 , v2 ]u2 + u1 [u2 , v1 ]v2 + u1 v1 [u2 , v2 ],
and
[u1 u2 , v1 v2 ] = [u1 u2 , v1 ]v2 + v1 [u1 u2 , v2 ] (3.16)
= [u1 , v1 ]u2 v2 + u1 [u2 , v1 ]v2 + v1 [u1 , v2 ]u2 + v1 u1 [u2 , v2 ].
Note that the order of the various factors has been preserved, since they now
represent noncommuting operators. Equating the above two results yields
[u1 , v1 ](u2 v2 − v2 u2 ) = (u1 v1 − v1 u1 )[u2 , v2 ]. (3.17)
Since this relation must hold for u1 and v1 quite independent of u2 and v2 , it
follows that
u1 v1 − v1 u1 = i h̄ [u1 , v1 ], (3.18)
u2 v2 − v2 u2 = i h̄ [u2 , v2 ], (3.19)
35
3.2 Poisson brackets 3 POSITION AND MOMENTUM
where h̄ does not depend on u1 , v1 , u2 , v2 , and also commutes with (u1 v1 −v1 u1 ).
Since u1 , etc., are quite general operators, it follows that h̄ is just a number. We
want the quantum mechanical Poisson bracket of two Hermitian operators to
be an Hermitian operator itself, since the classical Poisson bracket of two real
dynamical variables is real. This requirement is satisfied if h̄ is a real number.
Thus, the quantum mechanical Poisson bracket of two dynamical variables u and
v is given by
uv − vu
[u, v] = , (3.20)
i h̄
where h̄ is a new universal constant of nature. Quantum mechanics agrees with
experiments provided that h̄ takes the value h/2π, where
h = 6.6261 × 10−34 J s (3.21)
is Planck’s constant. Somewhat confusingly, the notation [u, v] is convention-
ally reserved for the commutator u v − v u in quantum mechanics. We will use
[u, v]quantum to denote the quantum Poisson bracket. Thus,
[u, v]
[u, v]quantum = . (3.22)
i h̄
It is easily demonstrated that the quantum mechanical Poisson bracket, as defined
above, satisfies all of the relations (3.8)–(3.14).
The strong analogy we have found between the classical Poisson bracket,
defined in Eq. (3.3), and the quantum mechanical Poisson bracket, defined in
Eq. (3.22), leads us to make the assumption that the quantum mechanical bracket
has the same value as the corresponding classical bracket, at least for the simplest
cases. In other words, we are assuming that Eqs. (3.4)–(3.6) hold for quantum
mechanical as well as classical Poisson brackets. This argument yields the funda-
mental commutation relations
[qi , qj ] = 0, (3.23)
[pi , pj ] = 0, (3.24)
[qi , pj ] = i h̄ δij . (3.25)
These results provide us with the basis for calculating commutation relations be-
tween general dynamical variables. For instance, if two dynamical variables, ξ
36
3.3 Wave-functions 3 POSITION AND MOMENTUM
and η, can both be written as a power series in the qi and pi , then repeated
application of Eqs. (3.8)–(3.13) allows [ξ, η] to be expressed in terms of the fun-
damental commutation relations (3.23)–(3.25).
Equations (3.23)–(3.25) provide the foundation for the analogy between quan-
tum mechanics and classical mechanics. Note that the classical result (that every-
thing commutes) is obtained in the limit h̄ → 0. Thus, classical mechanics can be
regarded as the limiting case of quantum mechanics when h̄ goes to zero. In classi-
cal mechanics, each pair of generalized coordinate and its conjugate momentum,
qi and pi , correspond to a different classical degree of freedom of the system. It is
clear from Eqs. (3.23)–(3.25) that in quantum mechanics the dynamical variables
corresponding to different degrees of freedom all commute. It is only those variables
corresponding to the same degree of freedom which may fail to commute.
3.3 Wave-functions
Consider a simple system with one classical degree of freedom, which corre-
sponds to the Cartesian coordinate x. Suppose that x is free to take any value
(e.g., x could be the position of a free particle). The classical dynamical vari-
able x is represented in quantum mechanics as a linear Hermitian operator which
is also called x. Moreover, the operator x possesses eigenvalues x 0 lying in the
continuous range −∞ < x 0 < +∞ (since the eigenvalues correspond to all the
possible results of a measurement of x). We can span ket space using the suit-
ably normalized eigenkets of x. An eigenket corresponding to the eigenvalue x 0
is denoted |x 0 i. Moreover, [see Eq. (2.85)]
hx 0 |x 00 i = δ(x 0 − x 00 ). (3.26)
The eigenkets satisfy the extremely useful relation [see Eq. (2.86)]
Z +∞
dx 0 |x 0 ihx 0 | = 1. (3.27)
−∞
This formula expresses the fact that the eigenkets are complete, mutually orthog-
onal, and suitably normalized.
37
3.3 Wave-functions 3 POSITION AND MOMENTUM
A state ket |Ai (which represents a general state A of the system) can be
expressed as a linear superposition of the eigenkets of the position operator using
Eq. (3.27). Thus, Z +∞
|Ai = dx 0 hx 0 |Ai|x 0 i (3.28)
−∞
The quantity hx |Ai is a complex function of the position eigenvalue x 0 . We can
0
write
hx 0 |Ai = ψA (x 0 ). (3.29)
Here, ψA (x 0 ) is the famous wave-function of quantum mechanics. Note that state
A is completely specified by its wave-function ψA (x 0 ) [since the wave-function
can be used to reconstruct the state ket |Ai using Eq. (3.28)]. It is clear that
the wave-function of state A is simply the collection of the weights of the cor-
responding state ket |Ai, when it is expanded in terms of the eigenkets of the
position operator. Recall, from Sect. 2.10, that the probability of a measurement
of a dynamical variable ξ yielding the result ξ 0 when the system is in state A is
given by |hξ 0 |Ai|2 , assuming that the eigenvalues of ξ are discrete. This result is
easily generalized to dynamical variables possessing continuous eigenvalues. In
fact, the probability of a measurement of x yielding a result lying in the range
x 0 to x 0 + dx 0 when the system is in a state |Ai is |hx 0 |Ai|2 dx 0 . In other words,
the probability of a measurement of position yielding a result in the range x 0 to
x 0 + dx 0 when the wave-function of the system is ψA (x 0 ) is
This formula is only valid if the state ket |Ai is properly normalized: i.e., if
hA|Ai = 1. The corresponding normalization for the wave-function is
Z +∞
|ψA (x 0 )|2 dx 0 = 1. (3.31)
−∞
38
3.4 Schrödinger’s representation - I 3 POSITION AND MOMENTUM
where use has been made of Eqs. (3.27) and (3.29). Thus, the inner product of
two states is related to the overlap integral of their wave-functions.
giving
ψB (x 0 ) = f(x 0 ) ψA (x 0 ), (3.34)
where use has been made of Eq. (3.26). Here, f(x 0 ) is the same function of the
position eigenvalue x 0 that f(x) is of the position operator x: i.e., if f(x) = x 2 then
f(x 0 ) = x 0 2 . It follows, from the above result, that a general state ket |Ai can be
written
|Ai = ψA (x)i, (3.35)
where ψA (x) is the same function of the operator x that the wave-function ψ A (x 0 )
is of the position eigenvalue x 0 , and the ket i has the wave-function ψ(x 0 ) = 1.
The ket i is termed the standard ket. The dual of the standard ket is termed the
standard bra, and is denoted h. It is easily seen that
DC
hψ∗A (x) ←→ ψA (x)i. (3.36)
Consider the simple system described in the previous section. A general state ket
can be written ψ(x)i, where ψ(x) is a general function of the position operator x,
and ψ(x 0 ) is the associated wave-function. Consider the ket whose wave-function
39
3.4 Schrödinger’s representation - I 3 POSITION AND MOMENTUM
is dψ(x 0 )/dx 0 . This ket is denoted dψ/dxi. The new ket is clearly a linear func-
tion of the original ket, so we can think of it as the result of some linear operator
acting on ψi. Let us denote this operator d/dx. It follows that
d dψ
ψi = i. (3.37)
dx dx
Any linear operator which acts on ket vectors can also act on bra vectors.
Consider d/dx acting on a general bra hφ(x). According to Eq. (2.34), the bra
hφ d/dx satisfies
d d
! !
hφ ψi = hφ ψi . (3.38)
dx dx
Making use of Eqs. (3.27) and (3.29), we can write
Z +∞ Z +∞ 0
d 0 0 0 0 0 dψ(x )
hφ |x i dx ψ(x ) = φ(x ) dx . (3.39)
−∞ dx −∞ dx 0
The right-hand side can be transformed via integration by parts to give
Z +∞ Z +∞
d 0 0 0 dφ(x 0 ) 0
hφ |x i dx ψ(x ) = − 0
dx ψ(x 0 ), (3.40)
−∞ dx −∞ dx
assuming that the contributions from the limits of integration vanish. It follows
that
d 0 dφ(x 0 )
hφ |x i = − , (3.41)
dx dx 0
which implies
d dφ
hφ = −h . (3.42)
dx dx
The neglect of contributions from the limits of integration in Eq. (3.40) is rea-
sonable because physical wave-functions are square-integrable [see Eq. (3.31)].
Note that
d dψ DC dψ∗ d
ψi = i ←→ h = −hψ∗ , (3.43)
dx dx dx dx
where use has been made of Eq. (3.42). It follows, by comparison with Eqs. (2.35)
and (3.36), that
d † d
!
=− . (3.44)
dx dx
40
3.4 Schrödinger’s representation - I 3 POSITION AND MOMENTUM
Let us evaluate the commutation relation between the operators x and d/dx.
We have
d d(x ψ) d
x ψi = i = x ψi + ψi. (3.45)
dx dx dx
Since this holds for any ket ψi, it follows that
d d
x−x = 1. (3.46)
dx dx
Let p be the momentum conjugate to x (for the simple system under consideration
p is a straight-forward linear momentum). According to Eq. (3.25), x and p
satisfy the commutation relation
x p − p x = i h̄. (3.47)
It can be seen, by comparison with Eq. (3.46), that the Hermitian operator
−i h̄ d/dx satisfies the same commutation relation with x that p does. The most
general conclusion which may be drawn from a comparison of Eqs. (3.46) and
(3.47) is that
d
p = −i h̄ + f(x), (3.48)
dx
since (as is easily demonstrated) a general function f(x) of the position operator
automatically commutes with x.
We have chosen to normalize the eigenkets and eigenbras of the position oper-
ator so that they satisfy the normalization condition (3.26). However, this choice
of normalization does not uniquely determine the eigenkets and eigenbras. Sup-
pose that we transform to a new set of eigenbras which are related to the old set
via
0
hx 0 |new = e i γ hx 0 |old , (3.49)
where γ 0 ≡ γ(x 0 ) is a real function of x 0 . This transformation amounts to a
rearrangement of the relative phases of the eigenbras. The new normalization
condition is
0 00 0 00
hx 0 |x 00 inew = hx 0 |e i γ e−i γ |x 00 iold = e i (γ −γ ) hx 0 |x 00 iold
0 00
= e i (γ −γ ) δ(x 0 − x 00 ) = δ(x 0 − x 00 ). (3.50)
41
3.4 Schrödinger’s representation - I 3 POSITION AND MOMENTUM
Thus, the new eigenbras satisfy the same normalization condition as the old
eigenbras.
where γ ≡ γ(x) is a real function of the position operator x. The dual of the
above equation yields the transformation rule for the standard bra,
The transformation rule for a general operator A follows from Eqs. (3.51) and
(3.52), plus the requirement that the triple product hAi remain invariant (this
must be the case, otherwise the probability of a measurement yielding a certain
result would depend on the choice of eigenbras). Thus,
yields
d
!
p = −i h̄ . (3.57)
dx new
42
3.5 Schrödinger’s representation - II 3 POSITION AND MOMENTUM
Equation (3.56) fixes γ to within an arbitrary additive constant: i.e., the special
eigenkets and eigenbras for which Eq. (3.57) is true are determined to within an
arbitrary common phase-factor.
The orthogonality condition for the eigenkets [i.e., the generalization of Eq. (3.26)]
is
hq10 · · · qN0 |q100 · · · qN00 i = δ(q10 − q100 ) δ(q20 − q200 ) · · · δ(qN0 − qN00 ). (3.59)
The completeness condition [i.e., the generalization of Eq. (3.27)] is
Z +∞ Z +∞
··· dq10 · · · dqN0 |q10 · · · qN0 ihq10 · · · qN0 | = 1. (3.60)
−∞ −∞
43
3.5 Schrödinger’s representation - II 3 POSITION AND MOMENTUM
The standard bra h is the dual of the standard ket. A general state ket is written
hφ(q1 · · · qN ), (3.64)
where
φ(q10 · · · qN0 ) = hφ|q10 · · · qN0 i. (3.65)
The probability of an observation of the system finding the first coordinate in the
range q10 to q10 + dq10 , the second coordinate in the range q20 to q20 + dq20 , etc., is
P(q10 · · · qN0 ; dq10 · · · dqN0 ) = |ψ(q10 · · · qN0 )|2 dq10 · · · dqN0 . (3.66)
44
3.5 Schrödinger’s representation - II 3 POSITION AND MOMENTUM
It can be seen, by comparison with Eqs. (3.23)–(3.25), that the linear oper-
ators −i h̄ ∂/∂qi satisfy the same commutation relations with the q’s and with
each other that the p’s do. The most general conclusion we can draw from this
coincidence of commutation relations is (see Dirac)
∂ ∂F(q1 · · · qN )
pi = −i h̄ + . (3.73)
∂qi ∂qi
However, the function F can be transformed away via a suitable readjustment of
the phases of the basis eigenkets (see Sect. 3.4, and Dirac). Thus, we can always
construct a set of simultaneous eigenkets of q1 · · · qN for which
∂
pi = −i h̄ . (3.74)
∂qi
This is the generalized Schrödinger representation.
46
3.6 The momentum representation 3 POSITION AND MOMENTUM
The above result is easily generalized to a system with more than one degree
of freedom. Suppose the system is specified by N coordinates, q1 · · · qN , and
N conjugate momenta, p1 · · · pN . Then, in the momentum representation, the
coordinates can be written as
∂
qi = i h̄ . (3.88)
∂pi
We also have
qi i = 0, (3.89)
and
∂
hp10 · · · pN0 |qi = i h̄ 0 hp10 · · · pN0 |. (3.90)
∂pi
47
3.7 The uncertainty relation 3 POSITION AND MOMENTUM
yields
hp 0 |p 00 i = |c 0 |2 h δ(p 0 − p 00 ). (3.95)
This is consistent with Eq. (3.80), provided that c 0 = h−1/2 . Thus,
48
3.7 The uncertainty relation 3 POSITION AND MOMENTUM
Consider a general state ket |Ai whose coordinate wave-function is ψ(x 0 ), and
whose momentum wave-function is Ψ(p 0 ). In other words,
ψ(x 0 ) = hx 0 |Ai, (3.97)
Ψ(p 0 ) = hp 0 |Ai. (3.98)
It is easily demonstrated that
Z +∞
ψ(x 0 ) = dp 0 hx 0 |p 0 ihp 0 |Ai
−∞
Z +∞
1
= 1/2 Ψ(p 0 ) exp(i p 0 x 0 /h̄) dp 0 (3.99)
h −∞
and
Z +∞
0
Ψ(p ) = dx 0 hp 0 |x 0 ihx 0 |Ai
−∞
Z +∞
1
= 1/2 ψ(x 0 ) exp(−i p 0 x 0 /h̄) dx 0 , (3.100)
h −∞
where use has been made of Eqs. (3.27), (3.81), (3.94), and (3.96). Clearly, the
momentum space wave-function is the Fourier transform of the coordinate space
wave-function.
49
3.8 Displacement operators 3 POSITION AND MOMENTUM
2 2 h̄2
h(∆x) ih(∆p) i ≥ (3.102)
4
for any general state. It is easily demonstrated that the minimum uncertainty
states, for which the equality sign holds in the above relation, correspond to
Gaussian wave-packets in both coordinate and momentum space.
The situation is not so clear with state kets. The final state of the system
only determines the direction of the displaced state ket. Even if we adopt the
convention that all state kets have unit norms, the final ket is still not completely
determined, since it can be multiplied by a constant phase-factor. However, we
know that the superposition relations between states remain invariant under the
displacement. This follows because the superposition relations have a physical
significance which is unaffected by a displacement of the system. Thus, if
in the undisplaced system, and the displacement causes ket |Ri to transform to
ket |Rdi, etc., then in the displaced system we have
50
3.8 Displacement operators 3 POSITION AND MOMENTUM
Incidentally, this determines the displaced kets to within a single arbitrary phase-
factor to be multiplied into all of them. The displaced kets cannot be multiplied
by individual phase-factors, because this would wreck the superposition relations.
Since Eq. (3.104) holds in the displaced system whenever Eq. (3.103) holds in
the undisplaced system, it follows that the displaced ket |Rdi must be the result
of some linear operator acting on the undisplaced ket |Ri. In other words,
where D an operator which depends only on the nature of the displacement. The
arbitrary phase-factor by which all displaced kets may be multiplied results in D
being undetermined to an arbitrary multiplicative constant of modulus unity.
We now adopt the ansatz that any combination of bras, kets, and dynamical
variables which possesses a physical significance is invariant under a displace-
ment of the system. The normalization condition
hA|Ai = 1 (3.106)
for a state ket |Ai certainly has a physical significance. Thus, we must have
hAd|Adi = 1. (3.107)
Since this must hold for any state ket |Ai, it follows that
D† D = 1. (3.109)
Hence, the displacement operator is unitary. Note that the above relation implies
that
|Ai = D† |Adi. (3.110)
The equation
v |Ai = |Bi, (3.111)
51
3.8 Displacement operators 3 POSITION AND MOMENTUM
where the operator v represents a dynamical variable, has some physical signifi-
cance. Thus, we require that
vd = D v D † . (3.114)
Note that the arbitrary numerical factor in D does not affect either of the results
(3.109) and (3.114).
to exist. Let
D−1
dx = lim , (3.116)
δx→0 δx
where dx is denoted the displacement operator along the x-axis. The fact that D
can be replaced by D exp(i γ), where γ is a real phase-angle, implies that d x can
be replaced by
D exp(i γ) − 1 D − 1 + iγ
lim = lim = dx + i a x , (3.117)
δx→0 δx δx→0 δx
where ax is the limit of γ/δx. We have assumed, as seems reasonable, that γ tends
to zero as δx → 0. It is clear that the displacement operator is undetermined to
an arbitrary imaginary additive constant.
52
3.8 Displacement operators 3 POSITION AND MOMENTUM
53
3.8 Displacement operators 3 POSITION AND MOMENTUM
A finite translation along the x-axis can be constructed from a series of very
many infinitesimal translations. Thus, the operator D(∆x) which translates the
system a distance ∆x along the x-axis is written
!N
∆x px
D(∆x) = lim 1−i , (3.127)
N→∞ N h̄
where use has been made of Eqs. (3.118) and (3.126). It follows that
We can also construct displacement operators which translate the system along
the y- and z-axes. Note that a displacement a distance ∆x along the x-axis com-
mutes with a displacement a distance ∆y along the y-axis. In other words, if the
system is moved ∆x along the x-axis, and then ∆y along the y-axis then it ends
up in the same state as if it were moved ∆y along the y-axis, and then ∆x along
the x-axis. The fact that translations in independent directions commute is clearly
associated with the fact that the conjugate momentum operators associated with
these directions also commute [see Eqs. (3.24) and (3.128)].
54
4 QUANTUM DYNAMICS
4 Quantum dynamics
Up to now, we have only considered systems at one particular instant of time. Let
us now investigate how quantum mechanical systems evolve with time.
Consider a system in a state A which evolves in time. At time t the state of the
system is represented by the ket |Ati. The label A is needed to distinguish the
ket from any other ket (|Bti, say) which is evolving in time. The label t is needed
to distinguish the different states of the system at different times.
The final state of the system at time t is completely determined by its initial
state at time t0 plus the time interval t − t0 (assuming that the system is left
undisturbed during this time interval). However, the final state only determines
the direction of the final state ket. Even if we adopt the convention that all state
kets have unit norms, the final ket is still not completely determined, since it
can be multiplied by an arbitrary phase-factor. However, we expect that if a
superposition relation holds for certain states at time t0 then the same relation
should hold between the corresponding time-evolved states at time t, assuming
that the system is left undisturbed between times t0 and t. In other words, if
|Rt0 i = |At0 i + |Bt0 i (4.1)
for any three kets, then we should have
|Rti = |Ati + |Bti. (4.2)
This rule determines the time-evolved kets to within a single arbitrary phase-
factor to be multiplied into all of them. The evolved kets cannot be multiplied by
individual phase-factors, since this would invalidate the superposition relation at
later times.
According to Eqs. (4.1) and (4.2), the final ket |Rti depends linearly on the
initial ket |Rt0 i. Thus, the final ket can be regarded as the result of some linear
operator acting on the initial ket: , i.e.,
|Rti = T |Rt0 i, (4.3)
55
4.1 Schrödinger’s equations of motion 4 QUANTUM DYNAMICS
where T is a linear operator which depends only on the times t and t0 . The
arbitrary phase-factor by which all time evolved kets may be multiplied results
in T (t, t0 ) being undetermined to an arbitrary multiplicative constant of modulus
unity.
Since we have adopted a convention in which the norm of any state ket is
unity, it make sense to define the time evolution operator T in such a manner
that it preserves the length of any ket upon which it acts (i.e., if a ket is prop-
erly normalized at time t then it will remain normalized at all subsequent times
t > t0 ). This is always possible, since the length of a ket possesses no physical
significance. Thus, we require that
hAt0 |At0 i = hAt|Ati (4.4)
for any ket A, which immediately yields
T † T = 1. (4.5)
Hence, the time evolution operator T is a unitary operator.
Up to now, the time evolution operator T looks very much like the spatial
displacement operator D introduced in the previous section. However, there are
some important differences between time evolution and spatial displacement.
In general, we do expect the expectation value of some observable ξ to evolve
with time, even if the system is left in a state of undisturbed motion (after all,
time evolution has no meaning unless something observable changes with time).
The triple product hA|ξ|Ai can evolve either because the ket |Ai evolves and the
operator ξ stays constant, the ket |Ai stays constant and the operator ξ evolves,
or both the ket |Ai and the operator ξ evolve. Since we are already committed to
evolving state kets, according to Eq. (4.3), let us assume that the time evolution
operator T can be chosen in such a manner that the operators representing the
dynamical variables of the system do not evolve in time (unless they contain some
specific time dependence).
We expect, from physical continuity, that as t → t0 then |Ati → |At0 i for any
ket A. Thus, the limit
|Ati − |At0 i T −1
lim = lim |At0 i (4.6)
t→t0 t − t0 t→t0 t − t0
56
4.1 Schrödinger’s equations of motion 4 QUANTUM DYNAMICS
should exist. Note that this limit is simply the derivative of |At 0 i with respect to
t0 . Let
T (t, t0 ) − 1
τ(t0 ) = lim . (4.7)
t→t0 t − t0
It is easily demonstrated from Eq. (4.5) that τ is anti-Hermitian: i.e.,
τ† + τ = 0. (4.8)
The fact that T can be replaced by T exp(i γ) (where γ is real) implies that τ is
undetermined to an arbitrary imaginary additive constant (see previous section).
Let us define the Hermitian operator H(t0 ) = i h̄ τ. This operator is undetermined
to an arbitrary real additive constant. It follows from Eqs. (4.6) and (4.7) that
d|At0 i |Ati − |At0 i
i h̄ = i h̄ lim = i h̄ τ(t0 )|At0 i = H(t0 )|At0 i. (4.9)
dt0 t→t0 t − t0
When written for general t this equation becomes
d|Ati
i h̄ = H(t)|Ati. (4.10)
dt
Equation (4.10) gives the general law for the time evolution of a state ket in
a scheme in which the operators representing the dynamical variables remain
fixed. This equation is denoted Schrödinger’s equation of motion. It involves a
Hermitian operator H(t) which is, presumably, a characteristic of the dynamical
system under investigation.
We saw, in the previous section, that if the operator D(x, x0 ) displaces the
system along the x-axis from x0 to x then
D(x, x0 ) − 1
px = i h̄ x→x
lim , (4.11)
0 x − x0
where px is the operator representing the momentum conjugate to x. We now
have that if the operator T (t, t0 ) evolves the system in time from t0 to t then
T (t, t0 ) − 1
H(t0 ) = i h̄ lim . (4.12)
t→t0 t − t0
57
4.1 Schrödinger’s equations of motion 4 QUANTUM DYNAMICS
where use has been made of Eqs. (4.5) and (4.6). (Here, we assume that Hamil-
tonian operators evaluated at different times commute with one another). It is
now clear how the fact that H is undetermined to an arbitrary real additive con-
stant leaves T undetermined to a phase-factor. Note that, in the above analysis,
time is not an operator (we cannot observe time, as such), it is just a parameter
(or, more accurately, a continuous label). Since we are only dealing with non-
relativistic quantum mechanics, the fact that position is an operator, but time is
only a label, need not worry us unduly. In relativistic quantum mechanics, time
and space coordinates are treated on the same footing by relegating position from
being an operator to being just a label.
58
4.2 Heisenberg’s equations of motion 4 QUANTUM DYNAMICS
We have seen that in Schrödinger’s scheme the dynamical variables of the system
remain fixed during a period of undisturbed motion, whereas the state kets evolve
according to Eq. (4.10). However, this is not the only way in which to represent
the time evolution of the system.
where use has been made of Eqs. (4.3), (4.5), and the fact that T (t0 , t0 ) = 1.
Clearly, the transformed state ket does not evolve in time. Thus, the transforma-
tion (4.16) has the effect of bringing all kets representing states of undisturbed
motion of the system to rest.
The transformation must also be applied to bras. The dual of Eq. (4.16) yields
The transformation rule for a general observable v is obtained from the require-
ment that the expectation value hA|v|Ai should remain invariant. It is easily seen
that
vt = T † v T. (4.19)
Thus, a dynamical variable, which corresponds to a fixed linear operator in Schrö-
dinger’s scheme, corresponds to a moving linear operator in this new scheme. It
is clear that the transformation (4.16) leads us to a scenario in which the state
of the system is represented by a fixed vector, and the dynamical variables are
represented by moving linear operators. This is termed the Heisenberg picture, as
opposed to the Schrödinger picture, which is outlined in Sect. 4.1.
59
4.2 Heisenberg’s equations of motion 4 QUANTUM DYNAMICS
Equation (4.25) shows how the dynamical variables of the system evolve in
the Heisenberg picture. It is denoted Heisenberg’s equation of motion. Note that
the time-varying dynamical variables in the Heisenberg picture are usually called
Heisenberg dynamical variables to distinguish them from Schrödinger dynamical
variables (i.e., the corresponding variables in the Schrödinger picture), which do
not evolve in time.
Here, [· · ·]classical is the classical Poisson bracket, and H denotes the classical
Hamiltonian. The strong resemblance between Eqs. (4.26) and (4.27) provides
us with further justification for our identification of the linear operator H with
the energy of the system in quantum mechanics.
Note that if the Hamiltonian does not explicitly depend on time (i.e., the sys-
tem is not subject to some time-dependent external force) then Eq. (4.15) yields
T (t, t0 ) = exp [−i H (t − t0 )/h̄] . (4.28)
This operator manifestly commutes with H, so
Ht = T † H T = H. (4.29)
Furthermore, Eq. (4.25) gives
dH
i h̄ = [H, H] = 0. (4.30)
dt
Thus, if the energy of the system has no explicit time-dependence then it is rep-
resented by the same non-time-varying operator H in both the Schrödinger and
Heisenberg pictures.
We have now derived all of the basic elements of quantum mechanics. The only
thing which is lacking is some rule to determine the form of the quantum mechan-
ical Hamiltonian. For a physical system which possess a classical analogue, we
61
4.3 Ehrenfest’s theorem 4 QUANTUM DYNAMICS
generally assume that the Hamiltonian has the same form as in classical physics
(i.e., we replace the classical coordinates and conjugate momenta by the corre-
sponding quantum mechanical operators). This scheme guarantees that quantum
mechanics yields the correct classical equations of motion in the classical limit.
Whenever an ambiguity arises because of non-commuting observables, this can
usually be resolved by requiring the Hamiltonian H to be an Hermitian oper-
ator. For instance, we would write the quantum mechanical analogue of the
classical product x p, appearing in the Hamiltonian, as the Hermitian product
(1/2)(x p + p x). When the system in question has no classical analogue then we
are reduced to guessing a form for H which reproduces the observed behaviour
of the system.
62
4.3 Ehrenfest’s theorem 4 QUANTUM DYNAMICS
63
4.3 Ehrenfest’s theorem 4 QUANTUM DYNAMICS
64
4.4 Schrödinger’s wave-equation 4 QUANTUM DYNAMICS
Let us now consider the motion of a particle in three dimensions in the Schrödinger
picture. The fixed dynamical variables of the system are the position operators
x ≡ (x1 , x2 , x3 ), and the momentum operators p ≡ (p1 , p2 , p3 ). The state of the
system is represented as some time evolving ket |Ati.
65
4.4 Schrödinger’s wave-equation 4 QUANTUM DYNAMICS
where V(x 0 ) is a scalar function of the position eigenvalues. Combining Eqs. (4.49),
(4.50), (4.52), and (4.53), we obtain
∂hx 0 |Ati h̄2 02 0
Suppose that the ket |Ati is an eigenket of the Hamiltonian belonging to the
eigenvalue H 0 :
H|Ati = H 0 |Ati. (4.56)
Schrödinger’s equation of motion (4.10) yields
d|Ati
i h̄ = H 0 |Ati. (4.57)
dt
This can be integrated to give
Note that |Ati only differs from |At0 i by a phase-factor. The direction of the
vector remains fixed in ket space. This suggests that if the system is initially in an
eigenstate of the Hamiltonian then it remains in this state for ever, as long as the
66
4.4 Schrödinger’s wave-equation 4 QUANTUM DYNAMICS
Since it is conventional to set the potential at infinity equal to zero, the above
relation implies that bound states are equivalent to negative energy states. The
boundary condition (4.61) is sufficient to uniquely specify the solution of Eq. (4.60).
In deriving Eq. (4.65) we have, naturally, assumed that the potential V(x 0 ) is
real. Suppose, however, that the potential has an imaginary component. In this
case, Eq. (4.65) generalizes to
∂ρ 2 Im(V)
+ ∇0 · j = ρ, (4.69)
∂t h̄
giving Z Z
∂ 2
ρ(x , t) d x = Im V(x 0 ) ρ(x 0 , t) d3 x 0 .
0 3 0
(4.70)
∂t h̄
Thus, if Im(V) < 0 then the total probability of observing the particle anywhere
in space decreases monotonically with time. Thus, an imaginary potential can
be used to account for the disappearance of a particle. Such a potential is often
employed to model nuclear reactions in which incident particles can be absorbed
by nuclei.
h̄ |∇ 02 S| |∇ 0 S|2 . (4.76)
69
4.4 Schrödinger’s wave-equation 4 QUANTUM DYNAMICS
70
5 ANGULAR MOMENTUM
5 Angular momentum
Lx = y p z − z p y , (5.1)
Ly = z p x − x p z , (5.2)
Lz = x p y − y p x . (5.3)
Let us assume that the operators (Lx , Ly , Lz ) ≡ L which represent the compo-
nents of orbital angular momentum in quantum mechanics can be defined in an
analogous manner to the corresponding components of classical angular momen-
tum. In other words, we are going to assume that the above equations specify
the angular momentum operators in terms of the position and linear momentum
operators. Note that Lx , Ly , and Lz are Hermitian, so they represent things which
can, in principle, be measured. Note, also, that there is no ambiguity regard-
ing the order in which operators appear in products on the right-hand sides of
Eqs. (5.1)–(5.3), since all of the products consist of operators which commute.
[xi , xj ] = 0, (5.4)
[pi , pj ] = 0, (5.5)
[xi , pj ] = i h̄ δij , (5.6)
where i and j stand for either x, y, or z. Consider the commutator of the operators
Lx and Lz :
71
5.1 Orbital angular momentum 5 ANGULAR MOMENTUM
The cyclic permutations of the above result yield the fundamental commutation
relations satisfied by the components of an angular momentum:
[Lx , Ly ] = i h̄ Lz , (5.8)
[Ly , Lz ] = i h̄ Lx , (5.9)
[Lz , Lx ] = i h̄ Ly . (5.10)
L × L = i h̄ L. (5.11)
The three commutation relations (5.8)–(5.10) are the foundation for the whole
theory of angular momentum in quantum mechanics. Whenever we encounter
three operators having these commutation relations, we know that the dynamical
variables which they represent have identical properties to those of the compo-
nents of an angular momentum (which we are about to derive). In fact, we shall
assume that any three operators which satisfy the commutation relations (5.8)–
(5.10) represent the components of an angular momentum.
Suppose that there are N particles in the system, with angular momentum
vectors Li (where i runs from 1 to N). Each of these vectors satisfies Eq. (5.11),
so that
Li × Li = i h̄ Li . (5.12)
However, we expect the angular momentum operators belonging to different par-
ticles to commute, since they represent different degrees of freedom of the sys-
tem. So, we can write
Li × Lj + Lj × Li = 0, (5.13)
PN
for i 6= j. Consider the total angular momentum of the system, L = i=1 Li . It is
clear from Eqs. (5.12) and (5.13) that
N
X N
X N
X N
1X
L×L = Li × Lj = Li × L i + (Li × Lj + Lj × Li )
i=1 j=1 i=1
2 i,j=1
N
X
= i h̄ Li = i h̄ L. (5.14)
i=1
72
5.1 Orbital angular momentum 5 ANGULAR MOMENTUM
Thus, the sum of two or more angular momentum vectors satisfies the same
commutation relation as a primitive angular momentum vector. In particular,
the total angular momentum of the system satisfies the commutation relation
(5.11).
The immediate conclusion which can be drawn from the commutation rela-
tions (5.8)–(5.10) is that the three components of an angular momentum vector
cannot be specified (or measured) simultaneously. In fact, once we have speci-
fied one component, the values of other two components become uncertain. It is
conventional to specify the z-component, Lz .
73
5.2 Eigenvalues of angular momentum 5 ANGULAR MOMENTUM
We can write
L2 |l, mi = f(l, m) h̄2 |l, mi, (5.26)
without loss of generality, where f(l, m) is some real dimensionless function of l
and m. Later on, we will show that f(l, m) = l (l + 1). Now,
hl, m|L2 − Lz2 |l, mi = hl, m|f(l, m) h̄2 − m2 h̄2 |l, mi = [f(l, m) − m2 ]h̄2 , (5.27)
where |Ai is a general ket, and ξ is an Hermitian operator. The proof follows
from the observation that
74
5.2 Eigenvalues of angular momentum 5 ANGULAR MOMENTUM
where |Bi = ξ|Ai, plus the fact that hB|Bi ≥ 0 for a general ket |Bi [see Eq. (2.21)].
It follows from Eqs. (5.27)–(5.29) that
Consider the effect of the shift operator L+ on the eigenket |l, mi. It is easily
demonstrated that
where use has been made of Eq. (5.26), plus the fact that L2 and Lz commute. It
follows that the ket L+ |l, mi has the same eigenvalue of L2 as the ket |l, mi. Thus,
the shift operator L+ does not affect the magnitude of the angular momentum of
any eigenket it acts upon. Note that
where use has been made of Eq. (5.22). The above equation implies that L+ |l, mi
is proportional to |l, m + 1i. We can write
L+ |l, mi = c+
l,m h̄ |l, m + 1i, (5.34)
where c+ +
l,m is a number. It is clear that when the operator L acts on a simulta-
neous eigenstate of L2 and Lz , the eigenvalue of L2 remains unchanged, but the
eigenvalue of Lz is increased by h̄. For this reason, L+ is called a raising operator.
The shift operators step the value of m up and down by unity each time they
operate on one of the simultaneous eigenkets of L2 and Lz . It would appear, at
first sight, that any value of m can be obtained by applying the shift operators a
sufficient number of times. However, according to Eq. (5.31), there is a definite
upper bound to the values that m2 can take. This bound is determined by the
75
5.2 Eigenvalues of angular momentum 5 ANGULAR MOMENTUM
L2 |l, mmax i = (Lz2 + h̄ Lz )|l, mmax i = mmax (mmax + 1) h̄2 |l, mmax i. (5.40)
But, when L− operates on |n, mmax i it generates |n, mmax − 1i, |n, mmax − 2i, etc.
Since the lowering operator does not change the eigenvalue of L2 , all of these
states must correspond to the same value of f, namely mmax (mmax + 1). Thus,
At this stage, we can give the unknown quantum number l the value mmax , with-
out loss of generality. We can also write the above equation in the form
Thus,
hl, m|L− L+ |l, mi = h̄2 [l (l + 1) − m (m + 1)]. (5.45)
76
5.2 Eigenvalues of angular momentum 5 ANGULAR MOMENTUM
We have already seen that the inequality (5.31) implies that there is a maxi-
mum and a minimum possible value of m. The maximum value of m is denoted
77
5.3 Rotation operators 5 ANGULAR MOMENTUM
l. What is the minimum value? Suppose that we try to lower the value of m
below its minimum value mmin . Since there is no state with m < mmin , we must
have
L− |l, mmin i = 0. (5.53)
According to Eq. (5.35), this implies that
c−
l,mmin = 0. (5.54)
It can be seen from Eq. (5.52) that mmin = −l. We conclude that m can take a
“ladder” of discrete values, each rung differing from its immediate neighbours by
unity. The top rung is l, and the bottom rung is −l. There are only two possible
choices for l. Either it is an integer (e.g., l = 2, which allows m to take the values
−2, −1, 0, 1, 2), or it is a half-integer (e.g., l = 3/2, which allows m to take the
values −3/2, −1/2, 1/2, 3/2). We will prove in the next section that an orbital
angular momentum can only take integer values of l.
Consider a particle described by the spherical polar coordinates (r, θ, ϕ). The
classical momentum conjugate to the azimuthal angle ϕ is the z-component of
angular momentum, Lz . According to Sect. 3.5, in quantum mechanics we can
78
5.3 Rotation operators 5 ANGULAR MOMENTUM
always adopt Schrödinger’s representation, for which ket space is spanned by the
simultaneous eigenkets of the position operators r, θ, and φ, and L z takes the
form
∂
Lz = −i h̄ . (5.57)
∂ϕ
We can do this because there is nothing in Sect. 3.5 which specifies that we
have to use Cartesian coordinates—the representation (3.74) works for any well-
defined set of coordinates.
Consider an operator R(∆ϕ) which rotates the system an angle ∆ϕ about the z-
axis. This operator is very similar to the operator D(∆x), introduced in Sect. 3.8,
which translates the system a distance ∆x along the x axis. We were able to
demonstrate in Sect. 3.8 that
D(δx) − 1
px = i h̄ lim , (5.58)
δx→0 δx
where px is the linear momentum conjugate to x. There is nothing in our deriva-
tion of this result which specifies that x has to be a Cartesian coordinate. Thus,
the result should apply just as well to an angular coordinate. We conclude that
R(δϕ) − 1
Lz = i h̄ lim . (5.59)
δϕ→0 δϕ
79
5.3 Rotation operators 5 ANGULAR MOMENTUM
Note that R(∆ϕ) has all of the properties we would expect of a rotation operator
R(0) = 1, (5.63)
R(∆ϕ) R(−∆ϕ) = 1, (5.64)
R(∆ϕ1 ) R(∆ϕ2 ) = R(∆ϕ1 + ∆ϕ2 ). (5.65)
We conclude that m must be an integer. This implies, from the previous section,
that l must also be an integer. Thus, orbital angular momentum can only take on
integer values of the quantum numbers l and m.
Thus, the eigenstate is invariant to rotations about the z-axis. Clearly, its wave-
function must be symmetric about the z-axis.
by analogy with Eq. (5.62). Here, Rx (∆ϕx ) denotes an operator which rotates the
system by an angle ∆ϕx about the x-axis, etc. Suppose that the system is in an
eigenstate of zero overall orbital angular momentum (i.e., an l = 0 state). We
know that the system is also in an eigenstate of zero orbital angular momentum
about any particular axis. This follows because l = 0 implies m = 0, according
80
5.4 Eigenfunctions of orbital angular momentum 5 ANGULAR MOMENTUM
to the previous section, and we can choose the z-axis to point in any direction.
Thus,
Clearly, a zero angular momentum state is invariant to rotations about any axis.
Such a state must possess a spherically symmetric wave-function.
Note that a rotation about the x-axis does not commute with a rotation about
the y-axis. In other words, if the system is rotated an angle ∆ϕx about the x-axis,
and then ∆ϕy about the y-axis, it ends up in a different state to that obtained
by rotating an angle ∆ϕy about the y-axis, and then ∆ϕx about the x-axis. In
quantum mechanics, this implies that Ry (∆ϕy ) Rx (∆ϕx ) 6= Rx (∆ϕx ) Ry (∆ϕy ), or
Ly Lx 6= Lx Ly , [see Eqs. (5.68)–(5.70)]. Thus, the noncommuting nature of the
angular momentum operators is a direct consequence of the fact that rotations
do not commute.
81
5.4 Eigenfunctions of orbital angular momentum 5 ANGULAR MOMENTUM
82
5.4 Eigenfunctions of orbital angular momentum 5 ANGULAR MOMENTUM
The spherical harmonics also form a complete set for representing general func-
tions of θ and ϕ.
By definition,
L2 Ylm = l (l + 1) h̄2 Ylm , (5.93)
where l is an integer. It follows from Eqs. (5.82) and (5.89) that
Lz Ylm = m h̄ Ylm , (5.94)
where m is an integer lying in the range −l ≤ m ≤ l. Thus, the wave-function
ψ(r, θ, ϕ) = R(r) Ylm (θ, φ), where R is a general function, has all of the expected
features of the wave-function of a simultaneous eigenstate of L2 and Lz belonging
to the quantum numbers l and m. The well-known formula
dPlm 1 m+1 mξ m
= √ P − P
dξ 1 − ξ2 l 1 − ξ2 l
(l + m)(l − m + 1) m−1 mξ m
= − √ P l + P (5.95)
1 − ξ2 1 − ξ2 l
can be combined with Eqs. (5.83) and (5.89) to give
q
L+ Ylm = l (l + 1) − m (m + 1) h̄ Ylm+1 , (5.96)
q
L− Ylm = l (l + 1) − m (m − 1) h̄ Ylm−1 . (5.97)
83
5.5 Motion in a central field 5 ANGULAR MOMENTUM
These equations are equivalent to Eqs. (5.55)–(5.56). Note that a spherical har-
monic wave-function is symmetric about the z-axis (i.e., independent of ϕ)
√ when-
0
ever m = 0, and is spherically symmetric whenever l = 0 (since Y0 = 1/ 4π).
h̄2 2
H=− ∇ + V(r). (5.99)
2M
When written in spherical polar coordinates, the above equation becomes
h̄2 1 ∂ 2 ∂ ∂2
1 ∂ ∂ 1
H=− r + sin θ + + V(r). (5.100)
2 M r2 ∂r ∂r r2 sin θ ∂θ ∂θ r2 sin2 θ ∂ϕ2
Comparing this equation with Eq. (5.85), we find that
h̄2 1 ∂ 2 ∂ L2
H= − 2 r + 2 2 + V(r). (5.101)
2M r ∂r ∂r h̄ r
84
5.5 Motion in a central field 5 ANGULAR MOMENTUM
[L, H] = 0. (5.102)
[L2 , H] = 0. (5.103)
According to Sect. 4.2, the previous two equations ensure that the angular mo-
mentum L and its magnitude squared L2 are both constants of the motion. This
is as expected for a spherically symmetric potential.
H ψ = E ψ, (5.104)
where E is a number. Since L2 and Lz commute with each other and the Hamil-
tonian, it is always possible to represent the state of the system in terms of the
simultaneous eigenstates of L2 , Lz , and H. But, we already know that the most
general form for the wave-function of a simultaneous eigenstate of L2 and Lz is
(see previous section)
ψ(r, θ, ϕ) = R(r) Ylm (θ, ϕ). (5.105)
Substituting Eq. (5.105) into Eq. (5.101), and making use of Eq. (5.93), we ob-
tain
2
h̄ 1 d 2 d l (l + 1)
− r + + V(r) − E R = 0. (5.106)
2M r2 dr dr r2
This is a Sturm-Liouville equation for the function R(r). We know, from the gen-
eral properties of this type of equation, that if R(r) is required to be well-behaved
at r = 0 and as r → ∞ then solutions only exist for a discrete set of values of E.
These are the energy eigenvalues. In general, the energy eigenvalues depend on
the quantum number l, but are independent of the quantum number m.
85
5.6 Energy levels of the hydrogen atom 5 ANGULAR MOMENTUM
Consider a hydrogen atom, for which the potential takes the specific form
e2
V(r) = − . (5.107)
4π0 r
The radial eigenfunction R(r) satisfies Eq. (5.106), which can be written
h̄2 1 d 2 d e2
l (l + 1)
− 2 r + 2
− − E R = 0. (5.108)
2µ r dr dr r 4π0 r
Here, µ = me mp /(me +mp ) is the reduced mass, which takes into account the fact
that the electron (of mass me ) and the proton (of mass mp ) both rotate about a
common centre, which is equivalent to a particle of mass µ rotating about a fixed
point. Let us write the product r R(r) as the function P(r). The above equation
transforms to
d2 P 2 µ l (l + 1)h̄2 e2
− 2 − − E P = 0, (5.109)
dr2 h̄ 2 µ r2 4π0 r
which is the one-dimensional Schrödinger equation for a particle of mass µ mov-
ing in the effective potential
e2 l (l + 1) h̄2
Veff (r) = − + . (5.110)
4π0 r 2 µ r2
The effective potential has a simple physical interpretation. The first part is the
attractive Coulomb potential, and the second part corresponds to the repulsive
centrifugal force.
Let v
2
u
u −h̄
u
a= t , (5.111)
2µE
and y = r/a, with
P(r) = f(y) exp(−y). (5.112)
Here, it is assumed that the energy eigenvalue E is negative. Equation (5.109)
transforms to
2 2
d d l (l + 1) 2 µ e a
−2 − + 2
f = 0. (5.113)
dy2 dy y2 4π0 h̄ y
86
5.6 Energy levels of the hydrogen atom 5 ANGULAR MOMENTUM
cn [n (n − 1) − l (l + 1)] = cn−1 2 (n − 1) −
. (5.116)
4π0 h̄2
Now, the power law series (5.114) must terminate at small n, at some positive
value of n, otherwise f(y) behaves unphysically as y → 0. This is only possible if
[nmin (nmin −1)−l (l+1)] = 0, where the first term in the series is cnmin ynmin . There
are two possibilities: nmin = −l or nmin = l + 1. The former predicts unphysical
behaviour of the wave-function at y = 0. Thus, we conclude that nmin = l + 1.
Note that for an l = 0 state there is a finite probability of finding the electron at
the nucleus, whereas for an l > 0 state there is zero probability of finding the
electron at the nucleus (i.e., |ψ|2 = 0 at r = 0, except when l = 0). Note, also,
that it is only possible to obtain sensible behaviour of the wave-function as r → 0
if l is an integer.
For large values of y, the ratio of successive terms in the series (5.114) is
cn y 2y
= , (5.117)
cn−1 n
according to Eq. (5.116). This is the same as the ratio of successive terms in the
series
X (2 y)n
, (5.118)
n n!
which converges to exp(2 y). We conclude that f(y) → exp(2 y) as y → ∞. It
follows from Eq. (5.112) that R(r) → exp(r/a)/r as r → ∞. This doesRnot cor-
respond to physically acceptable behaviour of the wave-function, since |ψ|2 dV
must be finite. The only way in which we can avoid this unphysical behaviour is
87
5.6 Energy levels of the hydrogen atom 5 ANGULAR MOMENTUM
where
n 4π0 h̄2
a= 2
= 5.3 × 10−11 n meters, (5.122)
µe
and R(x) is a well-behaved solution of the differential equation
1 d 2d l (l + 1) 2 n
x − + − 1 R = 0. (5.123)
x2 dx dx x2 x
Finally, the Ylm are spherical harmonics. The restrictions on the quantum numbers
are |m| ≤ l < n. Here, n is a positive integer, l is a non-negative integer, and m
is an integer.
88
5.7 Spin angular momentum 5 ANGULAR MOMENTUM
The next energy level corresponds to n = 2. The other quantum numbers are
allowed to take the values l = 0, m = 0 or l = 1, m = −1, 0, 1. Thus, there are
n = 2 states with non-zero angular momentum. Note that the energy levels given
in Eq. (5.120) are independent of the quantum number l, despite the fact that l
appears in the radial eigenfunction equation (5.123). This is a special property
of a 1/r Coulomb potential.
In addition to the quantized negative energy state of the hydrogen atom, which
we have just found, there is also a continuum of unbound positive energy states.
Up to now, we have tacitly assumed that the state of a particle in quantum me-
chanics can be completely specified by giving the wave-function ψ as a function
of the spatial coordinates x, y, and z. Unfortunately, there is a wealth of experi-
mental evidence which suggests that this simplistic approach is incomplete.
Consider an isolated system at rest, and let the eigenvalue of its total angular
momentum be j (j + 1) h̄2 . According to the theory of orbital angular momentum
outlined in Sects. 5.4 and 5.5, there are two possibilities. For a system consisting
of a single particle, j = 0. For a system consisting of two (or more) particles, j is
a non-negative integer. However, this does not agree with observations, because
we often find systems which appear to be structureless, and yet have j 6= 0. Even
worse, systems where j has half-integer values abound in nature. In order to
explain this apparent discrepancy between theory and experiments, Gouldsmit
and Uhlenbeck (in 1925) introduced the concept of an internal, purely quantum
mechanical, angular momentum called spin. For a particle with spin, the total
angular momentum in the rest frame is non-vanishing.
Let us denote the three components of the spin angular momentum of a par-
ticle by the Hermitian operators (Sx , Sy , Sz ) ≡ S. We assume that these operators
obey the fundamental commutation relations (5.8)–(5.10) for the components of
an angular momentum. Thus, we can write
S × S = i h̄ S. (5.125)
89
5.7 Spin angular momentum 5 ANGULAR MOMENTUM
Spin angular momentum clearly has many properties in common with orbital
angular momentum. However, there is one vitally important difference. Spin
angular momentum operators cannot be expressed in terms of position and mo-
mentum operators, like in Eqs. (5.1)–(5.3), since this identification depends on
an analogy with classical mechanics, and the concept of spin is purely quantum
mechanical: i.e., it has no analogy in classical physics. Consequently, the re-
striction that the quantum number of the overall angular momentum must take
integer values is lifted for spin angular momentum, since this restriction (found
in Sects. 5.3 and 5.4) depends on Eqs. (5.1)–(5.3). In other words, the quantum
number s is allowed to take half-integer values.
90
5.8 Wave-function of a spin one-half particle 5 ANGULAR MOMENTUM
91
5.8 Wave-function of a spin one-half particle 5 ANGULAR MOMENTUM
92
5.9 Rotation operators in spin space 5 ANGULAR MOMENTUM
Let us, for the moment, forget about the spatial position of the particle, and
concentrate on its spin state. A general spin state A is represented by the ket
in spin space. In Sect. 5.3, we were able to construct an operator Rz (∆ϕ) which
rotates the system by an angle ∆ϕ about the z-axis in position space. Can we also
construct an operator Tz (∆ϕ) which rotates the system by an angle ∆ϕ about
the z-axis in spin space? By analogy with Eq. (5.62), we would expect such an
operator to take the form
To demonstrate that the operator (5.148) really does rotate the spin of the
system, let us consider its effect on hSx i. Under rotation, this expectation value
changes as follows:
93
5.9 Rotation operators in spin space 5 ANGULAR MOMENTUM
First, we can use the explicit formula for Sx given in Eq. (5.135). We find that
Eq. (5.151) becomes
h̄
exp( i Sz ∆ϕ/h̄) ( |+ih−| + |−ih+| ) exp(−i Sz ∆ϕ/h̄), (5.152)
2
or
h̄ i ∆ϕ/2
|+ih−| e i ∆ϕ/2 + e −i ∆ϕ/2 |−ih+| e −i ∆ϕ/2 ,
e (5.153)
2
which reduces to
Sx cos ∆ϕ − Sy sin ∆ϕ, (5.154)
where use has been made of Eqs. (5.135)–(5.137).
Sx 1 −
+ · · · − Sy ϕ −
+ · · · , (5.157)
2! 3!
or
Sx cos ∆ϕ − Sy sin ∆ϕ, (5.158)
where use has been made of Eq. (5.125). The second proof is more general than
the first, since it only uses the fundamental commutation relation (5.125), and is,
therefore, valid for systems with spin angular momentum higher than one-half.
94
5.9 Rotation operators in spin space 5 ANGULAR MOMENTUM
where the Rkl are the elements of the conventional rotation matrix for the rotation
in question. It is clear, from our second derivation of the result (5.159), that this
property is not restricted to the spin operators of a spin one-half system. In fact,
we have effectively demonstrated that
X
hJk i → Rkl hJl i, (5.163)
l
where the Jk are the generators of rotation, satisfying the fundamental commuta-
tion relation J × J = i h̄ J, and the rotation operator about the kth axis is written
Rk (∆ϕ) = exp(−i Jk ∆ϕ/h̄).
Consider the effect of the rotation operator (5.148) on the state ket (5.147).
It is easily seen that
95
5.10 Magnetic moments 5 ANGULAR MOMENTUM
Note that a ket rotated by 2π radians differs from the original ket by a minus
sign. In fact, a rotation by 4π radians is needed to transform a ket into itself.
The minus sign does not affect the expectation value of S, since S is sandwiched
between hA| and |Ai, both of which change sign. Nevertheless, the minus sign
does give rise to observable consequences, as we shall see presently.
Does spin angular momentum also give rise to a contribution to the magnetic
moment of a charged particle? The answer is “yes”. In fact, relativistic quantum
mechanics actually predicts that a charged particle possessing spin should also
possess a magnetic moment (this was first demonstrated by Dirac). We can write
q
µ= (L + g S) , (5.168)
2m
where g is called the gyromagnetic ratio. For an electron this ratio is found to be
e2
1
ge = 2 1 +
. (5.169)
2π 4π0 h̄ c
96
5.11 Spin precession 5 ANGULAR MOMENTUM
It can be seen, by comparison with Eq. (5.148), that the time evolution operator
is precisely the same as the rotation operator for spin, with ∆ϕ set equal to ω t.
It is immediately clear that the Hamiltonian (5.171) causes the electron spin to
precess about the z-axis with angular frequency ω. In fact, Eqs. (5.159)–(5.161)
imply that
The time evolution of the state ket is given by analogy with Eq. (5.164):
97
5.11 Spin precession 5 ANGULAR MOMENTUM
Note that it takes time t = 4π/ω for the state ket to return to its original state. By
contrast, it only takes times t = 2π/ω for the spin vector to point in its original
direction.
98
5.12 Pauli two-component formalism 5 ANGULAR MOMENTUM
We have seen, in Sect. 5.4, that the eigenstates of orbital angular momentum
can be conveniently represented as spherical harmonics. In this representation,
the orbital angular momentum operators take the form of differential operators
involving only angular coordinates. It is conventional to represent the eigenstates
of spin angular momentum as column (or row) matrices. In this representation,
the spin angular momentum operators take the form of matrices.
The column matrix (5.184) is called a two-component spinor, and can be written
h+|Ai c+
χ≡ = = c + χ+ + c − χ− , (5.186)
h−|Ai c−
99
5.12 Pauli two-component formalism 5 ANGULAR MOMENTUM
where the c± are complex numbers. The row matrix (5.185) becomes
|A 0 i = Sk |Ai. (5.188)
|A 0 i → ≡ χ 0. (5.189)
h−|A 0 i
However,
or
h+|A 0 i
h+|Sk |+i h+|Sk |−i h+|Ai
= . (5.192)
h−|A 0 i h−|Sk |+i h−|Sk |−i h−|Ai
It follows that we can represent the operator/ket relation (5.188) as the matrix
relation
h̄
!
0
χ = σk χ, (5.193)
2
where the σk are the matrices of the h±|Sk |±i values divided by h̄/2. These
matrices, which are called the Pauli matrices, can easily be evaluated using the
explicit forms for the spin operators given in Eqs. (5.135)–(5.137). We find that
0 1
σ1 = , (5.194)
1 0
0 −i
σ2 = , (5.195)
i 0
1 0
σ3 = . (5.196)
0 −1
100
5.12 Pauli two-component formalism 5 ANGULAR MOMENTUM
Let us examine how the Pauli scheme can be extended to take into account the
position of a spin one-half particle. Recall, from Sect. 5.8, that we can represent
a general basis ket as the product of basis kets in position space and spin space:
|x 0 , y 0 , z 0 , ±i = |x 0 , y 0 , z 0 i|±i = |±i|x 0 , y 0 , z 0 i. (5.201)
The ket corresponding to state A is denoted ||Aii, and resides in the product
space of the position and spin ket spaces. State A is completely specified by the
two wave-functions
ψ+ (x 0 , y 0 , z 0 ) = hx 0 , y 0 , z 0 |h+||Aii, (5.202)
ψ− (x 0 , y 0 , z 0 ) = hx 0 , y 0 , z 0 |h−||Aii. (5.203)
Consider the operator relation
||A 0 ii = Sk ||Aii. (5.204)
101
5.12 Pauli two-component formalism 5 ANGULAR MOMENTUM
where use has been made of the fact that the spin operator Sk commutes with
the eigenbras hx 0 , y 0 , z 0 |. It is fairly obvious that we can represent the operator
relation (5.204) as a matrix relation if we generalize our definition of a spinor by
writing
0
ψ (r )
||Aii → + 0 ≡ χ, (5.207)
ψ− (r )
and so on. The components of a spinor are now wave-functions, instead of com-
plex numbers. In this scheme, the operator equation (5.204) becomes simply
h̄
!
0
χ = σk χ. (5.208)
2
102
5.12 Pauli two-component formalism 5 ANGULAR MOMENTUM
What about combinations of position and spin operators? The most commonly
occurring combination is a dot product: e.g., S · L = (h̄/2) σ · L. Consider the
hybrid operator σ·a, where a ≡ (ax , ay , az ) is some vector position operator. This
quantity is represented as a 2 × 2 matrix:
X +a3 a1 − i a 2
σ·a ≡ ak σ k = . (5.215)
k
a1 + i a 2 −a3
Since, in the Schrödinger representation, a general position operator takes the
form of a differential operator in x 0 , y 0 , or z 0 , it is clear that the above quantity
must be regarded as a matrix differential operator which acts on spinors of the
general form (5.207). The important identity
(σ·a) (σ·b) = a·b + i σ·(a × b) (5.216)
follows from the commutation and anti-commutation relations (5.199) and (5.200).
Thus,
X X XX 1 1
!
σ j aj σk bk = {σj , σk } + [σj , σk ] aj bk
j k j k
2 2
XX
= (σjk + i jkl σl ) aj bk
j k
103
5.12 Pauli two-component formalism 5 ANGULAR MOMENTUM
by analogy with Eq. (5.148), where n is a unit vector pointing along the axis of
rotation, and ∆ϕ is the angle of rotation. Here, n can be regarded as a trivial
position operator. The rotation operator is represented
exp (−i S·n ∆ϕ/h̄) → exp (−i σ·n ∆ϕ/2) (5.219)
in the Pauli scheme. The term on the right-hand side of the above expression is
the exponential of a matrix. This can easily be evaluated using the Taylor series
for an exponential, plus the rules
(σ·n)n = 1 for n even, (5.220)
(σ·n)n = (σ·n) for n odd. (5.221)
These rules follow trivially from the identity (5.216). Thus, we can write
!2 !4
(σ·n)2 ∆ϕ (σ·n)4 ∆ϕ
104
5.13 Spin greater than one-half systems 5 ANGULAR MOMENTUM
where the Rkl are the elements of a conventional rotation matrix. This is easily
demonstrated, since
i σ3 ∆ϕ −i σ3 ∆ϕ
! !
exp σ1 exp = σ1 cos ∆ϕ − σ2 sin ∆ϕ (5.226)
2 2
plus all cyclic permutations. The above expression is the 2 × 2 matrix analogue
of (see Sect. 5.9)
i Sz ∆ϕ −i Sz ∆ϕ
! !
exp Sx exp = Sx cos ∆ϕ − Sy sin ∆ϕ. (5.227)
h̄ h̄
The previous two formulae can both be validated using the Baker-Hausdorff
lemma, (5.156), which holds for Hermitian matrices, in addition to Hermitian
operators.
In the absence of spin, the Hamiltonian can be written as some function of the
position and momentum operators. Using the Schrödinger representation, in
which p → −i h̄ ∇, the energy eigenvalue problem,
H |Ei = E |Ei, (5.228)
can be transformed into a partial differential equation for the wave-function
ψ(r 0 ) ≡ hr 0 |Ei. This function specifies the probability density for observing the
particle at a given position, r 0 . In general, we find
H ψ = E ψ, (5.229)
where H is now a partial differential operator. The boundary conditions (for a
bound state) are obtained from the normalization constraint
Z
|ψ|2 dV = 1. (5.230)
This is all very familiar. However, we now know how to generalize this scheme
to deal with a spin one-half particle. Instead of representing the state of the par-
ticle by a single wave-function, we use two wave-functions. The first, ψ+ (r 0 ),
105
5.13 Spin greater than one-half systems 5 ANGULAR MOMENTUM
specifies the probability density of observing the particle at position r 0 with spin
angular momentum +h̄/2 in the z-direction. The second, ψ− (r 0 ), specifies the
probability density of observing the particle at position r 0 with spin angular mo-
mentum −h̄/2 in the z-direction. In the Pauli scheme, these wave-functions are
combined into a spinor, χ, which is simply the row vector of ψ+ and ψ− . In gen-
eral, the Hamiltonian is a function of the position, momentum, and spin opera-
tors. Adopting the Schrödinger representation, and the Pauli scheme, the energy
eigenvalue problem reduces to
H χ = E χ, (5.231)
Suppose that the Hamiltonian has no dependence on the spin operators. In this
case, the Hamiltonian is represented as diagonal 2 × 2 matrix partial differential
operator in the Schrödinger/Pauli scheme [see Eq. (5.214)]. In other words, the
partial differential equation for ψ+ decouples from that for ψ− . In fact, both
equations have the same form, so there is only really one differential equation.
In this situation, the most general solution to Eq. (5.231) can be written
c
χ= ψ(r 0 ) + . (5.232)
c−
Here, ψ(r 0 ) is determined by the solution of the differential equation, and the
c± are arbitrary complex numbers. The physical significance of the above ex-
pression is clear. The Hamiltonian determines the relative probabilities of finding
the particle at various different positions, but the direction of its spin angular
momentum remains undetermined.
Suppose that the Hamiltonian depends only on the spin operators. In this
case, the Hamiltonian is represented as a 2 × 2 matrix of complex numbers in
the Schrödinger/Pauli scheme [see Eq. (5.197)], and the spinor eigenvalue equa-
tion (5.231) reduces to a straight-forward matrix eigenvalue problem. The most
106
5.13 Spin greater than one-half systems 5 ANGULAR MOMENTUM
Here, the ratio c+ /c− is determined by the matrix eigenvalue problem, and the
wave-function ψ(r 0 ) is arbitrary. Clearly, the Hamiltonian determines the direc-
tion of the particle’s spin angular momentum, but leaves its position undeter-
mined.
where ||Aii denotes a state ket in the product space of the position and spin op-
erators. The state of the particle can be represented more succinctly by a spinor,
χ, which is simply the 2 s + 1 component row vector of the ψsz (r 0 ). Thus, a spin
one-half particle is represented by a two-component spinor, a spin one particle
by a three-component spinor, a spin three-halves particle by a four-component
spinor, and so on.
107
5.13 Spin greater than one-half systems 5 ANGULAR MOMENTUM
Sk → s h̄ σk , (5.236)
Hence,
hs, j|Sz |s, li j
(σz )jl = = δjl , (5.239)
s h̄ s
where use has been made of the orthonormality property of the |s, ji. Thus, σ z
is the suitably normalized diagonal matrix of the eigenvalues of Sz . The matrix
elements of σx and σy are most easily obtained by considering the shift operators,
S± = S x ± i S y . (5.240)
q
s (s + 1) − j (j + 1) δj,l−1
+ , (5.243)
2s
q
s (s + 1) − j (j − 1) δj,l+1
(σy )jl =
2is
q
s (s + 1) − j (j + 1) δj,l−1
− . (5.244)
2is
According to Eqs. (5.239) and (5.243)–(5.244), the Pauli matrices for a spin one-
half (s = 1/2) particle are
0 1
σx = , (5.245)
1 0
0 −i
σy = , (5.246)
i 0
1 0
σz = , (5.247)
0 −1
as we have seen previously. For a spin one (s = 1) particle, we find that
0 1 0
1
σx = √
,
1 0 1 (5.248)
2 0 1 0
0 −i 0
1
σy = √
i 0 −i
, (5.249)
2 0 i 0
1 0 0
σz =
0 0 0 .
(5.250)
0 0 −1
In fact, we can now construct the Pauli matrices for a spin anything particle.
This means that we can convert the general energy eigenvalue problem for a
spin-s particle, where the Hamiltonian is some function of position and spin op-
erators, into 2 s + 1 coupled partial differential equations involving the 2 s + 1
wave-functions ψsz (r 0 ). Unfortunately, such a system of equations is generally
too complicated to solve exactly.
109
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
In order to answer this question, we are going to have to learn how to add
angular momentum operators. Let us consider the most general case. Suppose
that we have two sets of angular momentum operators, J1 and J2 . By definition,
these operators are Hermitian, and obey the fundamental commutation relations
J1 × J1 = i h̄ J1 , (5.251)
J2 × J2 = i h̄ J2 . (5.252)
J = J 1 + J2 . (5.254)
110
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
Now
J 2 = J12 + J22 + 2 J1 ·J2 . (5.256)
We know that
and also that all of the J1i operators commute with the J2i operators. It follows
from Eq. (5.256) that
[J 2 , J12 ] = [J 2 , J22 ] = 0. (5.259)
This implies that the quantum numbers j1 , j2 , and j can all be measured simulta-
neously. In other words, we can know the magnitude of the total angular momen-
tum together with the magnitudes of the component angular momenta. However,
it is clear from Eq. (5.256) that
This suggests that it is not possible to measure the quantum numbers m1 and
m2 simultaneously with the quantum number j. Thus, we cannot determine the
projections of the individual angular momenta along the z-axis at the same time
as the magnitude of the total angular momentum.
It is clear, from the preceding discussion, that we can form two alternate
groups of mutually commuting operators. The first group is J12 , J22 , J1z , and J2z .
The second group is J12 , J22 , J 2 , and Jz . These two groups of operators are in-
compatible with one another. We can define simultaneous eigenkets of each
operator group. The simultaneous eigenkets of J12 , J22 , J1z , and J2z are denoted
111
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
|j1 , j2 ; m1 , m2 i, where
The simultaneous eigenkets of J12 , J22 , J 2 and Jz are denoted |j1 , j2 ; j, mi, where
Each set of eigenkets are complete, mutually orthogonal (for eigenkets corre-
sponding to different sets of eigenvalues), and have unit norms. Since the op-
erators J12 and J22 are common to both operator groups, we can assume that the
quantum numbers j1 and j2 are known. In other words, we can always determine
the magnitudes of the individual angular momenta. In addition, we can either
know the quantum numbers m1 and m2 , or the quantum numbers j and m, but
we cannot know both pairs of quantum numbers at the same time. We can write
a conventional completeness relation for both sets of eigenkets:
XX
|j1 , j2 ; m1 , m2 ihj1 , j2 ; m1 , m2 | = 1, (5.270)
m1 m2
XX
|j1 , j2 ; j, mihj1 , j2 ; j, m| = 1, (5.271)
j m
where the right-hand sides denote the identity operator in the ket space corre-
sponding to states of given j1 and j2 . The summation is over all allowed values of
m1 , m2 , j, and m.
The operator group J12 , J22 , J 2 , and Jz is incompatible with the group J12 , J22 ,
J1z , and J2z . This means that if the system is in a simultaneous eigenstate of the
former group then, in general, it is not in an eigenstate of the latter. In other
112
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
words, if the quantum numbers j1 , j2 , j, and m are known with certainty, then a
measurement of the quantum numbers m1 and m2 will give a range of possible
values. We can use the completeness relation (5.270) to write
XX
|j1 , j2 ; j, mi = hj1 , j2 ; m1 , m2 |j1 , j2 ; j, mi|j1 , j2 ; m1 , m2 i. (5.272)
m1 m2
Thus, we can write the eigenkets of the first group of operators as a weighted
sum of the eigenkets of the second set. The weights, hj1 , j2 ; m1 , m2 |j1 , j2 ; j, mi,
are called the Clebsch-Gordon coefficients. If the system is in a state where a
measurement of J12 , J22 , J 2 , and Jz is bound to give the results j1 (j1 + 1) h̄2 , j2 (j2 +
1) h̄2 , j (j + 1) h̄2 , and jz h̄, respectively, then a measurement of J1z and J2z will
give the results m1 h̄ and m2 h̄ with probability |hj1 , j2 ; m1 , m2 |j1 , j2 ; j, mi|2 .
m = m 1 + m2 . (5.273)
which proves the assertion. Thus, the z-components of different angular mo-
menta add algebraically. So, an electron in an l = 1 state, with orbital angular
momentum h̄, and spin angular momentum h̄/2, projected along the z-axis, con-
stitutes a state whose total angular momentum projected along the z-axis is 3h̄/2.
What is uncertain is the magnitude of the total angular momentum.
|j1 − j2 | ≤ j ≤ j1 + j2 . (5.276)
We can assume, without loss of generality, that j1 ≥ j2 . We know, from Eq. (5.273),
that for given j1 and j2 the largest possible value of m is j1 + j2 (since j1 is the
113
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
largest possible value of m1 , etc.). This implies that the largest possible value of j
is j1 + j2 (since, by definition, the largest value of m is equal to j). Now, there are
(2 j1 + 1) allowable values of m1 and (2 j2 + 1) allowable values of m2 . Thus, there
are (2 j1 + 1) (2 j2 + 1) independent eigenkets, |j1 , j2 ; m1 , m2 i, needed to span the
ket space corresponding to fixed j1 and j2 . Since the eigenkets |j1 , j2 ; j, mi span
the same space, they must also form a set of (2 j1 + 1) (2 j2 + 1) independent kets.
In other words, there can only be (2 j1 + 1) (2 j2 + 1) distinct allowable values of
the quantum numbers j and m. For each allowed value of j, there are 2 j + 1
allowed values of m. We have already seen that the maximum allowed value of
j is j1 + j2 . It is easily seen that if the minimum allowed value of j is j1 − j2 then
the total number of allowed values of j and m is (2 j1 + 1) (2 j2 + 1): i.e.,
1 +j2
jX
(2 j + 1) ≡ (2 j1 + 1) (2 j2 + 1). (5.277)
j=j1 −j2
Third, the sum of the modulus squared of all of the Clebsch-Gordon coeffi-
cients is unity: i.e.,
XX
|hj1 , j2 ; m1 , m2 |j1 , j2 ; j, mi|2 = 1. (5.278)
m1 m2
114
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
Making use of the well-known properties of the shift operators, which are speci-
fied by Eqs. (5.55)–(5.56), we obtain
q
j (j + 1) − m (m ± 1) |j1 , j2 ; j, m ± 1i =
X X q
j1 (j1 + 1) − m10 (m10 ± 1) |j1 , j2 ; m10 ± 1, m20 i
m10 m20
q
+ j2 (j2 + 1) − m20 (m20 ± 1) |j1 , j2 ; m10 , m20 ± 1i
×hj1 , j2 ; m10 , m20 |j1 , j2 ; j, mi. (5.281)
Taking the inner product with hj1 , j2 ; m1 , m2 |, and making use of the orthonormal-
ity property of the basis eigenkets, we obtain the desired recursion relations:
q
j (j + 1) − m (m ± 1) hj1 , j2 ; m1 , m2 |j1 , j2 ; j, m ± 1i =
q
j1 (j1 + 1) − m1 (m1 ∓ 1) hj1 , j2 ; m1 ∓ 1, m2 |j1 , j2 ; j, mi
q
+ j2 (j2 + 1) − m2 (m2 ∓ 1) hj1 , j2 ; m1 , m2 ∓ 1|j1 , j2 ; j, mi. (5.282)
It is clear, from the absence of complex coupling coefficients in the above re-
lations, that we can always choose the Clebsch-Gordon coefficients to be real
numbers. This is a convenient choice, since it ensures that the inverse Clebsch-
Gordon coefficients, hj1 , j2 ; j, m|j1 , j2 ; m1 , m2 i, are identical to the Clebsch-Gordon
coefficients. In other words,
hj1 , j2 ; j, m|j1 , j2 ; m1 , m2 i = hj1 , j2 ; m1 , m2 |j1 , j2 ; j, mi. (5.283)
The inverse Clebsch-Gordon coefficients are the weights in the expansion of the
|j1 , j2 ; m1 , m2 i in terms of the |j1 , j2 ; j, mi:
XX
|j1 , j2 ; m1 , m2 i = hj1 , j2 ; j, m|j1 , j2 ; m1 , m2 i|j1 , j2 ; j, mi. (5.284)
j m
It turns out that the recursion relations (5.282), together with the normal-
ization condition (5.278), are sufficient to completely determine the Clebsch-
Gordon coefficients to within an arbitrary sign (multiplied into all of the coeffi-
cients). This sign is fixed by convention. The easiest way of demonstrating this
assertion is by considering some specific examples.
115
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
Let us add the angular momentum of two spin one-half systems: e.g., two
electrons at rest. So, j1 = j2 = 1/2. We know, from general principles, that
|m1 | ≤ 1/2 and |m2 | ≤ 1/2. We also know, from Eq. (5.276), that 0 ≤ j ≤ 1,
where the allowed values of j differ by integer amounts. It follows that either
j = 0 or j = 1. Thus, two spin one-half systems can be combined to form either a
spin zero system or a spin one system. It is helpful to arrange all of the possibly
non-zero Clebsch-Gordon coefficients in a table:
m1 m2
1/2 1/2 ? ? ? ?
1/2 -1/2 ? ? ? ?
-1/2 1/2 ? ? ? ?
-1/2 -1/2 ? ? ? ?
j1 =1/2 j 1 1 1 0
j2 =1/2 m 1 0 -1 0
The box in this table corresponding to m1 = 1/2, m2 = 1/2, j = 1, m = 1
gives the Clebsch-Gordon coefficient h1/2, 1/2; 1/2, 1/2|1/2, 1/2; 1, 1i, or the in-
verse Clebsch-Gordon coefficient h1/2, 1/2; 1, 1|1/2, 1/2; 1/2, 1/2i. All the boxes
contain question marks because we do not know any Clebsch-Gordon coefficients
at the moment.
The normalization condition (5.278) implies that the sum of the squares of all
116
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
the rows and columns of the above table must be unity. There are two rows and
two columns which only contain a single non-zero entry. We conclude that these
entries must be ±1, but we have no way of determining the signs at present.
Thus,
m1 m2
1/2 1/2 ±1 0 0 0
1/2 -1/2 0 ? 0 ?
-1/2 1/2 0 ? 0 ?
-1/2 -1/2 0 0 ±1 0
j1 =1/2 j 1 1 1 0
j2 =1/2 m 1 0 -1 0
and
√ √
h1/2, −1/2|1, 0i + h−1/2, 1/2|1, 0i = 2 h−1/2, −1/2|1, −1i = ± 2. (5.286)
Here, the j1 and j2 labels have been suppressed for ease of notation. We also
know that
h1/2, −1/2|1, 0i2 + h−1/2, 1/2|1, 0i2 = 1, (5.287)
from the normalization condition. The only real solutions to the above set of
equations are
√ √
2 h1/2, −1/2|1, 0i = 2 h−1/2, 1/2|1, 0i
= h1/2, 1/2|1, 1i = h1/2, 1/2|1, −1i = ±1. (5.288)
117
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
We could fill in the remaining unknown entries of our table by using the re-
cursion relation again. However, an easier method is to observe that the rows
and columns of the table must all be mutually orthogonal. That is, the dot prod-
uct of a row with any other row must be zero. Likewise, for the dot product of
a column with any other column. This follows because the entries in the table
give the expansion coefficients of one of our alternative sets of eigenkets in terms
of the other set, and each set of eigenkets contains mutually orthogonal vectors
with unit norms. The normalization condition tells us that the dot product of a
row or column with itself must be unity. The only way that the dot product of the
fourth column with the second column can be zero is if the unknown entries are
equal and opposite. The requirement that the dot product of the fourth column
with itself
√ is unity tells us that the magnitudes of the unknown entries have to
be 1/ 2. The unknown entries are undetermined to an arbitrary sign multiplied
into them both. Thus, the final form of our table (with the conventional choice
of arbitrary signs) is
m1 m2
1/2 1/2 1 0√ 0 0√
1/2 -1/2 0 1/√2 0 1/ √2
-1/2 1/2 0 1/ 2 0 -1/ 2
-1/2 -1/2 0 0 1 0
j1 =1/2 j 1 1 1 0
j2 =1/2 m 1 0 -1 0
The table can be read in one of two ways. The columns give the expansions of
118
5.14 Addition of angular momentum 5 ANGULAR MOMENTUM
Note that our table is really a combination of two sub-tables, one involving
j = 0 states, and one involving j = 1 states. The Clebsch-Gordon coefficients cor-
responding to two different choices of j are completely independent: i.e., there is
no recursion relation linking Clebsch-Gordon coefficients corresponding to differ-
ent values of j. Thus, for every choice of j1 , j2 , and j we can construct a table of
Clebsch-Gordon coefficients corresponding to the different allowed values of m 1 ,
m2 , and m (subject to the constraint that m1 + m2 = m). A complete knowledge
of angular momentum addition is equivalent to a knowing all possible tables of
Clebsch-Gordon coefficients. These tables are listed (for moderate values of j 1 , j2
and j) in many standard reference books.
119
6 APPROXIMATION METHODS
6 Approximation methods
6.1 Introduction
Consider the simplest non-trivial system, in which there are only two indepen-
dent eigenkets of the unperturbed Hamiltonian. These are denoted
120
6.2 The two-state system 6 APPROXIMATION METHODS
It is assumed that these states, and their associated eigenvalues, are known. Since
H0 is, by definition, an Hermitian operator, its two eigenkets are orthonormal and
form a complete set. The lengths of these eigenkets are both normalized to unity.
Let us now try to solve the modified energy eigenvalue problem
In fact, we can solve this problem exactly. Since, the eigenkets of H 0 form a
complete set, we can write
Right-multiplication of Eq. (6.4) by h1| and h2| yields two coupled equations,
which can be written in matrix form:
E1 − E + e11 e12 h1|Ei 0
∗
= . (6.6)
e12 E2 − E + e22 h2|Ei 0
Here,
the solution of Eq. (6.6) (obtained by setting the determinant of the matrix equal
to zero) is q
(E1 + E2 ) ± (E1 − E2 )2 + 4 |e12 |2
E= . (6.11)
2
Let us expand in the supposedly small parameter
|e12 |
= . (6.12)
|E1 − E2 |
121
6.3 Non-degenerate perturbation theory 6 APPROXIMATION METHODS
We obtain
1 1
E ' (E1 + E2 ) ± (E1 − E2 )(1 + 2 2 + · · ·). (6.13)
2 2
The above expression yields the modifications to the energy eigenvalues due to
the perturbing Hamiltonian:
|e12 |2
E10 = E1 + + ···, (6.14)
E1 − E 2
|e12 |2
E20 = E2 − + ···. (6.15)
E1 − E 2
Note that H1 causes the upper eigenvalue to rise, and the lower eigenvalue to
fall. It is easily demonstrated that the modified eigenkets take the form
e12∗
|1i 0 = |1i + |2i + · · · , (6.16)
E1 − E 2
e12
|2i 0 = |2i − |1i + · · · . (6.17)
E1 − E 2
Thus, the modified energy eigenstates consist of one of the unperturbed eigen-
states with a slight admixture of the other. Note that the series expansion in
Eq. (6.13) only converges if 2 || < 1. This suggests that the condition for the
validity of the perturbation expansion is
|E1 − E2 |
|e12 | < . (6.18)
2
In other words, when we say that H1 needs to be small compared to H0 , what we
really mean is that the above inequality needs to be satisfied.
Let us now generalize our perturbation analysis to deal with systems possessing
more than two energy eigenstates. The energy eigenstates of the unperturbed
Hamiltonian, H0 , are denoted
122
6.3 Non-degenerate perturbation theory 6 APPROXIMATION METHODS
where n runs from 1 to N. The eigenkets |ni are orthonormal, form a complete
set, and have their lengths normalized to unity. Let us now try to solve the energy
eigenvalue problem for the perturbed Hamiltonian:
(H0 + H1 )|Ei = E |Ei. (6.20)
We can express |Ei as a linear superposition of the unperturbed energy eigenkets,
X
|Ei = hk|Ei|ki, (6.21)
k
where
emk = hm|H1 |ki. (6.23)
123
6.4 The quadratic Stark effect 6 APPROXIMATION METHODS
giving
emn
hm|Ei ' − . (6.30)
Em − E n
Substituting the above expression into Eq. (6.22), evaluated for m = n, and
neglecting O(3 ) terms, we obtain
X |enk |2
(En + enn − E) − = 0. (6.31)
k6=n
E k − E n
and a eigenket X
0 ekn
|ni = |ni + |ki + O(2 ). (6.33)
E − Ek
k6=n n
Note that
e∗nm emn
hm|ni 0 = δmn + + + O(2 ) = δmn + O(2 ). (6.34)
Em − E n En − E m
Thus, the modified eigenkets remain orthonormal and properly normalized to
O(2 ).
Suppose that a one-electron atom [i.e., either a hydrogen atom, or an alkali metal
atom (which possesses one valance electron orbiting outside a closed, spher-
ically symmetric shell)] is subjected to a uniform electric field in the positive
z-direction. The Hamiltonian of the system can be split into two parts. The un-
perturbed Hamiltonian,
p2
H0 = + V(r), (6.35)
2 me
and the perturbing Hamiltonian,
H1 = e |E| z. (6.36)
124
6.4 The quadratic Stark effect 6 APPROXIMATION METHODS
Now, since
Lz = x p y − y p x , (6.38)
it follows that
[Lz , z] = 0. (6.39)
Thus,
hn, l, m|[Lz , z]|n 0 , l 0 , m 0 i = 0, (6.40)
giving
(m − m 0 )hn, l, m|z|n 0 , l 0 , m 0 i = 0, (6.41)
since |n, l, mi is, by definition, an eigenstate of Lz with eigenvalue m h̄. It is
clear, from the above relation, that the matrix element hn, l, m|z|n 0 , l 0 , m 0 i is zero
unless m 0 = m. This is termed the selection rule for the quantum number m.
125
6.4 The quadratic Stark effect 6 APPROXIMATION METHODS
Thus,
This reduces to
+2 (L2 z − z L2 ) .
i
(6.46)
Lx x + Ly y + Lz z = 0. (6.47)
Hence, we obtain
[L2 , [L2 , z]] = 2 h̄2 (L2 z + z L2 ). (6.48)
Finally, the above expression expands to give
126
6.4 The quadratic Stark effect 6 APPROXIMATION METHODS
Note that all of the terms in Eq. (6.37) which vary linearly with the electric field-
strength vanish by symmetry, according to the selection rules. Only those terms
which vary quadratically with the field-strength survive. The polarizability of an
atom is defined in terms of the energy-shift of the atomic state as follows:
1
∆E = − α |E|2 . (6.55)
2
Consider the ground state of a hydrogen atom. (Recall, that we cannot address
the n > 1 excited states because they are degenerate, and our theory cannot
handle this at present). The polarizability of this state is given by
X |h1, 0, 0|z|n, 1, 0i|2
2
α = 2e . (6.56)
n>1
En00 − E100
127
6.4 The quadratic Stark effect 6 APPROXIMATION METHODS
Here, we have made use of the fact that En10 = En00 for a hydrogen atom.
128
6.5 Degenerate perturbation theory 6 APPROXIMATION METHODS
Let us now consider systems in which the eigenstates of the unperturbed Hamil-
tonian, H0 , possess degenerate energy levels. It is always possible to represent
degenerate energy eigenstates as the simultaneous eigenstates of the Hamilto-
nian and some other Hermitian operator (or group of operators). Let us denote
this operator (or group of operators) L. We can write
H0 |n, li = En |n, li, (6.65)
and
L |n, li = Lnl |n, li, (6.66)
where [H0 , L] = 0. Here, the En and the Lnl are real numbers which depend on
the quantum numbers n, and n and l, respectively. It is always possible to find a
sufficient number of operators which commute with the Hamiltonian in order to
ensure that the Lnl are all different. In other words, we can choose L such that
the quantum numbers n and l uniquely specify each eigenstate. Suppose that for
each value of n there are Nn different values of l: i.e., the nth energy eigenstate
is Nn -fold degenerate.
0
X |en 0 l 0 nl |2
Enl = En + enlnl + + O(3 ), (6.67)
E − En 0
n 0 ,l 0 6=n,l n
and X en 0 l 0 nl
|n, li 0 = |n, li + |n 0 , l 0 i + O(2 ), (6.68)
E − En 0
n 0 ,l 0 6=n,l n
where
en 0 l 0 nl = hn 0 , l 0 |H1 |n, li. (6.69)
129
6.5 Degenerate perturbation theory 6 APPROXIMATION METHODS
It is fairly obvious that the summations in Eqs. (6.67) and (6.68) are not well-
behaved if the nth energy level is degenerate. The problem terms are those
involving unperturbed eigenstates labeled by the same value of n, but different
values of l: i.e., those states whose unperturbed energies are En . These terms
give rise to singular factors 1/(En − En ) in the summations. Note, however, that
this problem would not exist if the matrix elements, enl 0 nl , of the perturbing
Hamiltonian between distinct, degenerate, unperturbed energy eigenstates cor-
responding to the eigenvalue En were zero. In other words, if
then all of the singular terms in Eqs. (6.67) and (6.68) would vanish.
In general, Eq. (6.70) is not satisfied. Fortunately, we can always redefine the
unperturbed energy eigenstates belonging to the eigenvalue En in such a manner
that Eq. (6.70) is satisfied. Let us define Nn new states which are linear combina-
tions of the Nn original degenerate eigenstates corresponding to the eigenvalue
En :
XNn
(1)
|n, l i = hn, k|n, l(1) i|n, ki. (6.71)
k=1
Note that these new states are also degenerate energy eigenstates of the unper-
turbed Hamiltonian corresponding to the eigenvalue En . The |n, l(1) i are chosen
in such a manner that they are eigenstates of the perturbing Hamiltonian, H 1 .
Thus,
H1 |n, l(1) i = λnl |n, l(1) i. (6.72)
The |n, l(1) i are also chosen so that they are orthonormal, and have unit lengths.
It follows that
hn, l 0(1) |H1 |n, l(1) i = λnl δll 0 . (6.73)
Thus, if we use the new eigenstates, instead of the old ones, then we can employ
Eqs. (6.67) and (6.68) directly, since all of the singular terms vanish. The only
remaining difficulty is to determine the new eigenstates in terms of the original
ones.
130
6.5 Degenerate perturbation theory 6 APPROXIMATION METHODS
Now
Nn
X
|n, lihn, l| = 1, (6.74)
l=1
where 1 denotes the identity operator in the sub-space of all unperturbed energy
eigenkets corresponding to the eigenvalue En . Using this completeness relation,
the operator eigenvalue equation (6.72) can be transformed into a straightfor-
ward matrix eigenvalue equation:
Nn
X
hn, l 0 |H1 |n, l 00 ihn, l 00 |n, l(1) i = λnl hn, l 0 |n, l(1) i. (6.75)
l 00 =1
U x = λ x, (6.76)
Provided that the determinant of U is non-zero, Eq. (6.76) can always be solved
to give Nn eigenvalues λnl (for l = 1 to Nn ), with Nn corresponding eigenvectors
xnl . The eigenvectors specify the weights of the new eigenstates in terms of the
original eigenstates: i.e.,
(xnl )k = hn, k|n, l(1) i, (6.78)
for k = 1 to Nn . In our new scheme, Eqs. (6.67) and (6.68) yield
X |en 0 l 0 nl |2
0
Enl = En + λnl + + O(3 ), (6.79)
E − En
n 0 6=n,l 0 n
0
and X en 0 l 0 nl
|n, l(1) i 0 = |n, l(1) i + |n 0 , l 0 i + O(2 ). (6.80)
E − En 0
n 0 6=n,l 0 n
There are no singular terms in these expressions, since the summations are over
n 0 6= n: i.e., they specifically exclude the problematic, degenerate, unperturbed
energy eigenstates corresponding to the eigenvalue En . Note that the first-order
energy shifts are equivalent to the eigenvalues of the matrix equation (6.76).
131
6.6 The linear Stark effect 6 APPROXIMATION METHODS
Let us examine the effect of an electric field on the excited energy levels of a
hydrogen atom. For instance, consider the n = 2 states. There is a single l =
0 state, usually referred to as 2s, and three l = 1 states (with m = −1, 0, 1),
usually referred to as 2p. All of these states possess the same energy, E200 =
−e2 /(32π 0 a0 ). As in Sect. 6.4, the perturbing Hamiltonian is
H1 = e |E| z. (6.81)
In order to apply perturbation theory, we have to solve the matrix eigenvalue
equation
U x = λ x, (6.82)
where U is the array of the matrix elements of H1 between the degenerate 2s and
2p states. Thus,
0 h2, 0, 0|z|2, 1, 0i 0 0
h2, 1, 0|z|2, 0, 0i 0 0 0
U= e |E| , (6.83)
0 0 0 0
0 0 0 0
where the rows and columns correspond to the |2, 0, 0i, |2, 1, 0i, |2, 1, 1i, and
|2, 1, −1i states, respectively. Here, we have made use of the selection rules,
which tell us that the matrix element of z between two hydrogen atom states is
zero unless the states possess the same m quantum number, and l quantum num-
bers which differ by unity. It is easily demonstrated, from the exact forms of the
2s and 2p wave-functions, that
h2, 0, 0|z|2, 1, 0i = h2, 1, 0|z|2, 0, 0i = 3 a0 . (6.84)
132
6.6 The linear Stark effect 6 APPROXIMATION METHODS
√
1/ √2
−1/ 2
x2 = , (6.86)
0
0
0
0
x3 = , (6.87)
1
0
0
0
x4 = . (6.88)
0
1
It follows from Sect. 6.5 that the simultaneous eigenstates of the unperturbed
Hamiltonian and the perturbing Hamiltonian take the form
|2, 0, 0i + |2, 1, 0i
|1i = √ , (6.89)
2
|2, 0, 0i − |2, 1, 0i
|2i = √ , (6.90)
2
|3i = |2, 1, 1i, (6.91)
|4i = |2, 1, −1i. (6.92)
In the absence of an electric field, all of these states possess the same energy, E 200 .
The first-order energy shifts induced by an electric field are given by
∆E1 = +3 e a0 |E|, (6.93)
∆E2 = −3 e a0 |E|, (6.94)
∆E3 = 0, (6.95)
∆E4 = 0. (6.96)
Thus, the energies of states 1 and 2 are shifted upwards and downwards, respec-
tively, by an amount 3 e a0 |E| in the presence of an electric field. States 1 and
2 are orthogonal linear combinations of the original 2s and 2p(m = 0) states.
133
6.7 Fine structure 6 APPROXIMATION METHODS
Note that the energy shifts are linear in the electric field-strength, so this is a
much larger effect that the quadratic effect described in Sect. 6.4. The energies
of states 3 and 4 (which are equivalent to the original 2p(m = 1) and 2p(m = −1)
states, respectively) are not affected to first-order. Of course, to second-order the
energies of these states are shifted by an amount which depends on the square of
the electric field-strength.
Note that the linear Stark effect depends crucially on the degeneracy of the 2s
and 2p states. This degeneracy is a special property of a pure Coulomb potential,
and, therefore, only applies to a hydrogen atom. Thus, alkali metal atoms do not
exhibit the linear Stark effect.
Let us now consider the energy levels of hydrogen-like atoms (i.e., alkali metal
atoms) in more detail. The outermost electron moves in a spherically symmetric
potential V(r) due to the nuclear charge and the charges of the other electrons
(which occupy spherically symmetric closed shells). The shielding effect of the
inner electrons causes V(r) to depart from the pure Coulomb form. This splits the
degeneracy of states characterized by the same value of n, but different values of
l. In fact, higher l states have higher energies.
It is fairly obvious that the first group of operators (H0 , L2 , S2 , Lz and Sz ) does not
commute with HLS , whereas the second group (H0 , L2 , S 2 , J 2 , and Jz ) does. In
fact, L·S is just a combination of operators appearing in the second group. Thus,
it is advantageous to work in terms of the eigenstates of the second group of
operators, rather than those of the first group.
Here, the kets on the left-hand side are |j, mj i kets, whereas those on the right-
hand side are |ml , ms i kets (the j1 , j2 labels have been dropped, for the sake of
clarity). We have made use of the fact that the Clebsch-Gordon coefficients are
automatically zero unless mj = ml + ms . We have also made use of the fact that
both the |j, mj i and |ml , ms i kets are orthonormal, and have unit lengths. We
now need to determine
which reduces to
v
+ m + 1/2
u
ul
hm − 1/2, 1/2|l + 1/2, mi = hm + 1/2, 1/2|l + 1/2, m + 1i. (6.107)
u
t
l + m + 3/2
136
6.7 Fine structure 6 APPROXIMATION METHODS
We can use this formula to successively increase the value of ml . For instance,
v v
+m+ + m + 3/2
u u
ul 1/2 u
ul
hm − 1/2, 1/2|l + 1/2, mi =
u
t t
l + m + 3/2 l + m + 5/2
×hm + 3/2, 1/2|l + 1/2, m + 2i. (6.108)
This procedure can be continued until ml attains its maximum possible value, l.
Thus,
v
ul + m + 1/2
u
hm − 1/2, 1/2|l + 1/2, mi = t
hl, 1/2|l + 1/2, l + 1/2i. (6.109)
2l + 1
Consider the situation in which ml and m both take their maximum values,
l and 1/2, respectively. The corresponding value of mj is l + 1/2. This value is
possible when j = l + 1/2, but not when j = l − 1/2. Thus, the |ml , ms i ket |l, 1/2i
must be equal to the |j, mj i ket |l + 1/2, l + 1/2i, up to an arbitrary phase-factor.
By convention, this factor is taken to be unity, giving
137
6.7 Fine structure 6 APPROXIMATION METHODS
v
ul − m + 1/2
u
|l − 1/2, mi = − t
|m − 1/2, 1/2i
2l + 1
v
+ m + 1/2
ul
u
+ t
|m + 1/2, −1/2i. (6.114)
2l + 1
It is convenient to define so called spin-angular functions using the Pauli two-
component formalism:
v
ul ± m + 1/2 m−1/2
u
j=l±1/2,m
Yl = ± t
Yl (θ, ϕ) χ+
2l + 1
v
ul ∓ m + 1/2 m+1/2
u
+ t
Yl (θ, ϕ) χ−
2l + 1
m−1/2
q
1 ± l ± m + 1/2 Yl (θ, ϕ)
= √ q
m+1/2
.
(6.115)
2l + 1 l ∓ m + 1/2 Yl (θ, ϕ)
These functions are eigenfunctions of the total angular momentum for spin one-
half particles, just as the spherical harmonics are eigenfunctions of the orbital
angular momentum. A general wave-function for an energy eigenstate in a
hydrogen-like atom is written
ψnlm± = Rnl (r) Y j=l±1/2,m . (6.116)
The radial part of the wave-function, Rnl (r), depends on the radial quantum num-
ber n and the angular quantum number l. The wave-function is also labeled by
m, which is the quantum number associated with Jz . For a given choice of l, the
quantum number j (i.e., the quantum number associated with J 2 ) can take the
values l ± 1/2.
The |l ± 1/2, mi kets are eigenstates of L·S, according to Eq. (6.102). Thus,
h̄2
L·S |j = l ± 1/2, mj = mi = [j (j + 1) − l (l + 1) − 3/4] |j, mi, (6.117)
2
giving
l h̄2
L·S |l + 1/2, mi = |l + 1/2, mi, (6.118)
2
(l + 1) h̄2
L·S |l − 1/2, mi = − |l − 1/2, mi. (6.119)
2
138
6.7 Fine structure 6 APPROXIMATION METHODS
It follows that
Z
l+1/2,m † l+1/2,m l h̄2
(Y ) L·S Y dΩ = , (6.120)
2
Z
l−1/2,m † l−1/2,m (l + 1) h̄2
(Y ) L·S Y dΩ = − , (6.121)
2
where the integrals are over all solid angle.
Let us now apply degenerate perturbation theory to evaluate the shift in energy
of a state whose wave-function is ψnlm± due to the spin-orbit Hamiltonian HLS .
To first-order, the energy-shift is given by
Z
∆Enlm± = (ψnlm± )† HLS ψnlm± dV, (6.122)
where the integral is over all space. Equations (6.100) (remember the factor of
two), (6.116), and (6.120)–(6.121) yield
1 1 dV l h̄2
* +
∆Enlm+ = + , (6.123)
2 me2 r dr 2
1 1 dV (l + 1) h̄2
* +
∆Enlm− = − , (6.124)
2 me2 r dr 2
where Z
1 dV 1 dV
+ *
= (Rnl )∗ Rnl r2 dr. (6.125)
r dr r dr
Equations (6.123)–(6.124) are known as Lande’s interval rule.
Let us now apply the above result to the case of a sodium atom. In chemist’s
notation, the ground state is written
The inner ten electrons effectively form a spherically symmetric electron cloud.
We are interested in the excitation of the eleventh electron from 3s to some
higher energy state. The closest (in energy) unoccupied state is 3p. This state
has a higher energy than 3s due to the deviations of the potential from the pure
139
6.8 The Zeeman effect 6 APPROXIMATION METHODS
Coulomb form. In the absence of spin-orbit interaction, there are six degenerate
3p states. The spin-orbit interaction breaks the degeneracy of these states. The
modified states are labeled (3p)1/2 and (3p)3/2 , where the subscript refers to the
value of j. The four (3p)3/2 states lie at a slightly higher energy level than the two
(3p)1/2 states, because the radial integral (6.125) is positive. The splitting of the
(3p) energy levels of the sodium atom can be observed using a spectroscope. The
well-known sodium D line is associated with transitions between the 3p and 3s
states. The fact that there are two slightly different 3p energy levels (note that
spin-orbit coupling does not split the 3s energy levels) means that the sodium D
line actually consists of two very closely spaced spectroscopic lines. It is easily
demonstrated that the ratio of the typical spacing of Balmer lines to the splitting
brought about by spin-orbit interaction is about 1 : α2 , where
e2 1
α= = (6.127)
2 0 h c 137
is the fine structure constant. Note that Eqs. (6.123)–(6.124) are not entirely
correct, since we have neglected an effect (namely, the relativistic mass correction
of the electron) which is the same order of magnitude as spin-orbit coupling.
Suppose that the energy-shifts induced by the magnetic field are much smaller
than those induced by spin-orbit interaction. In this situation, we can treat H B as
140
6.8 The Zeeman effect 6 APPROXIMATION METHODS
Let us apply this theory to the sodium atom. We have already seen that the
non-Coulomb potential splits the degeneracy of the 3s and 3p states, the latter
states acquiring a higher energy. The spin-orbit interaction splits the six 3p states
into two groups, with four j = 3/2 states lying at a slightly higher energy than
two j = 1/2 states. According to Eq. (6.136), a magnetic field splits the (3p)3/2
141
6.8 The Zeeman effect 6 APPROXIMATION METHODS
quadruplet of states, each state acquiring a different energy. In fact, the energy of
each state becomes dependent on the quantum number m, which measures the
projection of the total angular momentum along the z-axis. States with higher
m values have higher energies. A magnetic field also splits the (3p)1/2 doublet
of states. However, it is evident from Eq. (6.136) that these states are split by a
lesser amount than the j = 3/2 states.
Suppose that we increase the strength of the magnetic field, so that the energy-
shift due to the magnetic field becomes comparable to the energy-shift induced
by spin-orbit interaction. Clearly, in this situation, it does not make much sense to
think of HB as a small interaction term operating on the eigenstates of H0 + HLS .
In fact, this intermediate case is very difficult to analyze. Let us consider the
extreme limit in which the energy-shift due to the magnetic field greatly exceeds
that induced by spin-orbit effects. This is called the Paschen-Back limit.
Let us apply this result to a sodium atom. In the absence of a magnetic field,
the six 3p states form two groups of four and two states, depending on the values
142
6.8 The Zeeman effect 6 APPROXIMATION METHODS
Let us apply the above result to a sodium atom. In the presence of an intense
magnetic field, the 3p states are split into five groups with (ml , ms ) quantum
numbers (1, 1/2), (0, 1/2), (1, −1/2), or (−1, 1/2), (0, −1/2), and (−1, −1/2), re-
spectively, in order of decreasing energy. The spin-orbit term increases the en-
ergy of the highest energy state, does not affect the next highest energy state,
decreases, but does not split, the energy of the doublet, does not affect the next
lowest energy state, and increases the energy of the lowest energy state. The net
result is that the five groups of states are no longer equally spaced in energy.
The sort of magnetic field-strength needed to get into the Paschen-Bach limit
is given by
e me
BPB ∼ α2 ' 25 tesla. (6.143)
0 h a 0
143
6.9 Time-dependent perturbation theory 6 APPROXIMATION METHODS
Suppose that the Hamiltonian of the system under consideration can be written
H = H0 + H1 (t), (6.144)
where the cn are complex numbers. Thus, the initial state is some linear su-
perposition of the unperturbed energy eigenstates. In the absence of the time-
dependent perturbation, the time evolution of the system is given by
X
|A, t0 , ti = cn exp([−i En (t − t0 )/h̄] |ni. (6.147)
n
144
6.9 Time-dependent perturbation theory 6 APPROXIMATION METHODS
Clearly, with H1 = 0, the probability of finding the system in state |ni at time t is
exactly the same as the probability of finding the system in this state at the initial
time t0 . However, with H1 6= 0, we expect Pn (t) to vary with time. Thus, we can
write X
|A, t0 , ti = cn (t) exp[−i En (t − t0 )/h̄] |ni, (6.149)
n
2
where Pn (t) = |cn (t)| . Here, we have carefully separated the fast phase oscilla-
tion of the eigenkets, which depends on the unperturbed Hamiltonian, from the
slow variation of the amplitudes cn (t), which depends entirely on the perturba-
tion (i.e., cn is constant if H1 = 0). Note that in Eq. (6.149) the eigenkets |ni are
time-independent (they are actually the eigenkets of H0 evaluated at the time t0 ).
We also have
∂ X dcm
!
i h̄ |A, t0 , ti = i h̄ + cm (t) Em exp[−i Em (t − t0 )/h̄] |mi, (6.152)
∂t m dt
where use has been made of the time-independence of the kets |mi. According
to Eq. (6.150), we can equate the right-hand sides of the previous two equations
to obtain
X dcm X
i h̄ exp[−i Em (t − t0 )/h̄]|mi = cm (t) exp[−i Em (t − t0 )/h̄] H1 |mi.
m dt m
(6.153)
Left-multiplication by hn| yields
dcn X
i h̄ = Hnm (t) exp[i ωnm (t − t0 )] cm (t), (6.154)
dt m
where
Hnm (t) = hn|H1 (t)|mi, (6.155)
145
6.10 The two-state system 6 APPROXIMATION METHODS
and
En − E m
ωnm = . (6.156)
h̄
Here, we have made use of the standard orthonormality result, hn|mi = δ nm .
Suppose that there are N linearly independent eigenkets of the unperturbed
Hamiltonian. According to Eq. (6.154), the time variation of the coefficients
cn , which specify the probability of finding the system in state |ni at time t, is de-
termined by N coupled first-order differential equations. Note that Eq. (6.154) is
exact—we have made no approximations at this stage. Unfortunately, we cannot
generally find exact solutions to this equation, so we have to obtain approximate
solutions via suitable expansions in small quantities. However, for the particu-
larly simple case of a two-state system (i.e., N = 2), it is actually possible to
solve Eq. (6.154) without approximation. This solution is of enormous practical
importance.
Suppose, for the sake of simplicity, that the diagonal matrix elements of the in-
teraction Hamiltonian, H1 , are zero:
146
6.10 The two-state system 6 APPROXIMATION METHODS
i (ω − ω21 )/2
−q exp[ i (ω − ω21 ) t/2]
γ2 /h̄2 + (ω − ω21 )2 /4
r !
× sin γ2 /h̄2 + (ω − ω21 )2 /4 t . (6.165)
Now, the probability of finding the system in state 1 at time t is simply P 1 (t) =
|c1 |2 . Likewise, the probability of finding the system in state 2 at time t is P 2 (t) =
|c2 |2 . It follows that
γ2 /h̄2
P2 (t) = 2 2
γ /h̄ + (ω − ω21 )2 /4
147
6.10 The two-state system 6 APPROXIMATION METHODS
r !
2 2
× sin γ2 /h̄ + (ω − ω21 )2 /4 t , (6.166)
According to the above result, the system starts off at t = 0 in state 1. After a
time interval π h̄/2 γ it is certain to be in state 2. After a further time interval
π h̄/2 γ it is certain to be in state 1, and so on. Thus, the system periodically
flip-flops between states 1 and 2 under the influence of the time-dependent per-
turbation. This implies that the system alternatively absorbs and emits energy
from the source of the perturbation.
The absorption-emission cycle also take place away from the resonance, when
ω 6= ω21 . However, the amplitude of oscillation of the coefficient c2 is reduced.
This means that the maximum value of P2 (t) is no longer unity, nor is the mini-
mum value of P1 (t) zero. In fact, if we plot the maximum value of P2 (t) as a func-
tion of the applied frequency, ω, we obtain a resonance curve whose maximum
(unity) lies at the resonance, and whose full-width half-maximum (in frequency)
is 4 γ/h̄. Thus, if the applied frequency differs from the resonant frequency by
substantially more than 2 γ/h̄ then the probability of the system jumping from
state 1 to state 2 is very small. In other words, the time-dependent perturbation
is only effective at causing transitions between states 1 and 2 if its frequency of
oscillation lies in the approximate range ω21 ± 2 γ/h̄. Clearly, the weaker the
perturbation (i.e., the smaller γ becomes), the narrower the resonance.
148
6.11 Spin magnetic resonance 6 APPROXIMATION METHODS
The eigenstates of the unperturbed Hamiltonian are the ‘spin up’ and ‘spin
down’ states, denoted |+i and |−i, respectively. Thus,
e h̄ B0
H0 |±i = ± |±i. (6.174)
2 me
The time-dependent Hamiltonian can be written
e B1 h
exp( i ωt) S− + exp(−i ωt) S+ ,
i
H1 = (6.175)
2 me
where S+ and S− are the conventional raising and lowering operators for the spin
angular momentum. It follows that
h+|H1 |+i = h−|H1 |−i = 0, (6.176)
and
e h̄ B1
h−|H1 |+i = h+|H1 |−i∗ = exp( i ωt). (6.177)
2 me
149
6.12 The Dyson series 6 APPROXIMATION METHODS
It can be seen that this system is exactly the same as the two-state system
discussed in the previous section, provided that we make the identifications
Let us now try to find approximate solutions of Eq. (6.154) for a general system.
It is convenient to work in terms of the time evolution operator, U(t 0 , t), which is
defined
|A, t0 , ti = U(t0 , t) |Ai. (6.182)
150
6.12 The Dyson series 6 APPROXIMATION METHODS
Here, |A, t0 , ti is the state ket of the system at time t, given that the state ket at
the initial time t0 is |Ai. It is easily seen that the time evolution operator satisfies
the differential equation
∂U(t0 , t)
i h̄ = (H0 + H1 ) U(t0 , t), (6.183)
∂t
subject to the boundary condition
U(t0 , t0 ) = 1. (6.184)
In the absence of the external perturbation, the time evolution operator re-
duces to
U(t0 , t) = exp[−i H0 (t − t0 )/h̄]. (6.185)
Let us switch on the perturbation and look for a solution of the form
U(t0 , t) = exp[−i H0 (t − t0 )/h̄] UI (t0 , t). (6.186)
It is readily demonstrated that UI satisfies the differential equation
∂UI (t0 , t)
i h̄ = HI (t0 , t) UI (t0 , t), (6.187)
∂t
where
HI (t0 , t) = exp[+i H0 (t − t0 )/h̄] H1 exp[−i H0 (t − t0 )/h̄], (6.188)
subject to the boundary condition
UI (t0 , t0 ) = 1. (6.189)
Note that UI specifies that component of the time evolution operator which is due
to the time-dependent perturbation. Thus, we would expect UI to contain all of
the information regarding transitions between different eigenstates of H 0 caused
by the perturbation.
Suppose that the system starts off at time t0 in the eigenstate |ii of the un-
perturbed Hamiltonian. The subsequent evolution of the state ket is given by
Eq. (6.149), X
|i, t0 , ti = cm (t) exp[−i Em (t − t0 )/h̄] |mi. (6.190)
m
151
6.12 The Dyson series 6 APPROXIMATION METHODS
It follows that
cn (t) = hn|UI (t0 , t)|ii, (6.192)
where use has been made of hn|mi = δnm . Thus, the probability that the system
is found in state |ni at time t, given that it is definitely in state |ii at time t 0 , is
simply
Pi→n (t0 , t) = |hn|UI (t0 , t)|ii|2 . (6.193)
This quantity is usually termed the transition probability between states |ii and
|ni.
Note that the differential equation (6.187), plus the boundary condition (6.189),
are equivalent to the following integral equation,
Z
i t
UI (t0 , t) = 1 − HI (t0 , t 0 ) UI (t0 , t 0 ) dt 0 . (6.194)
h̄ t0
We can obtain an approximate solution to this equation by iteration:
Z Z t0
i t
i
UI (t0 , t) ' 1 − HI (t0 , t 0 ) 1 − HI (t0 , t 00 ) UI (t0 , t 00 ) dt 0
h̄ t0 h̄ t0
Zt
i
' 1− HI (t0 , t 0 ) dt 0
h̄ t0
! Z Z t0
−i 2 t
+ dt 0 HI (t0 , t 0 ) HI (t0 , t 00 ) dt 00 + · · · . (6.195)
h̄ t0 t0
where the superscript (1) refers to a first-order term in the expansion, etc. It
follows from Eqs. (6.192) and (6.195) that
c(0)
n (t) = δin , (6.197)
152
6.12 The Dyson series 6 APPROXIMATION METHODS
Zt
i
c(1)
n (t) = − hn|HI (t0 , t 0 )|ii dt 0 , (6.198)
h̄ t0
!2 Z t Z t0
−i 0
c(2)
n (t) = dt hn|HI (t0 , t 0 ) HI (t0 , t 00 )|ii dt 00 . (6.199)
h̄ t0 t0
c(0)
n (t) = δin , (6.200)
Zt
i
c(1)
n (t) = − exp[ i ωni (t 0 − t0 )] Hni (t 0 ) dt 0 , (6.201)
h̄ t0
!2 X Z t Z t0
−i
c(2)
n (t) = dt 0 dt 00 exp[ i ωnm (t 0 − t0 )]
h̄ m t0 t0
where
En − E m
ωnm = , (6.203)
h̄
and
Hnm (t) = hn|H1 (t)|mi. (6.204)
The transition probability between states i and n is simply
153
6.13 Constant perturbations 6 APPROXIMATION METHODS
154
6.13 Constant perturbations 6 APPROXIMATION METHODS
giving Z
2t
Pi→ (t) = |Hni |2 ρ(En ) sinc2 (x) dx, (6.216)
h̄
where
x = (En − Ei ) t/2 h̄, (6.217)
and use has been made of Eq. (6.210). We know that in the limit t → ∞ the
function sinc(x) is only non-zero in an infinitesimally narrow range of final ener-
gies centred on En = Ei . It follows that, in this limit, we can take ρ(En ) and |Hni |2
out of the integral in the above formula to obtain
2π
Pi→[n] (t) = |Hni |2 ρ(En ) t , (6.218)
h̄ En 'Ei
155
6.13 Constant perturbations 6 APPROXIMATION METHODS
where Pi→[n] denotes the transition probability between the initial state |ii and
all final states |ni which have approximately the same energy as the initial state.
Here, |Hni |2 is the average of |Hni |2 over all final states with approximately the
same energy as the initial state. In deriving the above formula, we have made
use of the result Z∞
sinc2 (x) dx = π. (6.219)
−∞
Note that the transition probability, Pi→[n] , is now proportional to t, instead of t2 .
Let us now calculate the second-order term in the Dyson series, using the
constant perturbation (6.206). From Eq. (6.202) we find that
!2 X Zt Z t0
−i
c(2)
n (t) = Hnm Hmi dt 0 exp( i ωnm t 0 ) dt 00 exp( i ωmi t )
h̄ m 0 0
Z
i X Hnm Hmi t
= [exp( i ωni t 0 ) − exp( i ωnm t 0 ] ) dt 0
h̄ m Em − Ei 0
i t X Hnm Hmi
= [exp( i ωni t/2) sinc(ωni t/2)
h̄ m Em − Ei
156
6.13 Constant perturbations 6 APPROXIMATION METHODS
Thus,
it X Hnm Hmi
cn (t) = c(1)
n + cn(2) = exp( i ωni t/2) Hni +
sinc(ω t/2)
ni
h̄ m E m − E i
X Hnm Hmi
− exp( i ωim t/2) sinc(ωnm t/2) , (6.224)
m E m − E i
where use has been made of Eq. (6.208). It follows, by analogy with the previous
analysis, that
2
2π X Hnm Hmi
wi→[n] = Hni + ρ(En )
, (6.225)
h̄ m Em − E i
En 'Ei
where the transition rate is calculated for all final states, |ni, with approximately
the same energy as the initial state, |ii, and for intermediate states, |mi whose
energies differ from that of the initial state. The fact that Em 6= Ei causes the
last term on the right-hand side of Eq. (6.224) to average to zero (due to the
oscillatory phase-factor) during the evaluation of the transition probability.
157
6.14 Harmonic perturbations 6 APPROXIMATION METHODS
Let us initiate the system in the eigenstate |ii of the unperturbed Hamiltonian,
H0 , and switch on the harmonic perturbation at t = 0. It follows from Eq. (6.201)
that
Z
−i t †
(1)
cn = 0
Vni exp(i ωt ) + Vni exp(−i ωt ) exp( i ωni t 0 ) dt 0 , (6.227)
0
h̄ 0
1 1 − exp[ i (ωni + ω) t] 1 − exp[ i (ωni − ω) t] †
= Vni + Vni ,
h̄ ωni + ω ωni − ω
where
Clearly, (6.231) corresponds to the first term on the right-hand side of Eq. (6.227),
and (6.232) corresponds to the second term. The former term describes a process
by which the system gives up energy h̄ω to the perturbing field, whilst making a
transition to a final state whose energy level is less than that of the initial state
158
6.15 Absorption and stimulated emission of radiation 6 APPROXIMATION METHODS
by h̄ω. This process is known as stimulated emission. The latter term describes
a process by which the system gains energy h̄ω from the perturbing field, whilst
making a transition to a final state whose energy level exceeds that of the initial
state by h̄ω. This process is known as absorption. In both cases, the total energy
(i.e., that of the system plus the perturbing field) is conserved.
Equation (6.233) specifies the transition rate for stimulated emission, whereas
Eq. (6.234) gives the transition rate for absorption. These equations are more
usually written
2π
wi→n = |Vni |2 δ(En − Ei + h̄ω), (6.235)
h̄
2π † 2
wi→n = |V | δ(En − Ei − h̄ω). (6.236)
h̄ ni
† 2
It is clear from Eqs. (6.228)-(6.229) that |Vni | = |Vni |2 . It follows from
Eqs. (6.233)–(6.234) that
wi→[n] wn→[i]
= . (6.237)
ρ(En ) ρ(Ei )
In other words, the rate of stimulated emission, divided by the density of final
states for stimulated emission, equals the rate of absorption, divided by the den-
sity of final states for absorption. This result, which expresses a fundamental
symmetry between absorption and stimulated emission, is known as detailed bal-
ancing, and is very important in statistical mechanics.
159
6.15 Absorption and stimulated emission of radiation 6 APPROXIMATION METHODS
electromagnetic radiation.
where and n are unit vectors which specify the direction of polarization and
the direction of propagation, respectively. Note that ·n = 0. The Hamiltonian
becomes
H = H0 + H1 (t), (6.249)
with
p2
H0 = + V(r), (6.250)
2 me
and
e A·p
H1 ' − , (6.251)
me
where the A2 term, which is second order in A0 , has been neglected.
U= + , (6.255)
2 2 2 µ0
161
6.16 The electric dipole approximation 6 APPROXIMATION METHODS
where E0 and B0 = E0 /c = 2 A0 ω/c are the peak electric and magnetic field-
strengths, respectively. The incident power per unit area of the electromagnetic
field is
c U = 2 0 c ω2 |A0 |2 . (6.256)
Now,
h̄ ω wi→n
σabs = , (6.257)
cU
so
π e2
σabs = 2
|hn| exp[ i (ω/c) n·r] ·p |ii|2 δ(En − Ei − h̄ω). (6.258)
0 m e ω c
162
6.16 The electric dipole approximation 6 APPROXIMATION METHODS
∆l = ±1, (6.264)
∆m = 0. (6.265)
Here, l is the quantum number describing the total orbital angular momentum
of the electron, and m is the quantum number describing the projection of the
orbital angular momentum along the z-axis. It is easily demonstrated that hn|x|ii
and hn|y|ii are only non-zero if
∆l = ±1, (6.266)
∆m = ±1. (6.267)
∆l = ±1, (6.268)
∆m = 0, ±1. (6.269)
These are termed the selection rules for electric dipole transitions. It is clear, for
instance, that the electric dipole approximation allows a transition from a 2p
state to a 1s state, but disallows a transition from a 2s to a 1s state. The latter
transition is called a forbidden transition.
Forbidden transitions are not strictly forbidden. Instead, they take place at a
far lower rate than transitions which are allowed according to the electric dipole
approximation. After electric dipole transitions, the next most likely type of tran-
sition is a magnetic dipole transition, which is due to the interaction between the
electron spin and the oscillating magnetic field of the incident electromagnetic
radiation. Magnetic dipole transitions are typically about 10 5 times more unlikely
than similar electric dipole transitions. The first-order term in Eq. (6.259) yields
163
6.16 The electric dipole approximation 6 APPROXIMATION METHODS
so-called electric quadrupole transitions. These are typically about 10 8 times more
unlikely than electric dipole transitions. Magnetic dipole and electric quadrupole
transitions satisfy different selection rules than electric dipole transitions: for
instance, the selection rules for electric quadrupole transitions are ∆l = 0, ±2.
Thus, transitions which are forbidden as electric dipole transitions may well be
allowed as magnetic dipole or electric quadrupole transitions.
Integrating Eq. (6.263) over all possible frequencies of the incident radiation
yields Z X
σabs (ω) dω = 4π2 α ωni |hn|·r|ii|2 . (6.270)
n
Suppose, for the sake of definiteness, that the incident radiation is polarized in
the x-direction. It is easily demonstrated that
h̄2
[x, [x, H0 ] ] = − . (6.271)
me
Thus,
2 2 h̄2
hi|[x, [x, H0 ] ]|ii = hi|x H0 + H0 x − 2 x H0 x|ii = − , (6.272)
me
giving
X h̄2
2 (hi|x|niEi hn|x|ii − hi|x|niEn hn|x|ii) = − . (6.273)
n me
It follows that
2 me X
ωni |hn|x|ii|2 = 1. (6.274)
h̄ n
This is known as the Thomas-Reiche-Kuhn sum rule. According to this rule,
Eq. (6.270) reduces to
Z
2π2 α h̄ π e2
σabs (ω) dω = = . (6.275)
me 2 0 m e c
Note that h̄ has dropped out of the final result. In fact, the above formula is ex-
actly the same as that obtained classically by treating the electron as an oscillator.
164
6.17 Energy-shifts and decay-widths 6 APPROXIMATION METHODS
We have examined how a state |ni, other than the initial state |ii, becomes popu-
lated as a result of some time-dependent perturbation applied to the system. Let
us now consider how the initial state becomes depopulated.
In the remote past, t → −∞, the system is assumed to be in the initial state |ii.
Thus, ci (t → −∞) = 1, and cn6=i (t → −∞) = 0. Basically, we want to calculate
the time evolution of the coefficient ci (t). First, however, let us check that our
previous Fermi golden rule result still applies when the perturbing potential is
turned on slowly, instead of very suddenly. For cn6=i (t) we have from Eqs. (6.200)–
(6.201) that
c(0)
n (t) = 0, (6.277)
Zt
i
c(1)
n (t) = − Hni exp[ (η + i ωni )t 0 ] dt 0
h̄ −∞
i exp[ (η + i ωni )t ]
= − Hni , (6.278)
h̄ η + i ωni
where Hni = hn|H1 |ii. It follows that, to first-order, the transition probability
from state |ii to state |ni is
|Hni |2 exp(2 η t)
Pi→n (t) = |c(1) 2
n | = . (6.279)
h̄2 η2 + ωni2
The transition rate is given by
dPi→n 2 |Hni |2 η exp(2 η t)
wi→n (t) = = . (6.280)
dt h̄2 η2 + ωni2
Consider the limit η → 0. In this limit, exp(η t) → 1, but
η
lim 2 2 = π δ(ωni ) = π h̄ δ(En − Ei ). (6.281)
η→0 η + ωni
165
6.17 Energy-shifts and decay-widths 6 APPROXIMATION METHODS
Thus, Eq. (6.280) yields the standard Fermi golden rule result
2π
wi→n = |Hni |2 δ(En − Ei ). (6.282)
h̄
It is clear that the delta-function in the above formula actually represents a func-
tion which is highly peaked at some particular energy. The width of the peak is
determined by how fast the perturbation is switched on.
Let us now consider the ratio ċi /ci , where ċi ≡ dci /dt. Using Eq. (6.286), we
can evaluate this ratio in the limit η → 0. We obtain
!2 2 !X 2
ċi −i −i |H | −i |H |
!
ii mi
' Hii + +
h̄ m6=i Ei − Em + i h̄ η
ci h̄ h̄ η
i Hii
, !
1−
h̄ η
−i −i X |Hmi |2
! !
' Hii + lim . (6.287)
h̄ η→0 h̄
m6=i
E i − E m + i h̄ η
166
6.17 Energy-shifts and decay-widths 6 APPROXIMATION METHODS
167
6.17 Energy-shifts and decay-widths 6 APPROXIMATION METHODS
and
Γi 2 Im(∆i ) 2π X
=− = |Hmi |2 δ(Ei − Em ). (6.297)
h̄ h̄ h̄ m6=i
Note that the energy-shift ∆Ei is the same as that predicted by standard time-
independent perturbation theory.
The probability of observing the system in state |ii at time t > 0, given that it
is definately in state |ii at time t = 0, is given by
where
Γi X
= wi→m . (6.299)
h̄ m6=i
Here, use has been made of Eq. (6.222). Clearly, the rate of decay of the initial
state is a simple function of the transition rates to the other states. Note that the
system conserves probability up to second-order in perturbed quantities, since
X X
|ci |2 + |cm |2 ' (1 − Γi t/h̄) + wi→m t = 1. (6.300)
m6=i m6=i
The quantity ∆i is called the decay-width of state |ii. It is closely related to the
mean lifetime of this state,
h̄
τi = , (6.301)
Γi
where
Pi→i = exp(−t/τi ). (6.302)
According to Eq. (6.294), the amplitude of state |ii both oscillates and decays as
time progresses. Clearly, state |ii is not a stationary state in the presence of the
time-dependent perturbation. However, we can still represent it as a superposi-
tion of stationary states (whose amplitudes simply oscillate in time). Thus,
Z
exp[−i (Ei + ∆Ei ) t/h̄] exp(−Γi t/2h̄) = f(E) exp(−i E t/h̄) dE, (6.303)
168
6.17 Energy-shifts and decay-widths 6 APPROXIMATION METHODS
where f(E) is the weight of the stationary state with energy E in the superposition.
The Fourier inversion theorem yields
1
|f(E)|2 ∝ . (6.304)
(E − [Ei + Re(∆i )])2 + Γi 2 /4
In the absence of the perturbation, |f(E)|2 is basically a delta-function centred on
the unperturbed energy Ei of state |ii. In other words, state |ii is a stationary state
whose energy is completely determined. In the presence of the perturbation, the
energy of state |ii is shifted by Re(∆i ). The fact that the state is no longer station-
ary (i.e., it decays in time) implies that its energy cannot be exactly determined.
Indeed, the energy of the state is smeared over some region of width (in energy)
Γi centred around the shifted energy Ei + Re(∆i ). The faster the decay of the state
(i.e., the larger Γi ), the more its energy is spread out. This effect is clearly a man-
ifestation of the energy-time uncertainty relation ∆E ∆t ∼ h̄. One consequence of
this effect is the existence of a natural width of spectral lines associated with the
decay of some excited state to the ground state (or any other lower energy state).
The uncertainty in energy of the excited state, due to its propensity to decay, gives
rise to a slight smearing (in wave-length) of the spectral line associated with the
transition. Strong lines, which correspond to fast transitions, are smeared out
more that weak lines. For this reason, spectroscopists generally favour forbid-
den lines for Doppler shift measurements. Such lines are not as bright as those
corresponding to allowed transitions, but they are a lot sharper.
169
7 SCATTERING THEORY
7 Scattering theory
7.1 Introduction
Historically, data regarding quantum phenomena has been obtained from two
main sources—the study of spectroscopic lines, and scattering experiments. We
have already developed theories which account for some aspects of the spectra of
hydrogen-like atoms. Let us now examine the quantum theory of scattering.
170
7.2 The Lipmann-Schwinger equation 7 SCATTERING THEORY
171
7.2 The Lipmann-Schwinger equation 7 SCATTERING THEORY
where
(∇2 + k2 ) G(r, r 0 ) = δ(r − r 0 ). (7.11)
Note that the solution (7.10) satisfies the boundary condition |ψi → |φi as H1 →
0. As is well-known, the Green’s function for the Helmholtz problem is given by
exp(±i k |r − r 0 | )
G(r, r 0 ) = − . (7.12)
4π |r − r 0 |
Thus, Eq. (7.10) becomes
Z
2 m exp(±i k |r − r 0 | ) 0
ψ± (r) = φ(r) − 2 0
hr |H1 |ψi d3 r 0 . (7.13)
h̄ 4π |r − r |
A comparison of Eqs. (7.7) and (7.13) suggests that the kernel to Eq. (7.7) takes
the form
1 2 m exp(±i k |r − r 0 | )
* +
0
r
r =− 2 . (7.14)
E − H0 ± i h̄ 4π |r − r 0 |
It is not entirely clear that the ± signs correspond on both sides of this equation.
In fact, they do, as is easily proved by a more rigorous derivation of this result.
= V(r 0 ) ψ± (r 0 ). (7.16)
Thus, the integral equation (7.13) simplifies to
Z
± 2 m exp(±i k |r − r 0 |)
ψ (r) = φ(r) − 2 0
V(r 0 ) ψ± (r 0 ) d3 r 0 . (7.17)
h̄ 4π |r − r |
Suppose that the initial state |φi is a plane-wave with wave-vector k (i.e., a
stream of particles of definite momentum p = h̄ k). The ket corresponding to
this state is denoted |ki. The associated wave-function takes the form
exp( i k·r)
hr|ki = . (7.18)
(2π)3/2
172
7.2 The Lipmann-Schwinger equation 7 SCATTERING THEORY
Suppose that the scattering potential V(r) is only non-zero in some relatively
localized region centred on the origin (r = 0). Let us calculate the wave-function
ψ(r) a long way from the scattering region. In other words, let us adopt the
ordering r r 0 . It is easily demonstrated that
|r − r 0 | ' r − ^r ·r 0 (7.20)
173
7.2 The Lipmann-Schwinger equation 7 SCATTERING THEORY
It is obvious that the former represents the physical solution. Thus, the wave-
function a long way from the scattering region can be written
1 exp( i kr)
ψ(r) = 3/2
exp( i k·r) + f(k 0 , k) , (7.25)
(2π) r
where
Z
0 (2π)2 m exp(−i k 0 ·r 0 )
f(k , k) = − V(r 0 ) ψ(r 0 ) d3 r 0
h̄2 (2π) 3/2
(2π)2 m 0
= − hk |H1 |ψi. (7.26)
h̄2
giving
dσ
= |f(k 0 , k)|2 . (7.33)
dΩ
Thus, |f(k 0 , k)|2 gives the differential cross-section for particles with incident mo-
mentum h̄ k to be scattered into states whose momentum vectors are directed in
a range of solid angles dΩ about h̄ k 0 . Note that the scattered particles possess
the same energy as the incoming particles (i.e., k 0 = k). This is always the case
for scattering Hamiltonians of the form shown in Eq. (7.15).
Suppose that the scattering is not particularly strong. In this case, it is reason-
able to suppose that the total wave-function, ψ(r), does not differ substantially
from the incident wave-function, φ(r). Thus, we can obtain an expression for
f(k 0 , k) by making the substitution
exp( i k·r)
ψ(r) → φ(r) = . (7.35)
(2π)3/2
This is called the Born approximation.
176
7.3 The Born approximation 7 SCATTERING THEORY
The Born approximation is valid provided that ψ(r) is not too different from
φ(r) in the scattering region. It follows, from Eq. (7.17), that the condition for
ψ(r) ' φ(r) in the vicinity of r = 0 is
Z 0
m
exp( i k r ) 0
3 0
V(r ) d r 1. (7.47)
2π h̄2 r0
Consider the special case of the Yukawa potential. At low energies, (i.e., k µ)
we can replace exp( i k r 0 ) by unity, giving
2 m |V0 |
1 (7.48)
h̄2 µ2
as the condition for the validity of the Born approximation. The condition for the
Yukawa potential to develop a bound state is
2 m |V0 |
≥ 2.7, (7.49)
h̄2 µ2
where V0 is negative. Thus, if the potential is strong enough to form a bound
state then the Born approximation is likely to break down. In the high-k limit,
Eq. (7.47) yields
2 m |V0 |
1. (7.50)
h̄2 µ k
This inequality becomes progressively easier to satisfy as k increases, implying
that the Born approximation is more accurate at high incident particle energies.
177
7.4 Partial waves 7 SCATTERING THEORY
Outside the range of the scattering potential, both φ(r) and ψ(r) satisfy the
free space Schrödinger equation
(∇2 + k2 ) ψ = 0. (7.54)
What is the most general solution to this equation in spherical polar coordinates
which does not depend on the azimuthal angle ϕ? Separation of variables yields
X
ψ(r, θ) = Rl (r) Pl (cos θ), (7.55)
l
since the Legendre functions Pl (cos θ) form a complete set in θ-space. The Leg-
endre functions are related to the spherical harmonics introduced in Sect. 5 via
v
4π
u
Yl0 (θ, ϕ).
u
Pl (cos θ) = t
(7.56)
2l + 1
178
7.4 Partial waves 7 SCATTERING THEORY
We can write
X
exp( i k r cos θ) = al jl (k r) Pl (cos θ), (7.62)
l
where the al are constants. Note there are no Neumann functions in this expan-
sion, because they are not well-behaved as r → 0. The Legendre functions are
orthonormal, Z1
δnm
Pn (µ) Pm (µ) dµ = , (7.63)
−1 n + 1/2
so we can invert the above expansion to give
Z1
al jl (k r) = (l + 1/2) exp( i k r µ) Pl (µ) dµ. (7.64)
−1
It is well-known that
Z1
(−i)l
jl (y) = exp( i y µ) Pl (µ) dµ, (7.65)
2 −1
179
7.4 Partial waves 7 SCATTERING THEORY
where l = 0, 1, 2, · · · [see Abramowitz and Stegun (Dover, New York NY, 1965),
Eq. 10.1.14]. Thus,
al = il (2 l + 1), (7.66)
giving X
exp( i k r cos θ) = il (2 l + 1) jl (k r) Pl (cos θ). (7.67)
l
The above expression tells us how to decompose a plane-wave into a series of
spherical-waves (or “partial waves”).
The most general solution for the total wave-function outside the scattering
region is
1 X
ψ(r) = [Al jl (k r) + Bl ηl (k r)] Pl (cos θ), (7.68)
(2π)3/2 l
where the Al and Bl are constants. Note that the Neumann functions are allowed
to appear in this expansion, because its region of validity does not include the
origin. In the large-r limit, the total wave-function reduces to
1 X sin(k r − l π/2) cos(k r − l π/2)
ψ(r) ' A l − B l Pl (cos θ), (7.69)
(2π)3/2 l kr kr
where use has been made of Eqs. (7.60)–(7.61). The above expression can also
be written
1 X sin(k r − l π/2 + δl )
ψ(r) ' Cl Pl (cos θ), (7.70)
(2π)3/2 l kr
where the sine and cosine functions have been combined to give a sine function
which is phase-shifted by δl .
180
7.5 The optical theorem 7 SCATTERING THEORY
Clearly, determining the scattering amplitude f(θ) via a decomposition into par-
tial waves (i.e., spherical-waves) is equivalent to determining the phase-shifts δ l .
181
7.6 Determination of phase-shifts 7 SCATTERING THEORY
where use has been made of Eq. (7.63). A comparison of this result with Eq. (7.75)
yields
4π
σtotal = Im [f(0)] , (7.78)
k
since Pl (1) = 1. This result is known as the optical theorem. It is a reflection
of the fact that the very existence of scattering requires scattering in the forward
(θ = 0) direction in order to interfere with the incident wave, and thereby reduce
the probability current in this direction.
It is usual to write ∞
X
σtotal = σl , (7.79)
l=0
where
4π
σl = 2
(2 l + 1) sin2 δl (7.80)
k
is the lth partial cross-section: i.e., the contribution to the total cross-section from
the lth partial wave. Note that the maximum value for the lth partial cross-section
occurs when the phase-shift δl takes the value π/2.
Let us now consider how the phase-shifts δl can be evaluated. Consider a spher-
ically symmetric potential V(r) which vanishes for r > a, where a is termed
the range of the potential. In the region r > a, the wave-function ψ(r) satisfies
the free-space Schrödinger equation (7.54). The most general solution which is
consistent with no incoming spherical-waves is
∞
1 X l
ψ(r) = i (2 l + 1) Al (r) Pl (cos θ), (7.81)
(2π)3/2 l=0
182
7.6 Determination of phase-shifts 7 SCATTERING THEORY
where
Al (r) = exp( i δl ) [ cos δl jl (k r) − sin δl ηl (k r) ] . (7.82)
Note that Neumann functions are allowed to appear in the above expression,
because its region of validity does not include the origin (where V 6= 0). The
logarithmic derivative of the lth radial wave-function Al (r) just outside the range
of the potential is given by
cos δl jl0 (k a) − sin δl ηl0 (k a)
βl+ = ka , (7.83)
cos δl jl (k a) − sin δl ηl (k a)
where jl0 (x) denotes djl (x)/dx, etc. The above equation can be inverted to give
k a jl0 (k a) − βl+ jl (k a)
tan δl = . (7.84)
k a ηl0 (k a) − βl+ ηl (k a)
Thus, the problem of determining the phase-shift δl is equivalent to that of ob-
taining βl+ .
The most general solution to Schrödinger’s equation inside the range of the
potential (r < a) which does not depend on the azimuthal angle ϕ is
∞
1 X l
ψ(r) = i (2 l + 1) Rl (r) Pl (cos θ), (7.85)
(2π)3/2 l=0
where
ul (r)
Rl (r) = , (7.86)
r
and
d2 ul 2 2m
l (l + 1)
+ k − V − ul = 0. (7.87)
dr2 h̄2 r2
The boundary condition
ul (0) = 0 (7.88)
ensures that the radial wave-function is well-behaved at the origin. We can
launch a well-behaved solution of the above equation from r = 0, integrate out
to r = a, and form the logarithmic derivative
1 d(ul /r)
βl− = . (7.89)
(ul /r) dr r=a
183
7.7 Hard sphere scattering 7 SCATTERING THEORY
Since ψ(r) and its first derivatives are necessarily continuous for physically ac-
ceptible wave-functions, it follows that
βl+ = βl− . (7.90)
The phase-shift δl is obtainable from Eq. (7.84).
Let us test out this scheme using a particularly simple example. Consider scatter-
ing by a hard sphere, for which the potential is infinite for r < a, and zero for
r > a. It follows that ψ(r) is zero in the region r < a, which implies that u l = 0
for all l. Thus,
βl− = βl+ = ∞, (7.91)
for all l. It follows from Eq. (7.84) that
jl (k a)
tan δl = . (7.92)
ηl (k a)
Let us consider the low and high energy asymptotic limits of tan δl . Low energy
means k a 1. In this regime, the spherical Bessel functions and Neumann
functions reduce to:
(k r)l
jl (k r) ' , (7.97)
(2 l + 1)!!
(2 l − 1)!!
ηl (k r) ' − , (7.98)
(k r)l+1
where n!! = n (n − 2) (n − 4) · · · 1. It follows that
−(k a)2 l+1
tan δl = . (7.99)
(2 l + 1) [(2 l − 1)!!]2
It is clear that we can neglect δl , with l > 0, with respect to δ0 . In other words,
at low energy only s-wave scattering (i.e., spherically symmetric scattering) is
important. It follows from Eqs. (7.33), (7.75), and (7.94) that
dσ sin2 k a
= 2
' a2 (7.100)
dΩ k
for k a 1. Note that the total cross-section
Z
dσ
σtotal = dΩ = 4π a2 (7.101)
dΩ
is four times the geometric cross-section π a2 (i.e., the cross-section for classical
particles bouncing off a hard sphere of radius a). However, low energy scattering
implies relatively long wave-lengths, so we do not expect to obtain the classical
result in this limit.
Consider the high energy limit k a 1. At high energies, all partial waves
up to lmax = k a contribute significantly to the scattering cross-section. It follows
from Eq. (7.77) that
lmax
4π X
σtotal = 2 (2 l + 1) sin2 δl . (7.102)
k l=0
185
7.8 Low energy scattering 7 SCATTERING THEORY
At low energies (i.e., when 1/k is much larger than the range of the potential)
partial waves with l > 0, in general, make a negligible contribution to the scatter-
ing cross-section. It follows that, at these energies, with a finite range potential,
only s-wave scattering is important.
186
7.8 Low energy scattering 7 SCATTERING THEORY
δ0 ' k a − 1 . (7.113)
k 0a
187
7.9 Resonances 7 SCATTERING THEORY
Note that there are values of k 0 a (e.g., k 0 a ' 4.49) at which δ0 → π, and the
scattering cross-section (7.114) vanishes, despite the very strong attraction of the
potential. In reality, the cross-section is not exactly zero, because of contributions
from l > 0 partial waves. But, at low incident energies, these contributions are
small. It follows that there are certain values of V0 and k which give rise to almost
perfect transmission of the incident wave. This is called the Ramsauer-Townsend
effect, and has been observed experimentally.
7.9 Resonances
Note that the cross-section now depends on the energy. Furthermore, the mag-
nitude of the cross-section is much larger than that given in Eq. (7.114) for
k 0 a 6= π/2 (since k a 1).
The origin of this rather strange behaviour is quite simple. The condition
v
u 2 m |V0 | a2 π
u
t
2 = (7.118)
h̄ 2
is equivalent to the condition that a spherical well of depth V0 possesses a bound
state at zero energy. Thus, for a potential well which satisfies the above equation,
the energy of the scattering system is essentially the same as the energy of the
bound state. In this situation, an incident particle would like to form a bound
state in the potential well. However, the bound state is not stable, since the
system has a small positive energy. Nevertheless, this sort of resonance scattering
is best understood as the capture of an incident particle to form a metastable
bound state, and the subsequent decay of the bound state and release of the
particle. The cross-section for resonance scattering is generally far higher than
that for non-resonance scattering.
We have seen that there is a resonant effect when the phase-shift of the s-wave
takes the value π/2. There is nothing special about the l = 0 partial wave, so it
is reasonable to assume that there is a similar resonance when the phase-shift of
the lth partial wave is π/2. Suppose that δl attains the value π/2 at the incident
energy E0 , so that
π
δl (E0 ) = . (7.119)
2
Let us expand cot δl in the vicinity of the resonant energy:
d cot δl
!
cot δl (E) = cot δl (E0 ) + (E − E0 ) + · · · (7.120)
dE E=E0
1 dδl
= − 2 (E − E0 ) + · · · . (7.121)
sin δl dE E=E0
Defining
dδl (E) 2
= , (7.122)
dE E=E0 Γ
189
7.9 Resonances 7 SCATTERING THEORY
we obtain
2
cot δl (E) = − (E − E0 ) + · · · . (7.123)
Γ
Recall, from Eq. (7.80), that the contribution of the lth partial wave to the scat-
tering cross-section is
4π 2 4π 1
σl = (2 l + 1) sin δ l = (2 l + 1) . (7.124)
k2 k2 1 + cot2 δl
Thus,
4π Γ 2 /4
σl ' 2 (2 l + 1) . (7.125)
k (E − E0 )2 + Γ 2 /4
This is the famous Breit-Wigner formula. The variation of the partial cross-section
σl with the incident energy has the form of a classical resonance curve. The
quantity Γ is the width of the resonance (in energy). We can interpret the Breit-
Wigner formula as describing the absorption of an incident particle to form a
metastable state, of energy E0 , and lifetime τ = h̄/Γ (see Sect. 6.17).
190