Genrel
Genrel
General Relativity
Benjamin Crowell
www.lightandmatter.com
Fullerton, California
www.lightandmatter.com
5
6
Contents
3 Differential Geometry 87
3.1 Tangent vectors . . . . . . . . . . . . . . . . . . 88
3.2 Affine notions and parallel transport . . . . . . . . . 89
The affine parameter in curved spacetime: a rough sketch, 89.—
The affine parameter in more detail, 90.—Parallel transport, 90.
3.3 Models . . . . . . . . . . . . . . . . . . . . . 92
3.4 Intrinsic quantities . . . . . . . . . . . . . . . . . 96
Coordinate independence, 97.
3.5 The metric . . . . . . . . . . . . . . . . . . . . 99
The Euclidean metric, 101.—The Lorentz metric, 106.—Isometry,
inner products, and the Erlangen Program, 107.—Einstein’s carousel,
109.
3.6 The metric in general relativity. . . . . . . . . . . . 115
The hole argument, 115.—A Machian paradox, 116.
7
3.7 Interpretation of coordinate independence. . . . . . . 117
Is coordinate independence obvious?, 117.—Is coordinate indepen-
dence trivial?, 118.—Coordinate independence as a choice of gauge,
119.
Problems . . . . . . . . . . . . . . . . . . . . . . 120
4 Tensors 123
4.1 Lorentz scalars . . . . . . . . . . . . . . . . . . 123
4.2 Four-vectors . . . . . . . . . . . . . . . . . . . 124
The velocity and acceleration four-vectors, 124.—The momentum
four-vector, 126.—The frequency vector and the relativistic Doppler
shift, 133.—A non-example: electric and magnetic fields, 136.—
The electromagnetic potential four-vector, 137.
4.3 The tensor transformation laws . . . . . . . . . . . 138
4.4 Experimental tests . . . . . . . . . . . . . . . . 142
Universality of tensor behavior, 142.—Speed of light differing from
c, 142.—Degenerate matter, 143.
4.5 Conservation laws. . . . . . . . . . . . . . . . . 148
No general conservation laws, 148.—Conservation of angular mo-
mentum and frame dragging, 149.
4.6 Things that aren’t quite tensors . . . . . . . . . . . 151
Area, volume, and tensor densities, 151.—The Levi-Civita symbol,
153.—Spacetime volume, 155.—Angular momentum, 155.
Problems . . . . . . . . . . . . . . . . . . . . . . 156
5 Curvature 159
5.1 Tidal curvature versus curvature caused by local sources 160
5.2 The stress-energy tensor . . . . . . . . . . . . . . 161
5.3 Curvature in two spacelike dimensions . . . . . . . . 162
5.4 Curvature tensors . . . . . . . . . . . . . . . . . 168
5.5 Some order-of-magnitude estimates . . . . . . . . . 170
The geodetic effect, 170.—Deflection of light rays, 171.
5.6 The covariant derivative . . . . . . . . . . . . . . 172
The covariant derivative in electromagnetism, 173.—The covariant
derivative in general relativity, 174.
5.7 The geodesic equation . . . . . . . . . . . . . . . 179
Characterization of the geodesic, 179.—Covariant derivative with
respect to a parameter, 179.—The geodesic equation, 180.—Uniqueness,
180.
5.8 Torsion . . . . . . . . . . . . . . . . . . . . . 181
Are scalars path-dependent?, 181.—The torsion tensor, 184.—Experimental
searches for torsion, 185.
5.9 From metric to curvature . . . . . . . . . . . . . . 188
Finding the Christoffel symbol from the metric, 188.—Numerical
solution of the geodesic equation, 189.—The Riemann tensor in
terms of the Christoffel symbols, 191.—Some general ideas about
gauge, 191.
5.10 Manifolds . . . . . . . . . . . . . . . . . . . . 194
Why we need manifolds, 194.—Topological definition of a manifold,
195.—Hausdorff property, 197.—Local-coordinate definition of a
8
manifold, 198.—Differentiable manifolds, 200.—The tangent space,
201.
5.11 Units in general relativity . . . . . . . . . . . . . 202
Problems . . . . . . . . . . . . . . . . . . . . . . 209
7 Symmetries 261
7.1 Killing vectors. . . . . . . . . . . . . . . . . . . 261
Killing vectors, 261.—Inappropriate mixing of notational systems,
265.—Conservation laws, 266.
7.2 Spherical symmetry . . . . . . . . . . . . . . . . 269
7.3 Penrose diagrams and causality . . . . . . . . . . . 271
Flat spacetime, 271.—Schwarzschild spacetime, 272.—Astrophysical
black hole, 273.—Penrose diagrams in general, 274.—Global hyper-
bolicity, 275.
7.4 Static and stationary spacetimes. . . . . . . . . . . 278
Stationary spacetimes, 278.—Isolated systems, 278.—A stationary
field with no other symmetries, 279.—A stationary field with addi-
tional symmetries, 280.—Static spacetimes, 281.—Birkhoff’s the-
orem, 281.—No-hair theorems, 282.—The gravitational potential,
284.
7.5 The uniform gravitational field revisited . . . . . . . . 285
Closed timelike curves, 288.
Problems . . . . . . . . . . . . . . . . . . . . . . 290
8 Sources 293
8.1 Sources in general relativity . . . . . . . . . . . . . 293
Point sources in a background-independent theory, 293.—The Ein-
stein field equation, 294.—Energy conditions, 307.—The cosmolog-
ical constant, 318.
8.2 Cosmological solutions. . . . . . . . . . . . . . . 322
Evidence for the finite age of the universe, 322.—Evidence for
expansion of the universe, 323.—Evidence for homogeneity and
isotropy, 324.—The FRW cosmologies, 325.—A singularity at the
9
Big Bang, 331.—Observability of expansion, 333.—The vacuum-
dominated solution, 341.—The matter-dominated solution, 346.—
The radiation-dominated solution, 350.—Local effects of expan-
sion, 350.—Observation, 354.
8.3 Mach’s principle revisited . . . . . . . . . . . . . . 357
The Brans-Dicke theory, 357.—Predictions of the Brans-Dicke the-
ory, 361.—Hints of empirical support, 361.—Mach’s principle is
false., 362.
8.4 Historical note: the steady-state model . . . . . . . . 363
Problems . . . . . . . . . . . . . . . . . . . . . . 367
10
Chapter 1
A Geometrical Theory of
Spacetime
“I always get a slight brain-shiver, now [that] space and time appear
conglomerated together in a gray, miserable chaos.” – Sommerfeld
This is a book about general relativity, at a level that is meant
to be accessible to advanced undergraduates.
This is mainly a book about general relativity, not special rel-
ativity. I’ve heard the sentiment expressed that books on special
relativity generally do a lousy job on special relativity, compared to
books on general relativity. This is undoubtedly true, for someone
who already has already learned special relativity — but wants to
unlearn the parts that are completely wrong in the broader context
of general relativity. For someone who has not already learned spe-
cial relativity, I strongly recommend mastering it first, from a book
such as Taylor and Wheeler’s Spacetime Physics.
In the back of this book I’ve included excerpts from three papers
by Einstein — two on special relativity and one on general relativity.
They can be read before, after, or along with this book. There are
footnotes in the papers and in the main text linking their content
with each other.
I should reveal at the outset that I am not a professional rela-
tivist. My field of research was nonrelativistic nuclear physics until
I became a community college physics instructor. I can only hope
that my pedagogical experience will compensate to some extent for
my shallow background, and that readers who find mistakes will be
kind enough to let me know about them using the contact informa-
tion provided at http://www.lightandmatter.com/area4author.
html.
11
1.1 Time and causality
Updating Plato’s allegory of the cave, imagine two super-intelligent
twins, Alice and Betty. They’re raised entirely by a robotic tutor
on a sealed space station, with no access to the outside world. The
robot, in accord with the latest fad in education, is programmed to
encourage them to build up a picture of all the laws of physics based
on their own experiments, without a textbook to tell them the right
answers. Putting yourself in the twins’ shoes, imagine giving up
all your preconceived ideas about space and time, which may turn
out according to relativity to be completely wrong, or perhaps only
approximations that are valid under certain circumstances.
Causality is one thing the twins will notice. Certain events re-
sult in other events, forming a network of cause and effect. One
general rule they infer from their observations is that there is an
unambiguously defined notion of betweenness: if Alice observes that
event 1 causes event 2, and then 2 causes 3, Betty always agrees that
2 lies between 1 and 3 in the chain of causality. They find that this
agreement holds regardless of whether one twin is standing on her
head (i.e., it’s invariant under rotation), and regardless of whether
one twin is sitting on the couch while the other is zooming around
the living room in circles on her nuclear fusion scooter (i.e., it’s also
invariant with respect to different states of motion).
You may have heard that relativity is a theory that can be inter-
preted using non-Euclidean geometry. The invariance of between-
ness is a basic geometrical property that is shared by both Euclidean
and non-Euclidean geometry. We say that they are both ordered
geometries. With this geometrical interpretation in mind, it will
be useful to think of events not as actual notable occurrences but
merely as an ambient sprinkling of points at which things could hap-
pen. For example, if Alice and Betty are eating dinner, Alice could
choose to throw her mashed potatoes at Betty. Even if she refrains,
there was the potential for a causal linkage between her dinner and
Betty’s forehead.
Betweenness is very weak. Alice and Betty may also make a
number of conjectures that would say much more about causality.
For example: (i) that the universe’s entire network of causality is
connected, rather than being broken up into separate parts; (ii) that
the events are globally ordered, so that for any two events 1 and 2,
either 1 could cause 2 or 2 could cause 1, but not both; (iii) not only
are the events ordered, but the ordering can be modeled by sorting
the events out along a line, the time axis, and assigning a number t,
time, to each event. To see what these conjectures would entail, let’s
discuss a few examples that may draw on knowledge from outside
Alice and Betty’s experiences.
Example: According to the Big Bang theory, it seems likely that
the network is connected, since all events would presumably connect
1
The possibility of having time come back again to the same point is often
referred to by physicists as a closed timelike curve (CTC). Kip Thorne, in his
popularization Black Holes and Time Warps, recalls experiencing some anxiety
after publishing a paper with “Time Machines” in the title, and later being
embarrassed when a later paper on the topic was picked up by the National
Enquirer with the headline PHYSICISTS PROVE TIME MACHINES EXIST.
“CTC” is safer because nobody but physicists know what it means.
2
This point is revisited in section 6.1.
3
Hafele and Keating, Science, 177 (1972), 168
4
These differences in velocity are not simply something that can be eliminated
by choosing a different frame of reference, because the clocks’ motion isn’t in
a straight line. The clocks back in Washington, for example, have a certain
acceleration toward the earth’s axis, which is different from the accelerations
experienced by the traveling clocks.
1.2.2 Muons
Although the Hafele-Keating experiment is impressively direct,
it was not the first verification of relativistic effects on time, it did
not completely separate the kinematic and gravitational effects, and
the effect was small. An early experiment demonstrating a large and
purely kinematic effect was performed in 1941 by Rossi and Hall,
who detected cosmic-ray muons at the summit and base of Mount
Washington in New Hampshire. The muon has a mean lifetime of
2.2 µs, and the time of flight between the top and bottom of the
mountain (about 2 km for muons arriving along a vertical path)
at nearly the speed of light was about 7 µs, so in the absence of
relativistic effects, the flux at the bottom of the mountain should
have been smaller than the flux at the top by about an order of
magnitude. The observed ratio was much smaller, indicating that
the “clock” constituted by nuclear decay processes was dramatically
slowed down by the motion of the muons.
gen maser clock which was used to control the frequency of a radio
signal. The radio signal was received on the ground, the nonrela-
tivistic Doppler shift was subtracted out, and the residual blueshift
was interpreted as the gravitational effect effect on time, matching
the relativistic prediction to an accuracy of 0.01%.
Section 1.3 Non-simultaneity and the maximum speed of cause and effect 17
and that all inertial frames of reference are equally valid. The best
that they can do is to compare clocks once Betty returns, and verify
that the net result of the trip was to make Betty’s clock run more
slowly on the average.
Alice and Betty can never satisfy their curiosity about exactly
when during Betty’s voyage the discrepancies accumulated or at
what rate. This is information that they can never obtain, but
they could obtain it if they had a system for communicating in-
stantaneously. We conclude that instantaneous communication is
impossible. There must be some maximum speed at which signals
can propagate — or, more generally, a maximum speed at which
cause and effect can propagate — and this speed must for example
be greater than or equal to the speed at which radio waves propa-
gate. It is also evident from these considerations that simultaneity
itself cannot be a meaningful concept in relativity.
O1-O2 express the same ideas as Euclid’s E1-E2. Not all lines
in the system will correspond physically to chains of causality; we
could have a line segment that describes a snapshot of a steel chain,
and O3-O4 then say that the order of the links is well defined. But
O3 and O4 also have clear physical significance for lines describing
causality. O3 forbids time travel paradoxes, like going back in time
and killing our own grandmother as a child; figure a illustrates why a
violation of O3 is referred to as a closed timelike curve. O4 says that
events are guaranteed to have a well-defined cause-and-effect order
only if they lie on the same line. This is completely different from
the attitude expressed in Newton’s famous statement: “Absolute,
true and mathematical time, of itself, and from its own nature flows
equably without regard to anything external . . . ”
If you’re dismayed by the austerity of a system of geometry with-
out any notion of measurement, you may be more appalled to learn
that even a system as weak as ordered geometry makes some state-
ments that are too strong to be completely correct as a foundation
for relativity. For example, if an observer falls into a black hole, at
some point he will reach a central point of infinite density, called a
singularity. At this point, his chain of cause and effect terminates,
violating O2. It is also an open question whether O3’s prohibition
on time-loops actually holds in general relativity; this is Stephen
Hawking’s playfully named chronology protection conjecture. We’ll
also see that in general relativity O1 is almost always true, but there
are exceptions.
find a much more lengthy list of axioms than the ones presented here. The
axioms I’m omitting take care of details like making sure that there are more
than two points in the universe, and that curves can’t cut through one another
without intersecting. The classic, beautifully written book on these topics is
H.S.M. Coxeter’s Introduction to Geometry, which is “introductory” in the sense
that it’s the kind of book a college math major might use in a first upper-division
course in geometry.
13
V.B. Braginskii and V.I. Panov, Soviet Physics JETP 34, 463 (1972).
14
Carusotto et al., “Limits on the violation of g-universality with a Galileo-
type experiment,” Phys Lett A183 (1993) 355. Freely available online at re-
searchgate.net.
15
Touboul et al., “The MICROSCOPE mission: first results of a space test of
the Equivalence Principle,” arxiv.org/abs/1712.01176
f / Wouldn’t it be nice if we could define the meaning of a Newtonian inertial frame of reference? New-
ton makes it sound easy: to define an inertial frame, just find some object that is not accelerating because it is
not being acted on by any external forces. But what object would we use? The earth? The “fixed stars?” Our
galaxy? Our supercluster of galaxies? All of these are accelerating — relative to something.
Lorentz frames
The conclusion is that we need to abandon the entire distinction
between Newton-style inertial and noninertial frames of reference.
The best that we can do is to single out certain frames of reference
defined by the motion of objects that are not subject to any non-
gravitational forces. A falling rock defines such a frame of reference.
In this frame, the rock is at rest, and the ground is accelerating. The
rock’s world-line is a straight line of constant x = 0 and varying t.
Such a free-falling frame of reference is called a Lorentz frame. The
frame of reference defined by a rock sitting on a table is an inertial
frame of reference according to the Newtonian view, but it is not a
Lorentz frame.
In Newtonian physics, inertial frames are preferable because they
make motion simple: objects with no forces acting on them move
along straight world-lines. Similarly, Lorentz frames occupy a privi-
leged position in general relativity because they make motion simple:
objects move along “straight” world-lines if they have no nongravi- g / An artificial horizon.
tational forces acting on them.
j / Two local Lorentz frames. A second way of stating the equivalence principle is that it is
always possible to define a local Lorentz frame in a particular neigh-
borhood of spacetime.18 It is not possible to do so on a universal
basis.
The locality of Lorentz frames can be understood in the anal-
ogy of the string stretched across the globe. We don’t notice the
curvature of the Earth’s surface in everyday life because the radius
of curvature is thousands of kilometers. On a map of LA, we don’t
notice any curvature, nor do we detect it on a map of Mumbai, but
it is not possible to make a flat map that includes both LA and
Mumbai without seeing severe distortions.
Terminology
The meanings of words evolve over time, and since relativity is
now a century old, there has been some confusing semantic drift
in its nomenclature. This applies both to “inertial frame” and to
“special relativity.”
18
This statement of the equivalence principle is summarized, along with some
other forms of it, in the back of the book on page 413.
Chiao’s paradox
The remainder of this subsection deals with the subtle ques-
tion of whether and how the equivalence principle can be applied to
charged particles. You may wish to skip it on a first reading. The
short answer is that using the equivalence principle to make con-
clusions about charged particles is like the attempts by slaveholders
and abolitionists in the 19th century U.S. to support their positions
based on the Bible: you can probably prove whichever conclusion
was the one you set out to prove.
The equivalence principle is not a single, simple, mathemati-
cally well defined statement.26 As an example of an ambiguity that
is still somewhat controversial, 90 years after Einstein first proposed
the principle, consider the question of whether or not it applies to
charged particles. Raymond Chiao27 proposes the following thought
experiment, which I’ll refer to as Chiao’s paradox. Let a neutral par-
ticle and a charged particle be set, side by side, in orbit around the
23
Eddington, op. cit.
24
Misner, Thorne, and Wheeler, op. cit., pp.163-164. Penrose, The Road to
Reality, 2004, p. 422. Taylor and Wheeler, Spacetime Physics, 1992, p. 132.
Schutz, A First Course in General Relativity, 2009, pp. 3, 141. Hobson, General
Relativity: An Introduction for Physicists, 2005, sec. 1.14.
25
arxiv.org/abs/0905.1929
26
A good recent discussion of this is “Theory of gravitation theories: a no-
progress report,” Sotiriou, Faraoni, and Liberati, http://arxiv.org/abs/0707.
2748
27
arxiv.org/abs/quant-ph/0601193v7
• What we have found agrees with Niels Bohr’s correspondence o / A simplified drawing of
principle, which states that when a new physical theory, such the 1903 experiment by Nichols
as relativity, replaces an older one, such as Newtonian physics, and Hull that verified the pre-
the new theory must agree with the old one under the experi- dicted momentum of light waves.
Two circular mirrors were hung
mental conditions in which the old theory had been verified by
from a fine quartz fiber, inside
experiments. The gravitational mass of a beam of light with an evacuated bell jar. A 150
energy E is E/c2 , and since c is a big number, it is not sur- mW beam of light was shone
prising that the weight of light rays had never been detected on one of the mirrors for 6 s,
before Einstein trying to detect it. producing a tiny rotation, which
was measurable by an optical
• This book describes one particular theory of gravity, Einstein’s lever (not shown). The force was
theory of general relativity. There are other theories of grav- within 0.6% of the theoretically
ity, and some of these, such as the Brans-Dicke theory, do predicted value of 0.001 µN.
just as well as general relativity in agreeing with the presently For comparison, a short clipping
available experimental data. Our prediction of gravitational of a single human hair weighs
∼ 1 µN.
Doppler shifts of light only depended on the equivalence princi-
ple, which is one ingredient of general relativity. Experimental
tests of this prediction only test the equivalence principle; they
do not allow us to distinguish between one theory of gravity
and another if both theories incorporate the equivalence prin-
ciple.
31
Phys. Rev. Lett. 4 (1960) 337
Problems 39
40 Chapter 1 A Geometrical Theory of Spacetime
Chapter 2
Geometry of Flat
Spacetime
The geometrical treatment of space, time, and gravity only requires
as its basis the equivalence of inertial and gravitational mass. Given
this assumption, we can describe the trajectory of any free-falling
test particle as a geodesic. Equivalence of inertial and gravitational
mass holds for Newtonian gravity, so it is indeed possible to redo
Newtonian gravity as a theory of curved spacetime. This project was
carried out by the French mathematician Cartan. The geometry of
the local reference frames is very simple. The three space dimensions
have an approximately Euclidean geometry, and the time dimension
is entirely separate from them. This is referred to as a Euclidean
spacetime with 3+1 dimensions. Although the outlook is radically
different from Newton’s, all of the predictions of experimental results
are the same.
The experiments in section 1.2 show, however, that there are
real, experimentally verifiable violations of Newton’s laws. In New-
tonian physics, time is supposed to flow at the same rate everywhere,
which we have found to be false. The flow of time is actually depen-
dent on the observer’s state of motion through space, which shows
that the space and time dimensions are intertwined somehow. The
geometry of the local frames in relativity therefore must not be as
simple as Euclidean 3+1. Their actual geometry was implicit in
Einstein’s 1905 paper on special relativity, and had already been
developed mathematically, without the full physical interpretation,
by Hendrik Lorentz. Lorentz’s and Einstein’s work were explicitly
connected by Minkowski in 1907, so a Lorentz frame is often referred
to as a Minkowski frame.
To describe this Lorentz geometry, we need to add more struc-
ture on top of the axioms O1-O4 of ordered geometry, but it will not
be the additional Euclidean structure of E3-E4, it will be something
different. To see how to proceed, let’s start by thinking about what
bare minimum of geometrical machinery is needed in order to set
up frames of reference.
41
2.1 Affine properties of Lorentz geometry
2.1.1 Parallelism and measurement
We think of a frame of reference as a body of measurements
or possible measurements to be made by some observer. Ordered
geometry lacks measure. The following argument shows that merely
by adding a notion of parallelism to our geometry, we automatically
gain a system of measurement.
We only expect Lorentz frames to be local, but we do need them
to be big enough to cover at least some amount of spacetime. If
a / Objects are released at Betty does an Eötvös experiment by releasing a pencil and a lead
rest at spacetime events P and ball side by side, she is essentially trying to release them at the
Q. They remain at rest, and their same event A, so that she can observe them later and determine
world-lines define a notion of
whether their world-lines stay right on top of one another at point
parallelism.
B. That was all that was required for the Eötvös experiment, but
in order to set up a Lorentz frame we need to start dealing with
objects that are not right on top of one another. Suppose we re-
lease two lead balls in two different locations, at rest relative to one
another. This could be the first step toward adding measurement
to our geometry, since the balls mark two points in space that are
separated by a certain distance, like two marks on a ruler, or the
goals at the ends of a soccer field. Although the balls are separated
by some finite distance, they are still close enough together so that
if there is a gravitational field in the area, it is very nearly the same
in both locations, and we expect the distance defined by the gap
between them to stay the same. Since they are both subject only to
b / There is no well-defined gravitational forces, their world-lines are by definition straight lines
angular measure in this ge-
(geodesics). The goal here is to end up with some kind of coordi-
ometry. In a different frame of
reference, the angles are not nate grid defining a (t, x) plane, and on such a grid, the two balls’
right angles. world-lines are vertical lines. If we release them at events P and
Q, then observe them again later at R and S, PQRS should form a
rectangle on such a plot. In the figure, the irregularly spaced tick
marks along the edges of the rectangle are meant to suggest that
although ordered geometry provides us with a well-defined ordering
along these lines, we have not yet constructed a complete system of
measurement.
The depiction of PQSR as a rectangle, with right angles at its
vertices, might lead us to believe that our geometry would have
something like the concept of angular measure referred to in Euclid’s
E4, equality of right angles. But this is too naive even for the
Euclidean 3+1 spacetime of Newton and Galileo. Suppose we switch
c / Simultaneity is not well to a frame that is moving relative to the first one, so that the balls
defined. The constant-time lines are not at rest. In the Euclidean spacetime, time is absolute, so
PQ and RS from figure a are events P and Q would remain simultaneous, and so would R and
not constant-time lines when S; the top and bottom edges PQ and RS would remain horizontal
observed in a different frame of on the plot, but the balls’ world-lines PR and QS would become
reference.
slanted. The result would be a parallelogram. Since observers in
prove that this line is unique, we argue by contradiction. Suppose some other
parallel m to exist. If m crosses the infinite line BQ at some point Z, then both
[ABPQ] and [ABPZ], so by A1, Q=Z, so the ` and m are the same. The only
other possibility is that m is parallel to BQ, but then the following chain of
parallelisms holds: PQ k AB k m k BQ. By A3, lines parallel to another line are
parallel to each other, so PQ k BQ, but this is a contradiction, since they have
Q in common.
2.1.2 Vectors
Vectors distinguished from scalars
We’ve been discussing subjects like the center of mass that in
freshman mechanics would be described in terms of vectors and
scalars, the distinction being that vectors have a direction in space
and scalars don’t. As we make the transition to relativity, we are
forced to refine this distinction. For example, we used to consider
time as a scalar, but the Hafele-Keating experiment shows that time
is different in different frames of reference, which isn’t something
that’s supposed to happen with scalars such as mass or temperature.
In affine geometry, it doesn’t make much sense to say that a vector
has a magnitude and direction, since non-parallel magnitudes aren’t
comparable, and there is no system of angular measurement in which
to describe a direction.
A better way of defining vectors and scalars is that scalars are
absolute, vectors relative. If I have three apples in a bowl, then all
observers in all frames of reference agree with me on the number
three. But if my terrier pup pulls on the leash with a certain force
vector, that vector has to be defined in relation to other things. It
might be three times the strength of some force that we define as
one newton, and in the same direction as the earth’s magnetic field.
In general, measurement means comparing one thing to another.
The number of apples in the bowl isn’t a measurement, it’s a count.
c e = 24
Here c represents the cuckoo clock and e the rotation of the earth.
Although the measurement relationship is nearly symmetric, the
arrow has a direction, because, for example, the measurement of
the earth’s rotational period in terms of the clock’s frequency is
c e = (24 hr)(1 hr−1 ) = 24, but the clock’s period in terms of the
earth’s frequency is e c = 1/24. We say that the relationship is
not symmetric but “dual.” By the way, it doesn’t matter how we
arrange these diagrams on the page. The notations c e and e c
mean exactly the same thing, and expressions like this can even be
drawn vertically.
Suppose that e is a displacement along some one-dimensional
line of time, and we want to think of it as the thing being measured.
Then we expect that the measurement process represented by c pro-
duces a real-valued result and is a linear function of e. Since the
relationship between c and e is dual, we expect that c also belongs
to some vector space. For example, vector spaces allow multiplica-
tion by a scalar: we could double the frequency of the cuckoo clock
by making the bird come out on the half hour as well as on the
hour, forming 2c. Measurement should be a linear function of both
vectors; we say it is “bilinear.”
Duality
The two vectors c and e have different units, hr−1 and hr, and
inhabit two different one-dimensional vector spaces. The “flavor” of
the vector is represented by whether the arrow goes into it or comes
out. Just as we used notation like → −v in freshman physics to tell
vectors apart from scalars, we can employ arrows in the birdtracks
notation as part of the notation for the vector, so that instead of
writing the two vectors as c and e, we can notate them as c and
e . Performing a measurement is like plumbing. We join the two
“pipes” in c e and simplify to c e .
A confusing and nonstandardized jungle of notation and termi-
nology has grown up around these concepts. For now, let’s refer to a
vector such as e , with the arrow coming in, simply as a “vector,”
and the type like c as a “dual vector.” In the one-dimensional
example of the earth and the cuckoo clock, the roles played by the
two vectors were completely equivalent, and it didn’t matter which
one we expressed as a vector and which as a dual vector. Example
5 shows that it is sometimes more natural to take one quantity as
3
The system used in this book follows the one defined by Cvitanović, which
was based closely on a graphical notation due to Penrose. For a more com-
plete exposition, see the Wikipedia article “Penrose graphical notation” and
Cvitanović’s online book at birdtracks.eu.
Scaling
In birdtracks notation, a scalar is a quantity that has no external
arrows at all. Since the expression c e = 24 has no external arrows,
only internal ones, it represents a scalar. This makes sense because
it’s a count, and a count is a scalar.
A convenient way of summarizing all of our categories of vari-
ables is by their behavior when we convert units, i.e., when we rescale
our space. If we switch our time unit from hours to minutes, the
number of apples in a bowl is unchanged, the earth’s period of ro-
tation gets 60 times bigger, and the frequency of the cuckoo clock
changes by a factor of 1/60. In other words, a quantity u under
rescaling of coordinates by a factor α becomes αp u, where the ex-
ponents −1, 0, and +1 correspond to dual vectors, scalars, and vec-
tors, respectively. We can therefore see that these distinctions are of
interest even in one dimension, contrary to what one would have ex-
pected from the freshman-physics concept of a vector as something
transforming in a certain way under rotations.
Geometrical visualization
In two dimensions, there are natural ways of visualizing the dif-
ferent vector spaces inhabited by vectors and dual vectors. We’ve
already been describing a vector like e as a displacement. Its
vector space is the space of such displacements.4 A vector in the
dual space such as c can be visualized as a set of parallel, evenly
h / 1. A displacement vector.
spaced lines on a topographic map, h/2, with an arrowhead to show
2. A vector from the space dual
to the space of displacements.
which way is “uphill.” The act of measurement consists of counting
3. Measurement is reduced how many of these lines are crossed by a certain vector, h/3.
to counting. The cuckoo clock Given a scalar field f , its gradient grad f at any given point
chimes 24 times in one rotation
is a dual vector. In birdtracks notation, we have to indicate this
of the earth.
by writing it with an outward-pointing arrow, (grad f ) . Because
gradients occur so frequently, we have a special shorthand for them,
which is simply a circle:
f
In the context of spacetime with a metric and curvature, we’ll see
that the usual definition of the gradient in terms of partial deriva-
tives should be modified with correction terms to form something
called a covariant derivative. When we get to that point on p. 178,
we’ll commandeer the circle notation for that operation.
i / Constant-temperature curves Force is a dual vector Example: 5
for January in North America, at
intervals of 4 ◦ C. The tempera-
The dot product dW = F · dx for computing mechanical work
ture gradient at a given point is a 4
In terms of the primitive notions used in the axiomatization in section 2.1,
dual vector. a displacement could be described as an equivalence class of segments such that
for any two segments in the class AB and CD, AB and CD form a parallelogram.
dW = F dx.
C s
I There is no flow through the top and bottom. This case cor-
responds to Galilean relativity, in which the rectangle shears
horizontally under a boost, and simultaneity is preserved, vi-
olating L5.
II Area flows downward at both the top and the bottom. The
flow is clockwise at both the positive t axis and the positive
x axis. This makes it plausible that the flow is clockwise ev-
erywhere in the (t, x) plane, and the proof is straightforward.7
7
Proof: By linearity of L, the flow is clockwise at the negative axes as well.
III Area flows upward at both the top and the bottom.
Only case III is possible, and given case III, there must be at least
one point P in the first quadrant where area flows neither clockwise
nor counterclockwise.8 The boost simply increases P’s distance from
the origin by some factor. By the linearity of the transformation,
the entire line running through O and P is simply rescaled. This
special line’s inverse slope, which has units of velocity, apparently
has some special significance, so we give it a name, c. We’ll see later
that c is the maximum speed of cause and effect whose existence
we inferred in section 1.3. Any world-line with a velocity equal to
c retains the same velocity as judged by moving observers, and by
isotropy the same must be true for −c.
For convenience, let’s adopt time and space units in which c = 1,
and let the original rectangle be a unit square. The upper right
tip of the parallelogram must slide along the line through the origin
with slope +1, and similarly the parallelogram’s other diagonal must
have a slope of −1. Since these diagonals bisected one another on
the original square, and since bisection is an affine property that
is preserved when we change frames of reference, the parallelogram
must be equilateral.
We can now determine the complete form of the Lorentz transfor-
mation. Let unit square PQRS, as described above, be transformed
to parallelogram P0 Q0 R0 S0 in the new coordinate system (x0 , t0 ). Let
the t0 coordinate of R0 be γ, interpreted as the ratio between the
time elapsed on a clock moving from P0 to R0 and the corresponding
time as measured by a clock that is at rest in the (x0 , t0 ) frame. By
the definition of v, R0 has coordinates (vγ, γ), and the other geo-
metrical facts established above place Q0 symmetrically on the other
side of the diagonal, at (γ, vγ). Computing the cross product of vec-
c / Unit square PQRS is Lorentz-
tors P0 R0 and P0 Q0 , we find the area of P0 Q0 R0 S0 to be γ 2 (1 − v 2 ),
boosted to the parallelogram
P0 Q0 R0 S0 . and setting this equal to 1 gives
1
γ=√ .
1 − v2
Also by linearity, the handedness of the flow is the same at all points on a ray
extending out from the origin in the direction θ. If the flow were counterclockwise
somewhere, then it would have to switch handedness twice in that quadrant, at
θ1 and θ2 . But by writing out the vector cross product r × dr, where dr is the
displacement caused by L(dv), we find that it depends on sin(2θ +δ), which does
not oscillate rapidly enough to have two zeroes in the same quadrant.
8
This follows from the fact that, as shown in the preceding footnote, the
handedness of the flow depends only on θ.
∆t 1
≈ v 2 − ∆Φ,
t 2
v2
1
+∆Φ − = 5.2 × 10−10 − 0.9 × 10−10 ,
c2 2
Time dilation in the Pound-Rebka experiment Example: 13 h / The change in the frequency
In the description of the Pound-Rebka experiment on page 34, I of x-ray photons emitted by 57 Fe
postponed the quantitative estimation of the frequency shift due as a function of temperature,
to temperature. Classically, one expects only a broadening of drawn after Pound And Rebka
(1960). Dots are experimental
the line, since the Doppler shift is proportional to vk /c, where
measurements. The solid curve
vk , the component of the emitting atom’s velocity along the line is Pound and Rebka’s theoretical
of sight, averages to zero. But relativity tells us to expect that if calculation using the Debye the-
the emitting atom is moving, its time will flow more slowly, so the ory of the lattice vibrations with
frequency of the light it emits will also be systematically shifted a Debye temperature of 420 de-
downward. This frequency shift should increase with tempera- grees C. The dashed line is one
ture. In other words, the Pound-Rebka experiment was designed with the slope calculated in the
as a test of general relativity (the equivalence principle), but this text using a simplified treatment
of the thermodynamics. There is
special-relativistic effect is just as strong as the relativistic one,
an arbitrary vertical offset in the
and needed to be accounted for carefully. experimental data, as well as the
theoretical curves.
11
Bailey at al., Nucl. Phys. B150(1979) 1
12
Phys. Rev. Lett. 4 (1960) 337
13
Phys. Rev. Lett. 4 (1960) 274
14
livingreviews.org/lrr-2005-5
2.4.2 Observer-independence of c
The constancy of the speed of light for observers in all frames of
reference was originally detected in 1887 when Michelson and Morley
set up a clever apparatus to measure any difference in the speed of
light beams traveling east-west and north-south. The motion of
the earth around the sun at 110,000 km/hour (about 0.01% of the
speed of light) is to our west during the day. Michelson and Morley
believed that light was a vibration of a physical medium, the ether,
so they expected that the speed of light would be a fixed value
relative to the ether. As the earth moved through the ether, they
15
http://arxiv.org/abs/0908.1832
where we assume that the square has negligible size, so that all four
Lorentz boosts act in a way that preserves the origin of the coordi-
nate systems. (We have no convenient way in our notation L(. . .) to
describe a transformation that does not preserve the origin.) The
first transformation, L(−vŷ), changes coordinates measured by the
original gyroscope-defined frame to new coordinates measured by
the new gyroscope-defined frame, after the box has been acceler-
ated in the positive y direction.
19 [ 0 + . . . ]
20
I’ve omitted some output generated automatically from the earlier steps in
the computation. The (%o9) indicates that this is Maxima’s output from the
ninth and final step.
In other words,
0 0
T 1 = 1 + ...,
0 −v 2
21
Although we will not need any more than this for the purposes of our present
analysis, a longer and more detailed discussion by Rhodes and Semon, www.
bates.edu/~msemon/RhodesSemonFinal.pdf, Am. J. Phys. 72(7)2004, shows
that this type of inertially guided, constant-thrust motion is always represented
on the velocity disk by an arc of a circle that is perpendicular to the disk at its
edge. (We consider a diameter of the disk to be the limiting case of a circle with
infinite radius.)
1 Suppose that we don’t yet know the exact form of the Lorentz
transformation, but we know based on the Michelson-Morley exper-
iment that the speed of light is the same in all inertial frames, and
we’ve already determined, e.g., by arguments like those on p. 71,
that there can be no length contraction in the direction perpendic-
ular to the motion. We construct a “light clock,” consisting simply
of two mirrors facing each other, with a light pulse bouncing back
and forth between them.
(a) Suppose this light clock is moving at a constant velocity v in the
direction perpendicular to its own optical arm, which is of length L.
Use the Pythagorean theorem√to prove that the clock experiences a
time dilation given by γ = 1/ 1 − v 2 , thereby fixing the time-time
portion of the Lorentz transformation.
(b) Why is it significant for the interpretation of special relativity
that the result from part a is independent of L?
(c) Carry out a similar calculation in the case where the clock moves
with constant acceleration a as measured in some inertial frame. Al-
though the result depends on L, prove that in the limit of small L,
we recover the earlier constant-velocity result, with no explicit de-
pendence on a.
Remark: Some authors state a “clock postulate” for special relativity, which
says that for a clock that is sufficiently small, the rate at which it runs de-
pends only on v, not a (except in the trivial sense that v and a are related
by calculus). The result of part c shows that the clock “postulate” is really a
theorem, not a statement that is logically independent of the other postulates
of special relativity. Although this argument only applies to a particular fam-
ily of light clocks of various sizes, one can also make any small clock into an
acceleration-insensitive clock, by attaching an accelerometer to it and apply-
ing an appropriate correction to compensate for the clock’s observed sensitivity
to acceleration. (It’s still necessary for the clock to be small, since otherwise
the lack of simultaneity in relativity makes it impossible to describe the whole
clock as having a certain acceleration at a certain instant.) Farley at al.22 have
verified the “clock postulate” to within 2% for the radioactive decay of muons
with γ ∼ 12 being accelerated by magnetic fields at 5 × 1018 m/s2 . Some peo-
ple get confused by this acceleration-independent property of small clocks and
think that it contradicts the equivalence principle. For a good explanation, see
http://math.ucr.edu/home/baez/physics/Relativity/SR/clock.html.
. Solution, p. 388
22
Nuovo Cimento 45 (1966) 281
Problems 83
2 Some of the most conceptually direct tests of relativistic time
dilation were carried out by comparing the rates of twin atomic
clocks, one left on a mountaintop for a certain amount of time, the
other in a nearby valley below.23 Unlike the clocks in the Hafele-
Keating experiment, these are stationary for almost the entire dura-
tion of the experiment, so any time dilation is purely gravitational,
not kinematic. One could object, however, that the clocks are not
A graph from the paper by really at rest relative to one another, due to the earth’s rotation.
Iijima, showing the time differ- This is an example of how the distinction between gravitational
ence between the two clocks. and kinematic time dilations is frame-dependent, since the effect is
One clock was kept at Mitaka purely gravitational in the rotating frame, where the gravitational
Observatory, at 58 m above sea
field is reduced by the fictitious centrifugal force. Show that, in the
level. The other was moved
back and forth between a second
non-rotating frame, the ratio of the kinematic effect to the gravi-
observatory, Norikura Corona tational one comes out to be 2.8 × 10−3 at the latitude of Tokyo.
Station, and the peak of the This small value indicates that the experiment can be interpreted
Norikura volcano, 2876 m above as a very pure test of the gravitational time dilation effect. To cal-
sea level. The plateaus on the culate the effect, you will need to use the fact that, as discussed
graph are data from the periods on p. 33, gravitational redshifts can be interpreted as gravitational
when the clocks were compared time dilations. . Solution, p. 388
side by side at Mitaka. The
difference between one plateau 3 (a) On p. 81 (see figure j), we showed that the Thomas pre-
and the next is the gravitational cession is proportional to area on the velocity disk. Use a similar
time dilation accumulated during argument to show that the Sagnac effect (p. 73) is proportional to
the period when the mobile clock
the area enclosed by the loop.
was at the top of Norikura.
(b) Verify this more directly in the special case of a circular loop.
(c) Show that a light clock of the type described in problem 1 is
insensitive to rotation with constant angular velocity.
(d) Connect these results to the commutativity and transitivity as-
sumptions in the Einstein clock synchronization procedure described
on p. ??. . Solution, p. 388
4 Example 14 on page 64 discusses relativistic bounds on the
properties of matter, using the example of pulling a bucket out of a
black hole. Derive a similar bound by considering the possibility of
sending signals out of the black hole using longitudinal vibrations of
a cable, as in the child’s telephone made of two tin cans connected
by a piece of string.
Remark: Surprisingly subtle issues can arise in such calculations; see A.Y.
Shiekh, Can. J. Phys. 70, 458 (1992). For a quantitative treatment of a dangling
rope in relativity, see Greg Egan, “The Rindler Horizon,” http://gregegan.
customer.netspace.net.au/SCIENCE/Rindler/RindlerHorizon.html.
23
L. Briatore and S. Leschiutta, “Evidence for the earth gravitational shift by
direct atomic-time-scale comparison,” Il Nuovo Cimento B, 37B (2): 219 (1977).
Iijima et al., “An experiment for the potential blue shift at the Norikura Corona
Station,” Annals of the Tokyo Astronomical Observatory, Second Series, Vol.
XVII, 2 (1978) 68.
Problems 85
86 Chapter 2 Geometry of Flat Spacetime
Chapter 3
Differential Geometry
General relativity is described mathematically in the language of
differential geometry. Let’s take those two terms in reverse order.
The geometry of spacetime is non-Euclidean, not just in the
sense that the 3+1-dimensional geometry of Lorentz frames is dif-
ferent than that of 4 interchangeable Euclidean dimensions, but also
in the sense that parallels do not behave in the way described by
E5 or A1-A3. In a Lorentz frame, which describes space without
any gravitational fields, particles whose world-lines are initially par-
allel will continue along their parallel world-lines forever. But in
the presence of gravitational fields, initially parallel world-lines of
free-falling particles will in general diverge, approach, or even cross.
Thus, neither the existence nor the uniqueness of parallels can be
assumed. We can’t describe this lack of parallelism as arising from
the curvature of the world-lines, because we’re using the world-lines
of free-falling particles as our definition of a “straight” line. Instead,
we describe the effect as coming from the curvature of spacetime it-
self. The Lorentzian geometry is a description of the case in which
this curvature is negligible.
What about the word differential ? The equivalence principle
states that even in the presence of gravitational fields, local Lorentz
frames exist. How local is “local?” If we use a microscope to zoom in
on smaller and smaller regions of spacetime, the Lorentzian approx-
imation becomes better and better. Suppose we want to do experi-
ments in a laboratory, and we want to ensure that when we compare
some physically observable quantity against predictions made based
on the Lorentz geometry, the resulting discrepancy will not be too
large. If the acceptable error is , then we should be able to get the
error down that low if we’re willing to make the size of our labora-
tory no bigger than δ. This is clearly very similar to the Weierstrass
style of defining limits and derivatives in calculus. In calculus, the
idea expressed by differentiation is that every smooth curve can be
approximated locally by a line; in general relativity, the equivalence
principle tells us that curved spacetime can be approximated locally
by flat spacetime. But consider that no practitioner of calculus ha-
bitually solves problems by filling sheets of scratch paper with ep-
silons and deltas. Instead, she uses the Leibniz notation, in which dy
and dx are interpreted as infinitesimally small numbers. You may
be inclined, based on your previous training, to dismiss infinitesi-
87
mals as neither rigorous nor necessary. In 1966, Abraham Robinson
demonstrated that concerns about rigor had been unfounded; we’ll
come back to this point in section 3.3. Although it is true that any
calculation written using infinitesimals can also be carried out using
limits, the following example shows how much more well suited the
infinitesimal language is to differential geometry.
Areas on a sphere Example: 1
TheR area of a region S in the Cartesian plane can be calculated
as S dA, where dA = dx dy is the area of an infinitesimal rectan-
gle of width dx and height dy. A curved surface such as a sphere
does not admit a global Cartesian coordinate system in which the
constant coordinate curves are both uniformly spaced and per-
pendicular to one another. For example, lines of longitude on the
earth’s surface grow closer together as one moves away from the
equator. Letting θ be the angle with respect to the pole, and φ the
azimuthal angle, the approximately rectangular patch bounded by
θ, θ+dθ, φ, and φ+dφ has width r sin θ dθ and height r dφ, giving
dA = r 2 sin θ dθ dφ. If you look at the corresponding derivation in
an elementary calculus textbook that strictly eschews infinitesi-
mals, the technique is to start from scratch with Riemann sums.
This is extremely laborious, and moreover must be carried out
again for every new case. In differential geometry, the curvature
of the space varies from one point to the next, and clearly we
don’t want to reinvent the wheel with Riemann sums an infinite
number of times, once at each point in space.
3.3 Models
A typical first reaction to the phrase “curved spacetime” — or even
“curved space,” for that matter — is that it sounds like nonsense.
How can featureless, empty space itself be curved or distorted? The
concept of a distortion would seem to imply taking all the points
and shoving them around in various directions as in a Picasso paint-
ing, so that distances between points are altered. But if space has
no identifiable dents or scratches, it would seem impossible to deter-
mine which old points had been sent to which new points, and the
distortion would have no observable effect at all. Why should we
expect to be able to build differential geometry on such a logically
dubious foundation? Indeed, historically, various mathematicians
have had strong doubts about the logical self-consistency of both
non-Euclidean geometry and infinitesimals. And even if an authori-
tative source assures you that the resulting system is self-consistent,
its mysterious and abstract nature would seem to make it difficult
for you to develop any working picture of the theory that could
play the role that mental sketches of graphs play in organizing your
knowledge of calculus.
Models provide a way of dealing with both the logical issues and
the conceptual ones. Figure a on page 90 “pops” off of the page,
presenting a strong psychological impression of a curved surface ren-
dered in perspective. This suggests finding an actual mathematical
object, such as a curved surface, that satisfies all the axioms of a
certain logical system, such as non-Euclidean geometry. Note that
the model may contain extrinsic elements, such as the existence of
a third dimension, that are not connected to the system being mod-
eled.
Let’s focus first on consistency. In general, what can we say
about the self-consistency of a mathematical system? To start with,
we can never prove anything about the consistency or lack of consis-
a / Tullio Levi-Civita (1873- tency of something that is not a well-defined formal system, e.g., the
1941) worked on models of Bible. Even Euclid’s Elements, which was a model of formal rigor for
number systems possessing thousands of years, is loose enough to allow considerable ambiguity.
infinitesimals and on differential If you’re inclined to scoff at the silly Renaissance mathematicians
geometry. He invented the
who kept trying to prove the parallel postulate E5 from postulates
tensor notation, which Einstein
learned from his textbook. He E1-E4, consider the following argument. Suppose that we replace
was appointed to prestigious E5 with E50 , which states that parallels don’t exist: given a line and
endowed chairs at Padua and the a point not on the line, no line can ever be drawn through the point
University of Rome, but was fired and parallel to the given line. In the new system of plane geometry
in 1938 because he was a Jew E0 consisting of E1-E4 plus E50 , we can prove a variety of theorems,
and an anti-fascist. and one of them is that there is an upper limit on the area of any fig-
ure. This imposes a limit on the size of circles, and that appears to
contradict E3, which says we can construct a circle with any radius.
a / Example 5.
Frames moving at c?
A good application of these ideas is to the question of what the
world would look like in a frame of reference moving at the speed
of light. This question has a long and honorable history. As a
young student, Einstein tried to imagine what an electromagnetic
wave would look like from the point of view of a motorcyclist riding
alongside it. We now know, thanks to Einstein himself, that it really
doesn’t make sense to talk about such observers.
The most straightforward argument is based on the positivist
idea that concepts only mean something if you can define how to
measure them operationally. If we accept this philosophical stance
(which is by no means compatible with every concept we ever discuss
in physics), then we need to be able to physically realize this frame
in terms of an observer and measuring devices. But we can’t. It
would take an infinite amount of energy to accelerate Einstein and
his motorcycle to the speed of light.
Since arguments from positivism can often kill off perfectly in-
teresting and reasonable concepts, we might ask whether there are
other reasons not to allow such frames. There are. Recall that
we placed two technical conditions on coordinate transformations:
they are supposed to be smooth and one-to-one. The smoothness
condition is related to the inability to boost Einstein’s motorcycle
into the speed-of-light frame by any continuous, classical process.
(Relativity is a classical theory.) But independent of that, we have
a problem with the one-to-one requirement. Figure b shows what
happens if we do a series of Lorentz boosts to higher and higher b / A series of Lorentz boosts
velocities. It should be clear that if we could do a boost up to a ve- acts on a square.
locity of c, we would have effected a coordinate transformation that
was not one-to-one. Every point in the plane would be mapped onto
a single lightlike line.
1
T(ab) = (Tab + Tba )
2
1
T[ab] = (Tab − Tba )
2
Any Tab can be split into symmetric and antisymmetric parts. This
is similar to writing an arbitrary function as a sum of and odd
function and an even function. The metric has only a symmetric
part: g(ab) = gab , and g[ab] = 0. This notation is generalized to
ranks greater than 2 on page 184.
Self-check: Characterize an antisymmetric rank-2 tensor in two
dimensions.
A change of scale Example: 7
. Suppose we start by describing the Euclidean plane with a cer-
tain set of Cartesian coordinates, but then want to change to a
new set of coordinates that are rescaled compared to the original
ones. How is the effect of this rescaling represented in g?
. If we change our units of measurement so that x µ → αx µ , while
demanding that ds2 come out the same, then we need gµν →
α−2 gµν .
Comparing with p. 48, we deduce the general rule that a tensor
of rank (m, n) transforms under scaling by picking up a factor of
αm−n .
This whole notion of scaling and units in general relativity turns
out to be nontrivial and interesting. See section 5.11, p. 202, for
a more detailed discussion.
Polar coordinates Example: 8
Consider polar coordinates (r , θ) in a Euclidean plane. The const-
ant-coordinate curves happen to be orthogonal everywhere, so
the off-diagonal elements of the metric gr θ and gθr vanish. In-
finitesimal coordinate changes dr and dθ correspond to infinitesi-
mal displacements dr and r dθ in orthogonal directions, so by the
Pythagorean theorem, ds2 = dr 2 + r 2 dθ2 , and we read off the
elements of the metric gr r = 1 and gθθ = r 2 .
Notice how in example 8 we started from the generally valid
relation ds2 = gµν dxµ dxν , but soon began writing down facts like
gθθ = r2 that were only valid in this particular coordinate system.
To make it clear when this is happening, we maintain the distinction
between abtract Latin indices and concrete Greek indices introduced
on p. 50. For example, we can write the general expression for
squared differential arc length with Latin indices,
ds2 = gij dx i dx j
= ds2 (cot2 φ + csc2 φ − 2g12 cot φ csc φ)
g12 = cos φ.
c / Example 9.
Area Example: 10
In one dimension, g is a single number, and lengths are given by
√
ds = g dx. The square root can also be understood through
example 7 on page 103, in which we saw that a uniform rescaling
x → αx is reflected in gµν → α−2 gµν .
In two-dimensional Cartesian coordinates, multiplication of the
width and height of a rectangle gives the element of area dA =
√
g11 g22 dx 1 dx 2 . Because the coordinates are orthogonal, g is
√
diagonal, and the factor of p g11 g22 is identified as the square root
of its determinant, so dA = |g| dx 1 dx 2 . Note that the scales on
the two axes are not necessarily the same, g11 6= g22 .
The same expression for the element of area holds even if the co-
ordinates
p pare not orthogonal. In example 9, for instance, we have
|g| = 1 − cos2 φ = sin φ, which is the right correction factor
corresponding to the fact that dx 1 and dx 2 form a parallelepiped
rather than a rectangle.
Area of a sphere Example: 11
For coordinates (θ, φ) on the surface of a sphere of radius r , we
have, by an argument similar to that of example 8 on page 103,
= 4πr 2
xa = gab xb
xa = g ab xb
Aab = g ac Acb ,
g ab = g ac gcb .
But we already know that g ... is simply the inverse matrix of g...
(example 12, p. 105), which means that g ab is simply the identity
matrix. That is, whereas a quantity like gab or g ab carries all the
information about our system of measurement at a given point,
g ab carries no information at all. Where gab or g ab can have both
positive and negative elements, elements that have units, and
off-diagonal elements, g ab is just a generic symbol carrying no
information other than the dimensionality of the space.
The metric tensor is so commonly used that it is simply left out of
birdtrack diagrams. Consistency is maintained because because
g ab is the identity matrix, so → g → is the same as →→.
The reason this quantity always comes out positive is that for two
vectors of fixed magnitude, the greatest dot product is always
achieved in the case where they lie along the same direction.
In Lorentzian geometry, the situation is different. Let b and c be
timelike vectors, so that they represent possible world-lines. Then
the relation a = b+c suggests the existence of two observers who
take two different paths from one event to another. A goes by a
direct route while B takes a detour. The magnitude of each time-
like vector represents the time elapsed on a clock carried by the
5
In mathematics, a group is defined as a binary operation that has an identity,
inverses, and associativity. For example, addition of integers is a group. In the
present context, the members of the group are not numbers but the transforma-
tions applied to the Euclidean plane. The group operation on transformations
T1 and T2 consists of finding the transformation that results from doing one and
then the other, i.e., composition of functions.
6
The discontinuous transformations of spatial reflection and time reversal are
not included in the definition of the Poincaré group, although they do preserve
inner products. General relativity has symmetry under spatial reflection (called
P for parity), time reversal (T), and charge inversion (C), but the standard
model of particle physics is only invariant under the composition of all three,
CPT, not under any of these symmetries individually.
7
Proof: Let b and c be parallel and timelike, and directed forward in time.
Adopt a frame of reference in which every spatial component of each vector
vanishes. This entails no loss of generality, since inner products are invariant
under such a transformation. Since the time-ordering is also preserved under
transformations in the Poincaré group, each is still directed forward in time, not
backward. Now let b and c be pulled away from parallelism, like opening a pair
of scissors in the x − t plane. This reduces bt ct , while causing bx cx to become
negative. Both effects increase the inner product.
8
The example is described in Einstein’s paper “The Foundation of the General
Theory of Relativity.” An excerpt, which includes the example, is given on p. ??.
Ehrenfest’s paradox
Ehrenfest11 described the following paradox. Suppose that ob-
server B, in the lab frame, measures the radius of the disk to be r
when the disk is at rest, and r0 when the disk is spinning. B can
also measure the corresponding circumferences C and C 0 . Because
B is in an inertial frame, the spatial geometry does not appear non-
Euclidean according to measurements carried out with his meter
sticks, and therefore the Euclidean relations C = 2πr and C 0 = 2πr0
both hold. The radial lines are perpendicular to their own motion,
and they therefore have no length contraction, r = r0 , implying
C = C 0 . The outer edge of the disk, however, is everywhere tan-
gent to its own direction of motion, so it is Lorentz contracted, and
therefore C 0 < C. The resolution of the paradox is that it rests on
the incorrect assumption that a rigid disk can be made to rotate.
If a perfectly rigid disk was initially not rotating, one would have
to distort it in order to set it into rotation, because once it was
9
Relativistic description of a rotating disk, Am. J. Phys. 43 (1975) 869
10
Space, Time, and Coordinates in a Rotating World, http://www.phys.uu.
nl/igg/dieks
11
P. Ehrenfest, Gleichförmige Rotation starrer Körper und Relativitätstheorie,
Z. Phys. 10 (1909) 918, available in English translation at en.wikisource.org.
ωr2
dt = dθ0 .
1 − ω2 r2
Substituting this into the metric, we are left with the purely spatial
metric
r2
[3] ds2 = − dr2 − dθ02 .
1 − ω2 r2
12
The paper is reproduced in the back of the book, and the relevant part is
on p. ??.
Problems 121
11 In the early decades of relativity, many physicists were in the
habit of speaking as if the Lorentz transformation described what an
observer would actually “see” optically, e.g., with an eye or a camera.
This is not the case, because there is an additional effect due to opti-
cal aberration: observers in different states of motion disagree about
the direction from which a light ray originated. This is analogous
to the situation in which a person driving in a convertible observes
raindrops falling from the sky at an angle, even if an observer on the
sidewalk sees them as falling vertically. In 1959, Terrell and Penrose
independently provided correct analyses,17 showing that in reality
an object may appear contracted, expanded, or rotated, depending
on whether it is approaching the observer, passing by, or receding.
The case of a sphere is especially interesting. Consider the following
four cases:
A The sphere is not rotating. The sphere’s center is at rest. The
observer is moving in a straight line.
17
James Terrell, “Invisibility of the Lorentz Contraction,” Physical Review 116
(1959) 1045. Roger Penrose, “The Apparent Shape of a Relativistically Moving
Sphere,” Proceedings of the Cambridge Philosophical Society 55 (1959) 139.
123
but so is 2τ +7. Less trivially, a photon’s proper time is always zero,
but one can still define an affine parameter along its trajectory. We
will need such an affine parameter, for example, in section 6.2.8,
page 233, when we calculate the deflection of light rays by the sun,
one of the early classic experimental tests of general relativity.
Another example of a Lorentz scalar is the pressure of a perfect
fluid, which is often assumed as a description of matter in cosmo-
logical models.
Infinitesimals and the clock “postulate” Example: 1
At the beginning of chapter 3, I motivated the use of infinitesimals
as useful tools for doing differential geometry in curved space-
time. Even in the context of special relativity, however, infinitesi-
mals can be useful. One way of expressing the proper time accu-
mulated on a moving clock is
Z
s = ds
Z q
= gij dx i dx j
s
Z 2 2 2
dx dy dz
= 1− − − dt,
dt dt dt
4.2 Four-vectors
4.2.1 The velocity and acceleration four-vectors
Our basic Lorentz vector is the spacetime displacement dxi . Any
other quantity that has the same behavior as dxi under rotations
and boosts is also a valid Lorentz vector. Consider a particle moving
through space, as described in a Lorentz frame. Since the particle
may be subject to nongravitational forces, the Lorentz frame can-
not be made to coincide (except perhaps momentarily) with the
particle’s rest frame. If dxi is not lightlike, then the corresponding
infinitesimal proper time interval dτ is nonzero. As with Newtonian
three-vectors, dividing a four-vector by a Lorentz scalar produces
another quantity that transforms as a four-vector, so dividing the
infinitesimal displacement by a nonzero infinitesimal proper time
interval, we have the four-velocity vector v i = dxi / dτ , whose com-
ponents in a Lorentz coordinate system are (γ, γu1 , γu2 , γu3 ), where
(u1 , u2 , u3 ) is the ordinary three-component velocity vector as de-
fined in classical mechanics. The four-velocity’s squared magnitude
ẗ 2 − ẍ 2 = −a2 .
1
The solution of these differential equations is t = a sinh aτ,
x = a1 cosh aτ, and eliminating τ gives
1p
x= 1 + a2 t 2 .
a
As t approaches infinity, dx/ dt approaches the speed of light.
Particles traveling at c
The definition of four-momentum as pi = mv i only works for
particles that move at less than c. For those that move at c, the
four-velocity is undefined. As we’ll see in example 6 on p. 129, this
class of particles is exactly those that are massless. As shown on
p. 32, the three-momentum of a light wave is given by p = E. The
fact that this momentum is nonzero implies that for light pi = mv i
represents an indeterminate form. The fact that this momentum
equals E is consistent with our definition of mass as m2 = E 2 − p2 .
6
Goldhaber and Nieto, ”Terrestrial and Extraterrestrial Limits on The Pho-
ton Mass,” Rev. Mod. Phys. 43 (1971) 277
7
See, e.g., Hasselkamp, Mondry, and Scharmann, Zeitschrift für Physik A:
Hadrons and Nuclei 289 (1979) 151.
8
G. Saathoff et al., “Improved Test of Time Dilation in Relativity,” Phys.
Rev. Lett. 91 (2003) 190403. A publicly available description of the experiment
is given in Saathoff’s PhD thesis, www.mpi-hd.mpg.de/ato/homes/saathoff/
diss-saathoff.pdf.
∂A
E = −∇Φ −
∂t
B = ∇A.
∂x0µ
[1] v 0µ = v κ
∂xκ
∂xκ
[2] vµ0 = vκ 0µ .
∂x
Note the inversion of the partial derivative in one equation compared
to the other. Because these equations describe a change from one
coordinate system to another, they clearly depend on the coordinate
system, so we use Greek indices rather than the Latin ones that
would indicate a coordinate-independent equation. Note that the
letter µ in these equations always appears as an index referring to
the new coordinates, κ to the old ones. For this reason, we can get
away with dropping the primes and writing, e.g., v µ = v κ ∂x0µ /∂xκ
rather than v 0 , counting on context to show that v µ is the vector
expressed in the new coordinates, v κ in the old ones. This becomes
especially natural if we start working in a specific coordinate system
10
The Feynman Lectures on Physics, Feynman, Leighton, and Sands, Addison
Wesley Longman, 1970
but this would give infinite results for the mixed terms! Only in the
case of functions of a single variable is it possible to flip deriva-
tives in this way; it doesn’t work for partial derivatives. To evalu-
ate these partial derivatives, we have to invert the transformation
(which in this example is trivial to accomplish) and then take the
partial derivatives.
The metric is a rank-2 tensor, and transforms analogously:
∂xκ ∂xλ
gµν = gκλ
∂x0µ ∂x0ν
(writing g rather than g 0 on the left, because context makes the
distinction clear).
12
This statement of the equivalence principle, along with the others we have
encountered, is summarized in the back of the book on page 413.
13
arxiv.org/abs/hep-ph/9703240
n → p + e− + ν̄
p + e− → n + ν,
which happen due to the weak nuclear force. The first of these re-
leases 0.8 MeV, and has a half-life of 14 minutes. This explains
why free neutrons are not observed in significant numbers in our
universe, e.g., in cosmic rays. The second reaction requires an input
of 0.8 MeV of energy, so a free hydrogen atom is stable. The white
dwarf contains fairly heavy nuclei, not individual protons, but sim-
ilar considerations would seem to apply. A nucleus can absorb an
electron and convert a proton into a neutron, and in this context the
process is called electron capture. Ordinarily this process will only
occur if the nucleus is neutron-deficient; once it reaches a neutron-
to-proton ratio that optimizes its binding energy, neutron capture
cannot proceed without a source of energy to make the reaction go.
In the environment of a white dwarf, however, there is such a source.
The annihilation of an electron opens up a hole in the “Fermi sea.”
There is now an state into which another electron is allowed to drop
without violating the exclusion principle, and the effect cascades
upward. In a star with a mass above the Chandrasekhar limit, this
process runs to completion, with every proton being converted into a
neutron. The result is a neutron star, which is essentially an atomic
nucleus (with Z = 0) with the mass of a star!
Observational evidence for the existence of neutron stars came
in 1967 with the detection by Bell and Hewish at Cambridge of a
mysterious radio signal with a period of 1.3373011 seconds. The sig-
nal’s observability was synchronized with the rotation of the earth
relative to the stars, rather than with legal clock time or the earth’s
rotation relative to the sun. This led to the conclusion that its origin
was in space rather than on earth, and Bell and Hewish originally
dubbed it LGM-1 for “little green men.” The discovery of a second
signal, from a different direction in the sky, convinced them that it
was not actually an artificial signal being generated by aliens. Bell
published the observation as an appendix to her PhD thesis, and
it was soon interpreted as a signal from a neutron star. Neutron
stars can be highly magnetized, and because of this magnetization
they may emit a directional beam of electromagnetic radiation that
sweeps across the sky once per rotational period — the “lighthouse
effect.” If the earth lies in the plane of the beam, a periodic signal
can be detected, and the star is referred to as a pulsar. It is fairly
easy to see that the short period of rotation makes it difficult to
explain a pulsar as any kind of less exotic rotating object. In the
approximation of Newtonian mechanics,
p a spherical body of density
ρ, rotating with a period T = 3π/Gρ, has zero apparent gravity
at its equator, since gravity is just strong enough to accelerate an
object so that it follows a circular trajectory above a fixed point on
We’ll see in chapter 6 that for a non-rotating black hole, the metric
is of the form
2. Similarly, it has terms that are odd under reversal of the dif-
ferential dφ of the azimuthal coordinate.
Tensorial
One is to let have the values 0 and ±1 at some arbitrarily
chosen point, in some arbitrarily chosen coordinate system, but to
let it transform like a tensor. Then Aµ = µκλ uκ v λ needs to be
modified, since the right-hand side is a tensor, and that would make
A a tensor, but if A is an area we don’t want it to transform like
a 1-tensor. We therefore need to revise the definition of area to
be Aµ = g −1/2 µκλ uκ v λ , where g is the determinant of the lower-
index form of the metric. The following two examples justify this
procedure in a locally Euclidean three-space.
Scaling coordinates with tensorial Example: 24
Then scaling of coordinates by k scales all the elements of the
metric by k −2 , g by k −6 , g −1/2 by k 3 , µκλ by k −3 , and u κ v λ by
k 2 . The result is to scale Aµ by k +3−3+2 = k 2 , which makes sense
if A is an area.
Oblique coordinates with tensorial Example: 25
In oblique coordinates (example 9, p. 104), the two basis vectors
have unit length but are at an angle φ 6= π/2 to one another. The
√
determinant of the metric is g = sin2 φ, so g = sin φ, which is
exactly the correction factor needed in order to get the right area
when u and v are the two basis vectors.
This procedure works more generally, the sole modification being
that in a space such as a locally Lorentzian one where g < 0 we need
√ √
to use −g as the correction factor rather than g.
Tensor-density
The other option is to let have the same 0 and ±1 values at
all points. Then is clearly not a tensor, because it doesn’t scale
by a factor of k n when the coordinates are scaled by k; is a tensor
density with weight −1 for the upper-index version and +1 for the
lower-index one. The relation Aµ = µκλ uκ v λ gives an area that is
a tensor density, not a tensor, because A is not written in terms of
purely tensorial quantities. Scaling the coordinates by k leaves µκλ
unchanged, scales up uκ v λ by k 2 , and scales up the area by k 2 , as
expected.
Unfortunately, there is no consistency in the literature as to
whether should be a tensor or a tensor density. Some authors
define both a tensor and a nontensor version, with notations like
and ˜, or19 0123 and [0123]. Others avoid writing the letter
completely.20 The tensor-density version is convenient because we
always know that its value is 0 or ±1. The tensor version has the
19
Misner, Thorne, and Wheeler
20
Hawking and Ellis
p
14 Derive the equation T = 3π/Gρ given on page 146 for the
period of a rotating, spherical object that results in zero apparent
gravity at its surface.
15 Section 4.4.3 presented an estimate of the upper limit on the
mass of a white dwarf. Check the self-consistency of the solution
in the following respects: (1) Why is it valid to ignore the contri-
bution of the nuclei to the degeneracy pressure? (2) Although the
electrons are ultrarelativistic, spacetime is approximated as being
flat. As suggested in example 14 on page 64, a reasonable order-of-
magnitude check on this result is that we should have M/r c2 /G.
16 The laws of physics in our universe imply that for bodies with
a certain range of masses, a neutron star is the unique equilibrium
state. Suppose we knew of the existence of neutron stars, but didn’t
know the mass of the neutron. Infer upper and lower bounds on the
mass of the neutron.
17 Example 20 on p. 141 briefly introduced the electromagnetic
potential four-vector Fij , and this implicitly defines the transforma-
tion properties of the electric and magnetic fields under a Lorentz
boost v. To lowest order in v, this transformation is given by
E0 ≈ E + v × B and
0
B ≈ B − v × E.
I’m not a historian of science, but apparently ca. 1905 people like
Hertz believed that these were the exact transformations of the
field.21 Show that this can’t be the case, because performing two
such transformations in a row does not in general result in a trans-
formation of the same form. . Solution, p. 392
21
Montigny and Rousseaux, arxiv.org/abs/physics/0512200.
Problems 157
18 We know of massive particles, whose velocity vectors always
lie inside the future light cone, and massless particles, whose veloc-
ities lie on it. In principle, we could have a third class of particles,
called tachyons, with spacelike velocity vectors. Tachyons would
have m2 < 0, i.e., their masses would have to be imaginary. Show
that it is possible to pick momentum four-vectors p1 and p2 for
a pair of tachyons such that p1 + p2 = 0. This implies that the
vacuum would be unstable with respect to spontaneous creation of
tachyon-antitachyon pairs.
159
intrinsic curvature. It arises only from the choice of the coordinates
(t0 , x0 ) defined by a frame tied to the accelerating rocket ship.
The fact that the above metric has nonvanishing derivatives, un-
like a constant Lorentz metric, does indicate the presence of a grav-
itational field. However, a gravitational field is not the same thing
as intrinsic curvature. The gravitational field seen by an observer
aboard the ship is, by the equivalence principle, indistinguishable
from an acceleration, and indeed the Lorentzian observer in the
earth’s frame does describe it as arising from the ship’s accelera-
tion, not from a gravitational field permeating all of space. Both
observers must agree that “I got plenty of nothin’ ” — that the
region of the universe to which they have access lacks any stars,
neutrinos, or clouds of dust. The observer aboard the ship must de-
scribe the gravitational field he detects as arising from some source
very far away, perhaps a hypothetical vast sheet of lead lying billions
of light-years aft of the ship’s deckplates. Such a hypothesis is fine,
but it is unrelated to the structure of our hoped-for field equation,
which is to be local in nature.
Not only does the metric tensor not represent the gravitational
field, but no tensor can represent it. By the equivalence princi-
ple, any gravitational field seen by observer A can be eliminated by
switching to the frame of a free-falling observer B who is instanta-
neously at rest with respect to A at a certain time. The structure of
the tensor transformation law guarantees that A and B will agree on
whether a given tensor is zero at the point in spacetime where they
pass by one another. Since they agree on all tensors, and disagree
on the gravitational field, the gravitational field cannot be a tensor.
We therefore conclude that a nonzero intrinsic curvature of the
type that is to be included in the Einstein field equations is not
encoded in any simple way in the metric or its first derivatives.
Since neither the metric nor its first derivatives indicate curvature,
we can reasonably conjecture that the curvature might be encoded
in its second derivatives.
Let’s now generalize beyond elliptic geometry. Consider a space i / 2. Gaussian curvature as
modeled by a surface embedded in three dimensions, with geodesics L 6= r θ.
defined as curves of extremal length, i.e., the curves made by a piece
of string stretched taut across the surface. At a particular point
P, we can always pick a coordinate system (x, y, z) such that the
surface z = 12 k1 x2 + 12 k2 y 2 locally approximates the surface to the
level of precision needed in order to discuss curvature. The surface
is either paraboloidal or hyperboloidal (a saddle), depending on the
signs of k1 and k2 . We might naively think that k1 and k2 could be
independently determined by intrinsic measurements, but as we’ve
seen in example 5 on page 96, a cylinder is locally indistinguishable
from a Euclidean plane, so if one k is zero, the other k clearly cannot
be determined. In fact all that can be measured is the Gaussian
curvature, which equals the product k1 k2 . To see why this should
be true, first consider that any measure of curvature has units of
inverse distance squared, and the k’s have units of inverse distance.
The only possible intrinsic measures of curvature based on the k’s
are therefore k12 + k22 and k1 k2 . (We can’t have, for example, just k12 ,
because that would change under an extrinsic rotation about the z
axis.) Only k1 k2 vanishes on a cylinder, so it is the only possible
intrinsic curvature.
1
Proof: Since any two lines cross in elliptic geometry, ` crosses the x axis. The
corollary then follows by application of the definition of the Gaussian curvature
to the right triangles formed by `, the x axis, and the lines at x = 0 and x = dx,
so that K = d/ dA = d2 α/ dx dy, where third powers of infinitesimals have
been discarded.
2
In the spherical model, L = ρθ sin u, where u is the angle subtended at the
center of the sphere by an arc of length r. We then have L/LE = sin u/u, whose
second derivative with respect to u is −1/3. Since r = ρu, the second derivative
of the same quantity with respect to r equals −1/3ρ2 = −K/3.
3
http://math.stackexchange.com/questions/112662/
gaussian-curvature-of-an-ellipsoid-proportional-to-fourth-power-of-the-distance
4
I W McAllister 1990 J. Phys. D: Appl. Phys. 23 359
5
Moore et al., Journal of Applied Meteorology 39 (1999) 593
Let’s estimate the size of the effect. The first derivative of the
metric is, roughly, the gravitational field, whereas the second deriva-
tive has to do with curvature. The curvature of spacetime around
the earth should therefore vary as GM r−3 , where M is the earth’s
mass and G is the gravitational constant. The area enclosed by a
circular orbit is proportional to r2 , so we expect the geodetic effect
to vary as nGM/r, where n is the number of orbits. The angle of
precession is unitless, and the only way to make this result unitless
is to put in a factor of 1/c2 . In units with c = 1, this factor is un-
necessary. In ordinary metric units, the 1/c2 makes sense, because
it causes the purely relativistic effect to come out to be small. The
result, up to unitless factors that we didn’t pretend to find, is
nGM
∆θ ∼ 2 .
c r
a / The geodetic effect as 7
This statement is itself only a rough estimate. Anyone who has taught
measured by Gravity Probe B. physics knows that students will often calculate an effect exactly while not un-
derstanding the underlying physics at all.
b / Precession angle as a function of time as measured by the four gyroscopes aboard Gravity Probe B.
∂b Ψ → ∂b eiα Ψ
= ∂b + A0b − Ab Ψ0
∇b = ∂b + ieAb
d 1 dG
∇X = − G−1 .
dX 2 dX
and should have an opposite sign for contravariant vectors.
Generalizing the correction term to derivatives of vectors in more
than one dimension, we should have something of this form:
∇a v b = ∂a v b + Γbac v c
∇a vb = ∂a vb − Γcba vc ,
where Γbac , called the Christoffel symbol, does not transform like
a tensor, and involves derivatives of the metric. (“Christoffel” is
pronounced “Krist-AWful,” with the accent on the middle syllable.)
The explicit computation of the Christoffel symbols from the metric
is deferred until section 5.9, but the intervening sections 5.7 and 5.8
can be omitted on a first reading without loss of continuity.
An important gotcha is that when we evaluate a particular com-
ponent of a covariant derivative such as ∇2 v 3 , it is possible for the
result to be nonzero even if the component v 3 vanishes identically.
This can be seen in example 5 on p. 305 and example 21 on p. 345.
or
With the partial derivative ∂µ , it does not make sense to use the
metric to raise the index and form ∂ µ . It does make sense to do so
with covariant derivatives, so ∇a = g ab ∇b is a correct identity.
5.7.4 Uniqueness
The geodesic equation is useful in establishing one of the neces-
sary theoretical foundations of relativity, which is the uniqueness of
geodesics for a given set of initial conditions. This is related to ax-
iom O1 of ordered geometry, that two points determine a line, and
is necessary physically for the reasons discussed on page 22; briefly,
if the geodesic were not uniquely determined, then particles would
have no way of deciding how to move. The form of the geodesic
equation guarantees uniqueness. To see this, consider the following
algorithm for determining a numerical approximation to a geodesic:
3. Add (d2 xµ / dλ2 )∆λ to the currently stored value of dxµ / dλ.
5. Add ∆λ to λ.
6. Repeat steps 2-5 until the geodesic has been extended to the
desired affine distance.
5.8 Torsion
This section describes the concept of gravitational torsion. It can
be skipped without loss of continuity, provided that you accept the
symmetry property Γa[bc] = 0 without worrying about what it means
physically or what empirical evidence supports it.
Self-check: Interpret the mathematical meaning of the equation
Γa[bc] = 0, which is expressed in the notation introduced on page
103.
(∇a ∇b − ∇b ∇a )f = −τ cab ∇c f ,
11
http://arxiv.org/abs/hep-ph/0606218
12
Carroll and Field, http://arxiv.org/abs/gr-qc/9403058
1
Γdba = g cd (∂? g?? ) ,
2
where inversion of the one-component matrix G has been replaced
by matrix inversion, and, more importantly, the question marks indi-
cate that there would be more than one way to place the subscripts
so that the result would be a grammatical tensor equation. The
most general form for the Christoffel symbol would be
1
Γbac = g db (L∂c gab + M ∂a gcb + N ∂b gca ) ,
2
where L, M , and N are constants. Consistency with the one-
dimensional expression requires L + M + N = 1, and vanishing
torsion gives L = M . The L and M terms have a different physical
significance than the N term.
Suppose an observer uses coordinates such that all objects are
described as lengthening over time, and the change of scale accu-
mulated over one day is a factor of k > 1. This is described by the
derivative ∂t gxx < 1, which affects the M term. Since the metric is
used to calculate
√ squared distances, the gxx matrix element scales
down by 1/ k. To compensate for ∂t v x < 0, so we need to add a
positive correction term, M > 0, to the covariant derivative. When
the same observer measures the rate of change of a vector v t with
respect to space, the rate of change comes out to be too small, be-
cause the variable she differentiates with respect to is too big. This
requires N < 0, and the correction is of the same size as the M
correction, so |M | = |N |. We find L = M = −N = 1.
Self-check: Does the above argument depend on the use of space
for one coordinate and time for the other?
The resulting general expression for the Christoffel symbol in
terms of the metric is
1
Γcab = g cd (∂a gbd + ∂b gad − ∂d gab ) .
2
One can readily go back and check that this gives ∇c gab = 0. In fact,
the calculation is a bit tedious. For that matter, tensor calculations
in general can be infamously time-consuming and error-prone. Any
reasonable person living in the 21st century will therefore resort to
a computer algebra system. The most widely used computer alge-
bra system is Mathematica, but it’s expensive and proprietary, and
it doesn’t have extensive built-in facilities for handling tensors. It
1 import math
2
3 l = 0 # affine parameter lambda
4 dl = .001 # change in l with each iteration
5 l_max = 100.
6
7 # initial position:
13
http://arxiv.org/abs/0903.2085
14
http://arxiv.org/abs/0903.2085
. . . and differentia-
tion of this gives the
gauge field. . . Ab Γcab
A second differen-
tiation gives the
directly observable
field(s) . . . E and B Rcdab
16
For those with knowledge of topology, these can be formalized a little more:
we want a completely normal, second-countable, locally connected topological
space that has Lebesgue covering dimension n, is a homogeneous space under
its own homeomorphism group, and is a complete uniform space. I don’t know
whether this is sufficient to characterize a manifold completely, but it suffices to
rule out all the counterexamples of which I know.
Lines Example: 13
The set of all real numbers is a 1-manifold. Similarly, any line with
the properties specified in Euclid’s Elements is a 1-manifold. All
such lines are homeomorphic to one another, and we can there-
fore speak of “the line.”
A circle Example: 14
A circle (not including its interior) is a 1-manifold, and it is not
homeomorphic to the line. To see this, note that deleting a point
from a circle leaves it in one connected piece, but deleting a point
from a line makes two. Here we use the fact that a homeomor-
phism is guaranteed to preserve “rubber-sheet” properties like the
number of pieces.
No changes of dimension Example: 15
A “lollipop” formed by gluing an open 2-circle (i.e., a circle not
including its boundary) to an open line segment is not a manifold,
because there is no n for which it satisfies M1.
It also violates M2, because points in this set fall into three distinct
classes: classes that live in 2-dimensional neighborhoods, those
that live in 1-dimensional neighborhoods, and the point where the
line segment intersects the boundary of the circle.
No manifolds made from the rational numbers Example: 16
The rational numbers are not a manifold,
√ because specifying an
arbitrarily small neighborhood around 2 excludes every rational
number, violating M3.
Similarly, the rational plane defined by rational-number coordinate
pairs (x, y ) is not a 2-manifold. It’s good that we’ve excluded
this space, because it has the unphysical property that curves
can cross without having a point in common. For example, the
curve y = x 2 crosses from one side of the line y = 2 to the other,
but never intersects it. This is physically undesirable because it
doesn’t match up with what we have in mind when we talk about
collisions between particles as intersections of their world-lines,
or when we say that electric field lines aren’t supposed to inter-
sect.
No boundary Example: 17
The open half-plane y > 0 in the Cartesian plane is a 2-manifold.
The closed half-plane y ≥ 0 is not, because it violates M2; the
19
“Differential geometry via infinitesimal displacements,” arxiv.org/abs/
1405.0984
20
It is possible to define a different and larger enhancement, called ∗ M, that
would include points with infinitely large coordinates. For example, suppose we
have a coordinate patch with bounds on the coordinates that can be written
down using inequalities, t > 0, 0 ≤ θ ≤ π/4, . . . Then ∗ M would contain any
finite, infinitesimal, and infinite values of (t, θ, . . .) satisfying these inequalities,
and this would include infinite values of t. We will not do this here, because
the inclusion of idealized points at infinity is more useful in relativity if we do it
using a different approach, discussed in section 7.3.4, p. 274.
21
As usual in this type of “big O” notation, we abuse the equals sign somewhat.
In particular, the equals sign here is not symmetric. For more detail, see the
Wikipedia article “Big O notation.”
22
An equivalent and manifestly coordinate-independent definition is that for
every smooth real function in a neighborhood of the points, the function differs
at these points by an amount that is O().
M =1·M
27
If we multiplied g by a negative constant, then we would change the signa-
ture, e.g., from +−−− to −+++. Changing the signature would be particularly
goofy in the context of Riemannian geometry, where it is customary to have a
positive-definite metric.
Problems 209
another for z near 0.
(b) When a test particle is released from rest in either of these met-
rics, its initial proper acceleration is g.
(c) The two metrics are not exactly equivalent to one another under
any change of coordinates.
(d) Both spacetimes are uniform in the sense that the curvature is
constant. (In both cases, this can be proved without an explicit
computation of the Riemann tensor.)
Remark: The incompatibility between [1] and [2] can be interpreted as showing
that general relativity does not admit any spacetime that has all the global
properties we would like for a uniform gravitational field. This is related to Bell’s
spaceship paradox (example 15, p. 65). Some further properties of the metric [1]
are analyzed in subsection 7.5 on page 285. . Solution, p. 393
8 In a topological space T, the complement of a subset U is
defined as the set of all points in T that are not members of U. A
set whose complement is open is referred to as closed. On the real
line, give (a) one example of a closed set and (b) one example of
a set that is neither open nor closed. (c) Give an example of an
inequality that defines an open set on the rational number line, but
a closed set on the real line.
9 Prove that a double cone (e.g., the surface r = z in cylindrical
coordinates) is not a manifold. . Solution, p. 393
10 Prove that a torus is a manifold. . Solution, p. 393
11 Prove that a sphere is not homeomorphic to a torus.
. Solution, p. 394
12 Curvature on a Riemannian space in 2 dimensions is a
topic that goes back to Gauss and has a simple interpretation: the
only intrinsic measure of curvature is a single number, the Gaussian
curvature. What about 1+1 dimensions? The simplest metrics I
can think of are of the form ds2 = dt2 − f (t)dx2 . (Something like
ds2 = f (t)dt2 −dx2 is obviously equivalent to Minkowski space under
a change of coordinates, while ds2 = f (x)dt2 − dx2 is the same as
the original example except that we’ve swapped x and t.) Playing
around with simple examples, one stumbles across the seemingly
mysterious fact that the metric ds2 = dt2 − t2 dx2 is flat, while ds2 =
dt2 − tdx2 is not. This seems to require some simple explanation.
Consider the metric ds2 = dt2 − tp dx2 .
(a) Calculate the Christoffel symbols by hand.
(b) Use a computer algebra system such as Maxima to show that
the Ricci tensor vanishes only when p = 2.
Remark: The explanation is that in the case p = 2, the x coordinate is expanding
in proportion to the t coordinate. This can be interpreted as a situation in which
our length scale is defined by a lattice of test particles that expands inertially.
Since their motion is inertial, no gravitational fields are required in order to
explain the observed change in the length scale; cf. the Milne universe, p. 332.
. Solution, p. 394
29
Jackson, Classical Electrodynamics
Problems 211
212 Chapter 5 Curvature
Chapter 6
Vacuum Solutions
In this chapter we investigate general relativity in regions of space
that have no matter to act as sources of the gravitational field.
We will not, however, limit ourselves to calculating spacetimes in
cases in which the entire universe has no matter. For example,
we will be able to calculate general-relativistic effects in the region
surrounding the earth, including a full calculation of the geodetic
effect, which was estimated in section 5.5.1 only to within an order
of magnitude. We can have sources, but we just won’t describe the
metric in the regions where the sources exist, e.g., inside the earth.
The advantage of accepting this limitation is that in regions of empty
space, we don’t have to worry about the details of the stress-energy
tensor or how it relates to curvature. As should be plausible based a/A Swiss commemorative
on the physical motivation given in section 5.1, page 160, the field coin shows the vacuum field
equations in a vacuum are simply Rab = 0. equation.
1 p
x= 1 + a2 t2 − 1 ,
a
gt0 0 t0 = (1 + ax0 )2
gx0 0 x0 = −1.
213
expect that this one also has zero Ricci curvature. This is straight-
forward to verify. The nonvanishing Christoffel symbols are
0 a 0
Γt x0 t0 = 0
and Γx t0 t0 = a(1 + ax0 ).
1 + ax
The only elements of the Riemann tensor that look like they might
0 0
be nonzero are Rt t0 x0 x0 and Rx t0 x0 t0 , but both of these in fact vanish.
Self-check: Verify these facts.
This seemingly routine exercise now leads us into some very in-
teresting territory. Way back on page 12, we conjectured that not all
events could be time-ordered: that is, that there might exists events
in spacetime 1 and 2 such that 1 cannot cause 2, but neither can 2
cause 1. We now have enough mathematical tools at our disposal
to see that this is indeed the case.
We observe that x(t) approaches the asymptote x = t − 1/a.
This asymptote has a slope of 1, so it can be interpreted as the
world-line of a photon that chases the ship but never quite catches
up to it. Any event to the left of this line can never have a causal
relationship with any event on the ship’s world-line. Spacetime, as
seen by an observer on the ship, has been divided by a curtain into
two causally disconnected parts. This boundary is called an event
horizon. Its existence is relative to the world-line of a particular
observer. An observer who is not accelerating along with the ship
does not consider an event horizon to exist. Although this particular
a / A spaceship (curved world- example of the indefinitely accelerating spaceship has some physi-
line) moves with an acceleration
cally implausible features (e.g., the ship would have to run out of
perceived as constant by its
passengers. The photon (straight
fuel someday), event horizons are real things. In particular, we will
world-line) comes closer and see in section 6.3.2 that black holes have event horizons.
closer to the ship, but will never Interpreting everything in the (t0 , x0 ) coordinates tied to the ship,
quite catch up. the metric’s component gt0 0 t0 vanishes at x0 = −1/a. An observer
aboard the ship reasons as follows. If I start out with a head-start
of 1/a relative to some event, then the timelike part of the metric at
that event vanishes. If the event marks the emission of a material
particle, then there is no possible way for that particle’s world-line
to have ds2 > 0. If I were to detect a particle emitted at that event,
it would violate the laws of physics, since material particles must
have ds2 > 0, so I conclude that I will never observe such a particle.
Since all of this applies to any material particle, regardless of its
mass m, it must also apply in the limit m → 0, i.e., to photons and
other massless particles. Therefore I can never receive a particle
emitted from this event, and in fact it appears that there is no way
for that event, or any other event behind the event horizon, to have
any effect on me. In my frame of reference, it appears that light
cones near the horizon are tipped over so far that their future light-
cones lie entirely in the direction away from me.
We’ve already seen in example 14 on page 64 that a naive New-
tonian argument suggests the existence of black holes; if a body is
2
http://xxx.lanl.gov/abs/gr-qc/9605032
3
“On the gravitational field of a point mass according to Einstein’s the-
a / The field equations of general
ory,” Sitzungsberichte der K oniglich Preussischen Akademie der Wissenschaften
1 (1916) 189. An English translation is available at http://arxiv.org/abs/
relativity are nonlinear.
physics/9905030v1.
4
See p. 254 for a different but closely related use of the same term.
Use of ctensor
In fact, when I calculated the Christoffel symbols above by hand,
I got one of them wrong, and missed calculating one other because I
thought it was zero. I only found my mistake by comparing against
a result in a textbook. The computation of the Riemann tensor is
an even bigger mess. It’s clearly a good idea to resort to a com-
puter algebra system here. Cadabra, which was discussed earlier, is
specifically designed for coordinate-independent calculations, so it
won’t help us here. A good free and open-source choice is ctensor,
which is one of the standard packages distributed along with the
computer algebra system Maxima, introduced on page 75.
The following Maxima program calculates the Christoffel sym-
bols found in section 6.2.1.
1 load(ctensor);
2 ct_coords:[t,r,theta,phi];
3 lg:matrix([1,0,0,0],
4 [0,-1,0,0],
5 [0,0,-r^2,0],
6 [0,0,0,-r^2*sin(theta)^2]);
7 cmetric();
8 christof(mcs);
Line 1 loads the ctensor package. Line 2 sets up the names of the
coordinates. Line 3 defines the gab , with lg meaning “the version of
g with lower indices.” Line 7 tells Maxima to do some setup work
with gab , including the calculation of the inverse matrix g ab , which
is stored in ug. Line 8 says to calculate the Christoffel symbols.
The notation mcs refers to the tensor Γ0 bca with the indices swapped
around a little compared to the convention Γabc followed in this
1 1
2 (%t6) mcs = -
3 2, 3, 3 r
4
5 1
6 (%t7) mcs = -
7 2, 4, 4 r
8
9 (%t8) mcs = - r
10 3, 3, 2
11
12 cos(theta)
13 (%t9) mcs = ----------
14 3, 4, 4 sin(theta)
15
16 2
17 (%t10) mcs = - r sin (theta)
18 4, 4, 2
19
20 (%t11) mcs = - cos(theta) sin(theta)
21 4, 4, 3
∇t v r = ∂t v r + Γrtc v c .
1 load(ctensor);
2 ct_coords:[t,r,theta,phi];
3 lg:matrix([(1-2/r),0,0,0],
4 [0,-(1+b1/r),0,0],
5 [0,0,-r^2,0],
6 [0,0,0,-r^2*sin(theta)^2]);
7 cmetric();
8 ricci(true);
9 limit(r^4*ric[1,1],r,inf);
Time-reversal symmetry
The Schwarzschild metric is invariant under time reversal, since
time occurs only in the form of dt2 , which stays the same under
dt → − dt. This is the same time-reversal symmetry that occurs in
Newtonian gravity, where the field is described by the gravitational
acceleration g, and accelerations are time-reversal invariant.
Fundamentally, this is an example of general relativity’s coordi-
nate independence. The laws of physics provided by general rela-
tivity, such as the vacuum field equation, are invariant under any
smooth coordinate transformation, and t → −t is such a coordinate
transformation, so general relativity has time-reversal symmetry.
Since the Schwarzschild metric was found by imposing time-reversal-
symmetric boundary conditions on a time-reversal-symmetric differ-
ential equation, it is an equally valid solution when we time-reverse
it. Furthermore, we expect the metric to be invariant under time
reversal, unless spontaneous symmetry breaking occurs (see p. 348).
This suggests that we ask the more fundamental question of what
global symmetries general relativity has. Does it have symmetry
under parity inversion, for example? Or can we take any solution
such as the Schwarzschild spacetime and transform it into a frame
of reference in which the source of the field is moving uniformly in a
certain direction? Because general relativity is locally equivalent to
special relativity, we know that these symmetries are locally valid.
But it may not even be possible to define the corresponding global
Flat space
As a first warmup, consider two spatial dimensions, represented
by Euclidean polar coordinates (r, φ). Parallel-transport of a gyro-
scope’s angular momentum around a circle of constant r gives
∇φ Lφ = 0
∇φ Lr = 0.
Computing the covariant derivatives, we have
0 = ∂φ Lφ + Γφφr Lr
0 = ∂φ Lr + Γrφφ Lφ .
P 0 = −Q
Q0 = P ,
P 0 = −Q
Q0 = (1 − )P ,
√
where = 2m. The solutions rotate with frequency ω 0 = 1 − .
The result is that when the basis vectors rotate by 2π, the compo-
nents
√ no longer return to their original values; they lag by a factor
of 1 − ≈ 1 − m. Putting the factors of r back in, this is 1 − m/r.
The deviation from unity shows that after one full revolution, the L
vector no longer has quite the same components expressed in terms
of the (r, φ) basis vectors.
To understand the sign of the effect, let’s imagine a counter-
clockwise rotation. The (r, φ) rotate counterclockwise, so relative
to them, the L vector rotates clockwise. After one revolution, it has
not rotated clockwise by a full 2π, so its orientation is now slightly
counterclockwise compared to what it was. Thus the contribution
to the geodetic effect arising from spatial curvature is in the same
direction as the orbit.
2+1 dimensions
To reproduce the experimental results correctly, we need to in-
clude the time dimension. The angular momentum vector now has
components (Lφ , Lr , Lt ). The physical interpretation of the Lt com-
ponent is obscure at this point; we’ll return to this question later.
Writing down the total derivatives of the three components, and
notating dt/ dφ as ω −1 , we have
dLφ
= ∂φ Lφ + ω −1 ∂t Lφ
dφ
dLr
= ∂φ Lr + ω −1 ∂t Lr
dφ
dLt
= ∂φ Lt + ω −1 ∂t Lt
dφ
Setting the covariant derivatives equal to zero gives
0 = ∂φ Lφ + Γφφr Lr
0 = ∂φ Lr + Γrφφ Lφ
0 = ∂t Lr + Γrtt Lt
0 = ∂t Lt + Γttr Lr .
Self-check: There are not just four but six covariant derivatives
that could in principle have occurred, and in these six covariant
derivatives we could have had a total of 18 Christoffel symbols. Of
these 18, only four are nonvanishing. Explain based on symmetry
arguments why the following Christoffel symbols must vanish: Γφφt ,
Γttt .
Putting all this together in matrix form, we have L0 = M L,
where
0 −1 0
M = 1− 0 −(1 − )/2ω .
0 −/2ω(1 − ) 0
6
Misner, Thorne, and Wheeler, Gravitation, p. 1118
7
Rindler, Essential Relativity, 1969, p. 141
Conserved quantities
If Einstein had had a computer on his desk, he probably would
simply have integrated the motion numerically using the geodesic
equation. But it is possible to simplify the problem enough to at-
tack it with pencil and paper, if we can find the relevant conserved
quantities of the motion. Nonrelativistically, these are energy and
angular momentum.
Consider a rock falling directly toward the sun. The Schwarzschild
metric is of the special form
s = s1 + s2
q q
= h1 t1 − k1 r1 + h2 t22 − k2 r22
2 2
q q
= h1 t1 − k1 r1 + h2 (T − t1 )2 − k2 r22 ,
2 2
and
dφ
L = r2
ds
for the conserved energy per unit mass and angular momentum per
unit mass.
In interpreting the energy per unit mass E, it is important to
understand that in the general-relativistic context, there is no use-
ful way of separating the rest mass, kinetic energy, and potential
energy into separate terms, as we could in Newtonian mechanics.
E includes contributions from all of these, and turns out to be less
Perihelion advance
For convenience, let the mass of the orbiting rock be 1, while m
stands for the mass of the gravitating body.
The unit mass of the rock is a third conserved quantity, and
since the magnitude of the momentum vector equals the square of
the mass, we have for an orbit in the plane θ = π/2,
or
ṙ2 = E 2 − U 2
U 2 = (1 − 2m/r)(1 + L2 /r2 ).
L2 p
r= 1 + 1 − 12m2 /L2
2m
L2
≈ (1 − ),
m
where = 3(m/L)2 . A planet in a nearly circular orbit oscillates
between perihelion and aphelion with a period that depends on the
curvature of U 2 at its minimum. We have
d2 (U 2 )
k=
dr2
d2 2m L2 2mL2
= 2 1− + 2 −
dr r r r3
4m 6L2 24mL2
=− 3 + 4 −
r r r5
= 2L−6 m4 (1 + 2)
∆saz = 2πr2 /L
= 2πL3 m−2 (1 − 2).
10
For a review article on this topic, see Clifford Will, “The Confrontation
between General Relativity and Experiment,” arxiv.org/abs/1403.7377.
1 import math
2
3 # constants, in SI units:
4 G = 6.67e-11 # gravitational constant
5 c = 3.00e8 # speed of light
6 m_kg = 1.99e30 # mass of sun
7 r_m = 6.96e8 # radius of sun
8
9 # From now on, all calculations are in units of the
10 # radius of the sun.
11
12 # mass of sun, in units of the radius of the sun:
13 m_sun = (G/c**2)*(m_kg/r_m)
14 m = 1000.*m_sun
15 print "m/r=",m
16
17 # Start at point of closest approach.
18 # initial position:
19 t=0
20 r=1 # closest approach, grazing the sun’s surface
21 phi=-math.pi/2
22 # initial derivatives of coordinates w.r.t. lambda
23 vr = 0
24 vt = 1
25 vphi = math.sqrt((1.-2.*m/r)/r**2)*vt # gives ds=0, lightlike
At line 14, we take the mass to be 1000 times greater than the
mass of the sun. This helps to make the deflection easier to calcu-
late accurately without running into problems with rounding errors.
Lines 17-25 set up the initial conditions to be at the point of closest
approach, as the photon is grazing the sun. This is easier to set
up than initial conditions in which the photon approaches from far
away. Because of this, the deflection angle calculated by the pro-
gram is cut in half. Combining the factors of 1000 and one half, the
final result from the program is to be interpreted as 500 times the
actual deflection angle.
The result is that the deflection angle is predicted to be 870
seconds of arc. As a check, we can run the program again with
m = 0; the result is a deflection of −8 seconds, which is a measure
13
See, e.g., http://arxiv.org/abs/1402.5675
14
arxiv.org/abs/0903.1105
15
arxiv.org/abs/0906.4040
Formal definitions
The remainder of this subsection provides a more formal expo-
sition of the definitions relating to singularities. It can be skipped
without loss of continuity.
The reason we care about singularities is that they indicate an
incompleteness of the theory, and the theory’s inability to make
predictions. One of the simplest things we could ask any theory
to do would be to predict the trajectories of test particles. For
example, Maxwell’s equations correctly predict the motion of an
electron in a uniform magnetic field, but they fail to predict the
motion of an electron that collides head-on with a positron. It might
have been natural for someone in Maxwell’s era (assuming they were
informed about the existence of positrons and told to assume that
both particles were pointlike) to guess that the two particles would
scatter through one another at θ = 0, their velocities momentarily
becoming infinite. But it would have been equally natural for this
person to refuse to make a prediction.
Similarly, if a particle hits a black hole singularity, we should not
expect general relativity to make a definite prediction. It doesn’t,
because the geodesic equation breaks down.
We would therefore like to define a singularity as a situation in
16
Phys. Rev. Lett. 70, p. 9
ds2 = A(dt 2 − dx 2 )
A = 1/(1 + et ),
1 load(ctensor);
2 dim:2;
3 ct_coords:[t,x];
4 u:1/(1+exp(t));
5 lg:matrix([u,0],
6 [0,-u]);
7 cmetric();
8 ricci(true);
9 lriemann(true);
10 uriemann(true);
11 scurvature();/* scalar curvature */
12 rinvariant(); /* Kretchmann */
ṫ 2 t
ẗ = − θ̇2
2t 2
ṫ
θ̈ = − t,
θ̇
where dots represent differentiation with respect to the affine pa-
rameter λ. Implicit differentiation of the equation θ = ln t gives
θ̇ = ṫ/t, and plugging this in to the first geodesic equation re-
sults in ẗ = 0. We can therefore take t = λ. (We could also
take t = aλ + b, which would result in a different and equally valid
affine parameter.) If λ had gone to −∞ as t went to zero, then
we would have demonstrated that the geodesic was complete. It
approaches a finite limit instead, which suggests, but does not
prove, that it is incomplete.
The change of coordinates θ → θ − ln t allows the counterclock-
wise lightlike geodesics to be continued through t = 0. (Because
this transformation is not a diffeomorphism, it is not just a renam-
ing of points but an actual physical change in the structure of the
19
Penrose, Gravitational radiation and gravitational collapse; Proceedings of
the Symposium, Warsaw, 1973. Dordrecht, D. Reidel Publishing Co. pp. 82-91,
free online at adsabs.harvard.edu/full/1974IAUS...64...82P
23
More rigorously, we expect them to satisfy suitable energy conditions, section
8.1.3, p. 307.
24
“Gravitational Collapse and Cosmic Censorship,” arxiv.org/abs/gr-qc/
9710068
Particle physics
Hawking radiation has some intriguing properties from the point
of view of particle physics. In a particle accelerator, the list of
particles one can create in appreciable quantities is determined by
coupling constants. In Hawking radiation, however, we expect to
see a representative sampling of all types of particles, biased only
by the fact that massless or low-mass particles are more likely to
be produced than massive ones. For example, it has been specu-
lated that some of the universe’s dark matter exists in the form of
“sterile” particles that do not couple to any force except for gravity.
Such particles would never be produced in particle accelerators, but
would be seen in Hawking radiation. Based on present knowledge
of particle physics, the main components of Hawking radiation, for
all but the most microscopic black holes, are expected to be pho-
tons and gravitons, which would compete on roughly equal terms,
depending on the angular momentum of the black hole.25
Hawking radiation would violate many cherished conservation
laws of particle physics. Let a hydrogen atom fall into a black hole.
We’ve lost a lepton and a baryon, but if we want to preserve con-
servation of lepton number and baryon number, we cover this up
with a fig leaf by saying that the black hole has simply increased
its lepton number and baryon number by +1 each. But eventually
the black hole evaporates, and the evaporation is probably mostly
into zero-mass particles such as photons. Once the hole has evapo-
rated completely, our fig leaf has evaporated as well. There is now
no physical object to which we can attribute the +1 units of lepton
and baryon number.
Black-hole complementarity
A very difficult question about the relationship between quan-
tum mechanics and general relativity occurs as follows. In our ex-
25
Dong, arxiv.org/abs/1511.05642
26
Kanti, arxiv.org/abs/hep-ph/0402168
This looks like the Schwarzschild form with no other change than
a generalization of the exponent, and in fact Tangherlini showed
in 1963 that for d > 4, one obtains the exact solution simply by
applying the same change of exponent to grr as well.27
If large extra dimensions do exist, then this is the actual form
of any black-hole spacetime for r ρ, where the background curva-
ture of the extra dimensions is negligible. Since the exponents are
all changed, gravitational forces become stronger than otherwise ex-
pected at small distances, and it becomes easier to make black holes.
It has been proposed that if large extra dimensions exist, microscopic
black holes would be observed at the Large Hadron Collider. They
would immediately evaporate into Hawking radiation (p. 250), with
an experimental signature of violating the standard conservation
laws of particle physics. As of 2010, the empirical results seem to
be negative.28
The reasoning given above fails in the case of d = 3, i.e., 2+1-
dimensional spacetime, both because the integral of r−1 is not r0
and because the Tangherlini-Schwarzschild metric is not a vacuum
solution. As shown in problem 12 on p. 259, there is no counter-
part of the Schwarzschild metric in 2+1 dimensions. This is essen-
tially because for d = 3 mass is unitless, so given a source having
a certain mass, there is no way to set the distance scale at which
Newtonian weak-field behavior gives way to the relativistic strong
field. Whereas for d ≥ 4, Newtonian gravity is the limiting case
of relativity, for d = 3 they are unrelated theories. In fact, the
relativistic theory of gravity for d = 3 is somewhat trivial. Space-
time does not admit curvature in vacuum solutions,29 so that the
only nontrivial way to make non-Minkowski 2+1-dimensional space-
times is by gluing together Minkowski pieces in various topologies,
like gluing pieces of paper to make things like cones and Möbius
strips. 2+1-dimensional gravity has conical singularities, but not
Schwarzschild-style ones that are surrounded by curved spacetime.
If black-hole solutions exist in d dimensions, then one can extend
such a solution to d+1 dimensions with cylindrical symmetry, form-
ing a “black string.” The nonexistence of d = 3 black holes implies
27
Emparan and Reall, “Black Holes in Higher Dimensions,” relativity.
livingreviews.org/Articles/lrr-2008-6/
28
http://arxiv.org/abs/1012.3375
29
arxiv.org/abs/gr-qc/0503022v4
2
∂u
gtt = guu
∂t
2
∂u
−t =
∂t
Problems 259
13 On p. 223 I argued that there is no way to define a time-
reversal operation in general relativity so that it applies to all space-
times. Why can’t we define it by picking some arbitrary space-
like surface that covers the whole universe, flipping the velocity of
every particle on that surface, and evolving a new version of the
spacetime backward and forward from that surface using the field
equations? . Solution, p. 397
14 In Newtonian gravity, a body in a hyperbolic orbit has
a radius that decreases, reaches a minimum, and then goes back
out to infinity. Show that if this is to happen in the Schwarzschild
spacetime, for a particle with zero or nonzero mass, the distance of
closest approach much be greater than or equal to the Schwarzschild
radius 2m. (Note that it is possible to have trajectories that pass
out through the horizon, although we don’t expect to observe such
trajectories in the case of an astrophysical black hole.)
. Solution, p. 397
15 An astronaut in a spacesuit falls into a black hole. Estimate
an order-of-magnitude inequality for the mass of the black hole if the
astronaut is to survive past the event horizon without being killed
by tidal forces.
ξ dt = (1, 0, 0, 0) dt.
261
The Euclidean plane Example: 1
The Euclidean plane has two Killing vectors corresponding to
translation in two linearly independent directions, plus a third Killing
vector for rotation about some arbitrarily chosen origin O. In Carte-
sian coordinates, one way of writing a complete set of these is is
ξ1 = (1, 0)
ξ2 = (0, 1)
ξ3 = (−y , x).
ξ = ∂t .
A sphere Example: 4
A sphere is like a plane or a cylinder in that it is a two-dimensional
space in which no point has any properties that are intrinsically
different than any other. We might expect, then, that it would
have two Killing vectors. Actually it has three, ξx , ξy , and ξz , cor-
responding to infinitesimal rotations about the x, y , and z axes.
To show that these are all independent Killing vectors, we need
d / Example 3: A cylinder to demonstrate that we can’t, for example, have ξx = c1 ξy + c2 ξz
has three local symmetries, but for some constants c1 and c2 . To see this, consider the actions of
only two that can be extended ξy and ξz on the point P where the x axis intersects the sphere.
globally to make Killing vectors. (References to the axes and their intersection with the sphere are
extrinsic, but this is only for convenience of description and vi-
sualization.) Both ξy and ξz move P around a little, and these
motions are in orthogonal directions, wherease ξx leaves P fixed.
This proves that we can’t have ξx = c1 ξy + c2 ξz . All three Killing
vectors are linearly independent.
This example shows that linear independence of Killing vectors
can’t be visualized simply by thinking about the vectors in the
tangent plane at one point. If that were the case, then we could
have at most two linearly independent Killing vectors in this two-
dimensional space. When we say “Killing vector” we’re really re-
ferring to the Killing vector field, which is defined everywhere on
the space.
Proving nonexistence of Killing vectors Example: 5
. Find all Killing vectors of these two metrics:
ds2 = e−x dx 2 + ex dy 2
ds2 = dx 2 + x 2 dy 2 .
1 load(ctensor);
2 dim:2;
3 ct_coords:[x,y];
4 lg:matrix([exp(-x),0],[0,exp(x)]);
5 cmetric();
6 R:scurvature(); /* scalar curvature */
v = v µ ∂µ
ω = ωµ dxµ .
2
Hawking and Ellis, The Large Scale Structure of Space-Time, p. 62, give
a succinct treatment that describes the flux densities and proves that Gauss’s
theorem, which ordinarily fails in curved spacetime for a non-scalar flux, holds in
the case where the appropriate Killing vectors exist. For an explicit description
of how one can integrate to find a scalar mass-energy, see Winitzki, Topics in
General Relativity, section 3.1.5, available for free online.
ω0 ua0 v 0a
= ,
ω ub v b
which is coordinate-independent and also independent of the choice
of affine parameter for v . This relation is a purely kinematical fact,
but a quick and dirty way to see that it must be true is that the
energy-momentum vector of a light ray is proportional to its four-
velocity, and therefore the numerator and denominator of this ex-
pression each represent the respective observer’s measurement
of the ray’s energy.
Specializing now to the specific physical situation being analyzed,
we know that the emitter and observer are both at rest relative to
to the black hole, so that in Schwarzschild coordinates u and u 0
have only t components. Because these vectors are normalized,
and the metric has gtt = A, we have u t = A−1/2 and u 0t = A0−1/2 .
The ray has a conserved energy Av t , so that Av t = A0 v 0t . There is
also a nonzero v r , but we don’t need to calculate it for our present
purposes.
The Doppler shift comes out to be (A0 /A)−1/2 , which is consistent
with the result found previously by more elementary methods.
Doppler shifts across the horizon Example: 8
As remarked in section 6.2.7, p. 232, we cannot extend the kind
4
Ashley, “Singularity theorems and the abstract boundary construc-
tion,” https://digitalcollections.anu.edu.au/handle/1885/46055. Garcia-
Parrado and Senovilla, “Causal structures and causal boundaries,” http://
arxiv.org/abs/gr-qc/0501069.
We have already solved the field equations for a metric of this form
and found as a solution the Schwarzschild spacetime.10 Since the
metric’s components are all independent of t, ∂t is a Killing vec-
tor, and it is timelike for large r, so the Schwarzschild spacetime is
asymptotically static.
12
For a more formal statement of this, see Hawking and Ellis, “The Large Scale
Structure of Space-Time,” p. 315. Essentially, the region must be a connected
region on a spacelike three-surface, and there must be no lightlike world-lines
that connect points in that region to null infinity. Null infinity was introduced
briefly on p. 272 is defined formally using conformal techniques, but basically
refers to points that are infinitely far away in both space and time, and have
the two infinities equal in a certain sense, so that a free light ray could end up
there. The definition is based on the assumption that the surrounding spacetime
is asymptotically flat, since otherwise null infinity can’t be defined. It is not
actually necessary to assume a singularity as part of the definition; the no-hair
theorems guarantee that one exists.
1 load(ctensor);
2 ct_coords:[t,x,y,z];
3 lg:matrix([exp(2*z),0,0,0],
4 [0,-exp(-2*j*z),0,0],
5 [0,0,-exp(-2*k*z),0],
6 [0,0,0,-1]
7 );
8 cmetric();
9 scurvature();
10 leinstein(true);
The output from line 9 shows that the scalar curvature is constant,
which is a necessary condition for any spacetime that we want to
14
Max Born, Einstein’s Theory of Relativity, 1920. In the 1962 Dover edition,
the relevant passage is on p. 320
15
A metric of this general form is referred to as a Kasner metric. One usually
sees it written with a logarithmic change of variables, so that z appears in the
base rather than in the exponent.
Note that it has many features in common with the complex oscilla-
tory solution we found above. There are transverse length contrac-
tions that decay and oscillate in exactly the same way. The presence
of the dφ dt term tells us that this is a non-static, rotating solution
— exactly like the one that Einstein and Born had in mind in their
prototypical example! We typically obtain this type of effect due
to frame dragging by some rotating massive body (see p. 149), and
the Petrov solution can indeed be interpreted as the spacetime that
exists in the vacuum on the exterior of an infinite, rigidly rotating
cylinder of “dust” (see p. 132).
The complicated Petrov metric might seem like the furthest pos-
sible thing from a uniform gravitational field, but in fact it is about
the closest thing general relativity provides to such a field. We
first note that the metric has Killing vectors ∂z , ∂φ , and ∂r , so it
has at least three out of the four translation symmetries we ex-
pect from a uniform field. By analogy with electromagnetism, we
would expect this symmetry to be absent in the radial direction,
since by Gauss’s law the electric field of a line of charge falls off
like 1/r. But surprisingly, the Petrov metric is also uniform ra-
dially. It is possible√to give the fourth killing
√ vector explicitly (it
is ∂r + z∂z + (1/2)( 3t − φ)∂φ − (1/2)( 3φ + t)∂t ), but it is per-
haps more transparent to check that it represents a field of constant
strength (problem 5, p. 290).
For insight into this surprising result, recall that in our attempt
at constructing the Cartesian version of this metric, we ran into the
16
Petrov, in Recent Developments in General Relativity, 1962, Pergamon, p.
383. For a presentation that is freely accessible online, see Gibbons and Gielen,
“The Petrov and Kaigorodov-Ozsváth Solutions: Spacetime as a Group Mani-
fold,” arxiv.org/abs/0802.4082.
17
http://golem.ph.utexas.edu/string/archives/000550.html
from page 285 has constant values of R = 1/2 and k = 1/4. Note
that Maxima’s ctensor package has built-in functions for these; you
have to call the lriemann and uriemann before calling them.
(b) Similarly, show that the Petrov metric
√ √
ds2 = − dr2 − e−2r dz 2 + er [2 sin 3r dφ dt − cos 3r(dφ2 − dt2 )]
Remark: Surprisingly, one can have a spacetime on which every possible curva-
ture invariant vanishes identically, and yet which is not flat. See Coley, Hervik,
and Pelavas, “Spacetimes characterized by their scalar curvature invariants,”
arxiv.org/abs/0901.0791v2.
Problems 291
292 Chapter 7 Symmetries
Chapter 8
Sources
8.1 Sources in general relativity
8.1.1 Point sources in a background-independent theory
The Schrödinger equation and Maxwell’s equations treat space-
time as a stage on which particles and fields act out their roles.
General relativity, however, is essentially a theory of spacetime it-
self. The role played by atoms or rays of light is so peripheral
that by the time Einstein had derived an approximate version of
the Schwarzschild metric, and used it to find the precession of Mer-
cury’s perihelion, he still had only vague ideas of how light and mat-
ter would fit into the picture. In his calculation, Mercury played the
role of a test particle: a lump of mass so tiny that it can be tossed
into spacetime in order to measure spacetime’s curvature, without
worrying about its effect on the spacetime, which is assumed to be
negligible. Likewise the sun was treated as in one of those orches-
tral pieces in which some of the brass play from off-stage, so as to
produce the effect of a second band heard from a distance. Its mass
appears simply as an adjustable parameter m in the metric, and if
we had never heard of the Newtonian theory we would have had no
way of knowing how to interpret m.
When Schwarzschild published his exact solution to the vacuum
field equations, Einstein suffered from philosophical indigestion. His
strong belief in Mach’s principle led him to believe that there was a
paradox implicit in an exact spacetime with only one mass in it. If
Einstein’s field equations were to mean anything, he believed that
they had to be interpreted in terms of the motion of one body rela-
tive to another. In a universe with only one massive particle, there
would be no relative motion, and so, it seemed to him, no motion
of any kind, and no meaningful interpretation for the surrounding
spacetime.
Not only that, but Schwarzschild’s solution had a singularity
at its center. When a classical field theory contains singularities,
Einstein believed, it contains the seeds of its own destruction. As
we’ve seen on page 242, this issue is still far from being resolved, a
century later.
However much he might have liked to disown it, Einstein was
now in possession of a solution to his field equations for a point
source. In a linear, background-dependent theory like electromag-
293
netism, knowledge of such a solution leads directly to the ability to
write down the field equations with sources included. If Coulomb’s
law tells us the 1/r2 variation of the electric field of a point charge,
then we can infer Gauss’s law. The situation in general relativity
is not this simple. The field equations of general relativity, unlike
the Gauss’s law, are nonlinear, so we can’t simply say that a planet
or a star is a solution to be found by adding up a large number of
point-source solutions. It’s also not clear how one could represent a
moving source, since the singularity is a point that isn’t even part
of the continuous structure of spacetime (and its location is also
hidden behind an event horizon, so it can’t be observed from the
outside).
γ2 ρ γ2 v ρ
0 0
0 0 γ2 v ρ γ2 v 2 ρ 0 0
Tµ ν =
0
.
0 0 0
0 0 0 0
Experimental tests
But how do we know that this prediction is even correct? Can
it be verified in the laboratory? The classic laboratory test of the
strength of a gravitational source is the 1797 Cavendish experiment,
in which a torsion balance was used to measure the very weak grav-
c / The Kreuzer experiment. 1. There are two passive masses, P, and an active mass A consisting of
a single 23-cm diameter teflon cylinder immersed in a fluid. The teflon cylinder is driven back and forth
with a period of 400 s. The resulting deflection of the torsion beam is monitored by an optical lever and
canceled actively by electrostatic forces from capacitor plates (not shown). The voltage required for this active
cancellation is a measure of the torque exerted by A on the torsion beam. 2. Active mass as a function of
temperature. 3. Passive mass as a function of temperature. In both 2 and 3, temperature is measured in units
of ohms, i.e., the uncalibrated units of a thermistor that was immersed in the liquid.
4
Phys. Rev. Lett. 57 (1986) 21. The result is summarized in section 3.7.3 of
the review by Will.
Some examples
We conclude this introduction to the stress-energy tensor with
some illustrative examples.
A perfect fluid Example: 4
For a perfect fluid, we have
Tab = (ρ + P)va vb − sPgab ,
where s = 1 for our + − −− signature or −1 for the signature
− + ++, and v represents the coordinate velocity of the fluid’s rest
frame.
Suppose that the metric is diagonal, but its components are vary-
ing, gαβ = diag(A2 , −B 2 , . . .). The properly normalized velocity
vector of an observer at (coordinate-)rest is v α = (A−1 , 0, 0, 0).
Lowering the index gives vα = (sA, 0, 0, 0). The various forms of
the stress-energy tensor then look like the following:
T00 = A2 ρ T11 = B 2 P
T 00 = sρ T 11 = −sP
T 00 = A−2 ρ T 11 = B −2 P.
ds2 = f 2 dt 2 − f −2 dr 2 + . . . ,
Γ ttr = f 0 /f
Γ θθr = Γ φφr = r −1
1 load(ctensor);
2 ct_coords:[t,r,theta,phi];
3 depends(f,r);
4 depends(ten,r); /* tension depends on r */
5 depends(mu,r); /* mass/length depends on r */
6 lg:matrix([f^2,0,0,0],
7 [0,-f^-2,0,0],
8 [0,0,-r^2,0],
9 [0,0,0,-r^2*sin(theta)^2]);
10 cmetric();
11 christof(mcs);
12 /* stress-energy tensor, T^mu_nu */
13 t:r^-2*matrix(
14 [mu,0,0,0],
15 [0,ten,0,0],
16 [0,0,0,0],
17 [0,0,0,0]
18 );
19 /*
20 Compute covariant derivative of the stress-energy
21 tensor with respect to its first index. The
22 function checkdiv is defined so that the first
23 index has to be covariant (lower); the T I’m
24 putting in is T^mu_nu, and since it’s symmetric,
25 that’s the same as T_mu^nu.
26 */
27 checkdiv(t);
6
“Theory of gravitation theories: a no-progress report,” Sotiriou, Faraoni,
and Liberati, http://arxiv.org/abs/0707.2748
Singularity theorems
An important example of the use of the energy conditions is that
Hawking and Ellis have proved that under the assumption of the
strong energy condition, any body that becomes sufficiently com-
pact will end up forming a singularity. We might imagine that
Current status
The current status of the energy conditions is shaky. Although
it is clear that all of them hold in a variety of situations, there are
strong reasons to believe that they are violated at both microscopic
and cosmological scales, for reasons both classical and quantum-
mechanical.12 We will see such a violation in the following section.
However, there are general reasons to believe that such violations
cannot be too extreme, or else they would result in instability of the
form of matter in question.13
12
Barcelo and Visser, “Twilight for the energy conditions?,” http://arxiv.
org/abs/gr-qc/0205066v1.
13
Buniy and Hsu, “Instabilities and the null energy condition,” http://arxiv.
org/abs/hep-th/0502203.
3. It should be coordinate-independent.
1 load(ctensor);
2 dim:3;
3 ct_coords:[r,theta,phi];
4 depends(f,t);
5 lg:matrix([f,0,0],
6 [0,r^2,0],
7 [0,0,r^2*sin(theta)^2]);
8 cmetric();
9 einstein(true);
Line 2 tells Maxima that we’re working in a space with three di-
mensions rather than its default of four. Line 4 tells it that f is a
function of time. Line 9 uses its built-in function for computing the
Einstein tensor Gab . The result has only one nonvanishing compo-
nent, Gtt = (1 − 1/f )/r2 . This has to be constant, and since scaling
can be absorbed in the factor a(t) in the 3+1-dimensional metric,
we can just set the value of Gtt more or less arbitrarily, except for
its sign. The result is f = 1/(1 − kr2 ), where k = −1, 0, or 1.
1 load(ctensor);
2 ct_coords:[t,r,theta,phi];
3 depends(a,t);
4 lg:matrix([1,0,0,0],
5 [0,-a^2/(1-k*r^2),0,0],
6 [0,0,-a^2*r^2,0],
7 [0,0,0,-a^2*r^2*sin(theta)^2]);
8 cmetric();
9 einstein(true);
The result is
2
ȧ
Gtt = 3 + 3ka−2
a
2
ä ȧ
Grr = Gθθ = Gφφ =2 + + ka−2 ,
a a
2
ȧ
3 + 3ka−2 − Λ = 8πρ
a
2
ä ȧ
2 + + ka−2 − Λ = −8πP .
a a
16
People sometimes incorrectly overstate this conclusion about the gravity
inside a hole according to general relativity. In the case of a spherical shell
of mass in an otherwise empty universe, it is true that the spacetime inside
is flat, but there is time dilation inside the shell compared to time at infinity,
and the Schwarzschild coordinates cannot be used inside the shell if they are to
match up with Schwarzschild coordinates outside the shell. See Zhang and Yi,
arxiv.org/abs/1203.4428.
17
Steigman, Ann. Rev. Nucl. Part. Sci. 57 (2007) 463. These tests are
stated in terms of the Hubble “constant” H = ȧ/a, which is actually varying
over cosmological time-scales. The nuclear helium-deuterium ratio is sensitive
to Ḣ/H.
18
Hawking and Ellis, “The Cosmic Black-Body Radiation and the Existence of
Singularities in Our Universe,” Astrophysical Journal, 152 (1968) 25. Available
online at articles.adsabs.harvard.edu.
3. The size of the solar system increases at this rate as well (i.e.,
gravitationally bound systems get bigger, including the earth
and the Milky Way).
• All the above hypotheses are false, and in fact none of these
sizes increases at all.
General-relativistic predictions
Does general relativity correctly reproduce these observations?
General relativity is mainly a theory of gravity, so it should be well
within its domain to explain why the solar system does not ex-
pand detectably while intergalactic distances do. It is impractical
to solve the Einstein field equations exactly so as to describe the
internal structure of all the bodies that occupy the universe: galax-
ies, superclusters, etc. We can, however, handle simple cases, as
in example 20 on page 345, where we display an exact solution for
the case of a universe containing only two things: an isolated black
hole, and an energy density described by a cosmological constant.
We find that the characteristic scale of the black hole, i.e., the radius
of its event horizon, does not increase with time. A fuller treatment
of these issues is given on p. 350, after some facts about realis-
tic cosmologies have been established. The result is that although
19
http://arxiv.org/abs/0912.2947v1
20
private communication, Jan. 4, 2010
strategy for maximizing its proper time is to make sure that its dr
is extremely small when a is extremely large. The geodesic must
therefore have nearly constant r at the end. This makes it sound as
though the particle was decelerating, but in fact the opposite is true.
If r is constant, then the particle’s spacelike distance from the origin
is just ra(t), which blows up exponentially. The near-constancy of
the coordinate r at large t actually means that the particle’s motion
at large t isn’t really due to the particle’s inertial memory of its
original motion, as in Newton’s first law. What happens instead
is that the particle’s initial motion allows it to move some distance
23
A computation of the Einstein tensor with ds2 = dt2 − a2 (1 − kr2 )−1 dr2
shows that k enters only via a factor the form (. . .)e(...)t + (. . .)k. For large t, the
k term becomes negligible, and the Einstein tensor becomes Gab = g ab Λ, This is
consistent with the approximation we used in deriving the solution, which was
to ignore both the source terms and the k term in the Friedmann equations.
The exact solutions with Λ > 0 and k = −1, 0, and 1 turn out in fact to be
equivalent except for a change of coordinates.
d2 x i j
i dx dx
k
= Γ jk .
dλ2 dλ dλ
from page 179. The nonvanishing Christoffel symbols for the 1+1-
dimensional metric ds2 = dt 2 −a2 dr 2 are Γ rtr = ȧ/a and Γ tr r = ȧa.
Setting T = 1 for convenience, we have Γ rtr = 1 and Γ tr r = e−2t .
We conjecture that the particle remains at the Rsame value of r .
Given this conjecture, the particle’s proper time ds is simply the
same as its time coordinate t, and we can therefore use t as an
affine coordinate. Letting λ = t, we have
2
d2 t
dr
− Γ tr r =0
dt 2 dt
0 − Γ tr r ṙ 2 = 0
ṙ = 0
r = constant
The metric
dr 2
2m 1 2
2
ds = 1 − − Λr dt 2 − −r 2 dθ2 −r 2 sin2 θ dφ2
r 3 1 − 2m
r − 1
3 Λr 2
ä = −ca−2
√
ȧ = 2ca−1/2 .
∇j T bc = ∂j T bc + Γjd
b dc c bd
T + Γjd T .
ȧ
∇µ T tµ = ∂t T tt + 3 T tt ,
a
or ρ̇/ρ = −3ȧ/a, which can be rewritten as
d d
ln ρ = −3 ln a,
dt dt
producing the proportionality originally claimed.
28
Komatsu et al., 2010, arxiv.org/abs/1001.4538
Dark matter
Another constraint comes from models of nucleosynthesis dur-
ing the era shortly after the Big Bang (before the formation of the
first stars). The observed relative abundances of hydrogen, helium,
and deuterium cannot be reconciled with the density of “dust” (i.e.,
nonrelativistic matter) inferred from the observational data. If the
inferred mass density were entirely due to normal “baryonic” matter
(i.e., matter whose mass consisted mostly of protons and neutrons),
then nuclear reactions in the dense early universe should have pro-
ceeded relatively efficiently, leading to a much higher ratio of helium
to hydrogen, and a much lower abundance of deuterium. The con-
clusion is that most of the matter in the universe must be made of
an unknown type of exotic non-baryonic matter, known generically
as “dark matter.”
The existence of nonbaryonic matter is also required in order to
reconcile the observed density of galaxies with the observed strength
32
See Carroll, “The Cosmological Constant,” http://www.livingreviews.
org/lrr-2001-1 for a full mathematical treatment of such models.
Current discrepancies
Even with the inclusion of dark matter, there is a problem
with the abundance of lithium-7 relative to hydrogen, which models
greatly overpredict.33
As of 2019, there is also tension between the values of the Hub-
ble constant found from distance-ladder techniques and analysis of
the CMB and BAO. The former34 give about 74.2 ± 1.8, in units of
km/s/Mpc, while the latter give about 67.5 ± 0.5. This may simply
be a case where people always underestimate their systematic errors,
or it may be a sign of new physics causing the universe to accelerate
its expansion more rapidly than predicted by ΛCDM models. Pro-
posed solutions involve physical ingredients such as sterile neutrinos,
axions, and phantom energy (example 26, p. 352).
39
Another good technical reasons for thinking of φ as relating to the gravita-
tional constant is that general relativity has a standard prescription for describ-
ing fields on a background of curved spacetime. The vacuum field equations of
general relativity can be derived from the principle of least action, and although
the details are beyond the scope of this book (see, e.g., Wald, General Relativ-
ity, appendix E), the general idea is that we define a Lagrangian density LG
that depends on the Ricci scalar curvature, and then extremize its integral over
all possible histories of the evolution of the gravitational field. If we want to
describe some other field, such as matter, light, or φ, we simply take the special-
relativistic Lagrangian LM for that field, change all the derivatives to covariant
derivatives, and form the sum (1/G)LG + LM . In the Brans-Dicke theory, we
have three pieces, (1/G)LG + LM + Lφ , where LM is for matter and Lφ for φ.
If we were to interpret φ as a rescaling of inertia, then we would have to have φ
appearing as a fudge factor modifying all the inner workings of LM . If, on the
other hand, we think of φ as changing the value of the gravitational constant G,
then the necessary modification is extremely simple. Brans and Dicke introduce
one further modification to Lφ so that the coupling constant ω between matter
and φ can be unitless. This modification has no effect on the wave equation of
4 + 3ω 2 t 2/(4+3ω)
φ = 8π ρo t ,
6 + 4ω o to
where ρo is the density of matter in the universe at time t = to .
When the density of matter is small, G is large, which has the same
observational consequences as the disappearance of inertia; this is
exactly what one expects according to Mach’s principle. For ω → ∞,
the gravitational “constant” G = 1/φ really is constant.
Returning to the thought experiment involving the 22-caliber ri-
fle fired out the window, we find that in this imaginary universe, with
a very small density of matter, G should be very large. This causes
a frame-dragging effect from the laboratory on the gyroscope, one
much stronger than we would see in our universe. Brans and Dicke
calculated this effect for a laboratory consisting of a spherical shell,
and although technical difficulties prevented the reliable extrapo-
lation of their result to ρo → 0, the trend was that as ρo became
small, the frame-dragging effect would get stronger and stronger,
presumably eventually forcing the gyroscope to precess in lock-step
with the laboratory. There would thus be no way to determine, once
the bullet was far away, that the laboratory was rotating at all —
in perfect agreement with Mach’s principle.
41
Bertotti, Iess, and Tortora, “A test of general relativity using radio links
with the Cassini spacecraft,” Nature 425 (2003) 374
Problems 367
8 In problem 7 on page 209, we analyzed the properties of the
metric
ds2 = e2gz dt2 − dz 2 .
(a) In that problem we found that this metric had the same proper-
ties at all points in space. Verify in particular that it has the same
scalar curvature R at all points in space.
(b) Show that this is a vacuum solution in the two-dimensional (t, z)
space.
(c) Suppose we try to generalize this metric to four dimensions as
Problems 369
370 Chapter 8 Sources
Chapter 9
Gravitational Waves
9.1 The speed of gravity
In Newtonian gravity, gravitational effects are assumed to propagate
at infinite speed, so that for example the lunar tides correspond at
any time to the position of the moon at the same instant. This
clearly can’t be true in relativity, since simultaneity isn’t something
that different observers even agree on. Not only should the “speed of
gravity” be finite, but it seems implausible that it would be greater
than c; in section 2.2 (p. 51), we argued based on empirically well
established principles that there must be a maximum speed of cause
and effect. Although the argument was only applicable to special
relativity, i.e., to a flat spacetime, it seems likely to apply to general
relativity as well, at least for low-amplitude waves on a flat back-
ground. As early as 1913, before Einstein had even developed the
full theory of general relativity, he had carried out calculations in the
weak-field limit showing that gravitational effects should propagate
at c. We will work out an argument to this effect (using a different
technique than Einstein’s) in section 9.2.3. This seems eminently
reasonable, since (a) it is likely to be consistent with causality, and
(b) G and c are the only constants with units that appear in the
field equations (obscured by our choice of units, in which G = 1 and
c = 1), and the only velocity-scale that can be constructed from
these two constants is c itself.1
As shown by the following timeline, Einstein’s prediction was
surprisingly difficult to verify.
1913 Einstein predicts gravitational waves traveling
at c.
1982 Hulse-Taylor pulsar (pp. 232, 372) seen to lose
energy at the rate predicted by general relativ-
ity’s prediction of gravitational radiation.
2016-2017 Direct detection of gravitational waves and ver-
ification that they propagate at c.
Why did this process take over a century? Naive arguments
suggest that it should have been much easier. Workers as early as
1
High-amplitude waves need not propagate at c. For example, general rela-
tivity predicts that a gravitational-wave pulse propagating on a background of
curved spacetime develops a trailing edge that propagates at less than c (Misner,
Thorne, and Wheeler, p. 957). This effect is weak when the amplitude is small
or the wavelength is short compared to the scale of the background curvature.
371
Newton and Laplace had investigated the consequences of a grav-
itational force that propagated at some finite speed. It is easy to
show that, if nonrelativistic ideas about spacetime are retained, the
predicted results are dramatic and not consistent with observation.
For example, the earth and moon orbit about their common center
of mass, which is inside the earth but offset from the earth’s center.
Suppose that we retain Newton’s ideas about spacetime, but modify
Newton’s law of gravity to incorporate a time delay, with changes
in the gravitational field propagating at some speed u. The force
acting on the moon would then point toward the earth’s location at
a slightly earlier time, and this force would therefore have a com-
ponent parallel to the moon’s direction of motion. The force would
do positive work on the moon and also exert a positive torque, the
result being that the moon would spiral away. This is not consistent
with the fact that the earth-moon system has remained fairly stable
for billions of years, unless we take u to be very large. From the sta-
bility of orbits in the solar system, Laplace estimated u & 1015 m/s,
many orders of magnitude greater than c. This seemed to sup-
port the Newtonian picture, in which gravity acts instantaneously
at a distance. A time delay in Newtonian spacetime would also have
been easily detected by twentieth-century measurements using space
probes and radio astronomy.2
The trouble with such arguments is that when we substitute rel-
ativistic spacetime for Newtonian spacetime, it is no longer expected
that a time-delayed field will point toward the retarded position of
the source. For example, if an electric charge moves inertially, and
is observed in a frame in which it is moving, then Lorentz invari-
ance requires that its electric field lines be straight, and converge on
the charge’s present position in that frame.3 The speed of gravity
therefore turns out to be much harder to measure than Laplace had
believed.
2 2 1
ds = dt − 1 + sin x dx2 − dy 2 − dz 2
10
It is transverse, it propagates at c(= 1), and the fact that gxx is the
reciprocal of gyy makes it volume-conserving. The following Maxima
program calculates its Einstein tensor:
1 load(ctensor);
2 ct_coords:[t,x,y,z];
3 lg:matrix([1,0,0,0],
4 [0,-(1+A*sin(z-t)),0,0],
5 [0,0,-1/(1+A*sin(z-t)),0],
6 [0,0,0,-1]);
7 cmetric();
8 einstein(true);
A2 cos2 (z − t)
Gtt = −
2 + 4A sin(z − t) + 2A2 sin2 (z − t)
dy 2
ds2 = dt 2 − (1 + f ) dx 2 − − dz 2 ,
1+f
where f = A sin(z − t). This doesn’t seem likely to be an exact
solution for large amplitudes, since the x and y coordinates are
treated asymmetrically. In the extreme case of |A| ≥ 1, there
would be singularities in gy y , but not in gxx . Clearly the metric will
have to have some kind of nonlinear dependence on f , but we just
haven’t found quite the right nonlinear dependence. Suppose we
try something of this form:
ds2 = dt 2 − 1 + f + cf 2 dx 2 − 1 − f + df 2 dy 2 − dz 2
11
Some of the history is related at http://en.wikipedia.org/wiki/Sticky_
bead_argument.
1 load(ctensor);
2 ct_coords:[t,x,y,z];
3 f : A*exp(%i*k*(z-t));
4 lg:matrix([1,0,0,0],
5 [0,-(1+f+c*f^2),0,0],
6 [0,0,-(1-f+d*f^2),0],
7 [0,0,0,-1]);
8 cmetric();
9 einstein(true);
1 load(ctensor);
2 ct_coords:[t,x,y,z];
3 depends(p,[z,t]);
4 depends(q,[z,t]);
ds2 = (1 − h) dt 2 − dx 2 − dy 2 − (1 + h) dz 2 + 2h dz dt,
Problems 385
Appendix 1: Hints and solutions
Hints
Hints for Chapter 1
Page 38, problem 5: Apply the equivalence principle.
Problems 387
Page 83, problem 1:
(a) Let t be the time taken in the lab frame for the light to go from one mirror to the other,
and t0 the corresponding interval in the clock’s frame. Then t0 = L, and (vt)2 +L2 = t2 , where the
use of the same L in both equations makes use of our prior knowledge that there is no transverse
length contraction. Eliminating L, we find the expected expression for γ, which is independent
of L (b) If the result of a were independent of L, then the relativistic time dilation would depend
on the details of the construction of the clock measuring the time dilation. We would be forced
to abandon the geometrical interpretation of special relativity. (c) The effect is to replace vt with
vt+at2 /2 as the quantity inside the parentheses in the expression (. . .)2 +L2 = t2 . The resulting
correction terms are of higher order in t than the ones appearing in the original expression, and
can therefore be made as small in relative size as desired by shortening the time t. But this is
exactly what happens when we make the clock sufficiently small.
Page 84, problem 2:
Since gravitational redshifts can be interpreted as gravitational time dilations, the gravi-
tational time dilation is given by the difference in gravitational potential g dr (in units where
c = 1). The kinematic effect is given by dγ = d(v 2 )/2 = ω 2 r dr. The ratio of the two effects is
ω 2 R cos λ/g, where R is the radius of the Earth and λ is the latitude. Tokyo is at 36 degrees
latitude, and plugging this in gives the claimed result.
Page 84, problem 3:
(a) Reinterpret figure j on p. 81 as a picture of a Sagnac ring interferometer. Let light waves
1 and 2 move around the loop in opposite senses. Wave 1 takes time t1i to move inward along
the crack, and time t1o to come back out. Wave 2 takes times t2i and t2o . But t1i = t2i (since
the two world-lines are identical), and similarly t1o = t2o . Therefore creating the crack has no
effect on the interference between 1 and 2, and splitting the big loop into two smaller loops
merely splits the total phase shift between them. (b) For a circular loop of radius r, the time
of flight of each wave is proportional to r, and in this time, each point on the circumference
of the rotating interferometer travels a distance v(time) = (ωr)(time) ∝ r2 . (c) The effect is
proportional to area, and the area is zero. (d) The light clock in c has its two ends synchronized
according to the Einstein prescription, and the success of this synchronization verifies Einstein’s
assumption of commutativity in this particular case. If we make a Sagnac interferometer in the
shape of a triangle, then the Sagnac effect measures the failure of Einstein’s assumption that all
three corners can be synchronized with one another.
Page 84, problem 5:
Here is the program:
1 L1:matrix([cosh(h1),sinh(h1)],[sinh(h1),cosh(h1)]);
2 L2:matrix([cosh(h2),sinh(h2)],[sinh(h2),cosh(h2)]);
3 T:L1.L2;
4 taylor(taylor(T,h1,0,2),h2,0,2);
The diagonal components of the result are both 1 + η12 /2 + η22 /2 + η1 η2 + . . . Everything after
the 1 is nonclassical. The off-diagonal components are η1 + η2 + η1 η22 /2 + η2 η12 /2 + . . ., with the
third-order terms being nonclassical.
Problems 389
lower the index on dsa , we find dsa = (dst , c−2 dsx ).
Page 121, problem 7:
According to the Einstein summation convention, the repeated index implies a sum, so the
result is a scalar. As shown in example 15 on p. 107, each term in the sum equals 1, so the
result is unitless and simply equals the number of dimensions.
Page 120, problem 3:
(a) The first two violate the rule that summation only occurs over up-down pairs of indices.
The third expression would result in a quantity that couldn’t be classified as either contravariant
or covariant. (b) In differential geometry, different elements of the same tensor can have different
units. Since, as remarked in the problem, Uaa were to be interpreted as a sum, this mean
adding things that had different units. In the expression pa − qa , even if we suppose that
p and q both represent the same type of physical quantity, e.g., force, their covariant and
contravariant versions would not necessarily have the same units unless we happened to be
working in coordinates such that the metric was unitless.
Page 120, problem 4:
Assuming the mountaineer uses radians and the metric system, the coordinates have units
1, 1, and m (where 1 means a unitless quantity and m means meters — radians are not really
units). Therefore the units of an infinitesimal difference in coordinates dsa are also (1, 1, m).
Because the coordinates are orthogonal, the metric is diagonal. If we want gab dsa dsb to have
units of m2 , then its diagonal elements must have units of (m2 , m2 , 1). The upper-index metric
g ab is the inverse of its lower-index version gab , so its units are (m−2 , m−2 , 1). Mechanical work
has units of N · m, so given dW = Fa dsa , the units of Fa must be (N · m, N · m, N). Raising the
index on the force using g ab gives (N/m, N/m, N).
Page 120, problem 5:
The only aspect of the geometrical representation that needs to be changed is that instead of
representing an upper-index vector using a pair of parallel lines, we should use a pair of parallel
planes.
Page 121, problem 8:
The coordinate T would have a discontinuity of 2πωr2 /(1 − ω 2 r2 ). Reinserting factors of c
to make it work out in SI units, we have 2πωr2 c−2 /(1 − ω 2 r2 c−2 ) ≈ 207 ns. The exact error in
position that would result is dependent on the geometry of the current position of the satellites,
but it would be on the order of c∆T , which is ∼ 100 m. This is considerably worse than civilian
GPS’s 20-meter error bars.
Page 121, problem 9:
The process that led from the Euclidean metric of example 8 on page 103 to the non-
Euclidean one of equation [3] on page 112 was not just a series of coordinate transformations.
At the final step, we got rid of the variable t, reducing the number of dimensions by one.
Similarly, we could take a Euclidean three-dimensional space and eliminate all the points except
for the ones on the surface of the unit sphere; the geometry of the embedded sphere is non-
Euclidean, because we’ve redefined geodesics to be lines that are “as straight as they can be”
(i.e., have minimum length) while restricted to the sphere. In the example of the carousel, the
final step effectively redefines geodesics so that they have minimal length as determined by a
chain of radar measurements.
Problems 391
of reference.
Page 156, problem 10:
Such a transformation would take an energy-momentum four-vector (E, p), with E > 0, to
a different four-vector (E 0 , p0 ), with E 0 < 0. That transformation would also have the effect of
transforming a timelike displacement vector from the future light cone to the past light cone. But
the Lorentz transformations were specifically constructed so as to preserve causality (property
L5 on p. 51), so this can’t happen.
Page 156, problem 11:
A spatial plane is determined by the light’s direction of propagation and the relative ve-
locity of the source and observer, so the 3+1 case reduces without loss of generality to 2+1
dimensions. The frequency four-vector must be lightlike, so its most general possible form
is (f , f cos θ, f sin θ), where θ is interpreted as the angle between the direction of propaga-
tion and the relative velocity. Putting this through a Lorentz boost along the x axis, we find
f 0 = γf (1 + v cos θ), which agrees with Einstein’s equation on page ??, except for the arbitrary
convention involved in defining the sign of v.
Page 157, problem 12:
The exact result depends on how one assumes the charge is distributed, so this can’t be any
more than a rough estimate. R −4 The energy density is (1/8πk)E 2 ∼ ke2 /r4 , so the total energy
R −2
is an integral of the form r dV ∼ r dr, which diverges like 1/r as the lower limit of
integration approaches zero. This tells us that most of the energy is at small values of r, so to
a rough approximation we can just take the volume of integration to be r3 and multiply by a
fixed energy density of ke2 /r4 . This gives an energy of ∼ ke2 /r. Setting this equal to mc2 and
solving for r, we find r ∼ ke2 /mc2 ∼ 10−15 m.
Remark: Since experiments have shown that electrons do not have internal structure on this
scale, we conclude that quantum-mechanical effects must prevent the energy from blowing up
as r → 0.
Page 157, problem 17:
Doing a transformation first by u and then by v results in E00 = E−v×(u×E)+(u+v)×B.
This is not of the same form, because if B = 0, we can have E00 6= E.
Page 209, problem 1: The equation for the Christoffel symbols in terms of the metric was
1
Γcab = g cd (∂a gbd + ∂b gad − ∂d gab ) .
2
Because both the metric matrix and its inverse appear, we get factors of α and 1/α that cancel
out. Therefore there is no effect on the Christoffel symbols or on the geodesics. This certainly
makes sense in the case of α = −1, because this is just a change in the choice of signature, which
is an arbitrary convention. It also makes sense that rescaling the metric by a nonzero positive
factor has no effect on the geodesics — we would expect this to change the measurement of
geodesics, but we would not expect it to make different curves be geodesics.
Page 209, problem 3:
Problems 393
assume the interpretation given above, since otherwise real-number pairs like (φ1 , φ2 ) wouldn’t
have the same topology as points on the rational-number torus.
Page 210, problem 11: In the torus, we can construct a closed curve C that encircles the
hole. If we have a homeomorphism, C must have an image C0 under that homeomorphism
that is a closed curve in the sphere. C0 can then be contracted continuously to a point, and
since the inverse of the homeomorphism is also continuous, it would be possible to contract C
continuously to a point. But this is impossible because C encircles the hole.
Page 210, problem 12: (a) The Christoffel symbols are (assuming I didn’t make a mistake
in calculating them by hand) Γtxx = (1/2)ptp−1 and Γxxt = Γxtx = (1/2)pt−1 . (b) After that, I
resorted to a computer algebra system (Maxima), which told me that, for example, the Ricci
tensor has Rtt = (p/2 − p2 /4)t−2 .
Page 211, problem 13:
The answer to this is a little subtle, since it depends on how we take the limit. Suppose we
join two planes with a section of a cylinder having radius ρ, and let ρ go to zero. The Gaussian
curvature of a cylinder is zero, so in this limit we fail to reproduce the correct result. On the
other hand, suppose we take a discus of radius ρ1 whose edge has a curve of radius ρ2 . in the
limit ρ1 → +∞, ρ2 → 0+ , we can get either K = 1/(ρ1 ρ2 ) → 0 or K → +∞, depending on how
quickly ρ1 and ρ2 approach their limits.
Page 211, problem 14:
The definition of the proper time is dτ 2 = dxµ dxµ . Dividing by dλ2 on both sides and using
dots for differentiation with respect to λ, we have
2
dτ
= ẋµ ẋµ .
dλ
This allows us to determine dτ / dλ up to a sign, and the sign can be easily determined by
inspection of the solution. This determines the relation between τ and λ up to an additive
constant. Alternatively, one could just normalize the velocity vector when setting the initial
conditions.
Page 258, problem 3: (a) In the center of mass frame, symmetry guarantees that the test
particle exits with a speed equal to the speed with which it entered, and the entry and exit
velocities are v and −v. Now let’s switch to the sun’s frame. This involves adding u to all
velocities, so the entry and exit velocities become v + u and −v + u. The difference in speed is
2u.
(b) The derivation assumed that velocities add linearly when you change frames of reference,
which is a nonrelativistic approximation. Relativistically, velocities combine not like u + v but
like (u + v)/(1 + uv). If you put in v = 1, the result for the combined velocity is always 1.
This is a funny case where we can get the answer to a gravitational problem purely through
special relativity. We might worry that the SR-based answer is wrong, because we really need
GR for gravity. But we can get the same answer from GR, since GR says that a test particle
always follows a geodesic, and a lightlike geodesic always remains lightlike. The reason SR
worked is that an observer could watch a patch of flat space far away from the black hole,
1 load(ctensor);
2 dim:2;
3 ct_coords:[r,theta];
4 lg:matrix([-1,0],
5 [0,-r^2/(1-w^2*r^2)]);
6 cmetric();
7 ricci(true);
8 scurvature();
The result is R = 6ω 2 /(1 − 2ω 2 r2 + ω 4 r4 ). This blows up at r = 1/ω, which shows that this is
not a coordinate singularity. The fact that R does not blow up at r = 0 is consistent with our
earlier conclusion that r = 0 is a coordinate singularity, but would not have been sufficient to
prove that conclusion.
(c) The argument is incorrect. The Gaussian curvature is not just proportional to the angular
Problems 395
deficit , it is proportional to the limit of /A, where A is the area of the triangle. The area of
the triangle can be small, so there is no upper bound on the ratio /A. Debunking the argument
restores consistency with the answer to part b.
Page 259, problem 10: The only nonvanishing Christoffel symbol is Γttt = −1/2t. The
antisymmetric treatment of the indices in Rabcd = ∂c Γadb −∂d Γacb +Γace Γedb −Γade Γecb guarantees
that the Riemann tensor must vanish when there is only one nonvanishing Christoffel symbol.
Page 259, problem 11: The first thing one notices is that the equation Rab = k isn’t written
according to the usual rules of grammar for tensor equations. The left-hand side has two lower
indices, but the right-hand side has none. In the language of freshman physics, this is like
setting a vector equal to a scalar. Suppose we interpret it as meaning that each of R’s 16
components should equal k in a vacuum. But this still isn’t satisfactory, because it violates
coordinate-independence. For example, suppose we are initially working with some coordinates
0
xµ , and we then rescale all four of them according to xµ = 2xµ . Then the components of Rab
all scale down by a factor of 4. But this would violate the proposed field equation.
Page 259, problem 12: The following Maxima code calculates the Ricci tensor for a metric
with gtt = h and grr = k.
1 load(ctensor);
2 dim:3;
3 ct_coords:[t,r,phi];
4 depends(h,r);
5 depends(k,r);
6 lg:matrix([h,0,0],
7 [0,-k,0],
8 [0,0,-r^2]);
9 cmetric();
10 ricci(true);
Inspecting the output (not reproduced here), we see that Rφφ = 0 requires k 0 /k = h0 /h. Since
the logarithmic derivatives of h and k are the same, the two functions can differ by at most a
constant factor c. So now we do a second iteration of the calculation:
1 load(ctensor);
2 dim:3;
3 ct_coords:[t,r,phi];
4 depends(h,r);
5 lg:matrix([h,0,0],
6 [0,-c*h,0],
7 [0,0,-r^2]);
8 cmetric();
9 ricci(true);
The result for Rrr is independent of c. Since h is essentially the gravitational potential, we have
the requirements h0 > 0 (because gravity is attractive) and h00 < 0 (because gravity weakens
with distance). Therefore we find that Rrr is positive, and we do not obtain a vacuum solution.
Page 290, problem 2: (a) If she makes herself stationary relative to the sun, she will still
experience local geometrical changes because of the planets. (b) If it was to be impossible
for her to prove the universe’s nonstationarity, then any world-line she picked would have to
experience constant local geometrical conditions. A counterexample is any world-line extending
back to the Big Bang, which is a singularity with drastically different conditions than any other
region of spacetime. (c) To maintain a constant local geometry, she would have to “surf” the
wave, but she can’t do that, because it propagates at the speed of light. (d) There are places
where the local mass-energy density is increasing, and the field equations link this to a change
in the local geometry.
Page 290, problem 5:
Under these special conditions, the geodesic equations become r̈ = Γrtt ṫ2 , φ̈ = 0, ẗ = 0,
where the dots can in principle represent differentation with respect to any affine parameter we
like, but we intend to use the proper time s. By symmetry,√ √there will
√ be no motion in the z
direction. The Christoffel symbol equals −(1/2)er (cos 3r − 3 sin 3r). At a location where
√
the cosine equals 1, this is simply −er /2. For ṫ, we have dt/ ds = 1/ gtt = e−r/2 . The result of
the calculation is simply r̈ = −1/2, which is independent of r.
Page 291, problem 6:
The Petrov metric is one example. The metric has no singularities anywhere, so the r
coordinate can be extended from −∞ to +∞, and there is no point that can be considered the
center. The existence of a dφ dt term in the metric shows that it is not static.
A simpler example is a spacetime made by taking a flat Lorentzian space and making it wrap
around topologically into a cylinder, as in problem 1 on p. 120. As discussed in the solution
to that problem, this spacetime has a preferred state of rest in the azimuthal direction. In
a frame that is moving azimuthally relative to this state of rest, the Lorentz transformation
Problems 397
requires that the phase of clocks be adjusted linearly as a function of the azimuthal coordinate
φ. As described in section 3.5.4, this will cause a discontinuity once we wrap around by 2π, and
therefore clock synchronization fails, and this frame is not static.
Page 291, problem 7:
For an observer in a circular orbit at radius r, we can trivially tell that when r is large, the
result is Newtonian, so the Doppler shifts will be small and will be both redshifts and blueshifts.
I don’t know of any simple way to prove, without a calculation, that even at small radii there
will be both redshifts and blueshifts.
Let the units be such that the Schwarzschild radius is 1 (which means that the mass of the
black hole is 1/2). Vectors are expressed in Schwarzschild coordinates (t, r, θ, φ) The orbiting
observer is in the plane θ = π/2. The ± signs refer to the extreme cases of the orbiting observer
detecting a ray of light from the forward direction and the backward direction. Solving the
geodesic equation for a circular orbit, we find that the normalized velocity vector of the orbiting
observer is
3 −1/2
0
u = 1− (1, 0, 0, 2−1/2 r−3/2 ).
2r
This expression misbehaves for r < 3/2; for radii that small, there are no circular orbits.
(Circular orbits are also unstable for r < 3.) Let the velocity vector of the ray at detection,
with an arbitrary choice of affine parameter, be
v 0 = (1, 0, 0, ±(1 − 1/r)1/2 r−1 ).
The velocity vector of the distant observer emitting the ray is
u = (1, 0, 0, 0).
We would also like to extrapolate backward in time to find v, the velocity vector of the ray upon
emission by the distant source. The complete vector probably can’t be found in closed form,
but because there is a conserved energy, we can get the only component we need in closed form
as
v = (1 − 1/r, . . .).
The Doppler shift is
ω0 ua v 0a 3 −1/2 h
i
= = 1− 1 ∓ [2r(1 − 1/r)]−1/2 .
ω ub v b 2r
Graphing shows that all the way down to r = 3/2, one solution always has ω 0 /ω > 1 and one
ω 0 /ω < 1.
For large values of r, we can understand the leading-order behavior of this result in semi-
Newtonian terms. The Newtonian orbital velocity is v = (2r)−1/2 , which gives a special-
relativistic longitudinal Doppler shift 1 ∓ (2r)−1/2 + 4r 1
+ . . ., the gravitational time dilation
1
is 1 + 2r + . . ., and the product of these is 1 ∓ (2r)−1/2 3
+ 4r + . . ., in agreement with the exact
expression up to order 1/r.
It is also interesting to compare the maximum redshift Dc for our observer in a circular
orbit with the result of example 8, p. 267 for an observer infalling radially from rest at infinity,
which is a maximum redshift for that observer. Call the latter Dr . At large radii, Dr is a bigger
redshift, because the effect is
√ semi-Newtonian, and the radially infalling observer has velocity
that is higher by a factor of 2. But Dc blows up at r = 3/2, while Dr blows up only at r = 0.
Therefore there is a point where the two curves cross, which turns out to be at r = 2.
Page 367, problem 2: No. General relativity only allows coordinate transformations that are
smooth and one-to-one (see p. 98). This transformation is not smooth at t = 0.
Page 367, problem 5: (a) The Friedmann equations are
ä 1 4π
= Λ− (ρ + 3P )
a 3 3
and
2
ȧ 1 8π
= Λ+ ρ − ka−2 .
a 3 3
The first equation is time-reversal invariant because the second derivative stays the same under
time reversal. The second equation is also time-reversal invariant, because although the first
derivative flips its sign under time reversal, it is squared.
(b) We typically do not think of a singularity as being a point belonging to a manifold at all. If
we want to create this type of connected, symmetric back-to-back solution, then we need the Big
Bang singularity to be a point in the manifold. But this violates the definition of a manifold,
because then the Big Bang point would have topological characteristics different from those of
other points: deleting it separates the spacetime into two pieces.
Page 367, problem 4: Example 16 on page 337, the cosmic girdle, showed that a rope that
stretches over cosmological distances does expand significantly, unlike Brooklyn, nuclei, and
solar systems. Since the Milne universe is nothing but a flat spacetime described in funny
coordinates, something about that argument must fail. The argument used in that example
relied on the use of a closed cosmology, but the Milne universe is not closed. This is not a
completely satisfying resolution, however, because we expect that a rope in an open universe
will also expand, except in the special case of the Milne universe.
In a nontrivial open universe, every galaxy is accelerating relative to every other galaxy.
By the equivalence principle, these accelerations can also be seen as gravitational fields, and
tidal forces are what stretch the rope. In the special case of the Milne universe, there is no
acceleration of test particles relative to other test particles, so the rope doesn’t stretch.
Example 18 on page 340, the cosmic whip, resulted in the conclusion that the velocity of
the rope-end passing by cannot be interpreted as a measure of the velocity of the distant galaxy
to which the rope’s other end is hitched, which makes sense because cosmological solutions are
nonstationary, so there is no uniquely defined notion of the relative velocity of distant objects.
The Milne universe, however, is stationary, so such velocities are well defined. The key here is
that nothing is accelerating, so the time delays in the propagation of information do not lead to
ambiguities in extrapolating to a distant object’s velocity “now.”
The Milne case also avoids the paradox in which we could imagine that if the rope is suf-
ficiently long, its end would be moving at more than the speed of light. Although there is no
limit to the length of a rope in the Milne universe (there being no tidal forces), the Hubble
law cannot be extrapolated arbitrarily, since the expanding cloud of test particles has an edge,
beyond which there is only vacuum.
Page 367, problem 6: The cosmological constant is a scalar, so it doesn’t change under
reflection. The metric is also invariant under reflection of any coordinate. This follows because
Problems 399
we have assumed that the coordinates are locally Lorentzian, so that the metric is diagonal.
It can therefore be written as a line element in which the differentials are all squared. This
establishes that the Λgab is invariant under any spatial or temporal reflection.
The specialized form of the energy-momentum tensor diag(−ρ, P , P , P ) is also clearly in-
variant under any reflection, since both pressure and mass-energy density are scalars.
The form of the tensor transformation law for a rank-2 tensor guarantees that the diagonal
elements of such a tensor stay the same under a reflection. The off-diagonal elements will flip
sign, but since only the G and T terms in the field equation have off-diagonal terms, the field
equations remain valid under reflection.
In summary, the Einstein field equations retain the same form under reflection in any co-
ordinate. This important symmetry property, which is part of the Poincaré group in special
relativity, is retained when we make the transition to general relativity. It’s a discrete sym-
metry, so it wasn’t guaranteed to exist simply because of general covariance, which relates to
continuous coordinate transformations.
Page 367, problem 7: (a) The Einstein field equations are Gab = 8πTab + Λgab . That means
that in a vacuum, where T = 0, a cosmological constant is equivalent to ρ = (1/8π)Λ and
P = −(1/8π)Λ. This gives ρ + 3P = (1/8π)(−2Λ), which violates the SEC for Λ > 0, since part
of the SEC is ρ + 3P ≥ 0.
(a) Since our universe appears to have a positive cosmological constant, and the paper by
Hawking and Ellis assumes the strong energy condition, doubts are raised about the conclusion
of the paper as applied to our universe. However, the theorem is being applied to the early
universe, which was not a vacuum. Both P and ρ were large and positive in the early, radiation-
dominated universe, and therefore the SEC was not violated.
Page 368, problem 8:
(a) The Ricci tensor is Rtt = g 2 e2gz , Rzz = −g 2 . The scalar curvature is 2g 2 , which is
constant, as expected.
(b) Both Gtt and Gzz vanish by a straightforward computation.
(c) The Einstein tensor is Gtt = 0, Gxx = Gyy = g 2 , Gzz = 0. It is unphysical because it has
a zero mass-energy density, but a nonvanishing pressure.
Page 368, problem 9:
This proposal is an ingenious attempt to propose a concrete method for getting around the
fact that in relativity, there is no unique way of defining the relative velocities of objects that
lie at cosmological distances from one another.
Because the Milne universe is a flat spacetime, there is nothing to prevent us from laying
out a chain of arbitrary length. The chain will not, for example, be subject to the kind of tidal
forces that would inevitably break a chain that was lowered through the event horizon of a
black hole. But this only guarantees us that we can have a chain of a certain length as measured
in the chain’s frame. An observer at rest with respect to the chain describes all the links of
the chain as existing simultaneously at a certain set of locations. But this is a description in
(T , R) coordinates. To an observer who prefers the FRW coordinates, the links do not exist
simultaneously at these locations. This observer says that the supposed locations of distant
points on the chain occurred far in the past, and suspects that the chain has broken since then.
The paradox can also be resolved from the point of view of the (T , R) coordinates. The
ä 4π
= − (1 + 3w)ρ
a 3
2
ȧ 8π
= ρ.
a 3
Eliminating ρ, we find
ä
= −β,
ȧ2
where β = (1 + 3w)/2. For a solution of the form a ∝ tδ , calculation of the derivatives results in
δ = 1/(1 + β) = (2/3)/(1 + w). For dust, δ = 2/3, which checks out against the result on p. 347.
For radiation, δ = 1/2. For a cosmological constant, w = −1 gives δ = ∞, so the solution has a
different form.
Problems 401
Page 368, problem 12: The integral is exactly the same as the one in example 22 on p. 348
for the dust case, except that the exponent 2/3 is generalized to δ = (2/3)/(1 + w), as shown
in the solution to problem 11. The result is L/t = 1/(1 − δ) = (w + 1)/(w + 1/3). In the
radiation-dominated case, we have L/t = 2.
Page 368, problem 13: The following Maxima code accomplishes the necessary calculations.
1 /* Kantowski-Sachs spacetime */
2 load(ctensor);
3 ct_coords:[t,theta,phi,z];
4 lg:matrix([1,0,0,0],
5 [0,-1/Lambda,0,0],
6 [0,0,-(1/Lambda)*sin(theta)^2,0],
7 [0,0,0,-exp(2*sqrt(Lambda)*t)])$
8 cmetric();
9 cgeodesic(true);
10 leinstein(true);
11 scurvature();
(a) The geodesic equations output by cgeodesic verify that a world-line of the given form is a
geodesic. Direct application of the metric shows that λ is the proper time.
(b) This follows from the form of the spatial terms of the metric.
(c) The lower-index Einstein tensor calculated by the code above equals Λ multiplied by the
lower-index metric.
(d) The Ricci scalar comes out as claimed.
(e) Our earlier treatment was based on the assumptions of anisotropy and homogeneity. This
spacetime is clearly anisotropic. (The result of part d suggests, as turns out to be the case, that
it is homogeneous.)
Page 384, problem 1: (a) The radiated power is on the order of (G/c5 )(mr2 )2 ω 6 . Taking
the mass to be 10 tons, r = 10 m, we find that the frequency required is on the order of 106
revolutions per minute.
(b) Using the same estimate for the radiated power as in part a, we get about 10−32 W. For the
given excitation energy, this implies a rate of decay by gravitational wave emission of something
like 10−21 s−1 . In competition with a gamma decay having a rate on the order of 1 yr−1 , this
gives a probability of about 10−14 for gravitational decay. This actually doesn’t sound so low
that its detection would be impossible, but we would have to have a case where the extremely
severe selection rule for gamma decay was not matched by an equally strong hindrance of the
gravitational decay.
Page 384, problem 2: (a) The members of the Hulse-Taylor system are spiraling toward one
another as they lose energy to gravitational radiation. If one of them were replaced with a
low-mass test particle, there would be negligible radiation, and the motion would no longer be
a spiral. This is similar to the issues encountered on pp. 39ff because the neutron stars in the
Hulse-Taylor system suffer a back-reaction from their own gravitational radiation.
(b) If this occurred, then the particle’s world-line would be displaced in space relative to a
geodesic of the spacetime that would have existed without the presence of the particle. What
Problems 403
Photo Credits
Cover Galactic center: NASA, ESA, SSC, CXC, and STScI 15 Atomic clock on plane:
Copyright 1971, Associated press, used under U.S. fair use exception to copyright law. 17
Gravity Probe A: I believe this diagram to be public domain, due to its age and the improba-
bility of its copyright having been renewed. 20 Stephen Hawking: unknown NASA photog-
rapher, 1999, public-domain product of NASA. 22 Eotvos: Unknown source. Since Eötvös
died in 1919, the painting itself would be public domain if done from life. Under U.S. law,
this makes photographic reproductions of the painting public domain. 25 Earth: NASA,
Apollo 17. Public domain. 25 Orion: Wikipedia user Mouser, GFDL. 25 M100: European
Southern Observatory, CC-BY-SA. 25 Supercluster: Wikipedia user Azcolvin429, CC-BY-SA.
25 Artificial horizon: NASA, public domain. 26 Upsidasium: Copyright Jay Ward Produc-
tions, used under U.S. fair use exception to copyright law.. 37 Pound and Rebka photo:
Harvard University. I presume this photo to be in the public domain, since it is unlikely to
have had its copyright renewed. 41 Lorentz: Jan Veth (1864-1925), public domain. 60
Muon storage ring at CERN: (c) 1974 by CERN; used here under the U.S. fair use doctrine.
64 Galaxies: Hubble Space Telescope. Hubble material is copyright-free and may be freely
used as in the public domain without fee, on the condition that NASA and ESA is credited
as the source of the material. The material was created for NASA by STScI under Contract
NAS5-26555 and for ESA by the Hubble European Space Agency Information Centre. 68
Gamma-Ray burst: NASA/Swift/Mary Pat Hrybyk-Keith and John Jones. 84 Graph from
Iijima paper: Used here under the U.S. fair use doctrine. 92 Levi-Civita: Believed to be public
domain. Source: http://www-history.mcs.st-and.ac.uk/PictDisplay/Levi-Civita.html.
73 Ring laser gyroscope: Wikimedia commons user Nockson, CC-BY-SA licensed. 95 Ein-
stein’s ring: I have lost the information about the source of the bitmapped image. I would
be grateful to anyone who could put me in touch with the copyright owners. 48 Map of
isotherms: J. Hanns, 1910, public domain. 49 Human arm: Gray’s Anatomy, 1918, public
domain. 119 SU Aurigae’s field lines: P. Petit, GFDL 1.2. 116 Galaxies: Hubble Space
Telescope. Hubble material is copyright-free and may be freely used as in the public domain
without fee, on the condition that NASA and ESA is credited as the source of the material.
The material was created for NASA by STScI under Contract NAS5-26555 and for ESA by
the Hubble European Space Agency Information Centre. 144 Chandrasekhar: University
of Chicago. I believe the use of this photo in this book falls under the fair use exception to
copyright in the U.S. 149 Relativistic jet: Biretta et al., NASA/ESA, public domain. 159
Rocks: Siim Sepp, CC-BY-SA 3.0. 160 Jupiter and comet: Hubble Space Telescope, NASA,
public domain. 161 Earth: NASA, Apollo 17. Public domain. 161 Moon: Luc Vi-
atour, CC-BY-SA 3.0. 162 Heliotrope: ca. 1878, public domain. 162 Triangulation
survey: Otto Lueger, 1904, public domain. 166 Triangle in a space with negative curva-
ture: Wikipedia user Kieff, public domain. 172 Eclipse: Eddington’s original 1919 photo,
public domain. 186 Torsion pendulum: University of Washington Eot-Wash group, http:
//www.npl.washington.edu/eotwash/publications/pdf/lowfrontier2.pdf. 194 Aster-
oids: I believe the use of this photo in this book falls under the fair use exception to copyright
in the U.S. 194 Coffee cup to doughnut: Wikipedia user Kieff, public domain. 213 Coin:
Kurt Wirth, public-domain product of the Swiss government. 216 Bill Unruh: Wikipedia user
Childrenofthedragon, public domain. 239 Accretion disk: Public-domain product of NASA
and ESA. 261 Wilhelm Killing: I believe this to be public domain the US, since Killing died in
early 1923.. 261 Surface of revolution: Shaded rendering by Oleg Alexandrov, public domain.
299 Cavendish experiment: Based on a public-domain drawing by Wikimedia commons user
Problems 405
Index
aberration, 122 singularity, 236
absolute geometry, 19 black string, 253, 283
abstract index notation, 50, 104 Bohr model, 81
equivalent to birdtracks, 50 Bondi, Hermann, 363
acceleration four-vector, 126 boost, 52
action, 62 boundary constructions, 275
Adams, W.S., 16 Brans-Dicke theory, 28, 33, 358
ADM mass, 377 Brown-Bethe scenario, 146
affine geometry, 43 BTZ black hole, 254
affine parameter, 44
Aharonov-Bohm effect, 192 cadabra, 189, 191
angular defect, 163 Cartan, 186
angular momentum, 153, 155 curved-spacetime theory of Newtonian grav-
antigravity, 26, 315 ity, 41, 117, 161
antisymmetrization, 103 Casimir effect, 322
Aristotelian logic, 67 Cauchy horizon, 250
Ashtekar formulation of relativity, 257 Cauchy surface, 276
asymptotic flatness, 149, 279 causal diagram, 271
atlas, 200 causal future, 276
atomic clocks, 15, 73 center of mass-energy, 296, 312
Cerenkov radiation, 143
background independence, 119 Chandrasekhar limit, 144
baryon acoustic oscillations, 355 charge inversion, 108
Bell, John, 65 chart, 200
spaceship paradox, 65, 210 Chiao’s paradox, 30, 33
Big Bang, 331 Christoffel symbol, 176
Big Crunch, 96 chronological future, 276
Big Rip, 350, 352 chronology protection conjecture, 20, 289
birdtracks, 47 clock “postulate”, 83, 124
birdtracks notation, 47 cloning of particles, 215
covariant derivative, 178 closed cosmology, 327
equivalent to abstract index notation, 50 closed set, 210
metric omitted in, 107 closed timelike curves, 13, 20, 288
rank-2 tensor, 106 violate no-cloning theorem, 215
Birkhoff’s theorem, 281, 284, 358 comoving cosmological coordinates, 326
black body spectrum, 216 completeness
black hole, 236 geodesic, 395
definition, 283 Compton scattering, 157
event horizon, 237 conformal cosmological coordinates, 326
formation, 147, 238 conformal flatness, 275
Newtonian, 64 conformal geometry, 336
no-hair theorem, 282 conformal transformation, 207, 274
observational evidence, 239 congruence, 317
radiation from, 250 conical singularity, 246
connection, 91, 205 dark matter, 356
conservation laws, 148 de Sitter spacetime, 342
from Killing vectors, 266 de Sitter, Willem, 81, 326
continuous function, 195 deflection of light, 171, 233
contravariant vector, 101 degeneracy, 254
summarized, 413 Dehn twist, 98
coordinate independence, 98 derivative
coordinate singularity, 236 covariant, 173, 174
coordinate transformation, 98 in electromagnetism, 173
correspondence principle, 33, 35, 56, 218 in relativity, 174
cosmic censorship, 242 deuterium
evidence against, 248 evidence for finite age of universe, 322
cosmic microwave background, 341 test of cosmological models, 332
discovery of, 323 diffeomorphism, 98
isotropy of, 324 Dirac sea, 130, 156
cosmic rays, 16 dominant energy condition, 308
cosmological constant, 63, 320 Doppler shift, 133
no variation of, 321 dual vector, 47
observation, 354 summarized, 413
cosmological coordinates dust, 132
comoving, 326
conformal, 326 Eötvös experiments, 22
standard, 326 Eddington, 171
covariant derivative, 173, 174 Ehrenfest’s paradox, 110
in electromagnetism, 173, 174 Einstein field equation, 295, 320
in relativity, 174 Einstein summation convention, 51
covariant vector, 101 Einstein synchronization, 280
summarized, 413 Einstein tensor, 295
ctensor, 219 Einstein-Cartan theory, 186
curvature, 159 electromagnetic fields
Gaussian, 164 transformation properties of, 157
in two spacelike dimensions, 162 electromagnetic potential four-vector, 137, 141,
intrinsic versus extrinsic, 96 157
Kretschmann invariant, 236 electromagnetic tensor, 141
none in one dimension, 162 electron capture, 145
of spacetime, 87 elliptic geometry, 93
Ricci, 161 energy, see also conservation laws
Ricci scalar, 236 of gravitational fields, 302
Riemann tensor, 168 energy conditions, 307
scalar, 236 violated by cosmological constant, 321
sectional, 161 equiconsistency, 93
tensors, 168 equivalence principle
tidal versus local sources, 160 accelerations and fields equivalent, 24
curvature scalars, 206 application to charged particles, 30, 33
Cvitanović, Predrag, 47 no preferred field, 142
Cygnus X-1, 239 not mathematically well defined, 30, 308
spacetime locally Lorentzian, 28
dark energy, 25 Erlangen program, 109
Index 407
ether, 69 Goudsmit, 81
event horizon, 213 GPS
expansion scalar, 317 frames of reference used in, 113
extra dimensions, 252 timing signals, 141
extrinsic quantity, 96 gravitational constant, 220
gravitational field
Fermat’s principle, 137 uniform, 209, 285, 368
field equation, Einstein, 295 gravitational mass, 21, 299
fine structure constant, 81 active, 299
foliation, 317 passive, 299
force gravitational potential, see potential
is a dual vector, 48 gravitational red-shift, see red-shift
four-vector, 124 gravitational shielding, 315
acceleration, 126 gravitational waves
momentum, 126 empirical evidence for, 372
velocity, 124 energy content, 375
frame dragging, 151, 193, 287 propagation at c, 371
frame of reference propagation at less than c, for high ampli-
inertial, 24 tudes, 371
ambiguity in definition, 29 rate of radiation, 381
frequency vector, 133 transverse nature, 378
Friedmann equations, 329 Gravity Probe A, 17
Friedmann-Robertson-Walker cosmology, 329 Gravity Probe B, 73, 142
Gödel metric, 325 frame dragging, 151
Gödel’s theorem, 93 geodetic effect calculated, 224
Gödel, Kurt, 93 geodetic effect estimated, 170
gauge transformation, 119, 173 group, 108
Gaussian curvature, 164 definition, 108
Gaussian normal coordinates, 164 Hafele-Keating experiment, 15, 73
Gell-Mann, Murray, 130 Hausdorff space, 197
general covariance, 98 Hawking radiation, 250
general relativity Hawking, Stephen, 20
defined, 30 hole
geodesic, 22 scooped out of a cosmological spacetime,
as world-line of a test particle, 22, 384 330
differential equation for, 179 hole argument, 115
stationary action, 62 homeomorphism, 195
geodesic completeness, 395 Hoyle, Fred, 324, 363
geodesic equation, 179 Hubble constant, 318, 329, 347
geodesic incompleteness, 243 Hubble flow, 348
geodetic effect, 170, 224 Hubble, Edwin, 323
geometrized units, 220 Hulse, R.A., 232
geometry Hulse-Taylor pulsar, 232, 383
elliptic, 93 hyperbolic geometry, 166
hyperbolic, 166
spherical, 94 index gymnastics notation, 105
global hyperbolicity, 275 indices
Gold, Thomas, 363 raising and lowering, 105
408 Index
inertial frame, see frame, inertial passive gravitational, 299
inertial mass, 21, 299 mass-energy, 128
information paradox, 215 ADM, 377
inner product, 108 Maxima, 75, 219
intrinsic quantity, 96 Mercury, orbit of, 223, 228
isometry, 108 metric, 100
Ives-Stilwell experiments, 134 none in Galilean spacetime, 101
Michelson-Morley experiment, 69
Jacobian matrix, 152 Milne universe, 332
Minkowski, 41
Kantowski-Sachs metric, 324, 368
model
Kasner metric, 286 mathematical, 93
Killing equation, 263 momentum four-vector, 126
Killing vector, 261 Mossbauer effect, 36
orbit, 261 muon, 16
Kretchmann invariant, 244, 278, 290
Kretschmann invariant, 206, 236 neighborhood, 195
Kreuzer experiment, 299 neutrino, 130
neutron star, 145, 232
large extra dimensions, 252 no-cloning theorem, 215
Lemaı̂tre, Georges, 331 no-hair theorems, 282
Lemaı̂tre-Tolman-Bondi metrics, 248 normal coordinates, 164
length contraction, 55 null energy condition, 308
Lense-Thirring effect, 151, 287 null infinity, 283
Levi-Civita symbol, 153, 185
Levi-Civita, Tullio, 92, 118, 153 observable universe, 348
light size and age, 348
deflection by sun, 171, 233 open cosmology, 327
light clock, 83 open set, 195
light cone, 63 optical effects, 122
lightlike, 63 orbit
logic Killing vector, 261
Aristotelian, 67 orientability, 152
loop quantum gravity, 68 orientable
Lorentz boost, 52 in time, 224
lowering an index, 105 orthogonality, 125
lune, 95
parallel postulate, 18
Mach’s principle, 117, 293, 357 parallel transport, 90, 91
manifold, 194 parity, 108
differentiable, 200 Pasch, Moritz, 19
geodesically complete, 395 patch, 199
smooth, 200 Penrose
mass graphical notation for tensors, 47
active gravitational, 299 Penrose diagram, 271
ADM, 377 Penrose, Roger, 122, 242
defined, 128 Penrose-Hawking singularity theorems, 316, 332
gravitational, 21, 299 Penzias, Arno, 323
inertial, 21, 299 Petrov classification, 378
Index 409
Petrov metric, 287, 290 proportional to area, 84
photon scalar
mass, 131 defined, 46
Pioneer anomaly, 338 scalar curvature, 236
Planck mass, 186 Schwarzschild metric, 223
Planck scale, 186 in d dimensions, 252
Playfair’s axiom, 18 Schwarzschild, Karl, 217
Poincaré group, 108, 400 shielding
polarization gravitational, 315
of gravitational waves, 378 signature
of light, 129 change of, 254
potential, 32 defined as a list of signs, 218
relativistic vs. Newtonian, 284 defined as an integer, 254
Pound-Rebka experiment, 16, 34 singularity, 20, 241
Poynting vector, 311 conical, 246
principal group, 109 coordinate, 236
prior geometry, 119 formal definition, 243
projective geometry, 98 naked, 248
proper distance, 326 timelike, 247
proper time, 123 singularity theorems, 316
pulsar, 145, 232 Sirius B, 16
spacelike, 63
raising an idex, 105
spaceship paradox, 65, 210
rank of a tensor, 102
special relativity
rapidity, 65
defined, 30
red-shift
spherical geometry, 94
cosmological
spherical symmetry, 269
kinematic versus gravitational, 285, 340
spontaneous symmetry breaking, 348
gravitational, 16, 34
standard cosmological coordinates, 326
Ricci curvature, 161
static spacetime, 281
defined, 170
stationary, 278
Ricci scalar, 236
asymptotically, 279
Riemann curvature tensor, 168
stationary action, 62
Riemann tensor
steady-state cosmology, 324, 363
defined, 168
stress-energy tensor, 162, 295
rigid-body rotation, 110
divergence-free, 294
Rindler coordinates, 209
interpretation of, 298
ring laser, 73
of an electromagnetic wave, 309
Robinson
symmetry of, 298
Abraham, 95
Robinson, Abraham, 88 string theory, 186
rotating frame of reference, 109, 286 strong energy condition, 308
rotation surface of last scattering, 323
rigid, 110 Susskind, Leonard, 252
Sylvester’s law of inertia, 255
Sagittarius A*, 239, 282 symmetrization, 103
Sagnac effect, 112, 280 symmetry
defined, 73 spherical, 269
in GPS, 59 symmetry breaking
410 Index
spontaneous, 348 units, 202
synchronization geometrized, 220
Einstein convention, 280 universe
observable, 348
tachyon, 158 size and age, 348
tangent space, see tangent vector upsidasium, 26
tangent vector, 88, 201, 262
Tarski, Alfred, 93 vector
Taub-NUT spacetimes, 197, 246 defined, 46
Taylor, J.H., 232 dual, 47
tensor, 102, 139 Penrose graphical notation, 47
antisymmetric, 103 summarized, 413
Penrose graphical notation, 47 vectors and dual vectors, 101
rank, 102, 203 summarized, 413
symmetric, 103 velocity addition, 65
transformation law, 139 velocity four-vector, 124
tensor density, 152 velocity vector, 124
tensor transformation laws, 138 volume
Terrell, James, 122 spacetime, 155
Thomas precession, 72, 171, 225 volume expansion, 317
Thomas, Llewellyn, 82
time dilation Waage, Harold, 26
gravitational, 15, 33 wavenumber, 133
nonuniform field, 59 waves
kinematic, 15, 55 gravitational, see gravitational waves
time reversal, 108 weak energy condition, 308
of the Schwarzschild metric, 223 weight of a tensor density, 152
symmetry of general relativity, 223 Wheeler, John, 26
time-orientable, 224 white dwarf, 144
timelike, 63 Wilson, Robert, 323
Tolman-Oppenheimer-Volkoff limit, 146 world-line, 21
topology, 194
topology change, 198
torsion, 181
tensor, 184
trace energy condition, 308
transformation laws, 138
transition map, 199
transverse polarization
of gravitational waves, 378
of light, 129
trapped surface, 317
triangle inequality, 108
Type III solution, 378
Type N solution, 378
Uhlenbeck, 81
uniform gravitational field, 209, 285, 368
unitarity, 215
Index 411
Euclidean geometry (page 18):
E3 A unique circle can be constructed given any point as its center and any line segment as
its radius.
E5 Parallel postulate: Given a line and a point not on the line, exactly one line can be drawn
through the point and parallel to the given line.15
O2 Line segments can be extended: given A and B, there is at least one event such that [ABC]
is true.
O4 Betweenness: For any three distinct events A, B, and C lying on the same line, we can
determine whether or not B is between A and C (and by statement 3, this ordering is
unique except for a possible over-all reversal to form [CBA]).
A1 Constructibility of parallelograms: Given any P, Q, and R, there exists S such that [PQRS],
and if P, Q, and R are distinct then S is unique.
A3 Lines parallel to the same line are parallel to one another: If [ABCD] and [ABEF], then
[CDEF].
L1 Spacetime is homogeneous and isotropic. No point has special properties that make it
distinguishable from other points, nor is one direction distinguishable from another.
L2 Inertial frames of reference exist. These are frames in which particles move at constant
velocity if not subject to any forces. We can construct such a frame by using a particular
particle, which is not subject to any forces, as a reference point.
15
This is a form known as Playfair’s axiom, rather than the version of the postulate originally given by Euclid.
412 Index
L3 Equivalence of inertial frames: If a frame is in constant-velocity translational motion
relative to an inertial frame, then it is also an inertial frame. No experiment can distinguish
one inertial frame from another.
L4 Causality: There exist events 1 and 2 such that t1 < t2 in all frames.
L5 Relativity of time: There exist events 1 and 2 and frames of reference (t, x) and (t0 , x0 )
such that t1 < t2 , but t01 > t02 .
Accelerations and gravitational fields are equivalent. There is no experiment that can
distinguish one from the other (page 24).
There is no way to associate a preferred tensor field with spacetime (page 142).
Vectors
Coordinates cannot in general be added on a manifold, so they don’t form a vector space,
but infinitesimal coordinate differences can and do. The vector space in which the coordinate
differences exist is a different space at every point, referred to as the tangent space at that point
(see p. 262).
Vectors are written in abstract index notation with upper indices, xa , and are represented
by column vectors, arrows, or birdtracks with incoming arrows, → x.
Dual vectors, also known as covectors or 1-forms, are written in abstract index notation with
lower indices, xa , and are represented by row vectors, ordered pairs of parallel lines (see p. 48),
or birdtracks with outgoing arrows, ← x.
In concrete-index notation, the xµ are a list of numbers, referred to as the vector’s con-
travariant components, while xµ would be the covariant components of a dual vector.
Fundamentally the distinction between the two types of vectors is defined by the tensor
transformation laws, p. 138. For example, an odometer reading is contravariant because con-
verting it from kilometers to meters increases it. A temperature gradient is covariant because
converting it from degrees/km to degrees/m decreases it.
In the absence of a metric, every physical quantity has a definite vector or dual vector
character. Infinitesimal coordinate differences dxa and velocities dxa / dτ are vectors, while
momentum pa and force Fa are dual (see p. 141). Many ordinary and interesting real-world
systems lack a metric (see p. 49). When a metric is present, we can raise and lower indices at
will. There is a perfect duality symmetry between the two types of vectors, but this symmetry
is broken by the convention that a measurement with a ruler is a ∆xa , not a ∆xa .
For consistency with the transformation laws, differentiation with respect to a quantity flips
the index, e.g., ∂µ = ∂/∂xµ . The operators ∂µ are often used as basis vectors for the tangent
plane. In general, expressing vectors in a basis using the Einstein notation convention results
in an ugly notational clash described on p. 265.
Index 413