Chap 01
Chap 01
Here we use the convention ⌘µ⌫ = ( 1, 1, 1, 1), but the other convention ⌘µ⌫ = (1, 1, 1, 1) is also popular in litera-
ture.
Maxwell’s equations says that the E&M phenomena are described by the speed c of light, and all the known waves al-
ways propagate through a medium. Aether was proposed to be such medium for light propagation, and the light propagates
at c in Aether. However, according to the Galilean transformation, the speed of light will be different, if the laboratory
or the source is moving. The Michelson-Morley experiment in 1895 was devised to detect the Earth motion relative to
Aether by measuring the change in the speed of light.
In 1895, Lorentz, Fitzgerald, Poincaré explained the null results of the Michelson-Morley experiment by adopting
two ad hoc hypotheses: contraction of rigid bodies and time dilation relative to Aether, and Lorentz provided a concise
formula (or the Lorentz transformation) that captures it in a mathematical form. In fact, all the essence of special relativity
is contained in the Lorentz transformation. Einstein in 1905 re-evaluated the idea of spacetime and re-derived the Lorentz
transformation based on two principles, providing its correct physical interpretations and highlighting the limitation of
classical mechanics in terms of simultaneity.
Einstein’s special relativity is based on two principles:
1 laws of physics are the same in any inertial frames (same from old days),
2 the speed of light is the same to all observers in any inertial frames.
t̃ = (t z) , z̃ = (z t) . (1.8)
While in Newtonian mechanics two simultaneous events can be specified only in terms of its spatial separation, the
simultaneity is now a relative concept, and the events are now characterized in terms of its spacetime interval:
s2 := t̃2 + z̃ 2 = t2 + z 2 , (1.9)
invariant under Lorentz transformations. The Lorentz transformation can also be derived by starting with the invariance
of the spacetime interval.
Now consider three inertial frames with two relative velocities 12 and 23 . The relation between one and three frames
should be described by the Lorentz transformation, which leads to
12 + 23
13 = . (1.10)
1+ 12 23
Note that this relation to the lowest order is just an addition of two velocities. Furthermore, the Lorentz transformation
allows only three possibilities, strictly separated by the speed of light = 1: sub-luminal, luminal or super-luminal in all
inertial frames.
• Manipulations of the Lorentz transformation.—: The Minkowski metric is invariant under the Lorentz transformation
(by construction):
where the Lorentz matrix is symmetric. Using the transformation of the inverse Minkowski metric and the identity matrix
⇣ ⌘
µ
⌘ ↵ = ⇤↵ µ ⇤ ⌫ ⌘ µ⌫ , ↵
⌘ ⌘ ⌘ ↵ = ⇤↵ µ ⌘ ⇤ ⌫ ⌘ ⌫µ ⌘ ⇤↵ µ ⇤ 1 , (1.12)
3
AST511 General Relativity JAIYUL YOO
Furthermore, we can also define two variants of the Lorentz matrix, satisfying
⇤ µ := ⌘ ⌫ ⇤⌫ µ , ⇤ ⇢
:= ⇤ µ ⌘ µ⇢ , ⇤ ⇢⇤ ⌫ ⌘ ⇢
⌫ , (1.15)
to find that
) (⇤ 1 µ
) ⌫ ⌘ ⇤⌫ µ . (1.16)
Note that ⇤µ⌫ is not symmetric, while ⇤µ ⌫ is symmetric; the Lorentz matrix is not orthogonal or unitary:
⇤ = ⇤t , orthogonal : ⇤ 1
6= ⇤ , unitary : ⇤ 1
6= ⇤† . (1.17)
Unitary matrices preserve the amplitude of complex vectors, and real unitary matrices are orthogonal.
Physical phenomena are described by tensors (vectors and scalars), and the relation between two frames with relative
velocity is described by the Lorentz transformation ⇤:
p̃µ = ⇤µ ⌫ p⌫ . (1.18)
A co-vector pµ can be obtained by lowering the index with ⌘µ⌫ , and hence it transforms with the inverse Lorentz matrix:
The notation convention is constructed to facilitate the manipulation. Note that only the four vectors and tensors transform
with the Lorentz matrix; one cannot apply the Lorentz transformation to three spatial vectors.
The length of the stick is smaller than that in the rest frame. This is not an illusory effect, but a real effect due to different
simultaneity.
• Time dilation.— Consider instead a clock at rest in a tilde coordinate, and emits signals at t̃1 and t̃2 to indicate that the
clock is ticking:
t1 = (t̃1 + z̃0 ) , t2 = (t̃2 + z̃0 ) . (1.22)
The proper time interval dtp := t̃2 t̃1 is the time interval measured by the observer in the rest frame with the clock. The
observer in a un-tilde coordinate would then see the clock tick slower as
Again, the time dilation is not an illusory effect. For example, muons decay with half-life ⌧ ⇡ 2.2 µ sec. When accelerated
to 1 in particle accelerators, the change in their half-life can be measured. Also, muons are produced at the
atmosphere from a collision with high-energy cosmic rays. With 2.2 µ sec, muons can only travel 0.6 km, even at
the speed of light, but those muons created in the atmosphere hit the Earth surface all the time, due to time dilation.
4
AST511 General Relativity JAIYUL YOO
In the same way, we consider an acceleration ai = dv i /dt of a particle, measured by two observers with relative
velocity . In the same way we derived the transformation of the velocity, we first compute
fz = 1
dv
dvz
, fx =
dv
dvx
+
vx dvz
, (1.30)
2 (1 vz )2 (1 vz ) (1 vz )2
and then obtain the transformation of three-acceleration
az ax v x az
ãz = 3 (1
, ãx = + . (1.31)
vz )3 2 (1 vz )2 2 (1 vz )3
Similarly, the relative velocity between two observers affects not only the parallel component az , but also the perpendicular
component ax , ay . One can also compute the four acceleration: aµ := duµ /d⌧ .
In the Newtonian dynamics (or Galilean transformation), the position and the velocity are relative, while the time and
the acceleration are absolute (or identical) in all inertial frames. In special relativity, the time (or simultaneity) is now
relative, but the acceleration is still absolute in all inertial frames, while its value is not invariant for all inertial observers,
i.e., non-vanishing accelerations are observed by all inertial observers, which is not true in general relativity.
5
AST511 General Relativity JAIYUL YOO
At a moment later, the rocket moves ahead of the observer, so we have to consider another observer, who is initially
moving faster and ahead of the object. The object will soon catch up this observer, when their velocities are identical and
this observer will again measure the acceleration. Note that with ã constant, the acceleration a in an un-tilde coordinate
continuously decreases as the speed of the object increases, and the object will never reach the speed of light. Note that
the acceleration is not bounded, while the speed is. The acceleration is a dimension of inverse length, such that when
the acceleration is stronger (or the length of interest is larger than the scale of acceleration), (general) relativistic effects
become more important (like curvature). For an earth gravity, the length scale of the acceleration is a light year. In
comparison, a typical acceleration in nucleus is a ⇠ v 2 /r ⇠ 1028 earth gravity. Gravity is very weak on Earth!
Now we derive the trajectory x(t) of the uniformly accelerating object. First, consider the identities
!
d d 1 d
= p = 3 , v= , (1.33)
dt dt 1 2 (t) dt
hence we derive
d( ) d d d 2 2 d
= + = +1 = 3 = ã . (1.34)
dt dt dt dt dt
Given the constant acceleration ã along the trajectory, we can integrate the above equation to obtain the solution for the
velocity (t) as
ã t ã t
= , ) (t) = p 1, (1.35)
( ) 1 + ã2 t2
and one more integration of the solution yields the trajectory of the uniformly accelerating object with boundary condition
= 0 at t = 0:
1
x2 t2 = 2 := L2 . (1.36)
ã
The uniformly accelerating object follows a hyperbolic trajectory i.e., it starts at x = 1 in the past t = 1, moving
almost at the speed of light toward x = 0, but with constant acceleration ã toward positive direction (or slowing down);
it reaches the closest point x = L at t = 0 and the velocity at the moment is zero; it then turns around and moves to
x = 1 again with increasing velocity. The trajectories are bounded by two light cones that pass through x = t = 0, such
that anything in the upper quadrant cannot be seen for those objects, acting as a horizon. This trajectory of a uniformly
accelerating object is used to describe Rindler coordinates.
⇤! := ⇤0 0 ⇤ 0 i ni = (1 + · n) . (1.39)
6
AST511 General Relativity JAIYUL YOO
where two frames are moving with relative velocity . The observed angle n in one frame is then related to the observed
angle ñ in the other frame. Note that some formulas are expressed in terms of the propagation direction n, instead of
the observed direction.
As an example, consider a source moving approaching the observer along z-direction with photons emitted with an
angle ✓. For convenience, the emission direction is n = (sin ✓, 0, cos ✓) confined in x-z plane ( ⌘ 0). In the tilde
coordinate, the observer frame is moving with = ẑ against the source and measures
cos ✓ + sin ✓
⇤! = (1 + cos ✓) , cos ✓˜ = , sin ✓˜ = , (1.42)
1 cos ✓ (1 cos ✓)
where in our notation ✓˜ is the angle the observer measures (not the observed propagation direction). For ✓ = 0,
s
1+
⇤! = ⇡ 1 + + O( 2 ) , ✓˜ = ⇡ , (1.43)
1
which reduces to the classical Doppler effect in the non-relativistic limit. The frequency is blueshift (✓˜ is the observed
direction). For ✓ = ⇡/2, we obtain
1
⇤! = , cos ✓˜ = , sin ✓˜ = . (1.44)
If the source is moving relativistically 1, the observed angle becomes narrow, called relativistic beaming:
˜ ' 1
|✓| , (1.45)
beyond which the object is hardly visible, while highly boosted along the direction. If the source is moving away from the
observer, the only difference is that we have to flip the emission direction. Now consider a source moving perpendicular
to the light propagation direction:
1
⇤! = = p ⇡ 1 + O( 2 ) , (1.46)
1 2
and this purely relativistic effect is called the transverse Doppler effect.
Further details in astrophysical applications of the relativistic effects can be found in Rybicki and Lightman (1979).
• Perpendicular components.— Now we define a projection tensor P ij = ij ni nj with ni and consider how the
perpendicular components of a vector Aµ orthogonal to a null vector k µ transform. Given the constraint 0 = Aµ k µ , we
can write
Aµ =: Ak , Ai? + ni Ak , Ai? = P ij Aj , Ai? ni = 0 , (1.47)
and in the tilde coordinate the expression takes the same form. From the transformation law
⌘µ⌫ = ⌘⇢ ⇤⇢ µ ⇤ ⌫ = ⇤0 µ ⇤0 ⌫ + ⇤ i µ ⇤i ⌫ , (1.48)
we can derive useful relations
2
⇤i 0 ⇤i 0 = 1 + ⇤0 0 = 1+ 2 2
, ⇤ j 0 ⇤ j i = ⇤0 0 ⇤ 0 i = 2
i , (1.49)
k k 0 0 2
⇤ i⇤ j = ij + ⇤ i⇤ j = ij + i j . (1.50)
Under the transformation, the perpendicular components transform as
Ãi? = (⇤? )i j Aj? , (1.51)
with
⇤i 0 ⇤ i k nk 0 ⇣ ⌘
(⇤? )i j := ⇤i j ⇤ j = ⇤i j ⇤k 1 ⇤i 0 ⇤ i k nk ⇤ 0 j , (1.52)
⇤0 0 ⇤ 0 l nl
and we can show that
i
X̃? Ỹ?i = (⇤? )i k X?
k
(⇤? )i l Y?l = X?
i
Y?i , (1.53)
where we need to replace all the summed ⇤’s, e.g., µ ⌫ by using the above relations and use the orthogonality 0 =
⇤i ⇤i
i n = Y i n . The derivation is relatively straightforward. The inner product of X is invariant under the transformation.
X? i ? i ?
7
AST511 General Relativity JAIYUL YOO
• Bell’s spaceship paradox.— Originally by Dewan & Beran, and later modified by Bell: In the inertial frame S, there
are two spaceships with separation d at rest, but they are connected by a string. They both accelerate equally at the same
time to some speed in S, hence they maintain the same speed and the same separation at each moment with respect to
the frame S. Therefore, the string should remain unaffected, which is the ordinary point of view. However, due to length
contraction in special relativity, all lengths contract equally, i.e., two spaceships are smaller, and the string is smaller. That
is the paradox. In this lab frame, the string is torn apart due to length contraction. In the rest frame of first spaceship with
the final speed, two spaceships did not accelerate at the same time. The first one accelerates first, and then the second one
later. Hence the distance between two spaceships increased, breaking apart the string. In this frame, the string is broken
by external forces.
8
AST511 General Relativity JAIYUL YOO
of the relativistic mass and the momentum, we demand the relativistic mass terms satisfy
which yields
pi
= i, p0 = m=E, m2 = ⌘µ⌫ pµ p⌫ . (1.63)
E
The Newton’s force-law should then be
d2 xµ
m = fµ , (1.64)
d⌧ 2
which becomes in the rest frame
d2 ⇠ µ µ 0 i
m = frest , frest =0, frest , (1.65)
dt2
with f~rest being the ordinary forces (three vector) in Newtonian dynamics, where we use ⇠ µ to represent a coordinate
where the particle is at rest. Given a force in the rest frame such as E&M force, the four force f µ can then be obtained by
Lorentz transformation:
· frest
f µ = ⇤µ ⌫ frest
⌫
, f i = frest
i
+( 1) i
2
, f0 = · frest = i i
f . (1.66)
9
AST511 General Relativity JAIYUL YOO
T 00 = ⇢ , T 0i = 0 , T ij = p ij
, (1.67)
where no summation in the second equality over µ, ⌫. Mind that T µ ⌫ is not symmetric over indices.
Using the energy-momentum tensor, we can define the angular momentum tensor J µ⌫ and the spin four-vector Sµ :
Z
⇢µ⌫ µ ⌫⇢ ⌫ µ⇢ 1
M := x T x T , J := d3 x M 0µ⌫ = J ⌫µ ,
µ⌫
Sµ := "µ⌫⇢ J ⌫⇢ u , (1.72)
2
and the conservation equation guarantees that the angular momentum tensor is constant in time:
@⇢ M ⇢µ⌫ = @⌫ T µ⌫ = 0 . (1.73)
The spin vector is the intrinsic angular momentum that does not vanish in the rest frame:
1.4 Electromagnetism
Here we provide a concise review of E&M in view of special relativity.
qq 0 q
Coulomb : F1 =: k1 , E := k1 2 , (1.75)
r2 r
dF2 II 0 dq k1
Ampere : =: 2k2 , I := , = c2 , (1.76)
dl r dt k2
@B 2k2 I
Faraday : r ⇥ E + k3 =0, B := , (1.77)
@t k3 r
10
AST511 General Relativity JAIYUL YOO
where three dimensionful proportionality constants k1 , k2 , k3 are left unspecified, q, q 0 are electric charges, and I, I 0 are
electric currents (I = dq/dt). The standard units are
1 1 10
cgs units : k1 = 1 , k2 = , k3 = , q = 4.803 ⇥ 10 esu , (1.78)
c2 c
1 7 2 µ0 7
MKS units : k1 = = 10 c , k2 = = 10 , k3 = 1 , q = 1 coulomb ,
4⇡✏0 4⇡
where cgs units are sometimes called Gaussian cgs units. The Maxwell’s equations are
1 @E 4⇡k2 @B
r · E = 4⇡k1 ⇢ , r·B = 0, r⇥B = J, r ⇥ E + k3 = 0 . (1.79)
k3 c2 @t k3 @t
In this note, we use the cgs units, and the Maxwell’s equations in the cgs units become dynamical equations with
source
1 @E 4⇡
r · E = 4⇡⇢ , r⇥B = J, (1.80)
c @t c
and the source-free equations
1 @B
r·B=0, r⇥E+ =0, (1.81)
c @t
in conjunction to the continuity equation and the Lorentz force
@⇢ ⇣ v ⌘
+r·J=0, F=q E+ ⇥B . (1.82)
@t c
The macroscopic field variables are
Using the above relations of Aµ to the electric and the magnetic fields, we derive
1 1
E= F0i = F̃0i , B = ✏ijk F jk = ✏ijk F̃ jk , (1.88)
2 2
where for E and B we do not distinguish their components with upper index or lower index, as it is defined in the
Euclidean space, and note that the Levi-Civita symbols in 3D satisfy
11
AST511 General Relativity JAIYUL YOO
• Action.— The Lagrangian for the E&M is and the Noether current (under spacetime translation invariance) is
✓ ◆
1 @L 1
L= Fµ⌫ F µ⌫ + J µ Aµ , tµ⌫ = ⌘µ @⌫ A⇢ + ⌘µ⌫ L = Fµ⇢ @⌫ A⇢ ⌘µ⌫ F⇢ F ⇢ . (1.99)
4 @(@ A⇢ ) 4
However, the Noether current is anti-symmetric, nor gauge-invariant, such that the correct energy-momentum tensor is2
0 1 2 2
1
2 E +B (E ⇥ B)i
i 1
Tµ⌫ = tµ⌫ ( 2iFµ⇢ @ ⇢ A⌫ ) = Fµ⇢ F⌫ ⇢ ⌘µ⌫ F⇢ F ⇢ = @ A , (1.101)
2 4
(E ⇥ B)i ij
2
The correct energy-momentum tensor is obtained by the Belinfante-Rosenfeld procedure:
i
Tµ⌫ = tµ⌫ @ (Sµ⌫ + S µ⌫ S⌫ µ ) , (1.100)
2
where Sµ⌫ is the contribution of the intrinsic angular momentum (see EnM.2016.pdf or Maggiore).
12
AST511 General Relativity JAIYUL YOO
where T00 is the energy density, T0i is the Poynting vector, the spatial components ij are the momentum density, and we
used
1 1 1
⌘µ⌫ F⇢ F ⇢ = ⌘µ⌫ B 2 E 2 , ij ⌘
2
ij E + B
2
Ei Ej B i B j . (1.102)
4 2 2
Note that the energy-momentum tensor of E&M is not conserved
µ⌫
@⌫ TEM = F µ⌫ J ⌫ , (1.103)
but in the presence of E&M (or external force other than gravity) the energy-momentum tensor of (charged) particles is
subject to
@⌫ T µ⌫ = F µ ⌫ J ⌫ , (1.104)
so that the total energy-momentum tensor is conserved.
• Gauge choice.— Finally, in terms of the field strength tensor, the Lorentz force becomes
F = q (E + v ⇥ B) 7 ! F µ = qF µ⌫ u⌫ . (1.105)
The electric and the magnetic fields are invariant under the gauge transformation of the four vector
@
7! , A 7! A + r . (1.106)
@t
Popular choices of gauge condition are as follows:
@
Lorenz : 0 = @µ Aµ = +r·A ! r2 @t2 = @ µ @µ = ⇤ = 4⇡⇢ , (1.107)
@t
Coulomb : 0 = r · A ! r2 = 4⇡⇢ , (1.108)
13