The Spin-Vector Calculus of Polarization
The Spin-Vector Calculus of Polarization
2.1 Motivation
The purpose of this calculus is to build a geometric interpretation of polar-
ization transformations. The geometric interpretation of polarization states
was already developed in §1.4. The Jones matrix, while a direct consequence
of Maxwell’s equations when light travels through a medium, is a complex-
valued 2 × 2 matrix. This is hard to visualize. The Mueller matrix, however,
can be visualized as rotations and length-changes in Stokes space. The spin-
vector formalism makes a bilateral connection between the Jones and Mueller
matrices.
Of all the possible Jones matrices, two classes predominate in polarization
optics: the unitary matrix and the Hermitian matrix. The unitary matrix pre-
serves lengths and imparts a rotation in Stokes space. A retardation plate is
described as a unitary matrix. The Hermitian matrix comes from a measure-
ment, such as that of a polarization state. Since all measured values must be
38 2 The Spin-Vector Calculus of Polarization
real quantities, the eigenvalues of a Hermitian matrix are real. The projection
induced by a polarizer is described as a Hermitian matrix.
Based on the characteristic form (1.4.27) on page 19 of the Mueller matrix
for a unitary matrix, defined by U U † = I, one can write
1 0 0 0
0
JU −→ MU = 0
(2.1.1)
R
0
This is indeed the case. Moreover, the Mueller matrix representing passage
of light through any number of retardation plates always keeps the form
of (2.1.1). Rotation matrix R is therefore a group closed under rotation.
Taking the abstraction one step further, any rotation has an axis of ro-
tation and an angle through which the system rotates. Instead of describing
the rotation R as a 3 × 3 matrix, it is more general to describe the rotation
as a vector quantity: R = f (r̂, ϕ), where r̂ is the rotation axis in Stokes space
and ϕ is the angle of rotation. The vector r̂ need not be resolved onto an
orthonormal basis to give r̂ = x̂ rx + ŷ ry + ẑ rz ; this operation may be post-
poned indefinitely. This is in contrast to writing R as a 3 × 3 matrix where
the underlying orthonormal basis is explicit. Accordingly, r̂ exists as a vector
in vector space and can undergo operations such as rotation, inner product,
and cross product with respect to other vectors.
In parallel to the unitary-matrix case, the Mueller matrix that corresponds
to a Hermitian matrix, defined by H = H † , one can write
JH −→ MH =
H̃
(2.1.3)
where ax and ay are the components along an orthogonal basis. The entries
are complex and accordingly there are four independent parameters contained
in (2.2.1). Since the entries are complex, they have magnitude and phase:
|ax |ejφx |a |
|a = = ejθ
x
(2.2.2)
|ay |ejφy |ay |ejφ
where θ is a common phase and φ is the phase difference of the second row.
In the following the explicit magnitude symbols | · | will be dropped and the
intent of magnitude or complex number should be clear from the context. Bra
vector a | is said to be the dual of |a because they are not equal but they
describe the same state:
dual
|a ←−−−→ a |
The bra vector a | corresponding to |a is
a | = a∗x a∗y (2.2.3)
for every |a. The bra vector is the adjoint (†), or complex-conjugate transpose,
of the corresponding ket vector:
†
a | = (|a) (2.2.4)
Bra and ket vectors obey algebraic additive properties of identity, addition,
commutation, and associativity. Identity and addition rules for kets are
Physically, the multiplication of a state vector by a scalar does not change the
state and therefore the two commute. Operations that have no meaning are
2.2 Vectors, Length, and Direction 41
the multiplication of multiple ket vectors or bra vectors. For example, |b |a
is meaningless.
Finally, it should be understood that state vectors a | and |a are a more
general representation than column and row vectors (2.2.1) and (2.2.3). A
state vector is a coordinate-free abstraction that has the properties of length
and direction; a row or column vector is a representation of a state vector
given a choice of an underlying coordinate system.
Bra and ket vectors have properties of length, phase, and pointing direction.
The length of a real-valued vector is a scalar quantity and is determined by
the dot product: |a|2 = a · a. For complex-valued bra-ket vectors, the inner
product is used to find length of a vector and is determined by multiplying its
bra representation a | with its ket representation |a: a2 = a |a, where ·
is the norm of the vector.
More generally, one wants to measure the length of one vector as projected
onto another. The inner product of two different vectors is the product of the
bra form of one vector and the ket for of the other: b |a. For real-valued vec-
tors it is clear that b · a = a · b. However, for bra-ket vectors, having complex
entries, the order of multiplication dictates the sign of the resulting phase.
That is,
so that
ã |ã = 1 (2.2.9)
In the following the tilde over the vectors will be dropped.
42 2 The Spin-Vector Calculus of Polarization
Two vectors are defined as orthogonal to one another when the inner
product vanishes:
b |a = 0 (2.2.10)
This is an essential inner product used regularly.
When two polarization vectors are resolved onto a common coordinate
system,
b |a = b∗x ax + b∗y ay (2.2.11)
Finally, the inner product in matrix representation of a normalized vector is
the sum of the component magnitudes squared:
The inner product measures the length of a vector or the projection of one
vector onto another. The result is a complex scalar quantity. In contrast,
the outer product retains a vector nature while also producing length by
projection. There are two outer product types to study: the projector, having
the form |pp|; and the outer product |pq|. The form |pq| is called a dyadic
pair because the vector pair has neither a dot nor cross product between them.
In quantum mechanics the projector |pp| is called the density operator for
the state.
Consider a projector that operates on ket |a:
The quantity c = p |a is just a complex scalar and commutes with the ket.
Operating on |a the projector measures the length of |a on |p and produces
a new vector |p.
The effect of the projector is to point along the |p direction where the
length of |p is scaled by p |a. Projectors work equally well on bras, e.g.
a |p p | = c∗ p | (2.2.14)
whereas acting on a bra of the same vector, the outer product yields
The resultant pointing direction and projected length depends on whether the
outer product operates on a bra or ket vector.
In the study of polarization, the outer product is a 2 × 2 matrix with
complex entries:
bx a∗x bx a∗y
|ba| = (2.2.18)
by a∗x by a∗y
The determinant is
det (|ba|) = 0 (2.2.19)
and therefore the projector is non-invertible. The determinant of an outer
product of any dimension is likewise zero. That means the action of |ba|
on a ket is irreversible, which is reasonable because the original direction of
the ket is lost. So, while all outer products are operators not all operators
are outer products. Operators that are linear combinations of projectors are
reversible under the right construction.
In summary, the outer product follows these rules:
†
equivalence (|ba|) = |ab|
associative (|ba|) |γ = b | (a |γ)
trace Tr (|ba|) = a |b
irreversible det (|ba|) = 0
where Tr stands for the trace operation. The trace connects the outer product
to the inner product.
When a basis set, or group, is closed, any operation to a member of the group
results in another member within the group. Together, (2.2.20–2.2.21) are the
two conditions that define an orthonormal basis.
Given an orthonormal basis, any arbitrary vector can be resolved onto the
basis using (2.2.21). An arbitrary ket |s is resolved as
where the complex coefficients are given by cn = an |s. The inner prod-
uct s |s is the sum of the absolute-value squares of the coefficients cn :
X |a ←−−−→ a | X †
dual
(2.3.1)
X † is said to be the adjoint operator of X. Care should be taken because the
action of X |a is not the same as a | X; these two results are different.
Operators always act on kets from the left and bras from the right, e.g. X |a
or a | X. The expressions |a X and Xa | are undefined. An operator multi-
plying a ket produces a new ket, and an operator multiplying a bra produces
a new bra. In general, an operator changes the state of the system,
X |a = c |b (2.3.2)
where c is a scaling factor induced solely by X. Operators are said to be equal
if
X |a = Y |a ⇒ X = Y (2.3.3)
Operators obey the following arithmetic properties of addition:
2.3 General Vector Transformations 45
commutative X +Y =Y +X
associative X + (Y + Z) = (X + Y ) + Z
distributive X (|a + |b) = X |a + X |b
Operators in general do not commute under multiplication. That is
XY = Y X (2.3.4)
In matrix form, only when X and Y are diagonal matrices does XY = Y X.
Other multiplicative properties are
The indexing symmetry of (2.3.9) looks like a matrix with am |X| an as
the (m, n) entry. For polarization, the matrix is 2 × 2 and looks like
det(X) = a1 a2 · · · aN (2.4.3a)
Tr(X) = a1 + a2 + · · · + aN (2.4.3b)
Since the eigenvalues of a Hermitian matrix are real, its determinant and trace
are real.
H† = H (2.4.4)
The associated Hermitian matrix in polarization studies has only four inde-
pendent variables: three amplitudes and one phase. This contrasts with the
general Jones matrix (2.3.8) which has eight.
The eigenvectors of H form a complete orthonormal basis and the eigen-
values are real. That the eigenvalues are real is proved from the following
difference:
an H † − H am = (an ∗ − am ) an |am
= 0 (2.4.5)
Non-trivial solutions are found when neither vector is null. The eigenvectors
may be the same or different. Consider first when the eigenvectors are the
same. Since an |an = 0, (a∗n − an ) = 0 and the eigenvalue is real. Consider
when the eigenvectors are different. Unless am = an , in which case the eigen-
vectors are not linearly independent, it must be the case that an |am = 0.
All eigenvalues are therefore real. Hermitian operators H scale its own basis
set:
am H † H an = a2m δm,n (2.4.6)
When det(H) = 0, H is invertible and the action of H on the state of a system
is reversible.
The expansion of H onto its own basis generates a diagonal eigenvalue
matrix. Under construction (2.3.9) the expansion yields
T †T = I (2.4.9)
Acting on its orthogonal eigenvectors |an , the unitary operator preserves the
unity basis length:
am T † T an = δm,n (2.4.10)
Taking the determinant of both sides of (2.4.9) gives det(T † T ) = 1. Since the
determinant of a product is the product of the determinants and the adjoint
operator preserves the norm, the determinant of T must be
U expands on its own basis set in the same way H expands (2.4.7):
Hy = H UyU = +1 =m
+1
<e <e
eig(H) eig(U)
Fig. 2.1. Eigenvalue loci of H and U . Left: eigenvalues of H lie on the real number
line. Right: eigenvalues of U lie on the unit circle in the complex plane.
generates a matrix that is not diagonal. However, the expansion matrix can
be diagonalized by rotating basis |pn into |an . The unitary matrix does this
operation. Taking advantage of U † U = 1, one can write
p |H| p = p U † U HU † U p
= a U HU † a
= a |HT | a (2.4.19)
Since (2.4.19) holds for any choice of initial basis |pn , the operators
HT = U HU † (2.4.20)
and
Tr(HT ) = Tr(U HU † ) (2.4.22)
The trace is always preserved under a similarity transform.
ej(γ+β) = −ej(α+η)
ejα = e−jη (2.4.25)
ejγ = −e−jβ
There are only two independent phases. Combining all of the above restric-
tions, the general matrix form of U is written
ejα cos κ −ejβ sin κ
U = (2.4.26)
e−jβ sin κ e−jα cos κ
There are three independent variables in U : one amplitude and two phases.
The fourth independent variable has been suppressed because of the arbitrary
selection det(U ) = +1. The unitary matrix T includes the common phase:
e cos κ −e sin κ
jα jβ
T = ejφ (2.4.27)
e−jβ sin κ e−jα cos κ
where there are now four independent variables: one amplitude and three
phases.
The Cayley-Klein form of U , using complex entries a and b, is
a b
U = (2.4.28)
−b∗ a∗
Identity UI = U
Closure U1 U2 = U3
Inverse U −1 U = I
Associativity (U1 U2 )U3 = U1 (U2 U3 )
where in all cases U1,2,3 ∈ SU(2). SU(2) is closed under these four operations.
52 2 The Spin-Vector Calculus of Polarization
where Eo is real. There are two polar angles in (2.5.1): χ and φ. The common
phase exp(jθ) is lost on conversion to Stokes space.
There are seven measurements necessary to determine the polarization
ellipse uniquely. The first measurement is for the overall intensity and the
remaining measurements project the ellipse onto six different reference axes.
The formal construction of a projection matrix is necessary at this point.
Consider points along two orthogonal axes and their projection onto a
line L inclined by angle θ that passes through the origin. As illustrated in
Fig. 2.2, the coordinate (1, 0) is projected to point a on line L. The coordinates
of a as measured along the two orthogonal axes are (cos2 θ, sin θ cos θ). After
a similar analysis for the coordinate (0, 1), one can construct the projection
matrix P:
cos2 θ sin θ cos θ
P= (2.5.2)
sin θ cos θ sin2 θ
It is clear that det(P) = 0; P is non-invertable and its action is irreversible.
There is loss of information after projection. Moreover, P 2 = P, so once
the projection is taken, subsequent projections along the same line L do not
change the result.
L
(0, 1) h i
a = cos u cos u
a sin u
u
h i
b
b = sin u cos u
sin u
(1, 0)
Fig. 2.2. Projection of unit coordinates (1, 0) and (0, 1) onto line L, which is inclined
by angle θ and passes through the origin. The projected coordinates are tabulated
on the right. A second projection of a and b onto L does not change the coordinates
of a and b. The projection operator is non-invertable.
s1 = P0 − Pπ/2 (2.5.5)
which makes
0 1
s2 = s | |s (2.5.9)
1 0
The last projection requires the measurement of the ellipse circularity. By
convention, right-hand circular polarization rotates in the counter-clockwise
(ccw) direction when observed along the −ẑ direction (looking into the light).
The right-hand circular polarization vector is
1
|s R = (2.5.10)
j
54 2 The Spin-Vector Calculus of Polarization
The ccw vector needs mapping to the θ = 0 axis; a unitary transform does
the rotation. The right- and left-hand projections are calculated via
PR = s | U † P0 U |s (2.5.11a)
PL = s | U † Pπ/2 U |s (2.5.11b)
s3 = PR − PL (2.5.15)
which makes
0 −j
s3 = s | |s (2.5.16)
j 0
From these seven measurements one can transform from a ket in Jones
space to three Stokes coordinates that lie on the unit sphere:
|s =⇒ ŝ (2.5.17)
The Pauli spin matrices connect Jones to Stokes spaces through the projection
measurements of the preceding section. The identity Pauli matrix is
1 0
σ0 = (2.5.18)
0 1
2.5 Vectors Cast in Jones and Stokes Spaces 55
σk † = σk and σk † σk = I (2.5.20)
The determinants of the spin matrices are −1 and the traces zero:
σk σk = I (2.5.22)
where the indices of the multiplication table (i, j, k) are cyclic permutations
of (1, 2, 3).
Each Stokes coordinate of a polarization state |s is calculated by inserting
the associated Pauli matrix into the inner product s | · | s. The individual
Stokes coordinates are
sk = s |σk | s (2.5.24)
This is shorthand for the projection-difference measurements of (2.5.6, 2.5.9,
2.5.16). Since the spin matrices are Hermitian, the Stokes coordinates sk are
real, signed quantities. Moreover, since det(σk ) = −1 and the Jones vector |s
is assumed to be normalized, sk is bounded by −1 ≤ sk ≤ +1. The proof that
the norm of ŝ is unity, |ŝ| = 1, is shown below.
The Pauli spin vector condenses further the notation of (2.5.24). The spin
vector is defined as
σ1
σ = σ2 (2.5.25)
σ3
1
In physics texts the z direction is denoted by the σ1 spin matrix while here it is
denoted by σ3 . Historically, the Pauli spin matrices describe electron spin, which
is either up or down in the “z” direction. In polarization optics, one usually thinks
of a horizontal polarization state aligned to the “x” axis.
56 2 The Spin-Vector Calculus of Polarization
More concisely,
ŝ = s |σ | s (2.5.27)
This is the most compact way to map Jones vectors to Stokes vectors.
The reciprocal connection is made through an eigenvalue equation whose
parameters are the Stokes vector ŝ and the spin vector. First, observe that the
spin vector behaves both as a 3 × 1 vector and as a 2 × 2 matrix, depending on
the context. Above shows the spin vector acting as a 3×1 vector. Alternatively,
the dot product of ŝ with the spin vector yields
ŝ · σ = s1 σ1 + s2 σ2 + s3 σ3
s1 s2 − js3
= (2.5.28)
s2 + js3 −s1
ŝ · σ in this case is a 2 × 2 Jones matrix and, since the coefficients sk are real,
ŝ · σ is Hermitian: (ŝ · σ ) † = (ŝ · σ ).
Next, recall from §2.2.3 that the trace operation connects the projector
with its inner product: Tr(|ss|) = s |s. Since the trace of each Pauli matrix
is zero it is also true that Tr (ŝ · σ ) = 0. For a normalized state vector such
that s |s = 1, one can construct the projector for ket |s in terms of the spin
vector:
1
|ss| = (I + ŝ · σ ) (2.5.29)
2
Subsequent multiplication on the right by |s generates the eigenvalue equation
This is the most compact way to map Stokes vectors to Jones vectors. The
eigenvector of ŝ · σ associated with eigenvalue +1 generates the Jones vector
|s from Stokes vector ŝ.
Vector operations that include spin-vectors do not yield to the same intu-
ition one is accustomed to with “normal” vectors. For example, while one is
quite familiar with a · (b × a) = 0, since a is orthogonal to b × a, the spin-
vector analogue produces σ · (a × σ ) = −2j(a · σ ). The difference comes from
2.5 Vectors Cast in Jones and Stokes Spaces 57
the cyclic multiplication table for spin-vectors (2.5.22–2.5.23), where the sign
of a product is determined by the order in which the spin-vectors appear.
The purpose of the following identity tabulation is to provide reductions
in the order k of (σ )k . For the following identities, a and b are real-valued 3×1
vectors and σ is the spin vector. Real vectors a and b are not interchangeable
with the spin vector σ .
Identities of order (σ )0 and (σ ):
a · a = a2 (2.5.31)
a · σ = σ · a (2.5.32)
a(a · σ ) = (a · σ )a (2.5.33)
2
Identities of order (σ ) :
σ · σ = 3I (2.5.34)
σ (a · σ ) = aI + ja × σ (2.5.35)
(a · σ )σ = aI − ja × σ (2.5.36)
(a · σ )(a · σ ) = a2 I (2.5.37)
(a · σ )(b · σ ) = (a · b)I + (ja × b) · σ (2.5.38)
[(a · σ ), σ ] = −2ja × σ (2.5.39)
{(a · σ ), σ } = 2a I (2.5.40)
(a · σ ), (b · σ ) = 2(ja × b) · σ (2.5.41)
(a · σ ), (b · σ ) = 2(a · b) I (2.5.42)
Finally, there are identities that relate to inner products taken with various
forms of the spin vector. These identities are as follows:
Substitution of the projector |ss|, (2.5.29), for the innermost term gives
1 1
s2k = s |s + s |σk (ŝ · σ )σk | s (2.5.55)
2 2
The sum of all three terms gives
3 1
s21 + s22 + s23 = s |s + s |σ · ((ŝ · σ )σ )| s (2.5.56)
2 2
The spin-vector identity (2.5.48) simplifies (2.5.56):
3 1
s21 + s22 + s23 = s |s − s |ŝ · σ | s = s |s (2.5.57)
2 2
2.5 Vectors Cast in Jones and Stokes Spaces 59
For every polarization state |s+ there is a unique polarization state |s− such
that s− |s+ = 0. These states |s+ and |s− are orthogonal. Given |s+ how
does one can construct the orthogonal state |s− and its Stokes equivalent?
From (2.5.30) on page 56 one writes
s− |s+ = s− | (ŝ− · σ )† (ŝ+ · σ ) |s+ = 0 (2.5.59)
s− |s+ = (ŝ− · ŝ+ )s− |s+ + js− |(ŝ− × ŝ+ ) · σ | s+ (2.5.60)
As s− |s+ = 0, (2.5.60) requires that (ŝ− × ŝ+ ) = 0. There are two ori-
entations that produce (ŝ− × ŝ+ ) = 0: ŝ− · ŝ+ = ±1. If ŝ− · ŝ+ = +1, then
s− |s+ = 1, contradicting the orthogonality of the two states. Therefore it
must be the case that
ŝ− · ŝ+ = −1 (2.5.61)
The Stokes coordinates for any two orthogonal polarization states are on op-
posite sides of the Poincaré sphere: ŝ− = −ŝ+ . Specifically, a chord that con-
nects any two orthogonal states crosses through the origin of the sphere. The
polarimetric parameters χ and φ are related through the Stokes vectors as
cos 2χ− cos 2χ+
sin 2χ− cos φ− = − sin 2χ+ cos φ+ (2.5.62)
sin 2χ− sin φ− sin 2χ+ sin φ+
a) b) S3
o ^
90 s+
jp- i
S2
a
2a
S1
jp+i
^
s-
Fig. 2.3. Orthogonal polarization states in Jones and Stokes space. a) The hand-
edness of the polarization ellipse is reversed and the major axis is rotated by π/2.
b) Points on opposite sides of the Poincaré sphere are orthogonal.
Equations (2.5.61) and (2.5.63) show that orthogonal polarization states have
opposite handedness and perpendicular orientations of the respective elliptical
major axes. The Jones and Stokes representation of orthogonal polarization
pairs is illustrated in Fig. 2.3.
The inner product magnitude between two polarization states may be calcu-
lated either in Jones or Stokes space. Consider two Jones vectors |p and |q
that are not normalized, and recall that Tr (|p q |) = p |q. In a manner sim-
ilar to (2.5.29), the inner product between the two Jones vectors is written
1
|pp| = (I + p · σ ) p |p (2.5.64)
2
Multiplication on the right by |q and on the left by q |, and some rearrange-
ment, makes
2
|p |q| 1
= (1 + p · q) (2.5.65)
p |p q |q 2
When |p and |q are normalized, the identity reduces to
2 1
|p |q| = (1 + p̂ · q̂) (2.5.66)
2
The magnitude of the inner product in Jones space is derived directly from
the Stokes vectors using this equation. What cannot be discerned, however, is
2.5 Vectors Cast in Jones and Stokes Spaces 61
the phase of the inner product. To recover the phase, p |q must be calculated
explicitly in Jones space. To construct the Jones vectors, one must either solve
the eigenvalue equation (2.5.30) or make the Jones vector (1.4.19) on page 18
for both |p and |q
A general operator can be constructed from the identity matrix and spin-
vector by the form
A = a0 I + a · σ
= a0 I + a1 σ1 + a2 σ2 + a3 σ3
a0 + a1 a2 − ja3
= (2.5.67)
a2 + ja3 a0 − a1
where all ak are complex numbers. This matrix has the eight requisite inde-
pendent variables necessary for a general Jones matrix. The entries in A are
isolated by the trace:
1 1 (2.5.68)
a0 = 2 Tr(A) and a = 2 Tr (Aσ )
H = a0 I + a · σ , ak real (2.5.69)
The determinant is det(H) = a20 − a21 + a22 + a23 . Moreover, when the trace
of H is zero, the Hermitian matrix equals a spin-vector form
HTr=0 = a · σ (2.5.70)
Throughout much of this text, Hermitian operators with zero trace, and op-
erators that preserve that trace, are associated with the spin-vector form.
The general operator A can be decomposed into Hermitian and skew-
Hermitian matrices
A = Hr + jHi (2.5.71)
where the operator K = (jHi ) is skew-Hermitian: K † = −K. The eigenvalues
of skew-Hermitian matrices are purely imaginary. The matrices Hr and Hi
contain the real and imaginary parts of A, respectively. The decomposition
is taken further by separating the finite-trace component from the traceless
components. Writing the complex number a0 as a0 = a0 + ja0 and identifying
each traceless Hermitian matrix with a spin-vector form, one has
One can interpret this operator as that for a partial polarizer: the common
loss is α0 /2 (which is a negative quantity for loss), the maximum and mini-
mum differential losses are 1 ± tanh α/2, and the Stokes direction of partial
polarization is α̂.
The Pauli spin operator for a unitary matrix is constructed by recalling
the connection between Hermitian and unitary operators (2.4.18) on page 49.
For coefficients βk real, the unitary form of M is M = −j(βo I + (β · σ )). The
equivalent to (2.5.76) is
exp −j β · σ /2 = I cos (β/2) − j(β̂ · σ ) sin (α/2) (2.5.78)
matrix, (1.4.22) on page 18 and figuratively (2.1.3) on page 38. Mueller ma-
trices operate on 4 × 1 Stokes vectors to create new, transformed 4 × 1 Stokes
vectors.
The connection between a unitary matrix and an equivalent Stokes matrix
is also made with the Mueller matrix, but as indicated by (2.1.1) on page 38,
only the lower right 3 × 3 sub-matrix R is relevant. Sub-matrix R maps
spherical coordinates (S1 , S2 , S3 ) into new spherical coordinates (S1 , S2 , S3 )
without change of length. Therefore one expects the existence of a rotation
operator R corresponding to matrix R that performs rotations on the Poincaré
sphere. The operator R does indeed exist and its derivation and properties are
so central to the description of retardance that this entire section is devoted
to its understanding.
Operators U and R are equivalent representation of the same transforma-
tion cast in two different vector spaces. The operators are called isomorphic
because they have similar, but not equal, effects. The isomorphism is two-to-
one since, as well be seen, there are two Jones operations that have the same
effect as every one Stokes operation.
Consider equivalent vectors |s and ŝ at the input of a system and equiv-
alent vectors |t and t̂ at the output. In Jones space a unitary transforma-
tion T , corresponding to the underlying Maxwell’s equations in anisotropic
media, links the input and output. In Stokes space the rotation operator R
links the input and output. The parallel transformations are
dual
|t = T |s ←−−−→ t̂ = R ŝ (2.6.1)
Expansion of the Stokes vectors on the right side of (2.6.1) into their corre-
sponding inner products gives the relation between R and T
Since (2.6.3) holds for any |s, the embedded operators must be equal. There-
fore
R σ = U †σ U (2.6.4)
where the common phase of T commutes with σ and is eliminated. Equa-
tion (2.6.4) has an unusual form; the interpretation is
σ1 U † σ1 U
3 × 3 σ2 = U † σ2 U (2.6.5)
†
σ3 U σ3 U
Identity UI = U RI = R
Closure U1 U2 = U3 R1 R2 = R3
−1
Inverse U U =I R−1 R = I
Associativity (U1 U2 )U3 = U1 (U2 U3 ) (R1 R2 )R3 = R1 (R2 R3 )
tj = t |σj | t
= Tr (|tt| σj )
= Tr J |ss| J † σj (2.6.10)
Next, the outer product |ss| is replaced with something close to the spin-
vector form (2.5.29) on page 56. Since the vector is not necessarily normalized,
the expression |ss| = 12 s |s (I + ŝ · σ ) will be used. Thus,
Tr J |ss| J † σj = 1
2 s |s Tr J (I + ŝ · σ ) J † σj
= 1
2 s |s Tr J(ŝ · σ )J † σj
1
3
= Tr Jσk J † σj sk (2.6.11)
2
k=0
where σ has been loosely indexed here to include σ0 , sk = s |s ŝk , and the
common phase in T commutes with (ŝ·σ ) and is eliminated. The last equation
has the matrix multiplication form
3
tj = Mj+1,k+1 sk (2.6.12)
k=0
cos 2κ sin 2κ
α=0 cos κ −j sin κ
U = R2 =
1
β = π/2 −j sin κ cos κ
− sin 2κ cos 2κ
cos 2κ − sin 2κ
cos κ − sin κ
α=β=0 U = R3 = sin 2κ cos 2κ
sin κ cos κ
1
(2.6.15)
det(R) = 1 (2.6.16)
The vector form of R abstracts away any notion of an underlying, fixed coor-
dinate system. Rather, each operation R has its own local coordinate system
based on the eigenvectors and spin direction of R. The vectorial form of R
gives the highest level of geometric interpretation to transformation mechanics
in Stokes space.
68 2 The Spin-Vector Calculus of Polarization
The vector expression for R is derived from the vector form of U . The
operator U is resolved into its eigenvector-based projectors using (2.4.15) on
page 48; the resolution for a two-dimensional system gives
Equation (2.6.19) is now in familiar form and can be mapped to the exponen-
tial equivalent as
U = e−j(ϕ/2)(r̂·
σ) (2.6.20)
where (ϕ/2)(r̂ · σ ) is the Hermitian operator associated with U .
Substitution of (2.6.19) into the equivalence relation (2.6.4), and applying
the spin-vector identities (2.5.35), (2.5.36), and (2.5.49) produces
Since each term on the left- and right-hand sides of (2.6.21) operates on σ ,
one can extract the embedded relation for R
Recalling the vector identity a × a × c = b(a · a) − c(a · b), the last term on
the right-hand side is identified as
Equation (2.6.25) is a beautifully compact expression for the action any uni-
tary operator has on a polarization vector. The vector r̂ points in the direction
of the positive eigenvector of U . The vector operators {(r̂r̂·), (r̂×), (r̂×)(r̂×)}
form a local orthonormal basis. The local basis requires a vector about which
2.6 Equivalent Unitary Transformations 69
a) b)
^
r
^
^ t
r
^^
rr. ^ ^ ^
(rx)(rx) s
^
^ (rx)
s w
Fig. 2.4. Vector components of rotation operator R. a) The local orthonormal basis
{(r̂r̂·), (r̂×), (r̂×)(r̂×)} as resolved on ŝ. b) Transformation to t̂ from ŝ via precession
about r̂, travelling through precession angle ϕ.
the basis can be fully resolved; for instance, operation on state ŝ generates
the basis (r̂, r̂ × ŝ, r̂ × r̂ × ŝ). In the absence of being fully resolved, the local
basis has immutable properties that are independent of the resolving vector.
Figure 2.4(a) illustrates the local basis resolved by ŝ. Vector ŝ in relation
to r̂ defines a precession circle, the circle about which ŝ travels. Local axis (r̂r̂·)
always points parallel to r̂. The local axes (r̂×), (r̂×)(r̂×) define the plane
of the precession circle and are perpendicular to r̂. The local axis (r̂×) is
tangent to the precession circle and (r̂×)(r̂×) points to the origin of the
precession circle. The particular pointing directions of (r̂×) and (r̂×)(r̂×),
while always in the precession plane, are determined only after determination
of ŝ. Figure 2.4(b) illustrates transformation to state t̂ from ŝ about r̂. The
precession angle is ϕ and the precession direction follows the right-hand rule.
Since the motion of precession is so central in the description of polar-
ization transformation mechanics, Fig. 2.5 is included to describe precession
in a local coordinate system. Consider the input state ŝ and the precession
axis r̂. The precession axis can be the birefringent axis of a dielectric medium
or the principal-state-of-polarization axis used to describe polarization-mode
dispersion. In any case, the angle γ separates the two vectors. The motion of
precession is to turn ŝ about r̂ in a circle while keeping the angle γ fixed. This
is the same motion a gyroscope exhibits under gravitational influence. The
angle subtended by projections of states ŝ and t̂ onto the base circle is the
precession angle. The differential equation of motion can be deduced from R
in local-coordinate form. Consider state ŝ that undergoes a small change in
angle δϕ. The motion is
ŝ + δŝ = Rδϕ ŝ (2.6.26)
Taking R in the form (2.6.22) and simplifying for small angles,
ŝ + δŝ = I ŝ + δϕ r̂ × ŝ (2.6.27)
70 2 The Spin-Vector Calculus of Polarization
^
r
^
^
t
s
g g ^
d s = ^r x ^s
dw
ws-t
Fig. 2.5. Precessional motion of ŝ about r̂, passing through state t̂. Angle γ remains
fixed, while angle ϕ, as projected onto the base, is the degree of precession.
g g b
d d
change the state, only a phase is contributed. The state is invariant under U .
For every |r± there are corresponding r̂± vectors in Stokes space. The be-
havior of R† E is to rotate r̂± to ±s1 ; R1 (ϕ) then pirouettes the state about
the s1 axis, and RE returns the state back to r̂± .
The decomposition of U as in (2.6.34) has much significance in relation to
propagation through birefringent media. For example, the propagation con-
stants for ordinary and extraordinary waves in a birefringent medium are
βo = ωno /c and βe = ωne /c. The eigenvalues of the propagation matrix are
exp(∓j(βe − βo )z/2). The polarization transformation in Stokes space is ac-
cordingly
R∆β = RE Rx (∆βz)R† E (2.6.36)
The inner matrix Rx creates precession about the s1 axis in Stokes space while
the Euler rotation and its adjoint transforms the eigenstates of the system
onto the s1 axis and then restores the pointing direction of the eigenstate.
The precession about s1 is transformed to precession about r̂.
a) S3 b) S3
^
r
c
S2 ^ S2
s
a
^
r2 S1 S1
^
r1 ws-t
^
t
w1 b
w2
dŝ
= Ω × ŝ (2.6.39)
dω
The differential precession rule for a single homogeneous birefringent section
is the same whether the position or frequency changes. This simplicity is
quickly broken when two or more homogeneous sections are concatenated.
Birefringent concatenation is in the category of polarization-mode dispersion.
The polarization state evolution through two misaligned birefringent sec-
tions as a function of length can be evaluated using (2.6.25) in the following
way. Since the media are misaligned, their birefringent axes are not parallel;
that is, r̂1 = r̂2 . Figure 2.7(a) illustrates the polarization evolution through
the sections. The input state, arbitrarily selected, is located at position (a).
That state precesses about r̂1 through angle ϕ1 , dictated by the length and
birefringence of the section, as well as the input frequency. The output polar-
ization from the first section is located at position (b). That state then enters
the second section which transforms it about r̂2 . The polarization state now
traces a second circle that is different from the first. The output state is even-
tually located at position (c). The aggregate polarization transformation is
calculated by the concatenation of R2 and R1 . The compounded polarization
transformation is
R2 R1 = (r̂2 r̂2 ·) + sin ϕ2 (r̂2 ×) − cos ϕ2 (r̂2 ×)(r̂2 ×) (2.6.40)
(r̂1 r̂1 ·) + sin ϕ1 (r̂1 ×) − cos ϕ1 (r̂1 ×)(r̂1 ×) (2.6.41)
a) S3 b) ^
so S3
S2 S2
S1 S1
Fig. 2.8. Uniform and biased scattering through operator R. a) Uniform scat-
tering. r̂ points in any direction with equal likelihood. ϕ is uniformly distributed.
b) Biased scattering, a = 0.05. R̃ is constructed along s3 and oriented toward ŝo .
Figure 2.7(b) illustrates the motion. r̂ is derived from the cross of ŝ and t̂.
Angle ϕs−t rotates ŝ through to t̂.
Uniform and biased polarization scattering is useful in connection with
polarization-mode dispersion fiber-modelling calculations. The scattering pro-
cess occurs between any two adjacent birefringent sections and it intended to
model the relative alignments of the respective birefringent axes. A uniform
scattering process sends the polarization state at the output of one section, ŝo ,
to any point on the Poincaré sphere with equal probability. That state, ŝi , is
then input to the next birefringent section. The biased scattering process
weights the scattering along a predetermined direction, often the direction
of ŝo . For either uniform or biased scattering an operator R needs to be con-
structed.
There are two variables contained in R, (2.6.25): pointing direction r̂ and
precession angle ϕ. Direction r̂ itself has two independent variables, the po-
lar angles of declination and azimuth. Combined, R has three independent
variables. The random process is derived using the unit deviate ũ. To have r̂
point in any direction on the unit sphere with equal likelihood, the azimuth
angle φ and position along the s3 axis are both uniformly distributed. Also, the
precession angle is uniformly distributed to generate precessions with equal
2.6 Equivalent Unitary Transformations 75
Relating r̃3 to the polar angle as r̃3 = cos θ, the remaining coordinates are
For a ≤ 1 the bias is toward +s3 and for a ≥ 1 the bias is toward −s3 . The
scattering operator R is now biased toward s3 and is denoted R3 . Before R3
can be applied to ŝo the former needs to be rotated into the latter. Following
the previous example of the shortest distance between two points in Stokes
space, a deterministic operator R3−so is constructed to perform the required
rotation. Operator R3−so needs to be calculated only once. Figure 2.8(b) illus-
trates an output state scattered on the Poincaré sphere and biased toward ŝo .
76 2 The Spin-Vector Calculus of Polarization
cos 2κ sin 2κ
cos κ −j sin κ
U2 = R2 = 1
−j sin κ cos κ
− sin 2κ cos 2κ
cos 2κ − sin 2κ
cos κ − sin κ
U3 = R3 = sin 2κ cos 2κ
sin κ cos κ
1
d |s dŝ
= −j/2(r̂ · σ ) |s = r̂ × ŝ
dϕ dϕ
2.6 Equivalent Unitary Transformations 77
α · σ /2) = eα0 /2
H = exp (α0 /2) exp ( I cosh (α/2) + (α̂ · σ ) sinh (α/2)
· σ /2 = e−j β0 /2 I cos (β/2) − j(β̂ · σ ) sin (β/2)
T = exp (−j β0 /2) exp −j β
78 2 The Spin-Vector Calculus of Polarization
References
1. O. Aso, I. Ohshima, and H. Ogoshi, “Unitary-conserving construction of the Jones
matrix and its applications to polarization-mode dispersion analysis,” Journal of
the Optical Society of America A, vol. 14, no. 8, pp. 1988–2005, Aug. 1997.
2. D. M. Brink and G. R. Satchler, Angular Momentum, 3rd ed. Oxford: Oxford
Science Publications, 1999.
3. N. Frigo, “A generalized geometric representation of coupled mode theory,” IEEE
Journal of Quantum Electronics, vol. QE-22, no. 11, pp. 2131–2140, 1986.
4. N. Gisin and B. Huttner, “Combined effects of polarization mode dispersion and
polarization dependent losses in optical fibers,” Optics Communications, vol. 142,
pp. 119–125, Oct. 1997.
5. J. P. Gordon and H. Kogelnik, “PMD fundamentals: Polarization mode dispersion
in optical fibers,” Proceedings of National Academy of Sciences, vol. 97, no. 9,
pp. 4541–4550, Apr. 2000. [Online]. Available: http://www.pnas.org
6. M. Rose, Elementary Theory of Angular Momentum. New York: Dover Publi-
cations, 1995.
7. J. J. Sakurai, Modern Quantum Mechanics. New York: Addison–Wesley, 1985.
8. G. Strang, Linear Algebra and its Applications, 3rd ed. New York: Harcourt
Brace Jovanovich College Publishers, 1988.
http://www.springer.com/978-0-387-22493-0