ABSTALG
ABSTALG
Y OTSANAN M EEMARK
Meemark, Yotsanan
Abstract Algebra / Yotsanan Meemark – 1st ed. Bangkok:
Danex Intercorporation Co., Ltd., 2014. 195pp.
ISBN 978-616-361-389-9
This book is written based on two graduate abstract algebra courses offered at Department of
Mathematics and Computer Science, Faculty of Science, Chulalongkorn University. It grows out
of my lecture notes that I used while I was teaching those courses many times. My intention is
to develop essential topics in algebra that can be used in research as illustrated some in the final
chapter. Also, it can be served as a standard reference for preparing for a qualifying examination
in Algebra. I have tried to make it self-contained as much as possible. However, it may not be
suitable for reading it for the first course in abstract algebra. It hits and goes through many basic
points quickly. A typically mathematical book style that begins with some motivation, definitions,
examples and theorems, is used throughout. I try to pause with remarks to make readers have
some thoughts before moving on.
The book also requires some background in undergraduate level linear algebra and elementary
number theory. For example, I assume the readers to have known matrix theory over a field in
which treatment can be found in most linear algebra books. My number theory lecture note is
available on the web-page as well. I give many examples to demonstrate new definitions and
theorems. In addition, when the converse of a theorem may not hold, counter examples are
provided. The major points are divided into six chapters as follows.
1 Groups A group is a basic algebraic structure but it is a core in this course. I choose the
approach via group actions. Although it is not quite elementary, it is an important aspect in
dealing with groups. I also cover Sylow theorems with some applications on finite groups.
The structure theorem of finite abelian groups is also presented.
2 Rings and Fields The abstract treatments of rings and fields using groups are presented
in the first section. Rings discussed throughout this book always contain the identity. Ideals
and factorizations are discussed in detail. In addition, I talk about polynomials over a ring
and which will be used in a construction of field extensions.
3 Advanced Group Theory In this chapter, I give deeper theory of groups. Various kinds
of series of a group are studied in the first three sections. I also have results on a linear
group. Finally, I show how to construct a group from a set of objects and presentations.
4 Modules and Noetherian Rings Modules can be considered as a generalization of vector
spaces. I cover basic concepts of modules and work on free modules. Projective and injective
modules are introduced. Moreover, I present the proof of the structure theorems for modules
over a PID. In the end, Noetherian and Artinion rings are explored.
5 Field Theory I give more details on a construction of extension fields. Also, I prepare the
readers to Galois theory. Applications of Galois theory are provided in proving fundamental
theorem of algebra, finite fields, and cyclotomic fields. For the sake of completeness, I
discuss some results on a transcendental extension in the final section.
6 Applications I demonstrate some aspects in doing research in algebra. In the first section,
the readers will see some applications of module theory, especially a free R-module over
commutative rings, to obtain a structure theorem for finite dimensional symplectic spaces
over a local ring. The symplectic graphs over a commutative ring is defined and studied.
Although the problems seem combinatorial, it requires many algebraic topics presented
throughout this book.
i
The whole book is designed for a year course. Chapters 1 and 2 are appropriate for a first
course and Chapters 3, 4 and 5 can be served as a more advanced course. Chapter 6 can be
assigned as optional reading.
The book would not have been possible without great lectures from my abstract algebra
teachers—Ajchara Harnchoowong and Yupaporn Kemprasit at Chulalongkorn University, and Ed-
ward Formanek at the Pennsylvania State University. They initiate wonderful resources to com-
pose each section in this book. I express my gratitude to them all.
I take full responsibility for typos/mistakes that may be found in the manuscript. If you catch
ones or have any other suggestions, please write to me. I shall include and correct them in the
more up-to-date version once a year on the website. Finally, I hope that the textbook will benefit
many students, teachers and researchers in Algebra and Number Theory.
Yotsanan Meemark
Bangkok, Thailand
ii
Contents
Foreword i
Contents iii
1 Groups 1
1.1 Definitions and Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.1 Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.2 Subgroups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1.3 Homomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2 Group Actions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.3 Quotient Groups and Cyclic Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.3.1 Quotient Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.3.2 Cyclic Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.4 The Symmetric Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
1.5 Sylow Theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.5.1 Sylow p-subgroups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.5.2 Applications of Sylow Theorems . . . . . . . . . . . . . . . . . . . . . . . . 26
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
1.6 Finite Abelian Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
iii
2.5.2 Factorizations in Polynomial Rings . . . . . . . . . . . . . . . . . . . . . . . 63
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
2.6 Field Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
2.6.1 Algebraic and Transcendental Extensions . . . . . . . . . . . . . . . . . . . . 68
2.6.2 More on Roots of Polynomials . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
iv
6 Applications 169
6.1 Symplectic Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
6.2 Symplectic Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
6.3 Vertex and Arc Transitivities of Symplectic Graphs . . . . . . . . . . . . . . . . . . . 176
6.4 Chromatic Number of Symplectic Graphs . . . . . . . . . . . . . . . . . . . . . . . . 178
Bibliography 183
Index 185
v
This page intentionally left blank
1 | Groups
We write N for the set of positive integers, Z for the set of integers, Q for the set of rational
numbers, R for the set of real numbers and C for the set of complex numbers.
In this first chapter, we talk about a group which is a basic algebraic structure. However, it
is a core in this course. Our approach here relies on group actions. Although it is not quite
elementary, it is an important aspect in dealing with groups. We also discuss Sylow theorems
with some applications and the structure of finite abelian groups.
We may write S for (S, ·) and ab for a · b where a, b ∈ S if there is no ambiguity. Let S be a
groupoid. For nonempty subsets A and B of S and x ∈ S, let
Definition. If S satisfies the associative law, i.e., ∀a, b, c ∈ S, (a · b) · c = a · (b · c), we say that
S is a semigroup.
Notice that if S is a semigroup, then any bracketing of x1 , . . . , xn gives the same product, so
we can write x1 · · · xn for this product. In addition, for a ∈ S and m ∈ N, we may let am = a · · · a
(m copies).
Clearly, S contains at most one identity (if e and e′ are identity, then e = ee′ = e).
Definition. A monoid is a semigroup with (unique) identity. Let S be a monoid with identity e.
If a and b in S are such that ab = e = ba, then b is called a two-sided inverse or inverse of a.
We have that every element of S has at most one inverse. For, if b and b′ are inverses of a, then
ab = e = ba and ab′ = e = b′ a, so b = be = b(ab′ ) = (ba)b′ = eb′ = b′ .
1
2 1. Groups
Definition. A group is a monoid G such that every element of G has an inverse, and for a ∈ G,
let a−1 denote the (unique) inverse of a.
Remark. For a nonempty set G with binary operation on G is a group if the following axioms are
all satisfied:
(G1) [associativity] ∀a, b, c ∈ G, (a · b) · c = a · (b · c)
(G2) [identity] ∃e ∈ G, ∀a ∈ S, ae = a = ea
(G3) [inverse] ∀a ∈ G, ∃b ∈ G, ab = e = ba.
Let G be a group with identity e. For a ∈ G and m ∈ N, let a0 = e and a−m = (a−1 )m .
Remarks. 1. For a group G, we have:
(a) e−1 = e and ∀a ∈ G, (a−1 )−1 = a,
(b) ∀a ∈ G, ∀m, n ∈ Z, am an = am+n and (am )n = amn , and
(c) ∀a, b ∈ G, (ab)−1 = b−1 a−1 because (ab)(b−1 a−1 ) = e.
2. In case G is abelian, we may choose to write G additively. This means:
(a) The binary operation is denoted by +.
(b) 0 denotes the identity element and −a denotes the inverse of a.
(c) ∀a ∈ G, ∀m ∈ N, ma = a + · · · + a (m copies).
3. A group G satisfies the cancellation law: ∀a, b, c ∈ G, ab = ac (or ba = ca) ⇒ b = c.
Examples 1.1.1. 1. (Z, −) is a groupoid which is not a semigroup; (N, +) is a semigroup
which is not a monoid; (N, ·) is a monoid which is not a group.
2. (Z, +), (Q, +), (R, +), (C, +), (Q∗ , ·), (R∗ , ·) and (C∗ , ·) are infinite abelian groups. Here, A∗
denotes the set of nonzero elements of A.
3. Let X be a set and P (X) the power set of X. For subsets A and B of X, we define A△B =
(A r B) ∪ (B r A). Then (P (X), △) is an abelian group having the empty set as its identity
and A−1 = A for all A ∈ P (X). Also, (P (X), ∩) is a commutative monoid with identity X.
4. For n ∈ N, let Zn = {0, 1, . . . , n − 1} called the set of integers modulo n, where a =
{kn + a : k ∈ Z} for all a ∈ Z. Define + and · on Zn by
7. For a nonempty set X, a function on X which is 1-1 and onto (a bijection on X) is said to be
a permutation of X. Let S(X) be the set of all permutations of X. Then under composition,
(S(X), ◦) is a group called the symmetric group on X; in case X = {1, 2, . . . , n}, we write
Sn and call Sn the symmetric group on n letters. It is a group of order n!.
Theorem 1.1.1. Let G be a semigroup. Then the following statements are equivalent.
(i) G is a group.
(ii) (a) ∃e ∈ G ∀a ∈ G, ea = a and (b) ∀a ∈ G ∃b ∈ G, ba = e.
(iii) (a) ∃e ∈ G ∀a ∈ G, ae = a and (b) ∀a ∈ G ∃b ∈ G, ab = e.
(iv) ∀a, b ∈ G ∃x, y ∈ G, ax = b and ya = b.
(v) ∀a ∈ G, aG = G = Ga.
Proof. If (i) holds, (ii)–(v) are clearly true. (iv) ⇔ (v) is obvious.
(ii) ⇒ (i). Assume (ii). Let a ∈ G. Then ∃b ∈ G, ba = e, and so ∃c ∈ G, cb = e. Thus,
Proof. We shall show that ∀a ∈ G, aG = G = Ga. Let a ∈ G. Since G is cancellative, |aG| = |G| =
|Ga|. Clearly, aG ⊆ G and Ga ⊆ G. Since G is finite, aG = G = Ga.
1.1.2 Subgroups
Sometimes a group contains a nonempty subset that is closed under its operation. In this subsec-
tion, we discuss a small group in a bigger one with the same operation.
Corollary 1.1.3. If H is a finite nonempty subset of a group G which is closed under the operation
of G, then H is a subgroup of G.
Next, we investigate the group of symmetries. We begin with the following groups.
4 1. Groups
Examples 1.1.2. 1. The set of rotations about a point 0 in the plane; composition as usual. If 0
is taken to be the origin, the rotation through an angle θ can be represented analytically as
the map
cos θ sin θ
(x, y) 7→ (x cos θ − y sin θ, x sin θ + y cos θ) = x y .
− sin θ cos θ
For θ = 0, we get the identity map and the inverse of the rotation through the angle θ is the
rotation through −θ. It is called the rotation group.
2. The set of rotations together with the set of reflections in the lines which passes through 0 with
slope tan α. The latter are given analytically by
cos 2α sin 2α
(x, y) 7→ (x cos 2α + y sin 2α, x sin 2α − y cos 2α) = x y .
sin 2α − cos 2α
The product of two reflections is a rotation and the product in either order of a reflection
and a rotation is a reflection.
3. Consider the regular n-gon (that is, the n-sided polygon in which the sides are all the same
length and are symmetrically placed about a common center) inscribed in the unit circle
in the plane, so that one of the vertices is (1, 0). The vertices subtend angles of 0, 2π/n,
4π/n, . . . , 2(n − 1)π/n radians with the positive x-axis. The subset of rotation maps which
maps our figure into itself consists of the n rotations through angles of 0, 2π/n, 4π/n, . . . , 2(n−
1)π/n radians, respectively. These elements form a subgroup Rn of the rotation group de-
fined in (1).
4. We now consider the set Dn of rotations and reflections which map the regular n-gon, as in
(3), into itself. These form a subgroup of the group defined in (2). We shall call the elements
of this group the symmetries of the regular n-gon. The reflection in the x-axis is one of our
symmetries. Multiplying this on the right by the n rotational symmetries we obtain n distinct
reflectional symmetries. These give them all, for if we let σ denote the reflection in the x-
axis and τ denote any reflectional symmetry then στ is one of the n-rotational symmetries
ρ1 , . . . , ρn , say ρi . Since σ 2 = 1, στ = ρj gives τ = σρj which is one of those we counted.
Thus, Dn consists of n rotations and n reflections and its order is 2n. The group Dn is called
the dihedral group. Note that Dn = {σ i ρj : i ∈ {0, 1} and j ∈ {0, 1, 2, . . . , n − 1}}.
Remark. It is easy to see that the intersection of a family of subgroups of a group G is a subgroup
of G. If H and K are subgroups of a group G, then, in general, H ∪ K is not a subgroup of G.
However, if H and K are subgroups of a group G with G = H ∪ K, then H = G or K = G.
Proof. Assume that there is an x ∈ G r H and a y ∈ G r K. Since G = H ∪ K, we have x ∈ K and y ∈ H.
Thus, xy ∈
/ H and xy ∈/ K, a contradiction.
Definition. Let G be a group and A a subset of G. Define hAi to be the intersection of all
subgroups of G containing A. It is the smallest subgroup of G containing A and is called the
subgroup of G generated by A. The elements of A are called generators. Moreover, we have
h∅i = {e} and
hAi = {an1 1 . . . ank k : ai ∈ A and ni ∈ Z} if A 6= ∅.
For a1 , . . . , am ∈ G, we write ha1 , . . . , am i for h{a1 , . . . , am }i. Then ∀a ∈ G, hai = {an : n ∈
Z} = ha−1 i is called the cyclic subgroup of G generated by a and the order of a is |hai| (finite
or infinite) and denoted by |a| or o(a). If G = hai for some a ∈ G, then G is said to be the cyclic
group generated by a.
Examples 1.1.3. 1. {e} and G are normal subgroups of G. They are called the trivial normal
subgroups.
2. Every subgroup of an abelian group is normal.
3. Let SLn (F ) be the set of matrices over F with determinant one. Then SLn (F ) is a normal
subgroup of GLn (F ) because det(P QP −1 ) = det Q for all P, Q ∈ GLn (F ).
4. Rn = hρ2π/n i and Rn E Dn because σρj σ = ρ−1 j for all j.
Definition. Let G be a group and X a nonempty subset of G. The centralizer of X is the set
CG (X) = {g ∈ G : ∀x ∈ X, gx = xg}
NG (X) = {g ∈ G : gX = Xg}.
Proof. By (1), Z(G) is a subgroup of G. To see it is normal, let g ∈ G and z ∈ Z(G). Let x ∈ G. Then
zg = gz and zx = xz, so
1.1.3 Homomorphisms
We now study a function between two groups that is required to preserve group operations.
6 1. Groups
ker ϕ = {g ∈ G : ϕ(g) = eH }.
Proof. Since ϕ(eG ) = eH , eG ∈ ker ϕ and eH ∈ im ϕ. Let x, y ∈ ker ϕ. Then ϕ(x) = eH = ϕ(y), so
Thus, xy −1 ∈ ker ϕ. Hence, ker ϕ is a subgroup of G. Next, let g ∈ G and x ∈ ker ϕ. Then
ϕ(x) = eH and
Thus, gxg −1 ∈ ker ϕ, and so ker ϕ is normal. Finally, let y, z ∈ im ϕ. Then ∃x1 , x2 ∈ G, ϕ(x1 ) = y
and ϕ(x2 ) = z. Thus,
Since x1 x−1
2 ∈ G, yz
−1 ∈ im ϕ. Hence, im ϕ is a subgroup of H.
G1 × G2 = {(x, y) : x ∈ G1 , y ∈ G2 }
Exercises 1.1. 1. Let G be the set of pairs of real numbers (a, b) with a 6= 0 and define:
(a, b)(c, d) = (ac, ad + b) and 1 = (1, 0).
Verify that this defines a group.
2. Consider the eight objects ±1, ±i, ±j and ±k with multiplication rules:
ij = k jk = i ki = j
ji = −k kj = −i ik = −j
2 2 2
i = j = k = −1
where the minus signs behave as expected and 1 and −1 multiply as expected. (For example, (−1)j =
−j and (−i)(−j) = ij = k.) Show that these objects form a group containing only one element of
order 2. This group is called the quaternion group and is denoted by Q8 .
3. Let H = {σ ∈ S4 : {σ(1), σ(2)} = {1, 2} or {σ(1), σ(2)} = {3, 4}}. Prove that H is a subgroup of S4
and find |H|. Is H normal in S4 ? Justify your answer.
4. Let G be a semigroup such that ∀a ∈ G, ∃b ∈ G, a = aba and ∃!e ∈ G, e2 = e. Prove that G is a group.
5. Let G be a semigroup such that ∀a, b ∈ G, a2 b = b = ba2 . Prove that G is an abelian group.
6. A certain multiplicative operation on a nonempty set G is associative and allows cancellations on
the left, and there exists a ∈ G such that x3 = axa for all x ∈ G. Prove that G endowed with this
operation is an abelian group.
7. Let G be a group with the following properties:
(i) G has no element of order 2 and (ii) (xy)2 = (yx)2 , for all x, y ∈ G.
Prove that G is abelian. If (i) fails, give an example to support that “G may not be abelian”.
8. If H and K are subgroups of a group G, prove that HK ≤ G if and only if HK = KH.
9. Let ϕ : G → Ḡ be a group homomorphism, and let N be a normal subgroup of G. Show that ϕ[N ] is
a normal subgroup of im ϕ.
10. Let ϕ : G → Ḡ be a group homomorphism, and let N̄ be a normal subgroup of Ḡ. Show that ϕ−1 [N̄ ]
is a normal subgroup of G.
11. Let G be a group with identity e and ϕ : G → G a function such that
ϕ(g1 )ϕ(g2 )ϕ(g3 ) = ϕ(h1 )ϕ(h2 )ϕ(h3 )
whenever g1 g2 g3 = e = h1 h2 h3 . Prove that there exists an element a ∈ G such that ψ(x) = aϕ(x)
for all x ∈ G is a homomorphism.
12. Let Dn be the dihedral group of order 2n where n > 2. Show that the center of Dn has one or two
elements according as n is odd or even.
Definition. Let G be a group with identity element e and X a nonempty set. We say that G acts
on X or X is a G-set if there is a mapping G × X → X (denoted by g · x or gx) which satisfies:
1. ∀ x ∈ X, e · x = x and
2. ∀g, h ∈ G, ∀x ∈ X, g · (h · x) = (gh) · x.
Remark. Let a group G act on a set X. Then each g ∈ G determines a set map φg : X → X by
φg (x) = gx.
Moreover, ∀g ∈ G, φg is a bijection (1-1 and onto). Hence, φg ∈ S(X), the symmetric group on X.
θ
The map g 7→ φg defines a group homomorphism from G to S(X) (i.e., φgh = φg ◦ φh for all
g, h ∈ G). Its kernel is the set
{g ∈ G : gx = x for all x ∈ X}.
8 1. Groups
Examples 1.2.1. 1. If X is a set, S(X) acts naturally on X by f · x = f (x) for all f ∈ S(X)
and x ∈ X. This action is faithfully if |X| > 1. In particular, Sn acts on {1, 2, . . . , n}. The
orbit of each i ∈ {1, 2, . . . , n} is all of {1, 2, . . . , n}, thus Sn acts transitively on {1, 2, . . . , n}.
If Y ⊆ {1, 2, . . . , n}, the stabilizer of Y is isomorphic to S(Y ) × S(Z) ∼ = Sk × Sn−k where
Z = {1, 2, . . . , n} r Y . Hence, the stabilizer of {n} is isomorphic to Sn−1 .
2. Let G be any group and let X = G, considered as a set. Let G act on X by left multiplication
g · x = gx.
This action is called the left regular representation. It is faithful and transitive.
3. GLn (F ) acts faithfully on F n , the set of n × 1 column vectors by left multiplication. The
orbit of ~0 is itself and GLn (F ) acts transitively on the nonzero vectors.
4. Let G be any group and let X be any set. Let G act on X by g ·x = x for all g ∈ G and x ∈ X.
This is called the trivial G action. Assuming g 6= e and X has more than one element, this
action is not faithful and not transitive. All orbits are singleton and G is the stabilizer of
every subset of X.
5. Let G be a group and let X = G, considered as a set. Let G act on X by conjugation
g · x = gxg −1 .
This action may not be faithful. The center of G acts trivially. The orbit of x ∈ G is the set
of conjugates of x, that is
g · X = {gxg −1 : g ∈ G},
called the conjugacy class of x. If |G| > 1, then this action is not transitive. The number
of orbits of the number of conjugacy classes. If Y is a subset of G, the stabilizer of Y under
the action is its normalizer, i.e., StabG Y = NG (Y ).
6. Let G be a group and let H be a subgroup of G. Let H act on G by left multiplication. This
action is faithful and the orbit of x ∈ G is
H · x = {hx : h ∈ H} = Hx.
The action is not transitive unless H = G. Moreover, we can let H act on G by h · g = gh−1
for all h ∈ H and g ∈ G. This action is also faithful and the orbit of x ∈ G is
7. Let X = C ∪ {∞}, a set that becomes the Riemann sphere in complex analysis. The group
GL2 (C) acts on X by the linear fractional transformation
a b az + b
·z = ,
c d cz + d
the understanding being the image of ∞ is a/c and the image of −d/c is ∞, just as if we
were to pass to a limit in each case.
1.2. Group Actions 9
8. Let SL2 (R) be the subgroup of real matrices in GL2 (R) of determinant one, and let H be the
subset of C ∪ {∞} in which Im z > 0, called the Poincaré upper half plane. Then SL2 (R)
acts on H by linear fractional transformations.
Theorem 1.2.1. [Cayley] Every group G is isomorphic to a subgroups of S(X) for some set X.
Theorem 1.2.2. Let G be a group and suppose G acts on a nonempty set X. Define a relation ∼
on X by
x ∼ y ⇔ ∃g ∈ G, y = g · x.
Then
1. ∼ is an equivalence relation on X.
2. The equivalence class of x ∈ X under ∼ is Gx = {gx : g ∈ G}, the orbit of x. Thus, X is a
disjoint union of orbits under the action of G.
From Example 1.2.1 (6), we can let H act on G in two ways. Then Hx [xH] is an orbit of x,
and so G is a disjoint union of left [right] cosets of H in G. If we choose a subset {xα } of G
such that G is the disjoint union of the left cosets Hxα , then {xα } is called a right transversal or
system of left coset representatives of H in G and if we choose a subset {xα } of G such that G
is the disjoint union of the right cosets yα H, then {yα } is called a left transversal or system of
right coset representatives of H in G.
Remarks. 1.SBy Theorem 1.2.2,
S
(a) G = x∈G Hx [G = x∈G xH],
(b) ∀x, y ∈ G, Hx = Hy or Hx ∩ Hy = ∅ [∀x, y ∈ G, xH = yH or xH ∩ yH = ∅],
(c) ∀x, y ∈ G, Hx = Hy ⇔ xy −1 ∈ H [∀x, y ∈ G, xH = yH ⇔ y −1 x ∈ H].
2. ∀a ∈ G, |H| = |Ha| = |aH| by cancellation on H.
3. The map aH 7→ Ha−1 for all a ∈ G is a 1-1 correspondence between the sets {xH : x ∈ G}
and {Hx : x ∈ G}.
Proof. For a, b ∈ G, aH = bH ⇔ b−1 a ∈ H ⇔ a−1 b ∈ H ⇔ Ha−1 = Hb−1 . Then this map is 1-1,
well defined and clearly onto.
Definition. The index of H in G, denoted by [G : H], is the cardinal number of distinct right
(or left) cosets of H in G, that is,
Proof. Since [G : H] = 2, G has exactly two right (or left) cosets. Then Hg = G r H and
gH = G r H for all g ∈ G not in H. Hence, ∀g ∈ G, Hg = gH, so H is normal in G.
Remark. Let I and A be sets. Define Ai = {(i, a) : a ∈ A} for all i ∈ I. Then |Ai | = |A| for all
i ∈ I, Ai ∩ Aj = ∅ if i 6= j, and
[ X X
Ai = |Ai | = |A| = |I||A|.
i∈I i∈I i∈I
Proof. Since G is a disjoint union of distinct left cosets Hxα , α ∈ Λ, and |Λ| = [G : H],
[ X X
|G| = Hxα = |Hxα | = |H| = |Λ||H| = [G : H]|H|.
α∈Λ α∈Λ α∈Λ
If G is finite, then |H| divides |G|. In addition, ∀a ∈ G, hai ≤ G, so |a| = |hai| divides |G| for all
a ∈ G.
Corollary 1.2.5. If G is a group of prime order, then {e} and G are the only two subgroups of G
and G must be cyclic.
Proof. Let H ≤ G. Then |H| divides |G| = p, so |H| = 1 or |H| = p. Thus, H = {e} and H = G.
Also, if a 6= e, then hai =
6 {e}. Hence, hai = G and so G is cyclic.
A relationship between the stabilizer of x in a group G and the number of elements in the
orbit G · x is recorded in the next theorem.
Theorem 1.2.6. [Orbit-Stabilizer Theorem] Let a group G act on a set X and suppose x ∈ X.
Then [G : StabG x] = |G · x|, that is, the index of the stabilizer of x in G is the number of elements
in the orbit of x.
This theorem is most useful when this index is finite but it is true in general. We see some
applications of this theorem in the following results.
Theorem 1.2.7. Let G be a group and x ∈ G. Then the following statements hold.
1. |{gxg −1 : g ∈ G}| = [G : CG (x)], i.e., the number of conjugates of x is [G : CG (x)].
2. If G is finite, then the number of conjugates of x is a divisor of |G|.
Proof. It follows directly from the Orbit-Stabilizer Theorem if we consider the action of G on G
by conjugation.
Corollary 1.2.8. [Class Equation] Let G be a finite group and let x1 , . . . , xs represent the conju-
gacy classes of G which contains more than one element. Then
s
X
|G| = |Z(G)| + [G : CG (xi )].
i=1
Now, let G act on the set of all subsets of G by conjugation, i.e., if Y ⊂ G, then g · Y = gY g −1 .
Under this action the stabilizer of Y is {g ∈ G : gY g −1 = Y } = NG (Y ), the normalizer of Y , and
the orbit of Y is {gY g −1 : g ∈ G}, the set of conjugates of Y . Thus, the number of conjugates
of Y is the index of the normalizer of Y . Hence, we have shown:
Theorem 1.2.9. Let G be a finite group and Y a subset of G. Then the number of conjugates of Y
is [G : NG (Y )] where NG (Y ) is the normalizer of Y . In particular, the number of conjugates of Y
divides the group order.
Remark. If H is a subgroup of G, then H ⊳ NG (H) < G. Hence, if G is finite, then the number of
conjugates of H is
[G : H]
[G : NG (H)] = ≤ [G : H].
[NG (H) : H]
Burnside’s theorem gives the number of orbits in X under the action of a finite group G.
Theorem 1.2.10. [Burnside] Let a finite group G act on a finite set X. For each g ∈ G, let
Xg = {x ∈ X : gx = x}, the set of points in X fixed by g. Then the number of orbits in X is
1 X
N= |Xg |.
|G|
g∈G
U = {(g, x) ∈ G × X : gx = x}.
For h ∈ G, let
U (h) = U ∩ ({h} × X) = {(h, x) : x ∈ X and hx = x}.
For y ∈ X, let
X X X X |G| X 1
|Xg | = |U (g)| = |U | = |U [x]| = = |G| = |G|N
|G · x| |G · x|
g∈G g∈G x∈X x∈X x∈X
as desired.
P
Corollary 1.2.11. Let a finite group G act transitively on a finite set X. Then |G| = g∈G |Xg |.
Moreover, if |X| > 1, then there exists a g ∈ G fixing no point of X.
12 1. Groups
P
Proof. Since G acts transitively on X, N = 1, and so |G| = g∈G |Xg |. Assume that |X| > 1 and
no g ∈ G fixing no point of X. Then ∀g ∈ G, ∃x ∈ X, gx = x which implies that |Xg | ≥ 1 for all
g ∈ G. Thus, X
|G| ≤ |Xg | = |G|.
g∈G
This forces that |Xg | = 1 for all g ∈ G. But |Xe | = |X| > 1, a contradiction. Hence, there exists a
g ∈ G fixing no point of X.
We have known from Lagrange’s theorem that the order of any subgroups of a group G is a
divisor of |G|. The next theorem implies that if |G| has a prime divisor p, then G has a subgroup
of order p. Its proof is another application of group actions.
Theorem 1.2.12. [Cauchy] Suppose G is a finite group and a prime p divides |G|. Then the
number of solutions of g p = e in G is a multiple of p. Hence, G contains an element of order p.
Corollary 1.2.13. If G is a finite group and a prime p divides |G|, then G has a subgroup of
order p.
Exercises 1.2. 1. Let G act on S, H act on T and assume S ∩ T = ∅. Let U = S ∪ T and define
(g, h)s = gs and (g, h)t = ht for all g ∈ G, h ∈ H, s ∈ S, t ∈ T . Show that this gives an action of the
group G × H on U .
2. Let H and K be subgroups of a group G.
|H||K|
(a) If H and K are finite, then HK is a finite set and |HK| = .
|H ∩ K|
(b) For x and y in G, prove that xH ∩ yK is empty or is a coset of H ∩ K.
(c) Deduce from (b) that if H and K have finite index in G, then so does H ∩ K.
(d) If [G : H] and [G : K] are finite and relatively prime, prove that G = HK.
3. Let α be an automorphism of a finite group G which leaves only the identity fixed. Prove that
G = {x−1 α(x)|x ∈ G}.
4. Let a group G act on a set X transitively. Prove that
(a) ∀x, y ∈ X, ∃g ∈ G, gx = y, and
(b) ∀x, y ∈ X, ∃g ∈ G, gGx g −1 = Gy , i.e., allT
stabilizers are conjugate.
5. Let H be a subgroup of a group G and N = x∈G xHx−1 . Prove that
(a) N is a normal subgroup of G, and (b) if [G : H] is finite, then [G : N ] is finite.
6. Determine the number of conjugacy classes in a non-abelian group G of order p3 where p is a prime.
7. Let S and T be sets and let M (S, T ) denote the set of all functions of S into T . Let G be a finite group
acting on S. For each map f : S → T and x ∈ G define the map πx f : S → T by (πx f )(s) = f (x−1 s).
(a) Prove that x 7→ πx is an action of G on M (S, T ).
(b) Assume that S and T are finite. Let n(x) denote the number of orbits of the cyclic group hxi
on S. Prove that the number of orbits of G in M (S, T ) is equal to
1 X
|T |n(x) .
|G|
x∈G
1.3. Quotient Groups and Cyclic Groups 13
and define 1 = (1, 1) – the units of K and H, respectively. Verify that this defines a group such
that h 7→ (1, h) is a monomorphism of H into K × H and k 7→ (k, 1) is a monomorphism of K into
K × H whose image is a normal subgroup. This group is called a semi-direct product of K by H and
is denoted by K ⋊ H.
N = gN g −1 or N g = gN.
In other words, every left coset of N in G is also a right coset of N in G. If we have two left cosets
of N in G;
N x = {ax : a ∈ N } and N y = {by : b ∈ N },
then N xN y = {axby : a, b ∈ N } = N (xN )y = N (N x)y = N xy is again a left coset of N in G.
Thus
N xN y = N xy
defines a binary operation on the set G/N = {N x : x ∈ G} of left cosets of N in G.
S
Theorem 1.3.1. Suppose G is a group and N is a normal subgroup of G. Let G = N xα be a
decomposition of G as a disjoint union of left (or right) cosets. Then the binary operation
N xα N xβ = N xα xβ
makes the set of left cosets of N into a group, called the quotient or factor group of G by the
normal subgroup N . This group is denoted by G/N . The map π : G → G/N defined by
π(x) = N x
Remarks. 1. If H is a subgroup of G which is not normal, then the set of left cosets of H
in G does not form a group in any natural way. For example, if G = S3 and H = h(12)i =
{(1), (12)}, then H is not normal in G and {H, H(13), H(23)} is not a group because
G/N = {N + x : x ∈ G}
(N + x) + (N + y) = N + (x + y)
for all x, y ∈ G.
ϕ(x(ker ϕ)) = ϕ(y(ker ϕ)) ⇔ ϕ(x) = ϕ(y) ⇔ xy −1 ∈ ker ϕ ⇔ x(ker ϕ) = y(ker ϕ),
ϕ(x(ker ϕ)y(ker ϕ)) = ϕ(xy(ker ϕ)) = ϕ(xy) = ϕ(x)ϕ(y) = ϕ(x(ker ϕ))ϕ(y(ker ϕ)).
θ / ϕ
Definition. Let G H / K be a sequence of group homomorphisms. We say that it is
exact at H if im θ = ker ϕ. A short exact sequence of groups is a sequence of groups and
homomorphisms
θ ϕ
1 /G /H /K /1
which is exact at G, H and K. That is, θ is 1-1, ϕ is onto and im θ = ker ϕ. Here, 1 stands for
the smallest group of order one.
ι π
1 /N /G / G/N /1
is exact. Here ι denotes the inclusion map. On the other hand, if N ≤ G and
ι
1 /N /G /H /1
Theorem 1.3.3. [Second Isomorphism Theorem] Suppose G is a group and H and N are sub-
groups of G, with N normal in G. Then HN = N H is a subgroup of G in which N is normal,
H ∩ N is normal in H and H/(H ∩ N ) ∼
= HN/N .
1.3. Quotient Groups and Cyclic Groups 15
ι ϕ
1 /H ∩N /H / HN/N /1,
Remark. If H and N are not normal in G, then HN may not be a subgroup of G. E.g.,
in S3 , the subgroups H = {(1), (12)} and N = {(1), (13)} are not normal in S3 and HN =
{(1), (12), (13), (132)} is not a subgroup of S3 .
Theorem 1.3.4. [Third Isomorphism Theorem] Suppose G is a group and N is a normal sub-
group of G. Then the map
θ : H 7→ H/N
gives a 1-1 correspondence
ι ϕ
1 / H/N / G/N / G/H /1,
where the homomorphism ϕ : gN 7→ gH for all g ∈ G which is well defined because N ⊆ H, and
ker ϕ = {gN : g ∈ G and gH = H} = {gN : g ∈ H} = H/N .
Remark. The above theorem is useful for obtaining subgroups and normal subgroups of a quo-
tient group.
Proof. (1) The given set contains a2 + n2 , so it is not empty and d exists by the well-ordering
principle. Then d = ax + ny > 0 for some x, y ∈ Z. We shall prove that d = gcd(a, n). By Division
Algorithm, ∃q, r ∈ Z, a = dq + r with 0 ≤ r ≤ d. If r > 0, then
Now we study cyclic subgroups of a group G. Recall that if G is a group and a ∈ G, then
hai = {am : m ∈ Z} and the order of a is |a| = |hai|.
Theorem 1.3.9. Any two cyclic groups of the same orders (finite or infinite) are isomorphic.
Proof. Assume that G is cyclic. Then G = hai for some a ∈ G. By Theorem 1.3.7, if G is infinite,
then G ∼
= Z, and if G is finite, then |G| = |a| and G = {e, a, . . . , a|a|−1 } ∼
= Z|a| .
1.3. Quotient Groups and Cyclic Groups 17
Theorem 1.3.10. Let G be a cyclic group generated by a, and let H be a subgroup of G. Then H is
also a cyclic group generated by ak where k = min{m ∈ N : am ∈ H} or H = {e}. Consequently,
every subgroup of a cyclic group is cyclic.
Theorem 1.3.12. Let G be a finite cyclic group of order n. Then G has exactly one subgroup H of
order d for each divisor d of n, and no other subgroups. Moreover, if G is generated by a, then H is
generated by an/d .
Proof. Let d | n. Since (an/d )d = e, |an/d | ≤ d. If |an/d | = r < d, then anr/d = e and nr/d < n
which contradicts |a| = n. Thus, |an/d | = d. Let H be a subgroup of G of order d. If d = 1, then
H = {e}. Assume that d > 1. By Theorem 1.3.10, H = hak i, where k = min{m ∈ N : am ∈ H}.
Since |H| = d, (ak )d = e, so n | kd which implies nd | k. Thus, k = nd q for some q ∈ Z. Hence,
ak = (an/d )q ∈ han/d i. It follows that H ⊆ han/d i. However, |H| = d = |han/d i|, so H = han/d i.
Example 1.3.2. All subgroups of the cyclic group G = hai of order 12 are shown in the following
diagram.
hai
② ❉❉
②② ❉❉
②② ❉❉
② ❉❉
②②
ha2 i ha3 i
❊❊
②② ❊❊ ③③
②②② ❊❊ ③③③
② ❊❊ ③
②② ③③
ha4 i ha6 i
❊❊
❊❊ ②②
❊❊ ②②②
❊❊ ②
②②
ha12 i
The order of an element in a cyclic group and its generators are studied in the next theorem.
Theorem 1.3.13. Let G be a finite cyclic group of order n generated by a and m ∈ Z. Then
1. ham i = had i, where d = gcd(m, n).
n
2. |am | = .
gcd(m, n)
3. am is a generator of G ⇔ gcd(m, n) = 1, and so G contains precisely φ(n) elements of
order n.
Proof. (1) Since d | m, ham i ⊆ had i. Since d = gcd(m, n), d = mx + ny for some x, y ∈ Z, so
Definition. The set of all automorphisms of a group G is denoted by Aut G and is called the
automorphism group of G.
and φg is called an inner automorphism. The subgroup of Aut G consisting of the {φg : g ∈
G} is called the inner automorphism group of G and is denoted by Inn G.
3. The map θ : g 7→ φg is a group homomorphism from G into Aut G.
4. The kernel of θ is Z(G), the center of G, and the image of θ is Inn G. Consequently,
G/Z(G) ∼
= Inn G ≤ Aut G.
= Z2 and Aut Zn ∼
Example 1.3.3. Aut Z ∼ = Z×
n.
Proof. Let ϕ ∈ Aut Z. Note that for each k ∈ N, ϕ(k) = ϕ(k · 1) = ϕ(1 · · + 1}) = k · ϕ(1)
| + ·{z
k
and ϕ(−k) = −ϕ(k) = −(k · ϕ(1)), so ϕ is completely determined by ϕ(1). Since ϕ is onto,
im ϕ = ϕ(1)Z = Z. Thus, ϕ(1) | 1, so ϕ(1) = ±1. Hence, Aut Z = {±id} ∼
= Z2 .
Let ϕ ∈ Aut Zn . Similarly, ϕ is completely determined by ϕ(1). Since ϕ is onto, im ϕ =
hϕ(1)i = Zn . By Remark after Theorem 1.3.13, ϕ(1) ∈ Z× ∼ ×
n . Hence, Aut Zn = Zn with isomor-
phism ϕ 7→ ϕ(1).
Exercises 1.3. 1. Prove that if G is a group for which G/Z(G) is cyclic, then G is abelian.
2. Let G be a group of order 2k where k is odd. Show that G contains S a subgroup of index 2.
3. Let H be a proper subgroup of a finite group G. Show that G 6= g∈G gHg −1 .
4. Let G be a group and a ∈ G. If hai ⊳ G and H < hai, prove that H is normal in G.
5. Let G be a group and N a subgroup contained in the center of G. Suppose that G/N is cyclic. Prove
that G is necessarily abelian.
6. Let G be a group. If a, b ∈ G are of finite order and ab = ba, prove that |ab| = lcm(|a|, |b|).
7. Let m and n be integers. Prove the following statements.
(a) mZ + nZ = dZ and mZ ∩ nZ = lZ where d = gcd(m, n) and l = lcm(m, n).
(b) If gcd(m, n) = 1, then Zmn ∼ = Zm × Zn . This is called the “Chinese remainder theorem”. Is the
converse true?
8. Let G be a group, K a normal subgroup of G of index r, and let g ∈ G be an element of order n.
Prove that if r and n are relatively prime, then g ∈ K.
9. Prove the following statements.
(a) If gcd(m, n) = 1, then Aut(Zm × Zn ) ∼ = Z×m × Zn .
×
∼
(b) Aut(Zp × Zp ) = GL2 (Zp ).
10. Prove Theorem 1.3.15.
11. Let H < G. Prove that CG (H) ⊳ NG (H) and NG (H)/CG (H) is isomorphic to a subgroup of Aut H.
12. If G is a group for which Aut(G) = {1}, prove that |G| ≤ 2.
1.4. The Symmetric Group 19
and fixed (that is, leaves unchanged) the other numbers in {1, 2, . . . , n} is called a cycle or an
r-cycle. We denote this as
γ = (i1 i2 . . . ir ).
It is clear that we can equally well write
Definition. Two cycles γ and γ ′ are said to be disjoint if their symbols contain no common
letters.
In this case, it is clear that any number moved by one of these transformations is fixed by the
other, i.e., ∀i, γ(i) 6= i ⇒ γ ′ (i) = i. Hence, if i is any number such that γ(i) 6= i, then γγ ′ (i) = γ(i),
and since also γ 2 (i) 6= γ ′ (i), γ ′ γ(i). Similarly, if γ ′ (i) 6= i, then γ ′ γ(i) = γ ′ (i) = γγ ′ (i). Also if
γ(i) = i = γ ′ (i), then γγ ′ (i) = γ ′ γ(i). Thus γγ ′ = γ ′ γ, that is, we have proved (1) of the following
theorem.
Proof. For (2), clearly, γ r = (1). Let 1 ≤ s < r. Then γ s (i1 ) = is+1 6= i1 , so γ s 6= (1). (3) follows
from (2) and the fact that |ab| = lcm(|a|, |b|) for all a, b ∈ G with ab = ba (see Exercises 1.3).
It is convenient to extend the definition of cycles and the cycle notation to 1-cycles where we
adopt the convention that for any i, (i) is the identity mapping. With this convention, we can see
that:
Theorem 1.4.2. Every permutation is a product of disjoint cycles. Moreover, the product is unique
up to rearranging its cycles and cyclically permuting the numbers within each cycle.
Proof. Let σ ∈ Sn . If σ = (1), we are done. Assume that σ 6= 1. Let G = hσi act on {1, 2, . . . , n
naturally as in Example 1.2.1 (1). Let B1 , B2 , . . . , Br be distinct orbits of {1, 2, . . . , n} under this
action. For each j ∈ {1, 2, . . . , r}, we define the cycle µi by
(
σ(x), if x ∈ Bi ;
µi (x) =
x, if x ∈ {1, 2, . . . , n} r Bi .
Remark. The above two theorems tell us how to find the order of an element in Sn .
Next, we shall discuss the cycle structure and the conjugacy class of a permutation.
20 1. Groups
Theorem 1.4.4. Two elements of Sn are conjugate if and only if they have the same cycle structure.
The number of conjugacy classes of Sn equals the number of partitions of n.
Proof. Assume that σ and τ are conjugate. Then τ = ασα−1 for some τ ∈ Sn . Write
τ = ασα−1 = α(i1 i2 . . . ir1 )α−1 α(j1 j2 . . . jr2 )α−1 α(k1 k2 . . . krs )α−1
= (α(i1 )α(i2 ) . . . α(ir1 ))(α(j1 )α(j2 ) . . . α(jr2 ))(α(k1 )α(k2 ) . . . α(krs )).
σ = (a1 a2 . . . ar1 )(ar1 +1 ar1 +2 . . . ar1 +r2 ) . . . (ar1 +r2 +···+rs−1 +1 . . . an−1 an )
and
τ = (b1 b2 . . . br1 )(br1 +1 br1 +2 . . . br1 +r2 ) . . . (br1 +r2 +···+rs−1 +1 . . . bn−1 bn ).
Define α ∈ Sn by α(ai ) = bi for all i ∈ {1, 2, . . . , n}. Then ασα−1 = τ .
To define an important subgroup of Sn , namely the alternating group, we shall need some
results on 2-cycles.
(1) = (x1 y1 )(x2 y2 ) . . . (xk yk ) = (1x1 )(1y1 )(1x1 )(1x2 )(1y2 )(1x2 ) . . . (1xk )(1yk )(1xk )
with xi < yi for all i ∈ {1, 2, . . . , k}. Consider any (1u), u > 1, in the right hand side. Since
the opposite side is (1), (1u) must occur twice (or even number of times) in the right hand side.
Note that (1 7→ u and u 7→ 1) will give u 7→ u. Thus each transposition in the right hand side
occurs even numbers of times, which implies that the right hand side should have even number
of transpositions. Hence, k is even.
(2) Assume
α = (x1 y1 )(x2 y2 ) . . . (xk yk ) = (w1 z1 )(w2 z2 ) . . . (wl zl )
for some xi 6= yi , wj 6= zj and k, l ∈ N. Since |(wi zi )| = 2 for all i,
(x1 y1 )(x2 y2 ) . . . (xk yk )(wl zl )−1 (wl−1 zl−1 )−1 . . . (w1 z1 )−1 = (1)
(x1 y1 )(x2 y2 ) . . . (xk yk )(wl zl )(wl−1 zl−1 ) . . . (w1 z1 ) = (1),
Remarks. Let α, β ∈ Sn .
1. αβ is even ⇔ α and β have the same parity.
2. Since αα−1 = (1) which is even, α and α−1 have the same parity.
Theorem 1.4.7. Let n > 1. The set An of all even permutations forms a normal subgroup of Sn of
index two. It is called the alternating group of degree n and |An | = n!/2.
22 1. Groups
Proof. By Theorem 1.4.6, (1) is even. It is clear that the product of even permutations is even.
Since a transposition has order two, the inverse of an even permutation is even. Hence, An is a
subgroup of Sn . Since n > 1, let (ab) be a transposition in Sn . Clearly, (ab) is an odd permutation.
We will show that Sn = An ∪ (ab)An . Let α ∈ Sn . If α is even, then α ∈ An . On the other hand,
assume that α is odd. Then (ab)α is even, so (ab)α ∈ An , i.e., α ∈ (ab)An . Thus, [Sn : An ] = 2. In
addition, since α and α−1 have the same parity, αAn α−1 ⊆ An . Hence, An is normal in Sn .
Corollary 1.4.8. Let a group G act on a finite set X, and assume that some element h ∈ G induces
an odd permutation on X. Then there exists a normal subgroup N of G with [G : N ] = 2 and
h∈/ N.
θ π
G / S(X) / S(X)/A(X) ,
where A(X) is the alternating group of even permutations on X, θ : g → φg and π is the canonical
map. Since φh is an odd permutation, π ◦ θ is onto. Choose N = ker π ◦ π. Then N ⊳ G and
G/N ∼= S(X)/A(X). Thus, [G : N ] = [S(X) : A(X)] = 2. Since (π ◦ θ)(h) = φh A(X) 6= A(X), we
have g ∈
/ N.
Corollary 1.4.9. Let |G| = 2m, where m is odd. Then G has a normal subgroup of order m. In
particular, if m > 1, then G is not simple.
Proof. Since |G| is even, let g be an element of order two in G. Let G act on G by left multiplication
θ
and onsider G → S(G). Since the action is faithful, θ is 1-1, so |θ(g)| = 2. Thus, θ(g) = φg is
an odd permutation. By the previous corollary, there exists a normal subgroup N of G such that
[G : N ] = 2. Hence, G is not simple.
Example 1.4.3. Since the Klein group V4 = {(1), (12)(34), (13)(24), (14)(23)} is normal in A4 , it
follows that A4 is not simple.
Proof. Clearly, A2 and A3 are simple. For n ≥ 5, we give a step-by-step guideline in the last
question of Exercises 1.4.
Corollary 1.4.11. If n 6= 4, then the only normal subgroups of Sn are {(1)}, An and Sn .
5. The exponent of a finite group G is the smallest positive integer n such that g n = 1 for all g ∈ G.
Find the exponent of S30 , the symmetric group on 30 letters.
6. Show that if H is any subgroup of Sn , n ≥ 2, then either all permutations in H are even or exactly
half are even.
7. Let G be a group of order 360 having a maximal subgroup isomorphic to A5 . Prove that G ∼ = A6 .
8. Prove that An is simple for n ≥ 5, following the steps and hints given.
(a) Show An contains every 3-cycle if n ≥ 3.
(b) Show An is generated by the 3-cycles for n ≥ 3. [Hint. Note that (a, c)(a, b) = (a, b, c) and
(a, b)(c, d) = (a, c, b)(a, c, d).]
(c) Let r and s be fixed elements of {1, 2, . . . , n} for n ≥ 3. Show that An is generated by the n
“special” 3-cycles of the form (r, s, i) for 1 ≤ i ≤ n. [Hint. Show every 3-cycle is the product of
“special” 3-cycles by computing
(r, s, i)2 , (r, s, j)(r, s, i)2 , (r, s, j)2 (r, s, i) and (r, s, i)2 (r, s, k)(r, s, j)2 (r, s, i).
(e) Let N be a nontrivial normal subgroup of An for n ≥ 5. Show that one of the following cases
must hold, and conclude in each case that N = An .
Case 1. N contains a 3-cycle.
Case 2. N contains a product of disjoint cycles, at least one of which has length greater
than 3. [Hint. Suppose N contains the disjoint product σ = µ(a1 , a2 , . . . , ar ). Show that
σ −1 (a1 , a2 , a3 )σ(a1 , a2 , a3 )−1 is in N , and compute it.]
Case 3. N contains a disjoint product of the form σ = µ(a4 , a5 , a6 )(a1 , a2 , a3 ). [Hint. Show that
σ −1 (a1 , a2 , a4 )σ(a1 , a2 , a4 )−1 is in N , and compute it.]
Case 4. N contains a disjoint product of the form σ = µ(a1 , a2 , a3 ) where µ is a product of
disjoint 2−cycles. [Hint. Show σ 2 ∈ N and compute it.]
Case 5. N contains a disjoint product σ of the form σ = µ(a3 , a4 )(a1 , a2 ), where µ is a product
of an even number of disjoint 2−cycles.
[Hint. Show that σ −1 (a1 , a2 , a3 )σ(a1 , a2 , a3 )−1 is in N , and compute it to deduce that α =
(a2 , a4 )(a1 , a3 ) is in N . Using n ≥ 5 for the first time, find i ∈ {1, 2, . . . , n}, where i 6=
a1 , a2 , a3 , a4 . Let β = (a1 , a3 , i). Show that β −1 αβα ∈ N , and compute it.]
Remark. Let P and Q be subgroups of G. If P is a p-group and Q is a q-group, where p and q are
distinct primes, then P ∩ Q = {e}.
Theorem 1.5.2. Let G be a finite p-group and |G| > 1. Then the following statements hold.
1. |Z(G)| > 1.
2. If |G| = p2 , then G is abelian.
Proof. By Corollary 1.5.1, |G| = pl for some l ∈ N. Recall from Corollary 1.2.8 that
s
X s
X
−1
|G| = |Z(G)| + |{gxi g : g ∈ G}| = |Z(G)| + [G : CG (xi )],
i=1 i=1
where x1 , . . . , xs represent the conjugacy classes of G which contains more than one element.
Since [G : CG (xi )] = |G|/|CG (xi )| > 1 for all i and |G| = pl , p divides |{gxi g −1 : g ∈ G}| for all
i ∈ {1, 2, . . . , s}. Hence, p | |Z(G)|, so |Z(G)| > 1. This proves (1). For the second part, assume
that |G| = p2 . We know that Z(G) is a normal subgroup of G and |Z(G)| > 1. By Lagrange
Theorem, |Z(G)| = p or |Z(G)| = p2 . If |Z(G)| = p2 , we have Z(G) = G and so G is abelian.
Suppose that |Z(G)| = p. Then G/Z(G) is of order p and so a cyclic group. This implies that G is
abelian.
Proof. Let X = {xP : x ∈ G} and let H act on X by h · xP = hxP for all x ∈ G and h ∈ H.
Clearly, |X| = [G : P ] and
|X0 | = |{xH : x ∈ G and x−1 Hx = H}| = |{xH : x ∈ NG (H)}| = [NG (H) : H],
so [NG (H) : H] ≡ [G : H] mod p . The final result clearly follows from (2).
1.5. Sylow Theorems 25
Theorem 1.5.5. [First Sylow Theorem] Let G be a group of order pn m where n ≥ 1 and p does
not divide m. Then the following statements hold.
1. G contains a subgroup of order pi for all 1 ≤ i ≤ n.
2. For each i, where 1 ≤ i < n, every subgroup H of G of order pi is a normal subgroup of a
subgroup of order pi+1 .
Proof. Since p divides |G|, by Cauchy theorem, G has a subgroup H1 of order p. Assume that
k ∈ {1, 2, . . . , n − 1} and G has a subgroup Hk of order pk . Then the index [G : Hk ] = pn−k m and
n − k ≥ 1. By Lemma 1.5.4, p divides [NG (Hk ) : Hk ] = |NG (Hk )/Hk |. Again, by Cauchy theorem,
NG (Hk )/Hk has a subgroup H of order p. By the third isomorphism theorem, H = Hk+1 /Hk for
some subgroup Hk+1 of NG (Hk ) containing Hk . Moreover, Hk ⊳ Hk+1 and |Hk+1 | = |H||Hk | =
p pk = pk+1 . Hence, there are subgroups H1 , H2 , . . . , Hn of G such that |Hi | = pi for i = 1, 2, . . . , n
and H1 ⊳ H2 ⊳ · · · ⊳ Hn .
Corollary 1.5.6. Let G be a group of order pn m where n ≥ 1 and p does not divide m.
1. G has a Sylow p-subgroup of order pn .
2. For H < G, H is a Sylow p-subgroup of G ⇔ |H| = pn .
3. Every conjugate of a Sylow p-subgroup of G is a Sylow p-subgroup of G.
4. If P is the only one Sylow p-subgroup of G, then P is normal in G.
Proof. (1) and (2) follow from the definition and the above discussion. Since a conjugate of a
subgroup of G is of the same order as the subgroup, (2) implies (3). Finally, (4) follows from
(3).
The second and third Sylow theorems determine all Sylow p-subgroups and possible numbers
of Sylow p-subgroups. Also, they give the converse of the above results.
For a finite group G and a prime p divides |G|, we write np (G) for the number of Sylow
p-subgroups of G.
Theorem 1.5.9. [Third Sylow Theorem] If G is a finite group and a prime p divides |G|, then
np (G) ≡ 1 mod p.
Proof. Let P be a Sylow p-subgroup of G. Then the set X = {xP x−1 : x ∈ G} consists of all
Sylow p-subgroups of G. Let P act on X by conjugation, namely, (g, xP x−1 ) 7→ gxP x−1 g −1 for
all g ∈ P and x ∈ G. Since gP g −1 = P for all g ∈ P , P ∈ X0 . Let Q ∈ X0 . Then gQg −1 = Q
for all g ∈ P , so P ⊆ NG (Q). Since P and Q are Sylow p-subgroups of NG (Q) and Q is normal
in NG (Q), P = Q by the uniqueness of normal Sylow p-subgroup. This proves X0 = {P }. By
Lemma 1.5.3, we have np (G) = |X| ≡ |X0 | = 1 mod p as desired.
NG (NG (P )) = NG (P ).
Proof. Since P ⊳ NG (P ), P is the only Sylow p-subgroup of NG (P ). Let x ∈ NG (NG (P )). Then
xNG (P )x−1 = NG (P ). Since P ⊆ NG (P ), xP x−1 ⊆ NG (P ). Thus, xP x−1 = P since xP x−1 is a
Sylow p-subgroup of G. Hence, x ∈ NG (P ).
Theorem 1.5.11. Let G be a group of order pq where p and q are primes and p < q. Then G is a
cyclic group, or G has q Sylow p-subgroups and p | (q − 1).
Proof. Since the number of Sylow p-subgroups divides |G| = pq, it is 1, p, q or pq. But this number
is ≡ 1 mod p, so it is 1 or q. If G has q Sylow p-subgroups, then we are done. Assume that G
has only one Sylow p-subgroup, say P . Then P is normal in G. Consider the number of Sylow
q-subgroups of G. It is again 1, p, q or pq, and ≡ 1 mod q, so the only possibility is 1 since p < q.
Thus, G also has a unique Sylow q-subgroup, say Q, and so Q is normal in G. Since the orders of
P and Q are prime, both P and Q are cyclic. Let a and b be generators of P and Q, respectively.
Note that aba−1 b−1 ∈ P ∩ Q = {e}. Thus, ab = ba, so |ab| = pq = |G|. Hence, G = habi.
Example 1.5.1. There can be no simple groups of order 200 and of order 280.
Proof. Let H be a group of order 200. Let P be a Sylow 5-subgroup of H. Then n5 (H) divides
[H : P ] = 8 and n5 (H) ≡ 1 (mod 5), so n5 (H) = 1. Hence, P is normal in H.
Next, let G be a group of order 280. By Corollary 1.5.8 and Theorem 1.5.9, we have n2 (G) =
1, 5, 7 or 35, n5 (G) = 1 or 56 and n7 (G) = 1 or 8. If n5 (G) = 1 or n7 (G) = 1, we are done. Assume
that n5 (G) = 56 and n7 (G) = 8. Then we have 56 · 4 = 224 elements of order 5, and 8 · 6 = 48
elements of order 7. Hence, G has a unique Sylow 2-subgroup.
Example 1.5.2. Every group G of order 12 that is not isomorphic with A4 contains an element of
order 6.
1.5. Sylow Theorems 27
Proof. If A is a Sylow 3-subgroup, then A = hai and |a| = 3. Let G act on {A, x2 A, x3 A, x4 A} by
(g, xA) 7→ gxA. This action induces a homomorphism θ : G → S4 whose kernel K is a subgroup
of A. Then K = {e} or K = A. If K = {e}, then G is isomorphic to a subgroup of S4 of order 12,
so G ∼= A4 which is excluded by hypothesis. Thus, A = K is normal in G which implies that A is
a unique Sylow 3-subgroup of G. Hence, a and a2 are only two elements of order 3 in G. Since
[G : CG (a)] is the number of conjugates of a which is 1 or 2, |CG (a)| = 12 or 6, so there is a b ∈ G
of order 2. Since ab = ba, |ab| = 6.
Example 1.5.3. Recall that V4 = {(1), (12)(34), (13)(24), (14)(23)} is a normal subgroup of A4 .
Since |A4 | = 12 = 22 · 3, V4 is the unique Sylow 2-subgroup of A4 . Moreover V4 has three
subgroups of order two, namely h(12)(34)i, h(13)(24)i and h(14)(23)i. Next, we analyze the Sylow
3-subgroups of A4 . They are cyclic subgroups of order three generated by a 3-cycle. Note that
there are eight 3-cycles in A4 , so we have four subgroups of order three, which are h(123)i, h(124)i,
h(134)i and h(234)i. By Exercises 1.4, A4 has no subgroup of order six. Hence, the diagram below
shows all subgroups of A4 .
❤❤❤ A4 ❳ ❯
❑❑❯❳❯❳❯❳❯❳❯❳❳❳❳
❤ ❤❤❤❤❤❤ ❑❑ ❯❯❯ ❳❳❳❳❳
❑❑ ❯
❤❤❤❤❤❤❤❤ ❑❑ ❯❯❯❯❯❳❯❳❳❳❳❳❳❳❳❳
❤❤ ❑❑ ❯❯❯❯ ❳❳❳❳❳
❤❤❤ ❯ ❳❳❳❳
❤❤❤❤
V 4 ◆◆◆◆ h(123)i h(124)i h(134)i h(234)i
♣♣♣♣♣ ◆ ◆ ✝✝ tt t
♠♠♠♠
♣♣♣
♣ ◆◆◆ ✝ tt
t ♠♠♠
♣♣♣
◆◆◆
✝✝✝ ttt ♠♠♠♠♠
✝ tt ♠♠
h(12)(34)i ❨❨❨❨ h(13)(24)i ❱ h(14)(23)i ✝✝✝ ttt ♠♠♠♠♠
❨❨❨❨❨❨ ▲▲▲ t ♠
❨❨❨❨❨❨ ❱❱❱❱❱❱❱❱ ✝ tt
✝✝ tttt ♠♠♠♠♠
♠♠
❨❨❨❨❨❨ ❱❱❱❱ ▲▲▲▲ ✝ ♠
❨❨❨❨❨❨ ❱❱❱❱ ▲▲ ✝ tt ♠♠
❨❨❨❨❨❨❱❱❱❱ ▲
❨❨❨❨❱ ✝✝t♠t♠♠♠
{(1)}
We shall see more applications of Sylow theorems in Section 3.3. It turns out that any finite
nilpotent group is the direct product of its Sylow p-subgroups.
Exercises 1.5. 1. If G is a finite p-group where p is a prime, N is normal in G and N 6= {e}, prove that
N ∩ Z(G) 6= {e}.
2. Prove that if |G| = pn with p > n, p is a prime, and H is a subgroup of G of order p, then H ⊳ G.
3. Let p be the smallest prime dividing the order of a finite group G. Show that any subgroup H of G
of index p is normal.
4. Let G be a group of order pn where p is a prime and n ∈ N. Prove that there exist normal subgroups
N1 , . . . , Nn of G such that N1 < N2 < · · · < Nn with |Ni | = pi for all i ∈ {1, 2, . . . , n}.
5. Let G be a group, M ⊳ G and N ⊳ G. Prove the following statements.
(a) If M ∩ N = {e}, then xy = yx for all x ∈ M and y ∈ N .
(b) If M and N are finite cyclic subgroups of G and gcd(|M |, |N |) = 1, then M N is a cyclic subgroup
of G of order |M ||N |.
6. Let P be a Sylow p-subgroup of a finite group G and N a normal subgroup of G. Show that:
(a) P ∩ N is a Sylow p-subgroup of N ,
(b) P N/N is a Sylow p-subgroup of G/N .
7. Show that there are no simple groups of order 148 or of order 56.
8. How many elements of order 7 are there in a simple group of order 168?
9. Let G be a group of order 153. Prove that G is abelian.
10. Let G be a group of order 231. Show that n11 (G) = 1 and the Sylow 11-subgroup of G is contained
in Z(G).
11. Show that there is no non-abelian finite simple group of order less than 60. (Hint. We may focus on
groups of the following orders: 24, 30, 40, 48, 54 and 56.)
12. Let G be a group of order 385. Show that a Sylow 11-subgroup of G is normal and a Sylow 7-subgroup
of G is contained in Z(G).
13. Let p be a prime and P a Sylow p-subgroup of a finite group G. Suppose that, for all g ∈ G, if
P 6= gP g −1 , then P ∩ gP g −1 = {e}. Show that np (G) ≡ 1 mod |P |.
28 1. Groups
14. Let G be a group of order 2013. Prove that G has a proper normal subgroup N such that G/N is
cyclic.
15. (a) Let G be a finite group and N a normal subgroup of G. If N contains a Sylow p-subgroup of
G, prove that the number of Sylow p-subgroups of N is the same as that of G (i.e., np (N ) =
np (G)).
(b) Show that if G is a group of order 130, then G has a normal subgroup of order 5.
Definition. Let A and B be groups. The direct product of A and B is defined as follows:
1. as a set A × B = {(a, b) : a ∈ A and b ∈ B} is the Cartesian product of A and B
2. multiplication is coordinatewise, namely, (a, b)(c, d) Q= (ac, bd).
More generally, if {Ai : i ∈ I} is a family of groups, then i∈I Ai is a group with coordinatewise
multiplication. It is called the direct product of the groups Ai . The subgroup
Yw n Y o
Ai = (ai ) ∈ Ai : ai = e for all but finitely many i
i∈I i∈I
Q
of Qi∈I Ai is called the external weak direct product of the groups A Pi . Note thatQ
it is normal
in A
Pw i∈I i Qw . In case A i are all additive abelian groups, we may write A
i∈I i for i∈I Ai and
i∈I Ai for i∈I Ai .
N1 N2 · · · Nm = hN1 ∪ N2 ∪ · · · ∪ Nm i.
Theorem 1.6.3. Let N1 , N2 , . . . , Nm be normal subgroups of G. Then the following statements are
equivalent.
(i) G is the internal weak direct product of N1 , . . . , Nm .
(ii) ∀x ∈ G, ∃!a1 ∈ N1 , . . . , am ∈ Nm , x = a1 . . . am .
Corollary 1.6.4. Let G be a group. Suppose that A and B are normal subgroups of G such that
1. A ∩ B = {e} 2. AB = G and 3. ∀a ∈ A, b ∈ B, ab = ba.
Then G ∼= A × B. In this case, we say that G is the internal direct product of A and B.
Theorem 1.6.5. Let {Ni : i ∈ I} be a family of normal subgroups of a group G. Then the following
statements are equivalent.
(i) G is the internal weak direct product of {Ni : i ∈ I}.
(ii) ∀x ∈ G r {e}, ∃!i1 , . . . , im ∈ I, ∃!ai1 ∈ Ni1 r {e}, . . . , aim ∈ Nim r {e}, x = ai1 . . . aim .
Next, we study the structure of a finite abelian group. Results on elements of finite order are
presented in the next theorem and we recall the Chinese Remainder Theorem in a group theoretic
language. Their proof are routine and left as exercises.
Theorem 1.6.8. Let A be an abelian group and n ∈ N. Then the following statements hold.
1. The mapping ϕn : A → A defined by ϕn (a) = an is a group homomorphism.
2. An = {an : a ∈ A} = im ϕn is a subgroup of A.
3. A(n) = {a ∈ A : an = e} = ker ϕn is a [
subgroup of A.
4. τ (A) = {a ∈ A : ∃k ∈ N, ak = e} = A(n) is a subgroup of A. It is called the torsion
n∈N
subgroup of A.
v ≡ ni mod mi
for all i = 1, . . . , k.
Remark. The Chinese Remainder Theorem may be restated as: If m1 , . . . , mk are pairwise rela-
tively primes and m = m1 . . . mk , then
Zm ∼
= Z m1 ⊕ · · · ⊕ Z mk .
Corollary 1.6.10. If m = pn1 1 . . . pnk k where n1 , . . . , nk ∈ N and p1 , . . . , pk are distinct primes, then
Zm ∼
= Zpn1 1 ⊕ · · · ⊕ Zpnk .
k
30 1. Groups
Proof. First we show existence, then uniqueness. The gi are in fact powers of g.
Existence: By the Chinese Remainder Theorem, choose v1 , . . . , vk satisfying
For each i, let vi = λi (m/mi ) for some λi ∈ Z and set gi = g vi . Then we have
(i) gimi = g vi mi = g λi (m/mi )mi = g λi m and
(ii) g1 , . . . , gk are powers of g and hence commute pairwise.
(iii) Note that v1 + · · · + vk − 1 ≡ 0 mod mi for i = 1, 2, . . . , k, that is, mi |(v1 + · · · + vk − 1). Since
m1 , . . . , mk are pairwise relatively prime, v1 + · · · + vk ≡ 1 mod m1 . . . mk , so
g1 . . . gk = g v1 . . . g vk = g v1 +···+vk = g.
In case g has order m = pa11 . . . pakk where mi = pai i and p1 , . . . , pk are distinct primes, gi is
called the pi -primary part of g.
Here, the pij are distinct primes and uij ≥ 1. Since every element of Ai satisfies g mi = e, |Ai |
involves only those primes occuring in mi by Cauchy theorem. This forces |Ai | = mi for all i.
Remark. If m = pa11 . . . pakk , then Ai in Theorem 1.6.12 is just the Sylow pi -subgroup of A.
Theorem 1.6.13. Let A be an abelian group with |A| = pu where p is a prime. Suppose A has
exponent p, (that is, ap = e for all a ∈ A). Then
A∼
= Zp × · · · × Zp = (Zp )u .
| {z }
u copies
which implies b = a1−λi1 . . . ak−λik ∈ im θ. Therefore, θ is onto as claimed, and we have an isomor-
phism
θ
ha1 i × · · · × hak i ∼
/A.
=
Thus, pk = |ha1 i| . . . |hak i| = |ha1 i×· · ·×hak i| = |A| = pu , so k = u and the theorem is proved.
Remark. If we write A in Theorem 1.6.13 additively, we see that it is just a vector space over the
field Zp . Since A is finite, it is a finite dimensional vector space over Zp . All we were doing in
Theorem 1.6.13 is finding a basis for A as a vector space over Zp .
Theorem 1.6.14. [Burnside basis theorem for abelian p-groups] Suppose A is an abelian group
of exponent pk where p is a prime. Let Ap = {ap : a ∈ A}. If H is a subgroup of A and HAp = A,
then H = A. Equivalently, if the cosets Ap a1 , . . . , Ap ak of A/Ap generate A/Ap , then a1 , . . . , ak
generate A.
2
Proof. Observe that HAp = A implies H p Ap = Ap , so
2 2
A = HAp = H(H p Ap ) = HAp .
2 3
Also, HAp = A implies H p Ap = Ap , so
3 3
A = HAp = H(H p Ap ) = HAp .
r k k
Continue inductively, we have A = HAp for all r. But Ap = {e}, so A = HAp = H. This
completes the proof.
32 1. Groups
Theorem 1.6.15. Let A be a finite abelian group of exponent p where p is a prime, and let H be
a subgroup of A. Then there exists a subgroup K of A such that H ∩ K = {e} and HK = A. In
other words, A is the internal direct product of H and K.
Proof. Let K be a subgroup of A satisfying H ∩K = {e} and among all subgroups K of A satisfying
H ∩ K = {e}, K is as large as possible. We claim that HK = A which proves the theorem.
For, suppose conversely that a ∈ A and a ∈ / HK. Then H ∩ hK, ai = 6 {e} by the maximal
choice of K, so there is a nontrivial element
Remark. As with Theorem 1.6.13, the above theorem can be regarded as a statement about
vector spaces over Zp as follows: If V is a finite dimensional vector space over Zp and U is a
subspace, then there is a subspace W such that V = U ⊕ W .
Theorem 1.6.16. Let A be a finite abelian p-group. Then A is (isomorphic to) a direct product of
cyclic groups.
Proof. We use induction on |A|. If |A| = 1, the result is clear. Now suppose |A| = pu > 1.
We assume inductively that any p-group where order is less than pu is a direct product of cyclic
groups. Consider the group
Ap = {ap : a ∈ A}.
Claim A 6= Ap . For, suppose A = Ap . Then
2 u
A = Ap = Ap = · · · = Ap = {e}
Ap = hap1 i × · · · × hapk i.
θ
More precisely, the map (a1pi1 , . . . , api 0 pi1 pik
k ) 7−→ a1 . . . ak is an isomorphism.
k
θ
ha1 i × · · · × hak i → H
(ai11 , . . . , aikk ) 7→ ai11 . . . aikk
api 1 pik
1 = · · · = ak = e
1.6. Finite Abelian Groups 33
since the map θ0 above is 1-1. Consider the integers i1 , . . . , ik . If p ∤ it for some of these integers,
then api t
t
= e implies apt = e. For, (api t λ
t )t = (at ) = apt , where λt it ≡ 1 mod |at |. Thus,
λ t it p
so apj 1 pjk
1 = · · · = ak = e since θ0 is 1-1. Hence, θ is 1-1.
A ■■
✇✇ ■■
✇✇✇ ■■
■■
✇
✇✇ ■■
✇✇
H Ap
✈ ❋❋❋ ✈ ❆❆
✈✈✈ ❋❋ ✈✈ ❆❆
✈✈
✈ ❋❋ ✈✈✈ ❆❆
✈ ❋❋ ✈ ❆❆
✈✈ ✈✈
Ap = H p H ∩ Ap K
❍❍ ⑥⑥
❍❍ ⑥⑥
❍❍ ⑥⑥
❍❍
⑥⑥
{e}
Next, let Ap = {a ∈ A : ap = e}. Then Ap is a finite group of exponent p and contains H ∩ Ap
as a subgroup. Therefore, H ∩ Ap has a component in Ap by Theorem 1.6.15. More precisely,
there is a subgroup K of Ap such that
(a) (H ∩ Ap ) ∩ K = {e} and (b) (H ∩ Ap )K = Ap .
Note that since K is a group of exponent p, K is a direct product of copies of Zp by Theorem
1.6.13. Finally, we claim that
(I) H ∩ K = {e} and (II) HK = A.
They implies that A is a direct product of H and K which are both direct products of cyclic groups.
(I) Suppose H ∩ K 6= {e}. Thus, there is some x ∈ H ∩ K with x 6= e and xp = e. But then
x ∈ Ap , so (H ∩ Ap ) ∩ K = (H ∩ K) ∩ Ap 6= {e}, contradicting (i) above.
(II) Suppose a ∈ A. Then ap ∈ Ap = hap1 i × · · · × hapk i, so ap = api 1 pik i1 ik p
1 . . . ak = (a1 . . . ak ) = b
p
i1 ik −1
where b = a1 . . . ak ∈ H = ha1 , . . . , ak i. Thus, b a ∈ Ap = (H ∩ Ap )K ⊆ HK by (ii)
above. Hence, a = b(b−1 a) ∈ HK and A = HK as required.
This completes the proof.
A = Zp × . . . Zp × Zp 2 × · · · × Zp 2 × · · · × Zp m × · · · × Zp m
| {z } | {z } | {z }
u1 copies u2 copies um copies
is isomorphic to
B = Zp × . . . Zp × Zp 2 × · · · × Zp 2 × · · · × Zp m × · · · × Zp m
| {z } | {z } | {z }
v1 copies v2 copies vm copies
where ui , vi ≥ 1. Then ui = vi for all i = 1, . . . , m. In other words, the orders and multiplicities
of the factors in a decomposition of a finite abelian p-group uniquely determine the group up to
isomorphism.
Proof. Since A ∼
= B, it follows that for any positive integer n,
# of solutions of xn = e in A = # of solutions of xn = e in B.
n # of solutions of xn = e in A # of solutions of xn = e in B
p pu1 +u2 +u3 +···+um pv1 +v2 +v3 +···+vm
p2 pu1 +2u2 +2u3 +···+2um pv1 +2v2 +2v3 +···+2vm
p3 pu1 +2u2 +3u3 +···+3um pv1 +2v2 +3v3 +···+3vm
.. .. ..
. . .
pm−1 pu1 +2u2 +3u3 +···+(m−1)um−1 +(m−1)um pv1 +v2 +···+(m−1)vm−1 +(m−1)vm
pm pu1 +2u2 +3u3 +···+(m−1)um−1 +mum pv1 +v2 +···+(m−1)vm−1 +mvm
Then we have
u1 + u2 + u3 + · · · + um = v 1 + v 2 + v 3 + · · · + v m
u1 + 2u2 + 2u3 + · · · + 2um = v1 + 2v2 + 2v3 + · · · + 2vm
u1 + 2u2 + 3u3 + · · · + 3um = v1 + 2v2 + 3v3 + · · · + 3vm
..
.
u1 + 2u2 + 3u3 + · · · + (m − 1)um−1 +(m − 1)um =
v1 + 2v2 + 3v3 + · · · + (m − 1)vm−1 + (m − 1)vm
u1 + 2u2 + 3u3 + · · · + (m − 1)um−1 +mum =
v1 + 2v2 + 3v3 + · · · + (m − 1)vm−1 + mvm .
defines a 1-1 correspondence between partitions of n and isomorphism classes of abelian groups
of order pn . In particular, the number of isomorphism classes of abelian groups of order pn is the
number of partitions of n.
Theorem 1.6.19. A finite abelian group is (isomorphic to) a direct product of cyclic groups.
Corollary 1.6.20. If m is a square free integer, then every abelian group of order m is cyclic.
Proof. Assume that an abelian group A is of order m = p1 . . . pr where pi are distinct primes.
Then
A∼= Zp 1 × · · · × Zp r ∼
= Zm
by Theorem 1.6.12 and the Chinese remainder theorem, respectively.
= B1 ∧ . . . ∧ Ak ∼
= B ⇔ (A1 ∼
A∼ = Bk ).
Z p u1 × · · · × Z p uk ∼
= Zn . (1.6.2)
1 k
This gives rise to a second way of writing a finite abelian group A as a direct product of cyclic
groups. Namely, let p1 , . . . , pk be the primes dividing |A|, and let
A = A1 × · · · × Ak ,
where Ai is the pi -primary part of A. Express each Ai as a direct product of cyclic factors and
assume that t is the largest number of factors occurring in any Ai . Write
where
0 ≤ vi1 ≤ vi2 ≤ · · · ≤ vit (1.6.3)
and we have allowed (for notational convenience) some vij to be zero. Let
A = A1 × · · · × Ak
= (Zpv11 × · · · × Zpv1t ) × (Zpv12 × · · · × Zpvk2 ) × · · · × (Zpv1t × · · · × Zpvkt )
1 1 1 k 1 k
∼
= Zn1 × Zn2 × · · · × Znt .
36 1. Groups
The integers nj are completely determined by the decomposition of A into a direct of cyclic
pi -groups. Conversely, given that
Theorem 1.6.21. Let A be a finite abelian group. Then there exist integers n1 , . . . , nt > 1 such
that n1 | n2 , n2 | n3 , . . . , nt−1 | nt and
A∼
= Zn1 × Zn2 × · · · × Znt ,
where these integers are uniquely defined by A. More precisely, if m1 , . . . , ms are positive integers
greater than 1 such that m1 | m2 , m2 | m3 , . . . , ms−1 | ms , and
then t = s, and n1 = m1 , . . . , nt = mt .
Solution. By Example 1.6.2 and Theorem 1.6.21. We have the following answers.
6 = 2 · 3: Z2 × Z3 ∼= Z6 ,
12 = 22 · 3: Z2 × Z2 × Z3 ∼= Z2 × Z6 and Z22 × Z3 ∼
= Z12 ,
3
27 = 3 : Z3 × Z3 × Z3 , Z3 × Z32 and Z33 ,
500 = 22 · 53 : Z2 × Z2 × Z5 × Z5 × Z5 ∼ = Z2 × Z10 × Z10 , Z2 × Z2 × Z5 × Z52 ∼= Z10 × Z50 ,
∼ ∼ ∼
Z2 × Z2 × Z53 = Z2 × Z250 , Z22 × Z5 × Z5 × Z5 = Z5 × Z5 × Z20 , Z22 × Z5 × Z52 = Z5 × Z100 and
Z22 × Z53 ∼= Z500 .
Corollary 1.6.22. Let A be a finite abelian group. If m divides the order of A, then A has a
subgroup of order m.
Proof. Write A = Zn1 × Zn2 × · · · × Znt as in the above theorem. Then |A| = n1 n2 . . . nt . Since m
divides |A|, m = l1 l2 . . . lt with li | ni for all i ∈ {1, 2, . . . , t}. Then Zni /li is a subgroup of Zni of
order li for all i. Thus,
Zn1 /l1 × Zn2 /l2 × · · · × Znt /lt
is a subgroup of A of order l1 l2 . . . lt = m as desired.
Exercises 1.6. 1. Suppose G1 and G2 are finite groups of relatively prime orders. Show that every
subgroup of G1 ×G2 is of the form H1 ×H2 for some subgroups H1 and H2 of G1 and G2 , respectively.
2. Let G1 and G2 be simple groups. Show that every nontrivial normal subgroup of G = G1 × G2 is
isomorphic to either G1 or G2 .
3. Proof Theorem 1.6.8.
4. Find the order of torsion subgroup of Z4 × Z × Z and of Z12 × Z × Z12 .
5. Find the torsion subgroup of the multiplicative group R∗ .
1.6. Finite Abelian Groups 37
This concludes the basic theory of groups. More advanced group theory will be studied in
Chapter 3.
38 1. Groups
2 | Rings and Fields
Rings and fields are the most common algebraic structures for students. They have learned addi-
tion together with multiplication since elementary schools. The abstract treatments using groups
are presented in the first section. Ideals and factorizations are discussed in details. Finally, we
talk about polynomials over a ring and which will be used in a construction of field extensions.
Unless the contrary is explicitly stated “ring” will mean “ring with identity”.
Definition. A subset S of a ring R is a subring if S is a subgroup of the additive group and also
a submonoid of the multiplicative monoid of R.
Clearly the intersection of any set of subrings of R is a subring. Hence, we may define:
Definition. If A is a subset of R one can define the subring generated by A to be the intersec-
tion of all subrings of R which contain A.
Examples 2.1.1. 1. Z, Q, R and C are commutative rings under usual addition and multipli-
cation.
2. For n ∈ N, (Zn , +, ·) is a commutative ring.
3. Recall that for a nonempty set X and P (X) the power set of X, we define A△B = (A r
B) ∪ (B r A) for all subsets A and B of X. Then (P (X), △, ∩) is a commutative ring with
identity X.
39
40 2. Rings and Fields
4. If A is an abelian group, then End(A), the set of all homomorphisms on A, is a ring with
the addition and multiplication are given by
hold. There are also a number of simple consequences of the distributive laws which we now
note. In the first place, induction on m and n give the generalization
m
! n m Xn
X X X
ai bj = a i bj .
i=1 j=1 i=1 j=1
0 = 0b = (a + (−a))b = ab + (−a)b,
If a and b commute, that is, ab = ba, then am bn = bn am . Also, by induction we can prove the
binomial theorem
n n n n−1 n n−2 2 n
(a + b) = a + a b+ a b + ··· + abn−1 + bn
1 2 n−1
for all n ∈ N.
Remark. In the case 1 = 0 in a ring R, we have that
Definition. A ring D is a division ring or skew field if every nonzero element of D is invertible.
A commutative division ring is called a field.
Example 2.1.3. Let n ∈ N, R a ring and Mn (R) the set of all n × n matrices over R. Then
(Mn (R), +, ·) is a ring under the usual addition and multiplication of matrices with unity In , the
identity matrix. If n > 1, then Mn (R) is not commutative. The group of invertible elements of
Mn (R) is called the general linear group and denoted by GLn (R). For the case R is commutative,
we can derive the determinant criterion for a matrix A to be invertible. We have the following
results.
Some rings do not have the property that the product of two nonzero elements is always
nonzero. If so, it leads to the cancellation property in the rings.
Proof. Let D be a finite integral domain. Then (D r {0}, ·) is a finite cancellative semigroup. By
Theorem 1.1.2, (D r {0}, ·) is a group. Hence, if a ∈ D and a 6= 0, then a has an inverse under ·.
Since D is commutative, D is a field.
42 2. Rings and Fields
2.1.2 Quaternions
In 1843, W. R. Hamilton constructed the first example of a division ring in which the commutative
law of multiplication does not hold. This was an extension of the field of complex numbers,
whose elements were quadruples of real numbers (α, β, γ, δ) for which the usual addition and a
multiplication were defined so that 1 = (1, 0, 0, 0) is the unit and i = (0, 1, 0, 0), j = (0, 0, 1, 0),
and k = (0, 0, 0, 1) satisfy i2 = j 2 = k 2 = −1 = ijk. Hamilton called his quadruples, quaternions.
Previously, he had defined complex numbers as pairs of real numbers (α, β) with the product
(α, β)(γ, δ) = (αγ − βδ, αδ + βγ). Hamilton’s discovery of quaternions led to a good deal of
experimentation with other such “hypercomplex” number systems and eventually to a structure
theory whose goal was to classify such systems. A good deal of important algebra thus evolved
from the discovery of quaternions.
We shall not follow Hamilton’s way of introducing quaternions. Instead we shall define this
system as a certain subring of the ring M2 (C) of 2 × 2 matrices with complex number entries. This
will have the advantage of reducing the calculations to a single simple verification.
We consider the subset H of the ring M2 (C) of complex 2 × 2 matrices that have the form
√ √
a b α0 + α1 √−1 α2 + α3 √−1
x= = , αi real. (2.1.1)
−b̄ ā −α2 + α3 −1 α0 − α1 −1
We claim that H is a subring of M2 (C). Since a1 − a2 = ā1 − ā2 for complex numbers, it is
clear that H is closed under subtraction; hence H is a subgroup of the additive group of M2 (C).
We obtain the unit matrix by taking a = 1, b = 0 in (2.1.1). Hence, 1 ∈ H. Since
a b c d ac − bd¯ ad + bc̄
=
−b̄ ā −d¯ c̄ −b̄c − ād¯ −b̄d + āc̄
and a1 a2 = ā1 ā2 , the right-hand side has the form
u v
−v̄ ū
¯ v = ad + bc̄. Therefore, H is closed under multiplication and so H is a subring
where u = ac − bd,
of M2 (C).
We now show that H is a division ring. We note first that
√ √
α0 + α1 √−1 α2 + α3 √−1
∆ := det = α02 + α12 + α22 + α32 .
−α2 + α3 −1 α0 − α1 −1
Since the αi are real numbers, this is real, and is 0 only if every αi = 0, that is, if the matrix
is 0. Hence, every non-zero element of H has an inverse in M2 (C). Moreover, we have, by the
definition of the adjoint, that
a b ā −b
adj = .
−b̄ ā b̄ a
¯ = a, this is obtained from the x in (2.1.1) by replacing a by ā and b by −b, and so it is
Since ā
contained in H. Thus, if the matrix x is 6= 0 then its inverse is
−1
ā∆ −b∆−1
b̄∆−1 a∆−1
and this is contained in H. Hence, H is a division ring.
H contains
The ring in its center the field R of real numbers identified with the set of diagonal
α 0
matrices , α ∈ R. H also contains the matrices
0 α
√ √
−1 √0 0 1 √0 −1
i= ,j = ,k = .
0 − −1 −1 0 −1 0
2.1. Basic Concepts 43
We verify that
x = α0 + α1 i + α2 j + α3 k (2.1.2)
and if α0 + α1 i + α2 j + α3 k = β0 + β1 i + β2 j + β3 k, βi ∈ R, then
√ √ √ √
α0 + α1 √−1 α2 + α3 √−1 β0 + β1 √−1 β2 + β3 √−1
=
−α2 + α3 −1 α0 − α1 −1 −β2 + β3 −1 β0 − β1 −1
so αl = βl , 0 ≤ l ≤ 3. Thus, any x in H can be written in one and only one way in the from
(2.1.2). The product of two elements in H
(α0 + α1 i + α2 j + α3 k)(β0 + β1 i + β2 j + β3 k)
is determined by the product and sum in R, the distributive laws and the multiplication table
i2 = j 2 = k 2 = −1
Definition. Let R be a ring. If there is a smallest positive integer n such that na = 0 for all a ∈ R,
then R is said to have characteristic n. If no such n exists, R is said to have characteristic
zero. We denote the characteristic of R by char R.
Remark. It is easy to see that char R = n if and only if n is the smallest positive integer such that
n1R = 0.
Example 2.1.5. The rings Z, Q, R, C and H are of characteristic zero, char Zn = n and char(Zm ×
Zn ) = lcm(m, n).
Proof. Let R be an integral domain of characteristic n > 0. Assume that n = ab for some a, b ∈ N.
It follows that 0 = n1R = (ab)1R = (a1R )(b1R ). Since R has no zero divisor, a1R = 0 or b1R = 0.
Then a = n or b = n. Hence, n is a prime.
Theorem 2.1.6. Let R be a ring of characteristic a prime p and a, b ∈ R. If a and b commute, then
k k k
(a + b)p = ap + bp and (a + b)p = ap + bp for all k ∈ N.
44 2. Rings and Fields
Proof. Note that if 1 ≤ r ≤ p − 1, then the binomial coefficient pr is a multiple of p, so it is 0 in
R. Hence,
p p−1 p
(a + b)p = ap + a b + ··· + abp−1 + bp = ap + bp .
1 p−1
A simple induction on k gives the second equation. The inductive step is
k k−1 k−1 k−1 k k
(a + b)p = ((a + b)p )p = (ap − bp ) p = a p + bp
Like in groups, a ring homomorphism is a function between two rings that preserves both
addition and multiplication.
Proof. (1) Observe that ϕ(1) = ϕ(1 · 1) = ϕ(1)ϕ(1), so ϕ(1) = 0 or ϕ(1) = 1. Moreover, ϕ(n) =
nϕ(1) for all n ∈ Z. Thus, ϕ(n) = 0 for all n ∈ Z or ϕ(n) = n for all n ∈ Z as desired.
(2) Similar to Z, ϕ(1) = 0 or ϕ(1) = 1 and ϕ(n) = nϕ(1) for all n ∈ Z. For m ∈ Z and n ∈ N,
we have ϕ(m(1/n)) = mϕ(1/n) and ϕ(1) = ϕ(n(1/n)) = nϕ(n). If ϕ(1) = 0, then ϕ(1/n) = 0 for
all n ∈ N, so ϕ is the zero map. On the other hand, if ϕ(1) = 1, then ϕ(1/n) = 1/n for all n ∈ N
which implies ϕ(m/n) = m/n for all m ∈ Z and n ∈ N.
(3) Assume that ϕ(x) is not the zero map. We can show that ϕ(x) = x for all x ∈ Q. Moreover, for
√ √
x ∈ R+ , ϕ(x) = ϕ(( x)2 ) = (ϕ( x))2 > 0. This implies ∀a, b ∈ R, a < b ⇒ ϕ(a) < ϕ(b). Now, let
x ∈ R. Suppose that ϕ(x) 6= x. Then ϕ(x) < x or x < ϕ(x). By the density theorem, ∃q1 , q2 ∈ Q
such that ϕ(x) < q1 < x or x < q2 < ϕ(x). Thus, ϕ(x) < q1 < ϕ(x) or ϕ(x) < q2 < ϕ(x) yields a
contradiction. Hence, ϕ(x) = x for all x ∈ R.
(4) is proved in the next section.
2.1. Basic Concepts 45
Let G = {gi : i ∈ I} be any multiplicative group, and let R be any commutative ring. Let RG
be the set of all formal sums X
ai gi
i∈I
for ai ∈ R and gi ∈ G, where all but finite number of the ai are 0. Define the sum of two elements
of RG by
X X X
ai gi + bi gi = (ai + bi )gi .
i∈I i∈I i∈I
Observe that (ai + bi ) = 0 except for a finite number of indices i, so the above
P sum is again in RG.
It is immediate that (RG, +) is an abelian group with additive identity i∈I 0gi .
Multiplication of two elements of RG is defined by the use of the multiplications in G and R
as follows:
X X X X
ai gi bi gi = a j bk g i .
i∈I i∈I i∈I gj gk =gi
P P
Naively, we formally distribute the sum i∈I ai gi over the sum i∈I bi gi and rename a term
a j g j bk g k P
by aj bk gk where gj gk = gi in G. Since ai and bi are 0 for all but a finite number of i,
the sum gj gk =gi aj bk contains only a finite number of nonzero summands aj bk ∈ R and may
P
thus be viewed as an element of R. Again at most a finite number of such sums gj gk =gi aj bk are
nonzero. Thus, multiplication is closed on RG. We can proceed to show that
Definition. The ring RG defined above is the group ring of G over R. If F is a field, then F G
is the group algebra of G over F .
5. A ring R is called a Boolean ring is x2 = x for all x ∈ R. Prove that every Boolean ring is commuta-
tive.
6. (a) Show that ϕ : Z12 → Z30 given by ϕ([a]12 ) = [10a]30 is a ring homomorphism.
(b) Show that ϕ : Z12 → Z30 given by ϕ([a]12 ) = [5a]30 is a additive group homomorphism. Is it a
ring homomorphism?
7. Consider (S, +, ·), where S is a set and + and · are binary operations on S which satisfy the distribu-
tive laws such that (S, +) and (S r {0}, ·) are groups. Show that (S, +, ·) is a division ring.
46 2. Rings and Fields
Prove that (R × Z, +, ·) is a ring with unity (0, 1) and of characteristic zero. Ditto the set R × Zn and
prove that it is a ring of characteristic n.
13. A ring R is simple if R and {0} are the only ideals in R. Show that the characteristic of a simple ring
is either 0 or a prime p.
14. If R is a finite integral domain, prove that |R| is a prime power.
ker ϕ = {x ∈ R : ϕ(x) = 0S }
is an ideal of R.
Proof. It is immediate that ker ϕ is a subgroup of (R, +). If a ∈ R and x ∈ ker ϕ, then ϕ(ax) =
ϕ(a)ϕ(x) = ϕ(a)0 = 0 and ϕ(xa) = ϕ(x)ϕ(a) = 0ϕ(a) = 0. Hence, ax and xa are in ker ϕ.
Examples 2.2.1. 1. All distinct ideals of Zn are dZn , where d = 0 or (d ∈ N and d | n).
2. All distinct ideals of Z are mZ, where m ∈ N ∪ {0}.
Definition. A ring R is a principal ideal ring if every ideal of R is principal. A principal ideal
domain (PID) is a principal ideal ring which is an integral domain.
Remark. Let F be a field, R a ring and ϕ : F → R a ring homomorphism. Then ker ϕ is either
{0} or F which implies ϕ is 1-1 or is the zero map, respectively. Hence, every nonzero ring
homomorphism of fields must be 1-1. In particular, one can readily verify that the only ring
endomorphisms of Zp are the zero map and the identity map. This finishes the proof of Example
2.1.6.
Theorem 2.2.2. Let R be a commutative ring whose only ideals are {0} and R itself. Then R is a
field.
Proof. Let a ∈ R r {0}. Then (a) = R, so 1 ∈ (a). Since R is commutative, there is a b ∈ R such
that ab = 1 = ba.
48 2. Rings and Fields
r + I = {r + a : a ∈ I}.
(r + I)(s + I) = rs + I.
rs − r′ s′ = (r − r′ )s + r′ (s − s′ ) ∈ I.
Thus, the above multiplication is well-defined, it is easy to see that R/I is a ring. Hence, we have
the next theorem.
(r + I) + (s + I) = (r + s) + I and (r + I)(s + I) = rs + I
make R/I into a ring with unity 1 + I, called the factor or quotient ring of R by I. The map
ϕ : R → R/I defined by ϕ(r) = r + I is an onto ring homomorphism which has kernel I. It is
called the canonical projection of R onto R/I.
There also are three isomorphism theorems for rings. Their proofs are similar to isomorphism
theorems for groups. Hence, we shall just sketch them.
R/ ker ϕ ∼
= im S.
Proof. Define ϕ̄ : R/ ker ϕ → S by ϕ̄(r + ker ϕ)ϕ(r) for all r ∈ R. Clearly, ϕ̄ is onto and it is easy
to check that ϕ̄ is a ring homomorphism. Moreover, for r, s ∈ R, we have
Hence, ϕ̄ is an isomorphism.
S/(S ∩ I) ∼
= (S + I)/I.
Theorem 2.2.6. [Third Isomorphism Theorem] If I and J are ideals of a ring R such that I ⊆ J,
then J/I is an ideal of R/I and
(R/I)/(J/I) ∼= R/J.
Proof. Define ϕ : R/I → R/J by ϕ(r + I) = r + J for all r ∈ R. It can be verified that ϕ is a ring
homomorphism with kernel J/I and the theorem follows from the first isomorphism theorem.
2.2. Ideals, Quotient Rings and the Field of Fractions 49
Remark. As for groups, the third isomorphism theorem gives a 1-1 correspondence between the
set of ideals of R containing I and the set of ideals of R/I.
Definition. We say that a ring R can be embedded in a ring R′ if there exists a monomorphism
(i.e., 1-1 homomorphism) of R into R′ .
Example 2.2.4. A ring R can be embedded in the ring Mn (R) by the diagonal map a 7→ aIn .
Theorem 2.2.7. Any ring R without identity can be embedded in a ring R′ with identity. Moreover,
R′ can be chosen to be either of characteristic zero or of same characteristic as R.
Proof. Consider the rings R×Z and R×Zn defined in the last question of Ring Exercise Set I. They
are rings with unity (0, 1) and (0, 1̄), and of characteristic 0 and n, respectively. If char R = 0, we
define ϕ : R → R × Z by ϕ(x) = (x, 0) and if char R = n, we define ϕ : R → R × Zn by
ϕ(x) = (x, 0̄). It is easy to show that both functions are monomorphisms. This finishes the
proof.
We now wish to show that every integral domain can be embedded in a field, called its field
of fractions such that every element of the field is a fraction a/b where a and b lie in the integral
domain and b 6= 0. There is only one problem to overcome: we might wish to define the field
to be the set of all “fraction” a/b, with b 6= 0. But this is not quite right because two different
fractions may be the same number. E.g., 1/2 = 2/4 = 3/6. We overcome this problem by defining
an equivalence relation on certain pairs of elements in the integral domain. The results are
presented in the next theorem. Its proof is routine and omitted.
Theorem 2.2.8. Suppose D is an integral domain, and let S be the set of pairs
Remark. If R is an entire ring which is not commutative, the construction Q(R) above does not
exist in general.
Example 2.2.5. Let D be an integral domain and a, b ∈ D. If am = bm and an = bn , for m and n
relatively prime positive integers, prove that a = b.
Proof. If a = 0, then b = 0 since D has no zero divisor. Assume that a 6= 0. Then b 6= 0. Let F be
the field of fraction of D. Since (m, n) = 1, ∃x, y ∈ Z, mx + ny = 1. Thus, in F , we have
a = a1 = amx+ny = (am )x (an )y = (bm )x (bn )y = bmx+ny = b1 = b,
so a = b in D.
50 2. Rings and Fields
Exercises 2.2. 1. An element a of a ring R is nilpotent if an = 0 for some n ∈ N. Show that the set of
all nilpotent elements N in a commutative ring R is an ideal, called the nilradical of R. Moreover,
prove that R/N has no nonzero nilpotent.
2. Show that a ring R has no nonzero nilpotent element if and only if 0 is the only solution of x2 = 0
in R.
3. Let ϕ : R → S be a homomorphisms of rings. Prove the following statements.
(a) If I is an ideal of R and ϕ is onto, then ϕ(I) is an ideal of S.
(b) If J is an ideal of S, then ϕ−1 (J) is an ideal of R containing ker ϕ.
4. Let R be a commutative ring and I an ideal of R. Show that
√
I = {x ∈ R : ∃n ∈ N, xn ∈ I}
M ⊆ J ⊆ R ⇒ J = M or J = R.
Proof. Let n be a prime and let J be an ideal of Z such that nZ ⊆ J ⊆ Z. Then J = dZ for some
d ∈ N and d | n, so d = 1 or d = n. Hence, J = nZ or J = Z. On the other hand, assume
that n = ab for some 1 < a, b < n. Then nZ ⊆ aZ ⊆ Z, aZ 6= nZ and aZ 6= Z, so nZ is not
maximal.
Remarks. 1. Every ideal I 6= R is contained in some maximal ideal M .
Proof. Let I = {J : J 6= R and J is an ideal of R containing I}. Let C = {Jα }α∈Λ be a chain in I .
Then ∪C is an ideal of R. If ∪C = R, then 1 ∈ Jα for some α ∈ Λ, so Jα = R, a contradiction.
Hence, ∪C is an upper bound of C in I . By, Zorn’s lemma, we have I has a maximal element
which turns out to be our desired maximal ideal containing I.
2. If M is a maximal ideal and I is an ideal of R such that I * M , then M + I = R.
Proof. Let x ∈ I, ∈
/ M . Consider the ideal J = M + Rx which is larger than M . Since M is maximal,
J = R. Thus, R = M + Rx ⊆ M + I.
2.3. Maximal Ideals and Prime Ideals 51
Proof. Clearly, R/M is a commutative ring with unity 1 + M . Assume that M is a maximal ideal.
Let a ∈
/ M . Then M + Ra = R, so ∃b ∈ R, 1 = m + ba. Thus, 1 + M = ba + M = (b + M )(a + M ),
and hence R/M is a field. Conversely, suppose that R/M is a field. Let M ⊆ J ⊆ R and J 6= M .
Then ∃a ∈ J r M . Since R/M is a field and a ∈
/ M , ∃b ∈ R, 1 + M = (a + M )(b + M ) = ab + M ,
so 1 − ab ∈ M ⊆ J. Since a ∈ J, ab ∈ J which implies 1 ∈ J. Hence, J = R.
AB ⊆ P ⇒ A ⊆ P or B ⊆ P.
P is prime ⇔ ∀a, b ∈ R, ab ∈ P ⇒ a ∈ P or b ∈ P
⇔ ∀a, b ∈ R, (a + P )(b + P ) = 0 + P ⇒ a + P = 0 + P or b + P = 0 + P
⇔ R/P is an integral domain
as desired.
Theorems 2.3.1 and 2.3.3 are the most useful for characterizing maximal ideals and prime
ideals in commutative rings.
Definition. The set of all prime ideals of a commutative ring R is denoted by Spec R, called the
spectrum of R.
Definition. A local ring is a commutative ring which has a unique maximal ideal.
Proof. Assume R is a local ring with the maximal ideal M . Let a ∈ R r M . If aR 6= R, then aR is
contained in some maximal ideal, so aR ⊆ M which yields a contradiction. Thus, aR = R, so a
is a unit. Hence, M is the set of nonunits of R. Conversely, suppose that the nonunits of R form
an ideal M of R. Clearly, M is maximal. Let M ′ be another maximal ideal of R. If ∃a ∈ M ′ r M ,
then a is a unit, so M ′ = R, a contradiction. Thus, M ′ ⊆ M . Since M ′ is maximal, M ′ = M .
Corollary 2.3.6. In a finite local ring R, every element is either a unit or a nilpotent element.
Moreover, |R| is a prime power.
Then Rp is a subring of Q and is local. Its unique maximal ideal is {pk/n : k, n ∈ Z and p ∤ n}.
Exercises 2.3. 1. Let R be a ring and I an ideal of R. Prove that the map J 7→ J/I gives a 1-1
correspondence
{ideals of R containing I} ←→ {ideals of R/I}.
Moreover, this correspondence carries maximal ideals to maximal ideals.
2. Prove Corollary 2.3.6 and Example 2.3.4.
3. Find all ideals, all prime ideals and all maximal ideals of
(a) Z12 (b) Z2 × Z4 (c) Q × Q (d) Q × Z (e) Z × Z4 × Z5 .
4. Let R be a commutative ring. If every ideal proper of R is prime, show that R is a field.
5. Show that in a Boolean ring R, every prime ideal P 6= R is maximal.
2.4 Factorizations
From elementary number theory, we know that every positive integer can be decomposed uniquely
into a product of prime numbers. It is a unique factorization property of the ring Z. In this section,
we shall learn about factorizations in any other integral domains.
Definition. Let R be a commutative ring and suppose that a, b ∈ R. We say that a divides b
and write a | b, if there is an r ∈ R such that ra = b.
Definition. Let R be an integral domain and suppose a, b ∈ R. We say that a and b are asso-
ciates if a | b and b | a.
Theorem 2.4.1. Let R be an integral domain, a, b ∈ R. The following statements are equivalent.
(i) a and b are associates. (ii) Ra = Rb. (iii) a = ub for some unit u ∈ R.
Proof. (i) ⇒ (iii) If a = 0, then b = 0 and (3) is clear. Suppose then that a 6= 0. Since a | b and
b | a, we can write a = ub and b = va. Thus, a = ub = uva, so (uv − 1)a = 0, so uv = 1. Hence,
a = ub and u is a unit of R.
(iii) ⇒ (ii) If a = ub where u is a unit, then Ra = Rub = (Ru)b = Rb.
(ii) ⇒ (i) If Ra = Rb, then a = rb, b = sa, so b | a and a | b. Hence, a and b are associates.
∀a, b ∈ R, p | ab ⇒ p | a or p | b.
Proof. (1) It follows directly from the definition and Theorem 2.3.2.
(2) Let p be a prime element. Assume that p = ab for some a, b ∈ R. Then Rab = Rp, so Ra ⊆ Rp
or Rb ⊆ Rp. Since Rp = Rab ⊆ (Ra ∩ Rb), Ra = Rp or Rb = Rp, so au = p or bv = p for some
units u and v in R. Hence, b = u or a = v is a unit in R. Finally, suppose that R is a PID and p is
irreducible. Then Rp is maximal, so it is a prime ideal. Hence, p is prime.
54 2. Rings and Fields
Examples 2.4.1. 1. The ring of rational integers Z is a UFD by the fundamental theorem of
arithmetic. Since U (Z) = {±1}, the atoms of Z are ±p where p is a prime. Note that p and
−p are associates (e.g., 12 = 2 · 2 · 3 = (−2)(−3) · 2).
2. Let F be a field. Every element of F except 0 is a unit. Hence, every nonzero nonunit of F
is uniquely a product of atoms (vacuously!). That is, F has no nonzero nonunits.
Proof. Suppose R is a UFD. Then (1) holds, by the definition of a UFD. It remains to show that if
x is irreducible, then x is prime. Suppose x | bc, and let ax = bc. Write a, b and c as products of
atoms, so that
a . . . a x = b1 . . . bl c1 . . . cm .
| 1 {z k} | {z } | {z }
a b c
Since these are two factorizations of ax = bc into products of atoms and x is an atom, x must be
an associate of some bi or some cj . Hence, x | b or x | c. Thus, x is prime.
Conversely, suppose (1) and (2) are given. Then to show R is a UFD, it suffices to show that if
a1 . . . ar = b1 . . . bs
where the ai and bi are atoms, then r = s and the bi may be arranged so the ai and bi are
associates for i = 1, . . . , r. The proof proceeds by induction on r.
When r = 1, a1 = b1 . . . bs . Since a1 is prime, a1 divides bi for some i. Assume that a1 | b1 ,
and let b1 = ua1 . Since b1 is an atom, u must be a unit, so a1 and b1 are associates. Furthermore,
a1 = b1 . . . bs = ua1 b2 . . . bs , so 1 = (ub2 ) . . . bs . That is, s = 1 and a1 = b1 . For the inductive step,
write a1 . . . ar = b1 . . . bs . Since a1 is prime, a1 divides bi for some i. As above, let b1 = ua1 where
u is a unit and a1 and b1 are associates. Then a1 . . . ar = b1 . . . bs = ua1 b2 . . . bs , so a2 . . . ar =
ub2 . . . bs . Now the inductive hypothesis applies since we have r − 1 factors on the left. It follows
that r = s and after reordering the bi , ai and bi are associates for i = 2, . . . , r. This completes the
induction.
To obtain more examples of a UFD and an integral domain which is not a UFD, we introduce:
√ √
Note that if x1 , x2 , y1 , y2 ∈ Z are such that x1 + y1 d = x2 + y2 d, then x1 = x2 and y1 = y2
because d is non-square.
2.4. Factorizations 55
√
Definition. Define a function N : Z[ d] → Z by
√
N (x + y d) = x2 − dy 2 for all x, y ∈ Z.
√
It is called the norm map on Z[ d].
√ √
Theorem 2.4.5.√ 1. If α ∈ Z[ d] and N (α) = 0, then α = 0 = 0 + 0 d.
Z[ d], N (αβ) = N (α)N (β) and (α | β ⇒ N (α) | N (β)).
2. ∀α, β ∈ √
3. ∀α ∈ Z[ √d], α is a unit ⇔ N (α) = ±1. √
4. If α ∈ Z[ d] and N (α) = p is a prime number, then α is irreducible in Z[ d].
Proof.
√ Let x, y ∈ Z be such that x2 − dy 2 = 0. Then x2 = dy 2 . If y 6= 0, then d = x2 /y 2 , so
d = |x/y| ∈ Q, which is a contradiction. Thus, we must have y√= 0 which also forces x = 0.
This proves (1). A direct calculation
√ gives (2). For (3), let α ∈ Z[ d]. Suppose that α is a unit.
Then αβ = 1 for some β ∈ Z[ d]. Thus, 1 = N (1) = N (αβ) = N (α)N (β), so N (α)√divides 1
in Z. This gives √ N (1) 2= ±1. Conversely,
√ assume √ that N (α) = ±1. Write α√= x + y d. Then
2
±1 = N (x + y d) = x − y d = (x + y d)(x − y d) which implies that x + y d is a unit. Finally,
(4) follows from (3).
√
Example 2.4.2. The unit group of the ring Z[i] is {1, −1, i, −i} where i denotes −1.
√
Remark. The equation x2 − dy 2 = 1 is called the Pell’s equation. Every unit in Z[ d] is a
solution of Pell’s equation, or else of x2 − dy 2 = −1, the negative Pell’s equation. If d < 0, then
x2 − dy 2 ≥ 0. In this case the negative Pell’s equation has no solutions. In fact, Pell’s equation only
has very few solutions in this case, namely two, unless d = −1 when there are four solutions. If
d > 0, there are infinitely many solutions to Pell’s equation. The negative Pell’s equation may or
may not have solutions.
√
Example √ 2.4.3. Consider
√ the ring Z[ −5].
1. 1 − √−5, 1 + −5, 2 and 3 are irreducible elements. √
2. 1 + −5 and 2 are not prime elements. Hence, Z[ −5] is not a UFD by Theorem 2.4.4.
√ √ √
Solution. (1) Assume that 1 − −5 = (a + b −5)(c + d −5) for some a, b, c, d ∈ Z. By taking
norms, we have
6 = (a2 + 5b2 )(c2 + 5d2 ),
√
which implies that a2 + 5b2 = 1, 2, 3 or 6. Observe that b = 0 implies a2 = 1, so a + b 5 is√ a unit.
2
If b 6= 0, then a + 5b 2 2 2 2 2
√ ≥ 5, so a + 5b = 6. This forces that c + 5d √= 1 and thus √ c + d −5 is
a unit. Hence, 1 − −5 is irreducible. Next, assume that 2 = (a + b −5)(c + d −5) for some
a, b, c, d ∈ Z. By taking norms, we have
which implies that a2 + 5b2 = 1, 2 or 4. If a2 + 5b2 = 2, then 2√is a square √ modulo 5 which is a
contradiction. Thus, a2 + 5b2 = √ 1 or c 2 + 5d2 = 1. Hence, a + b −5 or c + d −5 is a unit and so
Next, we talk about common factors, gcd and lcm of elements in an integral domain.
56 2. Rings and Fields
Remark. +3 and −3 are greatest common divisors of 12 and 15. 60 and −60 are least common
multiples of 12 and 15. Thus, the gcd or lcm of two elements is not unique, (however we adopt
the above notation anyway, e.g., gcd(12, 15) = 3 and gcd(12, 15) = −3 are both correct!). By their
definitions, they are unique up to associates as recorded in the next theorem.
Let R be an integral domain and let Q(R) be the set of atoms of R. Define an equivalence
relation on Q(R) by a ∼ b if a and b are associates. Then a set of representative atoms for R is a
set P = P(R) which contains exactly one atom from each equivalence class.
Example 2.4.4. Q(Z) = {±p | p is a prime} is the set of all atoms in Z.
P(Z) = {p | p is a positive prime} is a set of representative atoms.
P(Z) = {+2, −3, +5, −7, . . .} is another set of representative atoms.
Theorem 2.4.7. Let R be an integral domain and let P be a set of representative atoms for R.
Then the following statements are equivalent.
(i) R is a UFD.
(ii) Every nonzero element of R can be expressed uniquely (up to order of factors) as a =
ubi11 · · · bikk , where u is a unit of R, k ≥ 0, i1 , . . . , ik > 0 and b1 , . . . , bk are distinct elements
of P.
Proof. Let P be a set of representative atoms of R, and let b1 , . . . , bk ∈ P be all the atoms of P
which occur in either a or b when they are factored as in Theorem 2.4.7. Write
a = ubi11 · · · bikk and b = vbj11 · · · bjkk
where u and v are units and is , js ≥ 0. Then we derive:
min(i ,j ) min(i ,j )
(a) r = b1 1 1 · · · br r r is a gcd for a and b.
max(i1 ,j1 ) max(ir ,jr )
(b) s = b1 · · · br is a lcm for a and b.
(c) r is the only gcd of a and b which is a product of elements of P, and s is the only lcm of a and
b which is a product of elements of P.
(d) Since i + j = min(i, j) + max(i, j) for any integers i and j,
ab = uvb1i1 +j1 · · · bikk +jk = uvrs.
Hence, ab and rs are associates.
2.4. Factorizations 57
Remark. Suppose R is an integral domain and Ra + Rb = Rc. Then c = gcd(a, b). The converse
does not hold. E.g., Q[s, t], where s and t are indeterminates. Then gcd(s, t) = 1 and Q[s, t] 6=
Qs + Qt.
Lemma 2.4.9. [Ascending Chain Condition (ACC) for a PIR] Let R be a principal ideal ring. If
I1 ⊆ I2 ⊆ . . . is a chain of ideals in R, then ∃m ∈ N, In = Im for all n ≥ m.
S
Proof.
S∞ Let I = ∞ n=1 In . Then I is an ideal of R. Since R is a PIR, ∃a ∈ R, (a) = I. Then
a ∈ n=1 In , so ∃m ∈ N, a ∈ Im . Thus, I = (a) ⊆ Im ⊆ I which implies that Im = I. Hence,
∀n ≥ m, In = Im .
Lemma 2.4.10. If R is a PID and a is nonzero nonunit, then ∃ an atom p ∈ R such that p | a.
Proof. Let R be a PID. By Theorems 2.4.3 and 2.4.4, it suffices to show that every nonzero nonunit
of R is a product of atoms. Let a ∈ R be nonzero nonunit. By Lemma 2.4.10, there exists an atom
p1 dividing a. Write a = p1 b1 for some b1 ∈ R. If b1 is a unit, then a is an atom. If b1 is nonunit,
then there exists an atom p2 dividing b1 , so we write a = p1 b1 = p1 p2 b2 . Continuing, we get a
strictly ascending chain of ideals
Since R is a PID, this chain must terminate, by the ACC in Lemma 2.4.9, with some br = pr ur
where ur is a unit and pr is an atom. Hence, a = p1 p2 . . . pr ur , and so R is a UFD as desired.
Finally, we study a generalization of the division algorithm which leads to an algorithm to find
a gcd of two integers, called the Euclidean algorithm.
d : D r {0} → N ∪ {0},
a = b[(u + ε) + (v + η)i] = bq + r
Proof. Let I be an ideal in a Euclidean domain D. If I = {0}, we have I = (0). Otherwise, let
b 6= 0 be an element of I for which d(b) is minimal for the nonzero elements of I. Let a be any
element of I. Then a = bq + r for some q, r ∈ D with r = 0 or d(r) < d(b). Since r = a − bq ∈ I
and d(r) < d(b), we must have r = 0 by the choice of b in I. Hence, a = bq, so I = (b).
√
Example 2.4.6. Let θ = 21 (1 + −19) and Z[θ] = {a + bθ : a, b ∈ Z}. Assume that u = a + bθ is a
unit in Z[θ]. Then (a + bθ)(c + dθ) = 1 for some c, d ∈ Z. The squares of absolute value on both
sides give
((2a + b)2 + 19b2 )((2c + d)2 + 19d2 ) = 16
which implies b = d = 0 and so ac = 1. Hence, the unit group of Z[θ] = {±1}. By a similar
technique, we can show that 2 and 3 are irreducible in Z[θ]. Now, suppose that d is a valuation
map on Z[θ]. Choose m ∈ Z[θ] which is nonzero nonunit such that d(m) is minimal. First, we
divide 2 by m and get q, r ∈ Z[θ] and
√
Remark. In conclusion, recall that Z is an integral domain which
√ is not a field and Z[ −5] is not
a UFD. Besides, Z[θ] = {a + bθ : a, b ∈ Z}, where θ = (1 + −19)/2, is a PID which is not a
Euclidean domain as shown above. Finally, Z[x] (in the next section) is a UFD which is not a PID.
Exercises 2.4. 1. If p and q are prime elements in an integral domain R such that p | q, prove that p
and q are associates.
2. Let R be a UFD and c a nonzero element in R. Prove that R/Rc contains a nonzero nilpotent element
if and only if there is a prime element p ∈ R with p2 | c.
3. Let R be a UFD. If a ∈ R is a nonzero nonunit element, prove that Ra is the product of a finite
number of prime ideals.
4. If R is a PID and gcd(a, b) = 1, show that Ra + Rb = R, so 1 = ax + by for some x, y ∈ R.
5. Let R be a PID and suppose that a, b and c are nonzero elements of R such that Ra + Rb = Rc. Show
that there exist√u, v ∈ R such that ua + vb = c and Ru + Rv = R. √ √
6. Prove that 4 + 10 is irreducible but not prime in the ring {a + b 10 : a, b ∈ Z}. Deduce that Z[ 10]
is not a UFD. √
7. Show that the ring Z[ 2] has infinitely many units. (Hint. If u is a unit, so is un for all n ∈ Z.)
8. (a) Let D be a Euclidean domain. Prove that u is a unit in D if and only if d(u) = d(1).
(b) Show that ±1 and ±i are units in Z[i] and prove that if a+bi is not a unit in Z[i], then a2 +b2 > 1.
9. Let R be a Euclidean ring and a, b ∈ R, b 6= 0. Prove that there exist q0 , q1 . . . , qn and r1 , . . . , rn in R
such that
a = q0 b + r1 , d(r1 ) < d(b),
b = q1 r1 + r2 , d(r2 ) < d(r1 ),
r1 = q2 r2 + r3 , d(r3 ) < d(r2 ),
··· ··· ···
rn−2 = qn−1 rn−1 + rn , d(rn ) < d(rn−1 ),
rn−1 = qn rn
and if a and b satisfy the above conditions, then rn is a gcd of a and b. This algorithm is called the
Euclidean algorithm. Find a gcd of 8 + 6i and 5 − 15i in Z[i] by using the Euclidean algorithm.
10. Let D be a UFD with field of fractions F and suppose α ∈ F . Show that it is possible to write α = a/b
with a, b ∈ D and gcd(a, b) = 1.
11. Let R be a PID with field of fractions F , and let S be a ring with R ⊆ S ⊆ F .
(a) If α ∈ S, show that α = a/b with a, b ∈ R and 1/b ∈ S. (b) Prove that S is a PID.
12. Let R = {m/2n : m, n ∈ Z and n ≥ 0}.
(a) Prove that R is a subring of Q and determine all units of R.
(b) Show that 3 is an irreducible element in R.
(c) Prove that R is a PID.
Definition. Let R be a ring with identity 1 and let x be a symbol called an indeterminate, not
representing any element in R. Let R[x] denote the set of all symbols a0 + a1 x + · · · + an xn
where n ∈ N ∪ {0}, ai ∈ R, x0 = 1, x1 = x. For i ∈ N, let xi denote 1 · xi . In the symbol
a0 + a1 x + · · · + an xn , we may drop ai xi if ai = 0. Each element a0 + a1 x + · · · + an xn is called a
polynomial and ai is called the coefficient of xi for i ∈ {1, . . . , n} and a0 is called the constant
term.
60 2. Rings and Fields
f (x) 7→ f (c) := a0 + a1 c + · · · + an cn
gives a homomorphism from R[x] to R, called the evaluation at c. In addition, if f (c) = 0, then
c is called a root of f (x).
Theorem 2.5.2. [Division Algorithm] Let R be a ring, f (x), g(x) ∈ R[x] and g(x) 6= 0. Assume
that the leading coefficient of g(x) is a unit in R. Then ∃ unique q(x), r(x) ∈ R[x] such that
f (x) = q(x)g(x) + r(x) where r(x) = 0 or deg r(x) < deg g(x).
Proof. If there exists an h(x) ∈ R[x] such that f (x) = h(x)g(x), let q(x) = h(x) and r(x) = 0.
Assume that f (x) 6= h(x)g(x) for all h(x) ∈ R[x]. Let
Then S 6= ∅. By the Well-Ordering Principle, there exists a polynomial q(x) in R[x] such that
deg(f (x) − q(x)g(x)) has the least degree and we may write r(x) for f (x) − q(x)g(x). Then
r(x) 6= 0. Assume that deg r(x) ≥ deg g(x). Write r(x) = a0 + a1 x + · · · + an xn , an 6= 0, and
2.5. Polynomial Rings 61
Since deg(r2 (x) − r(x)) < deg g(x), this relation can hold only if q(x) − q2 (x) is zero, i.e., q(x) =
q2 (x), and hence finally r(x) = r2 (x).
Corollary 2.5.3. If F is a field, then F [x] is a Euclidean domain with valuation d(p(x)) = deg p(x)
for all p(x) ∈ F [x] r {0}. Moreover, F [x] is a PID and a UFD.
Theorem 2.5.4. [Remainder Theorem] Let R be a ring and f (x) ∈ R[x]. Then for all c ∈ R, the
remainder when x − c divides f (x) is f (c).
Proof. Let c ∈ R. By Theorem 2.5.2, there exist unique q(x) ∈ R[x] and r ∈ R such that f (x) =
q(x)(x − c) + r. Then f (c) = q(c)(c − c) + r = r.
Proof. (1) and (2) are clear. For (3), assume that c is a root of f (x). Then f (x) = q(x)(x − c)
for some q(x) ∈ R[x]. Since deg f (x) is 2 or 3, deg q(x) is 1 or 2, so f (x) is reducible. Conversely,
suppose that f (x) = g(x)h(x), where g(x), h(x) ∈ R[x] of degree ≥ 1. Since deg f (x) = 2 or 3,
deg g(x) = 1 or deg h(x) = 1. Hence, f (x) has a root in R.
Proof. We shall prove (1) by induction on k = deg f (x). It is clear when f (x) is linear. Assume
that k > 1 and any polynomials of degree k have at most k roots in F . Suppose that f (x)
is of degree k + 1. The statement is true when f (x) has no root in F . Otherwise, let α be a
root of f (x) in F . Then f (x) = (x − α)q(x) for some polynomial q(x) ∈ F [x] of degree k. By the
inductive hypothesis, q(x) has at most k roots. Hence, f (x) has at most k +1 roots. The remaining
statements follow from the first one.
Remarks. 1. f (x) = x2 − 1 has four roots in Z12 , namely 1, −1, 5, −5.
2. Corollary 2.5.6 says that two polynomials over an infinite field F which defined the same
function on F are identical. This is NOT true if F is finite. Let F = Zp , f (x) = x and
g(x) = xp . Then f (α) = g(α) for all α ∈ Zp but f (x) 6= g(x).
Let F be a field and f (x) a polynomial over F of degree n ∈ N. Then
F [x]/(f (x)) = {g(x) + (f (x)) : g(x) ∈ F [x]}
= {g(x) + (f (x)) : g(x) ∈ F [x] and g(x) = 0 or deg g(x) < n}
= {a0 + a1 x + · · · + an−1 xn−1 + (f (x)) : ai ∈ F }
by the division algorithm. Thus, if F is finite, then F [x]/(f (x)) is a commutative ring of |F |n
elements. In addition, if f (x) is irreducible, then (f (x)) is a maximal ideal, so F [x]/(f (x)) is a
field. Note that F is isomorphic to {c + (f (x)) : c ∈ F }, so we may embed F into F [x]/(f (x)) by
using the inclusion map.
Examples 2.5.2. 1. R[x]/(x2 + 1) is a field isomorphic to C (with the map f (x) 7→ f (i)).
2
2. Z11 [x]/(x + 3) is a field of 121 elements.
Theorem 2.5.7. Let F be a field and F [x] the polynomial ring over F . Then linear polynomials
are the only atoms in F [x] if and only if each polynomial f (x) ∈ F [x] of positive degree has a root
in F .
Proof. Suppose that linear polynomials are the only atoms in F [x]. Let f (x) be a polynomial of
positive degree over F . Since F [x] is a UFD, f (x) = α1 (x) · · · αk (x), a product of atoms. Each
αi (x) is linear, so αi (x) = bi (x − ci ) (bi , ci ∈ F with bi 6= 0). Then (x − ci ) | f (x), so c1 , . . . , ck are
roots of f (x) in F . Conversely, assume that every f (x) ∈ F [x] of positive degree has a root in F .
Let α(x) be an atom in F [x]. We claim that α(x) is linear. For, let b ∈ F be a root of α(x). Then
(x − b) | α(x) so α(x) = (x − b)β(x) for some β(x) ∈ F [x]. Since α(x) is an atom, β(x) must be a
unit. That is, β(x) is a constant lying in F r {0}. Thus, α(x) is a linear polynomial.
Theorem 2.5.8. Let R be an integral domain and f (x) ∈ R[x] a nonzero polynomial. If α1 , . . . , αk
are distinct roots of f (x), then (x − α1 ) . . . (x − αk ) divides f (x).
Proof. We shall prove this result by induction of k. Corollary 2.5.5 (1) gives the basis step.
Assume k > 1. By the inductive hypothesis (x − α1 ) . . . (x − αk−1 ) divides f (x), so let f (x) =
(x − α1 ) . . . (x − αk−1 )g(x) for some g(x) ∈ R[x]. Then
0 = f (αk ) = (αk − α1 ) . . . (αk − αk−1 )g(αk ).
Thus, g(αk ) = 0 since R is an integral domain, so (x−αk ) | g(x). It follows that (x−α1 ) . . . (x−αk )
divides f (x).
2.5. Polynomial Rings 63
Theorem 2.5.9. [Gauss’ lemma] Let R be a UFD and f (x), g(x) ∈ R[x]. If f (x) and g(x) are
primitive, so is f (x)g(x).
Proof. Let
f (x) = a0 + a1 x + · · · + am xm
g(x) = b0 + b1 x + · · · + bn xn
f (x)g(x) = c0 + c1 x + · · · + cm+n xm+n .
We shall suppose that f (x)g(x) is not primitive and obtain a contradiction. Let a ∈ R be an atom
of R which divides all of c0 , . . . , cm+n . Since R is a UFD, every atom is a prime, so Ra is a prime
ideal. Then (R/Ra)[x] is an integral domain. Since R[x]/R[x]a ∼ = (R/Ra)[x], R[x]a is a prime
ideal. Let
−
: R[x] → R[x]/R[x]a
be the canonical map. Since a divides c0 , . . . , cm+n , f¯(x)ḡ(x) = 0. But a does not divide all of
a0 , . . . , am or all of b0 , . . . , bn , since f and g are primitive. Thus, f¯(x) 6= 0, ḡ(x) 6= 0. This is a
contradiction since f¯(x)ḡ(x) = 0 and f¯(x), ḡ(x) lie in R[x]/P which is an integral domain. Hence,
f (x)g(x) is primitive, as claimed.
Theorem 2.5.10. Let R be a UFD and f (x), g(x) nonzero polynomials of R[x]. Then:
1. f (x) is primitive ⇔ the content of f (x) is 1.
2. If a is the content of f , then f (x) = af1 (x) where f1 (x) is primitive.
3. If f (x) = af1 (x) and f1 (x) is primitive, then a is the content of f (x).
4. If a and b are the contents of f (x) and g(x), respectively, then ab is the content of f (x)g(x).
Proof. (1), (2) and (3) are immediate from the definition of gcd. For the last statement, by (2),
we write f (x) = af1 (x) and g(x) = bg1 (x) where f1 (x) and g1 (x) are primitive. By Gauss’ lemma,
f1 (x)g1 (x) is primitive, and
Theorem 2.5.11. Let R be a UFD and let F = Q(R) = {r/s : r, s ∈ R, s 6= 0} be its field
of quotients. Suppose f (x) is an irreducible polynomial in R[x]. Then f (x), considered as a
polynomial in F [x], is irreducible in F [x]. In particular, if f (x) ∈ Z[x] is irreducible over Z, it is
irreducible over Q.
Proof. Suppose f (x) = g(x)h(x) where g(x) and h(x) are polynomials of positive degree in F [x].
Let g(x) = a0 /b0 + (a1 /b1 )x + · · · + (am /bm )xm and h(x) = c0 /d0 + (c1 /d1 )x + · · · + (cn /dn )xn .
Let b be a least common multiple of the bi and d a least common multiple of the dj so that
Since g2 (x) and h2 (x) are primitive, so is g2 (x)h2 (x) and hence the equation above implies that
bd | uv in R. Canceling, we obtain
uv
f (x) = wg2 (x)h2 (x) where w = ∈ R.
bd
Therefore, f (x) is reducible in R[x], which proves the theorem.
where a0 /b0 , a1 /b1 , . . . , an /bn are in “lowest terms”. That is, ai and bi have no common factor. Let
b = lcm(b0 , . . . , bn ). Then
is in R[x]. Let a be the content of bh(x). It happens that a = gcd(a0 , . . . , an ), although knowing
this is not essential. The main point is that
h1 (x) = (b/a)h(x)
is a primitive polynomial in R[x]. Moreover, the proof of Theorem 2.5.11 shows that if f (x) ∈
R[x], then h(x) | f (x) in F [x] ⇔ h1 (x) | f (x) in R[x]. In particular, suppose f (x) ∈ R[x], and
r/s ∈ F is a root of f (x) where r and s are relatively prime. Then h(x) = x − (r/s) divides f (x)
in F [x], so h1 (x) = sx − r divides f (x) in R[x]. Thus, we have:
Theorem 2.5.12. Let R be a UFD and F its field of quotients. Suppose f (x) ∈ R[x] where
f (x) = a0 + a1 x + · · · + an xn and r/s ∈ F is a root of f (x) where r and s are relatively prime.
Then s | an and r | a0 if r 6= 0.
Proof. The remarks above show that (sx − r) | (a0 + a1 x + · · · + an xn ) in R[x]. It is easy to see
that this implies our results.
Proof. We shall suppose that f (x) is reducible in Q[x] and obtain a contradiction. By dividing f (x)
by its content, we may assume that f (x) is primitive, this does not affect either the hypothesis
or the reducibility of f (x) in Q[x]. By Theorem 2.5.11, f (x) is reducible in Z[x], so let f (x) =
g(x)h(x) where g(x) = b0 + b1 x + · · · + bm xm and h(x) = c0 + c1 x + · · · + cn−m xn−m are in Z[x].
Note that since f (x) is primitive, neither g(x) nor h(x) is constant. That is, m ≥ 1 and n − m ≥ 1.
Let − : Z[x] → Zp [x] be the canonical projection. Then f¯(x) = ān xn where ān 6= 0̄ since p ∤ an ,
so ḡ(x)h̄(x) = f¯(x) = ān xn . Since Zp [x] is a UFD, this forms ḡ(x) = b̄m xm , h̄(x) = c̄n−m xn−m , so
that b̄0 = c̄0 = 0̄ (i.e., p divides b0 and c0 ). But then p2 | a0 since a0 = b0 c0 , which contradicts part
(3) of the hypotheses. Hence, f (x) is irreducible in Q[x] as claimed.
Example 2.5.4. f (x) = 2x5 − 6x3 + 9x2 − 15 is irreducible in Q[x] and in Z[x].
satisfies the Eisenstein criterion for the prime p and is thus irreducible in Q[x]. But clearly if
Φp (x) = h(x)r(x) were a nontrivial factorization of Φp (x) in Z[x], then
would give a nontrivial factorization of g(x) in Z[x]. Thus, Φp (x) must also be irreducible in
Q[x].
We next wish to prove a famous theorem of Gauss: If R is a UFD, so is R[x]. Recall the criteria
given in Theorem 2.4.4:
Suppose R is a UFD. We first observe that R[x] is an integral domain, so this presents no
problem. We shall establish the criteria above for R[x] (and these show that R[x] is a UFD) by
doing three things:
(a) We determine all atoms of R[x] (Theorem 2.5.15).
(b) We show that they are primes (Theorem 2.5.16).
(c) We show that every nonzero nonunit of R[x] is a product of atoms and conclude that R[x]
is a UFD (Theorem 2.5.17).
Theorem 2.5.15. Let R be a UFD, F its field of quotients and f (x) ∈ F [x]. Then f (x) is an atom
of R[x] ⇔ either
(1) f (x) ∈ R and f (x) is an atom of R or
(2) f (x) is a primitive polynomial of degree n ≥ 1 and f (x) is irreducible in F [x].
66 2. Rings and Fields
Proof. Assume that f (x) is an atom of R[x]. If deg f (x) = 0, then f (x) ∈ R, and clearly f (x) must
be an atoms of R. Otherwise, suppose that deg f (x) = n ≥ 1, and let a be the content of f (x).
Then f (x) = af1 (x) where f1 (x) is primitive. Since f (x) is irreducible in R[x], a must be a unit
in R, so f (x) is primitive. Again, since f (x) is irreducible in R[x], it is also irreducible in F [x] by
Theorem 2.5.11.
Conversely, assume that (1) and (2) hold. If f (x) is an atom of R, it is clearly an atom of R[x]
(Theorem 2.5.1). Suppose f (x) is a primitive polynomial of degree n ≥ 1 and f (x) is irreducible
in F [x]. We claim that f (x) is an atom of R[x]. For, suppose not, and let
f (x) = g(x)h(x),
Theorem 2.5.16. Let R be a UFD and f (x) an atom of R[x]. Then R[x]f (x) is a prime ideal of
R[x]. That is, f (x) is a prime element.
Proof. We consider separately the two types of atoms in R[x] given in Theorem 2.5.15.
Case 1. a ∈ R is an atom of R. Since R is a UFD, every atom is a prime, so Ra is a prime ideal.
Then (R/Ra)[x] is an integral domain. Since R[x]/R[x]a ∼ = (R/Ra)[x], R[x]a is a prime ideal, so
a is prime.
Case 2. f (x) is a primitive polynomial of degree n ≥ 1 and f (x) is irreducible in F [x] where F is
the quotient field of R. First we claim that F [x]f (x) ∩ R[x] = R[x]f (x). Clearly, f (x) ∈ F [x]f (x) ∩
R[x]. Conversely, we suppose g(x)f (x) ∈ R[x] with g(x) = a0 /b0 + (a1 /b1 )x + · · · + (an /bn )xn ∈
F [x]. We can find relatively prime a, b ∈ R such that (b/a)g(x) = g1 (x) where g1 (x) is a primitive
polynomial in R[x]. (In fact, a = gcd(a0 , a1 , . . . , an ) and b = lcm(b0 , b1 , . . . , bn ) will do, provided
each ai and bi are relatively prime.) Thus, (b/a)g(x)f (x) = g1 (x)f (x) ∈ R[x]. By Gauss’ lemma,
g1 (x)f (x) is a primitive polynomial. In connection with the above equation, this forces b to be a
unit of R, so g(x) = (a/b)g1 (x) ∈ R[x]. Hence, g(x)f (x) ∈ R[x]f (x) which proves our claim.
By the second isomorphism theorem, we have
Since (R[x] + F [x]f (x))/F [x]f (x) ⊆ F [x]/F [x]f (x) which is a field because f (x) is irreducible in
F [x], R[x]+F [x]f (x))/F [x]f (x) is an integral domain. Thus, R[x]/R[x]f (x) is an integral domain,
so R[x]f (x) is a prime ideal. Therefore, f (x) is prime and this proves the theorem.
Proof. We know all the atoms of R[x] by Theorem 2.5.15 and Theorem 2.5.16 tells us that each
atom of R[x] is prime. Hence, (by Theorem 2.4.4) to verify that R[x] is a UFD, it remains to show
that each nonzero nonunit f (x) ∈ R[x] is a product of atoms.
Case 1. deg f (x) = 0, i.e., f (x) ∈ R. Since R is a UFD and every atom of R is an atom of R[x], we
can express f (x) as a product of atoms in R, and so in R[x].
Case 2. deg f (x) = n ≥ 1. Let f (x) = f1 (x) . . . fk (x) where (a) each fi (x) has degree ≥ 1 and
(b) k is as large as possible. Such a factorization exists because any factorization which satisfies
(a) has at most n terms since n = deg f (x) = deg f1 (x) + · · · + deg fk (x) ≥ k. Now, let ai be the
content of fi (x), and let fi (x) = ai gi (x) where gi (x) is a primitive polynomial.
2.6. Field Extensions 67
We claim that gi (x) is an atom in R[x] because if gi (x) = r(x)s(x) where r(x) and s(x) are
nonunits, then r(x) and s(x) cannot lie in R, since gi (x) is primitive. In addition, r(x) and
s(x) cannot both have positive degree because then we could write f (x) as a product of k + 1
polynomials of positive degree, which violates (b). Thus, each gi (x) is an atom as desired. Hence,
By Case 1, a can be written as a product of atoms in R[x] and therefore shows that f (x) is a
product of atoms in R[x], which proves R[x] is a UFD.
p(x) = c(x − r1 ) . . . (x − rk )(x2 − (2a1 )x + (a21 + b21 )) . . . (x2 − (2am )x + (a2m + b2m )).
In addition, if p(x) ∈ R[x] is irreducible over R, then deg p(x) = 1 or 2, namely, p(x) = bx + c or
p(x) = ax2 + bx + c with b2 − 4ac < 0.
9. Let R be a UFD and F its field of quotients. Let f (x) = a0 + a1 x + · · · + an xn lies in R[x], and suppose
that there is an irreducible element p ∈ R such that
(i) p ∤ an , (ii) p | a0 , . . . , an−1 , and (iii) p2 ∤ a0 .
Prove that f (x) is irreducible in F [x]. Moreover, if f (x) is primitive, then f (x) is irreducible in R[x].
10. Let P (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 be a polynomial with integer coefficients. If there exist
a prime number p and an integer k ∈ {0, 1, . . . , n − 1} such that p | a0 , a1 , . . . , ak , p ∤ ak+1 and p2 ∤ a0 ,
then P (x) has an irreducible factor in Z[x] of degree greater than k.
11. If R is an intergral domain for which every ideal of R[x] is principal, show that R must be a field.
12. Let D be an integral domain. If ϕ : D[x] → D[x] is an automorphism such that ϕ(a) = a for all
a ∈ D, prove that there exist c, d ∈ D with c a unit in D such that ϕ(x) = cx + d. Here x stands for
the indeterminate of D[x].
Definition. Let F be a field. The intersection of all subfields of F is the smallest subfield of F ,
called the prime field of F .
Theorem 2.6.1. Let F be a field with the prime subfield P and 1F denote the identity of F .
1. If char F = p, a prime, then P = {n · 1F : n = 0, 1, . . . , p − 1} ∼
= Z/pZ.
2. If char F = 0, then P = {(m · 1F )(n · 1F )−1 : m, n ∈ Z, n 6= 0} ∼ = Q.
Remark. By Theorem 2.6.1, any field can be considered as an extension field of the field Q or Zp
for some prime p.
[L : F ] = [L : K][K : F ].
Proof. With F ⊆ K ⊆ L, let {βj }j∈J be a basis of K over F and {αi }i∈I a basis of L over K.
Every element of L can be written uniquely as a finite linear combination of the elements of
{αi }i∈I with coefficients in K, and every such coefficient can be written uniquely as a finite linear
combination of the elements of {βj }j∈J with coefficients in F . Hence, every element of L can be
written uniquely as a finite linear combination of the elements of {αi βj }i∈I,j∈J with coefficients
in K: {αi βj }i∈I,j∈J is a basis of L over F , and [L : F ] = |I × J| = [L : K][K : F ].
Theorem 2.6.3. Let K be a field extension of a field F and let u ∈ K. Then EITHER
(a) [F (u) : F ] = ∞ and F [u] ∼ = F [t], so F (u) ∼
= F (t) where t is an indeterminate OR
(b) [F (u) : F ] is finite and F [u] = F (u).
defined by ϕ(t) = u (or ϕ(f (t)) = f (u)). Note that the kernel of ϕ is a prime ideal, since the
image of ϕ has no zero divisors. There are two possibilities.
(1) ker ϕ = {0}. Then we have (a).
(2) ker ϕ 6= {0}. Then ker ϕ = F [t]g(t) where g(t) is a monic prime (i.e., irreducible) polynomial.
Since F [t] is a PID, F [t]g(t) is a maximal ideal. Thus,
F [u] ∼
= F [t]/F [t]g(t)
Corollary 2.6.4. Let K be an extension field of a field F and u ∈ K. The following conditions
on u are equivalent:
(i) u is transcendental over F (if f (t) ∈ F [t] and f (u) = 0, then f = 0);
(ii) F (u) ∼
= F (t);
(iii) [F (u) : F ] is infinite.
70 2. Rings and Fields
Corollary 2.6.5. Let K be an extension field of a field F and u ∈ K. The following conditions
on u are equivalent:
(i) u is algebraic over F (there exists a polynomial 0 6= f (t) ∈ F [t] such that f (u) = 0);
(ii) there exists a monic irreducible polynomial g(t) ∈ F [t] such that g(u) = 0;
(iii) [F (u) : F ] is finite.
Moreover, in part (ii), we have g(t) is unique; f (u) = 0 if and only if g(t) | f (t); F (u) ∼=
F [t]/(g(t)); and [F (u) : F ] = deg g(t).
Definition. When u is algebraic over F , the unique monic irreducible polynomial g(t) ∈ F [t] in
part (ii) is the minimal polynomial of u. The degree of u over F is deg g(t).
For example, C is an algebraic extension of R, but R is not algebraic over Q. Note that if
[K : F ] is finite, then K is an algebraic extension because [F (u) : F ] ≤ [K : F ] < ∞ for all u ∈ K.
a0 is algebraic over F and ai is algebraic over F (a0 , . . . , ai−1 ) for all i ∈ {1, . . . , n}. Since f (x) ∈
E[x], u is algebraic over E, so [E(u) : E] is finite by Corollary 2.6.5. Thus,
Corollary 2.6.7. For a, b ∈ K, if a and b are algebraic over F of degree m and n, respectively, then
a ± b, ab and a/b (if b 6= 0) are all algebraic over F of degree ≤ mn. Hence, the set of all algebraic
elements of K over F is a subfield of K and is an algebraic extension over F .
2.6. Field Extensions 71
Theorem 2.6.8. If F is a field and G is a finite subgroup of the multiplicative group of nonzero
elements of F , then G is a cyclic group. In particular, the multiplicative group of all nonzero
elements of a finite field is cyclic.
Proof. If G = {1}, then G is cyclic. Assume that G 6= {1}. Since G is a finite abelian group,
G∼
= Z/(m1 ) ⊕ · · · ⊕ Z/(mk )
k
X
where k ≥ 1, m1 > 1 and m1 | · · · | mk . Since mk Z/(mi ) = 0, u is a root of the polynomial
i=1
xmk − 1 ∈ F [x] for all u ∈ G. By Corollary 2.5.6, this polynomial has at most mk distinct roots
in F , so |G| ≤ mk . Hence, we must have k = 1 and G ∼= Z/(m1 ) which is a cyclic group.
Remark. The finite multiplicative subgroup of a division ring may not be cyclic. E.g., Q8 =
{±1, ±i, ±j, ±k} is a subgroup of the ring of real quaternions H and Q8 is not cyclic.
Definition. Let R be an integral domain and f (x) ∈ R[x]. If α is a root of f (x), then there
exist m ∈ N and g(x) ∈ R[x] such that f (x) = (x − α)m g(x) and g(α) 6= 0. m is called the
multiplicity of the root α of f (x) and if m > 1, α is called a multiple root of f (x).
We record the immediate properties of the derivative of polynomials in the next lemma
Lemma 2.6.9. If f (x) and g(x) are polynomials over an integral domain R and c ∈ R, then
1. (cf (x))′ = cf ′ (x),
2. (f (x) + g(x))′ = f ′ (x) + g ′ (x),
3. (f (x)g(x))′ = f (x)g ′ (x) + f ′ (x)g(x),
4. ((f (x))n )′ = n(f (x))n−1 f ′ (x) where n ∈ N.
Theorem 2.6.11. If f (x) ∈ F [x] and deg f (x) = n 6= 1, then f (x) can have at most n roots
counting multiplicities in any extension field of F .
Proof. We shall prove the theorem by induction on the degree of f (x). If deg f (x) = 1, then
f (x) = ax + b for some a, b ∈ F and a 6= 0. Then −b/a is the unique root of f (x) and −b/a ∈ F ,
so we are done.
Let deg f (x) = n > 1 and assume that the result is true for all polynomials of degree < n. Let
E be an extension field of F . If f (x) has no roots in E, then we are done. Let r ∈ E be a root
of f (x) of multiplicity m ≥ 1. Then there exists q(x) ∈ E[x] such that f (x) = (x − r)m q(x) and
q(r) 6= 0. Then deg q(x) = n − m. By the inductive hypothesis q(x) has at most n − m roots in E
counting multiplicities. Hence, f (x) has at most m+(n−m) roots in E counting multiplicities.
Theorem 2.6.12. [Kronocker] If p(t) ∈ F [t] is irreducible in F [t], then there exists an extension
field E of F such that [E : F ] = deg p(t) and p(t) has a root in E.
Corollary 2.6.13. If p(t) ∈ F [t] is a nonconstant polynomial, then there exists a finite extension
field E of F containing a root of p(t) and [E : F ] ≤ deg p(t).
Proof. Since F [t] is a UFD, p(t) has an irreducible factor in F [t], say p1 (t). By Theorem 2.6.12,
there exists an extension field E of F such that E contains a root of p1 (t) and [E : F ] = deg p1 (t).
Hence, [E : F ] ≤ deg p(t) and E contains a root of p(t).
Exercises 2.6. 1. If u ∈ K is algebraic of odd degree over F , prove that F (u2 ) = F (u).
2. Let a, b ∈ K be algebraic over F of degree m and n, respectively. Prove that if m and n are relatively
prime, then [F (a, b) : F ] =
√ mn.√ √ √
3. Show that the degree of 2 + 3 over Q is 4 and the degree of 2 + 3 5 is 6.
4. Let p be a prime and let v ∈ C satisfy v 6= 1, v p = 1 (e.g., v = cos(2π/p) + i sin(2π/p)).
Show that [Q(v) : Q] = p − 1.
5. Let E = Q(u) where u3 − u2 + u + 2 = 0. Express (u2 + u + 1)(u2 − u) and (u − 1)−1 in the form
au2 + bu + c where a, b, c ∈ Q.
2.6. Field Extensions 73
6. Let E be an algebraic extension of a field F . Show that any subring of E/F is a subfield. Hence
prove that any subring of a finite dimensional extension field E/F is a subfield.
7. Let E = F (u), u transcendental and let K 6= F be a subfield of E/F . Show that u is algebraic
over K.
8. Let u and v be positive irrational numbers such that u is algebraic over Q and v is transcendental
over Q.
(a) Show that v is transcendental over Q[u].
(b) Classify whether the following elements are algebraic or transcendental over Q.
1 √ √
(i) (ii) u (iii) v
u+v
9. (a) Show that there are countably many irreducible polynomials in Q[x].
(b) Let A be the set of all real numbers that are algebraic over Q. Show that A is countable, so that
R r A is uncountable.
10. Let R be an integral domain and f (x) a nonconstant polynomial. Prove that:
(a) If char R = 0, then f ′ (x) 6= 0.
(b) If char R = p, a prime, then f ′ (x) = 0 ⇔ ∃a0 , a1 , . . . , an ∈ R, f (x) = a0 + a1 xp + · · · + an xnp .
11. Suppose that F is a finite field and f (x) ∈ F [x] a nonconstant. If f ′ (x) = 0, prove that f (x) is
reducible over F .
12. Let F be a finite field with q elements. Prove that if K is an extension field of F and b ∈ K is
m
algebraic over F , then bq = b for some m ∈ N.
13. A complex number α is called an algebraic integer if it is a root of a monic polynomial
More ring theory will be in terms of modules in Chapter 4. We shall classify extension fields
and talk about the fundamental theorem of Galois theory in Chapter 5.
74 2. Rings and Fields
3 | Advanced Group Theory
Deeper results of groups are presented in this chapter. Various kinds of series of a group are
studied in the first three sections. A solvable group gets its name from the Galois group of a
polynomial p(x) and solvability by radicals of the equation p(x) = 0. A nilpotent group can be
considered as a generalization of an abelian group. A linear group gives an example of an infinite
simple group. Finally, we discuss how to construct a group from a set of objects and presentations.
Examples 3.1.1. 1. {0} < 8Z < 4Z < Z and {0} < 9Z < Z are normal series of Z.
2. {(1)} < A3 < S3 is a normal series of S3 .
3. {(1)} < A4 < S4 , {(1)} < V4 < S4 and {(1)} < V4 < A4 < S4 are normal series of S4 . Here
V4 = {(1), (12)(34), (13)(24), (14)(23)}.
4. {(1)} < {(1), (12)(34)} < V4 < A4 < S4 is a subnormal series of S4 which is not a normal
series.
Example 3.1.2. The series {0} < 72Z < 9Z < 3Z < Z is a refinement of the series {0} < 9Z < Z.
Definition. Two subnormal [normal] series {Hi } and {Kj } of the same group G are isomorphic
if there is a one-to-one correspondence between the collections of factor groups {Hi+1 /Hi } and
{Kj+1 /Kj } such that corresponding factor groups are isomorphic.
Clearly, two isomorphic subnormal [normal] series must have the same number of groups.
Example 3.1.3. The two series of Z15 , {0} < h5i < Z15 and {0} < h3i < Z15 are isomorphic.
The following theorem is fundamental to the theory of series.
Theorem 3.1.1. [Schreier] Two subnormal [normal] series of a group G have isomorphic refine-
ments.
75
76 3. Advanced Group Theory
of {0} < 9Z < Z. In both cases the refinements have four factor groups isomorphic to Z4 , Z2 , Z9 ,
and 72Z or Z. The order in which the factor groups occurs is different to be sure.
Recall the following fact.
To prove Schreier’s theorem, we shall need the following lemma developed by Zassenhaus.
This lemma is also called the butterfly lemma since the diagram which accompanies the lemma
has a butterfly shape.
Lemma 3.1.3. [Zassenhaus] Let H and K be subgroups of a group G and let H ∗ and K ∗ be
normal subgroups of H and K respectively. Then
1. H ∗ (H ∩ K ∗ ) is a normal subgroup of H ∗ (H ∩ K).
2. K ∗ (H ∗ ∩ K) is a normal subgroup of K ∗ (H ∩ K).
3. H ∗ (H ∩ K)/H ∗ (H ∩ K ∗ ) ∼= K ∗ (H ∩ K)/K ∗ (H ∗ ∩ K) ∼
= (H ∩ K)/[(H ∗ ∩ K)(H ∩ K ∗ )].
H ◆◆ ♣ K✭
◆◆◆ ♣♣♣ ✭✭
✖✖✖ ◆◆◆ ♣ ♣
◆◆◆ ♣♣♣ ✭✭
✖✖✖ ◆ ♣♣♣ ✭✭
✖✖✖ H ∗ (H ∩ K) K ∗ (H ∩ K) ✭✭
❘❘❘ ❧❧ ✭✭
✖✖ ❘❘❘ ❧❧
❘❘❘
❘ ❧ ❧ ❧❧❧ ✭✭
✖ ❘❘❘ ❧
✖✖✖ ❧❧❧ ✭✭
H ∩ K ✭✭
✖✖✖ ✭✭
✖✖✖ ✭✭
✖ ✭✭
H ∗ (H ∩ K ∗ ) K ∗ (H ∗ ∩ K) ✭✭
✖✖✖ s ☛ ❘ ❘❘❘ ❧❧ ✸ ❑
✸✸ ❑❑❑ ✭✭
s s ☛ ❘❘ ❧ ❧
✖✖✖ s ss ☛☛ ❘❘❘
❘ ❧ ❧❧❧❧ ✸✸ ❑❑❑ ✭✭
ss ❘❘❘ ❧ ❑❑
✖✖✖ s ss ☛☛☛ ❧❧❧ ✸✸
✸
❑❑
❑
✭✭
s ∗ ∗ ❑❑
ss ☛☛ (H ∩ K)(H ∩ K ) ✸✸ ❑ ✭✭
✖✖✖ s ss ☛☛ ♠♠ ♠ ◗◗◗
◗ ✸✸ ❑❑
❑ ✭✭
ss ☛ ♠♠♠ ◗◗◗ ✸✸ ❑❑
✭
✖✖✖ ssss ☛☛☛
♠♠♠ ♠♠ ◗◗ ◗◗◗ ✸✸ ❑ ❑ ❑❑ ✭
s ♠ ◗ ✸✸
H∗ ■ ☛☛
☛ ♠♠♠ ◗◗◗
K∗
■■ ☛ ♠♠ ♠♠♠ ◗◗◗
◗ ◗
✸✸
✉ ✉
■■ ☛ ♠♠ ◗◗◗ ✸ ✉
■■ ☛☛ ♠♠♠♠♠ ◗◗◗ ✸✸ ✉✉
■■ ☛ ♠ ◗◗ ✸ ✉ ✉✉
☛ ♠ ✉
H∗ ∩ K H ∩ K∗
H ∗ (H ∩ K), H ∗ (H ∩ K ∗ ), K ∗ (H ∩ K) and K ∗ (H ∗ ∩ K)
are groups. It is easy to show that H ∗ ∩ K are H ∩ K ∗ are normal subgroups of H ∩ K. Apply
Theorem 3.1.2 to H ∗ ∩K and H∩K ∗ as normal subgroups of H∩K, we have L = (H ∗ ∩K)(H∩K ∗ )
is a normal subgroup of H ∩ K. Thus we have the lattice of subgroups shown above.
3.1. Jordan-Hölder Theorem 77
Definition. A group G is simple if G and {e} are the only normal subgroups of G.
For example, Zp , p prime, and An , n 6= 4, are simple. We also have an obvious fact.
Theorem 3.1.4. G is a simple abelian group if and only if G is cyclic of prime order.
Theorem 3.1.5. [Third Isomorphism Theorem] Suppose G is a group and N is a normal sub-
group of G. Then the map
H 7→ H/N
gives a 1-1 correspondence
Theorem 3.1.6. M is a maximal normal subgroup of a group G if and only if G/M is simple.
Definition. A subnormal series {Hi } of a group G is a composition series if all the factor
groups Hi+1 /Hi are simple. A normal series {Hi } of G is a principal or chief series if all the
factor groups Hi+1 /Hi are simple.
Observe that by Theorem 3.1.6 Hi+1 /Hi is simple if and only if Hi is a maximal normal
subgroup of Hi+1 . Thus for a composition series, each Hi must be a maximal normal subgroup of
Hi+1 . To form a composition series of a group G, we just look for a maximal normal subgroup Hn−1
of G, then for a maximal normal subgroup of Hn−1 , and so on. If this process terminates in finite
number of steps, we have a composition series. Hence, we have first shown:
Note that by Theorem 3.1.6 a composition series cannot have any further refinement. To form
a principal series, we have to hunt for a maximal normal subgroup Hn−1 of G, then for a maximal
normal subgroup of Hn−1 that is also normal in G, and so on. The main theorem is as follows.
Theorem 3.1.8. [Jordan-Hölder] Any two composition [principal] series of a group G are iso-
morphic.
Proof. Let {Hi } and {Ki } be two composition [principal] series of G. By Schreier’s theorem,
they have isomorphic refinements. But since all factor groups are already simple, Theorem 3.1.6
shows that neither series has any further refinement. Hence, {Hi } and {Ki } must already be
isomorphic.
Examples 3.1.5. 1. If G is simple, then {e} ⊳ G is the only normal series of G. It is a compo-
sition series for G and its associated factor is G = G/{e}.
2. If n 6= 4, then {(1)} < An < Sn is a composition series of Sn .
3. Z has many normal series. For example, let m1 , . . . , mn be positive integers. Then
is a normal series for Z whose associated factors are Zm1 , Zm2 , . . . , Zmn , Z. Note that since
any nontrivial subgroup of Z is isomorphic to Z, any normal series for Z must have one
associated factor isomorphic to Z. Hence, Z has no composition series.
4. Let p be prime and G = Zp × Zp . If (x, y) 6= (0, 0) in G, then h(x, y)i ∼ = Zp and {(0, 0)} <
h(x, y)i < G is a composition series for G. The composition factors are G/h(x, y)i ∼ = Zp and
∼ 2
h(x, y)i/(0, 0) = Zp , i.e., Zp with multiplicity 2. Note that G has (p − 1)/(p − 1) = p + 1
subgroups of order p, so G has p + 1 distinct composition series. But in all cases they have
the same composition factors: Zp with multiplicity 2.
5. Let p and q be prime and G = Zp × Zq = hai × hbi. Then the only proper subgroup of G are
hai = Zp and hbi = Zq . Thus G has two composition series
In both cases, the associated composition factors are Zp and Zq both with multiplicity one.
6. Consider Zp3 , Zp2 × Zp and Zp × Zp × Zp . In any composition series for these groups the
same composition factors, namely Zp with multiplicity 3, occur.
3.2. Solvable Groups 79
Exercises 3.1. 1. Suppose G has precisely two subgroups. Show that G has prime order.
2. A proper subgroup M of G is maximal if whenever M ⊆ H ⊆ G, we have H = M or H = G.
Suppose G is finite and has only one maximal subgroup. Show that |G| is a power of prime.
3. Let G = Z36 . Consider two normal series {0} < h12i < h3i < Z36 and {0} < h18i < Z36 . Find two
isomorphic chains and exhibit the isomorphic factor groups as described in the proof of Schreier’s
Theorem.
4. Find a composition series for the dihedral group D4 = {σ, ρ : σ 4 = ρ2 = e and ρσρ−1 = σ −1 } and
for the quaternion group Q = {±1, ±i, ±j, ±k}. Determine the composition factor in each case.
5. Prove that if G has a composition [resp. principal] series and if N is a proper normal subgroup of G,
then there exists a composition [resp. principal] series containing N . Hence, show that N and G/N
have composition [principal] series.
6. Show that if H0 = {e} < H1 < H2 < · · · < Hn = G is a subnormal [normal] series of G, and if
Hi+1 /Hi is of finite order si+1 , then G is of finite order s1 s2 . . . sn .
7. Show that an infinite abelian group can have no composition series.
Definition. Let G be a group. For g, h ∈ G, [g, h] = ghg −1 h−1 is called a commutator of G. The
derived subgroup of G, denoted by G′ , is the group generated by all commutators of elements
of G, i.e.,
G′ = hghg −1 h−1 : g, h ∈ Gi.
The n-th derived subgroup of G, denoted by G(n) is defined inductively by G(0) = G and
G(n) = (G(n−1) )′ for all n ≥ 1.
Proof. (1) Assume that N is normal and G/N is abelian. Let x, y ∈ G. Then xyN = yxN , so
xyx−1 y −1 ∈ N . Thus G′ ⊆ N . Conversely, suppose that G′ ⊆ N . Let x, y ∈ G and n ∈ N . Then
xnx−1 n−1 ∈ G′ ⊆ N which implies that xnx−1 ∈ N n = N . Hence, N ⊳ G. Since (xy)(yx)−1 =
xyx−1 y −1 ∈ G′ ⊆ N , xyN = yxN , so G/N is abelian.
(2) follows from 1 by taking N = G′ .
(3) Define θ̄(xG′ ) = θ(x) for all x ∈ G. Clearly, θ = θ̄ ◦ π and is a homomorphism. Since
θ(G′ ) = {e}, θ̄ is well-defined.
A group G is said to be solvable (of derived length ≤ n) if G(n) = {e} for some n.
80 3. Advanced Group Theory
Definition. A subgroup H of a group G which is invariant under all automorphisms, that is,
ϕ(H) ≤ H for all ϕ ∈ Aut G, is called a characteristic subgroup of G.
Using the inner automorphisms ϕa (x) = axa−1 for all a ∈ G, we deduce that every character-
istic subgroup is normal in G.
Lemma 3.2.2. Let ϕ : G → H be a surjective homomorphism. Then ϕ(G(i) ) = H (i) for every
i ≥ 0. Also, G(i) is a characteristic subgroup for all i, and is thus normal in G.
Proof. We have ϕ([x, y]) = [ϕ(x), ϕ(y)], and since ϕ is onto, we see that ϕ maps the set of commu-
tators in G onto those in H. It follows that ϕ(G′ ) = H ′ , and repeated application of this argument
yields that ϕ(G(i) ) = H (i) , as required. That the terms of the derived series of G are characteristic
follows from the first part of the lemma when we take H = G and ϕ ∈ Aut G.
Theorem 3.2.3. Let G be a group. Then G is solvable if and only if G has a subnormal series with
abelian factors.
Proof. If G is solvable, G = G(0) > G(1) > · · · > G(n) = {e} is a subnormal series with abelian
factors. Conversely, suppose G = G0 > G1 > · · · > Gm = {e} is a subnormal series for G with
abelian factors. Since Gi /Gi+1 is abelian, Gi+1 ≥ G′i . We claim Gi ≥ G(i) for i = 0, 1, . . . , m by
induction on i. For i = 0, G0 = G = G(0) . Assume Gi ≥ G(i) . Then Gi+1 ≥ G′i ≥ (G(i) )′ = G(i+1) ,
which completes the induction. Hence, {e} = Gm ≥ G(m) , so G(m) = 1 and G is solvable.
Remark. From Lemma 3.2.2, we know that G(i) ⊳ G for all i. Then the above derived series
is indeed a normal series with abelian factors for G. Also, if G is solvable, its derived length,
dl(G), is the smallest positive integer n such that G(n) = {e}.
Examples 3.2.1. 1. Abelian groups are solvable of derived length 1 because G′ = {e}. In
addition, the groups with derived length 1 are exactly the abelian groups. Therefore, a
group G is abelian if and only if G is solvable of derived length 1.
2. Let Dn be the dihedral group of order 2n, i.e.,
Here, σ is the 2π/n rotation and ρ is the reflection of the regular n-gon. For example,
(2)
D1 = Z2 , D2 = Z2 × Z2 and D3 = S3 . Then Dn′ = hσ 2 i, an abelian group. Thus, Dn = {e}.
For n = 1 or 2, Dn is abelian and hence has derived length one. For n ≥ 3, Dn is solvable of
derived length two.
Proof. Observe that Dn = {e, σ, σ 2 , . . . , σ n−1 , ρ, ρσ, ρσ 2 , . . . , ρσ n−1 }. For x, y ∈ Dn , we distinguish
four cases
σ k σ l σ −k σ −l = e
(ρσ k )(ρσ l )(σ −k ρ−1 )(σ −l ρ−1 ) = ρσ k σ k σ −l σ −l ρ−1 = σ −2k σ 2l
xyx−1 y −1 =
(ρσ k )σ l (σ −k ρ−1 )σ −l = σ −l σ −l = σ −2l
σ k (ρσ l )σ −k (σ −l ρ−1 ) = σ k σ k = σ 2k .
This implies that Dn′ ⊆ hσ 2 i. On the other hand, we have σ 2 = ρσ −1 ρ−1 σ. Thus, Dn′ = hσ 2 i.
3.2. Solvable Groups 81
3. The groups S1 = {(1)} and S2 = Z2 are abelian groups. The group S3 = D3 is solvable of
(3)
derived length two. Since S4′ = A4 , A′4 = V4 and V4′ = {(1)} = S4 , we can conclude that
(2)
the group S4 is solvable of derived length 3. For n ≥ 5, Sn′ = An and A′n = Sn = An since
An is simple and non-abelian. Therefore Sn ≥ An ≥ An ≥ . . . is the derived series of Sn
and Sn is not solvable.
Proof. It is easy to see that any group of order two in A4 are not normal. Since A4 has more than one
Sylow 3-subgroup, any subgroups of A4 of order three are not normal. Moreover, A4 has no subgroup
of order six (see Exercises 1.4). Hence, the normal subgroups of A4 are A4 , V4 and {(1)}. Note that
S4′ is a subgroup of A4 . Moreover, it is normal in A4 . Since S4 and S4 /V4 are not abelian, S4′ must
be A4 . Since A4 is not abelian and A4 /V4 is abelian, we have A′4 = V4 . Hence, S4 ⊲ A4 ⊲ V4 ⊲ {(1)} is
the derived series of S4 . Next, let n ≥ 5 and K = Sn′ ⊳ Sn . Then K ∩ An ⊳ Sn , so K ∩ An ⊳ An . Since
An is simple, K ∩ An = {(1)} or K ∩ An = An . But K ⊆ An and K 6= {(1)} (since Sn is non-abelian),
we get K = An . Hence, Sn′ = An .
Proof. (1) Since H (i) ≤ G(i) for all i, H (n) = {e} if G(n) = {e}.
(2) The application of Lemma 3.2.2 to the canonical homomorphism π : G → G/N yields that
(G/N )(i) = π(G(i) ) for all i, and hence if G(n) = {e}, we have (G/N )(n) = {N }.
(3) follows from (2).
(4) Let dl(N ) = n and dl(G/N ) = m. Since the canonical homomorphism ϕ : G → G/N maps
G(m) to (G/N )(m) = {N }, we see that G(m) ⊆ N . Thus G(m+n) = (G(m) )(n) ⊆ N (n) = {e}, and
hence G is solvable.
(5) follows from (4).
Some additional conditions under which finite groups are solvable are the following.
Burnside’s and Philip Hall’s Theorems were proved by using Character Theory. The proof of
the Feit-Thompson Theorem takes 255 pages of hard mathematics (Solvability of groups of odd
order, Pacific Journal of Mathematics, 13 (1963), pp. 775–1029). Thompson’s Theorem was first
proved as a consequence of 475-page paper (that in turn relies ultimately on the Feit-Thompson
Theorem).
Exercises 3.2. 1. (a) Give an example of a normal subgroup of G which is not characteristic.
(b) Prove that Z(G) is a characteristic subgroup of G.
(c) If H is a characteristic subgroup of N and N ⊳ G, show that H ⊳ G.
2. Show that if G is a solvable simple group, then G is abelian.
82 3. Advanced Group Theory
3. Let {e} = H0 < H1 < H2 < · · · < Hn−1 < Hn = G be a composition for G. Prove that G is solvable
if and only if the composition factors Hi+1 /Hi all have prime order. Deduce that if G is solvable with
a composition series, then G is finite.
4. Find a composition series of S3 × S3 . Is S3 × S3 solvable?
5. Show that a group of order 1995 is solvable.
6. Let p < q < r be primes and let G1 be a group of order pq and let G2 be a group of order pqr. Prove
that both of them are solvable. [Hint: G1 has a unique subgroup of order q.]
7. Let G be a group of order 495 = 32 · 5 · 11.
(a) Prove that a Sylow 5-subgroup or a Sylow 11-subgroup of G is normal in G.
(b) Let P be a Sylow 5-subgroup and Q a Sylow 11-subgroup of G. Prove that P Q is normal in G.
(c) Prove that G is solvable.
8. Prove (without using the Feit-Thompson Theorem) that the following statements are equivalent:
(i) every group of odd order is solvable
(ii) the only simple groups of odd order are those of prime order.
The lower central series of a group G is defined inductively by Γ1 (G) = G and Γn+1 (G) =
[G, Γn (G)] for all n ≥ 1, so we get
G = Γ1 (G) ≥ Γ2 (G) ≥ . . .
and Γn (G) is called the n-th term of the lower central series of G.
A group G is said to be nilpotent of class ≤ n if Γn+1 = {e}.
G = G(0) ≥ G(1) ≥ . . .
G = Γ1 (G) ≥ Γ2 (G) ≥ . . . .
Examples 3.3.2. 1. S3 has the derived series S3 > A3 > {(1)} and has the lower central series
S3 > A3 ≥ A3 ≥ . . . , so S3 is solvable (of length 2) but not nilpotent.
2. S4 has the derived series S4 > A4 > V4 > {(1)} and has the lower central series S4 > A4 ≥
A4 ≥ . . . , so S4 is solvable (of length 3) but not nilpotent.
3. Dn = hρ, τ : ρn = τ 2 = e and τ ρτ −1 = ρ−1 i has the derived series Dn > hρ2 i > {e} and has
a lower central series Dn ≥ hρ2 i ≥ hρ4 i ≥ hρ8 i ≥ . . . . Hence, Dn is solvable (of length 2)
r
unless D1 or D2 which is abelian. But Dn is nilpotent if and only if ρ2 = e for some r if and
only if n is a power of 2.
3.3. Nilpotent Groups 83
Theorem 3.3.1. Let G be a group. Then Γn+1 (G) ≥ G(n) for all n ≥ 0. Hence, a nilpotent group
is solvable. Therefore, Sn is not nilpotent for all n ≥ 5.
Proof. We shall use induction on n. For n = 0, Γ1 (G) = G = G(0) . For the inductive step, we
suppose Γn+1 (G) ≥ G(n) . Thus
Finally, assume that G is nilpotent. Then Γn+1 (G) = {e} for some n, so G(n) = {e}. Hence, G is
solvable.
Remark. In fact, we have Γ1 (G) ≥ G(0) , Γ2 (G) ≥ G(1) , Γ4 (G) ≥ G(2) , Γ8 (G) ≥ G(3) , . . . ,
Γ2n (G) ≥ G(n) , . . . but this is more difficult to prove.
Z(G/Z(G)) = Z2 (G)/Z(G)
where Z2 (G) is a normal subgroup of G containing Z(G). We generalize this construction to make
the following definition.
Definition. The upper central series of a group G is defined inductively by Z0 (G) = {e} and
Zn+1 (G)/Zn (G) = Z(G/Zn (G)) for all n ≥ 1, so we get
Remarks. 1. Z1 (G) is the center of G and Zi+1 (G)/Zi (G) is the center of G/Zi (G).
2. Zi+1 (G)/Zi (G) = Z(G/Zi (G)) is equivalent to Zi+1 (G) = {g ∈ G : [G, g] ≤ Zi (G)} because
Remarks. 1. Since [G, Γi (G)] = Γi+1 (G), the lower central series is a central series for G.
2. If Zn (G) = G for some n, then the upper central series (in reverse order) is a central series
for G:
G = Zn (G) ≥ Zn−1 (G) ≥ · · · ≥ Z1 (G) ≥ Z0 (G) = {e}.
For, the condition Zi+1 (G)/Zi (G) = Z(G/Zi (G)) implies that [G, Zi+1 (G)] ≤ Zi (G).
84 3. Advanced Group Theory
Proof. (1) We shall use induction on n. For n = 1, G1 = G = Γ1 (G). For the inductive step, we
suppose Gn ≥ Γn (G). Then
(2) If G = G1 ≤ G2 ≤ . . . ≤ Gn+1 = {e} is a central series for G, then {e} = Gn+1 ≥ Γn+1 (G),
so G is nilpotent of class ≤ n. Conversely, if G is nilpotent of class ≤ n, then G = Γ1 (G) ≥ . . . ≥
Γn+1 (G) = {e} is a central series of the required length.
Proof. (1) We shall show that Zk (G) ≥ Gn−k+1 by induction on k. For k = 0, Z0 (G) = {e} =
Gn+1 . Suppose Zk (G) ≥ Gn−k+1 . Let g ∈ Gn−(k+1)+1 = Gn−k , then [G, g] ≤ Gn−k+1 ≤ Zk (G), so
g ∈ Zk+1 (G). Hence, Zk+1 (G) ≥ Gn−(k+1)+1 .
(2) Since Zi+1 (G)/Zi (G) = Z(G/Zi (G)), [G, Zi+1 (G)] ≤ Zi (G). Hence, the given series is a
central series.
(3) follows from (1) and (2) using Theorem 3.3.2.
In other words, of all central series for G, the upper central series has the largest groups
and the lower central series has the smallest groups. We can restate some of the conclusions of
Theorems 3.3.2 and 3.3.3 as follows.
Theorem 3.3.4. Let G be a group. Then the following statements are equivalent.
(i) G is nilpotent of class ≤ n.
(ii) Γn+1 (G) = {e}.
(iii) G has a central series G = G1 ≥ G2 ≥ . . . ≥ Gn+1 = {e}.
(iv) Zn (G) = G.
Next, we shall see that a finite nilpotent group behaves like a finite abelian group (Remark
after Theorem 1.6.12). We show that it is a direct product of its Sylow p-subgroups. We recall
Theorem 1.5.2.
Theorem 3.3.5. Let p be a prime. If G 6= {e} is a finite p-group, then Z(G) 6= {e}.
Theorem 3.3.6. Let G be a finite p-group. Then G is nilpotent, and hence G is solvable.
Proof. Consider the upper central series {e} = Z0 (G) ≤ Z1 (G) ≤ Z2 (G) ≤ Z3 (G) ≤ . . . . If
Zi (G) 6= G, then G/Zi (G) is a p-group, so Zi+1 (G)/Zi (G) = Z(G/Zi (G)) 6= {Zi (G)}. That is,
Zi+1 (G) Zi (G). Since G is finite, the central series cannot increase for all i. Hence, Zn (G) = G
for some n, so G is nilpotent.
Theorem 3.3.7. Let G be a nilpotent group and let {e} < Z1 (G) < · · · < Zn (G) = G be the
upper central series of G. Suppose H is a subgroup of G and define inductively N0 (H) = H,
N1 (H) = N (H) = {g ∈ G : gHg −1 ⊆ H}, the normalizer of H and Nk+1 = N (Nk (H)) for all
k ≥ 0. Then Nn (H) = G.
Proof. We shall prove by induction on i that Ni (H) ≥ Zi (G). For i = 0, N0 (H) = H ≥ {e} =
Z0 (G). Suppose Ni (H) ≥ Zi (G). Let g ∈ Zi+1 (G). Then [g, G] ⊆ Zi (G). To show that g ∈
N (Ni (H)), let x ∈ Ni (H). Then gxg −1 x−1 ∈ Zi (G) ≤ Ni (H), so gxg −1 ∈ Ni (H)x = Ni (H).
Hence, g ∈ Ni+1 (H).
Proof. (1) and (2) are analogous to the proofs of 3.2.4 for G is solvable.
(3) Suppose that G and H are nilpotent. Then there exist r, s > 0 so that Γr (G) = {eG } and
Γs (H) = {eH }. Thus Γk (G × H) = Γk (G) × Γk (H) = {(eG , eH )} where k = max{r, s}. Hence,
G × H is nilpotent.
Theorem 3.3.10. Let G be a finite group. Then the following statements are equivalent.
(i) G is nilpotent.
(ii) All Sylow p-subgroups of G are normal in G.
(iii) G is the direct product of its Sylow p-subgroups.
Proof. (i) ⇒ (ii). Assume that G is nilpotent. Recall Theorem 1.5.10 that if P is a Sylow p-
subgroup, then N (N (P )) = N (P ). But Theorem 3.3.8 asserts that if H is a proper subgroup of
G, then N (H) H. Thus we must have N (P ) = G, that is, P is normal in G since P ⊳ N (P ).
(ii) ⇒ (iii). Note that if a Sylow p-subgroup P of G is normal in G, then it is the unique Sylow
p-subgroup of G. Let p1 , p2 , . . . , pk be the distinct prime divisors of |G| and let Pi be the Sylow
pi -subgroup of G. Suppose x ∈ Pi and y ∈ Pj where i 6= j. Then xyx−1 y −1 ∈ Pi ∩ Pj = {e}, so
x and y commute. It follows that φ : P1 × · · · × Pk → G defined by φ(x1 , . . . , xk ) = x1 . . . xk is a
homomorphism. It is easy to show that φ is 1-1 and onto. Hence, G is the direct product of its
Sylow p-subgroups.
(iii) ⇒ (i). A finite p-group is nilpotent (Theorem 3.3.6) and a finite direct product of nilpotent
groups is nilpotent (Theorem 3.3.9). Hence, if G is the direct product of its Sylow p-subgroups,
then G is nilpotent.
Corollary 3.3.11. A finite abelian group is the direct product of its Sylow subgroups.
86 3. Advanced Group Theory
Exercises 3.3. 1. (a) [P. Hall] Let G be a group and x, y, z ∈ G. Write [x, y, z] for [[x, y], z]. Prove that
[x, y −1 , z]y [y, z −1 , x]z [z, x−1 , y]x = e.
(b) Let X, Y, Z ⊆ G and assume [X, Y, Z] = {e} = [Y, Z, X]. Prove that [Z, X, Y ] = {e}.
2. Prove that if N ≤ Z(G) and N and G/N are nilpotent, then G is nilpotent. Give an example of a
group G with a normal subgroup N such that N and G/N are nilpotent but G is not nilpotent.
3. Let G be nilpotent of class 3. Show that if v ∈ G′ and x ∈ G, then v −1 xv = cx where c ∈ Z(G).
Deduce that G′ is abelian.
4. Show that if G is a nilpotent group and N is a normal subgroup of G where N 6= {e}, then N ∩Z(G) 6=
{e}.
5. Prove that if M is a maximal subgroup of a nilpotent group G, then M is normal and |G/M | = p
where p is a prime. (A maximal subgroup is a proper subgroup which is not contained in any other
proper subgroup. Infinite groups needs not possess maximal subgroups.)
6. Prove that if G is a nilpotent group and N is a minimal normal subgroup of G ({e} = 6 N is normal
and simple), then N ≤ Z(G) and |N | = p for some p.
Definition. Let K be a field and Mn (K) be the set of n × n matrices with entries in K. Then
Mn (K) is a ring. Let GLn (K) denote the set of multiplicatively invertible elements in Mn (K),
called the general linear group of degree n, that is,
Since det(AB) = det A det B, det : GLn (K) → K × is a homomorphism (of two groups).
Its kernel consists of determinant one matrices, denoted by SLn (K) and called the special
linear group of degree n. It is a normal subgroup of GLn (K) with quotient GLn (K)/SLn (K)
isomorphic to K × = K r {0}.
Proof. For M to be in the center of G = GLn (K), it must commute with every N in G. In
particular, M commutes with the elementary matrices. Recall that multiplying M on the left by
an elementary matrix corresponds to performing an elementary row operation; multiplying M on
the right by an elementary matrix corresponds to performing an elementary column operation.
Thus, multiplying the ith row of M by a nonzero a gives you the same matrix as multiplying the
ith column of M by a. This implies that the matrix is diagonal. Then, since interchanging the ith
and jth row of M gives us the same matrix as swapping the ith and jth column of M , the ith
entry along the diagonal must equal the jth entry along the diagonal, for all i and j. Therefore,
M must be a multiple of In . Finally, it is easy to see that all nonzero multiples of In do commute
with all N ∈ G. Hence, the theorem is proved for GLn (K).
3.4. Linear Groups 87
For the center of SLn (K), we need to use the elementary matrices Xij (a), i 6= j, whose entries
are the same as that of the identity matrix In except for an a ∈ K in the (i, j) location. It is
obtained by performing the row operation Ri + aRj , i 6= j or the column operation Cj + aCi on
In . Clearly, Xij (a) ∈ SLn (K) for all a ∈ K and i 6= j.
If M is in the center of SLn (K), then M must commute with Xij (1) for all i 6= j, so the ith
and jth columns and rows must be all zeros except for the (i, i) and (j, j) entries which must be
equal. Moreover, the product of the diagonal entries is the determinant which is equal to 1.
From the above theorem, the center of GLn (K) consists of scalar matrices λIn with λ ∈ K ×
and the center of SLn (K) consists of scalar matrices λIn with λ ∈ K and λn = 1. They are normal
and lead to the next definitions.
Definition. The quotient group GLn (K)/Z(GLn (K)) = PGLn (K), called the projective linear
group of degree n. The quotient SLn (K)/Z(SLn (K)) = PSLn (K) is called the projective
special linear group of degree n.
Proof. Let A ∈ GLn (K). Then the columns of A are linearly independent vectors in K n . Thus
the first column of A can be any nonzero vectors in K n . The second column must not be multiple
of the first column, and the jth column must not be a linear combination of the previous j − 1
columns for all j = 2, . . . , n. By the product rule, we obtain the theorem.
and |PSL2 (K)| = |SL2 (K)| if charK = 2 and |PSL2 (K)| = |SL2 (K)|/2 if charK 6= 2.
Lemma3.4.4. Let
K bea field. The groupSL2 (K) is generated by the union of the two subgroups
1 λ 1 0
: λ ∈ K and : µ ∈ K . Hence, every matrix, in SL2 (K) is a finite product of
0 1 µ 1
matrices which either upper triangular or lower triangular and which have 1’s along the diagonal.
These matrices are called unipotent matrices or transvections.
a b
Proof. Let ∈ SL2 (K). Assume that c 6= 0. Perform the following row/column transforma-
c d
tions:
d−1
d−1
a b R1 + 1−a C2 + 1−d
/ 1 0 .
R2 C1
c / 1 c R2 −cR1
/ 1 c c
c d because ad − bc = 1 c d 0 1 0 1
Thus,
1−a 1−d
1 0 1 c a b 1 c 1 0
= .
−c 1 0 1 c d 0 1 0 1
a b
Hence, is a product of transvections.
c d
88 3. Advanced Group Theory
a+b b
If c = 0, then d 6= 0 and the matrix ∈ SL2 (K) can be treated as in the first case.
d d
However,
a b a+b b 1 0
=
0 d d d −1 1
and the result follows.
Theorem 3.4.5. Let K be a field. The elementary matrices Xij (a), defined in the proof of Theorem
3.4.1, generate SLn (K).
Proof. If n = 1, then SL1 (K) = {1} is trivial. Lemma 3.4.4 gives the case n = 2. For n > 2, the
theorem follows from the mathematical induction in a similar manner.
Lemma 3.4.6. The elementary matrices Xij (a), defined in the proof of Theorem 3.4.1, are com-
mutators in SLn (K) except in the case n = 2 and (|K| = 2 or 3).
Proof. If n ≥ 3, this is easy since there is a third index k and [Xik (a), Xkj (a)] = Xij (a). If n = 2,
we use the commutator relation
α 0 1 β 1 (α2 − 1)β
, = .
0 α−1 0 1 0 1
However, given any λ ∈ K, the equation λ = (α2 − 1)β can be solved for β if and only if there
exists a nonzero α ∈ K so that α2 6= 1 (i.e., α 6= ±1). This works as long as K × has at least three
elements.
Corollary 3.4.7. Let K be a field. If n ≥ 2, then SLn (K) is not solvable except in the cases SL2 (F2 )
and SL2 (F3 ).
Theorem 3.4.8. [Jordan-Moore] Let K be a field with |K| ≥ 4. Then PSL2 (K) is a simple group.
Proof. Using the third isomorphism theorem, it suffices to prove that a normal subgroup N of
SL2 (K) containing a matrix other than ±I2 must be all of SL2 (K). Let A 6= ±I2 be a matrix in N .
Then there is a vector ~v in K 2 so that ~v and A~v are linearly independent over K. This means
that
0 b
{~v , A~v } is a basis of K 2 . The matrix representation of A with respect to this basis is (since
1 d
A~v = 0 · ~v + 1 · A~v and A(A~v ) = b · ~v + d · A~vfor some b, d ∈ K). Since
det
A = 1, we actually
0 −1 0 −1
have b = −1. That is, A is conjugate to . Since N is normal, is also in N .
1 d 1 d
Our strategy is to show that N contains all unipotent elements in SL2 (K) by repeatedly using
−1 −1 CB ∈ N for all C ∈ SL (K) and B ∈ N ”. First, apply this trick with B = A
the fact that “C B 2
α 0
and C = (α ∈ K × ) to get
0 α−1
−1 −1 α−2 d(α−2 − 1)
C A CA = ∈ N.
0 α2
3.4. Linear Groups 89
α−2 d(α−2 − 1) 1 µ
Next, repeat the fact with B′ = ′
and C = (µ ∈ K), we get
0 α2 0 1
′−1 ′−1 ′ ′ 1 µ(α4 − 1)
C B CB = ∈ N.
0 1
We get all upper triangular unipotent elements in N as long as there exists an α ∈ K × such that
α4 6= 1. This happens if |K| ≥ 6 since the polynomial x4 − 1 has at most four distinct roots in K ×
or if |K| = 4 since F× 4 ×
4 is cyclic of order 3 and α = α for all α ∈ F4 . Observe that
−1
0 −1 1 µ 0 −1 1 0
=
1 0 0 1 1 0 µ 1
for all µ ∈ K × . This proves that N = SL2 (K) if |K| ≥ 4 and −2|K| 6= 5.
α d(α−2 − 1)
It remains to deal with the case K = F5 . We still have ∈ N for all α ∈ K × .
0 α2
2
−1 −2d −1 −2d 1 −d
Take α = 2 to get ∈ N , and hence = ∈ N . Two cases are
0 −1 0 −1 0 1
possible:
1 −d
(a) d 6= 0. The powers of give all upper triangular unipotent elements. By conjugating
0 1
0 −1
with , the lower triangular ones appear. Thus, N = SL2 (K).
1 0
0 −1 ′′ δ 1
(b) d = 0, so A = . We then perform the standard trick with B = A and C =
1 0 −1 0
×
(δ ∈ F5 ), so that
′′−1 −1 1 −δ
Aδ = C A CA = ∈ N.
−δ δ 2 + 1
Since δ 6= 0, this element is not in the center. Note that itstrace is δ 2 + 2 is never zero. Choose
0 −1
δ = 1, say. Then A1 ∈ N and A is conjugate to A′ = (as at the beginning of the
1 3
proof and the trace remains the same under conjugation). Apply Case (a), to A′ , and the proof is
complete.
Exercises 3.4. 1. (a) Prove that for any integer N , the map SL2 (Z) → SL2 (Z/N Z) obtained by
reducing the matrix entries modulo N is a surjective group homomorphism.
(b) Prove that for positive integers M and N , the maps (“reduction modulo N ”) from SL2 (Z/M N Z)
to SL2 (Z/N Z) and from GL2 (Z/M N Z) to GL2 (Z/N Z) are surjective group homomorphisms.
(c) What is the kernel of the homomorphism GL2 (Z/pe Z) → GL2 (Z/pZ)?
(d) What are the order of the groups GL2 (Z/pe Z) and SL2 (Z/pe Z)?
(e) Let N = pe11 . . . prer be the prime factorization of the positive integer N . Show that the reduc-
e
tions modulo pj j , j = 1, . . . , r, give isomorphisms
Y e
Y e
GL2 (Z/N Z) ∼
= GL2 (Z/pj j ) and SL2 (Z/N Z) ∼
= SL2 (Z/pj j ).
j j
(f) What are the order of the groups GL2 (Z/N Z) and SL2 (Z/N Z)?
2. Show that there is no non-abelian finite simple group of order less than 60. (Hint. We may focus on
groups of the following orders: 24, 30, 40, 48, 54 and 56.)
3. Suppose G is a simple group of order 60. Show that:
(a) G has a subgroup A of order 12
(b) A has exactly five different conjugates
(c) there is an injective homomorphism from G to S5
90 3. Advanced Group Theory
(d) both A5 and H contain every element of S5 of the form g 2 and therefore every 5-cycle and
every 3-cycle
(e) H = A5 .
Deduce that any simple group of order 60 must be isomorphic to A5 and hence PSL2 (F4 ) and
PSL2 (F5 ) are isomorphic to A5 .
In each of the above examples the data inside the brackets h i is sufficient to describe a group,
that is, it gives the multiplication table for a groups. We call such an expression a presentation for
the group. It turns out that every group has a presentation and every presentation defines a group.
However, it is generally difficult to decide if a group defined by a presentation is isomorphic to an
explicitly given group.
Definition. Let A be any (not necessarily finite) set of elements ai for i ∈ I. We think of A as
an alphabet set and of the ai as letters in the alphabet set. Any symbol of the form ani with
n ∈ Z is a syllable and a finite string w of syllables written in juxtaposition is a word. We also
introduce the empty word 1, which has no syllables. A word on A is reduced if w = 1 or the
string ai a−i or a−i ai does not appear in w for all a ∈ A and i ∈ N.
Definition. Let A be a set. Write F [A] for the set of all reduced words formed from our alpha-
bet A. For convenience, we may let F [∅] = {1}. We make F [A] into a group by the juxtaposition
w1 w2 of two words w1 and w2 with reduction of strings ai a−i or a−i ai (if any) for all a ∈ A and
i ∈ N. It is called the free group generated by A.
3.5. Free Groups and Presentations 91
Example 3.5.2. The only example of a free group that has occurred before is Z, which is free on
one generators. Clearly, every free group is infinite.
Example 3.5.3. F2 = hx, yi. The element of F2 are all words in x and y. More precisely, F2 is the
disjoint union of the following seven sets.
1. {1} 4. {xi1 y j1 . . . xik y jk : k > 0, ir , jr ∈ Z r {0}}
i
2. {x : i ∈ Z r {0}} 5. {xi1 y j1 . . . xik y jk xik+1 : k > 0, ir , jr ∈ Z r {0}}
3. {y i : i ∈ Z r {0}} 6. {y j1 xi1 . . . y jk xik : k > 0, ir , jr ∈ Z r {0}}
7. {y j1 xi1 . . . y jk xik y jk+1 : k > 0, ir , jr ∈ Z r {0}}
Let G be a group and let A be a subset of G such that hAi = G. If G is isomorphic to F [A]
under the map ϕ : G → F [A] such that ϕ(a) = a for all a ∈ A, then G is said to be free on A. A
group is free if it is free on some nonempty set A.
Theorem 3.5.1. [Universal Mapping Property of a Free Group] Let A be a nonempty set. Suppose
H is any group and there is a function φ : A → H.
1. There is a unique homomorphism Φ : F [A] → H extending φ.
2. If im φ generates H, then Φ : F [A] → H is a surjection.
3. If G is a group and θ : G → F [A] is an onto homomorphism, then there is a homomorphism
Φ : F [A] → G such that θ ◦ Φ = id F [A] , the identity map on F [A].
Proof. Let G1 and G2 be free groups on S. Then S is a subset of both G1 and G2 . Consider the
inclusion maps ι1 : S → G1 and ι2 : S → G2 and the result follows from the uniqueness of the
universal mapping property.
Proof. Let A be a set for which there exists a bijection φ : A → H (e.g., take A = H and φ = id H ),
and let G = F [A]. By the universal mapping property, there is an onto homomorphism Φ : G → H
extending φ. Therefore, G/(ker Φ) ∼ = H.
We refer the reader to reference textbooks for proofs of the next three theorems. They are
stated simply to inform us of these interesting facts.
Theorem 3.5.4. If a group G is free on A and also on B (not necessarily finite), then the sets A
and B have the same number of elements; that is, any two sets of generators of a free group have
the same cardinality.
We shall prove this theorem for the finite basis case with some result on finitely generated free
abelian group (Corollary 4.2.7) in the next chapter.
Theorem 3.5.5. Two free groups are isomorphic if and only if they have the same rank.
This is not trivial to prove. There is a nice proof of this result using covering spaces (cf. J.-P.
Serre, Trees, Springer-Verlag, 1980).
Example 3.5.4. Let yl = xl yx−l for l ≥ 0. Then yl , l ≥ 0, are free generators for the subgroup of
F2 = hx, yi that they generate. This illustrates that although a subgroup of a free group is free,
the rank of the subgroup may be much greater than the rank of the whole group!
θ / φ
Definition. Let G H / K be a sequence of groups homomorphisms. We say that it
is exact at H if im θ = ker φ. A short exact sequence of groups is a sequence of groups and
homomorphisms
θ φ
1 /G /H /K /1
G = hg1 , . . . , gr : w1 = · · · = wt = 1i
where w1 , . . . , wt are words in g1 , . . . , gr such that the following two properties are satisfied: (1)
g1 , . . . , gr generate G and (2) the conditions that w1 = w2 = · · · = wt = 1 are sufficient to define
the multiplication table of G. Here, g1 , . . . , gr are called generators of G in the presentation
and w1 , w2 , . . . , wt are called defining relations.
Note that the free group of rank n is the group Fn = hx1 , . . . , xn : i given by a presentation
with n generators and zero defining relation.
ι φ
1 / ker φ / Fn /G / 1.
What is the kernel of φ? ker φ is a normal subgroup of Fn and contains wi (x1 , . . . , xn ) for i =
1, . . . , t. In fact, ker φ is the smallest normal subgroup of Fn which contains wi (x1 , . . . , xn ) for
i = 1, . . . , t.
and so
Example 3.5.5. Consider the free group F2 = hx, yi. Let G = hx, y : xyx−1 y −1 = 1i ∼= F2 /N .
Since G is abelian, F2′ ⊆ N . But xyx−1 y −1 ∈ N and N is the smallest, so N = F2′ .
Theorem 3.5.8. [von Dyck’s Theorem /fon dike/] Let G be given by a presentation
G = hx1 , . . . , xn : w1 = · · · = wt = 1i.
Proof. By Theorem 3.5.7, G ∼ = F/N , where F is a free group on {x1 , . . . , xn } and N is the
normal closure of {w1 , . . . , wt }. By the assumption N ⊆ ker φ, so φ induces a (well-defined)
homomorphism xi = xi N 7→ hi for all i ∈ {1, . . . , n}.
Solution. Since 6 = 2 · 3, G contains elements a and b such that |a| = 2 and |b| = 3 and G = ha, bi.
Since hbi is normal in G, aba−1 ∈ hbi. Thus, aba−1 = b or aba−1 = b−1 . If aba−1 = b, then G is
abelian, so G ∼= Z6 . Assume that aba−1 = b−1 . Then G = ha, b : a2 , b3 , aba−1 = b−1 i. Note that
S3 = h(12), (123)i and (12)(123)(12)−1 = (132) = (123)−1 . By von Dyck’s theorem, there is an
onto homomorphism from G to S3 . But |G| = 6 = |S3 |, G ∼ = S3 .
Exercises 3.5. 1. (a) Prove that the derived group of a free group consists of those words in which the
sum of the exponents for each generator is equal to zero (e.g., x1 x−1 −2
2 x1 x2 x1 ).
(b) Let F be a free group generated by x1 , x2 , . . . , xr . Show that each element of F/F ′ is of the form
′ ∼
(xm 1 m2 mr ′ r
1 x2 . . . xr )F . Now use (a) to show that F/F = Z , i.e., F/F is the free abelian group of
′
rank r.
2. Determine the structure of G/G′ , when G is given by
(i) a6 = b2 = (ab)2 = 1; (ii) a6 = 1, b2 = (ab)2 = a3 .
3. Show that if G is generated by a and b subject to the relations a−1 ba = b2 and ab = ba2 , then
G = {1}.
4. Let G be a group. For a, b ∈ G, let [a, b] = aba−1 b−1 and ab = bab−1 .
(a) Prove that [a, bc] = [a, b][a, c]b for all a, b, c ∈ G.
(b) If H = hx, y, z ∈ G : [x, y] = y, [y, z] = z and [z, x] = xi, show that H = {e}.
5. If G is a non-abelian group of order eight, show that G is isomorphic to D4 or Q8 .
94 3. Advanced Group Theory
4 | Modules and Noetherian Rings
Modules can be considered as a generalization of vector spaces. It is like we study linear algebra
over a ring. In this chapter, we first cover basic concepts of modules. Next, we work on free
modules. Projective and injective modules are introduced. We also present the proof of the
structure theorems for modules over a PID. Finally, we talk about Noetherian and Artinion rings.
Noetherian rings have a lot of applications in algebraic geometry and algebraic number theory.
Each ring R that we consider will be assumed to contain a multiplicative identity element,
which will be denoted by 1. We shall therefore regard the possession of such an identity as one of
the defining conditions of the ring concept and also assume 1 6= 0.
4.1 Modules
The definition of a module is similar to a vector space. However, now our scalars are in a ring.
Remark. Note that we abuse notations by not distinguishing between the addition in M or in R
and the multiplication in R or the multiplication R × M → M . A right R-module can be defined
analogously.
Examples 4.1.1. 1. If R = F , a field, an F -module is just a vector space over F .
2. Any abelian group A is a Z-module, where the action of Z is given by for a ∈ A,
0 · a = 0A , n · a = |a + a +
{z· · · + a} if n > 0 and
n
n · a = (−a) + (−a) + · · · + (−a) if n < 0.
| {z }
−n
95
96 4. Modules and Noetherian Rings
Definition. Let R be a ring. The direct sum of R-modules M and N is the abelian group direct
sum of M and N
M ⊕ N = {(m, n) : m ∈ M, n ∈ N }
with the action of R on M ⊕ N given by
f g
M /N /P
f1 f2
··· / M1 / M2 / M3 / ···
that may be finite or run to infinity in either direction is called exact if for any three con-
secutive terms the subsequence Mi / Mi+1 / Mi+2 is exact. An exact sequence of
the form
f g
0 / M′ /M / M ′′ /0
ker f = {m ∈ M : f (m) = 0N }
and the cokernel of f is N/im f . They are clear that ker f and im f are R-submodules of
M and N , respectively. Evidently, f is surjective if and only if coker f = 0. In any case,
we have
f
0 / ker f /M /N / coker f /0
is exact.
Remark. The isomorphism theorems also hold for R-modules and their homomorphisms. Note
however that the first isomorphism theorem will say a bit more, because coker f = N/im f is an
R-module. This is not the case with homomorphisms of groups or rings: If f : G → H is a group
homomorphism, then f (G) = im f is not in general a normal subgroup of H, hence H/im f is not
4.1. Modules 97
Theorem 4.1.1. [First Isomorphism Theorem] Let M and N be R-modules. Then the following
diagram of R-modules has an exact row and a commutative square.
f
0 / ker f /M /N
O
/ coker f /0
π i
∼
=
M/ ker f / im f
f¯
0 / N1 ∩ N2 / N2 / N2 /(N1 ∩ N2 ) /0
∼
=
0 / N1 / N1 + N2 / (N1 + N2 )/N1 /0
Theorem 4.1.3. [Third Isomorphism Theorem] If N2 ≤ N1 ≤ N are R-modules, then the fol-
lowing diagram is commutative and has exact rows:
0 / N1 /N / N/N1 /0
id
0 / N1 /N2 / N/N2 / N/N1 /0
Theorem 4.1.4. Let N1 and N2 be submodules of an R-module N . Then the following diagram is
commutative and has exact rows and columns.
0 0 0
0 / N1 ∩ N2 / N2 / N2 /(N1 ∩ N2 ) /0
0 / N1 /N / N/N1 /0
0 / N1 /(N1 ∩ N2 ) / N/N2 / N/(N1 + N2 ) /0
0 0 0
98 4. Modules and Noetherian Rings
Proof. The commutativity and the exactness of the top two rows and the two left columns are
clear. The exactness of the third row and right column come, respectively, from the isomorphisms
(N1 + N2 )/N2 ∼= N1 /(N1 ∩ N2 ) and (N1 + N2 )/N1 ∼ = N2 /(N1 ∩ N2 ).
The next theorem has wide use in mathematics. It is proved by the technique called “diagram
chasing”.
Theorem 4.1.5. [5-Lemma] Suppose the following diagram is commutative and has exact rows.
α1 α2 α3 α4
A1 / A2 / A3 / A4 / A5
f1 f2 f3 f4 f5
B1 / B2 / B3 / B4 / B5
β1 β2 β3 β4
Proof. (1) Assume f1 is onto and f2 and f4 are 1-1. Suppose x ∈ A3 and f3 (x) = 0. We shall
show that x = 0. Since f4 (α3 (x) = β3 (f3 (x)) = β3 (0) = 0 and f4 is 1-1, α3 (x) = 0, so x ∈
ker α3 = im α2 from the exactness of the top row. Thus, x = α2 (y) for some y ∈ A2 . Then
0 = f3 (x) = f3 (α2 (y)) = β2 (f2 (y)), so f2 (y) ∈ ker β2 = im β1 from the exactness of the bottom
row. Thus, f2 (y) = β1 (z) for some z ∈ B1 . Since f1 is onto, there is a u ∈ A1 with f1 (u) = z.
Then f2 (y) = β1 (z) = β1 (f1 (u)) = f2 (α1 (u)), so y = α1 (u) since f2 is 1-1. Hence, x = α2 (y) =
α2 (α1 (u)) = 0 since α2 α1 = 0 by the exactness of the top row.
(2) Assume f5 is 1-1 and f2 and f4 are onto. Let x ∈ B3 . We must find w ∈ A3 with f3 (w) = x.
Since f4 is onto, we can choose y ∈ A4 with f4 (y) = β3 (x). Then f5 (α4 (y)) = β4 (f4 (y)) =
β4 (β3 (x)) = 0 from the bottom row is exact. But f5 is 1-1, so α4 (y) = 0. Since the top row is exact,
y = α3 (z) for some z ∈ A3 . Then β3 (x) = f4 (y) = f4 (α3 (z)) = β3 (f3 (z)), so β3 (x − f3 (z)) = 0.
Thus, there is a u ∈ B2 with β2 (u) = x − f3 (z) from the bottom row is exact. Since f2 is onto,
there is a v ∈ A2 with f2 (v) = u. Hence, x − f3 (z) = β2 (u) = β2 (f2 (v)) = f3 (α2 (v)), so
x = f3 (z + α2 (v)) = f3 (w) where w = z + α2 (v). That is, f3 is onto.
α β
Theorem 4.1.6. Let 0 /L /M /N / 0 be a short exact sequence of R-modules.
Then the following three conditions are equivalent.
(i) There exists an isomorphism M ∼ = L ⊕ N in which α : l 7→ (l, 0) and β : (l, n) 7→ n.
(ii) There exists a section of β, that is, a homomorphism s : N → M such that β ◦ s = id N .
(iiii) There exists a retraction of α, that is, a homomorphism r : M → L such that r ◦ α = id L .
If this happens, the sequence is a split exact sequence.
m = (m − s(β(m))) + s(β(m)),
where the second term is obviously in s(N ); since β ◦ s = id N , the first term is clearly in ker β,
and by exactness this is α(L). Furthermore, α(L) ∩ s(N ) = {0}, since if n ∈ N is such that
s(n) ∈ α(L) = ker β then n = β(s(n)) = 0.
(iii) ⇒ (i) is similar to (ii) ⇒ (i) and left as an exercise.
4.2. Free Modules and Matrices 99
For finite dimensional vector spaces over a field, every subspace has complement, so every
short exact sequence splits. Whether an exact sequence splits or not depends on what ring it is
considered over. For example,
Since ϕ is 1-1, a0 6= 0. Since ϕ is onto, a2 , . . . , an = 0 and a1 6= 0. Thus, we reduce the above map to
ϕ
(b0 + b1 x + · · · + bm xm ) 7→ (b0 a0 , a1 (b0 x + b1 x2 + · · · + bm xm+1 )).
{(m1 , . . . , mk ) : mi ∈ Mi }
k
M
The direct sum of M1 , . . . , Mk is denoted by M1 ⊕ · · · ⊕ Mk or Mi .
i=1
100 4. Modules and Noetherian Rings
M = Rx1 + · · · + Rxk .
Now let M = Rx1 + · · · + Rxn be the free R-module on the set X = {x1 , . . . , xn }. Suppose N
is any left R-module and y1 , . . . , yn are any elements of N . Let us define a map φ : M → N by
φ(r1 x1 + · · · + rn xn ) = r1 y1 + · · · + rn yn .
4.2. Free Modules and Matrices 101
Then φ is a homomorphism of left R-modules such that φ(xi ) = yi for all i. In fact, we could also
define a homomorphism even if X were infinite. The point is that any set map X → N gives rise
to an R-module homomorphism M → N . More precisely,
Theorem 4.2.2. [Universal Mapping Property of a Free Module] Let R be a ring, X a set and
M = M (X) the free R-module on the set X. Let i : X → M be defined by i(x) = 1 · x for all
x ∈ X. (i may be thought of as an inclusion map.) Suppose N is an R-module and α : X → N is
a set map. Then there exists a unique R-module homomorphism θ : M → N such that θ ◦ i = α.
i /
X❇ M
❇❇
❇❇
α ❇❇❇ θ
N
Next, let us consider homomorphism of finitely generated free R-modules. Suppose M and N
are free R-modules with bases X = {x1 , . . . , xm } and Y = {y1 , . . . , yn } where
2. If R is commutative, then we can make homR (M, N ) into a left R-module by defining
(rφ)(m) = rφ(m). Note that rφ : M → N is really an R-module homomorphism, for if
m ∈ M, s ∈ R, then
homR (M, N ) −→ N ⊕ · · · ⊕ N
φ 7−→ (φ(x1 ), . . . , φ(xm ))
For k ≥ 1 and a ring R, let Rk denote the R-module of k × 1 column vectors over R. Now
let us return to free R-modules M = Rx1 + · · · + Rxm and N = Ry1 + · · · + Ryn . As noted
earlier, if φ : M → N is an R-module homomorphism, then φ is completely determined by
φ(x1 ), . . . , φ(xm ), and φ 7→ (φ(x1 ), . . . , φ(xm )) is an isomorphism of abelian groups, and it is an
isomorphism of R-modules if R is commutative. Since N = Ry1 + · · · + Ryn is free on y1 , . . . , yn
every element of N can be uniquely expressed in the form
y = r 1 y1 + . . . r n y n .
Write Ruv for the R-module of u × v matrices over R. For each R-module homomorphism φ :
M → N , we define
a11 a12 . . . a1m
a21 a22 . . . a2m
[φ] = .
. .. ∈ Rnm
. .
an1 an2 . . . anm
implicitly from the equations
Theorem 4.2.4. Let R be a commutative ring. Under the above set-up we have:
1. Each matrix [φ] ∈ Rnm defines a homomorphism [φ] : Rm → Rn by left multiplication of an
n × m matrix by an m × 1 matrix. The same is true for [θ] : Rn → Rp and [τ ] : Rm → Rp .
2. The following diagram is commutative
φ θ
M /N /P
α ∼
= β ∼
= γ ∼
=
[φ] [θ]
Rm / Rn / Rp
:
[θφ]
Recall the following fact about matrices: Let R be a commutative ring and suppose A ∈
Mn (R). Then
Theorem 4.2.5. Let R be a commutative ring and let [φ] ∈ Rmn and [θ] ∈ Rnm . Suppose [φ][θ] =
Im and [θ][φ] = In are identity matrices of sizes m × m and n × n, respectively. Then m = n.
Theorem 4.2.6. Let R be a commutative ring and suppose that M = Rx1 + · · · + Rxm and
N = Ry1 + · · · + Ryn are free R-modules with indicated generators. If M and N are isomorphic
R-modules, then m = n.
In other words, [φ] is an n × m matrix and [θ] is an m × n matrix with [φ][θ] = In and [θ][φ] = Im .
Hence, m = n by Theorem 4.2.5.
Corollary 4.2.7. If F is a finitely generated free abelian group, then any two bases of F have the
same number of elements.
Using this corollary, we can verify that if a group G is free on A and also on B, which are finite
sets, then the sets A and B have the same number of elements. It is Theorem 3.5.4 for the finite
basis case.
Proof of Theorem 3.5.4 for the finite basis case. Assume that G is a free group on A and also on B,
where A and B are finite sets. By Exercise 1, G/G′ is a free abelian group of rank |A| and |B|,
respectively. By Corollary 4.2.7, |A| = |B|.
Remarks. 1. As we have seen that subgroups of a free (abelian) group are free. This is not
true for general R-modules. For example, let R = Z6 . Then R R is a free R-module generated
by {1}. N = {0, 2, 4} is an R-submodule of R R. Since ∅ does not span N , ∅ is not a basis.
If B 6= ∅ is a basis of N , then 0 ∈
/ B, so 2 or 4 are in B. Since 3 · 2 = 0 and 3 · 4 = 0 where
3 6= 0, where B is not linearly independent. Hence, submodules of a free module may not be
free.
2. In the case of free abelian groups and vector spaces, it is true that any two bases of have the
same cardinality. This is not true in general as shown in the following example.
Example 4.2.1. Let S be a ring and F a free S-module with infinite denumerable basis {e1 ,
e2 , e3 , . . . }. Let R = homS (F, F ). Then R is a ring with identity 1R , so {1R } is a basis for R R.
Next, we define f1 , f2 ∈ R as follows: f1 (e2n ) = en , f1 (e2n−1 ) = 0 and f2 (e2n ) = 0, f2 (e2n−1 ) =
en . To show that {f1 , f2 } spans R R, let g ∈ R. Define g1 , g2 ∈ R by g1 (en ) = g(e2n ) and
g2 (en ) = g(e2n−1 ). Then (g1 f1 + g2 f2 )(e2n−1 ) = g1 f1 (e2n−1 ) + g2 f2 (e2n−1 ) = g2 (en ) = g(e2n−1 )
and (g1 f1 + g2 f2 )(e2n ) = g1 f1 (e2n ) + g2 f2 (e2n ) = g1 (en ) = g(e2n ). Thus, g = g1 f1 + g2 f2 . Next we
shall prove that {f1 , f2 } is linearly independent over R. Let h1 , h2 ∈ R such that h1 f1 + h2 f2 = 0.
Then for any n ≥ 1, h1 (en ) = h1 (en ) + 0 = h1 f1 (e2n ) + h2 f2 (e2n ) = (h1 f1 + h2 f2 )(e2n ) = 0 and
h2 (en ) = 0 + h2 (en ) = h1 f1 (e2n−1 ) + h2 f2 (e2n−1 ) = (h1 f1 + h2 f2 )(e2n−1 ) = 0, so h1 = h2 = 0.
Hence, {f1 , f2 } is linearly independent and so it is a basis of R R.
is exact.
f g
In a similar manner, one can prove that exactness of N → N ′ → N ′′ → 0 implies exactness of
hom(−,M )(g) hom(−,M )(f )
0 / homR (N ′′ , M ) / homR (N ′ , M ) / homR (N, M )
where hom(−, M )(f )(h) = h ◦ f for all h ∈ homR (N ′ , M ) and hom(−, M )(g)(h) = h ◦ g for all
h ∈ homR (N ′′ , M ).
4.3. Projective and Injective Modules 105
6. Let R be a ring, I a proper ideal of R and F a free R-module with a basis X. Then F/IF is a free
R/I-module with a basis of cardinality |X|.
P
f
M p
//N
P
g
f
~
M p
//N
i p
We recall that for any module M , if 0 / N′ /N / N ′′ / 0 is exact, then
hom(M,i) hom(M,p)
0 / homR (M, N ′ ) / homR (M, N ) / homR (M, N ′′ )
hom(M,i) hom(M,p)
0 / homR (M, N ′ ) / homR (M, N ) / homR (M, N ′′ ) /0
i p
as a consequence of the exactness of 0 / N′ /N / N ′′ /0.
p
The converse holds also. Suppose hom(P, −) is exact and suppose M / / N . Let K = ker p.
i p
Then we have the exact sequence 0 /K /M /N / 0 where i is the inclusion
map. Applying the exactness of hom(P, −), we obtain the property of a projective module. There-
fore,
Theorem 4.3.1. Let P be an R-module. Then P is projective if and only if for any R-modules
i p
N, N ′ and N ′′ , if 0 / N′ /N / N ′′ / 0 is a short exact sequence, then
hom(P,i) hom(P,p)
0 / homR (P, N ′ ) / homR (P, N ) / homR (P, N ′′ ) /0
1
g : xn 7→ for all n ∈ N.
n
m
Then g induces a Z-module homomorphism from F to Q. Since g(mxn ) = n for all m ∈ Z and
n ∈ N, g is onto. Assume that Q is projective.
Q
h
id Q
F //Q
g
X
Then there exists an h : Q → F such that gh = id Q . Suppose h(1) = ai xi (with all but finite
Y X i
ai = 0). Let k = 1 + |ai | and assume that h(k −1 ) = bi xi (again, with all but finite ai = 0).
i,ai 6=0 i
Then X X X
kbi xi = k bi xi = kh(k −1 ) = h(1) = ai xi ,
i i i
X
so (ai − kbi )xi = 0. Since X is linearly independent, ai = kbi for all i which implies k | ai for
i
all i. This forces k = 1 and ai = 0 for all i. Thus, h is the zero map which contradicts gh = id Q .
Hence, Q is not projective.
How close are projective modules to being free? We shall give two important characterizations
of projective modules as follows.
f g
Proof. (i) ⇒ (ii). Let 0 /M /N /P / 0 be exact and consider the diagram
P
id P
N g
//P
i p
(iii) ⇒ (i). We are given that there exists a sequence 0 / P′ /F /P / 0 with F
is free. Now suppose we have a diagram
P
f
M q
//N
p
/ P′ i /
0 ❆❆ ′ P
/F o /0
❆❆i
❆❆ f
f p ❆❆
M //N
q
where pi′ = id P (since the top line splits). Since F is free, hence projective, we can fill in
g : F → M to obtain f p = qg. Then f = f id P = f pi′ = qgi′ and gi′ : P → M make
P
gi′
⑤
⑤⑤ f
⑤⑤⑤
~⑤
⑤
M q //N
Of particular interest are the modules that are finitely generated and projective. The theorem
gives the following characterization of these modules.
Corollary 4.3.4. A module P is finitely generated and projective if and only if P is a direct sum-
mand of a free module with a finite base.
Proof. If P is a direct summand of a free module F with finite base, then P is projective. Moreover,
P is a homomorphic image of F , so P has a finite set of generators (the images of the base under
an epimorphism of F onto P ). Conversely, suppose P is finitely generated and projective. Then
the first condition implies that we have an exact sequence 0 → P ′ → F → P → 0 where F is free
with finite base. The proof of the theorem shows that if P is projective, then F ∼= P ⊕ P ′ , so P is
a direct summand of a free module with finite base.
The concept of a projective module has a dual obtained by reversing the arrows in the defini-
tion as follows.
i
0 /N /M
f
Q
With a slight change of notation, the definition amounts to this: Given an exact sequence
i
0 → N ′ → N , the sequence
hom(i,Q)
homR (N, Q) / homR (N ′ , Q) /0
i p
is exact. Since we know that exactness of 0 → N ′ → N → N ′′ → 0 implies exactness of
hom(p,M ) hom(i,M )
0 / homR (N ′′ , M ) / homR (N, M ) / homR (N ′ , M ) ,
it is clear that Q is injective if and only if hom(−, Q) is exact in the sense that it maps any short
exact sequence 0 → N ′ → N → N ′′ → 0 into a short exact sequence of Z-module
0 → homR (N ′′ , Q) → homR (N, Q) → homR (N ′ , Q) → 0.
It is easily seen also that the definition of injective is equivalent to the following: If N is a
submodule of a module M , then any homomorphism of N into Q can be extended to a homo-
morphism of M into Q. Another result, which is easily established by dualizing the proof of the
analogous result on projective (Theorem 4.3.3), is that if Q is injective, then any short exact se-
quence 0 → Q → M → N → 0 splits. The converse of this holds also. However, the proof requires
the dual of the easy result that any module is a homomorphic image of a projective module (in
fact, a free module). The dual statement is that any module can be embedded in an injective one.
We shall see that this is the case, but the proof will turn out to be fairly difficult.
Theorem 4.3.5. [Baer] A right module Q is injective if and only if any homomorphism of a right
ideal I of R into Q can be extended to a homomorphism of R into Q.
Proof. Obviously, the condition is necessary. Now suppose it holds and suppose M is a module
and f is a homomorphism of a submodule N of M into Q. Consider the set {(g, M ′ )} where M ′
is a submodule of M containing N and g is a homomorphism of M ′ into Q such that g|N = f .
We define a partial order in the set {(g, M ′ )} by declaring that (g1 , M1′ ) ≥ (g2 , M2′ ) if M1′ ⊃ M2′
and g1 |M2′ = g2 . It is clear that any totally ordered subset has an upper bound in this set.
Hence, by Zorn’s lemma, there exists a maximal (g, M ′ ); that is, we have an extension of f to a
homomorphism g of M ′ ⊃ N which is maximal in the sense that if g1 is a homomorphism of an
M1′ ⊃ M ′ such that g1 |M ′ = g, then necessarily M1′ = M ′ . We claim that M ′ = M . Otherwise,
/ M ′ and so xR + M ′ is a submodule of M properly containing M ′ . Now let
there is an x ∈ M, ∈
I = {s ∈ R : xs ∈ M ′ }.
Then I = ann(x + M ′ ) in M/M ′ , so I is a right ideal of R. If s ∈ I, then xs ∈ M ′ , so g(xs) ∈ Q.
It is immediate that the map h : s 7→ g(xs) is a module homomorphism of I into Q. Hence, by
hypothesis, h can be extended to a homomorphism k of R into Q. We shall use this to obtain an
extension of g to a homomorphism of xR + M ′ to Q. The elements of xR + M ′ have the form
xr + y, r ∈ R, y ∈ M ′ . If we have a relation xs + y ′ = 0, s ∈ R, y ′ ∈ M ′ , then s ∈ I. Then
k(s) = h(s) = g(xs) = −g(y ′ ).
Thus, xs + y ′ = 0 for s ∈ R, y ′ ∈ M ′ , implies that k(s) + g(y ′ ) = 0. It follows that
xr + y 7→ k(r) + g(y),
r ∈ R, y ∈ M ′ , is a well-defined map. For, if xr1 + y1 = xr2 + y2 , ri ∈ R, yi ∈ M ′ , then xs + y ′ = 0
for s = r1 − r2 , y ′ = y1 − y2 . Then k(s) + g(y ′ ) = 0 and k(r1 − r2 ) + g(y1 − y2 ) = 0. Since k and g
are homomorphisms, this implies that k(r1 ) + g(y1 ) = k(r2 ) + g(y2 ). It is immediate that the map
rx+y 7→ k(r)+g(y) is a module homomorphism of xR+M ′ into Q extending the homomorphism
g of M ′ . This contradicts the maximality of (g, M ′ ). Hence, M ′ = M and we have proved that if f
is a homomorphism of a submodule N of M into Q, then f can be extended to a homomorphism
of M into Q. Hence, Q is injective.
4.3. Projective and Injective Modules 109
For certain “nice” rings, the concept of injectivity of modules is closely related to the simpler
notion of divisibility, which we proceed to define.
Theorem 4.3.6. 1. If R has no zero divisors 6= 0, then any injective R-module is divisible.
2. If R is a ring such that every right ideal of R is principal (= aR for some a ∈ R), then any
divisible R-module is injective.
Proof. (1) Suppose R has no zero-divisors 6= 0 and let Q be an injective R-module. Let x ∈ Q, r ∈
R, r 6= 0. If a, b ∈ R and ra = rb, then a = b. Hence, we have a well-defined map ra 7→ xa, a ∈ R,
of the right ideal rR into Q. Clearly this is a module homomorphism. Since Q is injective, the
map ra 7→ xa can be extended to a homomorphism of R into Q. If 1 7→ y under this extension,
then r = 1r 7→ yr. Since r = r1 7→ x1 = x, we have x = yr. Since x was arbitrary in Q and r was
any non-zero element of R, this shows that Q is divisible.
(2) Suppose R is a ring in which every right ideal is principal. Let M be a divisible R-module
and let f be a homomorphism of the right ideal rR into M . If r = 0, then f is the zero map
and this can be extended to the zero map of R. If r 6= 0 and f (r) = x ∈ M , then there exists
a y in M such that x = yr. Then a 7→ ya is a module homomorphism of R into M and since
rb 7→ yrb = xb = f (r)b = f (rb), a 7→ ya is an extension of f . Thus, any module homomorphism
of a right ideal of R into M can be extended to a homomorphism of R. Hence, M is injective by
Baer’s criterion.
If R satisfies both conditions stated in the theorem, then an R-module is injective if and only
if it is divisible. In particular, this holds if R is a PID. We can use this to construct some examples
of injective modules.
Examples 4.3.2. 1. Let R be a subring of a field F and regard F as an R-module in the natural
way. Evidently F is a divisible R-module. Hence, if K is any R-submodule of F , then F/K
is a divisible R-module. In particular, Q is an injective Z-module which is not projective.
2. Let D be a PID, F its field of fractions. If r ∈ D, then F/(r)((r) = rD) is divisible and hence
is injective by Theorem 4.3.6.
Our next objective is to prove that any module can be embedded in an injective module, that
i
is, given any M there exists an exact sequence 0 → M → Q with Q is injective. The first step in
the proof we shall give is as follows.
Lemma 4.3.7. Any abelian group can be embedded in a divisible group (= a divisible Z-module).
Proof. First let F be a free abelian group with base {xα } and F ′ the vector space over Q with {xα }
as base. Then F is embedded in F ′ and it is clear that F ′ is divisible. Now let M be an arbitrary
abelian group. Then M is isomorphic to a factor group F/K of a free abelian group F . Hence,
F ′ /K is a divisible group and F/K ∼ = M is a subgroup.
Now for an arbitrary R-module M , we have the isomorphism of M onto homR (R, M ) which
maps an element x ∈ M into the homomorphism fx such that 1 7→ x. This is an R-isomorphism
if we make homR (R, M ) into a right R-module by defining f a, a ∈ R, by (f a)(b) = f (ab). Also
homZ (R, M ) is a right R-module using this definition of f a. Clearly homR (R, M ) is a submod-
ule of homZ (R, M ). Since M is isomorphic to homR (R, M ), we have an embedding of M in
homZ (R, M ). Now embed M in an injective Z-module Q, which can be done by the foregoing
corollary. Then we have an embedding of homZ (R, Q) as R-modules. This gives an embedding of
M in an injective R-module, since we have the following lemma.
f
Proof. We must show that if 0 → N ′ → N is an exact sequence of R-modules, then
f∗
homR (N, homZ (R, Q)) → homR (N ′ , homZ (R, Q)) → 0
and the definition shows that this is “natural” in N . Since the isomorphism of N ⊗R R onto N
such that y ⊗ 1 7→ y is natural in N , we have an isomorphism
ψN
homZ (N, Q) / homR (N, homZ (R, Q))
f¯ f∗
homZ (N ′ , Q) / homR (N ′ , homZ (R, Q))
ψN ′
where f¯ = hom(f, Q). Now f¯ is surjective since Q is Z-injective. Since ψN and ψN ′ are isomor-
phisms, this implies that f ∗ is surjective.
Proof. We leave the proof of (i) ⇒ (ii) as an exercise. Conversely, suppose any short exact se-
quence 0 → Q → M → N → 0 splits. By the embedding theorem we have an exact sequence
i i p
0 → Q → M where M is injective. Then we have the short exact sequence 0 → Q → M →
4.4. Modules over a PID 111
M/Q → 0 where p is the canonical homomorphism of M onto M/Q. By hypothesis, we can find
a p′ : M → Q such that p′ i = id Q . Now suppose we have a diagram
j
0 / N′ /N
f
Q
j
0 / N′ /N
p′ g
f
x
QO g
i p′
M
This means that by the injectivity of M we have g : N → M such that if = gj. Then f = id Q f =
p′ if = (p′ g)j. Hence, Q is injective.
It is possible to prove a sharper result than Theorem 4.3.10, namely that there is a minimal
injective R-module H containing M in the sense that any injective map of M into an injective
R-module Q factor through H. More precisely, if M ⊆ Q for an injective R-module Q then there
is an injection i : H → Q that restricts to the identity map on M ; using i to identify H as a subset
of Q we have M ⊆ H ⊆ Q. This module H is called the injective hull or injective envelope
of M . For example, the injective hull of Z is Q, and the injective hull of any field is itself.
Theorem 4.4.1. Let R be a PID and suppose that M is a finitely generated R-module. Then there
is an integer r ≥ 0 and nonzero elements d1 , . . . , dk ∈ R with d1 |d2 , . . . , dk−1 |dk such that
M∼
=R| ⊕ ·{z
· · ⊕ R} ⊕R/Rd1 ⊕ · · · ⊕ R/Rdk .
r copies
N∼
=R · · ⊕ R} ⊕R/Rd¯1 ⊕ · · · ⊕ R/Rd¯k̄ ,
| ⊕ ·{z
r̄ copies
where d¯i |d¯i+1 , then M and N are isomorphic as R-modules if and only if r = r̄, k = k̄ and di and
d¯i are associates for i = 1, . . . , k.
Note that we cannot assert more than that di and d¯i are associates, for if d and d¯ are associates,
then R/Rd ∼ = R/Rd. ¯
Since abelian groups are equivalent to Z-modules, this theorem can be stated as “A finitely
generated Z-module is a direct sum of cyclic modules”. Actually, the theorem was more precise in
that it actually classified all finitely generated Z-modules up to isomorphism. That is, one has
Theorem 4.4.2. Let M be a finitely generated Z-module. Then there are nonnegative integers
r ≥ 0, d1 , . . . , dk > 0 where d1 |d2 , . . . , dk−1 |dk such that
M∼
=Z · · ⊕ Z} ⊕Z/d1 Z ⊕ · · · ⊕ Z/dk Z.
| ⊕ ·{z
r copies
N∼
=Z · · ⊕ Z} ⊕Z/d¯1 Z ⊕ · · · ⊕ Z/d¯k̄ Z,
| ⊕ ·{z
r̄ copies
where d¯i |d¯i+1 , then M and N are isomorphic if and only if r = r̄, k = k̄ and di = d¯i for
i = 1, . . . , k. r is called the rank or torsion-free rank of M and d1 , . . . , dk are called the invariant
factors of M .
Corollary 4.4.3. A finitely generated abelian group is a direct product of cyclic groups.
The strategy of our proof is the following. First we observe that even with no hypothesis on
the ring R, the following statements are equivalent:
(i) M is a finitely generated R-module and can be generated by s elements.
(ii) Let F = Rx1 + · · · + Rxs be a free R-module with s free generators. Then there is an exact
φ
sequence of R-modules 0 /K /F /M / 0 where K = ker φ.
(iii) M ∼= F/K where F is a free R-module on s free generators and K is an R-submodule of F .
Now let us suppose that R is commutative and we have a free R-module F and a submodule
K where F = Rx1 + · · · + Rxr + Ry1 + · · · + Ryk and K = d1 Ry1 + · · · + dk Ryk . Then it is easy
to see that
F/K ∼ =R| ⊕ ·{z
· · ⊕ R} ⊕R/Rd1 ⊕ · · · ⊕ R/Rdk
r copies
since Rxi ∼
= R and Ryi /di Ryi ∼
= R/Rdi .
4.4. Modules over a PID 113
If we have an arbitrary commutative ring R, K may not have an appropriate form. Moreover,
no change of basis may be possible which changes K to the appropriate form. However, in case
R is a PID, it is always possible to choose a basis for F and a basis for K (which will also be
free) so that the above situation exists. In addition, it will be possible to choose d1 , . . . , dk so that
d1 |d2 , . . . , dk−1 |dk . This will yield the desired structure theorem for finitely generated modules
over R.
The proof will consist of two stages.
Stage I. We prove an appropriate theorem about m × n matrices over a PID R.
Stage II. We show that theorem about m × n matrices proved in Stage I can be translated into a
theorem about modules—namely the structure theorem for modules over a PID.
We shall now prove a theorem which says that any m × n matrix [A] over a PID R can be
transformed to a diagonal matrix by a transformation
[A] → [P ][A][Q]
Remark. Passing from A to EA as in (3), (4) and (5) of the above theorem are called elementary
row transformations of A. Elementary column transformations of A are defined similarly.
114 4. Modules and Noetherian Rings
a b
Theorem 4.4.5. Let R be a PID and A = ∈ M2 (R). Then there exist P, Q ∈ GL2 (R) such
c d
that
e 0
P AQ =
0 f
where e = gcd(a, b, c, d) and e | f .
Proof.We first
claim
p q
(∗) if ∈ M2 (R), either p = gcd(p, q, r, s), or there exist P1 , Q1 ∈ GL2 (R) such that Rp1 ⊃
r s
Rp and
p q p1 q1
P1 Q = .
r s 1 r1 s 1
If a1 = gcd(a1 , b1 , c1 , d1 ), take P̄ = P, Q̄ = Q and end the process. If not, use (∗) again to choose
P2 , Q2 ∈ GL2 (R) such that Ra2 ⊃ Ra1 and
a1 b1 a 2 b2
P2 Q2 = .
c 1 d1 c 2 d2
Theorem 4.4.6. Let R be a PID and A an m × n matrix over R. Then there exist P ∈ GLm (R)
and Q ∈ GLn (R) such that
d1
d2 0
..
.
P AQ = dr
0
. .
0 .
0
Then a succession of such right multiplications together with elementary transformations (or a
suitable induction) show that we can obtain
AQ = gcd(a1 , . . . , an ) 0 . . . 0 for some Q ∈ GLn (R)
116 4. Modules and Noetherian Rings
as follows:
a1 a2 . . . an ∼ gcd(a1 , a2 ) 0 a3 . . . an
∼ gcd(a1 , a2 ) a3 0 a4 . . . an
∼ gcd(a1 , a2 , a3 ) 0 0 a4 . . . an
∼ · · · ∼ gcd(a1 , . . . , an ) 0 0 . . . 0 .
where W ′ is an (m − 1) × (n − 1) matrix over R and e divides every entry of W ′ . Now we use the
inductive hypothesis again to choose P4 ∈ GLm−1 (R), Q4 ∈ GLn−1 (R) such that
d2
..
′
P4 W Q4 =
.
dr
where d2 | d3 , . . . , dr−1 | dr . We note that since e divides all the entries of W ′ , e | d2 . Hence,
setting e = d1 , we have
e d1
d2 d2
1 e 1 .. ..
= . = .
W′
P4 Q4
dr dr
Before we can use our result on matrices to show that a finitely generated module over a PID
R is a direct sum of cyclic modules, we need to show that every submodule of Rn = R | ⊕ ·{z
· · ⊕ R}
n
is finitely generated. In fact we shall show that every submodule of Rn is free of rank ≤ n.
Theorem 4.4.7. Let R be a ring and let M = Rx1 + · · · + Rxm and N = Ry1 + · · · + Ryn be free
j π
R-modules of rank m and n, respectively. Suppose 0 → M → P → N → 0 is an exact sequence of
R-modules. Then P is a free R-module of rank m + n.
where i is the inclusion map and π is the projection onto the last factor. Let M = (Rx1 ⊕ · · · ⊕
Rxn−1 ) ∩ P ⊆ Rx1 ⊕ · · · ⊕ Rxn−1 and N = π(P ) ⊆ Rxn . Then
i π|P
0→M →P → N →0
Theorem 4.4.9. Let R be a PID and A a finitely generated R-module. Then A is a direct sum of
cyclic R-modules. More precisely,
A∼
=R| ⊕ ·{z
· · ⊕ R} ⊕R/Rd1 ⊕ · · · ⊕ R/Rdk
r
Let M̄ = Rn be the space of n × 1 column vector over R and let N̄ be the R-submodule of M̄
generated by the columns of the n × m matrix
a11 a12 . . . a1m
a21 a22 . . . a2m
[N̄ ] = . .. .. .
.. . .
an1 an2 . . . anm
= M/N ∼
A∼ = α(M )/α(N ) = M̄ /N̄ .
By Theorem 4.4.6, there are matrices [P ] ∈ GLn (R) and [Q] ∈ GLm (R) such that
d1
..
. 0
dk
[P ][N̄ ][Q] =
0
. .
0 .
0 n×m
A∼
= M̄ /N̄ ∼
= [P ]M̄ /[P ]N̄ = M̄ /Ū .
M̄ /Ū ∼
= R/Rd1 ⊕ R/Rd2 ⊕ · · · ⊕ R/Rdk ⊕ R
| ⊕ ·{z
· · ⊕ R} .
n−k
Hence, A ∼
=R| ⊕ ·{z
· · ⊕ R} ⊕R/Rd1 ⊕ R/Rd2 ⊕ · · · ⊕ R/Rdk where d1 | d2 , . . . , dk−1 | dk .
n−k
Example 4.4.1. Let A be the Z-module generated by x, y and z with the relations
x + y = 0 and x − y + 2z = 0.
= {0} ⊕ Z2 ⊕ Zz ∼
= M/U = (Zx ⊕ Zy ⊕ Zz)/(Zx ⊕ Z(2y)) ∼
A∼ = Z2 ⊕ Z
as desired.
Our next goal is to show that the direct summands which occur in the above decomposition
are unique up to isomorphism. This does NOT mean that the actual summands which occur are
unique. For example, suppose R = Z and
A = Z ⊕ Z ⊕ Z3 ⊕ Z3 ∼
= Zx1 ⊕ Zx2 ⊕ Zy1 ⊕ Zy2
120 4. Modules and Noetherian Rings
where Zx1 and Zx2 are free summands and 3y1 = 3y2 = 0. Then we can also write
Zx1 ∼
= Z, Zx2 ∼
= Z, Zy1 ∼
= Z3 , Zy2 ∼
= Z3 .
Z(x1 + 2x2 + y2 ) ∼
= Z, Zx2 ∼
= Z, Z(2y1 + y2 ) ∼
= Z3 , Z(y1 + y2 ) ∼
= Z3 .
Then the summands which occurs are distinct submodules in the two cases, but the isomorphism
classes of summands are the same, namely Z, Z, Z3 and Z3 .
As a preparation for proving uniqueness, we shall need the concept of a torsion element.
Proof. (1) The only problem in showing that τ (M ) is a submodule of M is in showing that τ (M )
is closed under addition. Suppose x, y ∈ τ (M ). Then there exist nonzero elements r, s ∈ R such
that rx = 0 and sy = 0. Since R is an integral domain, rs 6= 0. But
Hence, x + y ∈ τ (M ).
(2) Suppose x + τ (M ) ∈ τ (M/τ (M )). Then there is a nonzero r ∈ R such that
r(x + τ (M )) = rx + τ (M ) = 0 + τ (M ),
Remarks. 1. If R is not an integral domain, then the torsion elements of an R-module M may
not form a submodule, even if R is commutative. For example, let R = F × F where F is a
field and let M = R = F × F . Then the torsion elements of M are all elements of the form
(a, 0) or (0, b). But if a, b 6= 0, (a, b) = (a, 0) + (0, b) is not a torsion element.
2. If R is not commutative, then the torsion elements of an R-module M may not form a
submodule, even if R has no zero divisors. For example, there exists a non-commutative do-
main R (such as the polynomial rings over the quaternion ring) such that for some nonzero
x, y ∈ R, Rx ∩ Ry = 0. In other words, x and y have no common left multiple except 0. For
such an R, x and y, let M = R/Rx as a left R-module. Then
(a) y +Rx is not a torsion element of M , for 0 = r(y +Rx) = ry +Rx, so ry ∈ Rx∩Ry = 0.
Thus, ry = 0, so r = 0.
(b) 1 + Rx is a torsion element of M since x(1 + Rx) = x + Rx = 0. Since 1 + Rx generates
M = R/Rx as a left R-module, it follows that the torsion elements of M do not form
a submodule.
4.4. Modules over a PID 121
Proof. (1) Suppose Rp is not a maximal ideal and let Rp ⊂ Rx ⊂ R. Then p = rx where neither r
nor x is a unit of R, which contradicts the hypothesis that p is irreducible.
(2) Since Rp is a maximal ideal and p ∤ d, Rp + Rd = R. Thus, we can choose r, s ∈ R with
rp + sd = 1. Then for any x ∈ R, x + Rd = (rp + sd)x + Rd = prx + Rd = p(rx + Rd). Hence,
p(R/Rd) = R/Rd.
(3) Since p | d, Rd ⊂ Rp. The multiplication by p defines an onto R-module homomorphism
ϕp : R → Rp/Rd where ϕp (x) = xp + Rd. It is easy to verify that ker ϕp = R(d/p). Hence, we
have the theorem.
A = R/Rd1 ⊕ · · · ⊕ R/Rdk ∼
= R/Re1 ⊕ · · · ⊕ R/Rem = B.
By symmetry, k ≥ m. Hence, m = k.
We now show that R/Rdi ∼ = R/Rei by induction on the number n of prime divisors of d1 · · · dk .
E.g., for d1 . . . dk = pα1 1 . . . pαr r , we have n = α1 + · · · + αr . If n = 1, then k = 1 and A = R/Rd1 ∼=
R/Re1 = B. For inductive step, let p be a prime divisor of d1 and hence of d2 , . . . , dk . Then
A/pA ∼ = R/Rp ⊕ · · · ⊕ R/Rp as above. Suppose p ∤ e1 . Then p(R/Re1 ) = R/Re1 , by Theorem
| {z }
k
4.4.11, so (R/Re1 )/p(R/Re1 ) = 0. Thus,
Now the number of prime factors of (d1 /p) · · · (dk /p) is strictly less than the number of prime
factors of d1 · · · dk . Hence, the inductive hypothesis applies to the isomorphism
Therefore, R/Rdi ∼
= R/Rei for i = 1, . . . , k and the theorem is proved.
A∼
=R| ⊕ ·{z
· · ⊕ R} ⊕R/Rd1 ⊕ · · · ⊕ R/Rdk
r
and
∼ R ⊕ · · · ⊕ R ⊕R/Re1 ⊕ · · · ⊕ R/Rem
B= | {z }
s
are isomorphic R-modules where the di and ei are nonzero nonunits, d1 | d2 , . . . , dk−1 | dk and
e1 | e2 , . . . , em−1 | em . Then r = s, k = m and R/Rdi ∼
= R/Rei for all i = 1, . . . , k.
Also,
| ⊕ ·{z
A/τ (A) = R | ⊕ ·{z
· · ⊕ R} and B/τ (B) = R · · ⊕ R} .
r s
Exercises 4.4. 1. Let R be a commutative ring such that every submodule of a free R-module is free.
Prove that R is a PID.
2. Prove that every finitely generated subgroup of the additive group (Q, +) is cyclic.
3. Let R = Z[x] and let M = (2, x) be the ideal generated by 2 and x, considered as a submodule of R.
Show that {2, x} is not a basis of M . Show that the rank of M is 1 but that M is not free of rank 1.
4. Let R be a PID. Prove that
(a) For any a, b ∈ R, if gcd(a, b) = 1, then R/Rab ∼ = R/Ra ⊕ R/Rb.
(b) If d = pn1 1 . . . pnk k where p1 , . . . , pk are distinct primes and n1 , . . . , nk > 0, then
R/Rd ∼ n
= R/Rpn1 1 ⊕ · · · ⊕ R/Rpk k .
4a + 3b + 3c = 0 and 2a − b + 3c = 0.
Express M as a direct sum of cyclic modules. What are the orders of these modules?
4.5. Noetherian Rings 123
6. Let D be the ring of Gaussian integers Z[i] and M = D3 the free D-module of rank 3. Take K to be
the submodule generated by (1, 2, 1), (0, 0, 5) and (1, −i, 6). Prove that M/K is finite and determine
its order.
7. Let D be the ring of Gaussian integers Z[i]. Determine the structure of D3 /K where K is generated
by f1 = (1, 3, 6), f2 = (2 + 3i, −3i, 12 − 18i) and f3 = (2 − 3i, 6 + 9i, −18i). Show that M = D3 /K is
finite (of order 352512). (The order of the ring Z[i]/(a + bi) is a2 + b2 .)
8. Let D = Q[x] be the polynomial ring in one variable over the field Q of rational numbers. Let K be
the submodule of D3 generated by (2x − 1, x, x2 + 3) and (x, x, x2 ). Find polynomials g1 , . . . , gr such
that D3 /K ∼ = D/(g1 ) ⊕ · · · ⊕ D/(gr ).
Definition. A partially ordered set Σ has the ascending chain condition (a.c.c.) if every chain
s1 ≤ s2 ≤ . . .
eventually breaks off, that is, sk = sk+1 = . . . for some k. This is a finiteness condition in logic
that allows arguments by induction, even when the partially ordered set Σ is infinite. It is easy
to see that a partially ordered set Σ has the a.c.c. if and only if every nonempty subset S ⊂ Σ
has a maximal element: If ∅ = 6 S ⊂ Σ does not have a maximal element, then choose s1 ∈ S,
and for each sk , an element sk+1 with sk < sk+1 , thus contradicting the a.c.c..
Theorem 4.5.1. Let R be a ring. The following three conditions are equivalent.
(i) The set Σ of left ideals of R has the a.c.c.; in other words, every increasing chain of left ideals
I1 ⊂ I2 ⊂ . . . eventually stops, that is Ik = Ik+1 = . . . for some k.
(ii) Every nonempty set S of left ideals has a maximal element.
(iii) Every left ideal I ⊂ R is finitely generated.
If one of these conditions hold, then R is Noetherian (named after E. Noether).
Proof. Here (i) ⇔ (ii) is the purely logical statement about partially ordered sets already dis-
cussed, whereas (i) or (ii) ⇔ (iii) is directly concerned with rings and ideals.
(i) ⇒ (iii). Pick f1 ∈ I, then if possible f2 ∈ I r(f1 ), and so on. At each step, if I 6= (f1 , . . . , fk ),
pick fk+1 ∈ I r (f1 , . . . , fk ). Then by the a.c.c. (i), the chain of ideals
must break off at some stage, and this can only happen if (f1 , . . . , fk ) = I for some k. This proof
involves an implicit appeal to the axiom of choice. It is perhaps cleaner to do (i) ⇒ (ii) purely in
set theory, then argue as follows.
(ii) ⇒ (iii). Let I be a left ideal of R and consider the set S of finitely generated left ideals
contained in I. Then {0} ∈ S , so that S has a maximal element J by (ii). But then J = I,
since any element f ∈ I r J would give rise to a strictly bigger finitely generated left ideal
J ⊂ (J, f ) ⊆ I. S
(iii) ⇒ (i). Let I1 ⊂ I2 ⊂ . . . be an increasing chain of left ideals. Then J = k Ik is again an
ideal. If J is finitely generated then J = (f1 , . . . , fn ) and each fi ∈ Iki , so that setting k = max ki
gives J = Ik and the chain stops.
124 4. Modules and Noetherian Rings
Remarks. 1. Every PID is Noetherian. Hence, we may consider a Noetherian ring as a gener-
alization of a PID.
2. Most rings of interest are Noetherian this is a very convenient condition to work with. At
first sight, more concrete conditions (such as R finitely generated over k or over Z) might
seem more attractive, but as a rule, the Noetherian condition is both more general and more
practical to work with.
3. The descending chain condition (d.c.c.) on a partially ordered set is defined in a similar
way. A ring whose ideals satisfy the d.c.c. is called an Artinian ring. This is also a very
important notion, but is more special: the d.c.c. for rings turns out to be very much stronger
than the a.c.c. (and implies it). We shall discuss this kind of rings in the next section.
Example 4.5.1. Z is Noetherian but not Artinian since Z ⊃ pZ ⊃ p2 Z ⊃ . . . , p prime, is a
decreasing chain which does not stop.
Examples 4.5.2. Here are three examples of non-Noetherian rings. Let k be a field.
1. The polynomial ring k[x1 , . . . , xn , . . . ] in an infinite number of indeterminates is obviously
non-Noetherian.
2. Consider the ring A1 of polynomials in x, y of the form f (x, y) = a + xg(x, y) with a a
constant and g ∈ k[x, y]; that is, f involves no pure power y j of y with j > 0. In other
words,
n X o
A1 = f (x, y) = aij xi y j : i, j ≥ 0 and i > 0 if j 6= 0
= k[x, xy, xy 2 , . . . , xy n , . . . ] ⊂ k[x, y].
It is clear that (x, xy, xy 2 , . . . ) is a maximal ideal of A1 , and is not finitely generated. (It
looks as if it should be generated by x, but, of course, y, y 2 , . . . are not elements of the ring
A1 .) Thus, A1 is not Noetherian.
3. A rather similar example is the ring A2 of polynomials in x, y, y −1 of the form g(x, y) +
xh(x, y, y −1 ); that is,
n X o
A2 = f (x, y) = aij xi y j : i ≥ 0, and j ≥ 0 if i = 0 = k[x, y, x/y, x/y 2 , . . . , x/y n , . . . ].
In this ring x = (x/y) · y, and x/y = (x/y 2 ) · y, etc., so that the element x does not have a
factorization into irreducibles and
(x) ⊂ (x/y) ⊂ (x/y 2 ) ⊂ · · ·
is an infinite ascending chain.
Just as before, it is equivalent to say that any nonempty set of submodules of M has a maximal
element, or that every submodule of M is finitely generated.
Proof. Indeed, if m ∈ M2 , then β(m) ∈ β(M2 ) = β(M1 ), so that there is an n ∈ M1 such that
β(m) = β(n). Then β(m − n) = 0, so that m − n ∈ M2 ∩ ker β = M1 ∩ ker β. Hence, m ∈ M1 .
r
M
Corollary 4.5.5. 1. If Mi are Noetherian modules, i = 1, . . . , r, then Mi is Noetherian.
i=1
2. If R is a Noetherian ring, then an R-module M is Noetherian if and only if it is finitely
generated over R.
3. If R is a Noetherian ring and M is a finitely generated R-module, then any submodule
N ⊂ M is again finitely generated.
4. If R is a Noetherian ring and ϕ : R → B is a ring homomorphism such that B is a finitely
generated R-module, then B is a Noetherian ring. In particular, a homomorphic image of a
Noetherian ring is a Noetherian ring.
Proof. (1) A direct sum M1 ⊕ M2 is a particular case of an exact sequence, so that the previous
proves (1) when r = 2. The case r > 2 follows by an easy induction.
(2) If M is finitely generated then there is a surjective homomorphism Rr → M → 0 for some
r, so that M is a quotient M ∼ = Rr /N for some submodule N ⊂ Rr ; now Rr is a Noetherian
module by (1), so M Noetherian follows by the implication ⇒ of the above theorem. Conversely,
M Noetherian obviously implies M is finitely generated.
(3) This just uses the previous implication: M finitely generated and R Noetherian implies that
M is Noetherian, so that N is Noetherian, which implies that N is a finitely generated R-module.
(4) B is Noetherian as an R-module; but left ideals of B are submodules of B as an R-submodule,
so that B is a Noetherian ring.
The following result provides many examples of Noetherian rings, and is the main motivation
behind the use of the a.c.c. in commutative algebra. Note that in Hilbert’s day, a “basis” of a
module meant simply a family of generators.
Theorem 4.5.6. [Hilbert Basis Theorem] If R is a commutative Noetherian ring, then so is the
polynomial ring R[x].
Proof. We shall prove that any ideal I ⊂ R[x] is finitely generated. For this, define auxiliary sets
Jn ⊂ R by
J0 ⊆ J1 ⊆ J2 ⊆ . . .
is an increasing chain of ideals. Using the assumption that R is Noetherian, we deduce that
Jn = Jn+1 = . . . for some n.
126 4. Modules and Noetherian Rings
For each m ≤ n, the ideal Jm ⊂ R is finitely generated, say Jm = (am,1 , . . . , am,rm ); and by
definition of Jm , for each am,j with 1 ≤ j ≤ rm there is a polynomial fm,j ∈ I of degree m having
the leading coefficient am,j . This allows us to write down a finite set
S = {fm,j }0≤m≤n,1≤j≤rm
of elements of I.
We now claim that S generates I. Indeed, for any polynomial f (x) ∈ I, if f (x) has degree P m
then its leading coefficient
P a is in J m , hence if m ≥ n, then a ∈ J m = J n , so that a = b a
i n,i
withP bi ∈ R and f (x) − bi xm−n fn,i (x) P has degree < m; similarly, if m ≤ n, then a ∈ J m , so that
a= bi am,i with bi ∈ R and f (x) − bi fm,i (x) has degree < m. By induction on m, it follows
that f can be written as a linear combination of the finitely many elements in S. This proves that
any ideal of R[x] is finitely generated.
Proof. The assumption is that B is a quotient of a polynomial ring, B ∼ = R[x1 , . . . , xn ]/I for
some ideal I. Now by Hilbert Basis Theorem and an obvious induction, R Noetherian implies
that so is R[x1 , . . . , xn ], and by Corollary 4.5.5, (4), R[x1 , . . . , xn ] is Noetherian implies that so is
R[x1 , . . . , xn ]/I.
Definition. The Jacobson radical of a ring R is the intersection of all maximal ideals of R and
is denoted by Jac R.
Note that if R is a local ring with unique maximal ideal M , then Jac R = M .
The set of all nilpotent elements in a commutative ring R is an ideal, called the nilradical
of R. It is also clear that every prime ideal in a commutative ring contains the nilradical.
4.6. Artinian Rings 127
Proof. (1) If I is a proper ideal of R, then I is contained in some maximal ideal M of R. Since
J ⊆ M, I ∪ J ⊆ M.
(2) Let a ∈ R be nilpotent. Then an = 0 for some n ∈ N. Since maximal ideals are prime and
an ∈ M , so a ∈ M .
(3) Suppose 1 − rx is not a unit for some r ∈ R and let M be a maximal ideal containing 1 − rx.
Since 1 ∈/ M , rx ∈ / M , so x ∈ / M . But J ⊆ M , it follows that x ∈
/ J. Conversely, assume that
x∈/ J. Then there is a maximal ideal M such that x ∈ / M . Thus, R = (x, M ), so 1 = rx + m for
some r ∈ R and m ∈ M . Hence, 1 − rx = m ∈ M which implies that 1 − rx is not a unit in R.
(4) Assume that M 6= {0} and let n be the smallest positive integer such that M is generated by
n elements, say m1 , . . . , mn . Since M = JM , we have
mn = r1 m1 + · · · + rn mn for some r1 , . . . , rn ∈ J.
Remark. In the special case of a finitely generated module M over a local ring R with unique
maximal ideal J, the quotient M/JM is a vector space over the field R/J. Statement (6) implies
that a basis of M/JM lifts to a minimal set of generators of M . Conversely, every minimal set
of generators of M is obtained in this way, and any two such sets of generators are related by an
invertible matrix with entries in the ring.
Definition. A ring whose ideals satisfy the descending chain condition (d.c.c.), i.e., whenever
I1 ⊇ I2 ⊇ . . . is a decreasing chain of ideals of R, then there is a positive integer m such that
Ik = Im for all k ≥ m, is called an Artinian ring (named after E. Artin).
Clearly, every finite ring is Artinian. Also, it is immediate that every quotient ring of an Artinian
ring is Artinian. Similar to Theorem 4.5.1, we have the following theorem.
Theorem 4.6.2. R is an Artinian ring if and only if every nonempty set S of ideals has a minimal
element.
Lemma 4.6.4. Let M be a maximal ideal of the commutative ring R and suppose that M m = {0}
for some m ∈ N. Then R is Noetherian if and only if R is Artinian.
Lemma 4.6.5. Let R be a commutative ring and P a prime ideal of R. If I and J are ideals of R
such that P ⊇ I ∩ J, then I ⊆ P or J ⊆ P .
R/(Jac R) ∼
= k1 × · · · × kn ,
Proof. (1) Let S be the set of all ideals of R that are the intersection of a finite number of maximal
ideals. By Theorem 4.6.2, S has a minimal element, say M1 ∩ · · · ∩ Mn . Then for any maximal
ideal M , we have
M ∩ M 1 ∩ · · · ∩ Mn = M1 ∩ · · · ∩ Mn ,
so M ⊇ M1 ∩ · · · ∩ Mn . By Lemma 4.6.5, Mi ⊆ M for some i. Since Mi and M are maximal,
Mi = M and hence M1 , . . . , Mn are all maximal ideals of R.
(2) Since Mi + Mj = R for all i 6= j and Jac R = M1 ∩ · · · ∩ Mn , the statement follows from the
Chinese remainder theorem applied to M1 , . . . , Mn .
(3) We first show that J = Jac R is nilpotent. By d.c.c., there is some m ∈ N such that J m = J m+i
for all i ∈ N. Assume that J m 6= {0}. Let S be the set of proper ideals I such that IJ m 6= {0}.
Then J ∈ S . Let I0 be a minimal element of S . Thus, there is some x ∈ I0 such that xJ m 6= {0}.
By minimality of I0 , we have I0 = (x). Since ((x)J)J m = xJ m+1 = xJ m , it follows that (x) = (x)J
by minimality of (x). By Nakayama’s lemma, (x) = {0}, a contradiction. Hence, J m = {0}.
Since am ∈ J m = {0} for all a ∈ J, every element of J is nilpotent. But J contains the
nilradical of R, so these two ideals are equal.
Let P be a prime ideal P of R. Then P contains the nilradical of R, so it contains J. Thus,
P/J is a prime ideal of R/J. By (2), R/J ∼ = k1 × · · · × kn and thus a prime ideal of R/J consists
of the elements that are 0 in one of the components. In particular, such a prime ideal is also a
maximal ideal. Hence, P is maximal as desired.
(4) Let M1 , M2 , . . . , Mn be all the distinct maximal ideals of R and let J = Jac R and J m = {0}
as in (3). Then !m
\n \n
Mim ⊆ Mi ⊆ J m = {0}.
i=1 i=1
4.6. Artinian Rings 129
R∼
= R/M1m × R/M2m × · · · × R/Mnm ,
and each R/Mim is an Artinian ring (because R is) with unique maximal ideal Mi /Mim .
(5) From (4), it suffices to prove that an Artinian local ring is Noetherian. Assume that R is an
Artinian with unique maximal ideal M . Then Jac R = M and M m = {0} for some m ∈ N. Thus,
the desired result follows from Lemma 4.6.4.
Corollary 4.6.7. Any finite commutative ring is a direct product of a finite number of local rings.
Z/nZ ∼
= Z/pa11 Z × Z/pa22 Z × · · · × Z/par r Z.
Each Z/pai i Z is a local ring with unique maximal ideal pi Z/pai i Z for all i ∈ {1, 2, . . . , r}.
Exercises 4.6. 1. Prove that an Artinain integral domain is a field. Hence, Z is not Artinian.
2. Suppose R = F is a field. Prove that an R-module M is Artinian if and only if it is Noetherian if and
only if M is a finite dimensional vector space over F .
130 4. Modules and Noetherian Rings
5 | Field Theory
In Section 2.6, we learn about extensions of a field. Here, we give more details on a construction
of extension fields. We prepare the readers to Galois theory which yields a connection between
field theory and group theory. Applications of Galois theory are provided in proving fundamental
theorem of algebra, finite fields, and cyclotomic fields. We discuss some results on a transcenden-
tal extension in the final section.
n
Y
Definition. We say that f (x) splits in an extension field E if f (x) = c(x − ri ), that is, it is a
i=1
product of linear factors in E[x] and c ∈ F .
We shall first study some facts about the roots of f (x) ∈ F [x] as follows.
Theorem 5.1.1. If f (x) ∈ F [x] and deg f (x) = n ≥ 1, then f (x) can have at most n roots
counting multiplicities in any extension field of F .
Proof. We shall prove the theorem by induction on the degree of f (x). If deg f (x) = 1, then
f (x) = ax + b for some a, b ∈ F and a 6= 0. Then −b/a is the unique root of f (x) and −b/a ∈ F ,
so we are done.
Let deg f (x) = n > 1 and assume that the result is true for all polynomials of degree < n. Let
E be any extension field of F . If f (x) has no roots in E, then we are done. Let r ∈ E be a root
of f (x) of multiplicity m ≥ 1. Then there exists q(x) ∈ E[x] such that f (x) = (x − r)m q(x) and
q(r) 6= 0. Thus, deg q(x) = n − m. By the inductive hypothesis q(x) has at most n − m roots in E
counting multiplicities. Hence, f (x) has at most m+(n−m) roots in E counting multiplicities.
Theorem 5.1.2. [Kronocker] If p(t) ∈ F [t] is irreducible over F , then there exists an extension
field E of F such that [E : F ] = deg p(t) and p(t) has a root in E.
131
132 5. Field Theory
Corollary 5.1.3. If p(t) ∈ F [t] is a nonconstant polynomial, then there exists a finite extension
field E of F containing a root of p(t) and [E : F ] ≤ deg p(t).
Proof. Since F [t] is a UFD, p(t) has an irreducible factor in F [t] say p1 (t). By Theorem 5.1.2, there
exists an extension field E of F such that E contains a root of p1 (t) and [E : F ] = deg p1 (t).
Hence, [E : F ] ≤ deg p(t) and E contains a root of p(t).
Definition. Let F be a field and f (x) a monic polynomial in F [x]. An extension field E of F is
a splitting field of f (x) over F if
f (x) = (x − r1 ) . . . (x − rn )
in E[x] and
E = F (r1 , . . . , rn ),
that is, E is generated by the roots of f (x).
Theorem 5.1.4. Let f (x) be a nonconstant polynomial of degree n. Then there exists an extension
field E of F such that [E : F ] ≤ n! and E contains n roots of f (x) counting multiplicities.
Hence, in E[t], f (x) = c(x − r1 ) . . . (x − rn ) for some c ∈ F and r1 , . . . , rn ∈ E, so that r1 , . . . , rn
are n roots of f (x) in E.
Proof. We shall prove the theorem by induction on the degree of f (x). If deg f (x) = 1, then f (x)
has exactly one root in F and [F : F ] = 1 = 1!.
Let deg f (x) = n > 1 and assume that the theorem is true for the case of polynomials of
degree < n. By Corollary 5.1.3, there exists an extension field E0 of F such that f (x) has a root,
say r ∈ E0 and [E0 : F ] ≤ n. Since r is a root of f (x), f (x) = (x − r)q(x) for some q(x) ∈ E0 [x],
so deg q(x) = n − 1. By the inductive hypothesis, there exists an extension field E of E0 such
that [E : E0 ] ≤ (n − 1)! and E contains n − 1 roots of q(x). Then E is an extension field of F ,
[E : F ] = [E : E0 ][E0 : F ] ≤ n! and E contains n roots of f (x) counting multiplicities.
Corollary 5.1.5. Let F be a field and f (x) a nonconstant polynomial over F of degree n. Then
there exists a splitting field E of f (x) over F . Moreover, [E : F ] ≤ n!.
Proof. We have seen from Theorem 5.1.4 that there exists an extension field E of F such that
f (x) = c(x − r1 ) . . . (x − rn ), for some c ∈ F and r1 , . . . , rn ∈ E, is a product of linear factors in
E[x] and [E : F ] ≤ n!. Hence, E = F (r1 , . . . , rn ) is a desired field.
3. Let F = Q, f (x) = (x2 − 2)(x2 − 3). Since the rational roots of x2 − 2 and x2 − 3 must be
integral, it follows that x2 − 2 and x2 − 3 are irreducible in Q[x]. Form K = Q(r1 ), r1 =
x + (x2 − 2) in Q[x]/(x2 − 2). The elements of K have the form a + br1 , a, b ∈ Q. We
claim that x2 − 3 is irreducible in K[x]. Otherwise, we have rational numbers a, b such that
(a + br1 )2 = 3. Then (a2 + 2b2 ) + 2abr1 = 2 2
√ 3 so that ab = 0 and a + 2b =23. If b = 0 we
2
obtain a = 3 which√ is impossible since 3 is not rational, and if a = 0, b = 3/2. Then
(2b2 ) = 6 and since 6 is not rational, we again obtain an impossibility. Thus, x2 − 3 is
irreducible in K[x]. Now form E = K[x]/(x2 − 3). Then this is a splitting field over Q of
(x2 − 2)(x2 − 3) and [E : Q] = [E : K][K : Q] = 2 · 2 = 4.
4. Let F = Q, f (x) = xp − 1, p a prime. We have xp − 1 = (x − 1)(xp−1 + xp−2 + · · · + x + 1)
and we know that xp−1 + xp−2 + · · · + x + 1 is irreducible in Q[x]. Let E = Q(z) where
z = x + (xp−1 + xp−2 + · · · + x + 1) in Q[x]/(xp−1 + xp−2 + · · · + x + 1). We have 1, z, . . . , z p−1
are distinct. Also (z k )p = (z p )k = 1 so every z k is a root of xp − 1. It follows that xp − 1 =
Q p k
k=1 (x − z ) in E[x].√Thus, E √ is a splitting
√ field over Q of xp − 1 and [E : Q] =√p − 1.
5. Since x − 2 = (x − 2)(x − 2ω)(x − 2ω 2 ) where ω 6= 1 and ω 3 = 1, Q(
3 3 3 3
√
3
2) is not a
3 3
splitting field of f (x) = x − 2 over Q. √A splitting field of f (x) is√E = Q( 2, ω). Since
g(x) = √ x2 + x +√ 1 is irreducible over Q( 3 2) and g(ω) = 0, [E : Q( 3 2)] = 2, so [E : F ] =
3 3
[E : Q( 2)][Q( 2) : Q] = 2 · 3 = 6.
e
6. A splitting field over Z/(p) of xp − 1, e ∈ N, is Z/(p).
Proof. Let fˆ(x) be an irreducible factor of f (x) and let fˆ1 (x) = η(fˆ(x)). Let r ∈ E be a root
of fˆ(x) and let r1 ∈ E1 be a root of fˆ1 (x). Then we have a commutative diagram in which the
vertical arrows are isomorphisms and the horizontal arrows are inclusion maps
F / F [r] /E
O
i
F [x]/fˆ(x)F [x]
η η̂
F1 [x]/fˆ1 (x)F1 [x]
j
F1 / F1 [r1 ] / E1 .
The map j η̂i−1 = η̄ is an isomorphism of fields extending η. Also, η̄(f (x)/(x − r)) = f1 (x)/(x − r)
and E/F [r], E1 /F1 [r1 ] are splitting fields of f (x)/(x − r) and f1 (x)/(x − r1 ), respectively.
Now, by induction on deg f (x), η̄ : F [r] → F1 [r1 ] has an extension to η ∗ : E → E1 and this is
the required extension of η.
Theorem 5.1.7. Assume f (x) has no multiple factors as an element of F [x]. Under the hypothesis
of Theorem 5.1.6, the number of distinct extensions of η : F → F1 to η : E → E1 is at most [E : F ].
Moreover, the number of distinct extensions is equal to [E : F ] if and only if f (x) has distinct roots
in E.
134 5. Field Theory
Proof. Proceeding as in the proof of Theorem 5.1.6, let fˆ(x) be an irreducible factor of f (x), let d
be the degree of fˆ(x), let fˆ1 (x) = η(fˆ(x)), let r1 , . . . , re be the distinct roots of fˆ(x) in E and let
r1′ , . . . , re′ be the roots of fˆ1 (x) in E1 . (Note that e ≤ d and e = d if fˆ1 (x) has no multiple roots,
but this is not always the case.)
Next fix a root r = r1 of fˆ(x). The argument of Theorem 5.1.6 shows that for each root
r1 , . . . , re′ of fˆ1 (x) there is an isomorphism η̄j : F [r] → F1 [rj′ ] extending η, where η̂j (r) = rj′ .
′
F / F [r]
η
F1 / F1 [r ′ ] / E1
j
On the other hand, any isomorphism of F [r] into E1 must carry r into a root of fˆ1 (x), and so must
one of the η̄j . Furthermore, as noted above
the number of roots of fˆ(x) = e ≤ d = [F [r] : F ].
By induction, the number of ways each η̂j can be extended to an isomorphism E → E1 is at most
[E : F [r]]. Thus,
the number of extensions of η : F → F1 to η ∗ : E → E1
≤ e[E : F [r]] ≤ [F [r] : F ][E : F [r]] = [E : F ].
Now we want to answer the question: When is there equality – that is, the number of exten-
sions = [E : F ]?
Looking at the first step above we see that the number of roots of fˆ(x) = e = d = [F [r] : F ] if
and only if fˆ(x) has d = deg fˆ(x) roots – that is if and only if fˆ(x) has distinct roots.
To continue inductively, we now have the set up
F [r] /E
η̂j
F1 [rj′ ] / E1
The key point is that E is the splitting field over F [r] of the polynomial f (x)/(x − r). This
polynomial has no multiple factor so inductively the number of extensions of η̂j to an isomorphism
η ∗ : E → E1 is equal to [E : F [r]] if and only if f (x)/(x−r) has distinct roots. Combining this with
the result for fˆ(x) we get the number of extensions of η : F → F1 to an isomorphism η : E → E1
is equal to [E : F ] if and only if f (x) has distinct roots in E.
Remarks. (1) If f (x) is an irreducible polynomial over a field F and r is a root of f (x) in some
extension field of F , then
F [x]/f (x)F [x] ∼
= F [r].
However, if f (x) = g(x)h(x) where g(x) and h(x) are irreducible polynomials, then by Chinese
remainder theorem
F [x]/f (x)F [x] ∼
= F [x]/g(x)F [x] × F [x]/h(x)F [x]
a direct product of fields. If f (x) = g(x)2 , then F [x]/f (x)F [x] even has nilpotent elements.
In general, E/F arises from a succession of simple extensions
F ⊆ F1 ∼= F [x]/f1 (x)F [x],
∼
F1 ⊆ F2 = F1 [x]/f2 (x)F1 [x],
..
.
Fr−1 ⊆ Fr ∼
= Fr−1 [x]/fr (x)Fr−1 [x] = E.
5.2. Algebraic Closure of a Field 135
We shall see that in some important cases (but not all), the splitting field E/F of the polynomial
f (x) can be achieved as a simple extension F ⊆ F [x]/g(x)F [x] = E, but usually g(x) 6= f (x).
(2) If f (x) and g(x) have the same roots in some extension field E of F (f (x), g(x) ∈ F [x]),
then they have the same splitting field. However, one cannot guarantee that the roots of f (x) are
distinct (or simple, or one fold). The basic example is the polynomial
f (x) = xp − a ∈ F [a]
where F is a field of characteristic p > 0. If r is a root of f (x) in some extension field E of F [a],
then rp = a and the factorization of f (x) in E[x] is
f (x) = xp − a = xp − rp = (x − r)p .
Exercises 5.1. 1. Construct a splitting field over Q of x5 − 2. Find its dimension over Q.
2. Let f (x) = x + x2 + 1. Find the splitting field of f (x) over Q and determine its dimension.
4
3. Let E/F be a splitting field of over F of f (x) and let K be a subfield of E/F . Show that any
monomorphism of K/F into E/F can be extended to an automorphism of E.
4. If f (x) ∈ F [x] has degree n and K is a splitting field of f (x) over F , prove that [K : F ] divides n!.
5. Let F be a field of characteristic p > 0 and let b ∈ F . Show that either xp − b is irreducible in F [x] or
b = ap and xp − b = (x − a)p for some a ∈ F .
Definition. A field F is called algebraically closed if every monic polynomial f (x) of positive
degree with coefficients in F has a root in F .
Proof. Since r is a root, that is f (r) = 0, if and only if x − r is a factor of f (x) in F [x], we have
(i) ⇔ (ii). Next, we show (i) ⇔ (iii). If E is an extension field of F and a ∈ E is algebraic
over F , then [F (a) : F ] is the degree of the minimal polynomial f (x) of a over F , and f (x)
is monic and irreducible. Then a ∈ F if and only if deg f (x) = 1. Hence, E is algebraic over
F and E ⊃ F implies there exist irreducible monic polynomials in F [x] of degree ≥ 2; hence
F is not algebraically closed. Conversely, if F is not algebraically closed, then there exists a
monic irreducible f (x) ∈ F [x] with deg f (x) ≥ 2, Thus, the field F [x]/(f (x)) is a proper algebraic
extension of F .
We recall that (Corollary 2.6.7) if E is an extension field of the field F , then the set of elements
of E that are algebraic over F constitutes a subfield A of E/F (that is, a subfield of E containing
F ). Evidently E = A if and only if E is algebraic over F . At the other extreme, if A = F , then
F is said to be algebraically closed in E. In any case A is algebraically closed in E, since any
element of E that is algebraic over A is algebraic over F and so is contained in A. This result
shows that if a field F has an algebraically closed extension field, then it has one that is algebraic
over F .
136 5. Field Theory
For example, assuming the truth of the fundamental theorem of algebra (Theorem 5.5.6), that
C is algebraically closed, it follows that the field of A of algebraic numbers is an algebraic closure
of Q, and thus A is algebraically closed.
We proceed to prove the existence and uniqueness up to isomorphism of an algebraic closure
of any field F . For a countable F a straightforward argument is available to establish these results.
We begin by enumerating the monic polynomials of positive degree as f1 (x), f2 (x), . . . . Evidently
this can be done. We now define inductively a sequence of extension fields beginning with F0 = F
and letting Fi be a splitting field over Fi−1 of fi (x). The construction of such splitting fields was
given at the end of the previous section. It is clear that every Fi is countable,
S so we can realize all
of these constructions in some large set S. Then we can take E = Fi in the set. Alternatively
we can define E to be a direct limit of the fields Fi . It is easily seen that E is an algebraic closure
of F . We showed that (Theorem 5.1.6) there exists an isomorphism of K1 /F onto K2 /F . This
can be used to prove the isomorphism theorem for algebraic closures of a countable field by a
simple inductive argument.
The pattern of the proof sketched above can be carried over to the general case by using
“transfinite induction”. This is what was done by E. Steinitz, who first proved these results. There
are several alternative proofs available that are based on Zorn’s lemma. We shall give one that
makes use of the following lemma.
Lemma 5.2.2. If E is an algebraic extension of a field F , then the cardinality of E cannot exceed
the cardinality of F [x].
Proof. Let S be the set of all ordered pairs (f, α) where f (x) ∈ F [x] is nonzero and α ∈ E with
f (α) = 0. Since for each polynomial f (x), the number of α such that (f, α) lies in S is finite, we
have |S| ≤ |F [x]|ℵ0 = |F [x]|. On the other hand, E maps injectively into S via α 7→ (fα , α) where
fα is the minimal polynomial of α, and thus |E| ≤ |S|.
Recall that |F [x]| = |F |ℵ0 . If F is infinite, then |F [x]| = |F | and it follows that |E| = |F |.
When F is finite, F [x] is countable, and hence E is either finite or countably infinite.
Proof. There are only countably many polynomials in Q[x]. Since R is uncountable, the above
lemma guarantees that R is not algebraic over Q.
Proof. We first embed F in a set S in which we have a lot of elbow room. Precisely, we assume
that |S| > |F | if F is infinite and that S is uncountable if F is finite. We now define a set Λ whose
elements are (E, +, ·) where E is a subset of S containing F and +, · are binary compositions in
E such that (E, +, ·) is an algebraic extension field of F . We partially order Λ by declaring that
(E, +, ·) > (E ′ , +′ , ·′ ) if E is an extension field of E ′ . By Zorn’s lemma there exists a maximal
element (E, +, ·). Then E is an algebraic extension of F . We claim that E is algebraically closed.
Otherwise we have a proper algebraic extension E ′ = E(a) of E. Then |E ′ | < |S|, so we can
define an injective map of E ′ into S that is the identity on E and then we can transfer the
addition and multiplication on E ′ to its image. This gives an element of Λ bigger than (E, +, ·).
This contradiction shows that E is an algebraic closure of F .
5.2. Algebraic Closure of a Field 137
Next we take up the question of uniqueness of algebraic closures. It is useful to generalize the
concept of a splitting field of a polynomial to apply to sets of polynomials.
Definition. If Γ = {fα (x)} is a set of monic polynomials with coefficients in F , then an exten-
sion field E/F is called a splitting field over F of the set Γ if
1. every fα (x) ∈ Γ is a product of linear factors in E[x] and
2. E is generated over F by the roots of the fα (x) ∈ Γ.
It is clear that if E is a splitting field over F of Γ, then no proper subfield of E/F is a splitting
field of Γ and if K is any intermediate field, then E is a splitting field of Γ. Since an algebraic
closure E of F is algebraic, it is clear that E is a splitting field over F of the complete set of
monic polynomials of positive degree in F [x]. The isomorphism theorem for algebraic closures
will therefore be a consequence of a general result on isomorphisms of splitting fields that we
shall now prove. Our starting point is the following result, which is Theorem 5.1.6.
Let η : a 7→ ã be an isomorphism of a field F onto a field F̃ , f (x) ∈ F [x] be monic of positive
degree, f˜(x) the corresponding polynomial in F̃ [x] (under the isomorphism, which is η on F and
sends x 7→ x), and let E and Ẽ be splitting fields over F and F̃ of f (x) and f˜(x), respectively.
Then η can be extended to an isomorphism of E onto Ẽ.
We shall now extend this to a set of polynomials.
Proof. The proof is a straightforward application of Zorn’s lemma. We consider the set of exten-
sions of η to monomorphisms of subfields of E/F into Ẽ/F̃ and use Zorn’s lemma to obtain a
maximal one. This must be defined on the whole E, since otherwise we could get a larger one
by applying the result quoted to one of the polynomials fα (x) ∈ Γ. Now if ζ is a monomorphism
of E into Ẽ such that ζ|F = η, then it is clear that ζ(E) is a splitting field over F̃ of Γ̃. Hence,
ζ(E) = Ẽ and ζ is an isomorphism of E onto Ẽ.
Theorem 5.2.6. Any two algebraic closures of a field F are isomorphic over F .
From now on we shall appropriate the notation F̄ for any determination of an algebraic closure
of F . If A is any algebraic extension of F , its algebraic closure Ā is an algebraic extension of A,
hence of F , and so Ā is an algebraic closure of F . Consequently, we have an isomorphism of Ā/F
into F̄ /F . This maps A/F into a subfield of F̄ /F . Thus, we see that every algebraic extension
A/F can be realized as a subfield of the algebraic closure F̄ /F .
Exercises 5.2. 1. No finite field F is algebraically closed. [Hint. If F = {0, 1, a2 , . . . , an }, consider the
polynomial 1 + x(x − 1)(x − a2 ) . . . (x − an ) ∈ F [x].]
2. Let E be an algebraic extension of a field F and A an algebraic closure of F . Show that E/F is
isomorphic to a subfield of A/F . [Hint. Consider the algebraic closure Ā of A and note that this is
an algebraic closure of F .]
138 5. Field Theory
Definition. Let R be an integral domain and f (x) ∈ R[x]. If α is a root of f (x), then there
exist m ∈ N and g(x) ∈ R[x] such that f (x) = (x − α)m g(x) and g(α) 6= 0. m is called the
multiplicity of the root α of f (x) and if m > 1, α is called a multiple root of f (x).
We record the straightforward properties of the derivative of polynomials in the next lemma.
Lemma 5.3.1. If f (x) and g(x) are polynomials over an integral domain R and c ∈ R, then
1. (cf (x))′ = cf ′ (x),
2. (f (x) + g(x))′ = f ′ (x) + g ′ (x),
3. (f (x)g(x))′ = f (x)g ′ (x) + f ′ (x)g(x),
4. ((f (x))n )′ = n(f (x))n−1 f ′ (x) where n ∈ N.
Definition. Let F be a field. A polynomial f (x) ∈ F [x] is separable if every root (in some
splitting field over F ) of its irreducible factor is not a multiple root. If E is an extension of F
and α ∈ E is algebraic over F , then α is separable over F if its minimal polynomial over F is
separable.
Suppose that F is a field of characteristic zero and f (x) is a monic irreducible polynomial over
F , say f (x) = a0 + a1 x + · · · + an−1 xn−1 + xn . Then f ′ (x) = a1 + 2a2 x + · · · + nxn−1 . The key point
is that n 6= 0, so f ′ (x) 6= 0. Since deg f ′ (x) < deg f (x) and f (x) is irreducible, f (x) and f ′ (x) are
relatively prime, so all roots of f (x) are simple. Thus, we have shown:
Theorem 5.3.3. Let F be a field of characteristic zero. Then every polynomial f (x) ∈ F [x] is
separable.
Theorem 5.3.5. Let F be a field of characteristic p > 0. Then F is perfect if and only if F = F p .
We shall end this section by proving the “primitive element theorem” which is a classic of field
theory. We first recall that an extension field E of a field F is said to be a simple extension of F
if E = F (α) for some α ∈ E. Such an element α is called a primitive element.
Theorem 5.3.7. If F is a field and G is a finite subgroup of the multiplicative group of nonzero
elements of F , then G is a cyclic group. In particular, the multiplicative group of all nonzero
elements of a finite field is cyclic.
Proof. If G = {1}, then G is cyclic. Assume that G 6= {1}. Since G is a finite abelian group,
G∼
= Z/(m1 ) ⊕ · · · ⊕ Z/(mk )
k
X
where k ≥ 1, m1 > 1 and m1 | · · · | mk . Since mk ( Z/(mi )) = 0, u is a root of the polynomial
i=1
xmk − 1 ∈ F [x] for all u ∈ G. By Theorem 5.1.1, this polynomial has at most mk distinct roots in
F , we must have k = 1 and G ∼ = Z/(m1 ) which is a cyclic group.
Theorem 5.3.8. [Primitive Element Theorem] Let E be a finite separable extension of a field F .
Then there exists α ∈ E such that E = F (α). That is, a finite separable extension of a field is a
simple extension.
Proof. If F is a finite field, then E is also finite. Let α be a generator for the cyclic group of all
nonzero elements of E under multiplication. Clearly, E = F [α], so α is a primitive element in this
case.
We now assume that F is infinite and prove our theorem in the case that E = F (β, γ). The
induction argument from this to the general case is obvious. Let mβ,F (x) and mγ,F (x) be the
minimal polynomials over F of β and γ, respectively. Assume that mβ,F (x) has distinct roots β =
β1 , . . . , βn and mγ,F (x) has distinct roots γ = γ1 , . . . , γm in F̄ where all roots are of multiplicity 1,
since E is a separable extension of F . Since F is infinite, we can find a ∈ F such that
βi − β
a 6=
γ − γj
for all i and j, with j 6= 1. That is, a(γ − γj ) 6= βi − β. Letting α = β + aγ, we have α = β + aγ 6=
βi + aγj , so
α − aγj 6= βi
for all i and all j 6= 1. Consider h(x) = mβ,F (α − ax) ∈ F (α)[x]. Now, h(γ) = mβ,F (β) = 0.
However, h(γj ) 6= 0 for j 6= 1 by construction, since the βi were the only roots of mβ,F (x). Hence,
h(x) and mγ,F (x) have a common factor in F (α)[x], namely the minimal polynomial of γ over
F (α), which must be linear, since γ is the only common root of mγ,F (x) and h(x). Thus, γ ∈ F (α),
and therefore β = α − aγ is in F (α). Hence, F (β, γ) = F (α).
Exercises 5.3. 1. Suppose that F ⊆ K ⊆ E and that E is separable extension of F . Prove that E is
separable over K and K is separable over F .
2. Let F be of characteristic p and let a ∈ F . Show that f (x) = xp − x − a has no multiple roots and
f (x) is irreducible in F [x] if and only if a 6= cp − c for any c ∈ F .
5.4. Automorphisms of Fields and Galois Theory 141
√
3. Find a primitive element of Q(i, 3 2) over Q.
4. Let K = F25 be the field with 5 elements and let F = Z/(5) be the prime subfield of K. Determine
the cardinalities of the following two sets.
(a) The set of elements of K which generate K as a field over F .
(b) The set of elements of K which generate the group of nonzero elements of K as an abelian group
under multiplication.
5. Let F be a field and let F be its algebraic closure. If a monic polynomial p(x) ∈ F [x] is irreducible
over F and has distinct roots α1 , α2 , . . . , αk ∈ F , prove that the multiplicities of αj are equal, that is,
p(x) = (x − α1 )m (x − α2 )m . . . (x − αk )m
for some m ∈ N.
Aut F (t) ∼
= GL2 (F )/F × = PGL2 (F ),
where F × is the set of matrices aI, a 6= 0.
142 5. Field Theory
5. If F is a subfield of K, let
The group structure of Aut F F (x, y) is known, but very complicated. For n ≥ 3, almost
nothing is known about Aut F F (x1 , . . . , xn ).
The above examples show that Aut F is in general very complicated and probably impossible
to describe. Galois theory proceeds in a different direction. One takes a subgroup H of Aut F —we
shall be almost concerned with finite H—and looks the set
1⊆K⊆H
F ⊇ F K ⊇ F H.
The fundamental result of Galois theory is that of F is separable over F H , then there is a one-
to-one correspondence between subgroups of H and subfields of F which contain F H . Such
correspondences are inclusion reversing and are called “Galois correspondences”.
Definition. Let E be an extension field of a field F . The Galois group of E over F denoted by
Gal(E/F ) is the group
{ϕ ∈ Aut E : ϕ(a) = a for all a ∈ F }.
Let G be a subgroup of Aut E where E is a field. Then the field of G-invariant of E or the
fixed field of G acting on E is the field
It is denoted by E G or Inv G.
We shall now apply these ideas to splitting fields. Using the present terminology, Theorem
5.1.7 can be restated as follows. If E is a splitting field over F of a polynomial f (x), then
Gal(E/F ) is finite and we have the inequality |Gal(E/F )| ≤ [E : F ]. Moreover, |Gal(E : F )| =
[E : F ] if f (x) has distinct roots. We therefore have the following important preliminary result.
Lemma 5.4.2. Let E/F be a splitting field of a separable polynomial contained in F [x]. Then
|Gal(E/F )| = [E : F ].
Our next attack will be from the group side. We begin with an arbitrary field E and any finite
group of automorphisms G acting in E. Then we have the following
Lemma 5.4.3. [Artin] Let G be a finite subgroup of Aut E and let F = E G . Then
[E : F ] ≤ |G|.
5.4. Automorphisms of Fields and Galois Theory 143
u1 x1 + · · · + un+1 xn+1 = 0,
This matrix has rank ≤ n, so there is a nonzero (n + 1) × 1 vector ~v = (v1 , . . . , vn+1 )t with entries
in E such that M~v = ~0(n+1)×1 . We wish to find such a vector where entries lie in F . Among
all such vectors with entries in E, choose one in which the number of nonzero coordinates, r, is
minimal. By renaming the elements x1 , . . . , xn+1 , we may suppose that the nonzero coordinates
are the first r of them; by multiplying the vector by vr−1 we may suppose that the last nonzero
coordinate is equal to 1. Thus,
Since M (~v − h(~v )) = ~0 and ~v − h(~v ) has at most r − 1 nonzero entries, ~v − h(~v ) = ~0 by the minimal
choice of r. This means that for all h ∈ G and i = 1, . . . , r − 1, we have h(vi ) = vi . Thus, all the vi
lie in E G = F and (u1 , . . . , un+1 ) = (v1 , . . . , vr−1 , 0, . . . , 0) is a set of elements of F which satisfies
(∗).
Recall that an algebraic extension field E of a field F is a separable extension if the minimal
polynomial of every element of E is separable.
This is equivalent to saying that E contains a splitting field for the minimal polynomial of
every element of E.
144 5. Field Theory
Definition. Normality plus separability, called a Galois extension, mean that every irreducible
polynomial of F [x] which has a root in E is a product of distinct linear factors in E[x].
Also, by the results of the last section, if E is algebraic over F , then E is necessarily separable
over F if the characteristic is zero or if the characteristic is p > 0 and F p = F .
We are now ready to derive our main results, the first of which gives two abstract character-
izations of splitting fields of separable polynomials and some important additional information.
We state this as
Theorem 5.4.4. Let E be an extension field of a field F . Then the following conditions on E/F
are equivalent.
(i) E is a splitting field over F of a separable polynomial f (x).
(ii) F = E G for some finite group G of automorphisms of E.
(iii) E is finite dimensional Galois (normal and separable) over F .
Moreover, if E and F are as in (i) and G = Gal(E/F ), then F = E G and if G and F are as in (ii),
then G = Gal(E/F ).
Proof. (i) ⇒ (ii). Let G = Gal(E/F ). Then E G is a subfield of E containing F . Also it is clear that
E is a splitting field over E G of f (x) as well as over F and G = Gal(E/E G ). Hence, by Lemma
5.4.2, |G| = [E : F ] and |G| = [E : E G ]. Since E ⊇ E G ⊇ F , we have [E : F ] = [E : E G ][E G : F ].
Hence, [E G : F ] = 1, and so E G = F . We have prove also that F = E G for G = Gal(E/F ), which
is the first of the two supplementary statements.
(ii) ⇒ (iii). By Artin’s lemma, [E : F ] ≤ |G|, and so E is finite dimensional over F . Let f (x)
be an irreducible polynomial in F [x] having a root r in E. Let {r = r1 , r2 , . . . , rm } be the orbit of
r under the action of G. Thus, this is the set of distinct elements of the form σ(r), σ ∈ G. Hence,
if σ ∈ G, then the set {σ(r1 ), σ(r2 ), . . . , σ(rm )} is a permutation of {r1 , r2 , . . . , rm }. We have
f (r) = 0 which implies that f (ri ) = 0. Then Qmf (x) is divisible by x − ri , and since the ri , 1 ≤ i ≤ m,
are distinct, f (x) is divisible by g(x) = i=1 (x − ri ). We now apply to g(x) Q the automorphism
m
of
Qm E[x], which sends x → x and a → σ(a) for a ∈ E. This gives σg(x) = i=1 (x − σ(ri )) =
i=1 (x − ri ) = g(x). Since this holds for every σ ∈ G we see that the coefficients of g(x)
are G-invariant.QHence, g(x) ∈ F [x]. Since we assumed f (x) irreducible in F [x] we see that
f (x) = g(x) = (x − ri ), a product of distinct linear factors in E[x]. Thus, E is separable and
normal over F and (iii) holds.
(iii) ⇒ (i). Since we are given that [E : F ] < ∞ we can write E = F (r1 , r2 , . . . , rk ) and each
ri is algebraic over F . Let fi (x) be the minimal polynomial of ri over F . Then the hypothesis Q
implies that fi (x) is a product of distinct linear factors in E[x]. It follows that f (x) = ki=1 fi (x)
is separable and E = F (r1 , r2 , . . . , rk ) is a splitting field over F of f (x). Hence, we have (i).
It remains to prove the second supplementary statement. We have seen that under the hy-
pothesis of (ii) we have [E : F ] ≤ |G|, and that since (i) holds, we have |Gal(E/F )| = [E : F ].
Since G ⊆ Gal(E/F ) and |G| ≥ [E : F ] = |Gal(E/F )|, equivalently G = Gal(E/F ).
Corollary 5.4.5. If E/F is the splitting field of f (x) ∈ F [x] and r1 , . . . , rn are distinct roots of
f (x) in E, then G = Gal(E/F ) may be identified with a subgroup of Sn , the group of permutations
of {r1 , . . . , rn }. However, it is not always the case that Gal(E/F ) is the full group of permutations
of the roots of f (x).
Example 5.4.2. If K is a field, then the polynomial ring K[x1 , . . . , xn ] is an integral domain.
The quotient field of K[x1 , . . . , xn ] is denoted by K(x1 , . . . , xn ) and is called the field of rational
functions in x1 , . . . , xn over K. In the field extension
K ⊂ K(x1 , . . . , xn )
each xi is easily seen to be transcendental over K. In fact, every element of K(x1 , . . . , xn ) not in
K itself is transcendental over K (Prove!).
Let Sn be the symmetric group on n letters. A rational function ϕ ∈ K(x1 , . . . , xn ) is said to
be symmetric in x1 , . . . , xn over K if for every σ ∈ Sn ,
ϕ(x1 , x2 , . . . , xn ) = ϕ(xσ(1) , xσ(2) , . . . , xσ(n) ).
Trivially, every constant polynomial is a symmetric function. More generally, the elementary
symmetric functions in x1 , . . . , xn over K are defined to be the polynomials:
n
X
e1 = x 1 + x 2 + · · · + x n = xi ;
i=1
X
e2 = xi xj ;
1≤i<j≤n
..
.
X
ek = x i1 x i2 . . . x ik ;
1≤i1 <···<ik ≤n
..
.
en = x 1 x 2 . . . x n .
The verification that the ei are indeed symmetric follows from the fact that they are simply the
coefficients of t in the polynomial p(t) ∈ K[x1 , . . . , xn ][t], where
p(t) = (t − x1 )(t − x2 ) . . . (t − xn ) = tn − e1 tn−1 + e2 tn−2 − · · · + (−1)n−1 en−1 t + (−1)n en .
If σ ∈ Sn , then the assignments xi 7→ xσ(i) , i = 1, 2, . . . , n and
f (x1 , . . . , xn )/g(x1 , . . . , xn ) 7→ f (xσ(1) , . . . , xσ(n) )/g(xσ(1) , . . . , xσ(n) )
define a K-automorphism of the field E = K(x1 , . . . , xn ) which will also be denoted σ. The
map Sn → Gal(E/K) given by σ 7→ σ is clearly a monomorphism of groups, whence Sn may be
consider as a subgroup of the Galois group Gal(E/K). Clearly, the fixed field F = E Sn consists
precisely of symmetric functions; that is, the set of all symmetric functions is a subfield of E
containing K. Therefore, by Theorem 5.4.4, E is a Galois extension of F with Galois group
Gal(E/F ) = Sn and dimension |Sn | = n!.
Example 5.4.3. Let K be a field and x1 , x2 , x3 be indeterminates over K, set
e1 = x 1 + x 2 + x 3 , e2 = x 1 x 2 + x 2 x 3 + x 3 x 1 , e3 = x 1 x 2 x 3
and consider the fields
F = K(e1 , e2 , e3 ) ⊆ K(x1 , x2 , x3 ) = E.
The relevant subfields of E are indicated in the diagram
F (x1 ) P
♦♦ PPP
3♦♦♦♦ PPP2
♦♦♦ PPP
♦♦♦ PP
3 2
F = K(e1 , e2 , e3 ) F (x2 ) K(x1 , x2 , x3 ) = E
PPP ♥♥
PPP3 2♥♥♥♥
PPP
PPP ♥♥♥
♥♥♥
F (x3 )
146 5. Field Theory
The fields F (x1 ), F (x2 ) and F (x3 ) are all isomorphic (over F ), but they are distinct subfields of E.
Moreover, E is a splitting field for f (t) = t3 − e1 t2 + e2 t − e3 but F (x1 ), F (x2 ) and F (x3 ) are not.
We know that G = Gal(E/F ) = S3 where S3 is identified with the group of permutations on
3 letters. We next calculate E H when H is a subgroup of G = Gal(E/F ) = S3 . The following is a
diagram of the lattice of subgroups of S3 and there indices.
S3 ❙❍❙❙
❥❥ ❥❥❥❥✈❥✈ ❍❍ ❙❙❙❙
3❥❥❥ ✈✈ ❍❍ ❙❙❙2
❥❥❥ ✈ ✈ ❍❍ ❙❙❙
❥ ✈ 3 ❍❍ ❙❙❙
❥❥❥❥
❥ ✈✈ 3 ❙❙❙
h(12)i ❚❚ h(13)i h(23)i ❦ A3
❚❚❚❚ ❍❍
❍ ✈✈ ❦❦❦❦❦
❚❚❚❚ ❍ ❦
❚❚❚❚ ❍2❍❍❍ 2 ✈✈
✈ ❦❦❦
✈✈❦❦❦❦❦❦ 3
2 ❚❚❚❚❍ ✈
✈❦❦
{(1)}
We have already calculated that E S3 = E G = F and of course E {(1)} = E. It is not hard to verify
that
E h(12)i = F [x3 ], E h(13)i = F [x2 ], E h(23)i = F [x1 ].
and one can verify that [F [∆] : F ] = 2. Thus, we get the following diagrams of all (by Galois
Theory) subfields of E containing F
F = E SP3 ❳❳❳
❢❢ ❢❢ ❢❢❢❢♥♥♥ PPP ❳❳❳❳❳
PPP ❳❳
3 ❢❢❢❢ ♥
❢❢❢❢❢❢❢
♥♥ ♥♥
3
♥
3
PPP ❳❳❳2❳❳❳❳❳❳
❢ ♥♥♥ PP ❳❳❳❳❳
❢❢❢❢❢ ❳
F [x3 ] = E h(12)i h(13)i
F [x1 ] = E h(23)i A3
❳❳❳❳❳ F [x2 ] = E PPP ♥♥ ❣❣❣❣❣
F [∆] = E
❳❳❳❳❳ PPP2 ♥ ❣❣
❳❳❳❳❳ 2 ♥♥ ❣❣❣❣
❳❳❳
P
❳❳❳❳P❳PPP ♥♥♥ ❣❣❣❣❣3❣
2
❳ ♥♥❣♥❣❣❣❣❣❣
E = E {(1)}
The indices are the same as in the lattice diagram for S3 , but inclusions are reversed. Recall
that E is the splitting field of a separable polynomial
for any field in the above diagram. More generally, it is clear that if M/L is a splitting field for
f (t) ∈ L[t] and M ⊇ N ⊇ L, then M/N is a splitting field for f (t), regarded as a polynomial in
N [t].
Furthermore, for each field L in the above diagram, we have L = E H for some subgroup H of
G = S3 and Gal(E/L) = H. On the other hand, things are not so nice for the extensions L/F . For
example, Gal(F [xi ]/F ) = 1 for all i = 1, 2, 3 and Gal(F [∆]/F ) ∼
= Z/(2) = hϕi where the action of
ϕ is ϕ(∆) = −∆. Here ∆2 ∈ F and F [∆] is the splitting field of the polynomial t2 − ∆2 over F , so
it is Galois. However, we may conclude that the fields F [x1 ], F [x2 ] and F [x3 ] are not the splitting
fields of any polynomials over F .
5.4. Automorphisms of Fields and Galois Theory 147
= Z2 ∼
Gal(E A3 /F ) = Gal(F [∆]/F ) ∼ = S3 /A3 = Gal(E/F )/A3 .
Theorem 5.4.6. [Fundamental Theorem of Galois Theory] Let E be a finite dimensional Galois
extension of a field F (i.e., the conditions of Theorem 5.4.4 holds) and let G = Gal(E/F ). Let
Γ = {H}, the set of subgroups of G, and Σ, the set of intermediate fields between E and F (the
subfields of E/F ). Then the map H 7→ E H and K 7→ Gal(E/K), H ∈ Γ, K ∈ Σ, are inverses
to each other. In particular, they are one-to-one correspondences between Γ and Σ. Moreover, the
pairing Γ ↔ Σ has the following properties:
1. H1 ⊇ H2 if and only if E H1 ⊆ E H2 .
2. |H| = [E : E H ] and [G : H] = [E H : F ] = [E H : E G ].
3. H is normal in G if and only if E H is normal over F . In this case,
Gal(E H /F ) ∼
= G/H.
This is the main theorem. Most of our remaining field theory will be consequences of it.
√ √
Example 5.4.4. Let E = Q[ 2, 3] be a splitting field of f (x) = (x2 − √ 2)(x2 − 3). Then E is
Galois over Q. Let G = Gal(E/Q). Then |G| = [E : Q] = 4. Since Q( 2) is a splitting field
of x2√− 2, it is √ Galois over Q and its Galois group consists of 2 elements, namely σ1 = id and
σ
√2 : 2
√ →
7 − √ 2. Each √ automorphism extends to an automorphism of E √in two√different √ ways;
√
3 7→√ 3 or 3
√ √ →
7 − 3.
√ Then the√four elements
√ √ of G are
√ τ 1 = id E , τ 2 : 2 →
7 2, 3 →
7 − 3,
τ3 : 2 7→ − 2, 3 7→ 3 and τ4 : 2 7→ − 2, 3 7→ − 3. Each of these elements except τ1 has
order 2. Thus, G ∼ = Z2 × Z2 . Hence, the subgroup-intermediate subfield correspondence for the
fundamental theorem of Galois theory is shown in the lattice diagrams
{idE } ♠ E ◗◗◗◗
②②
❊❊
❊❊ ♠♠♠♠♠ ◗◗◗
②②② ❊❊ ♠♠ ♠♠ ◗◗◗
◗◗◗
②②
② ❊❊ ♠ ♠♠♠ ◗
√ √ √
hτ2 i hτ3 i hτ4 i Q[ 2] = E hτ2 i Q[ 3] = E hτ3 i Q[ 6] = E hτ4 i
❋❋ ◗◗◗
❋❋ ①① ◗◗◗ ♠♠♠
❋❋ ①①① ◗◗◗ ♠♠♠♠♠
❋❋ ①① ◗◗◗ ♠
❋ ①① ◗◗ ♠♠♠♠♠
G Q
Exercises 5.4. 1. Let E = F (t) where t is transcendental over F and write any non-zero element
of E as u = f (t)/g(t) where (f (t), g(t)) = 1. Call the maximum of degrees of f and g the degree
of u. Show that if x and y are indeterminates then f (x) − yg(x) is irreducible in F [x, y] and hence
is irreducible in F (y)[x]. Show that t is algebraic over F (u) with minimal polynomial the monic
polynomial which is a multiple in F (u) of f (x) − ug(x). Hence, conclude that [F (t) : F (u)] = 1, and
F (u) = F (t) if and only if deg u = 1. Note that this implies
at + b
u=
ct + d
where ad − bc 6= 0. Therefore, deduce that Gal(E/F ) is the set of maps h(t) 7→ h(u) where u is of
the form indicated.
2. Let F ⊆ K ⊆ E and E Galois over F . Prove that E is Galois over K.
3. Show that every element of K(x1 , . . . , xn ) which is not in K is transcendental over K.
4. Show that in the subgroup-intermediate subfield correspondence given in the fundamental theorem
of Galois theory, the subfield corresponding to the intersection of two subgroups H1 and H2 is the
subfield generated by the composite field E H1 E H2 , the smallest subfield of E generated by E H1
and E H2 , and the intersection of two intermediate fields K1 and K2 corresponds to the subgroup
generated by Gal(E/K1 ) ∪ Gal(E/K2 ).
5. Use the fact that any finite group G is isomorphic to a subgroup of Sn (Cayley’s theorem) to prove
that given any finite group G, there exist fields E and E/F such that Gal(E/F ) = G.
6. Let E = Q(r) where r3 + r2 − 2r − 1 = 0. Verify that r′ = r2 − 2 is also a root of x3 + x2 − 2x − 1 = 0.
Determinep Gal(E/Q). Show that E is normal over Q.
√
7. Let α = 2 + 2 in R, f (x) the minimal polynomial of α over Q and E is a splitting field of f (x)
over Q.
(a) Compute f (x) and [E : Q].
(b) Find G = Gal(E/Q) and draw a lattice diagram for the subgroup-intermediate subfield corre-
spondence for the fundamental theorem of Galois theory.
8. Let (Z/(p))(t) where t is transcendental over Z/(p). Let G be the group of automorphisms generated
by the automorphism of E such that t 7→ t + 1. Determine F = E G and [E : F ].
5.5. Some Consequences of Galois Theory 149
Theorem 5.5.1. Let K be a finite dimensional separable extension of a field F . Then there are
only finitely many fields L such that K ⊇ L ⊇ F .
Proof. Since K/F is finite separable, by primitive element theorem, K = F [α] for some α ∈ K.
Let E be the splitting field of mα,F (x). Then E is Galois over F and E ⊆ K ⊆ F . By fundamental
theorem of Galois theory, the number of intermediate fields between E and F is the number of
subgroups of Gal(E/F ). Hence, the number of intermediate fields between K and F is at most
the number of subgroups of Gal(E/F ).
Remark. If G = Gal(E/F ), then K = E H for some subgroup H of G and the fields L such that
K ⊇ L ⊇ F are in 1-1 correspondence with the subgroups J of G such that G ⊇ J ⊇ H.
The primitive element theorem and the previous theorem both fail for inseparable extensions
as shown in the following example.
Example 5.5.1. Let F be an infinite field of prime characteristic p and let u and v be indetermi-
nates over F . Consider
F (u, v) ⊇ F (up , v p )
It is easy to see that [F (u, v) : F (up , v p )] = p2 . On the other hand, if z ∈ F (u, v), then z p ∈
F (up , v p ), so
[F (up , v p )(z) : F (up , v p )] ≤ p.
Hence, there is no z such that F (u, v) = F (up , v p )(z), that is, no primitive element.
On the other hand, the nonexistence of a primitive element shows that the fields
for α ∈ F , are all distinct. To see this, assume that F (up , v p )(u + αv) = F (up , v p )(u + βv) = E for
some α 6= β in E. Then u + αv and u + βv in E, so
H = {g ∈ G : gs = s}
be the stabilizer of s. Then S can be identified with the set of left cosets
{gH : g ∈ G},
with G acting by left multiplication. Note that the subgroup H depends on the choice of s and
choosing a different s will give a conjugate of H. More precisely, if s ∈ S and x ∈ G, and
H = stabilizer of s = {g ∈ G : gs = s}
150 5. Field Theory
then
xHx−1 = stabilizer of xs = {g ∈ G : g(xs) = xs}.
(If gs = s, then (xgx−1 )(xs) = xs.)
A basic example of this phenomenon is the action of Sn on {1, 2, . . . , n}. The stabilizer of
i ∈ {1, 2, . . . , n} is Sym{1, . . . , i − 1, i + 1, . . . , n} which may be identified with Sn−1 , but Sn−1 has
n conjugates in Sn .
Theorem 5.5.2. Let E be the splitting field over F of a separable polynomial f (x) ∈ F [x] which is
irreducible over F . Then Gal(E/F ) acts transitively on the roots of f (x). Hence, Gal(E/F ) may
be identified with a subgroup of Sym{r1 , . . . , rn } which acts transitively on {r1 , . . . , rn }, the roots
of f (x) in E.
Proof. This is implicit in the proof of Theorem 5.1.6. For, if r and s are roots of f (x) in E, then
F (r) ∼
= F [x]/(f (x)) ∼
= F (s) with r 7→ x + (f (x)) 7→ s
Remarks. 1. The hypothesis that f (x) be irreducible over F is essential. For, example, if
f (x) = f1 (x) . . . fk (x) where f1 (x), . . . , fk (x) are distinct irreducible polynomials, then all
one can say is that Gal(E/F ) permutes the roots of each fi (x) among themselves. It is still
true that Gal(E/F ) can be identified with a subgroup of the group of permutations of the
roots, but not a transitive one.
2. Assume that f (x) is irreducible and separable over F of degree n, E/F is a splitting field
for f (x) over F and r is one root of f (x). Then the fundamental theorem of Galois theory
gives the following picture
E {id E }
F Gal(E/F ).
The basic Theorems 5.1.4 and 5.1.6 give the existence and uniqueness of splitting fields. That
is, if F is a field and f (x) is a monic polynomial in F [x], then
1. A splitting field E for f (x) exists. E is generated over F by the roots of f (x) and f (x) splits
into linear factors in E[x].
2. The splitting field E/F is unique up to isomorphism over F . In other words, if E ′ /F is
another splitting field for f (x) over F , then there is an isomorphism
ϕ : E → E′
which is identity on F .
What does this means if we are searching for the splitting field of some f (x) ∈ Q[x]?
It means that we can realize E as a subfield of C. More precisely, f (x) is a product of linear
factors in C[x], say f (x) = (x−α1 ) . . . (x−αk ) and we can take E to be the field Q(α1 , . . . , αk ) ⊆ C.
This could be very helpful because it allows us to work in a concrete and explicit field.
5.5. Some Consequences of Galois Theory 151
The fundamental theorem of algebra (every f (x) ∈ C[x] is a product of linear factors) is usually
proved in complex analysis and there is also a topological proof. Here we present a proof based
on Galois theory and the intermediate value theorem from real analysis or calculus. We shall start
with some basic results.
Theorem 5.5.3. Let f (x) ∈ R[x] be a polynomial of odd degree. Then f (x) has a root in R.
with ai ∈ R and n is odd. If a = |a0 | + · · · + |an−1 |, then it is easy to see that f (a) > 0 and
f (−a) < 0. By intermediate value theorem (because f (x) is continuous), there exists r ∈ R such
that f (r) = 0.
p
Consider α + βi with α, β ∈ R. If γ = α2 + β 2 , then
p p
( (γ + α)/2 + i (γ − α)/2)2 = α + βi.
Proof. Suppose conversely that [K : C] = 2 and let K = C + Cu for some u ∈ K. Then u satisfies
a polynomial
f (x) = x2 − bx + c
of degree two over C, since 1, u, u2 are linearly dependent over C. The roots of f (x) are
√
−b ± b2 − 4ac
2
which lie in C, since every element of C has a square root in C. Thus, u ∈ C, a contradiction.
Theorem 5.5.6. [Fundamental Theorem of Algebra] Let f (x) ∈ C[x]. Then f (x) is a product of
linear factors in C[x].
Proof. Let ¯: C → C denote the complex conjugation. Then g(x) = f (x)f (x) ∈ R[x]. Let E be a
splitting field for g(x)(x2 + 1) over R and identify C with the subfield of E generated by the roots
of x2 + 1. Since the characteristic is zero, all polynomials are separable, so E is the splitting field
of a separable polynomial. Hence, E is Galois over R by Theorem 5.4.4.
Let G = Gal(E/R), |G| = 2a m, where m is odd, and let P be a Sylow 2-subgroup of G.
Consider the diagram of fields
E❆
✈✈ ❆❆ a
❆❆2
✈✈✈ ❆❆
✈✈ ❆
✈✈
R[i] = C EP
❍❍ ⑥⑥
❍❍ ⑥⑥
❍❍ ⑥
2 ❍❍❍ ⑥⑥ m
⑥⑥
R
152 5. Field Theory
2 2
R C = EH EK E.
The fundamental theorem of algebra was first rigorously proved by Gauss in 1816 (his doctoral
dissertation in 1798 provides a proof using geometric considerations requiring some topological
justification). There was a proof due to Laplace in 1795. However, Laplace’s proof was deemed
unacceptable because he assumed the existence of a splitting field for polynomials (i.e., that the
roots existed somewhere in some field), which had not been established at that time. The elegant
above proof was given by Artin.
Corollary 5.6.2. The field k consists of the solutions to xq − x = 0 in an algebraic closure of Z/pZ
containing k.
Corollary 5.6.3. There is an element α ∈ k such that k = (Z/pZ)[α], that is, k is a simple
extension of the prime field Z/pZ.
Corollary 5.6.4. For each positive divisor r of q − 1(= |k × |) there are exactly φ(r) elements in k ×
of order r.
Corollary 5.6.5. Let p be a prime and d a positive integer. Then, up to isomorphism, there is
exactly one field of order q = pd .
d
Proof. Let E be a splitting field of f (t) = tp − t over Z/pZ in an algebraic closure of Z/pZ. By
d
Theorem 5.1.6, E is unique up to isomorphism. It consists of the roots of tp = t in the algebraic
d
closure of Z/pZ. Thus, |E| is the number of roots of tp − t. Since f ′ (t) = −1, f (t) is separable, so
|E| = pd . Thus, we have constructed a field of order q = pd , namely E, the splitting field of f (t)
over Z/pZ.
5.6. Finite Fields 153
For q = pd , we may write Fq for the (unique up to isomorphism) field of q elements. Also, we
may write Fp for Z/pZ.
Corollary 5.6.6. Given any positive integer d, there exists an irreducible polynomial of degree n
over Fp .
Proof. By Corollary 5.6.3, Fpd = Fp [α] for some α ∈ Fpd . Let f (t) be the minimal polynomial of α
over Fp . Then Fpd = Fp [α] ∼
= Fp [t]/(f (t)) shows deg f (t) = [Fpd : Fp ] = d.
Next, we shall study finite extensions of a finite field. For simplicity, k stands for the finite
field Fq .
Let kn be a degree n field extension of k. If km is an intermediate field of degree m over k,
then kn is a vector space over km , so m divides n. Conversely, any degree m extension of k within
an algebraic closure of k with m | n is a subfield of kn by Corollary 5.6.2 since m | n implies
(q m − 1) | (q n − 1).
Consider the map σ on kn which sends x to xq . From
implies r = n since kn× is cyclic of order q n − 1. Hence, Gal(kn /k) contains the cyclic group hσi of
order n. Since |Gal(kn /k)| ≤ [kn : k] = n, Gal(kn /k) = hσi and so the field kn is Galois over k.
We record this in
Theorem 5.6.7. The field kn is Galois over k with the Galois group Gal(kn /k) cyclic of order n,
generated by the Frobenius’ automorphism σ.
Note that an element x ∈ kn lies in k if and only if it satisfies xq = x, in other words, if and
only if it is fixed by the Frobenius’ automorphism, or equivalently, by the group Gal(kn /k). Using
G = Gal(kn /k), we define two important maps, called trace and norm, denoted by Trkn /k and
Nkn /k , respectively, from kn to k as follows:
X n
X
Trkn /k : x 7→ τ (x) = σ i (x),
τ ∈G i=1
Y n
Y
Nkn /k : x 7→ τ (x) = σ i (x).
τ ∈G i=1
One check easily that the images of trace and norm maps are in k. It is clear that Trkn /k is a
homomorphism from the additive group kn to the additive group k and Nkn /k is a homomorphism
from kn× to k × . Next we investigate their images. We shall first need
Lemma 5.6.8. If E is an extension field of a field F , then the automorphisms in Gal(E/F ) are
E-linearly independent F -linear transformations.
154 5. Field Theory
Q Q
Proof. (1) Since Nkn /k (σ(x)) = ni=1 σ i+1 (x) = ni=1 σ i (x) = Nkn /k (x), so x/σ(x) lies in the
kernel of the norm map for all x ∈ kn× . Further, x/σ(x) = y/σ(y) if and only if xy −1 ∈ k × , hence
the elements x/σ(x) with x ∈ kn× form a subgroup of kn× of order (q n − 1)/(q − 1). Thus, it is equal
to the whole kernel if and only if the norm map is surjective. To see Nkn /k is onto, observe that
n
Y 2 n−1 2 +···+q n−1 n −1)/(q−1)
Nkn /k (x) = σ i (x) = x · xq · xq · · · · · xq = x1+q+q = x(q
i=1
for all x ∈ kn× . Hence, any generator x of kn× has Nkn /k (x) of order q − 1.
(2) Since elements in Gal(kn /k) are k-linear maps, the image P of Trkn /k (kn ) is a vector space
over k, hence Trkn /k (kn ) = 0 or k. If Trkn /k = 0, then ni=1 σi = 0, which is a nontrivial
linear relation among elements of Gal(kn /k), so impossible by Lemma 5.6.8. Therefore, Trkn /k
is surjective. Then its kernel has order q n−1 . Clearly, Trkn /k (σ(x)) = Trkn /k (x) so that kernel
contains x − σ(x) for all x ∈ kn . Further, x − σ(x) = y − σ(y) if and only if x − y ∈ k, so the group
{x − σ(x) : x ∈ kn } has order q n /q, thus is equal to the kernel.
Remark. The Hilbert Theorem 90 for norm and trace maps is usually proved using first cohomol-
ogy group of the Galois group (à la Noether). When the base field is finite, we may use counting
argument, as shown above.
They are in fact given by Trkn /k and Nkn /k of z. More precisely, we have
Proof. We shall prove (1) and (2) under the assumption (2) and leave (1) for the case k(z) being
a proper subfield kn as an exercise. For each τ ∈ Gal(kn /k), 0 = τ (f (z)) = f (τ (z)), hence
τ (z) is also a root of f (x). Further, if τ and τ ′ are two different elements in Gal(kn /k), then
τ (z) 6= τ ′ (z) (otherwise they would agree on k(z) = kn ). This shows that z has n distinct images
under Gal(kn /k) and they are the roots of f (t). Therefore,
and
(−1)n an = the product of roots of f (t) = Nkn /k (z).
This proves (2). For (1), we know that Lz satisfies f (t) = 0. As f (t) is irreducible over k and
[kn : k] = n, f (t) is the characteristic polynomial of Lz . The companion matrix attached to Lz is
0 −an
1 0
−an−1
1 0
−an−2
.. .. ,
. .
..
0 .
1 −a1
which has trace = −a1 and determinant = (−1)n an . This proves (1).
Exercises 5.6. 1. Let k6 = F56 be the field with 15625 elements and let k = F5 be its prime subfield.
(a) Determine the cardinality of the set of elements of k6 which generate k6 as a field over k.
(b) Draw a lattice diagram for the subgroup-intermediate subfield correspondence for the funda-
mental theorem of Galois theory of k6 /k.
2. Let k be a finite field with finite extensions km and kmn of degrees m and mn, respectively. Show
that
Trkmn /k = Trkm /k ◦ Trkmn /km and Nkmn /k = Nkm /k ◦ Nkmn /km .
3. Let z ∈ kn . Suppose k(z) = km is a proper subfield of kn . Prove that
4. (a) (Normal Basis Theorem) There exists an element z ∈ kn such that the set {τ (z) : τ ∈ Gal(kn /k)}
is a basis of kn over k. [Hint: Consider the minimal polynomial of the Frobenius’ automorphism σ.]
(b) For z in (a), we have Trkn /k (z) 6= 0. [Hint: Express an element in kn as a k-linear combination
of {τ (z)}. Then show Trkn /k (kn ) = kTrkn /k (z).]
Theorem 5.7.1. Let K be a field of characteristic 0 and let E be a splitting field of xn − 1 over K.
Then Gal(E/K) is isomorphic to a subgroup of Aut Z/(n) ∼ = (Z/(n))× . In particular, Gal(E/K)
is abelian.
156 5. Field Theory
Proof. Since (xn − 1)′ = nxn−1 6= 0, the roots of xn − 1 (in E) are distinct, say
xn − 1 = (x − 1)(x − α2 ) . . . (x − αn ).
Definition. We call a Galois extension field E/F abelian [cyclic] over F if Gal(E/F ) is abelian
[cyclic].
In the complex numbers C, the nth roots of unity are the powers of
Thus, Q[ω] is the splitting field of xn − 1 over Q, so [Q[ω] : Q] is the degree of the minimal
polynomial of ω over Q. We know that the set U of the nth roots of unity is a cyclic group of
order n under multiplication. Hence, the number of primitive nth roots of 1, that is, the number
of generators of U , is φ(n).
Definition. For a positive integer d and x an indeterminate, the dth cyclotomic polynomial,
Φd (x) is the product
Y
Φd (x) = {(x − ε) : ε is a primitive dth root of unity}.
If η ∈ Gal(Q[ω]/Q) and z is primitive nth root of unity, then η(z) is primitive. Hence,
η(Φn (x)) = Φn (x) and so Φn (x) ∈ Q[x]. It is clear that Φn (x) | (xn − 1) and, in fact, since
any nth root of unity has an order d | n we see that
Y
xn − 1 = Φd (x). (5.7.1)
d|n
5.7. Cyclotomic Extensions 157
Remark. The formula (5.7.1) provides us with an algorithm for calculating the polynomial Φn (x).
To begin with we have
Φ1 (x) = x − 1
and assuming we already know the Φd (x) for proper divisors d of n then (5.7.1) gives us Φn (x).
For example, for a prime p, Φ1 (x)Φp (x) = xp − 1, so we get
Φp (x) = xp−1 + xp−2 + · · · + x + 1.
Then Φ2 (x) = x + 1 and Φ3 (x) = x2 + x + 1, so
x4 − 1
Φ4 (x) = = x2 + 1
Φ1 (x)Φ2 (x)
x6 − 1
Φ6 (x) = = x2 − x + 1
Φ1 (x)Φ2 (x)Φ3 (x)
x12 − 1
Φ12 (x) = = x4 − x2 + 1.
Φ1 (x)Φ2 (x)Φ3 (x)Φ4 (x)Φ6 (x)
Next, we observe that Φn (x) has integer coefficients. This holds for
Qn = 1 and assuming it
holds for every Φd (x), d < n, we have xn −1 = Φn (x)g(x) where g(x) = d|n;d<n Φd (x) is a monic
polynomial with integer coefficients. The division algorithm gives integral polynomials q(x) and
r(x) with deg r(x) < deg g(x) such that xn − 1 = q(x)g(x) + r(x). Since q(x) and r(x) are unique
in Z[x] and xn − 1 = Φn (x)g(x) in Q[x], we see that Φn (x) = q(x) ∈ Z[x].
We shall now prove
Theorem 5.7.2. The nth cyclotomic polynomial Φn (x) has integer coefficients and is an irreducible
polynomial in Q[x].
Proof. Suppose that Φn (x) = h(x)k(x), where h(x), k(x) ∈ Z[x] and h(x) is irreducible in Z[x],
hence, in Q[x] (Gauss’ lemma). We may also assume that h(x) and k(x) are monic and so
deg h(x) ≥ 1. Let p be a prime integer not dividing n and let δ be a root of h(x). Since (p, n) = 1, δ p
is a primitive nth root of unity. Assume that δ p is not a root of h(x). Then δ p is a root of k(x); con-
sequently δ is a root of k(xp ). Since h(x) is irreducible and has δ as a root also, (h(x), k(xp )) 6= 1
and thus h(x) | k(xp ). It follows (as mentioned earlier) that k(xp ) = h(x)l(x), where l(x) is monic
with integral coefficients. Since xn − 1 = Φn (x)g(x), we have xn − 1 = h(x)k(x)g(x). We now
pass to congruences modulo p or, which is the same thing, to equations in (Z/(p))[x]. This gives
xn − 1̄ = h̄(x)k̄(x)ḡ(x) (5.7.2)
where, in general, if f (x) = a0 xm + a1 xm−1 + · · · + am ∈ Z[x], then f¯(x) = ā0 xm + ā1 xm−1 + · · · +
ām , āi = ai + (p) in Z/(p). Similarly, we have k̄(xp ) = h̄(x)¯l(x). Now, using āp = ā for any a ∈ Z,
we see that
f¯(x)p = (ā0 xm + ā1 xm−1 + · · · + ām )p
= āp0 xpm + āp1 xp(m−1) + · · · + āpm
= ā0 xpm + ā1 xp(m−1) + · · · + ām
= f¯(xp )
for any f (x) ∈ Z[x]. Thus, k̄(x)p = k̄(xp ) = h̄(x)¯l(x) which implies that (h̄(x), k̄(x)) 6= 1. Then
(5.7.2) shows that xn − 1̄ has multiple roots in its splitting field over Z/(p). Since the derivative
(xn − 1̄)′ = n̄xn−1 and n̄ 6= 0, we have (xn − 1̄, (xn − 1̄)′ ) = 1̄, contrary to the derivative criterion for
multiple roots. This contradiction shows that δ p is a root of h(x) for every prime p 6 |n. A repetition
of this shows that δ r is a root of h(x) for every integer r prime to n. Since every primitive nth root
of 1 has the form δ r , (r, n) = 1, we see that h(x) is divisible by every x − δ ′ , δ ′ primitive. Hence,
h(x) = Φn (x) and Φn (x) is irreducible in Q[x].
158 5. Field Theory
Proof. (1), (2) and (3) are obvious. To prove (4), recall that by Theorem 5.7.1, Gal(Q[ω]/Q)
is isomorphic to a subgroup of (Z/(n))× . Since [Q[ω] : Q] = φ(n) = |(Z/(n))× |, it must be
isomorphic to all of (Z/(n))× .
Theorem 5.7.3 implies that Gal(Q[ω]/Q) is isomorphic to the multiplicative group Un of units
of the ring Z/(n). If n is a prime then we know that this is a cyclic group of order p−1. Moreover, if
n = pe11 pe22 . . . pess , pi distinct primes, then Un is isomorphic to the direct product of the groups Upei .
In addition, we know the structures of Upe from the knowledge of primitive roots in number theory
as follows.
Theorem 5.7.4. 1. U2 and U4 are cyclic and if e > 3, then U2e is a direct product of a cyclic
group of order 2 and one of order 2e−2 .
2. If p is an odd prime, the multiplicative group Upe of units of Z/(pe ) is cyclic.
= U72 ∼
Gal(Q[ω]/Q) ∼ = Z/(2) × Z/(2) × Z/(6).
Theorem 5.7.5. Let K be a cyclotomic field. Then K is Galois over Q and Gal(K/Q) is abelian.
Proof. Consider Q ⊂ K ⊂ Q[ω] for some nth root of unity ω. By the fundamental theorem of
Galois theory K = Q[ω]H for some subgroup H of G = Gal(Q[ω]/Q) ∼ = (Z/(n))× . Since G is
abelian, H is normal in G, so the fundamental theorem says that K is Galois over Q with Galois
group G/H, an abelian group.
Remark. A deep theorem of Kronecker and Weber says that the converse of Theorem 5.7.5 is
true, namely, “if K is Galois over Q and Gal(K/Q) is abelian, then K is a cyclotomic field, that is,
K ⊂ Q[ω] for some root of unity ω.”
= Z/(70) ∼
= U71 ∼
G = Gal(Q[ω]/Q) ∼ = Z/(2) × Z/(5) × Z/(7).
Let H = Z/(2) × Z/(5) be the subgroup of G of order 10. Then H is normal in G and conse-
quently we have Q[ω]H is a Galois extension over Q of degree [Q[ω]H : Q] = [G : H] = 7 and
= G/H ∼
Gal(Q[ω]H /Q) ∼ = Z/(7).
We now have enough tools to find the Galois groups of splitting fields of irreducible separable
polynomials xn − a. Note that (xn − a)′ = nxn−1 , so xn − a is separable over a field F if and only
if char F 6 |n. In particular, if F contains a primitive nth root of unity, then char F 6 |n.
5.7. Cyclotomic Extensions 159
Theorem 5.7.6. Let F be a field which contains a primitive nth root of unity ω, i.e., char F not
divide n. Let a ∈ F , f (x) = xn − a, E the splitting field for E over F and r a root of f (x) in E.
Then
(1) The factorization of f (x) in E[x] is
and E = F [r].
(2) Let d be the least positive integer such that rd = b ∈ F . Then d divides n and
is the minimal polynomial of r over F where ε = ω n/d , a primitive dth root of unity. In addition,
[E : F ] = d and Gal(E/F ) ∼ = Z/(d). The automorphism α : E → E defined by α(r) = εr
generates Gal(E/F ).
Proof. (1) Since r, ωr, . . . , ω n−1 r are all roots of xn − a, (x − r)(x − ωr) . . . (x − ω n−1 r) must
divide xn − a. Since both polynomials are monic of degree n, they must be equal. Also, ω ∈ F
by hypothesis, so F [r] contains all the roots of xn − a and is generated over F by them. Hence,
E = F [r] by the definition of splitting field.
(2) Since d is the generator of the group {m ∈ Z : rm ∈ F } and n is in this group, d divides n.
Certainly, r is a root of xd − b ∈ F [x]. If xd − b had a proper factor of degree c, 0 < c < d, looking
at its constant term would show that rc ∈ F , contradicting the minimality of d. Thus, xd − b is
irreducible. Hence, [E : F ] = [F [r] : F ] = d, so |Gal(E/F )| = d. On the other hand, one sees that
αi (r) = εi r, so α is an element of Gal(E/F ) of order d. Therefore, Gal(E/F ) = hαi ∼ = Z/(d).
For the sake of clarity, we reformulate Theorem 5.7.6 slightly to emphasize the case where
f (x) is irreducible, which is the important one.
Theorem 5.7.7. Let F be a field which contains a primitive nth root of unity ω and let a ∈ F .
Then xn − a is irreducible if and only if no divisor d of n, d 6= 1, such that a = bd for some b ∈ F .
If xn − a is irreducible and E/F is its splitting field, then [E : F ] = n and Gal(E/F ) ∼= Z/(n).
Example 5.7.3. Let f (x) = xn − p ∈ Q[x] where p is prime. (The essential point is not that p is
prime, but that it is not a proper power.) By Eisenstein’s criterion f (x) is irreducible over Q. If
√
we let r = n p denote the positive real nth root of p and ω = e2πi/n , a primitive nth root of unity,
then the factorization of f (x) in C[x] is
Now let E = Q[r, ωr, . . . , ω n−1 r] be a splitting field for f (x), and let ϕ ∈ Gal(E/Q). Then ϕ
permutes {r, ωr, . . . , wn−1 r} and ϕ is completely defined by its action on the set {r, ωr, . . . , ω n−1 r}.
This gives rise to an embedding
Thus, E is generated over Q by two elements ω and r. We also know that E can be generated over
Q by a primitive element. However, using such an element would not simplify the description of
Gal(E/Q).
160 5. Field Theory
for some 1 ≤ i ≤ n − 1 such that gcd(i, n) = 1 and 0 ≤ j ≤ n − 1. The choice of i and j completely
determines ϕ and it turns out that all of the above choices do determine automorphisms of E.
Thus,
|Gal(E/Q)| = n · φ(n).
To describe Gal(E/Q) more precisely, let Q[ω] = E H , and for 0 ≤ j ≤ n − 1, let Q[ω j r] =
E Kj . Since Q[ω] is Galois over Q, H is normal in Gal(E/Q). Moreover, by Theorem 5.7.7,
H = Gal(E/Q[ω]) = hτ i ∼= Z/(n) is cyclic of order n with generator τ defined by
The group Kj are more difficult to describe explicitly, but they are all conjugate in Gal(E/Q)
and isomorphic as abstract groups to Gal(Q[ω]/Q) ∼ = (Z/(n))× . We have the following diagram
of subgroups of Gal(E/Q) which does not include all subgroups.
Gal(E/Q)
❡❡❡❣❡❣ ■■
n ❡❡ ❡❡❡❣❡❣❡❣❡❣❡❣❡❣❡❣❣❣ rrrr ■■ φ(n)
■■
❡❡ ❣❣ r
❡❡ ❡ ❡ ❡ ❡❡❡❣❡❣❡❣❡❣❣❣❣❣ r rrrr ■■
■■
❡❡❡❡❡ ❣❣❣❣❣
K0 ❨❡❨❨❨❨❨K
❨❨1❨❨❲❨❲❨❲❲❲❲.❲.❲.❲ Kn−1 ▼ H
❨❨❨❨❨❨❲❲❲❲❲ ▼▼▼ tt
❨❨❨❨❨❨❲❲❲❲❲ ▼ ttt
φ(n)
❨ ❨❨❨❨❨❲❨❲❲❲ ▼▼▼▼
❲ tt
tt n
❨❨❨❲❨❲❨❲❨❲❲▼▼ tt
❨❨❲
1
❣❡❣❡❣ Q ❇❇
❡❡❣❡❡
❡❡❡❣❡❣❡❣❡❣❣ ✈✈✈✈
❡❣❡❣❡❣❣❣
n❡❡❡❡❡❡❣ ❇❇ φ(n)
❡❡❡ ❣❣ ✈✈ ❇❇
❡❡❡❡❡❡❡❡❡❣❡❣❡❣❣❣❣❣❣❣ ✈✈
✈ ❇❇
❡❡❡❡❡ ❣❣❣
Q[r] ❨❡❨❨❨❨❨Q[ωr]
❨❨❨❨❨❨❲❲❲❲❲❲.❲.❲. Q[ω n−1 r] Q[ω]
❍❍
❨❨❨❨❨❨ ❲❲❲❲ ❍❍ ④④
❨❨❨❨❨❨❲❲❲❲❲
❨❨❨❨❨❲❨❲❲❲ ❍❍
❍ ④④④
φ(n) ❨❨❨❨❲❨❲❨❲❨❲❲❍❍❍
❲ ④ n
❨❨❨❲❨❲ ④④
E
We conclude this section with the statement of the following theorem on the Galois group of
splitting fields of irreducible separable polynomials xn − a without proof.
Theorem 5.7.8. Let F [ω] be a splitting field for xn − 1 over F where ω is a primitive nth root of
unity. Suppose that a ∈ F and f (x) = xn − a is irreducible over F and let E be a splitting field
for f (x) over F . Let d be the largest divisor of n such that bd = a for some b ∈ F [ω] (possibly
d = 1). Let G = Gal(E/F ) and H = Gal(E/F [ω]). Then H is cyclic of order d and normal in G,
Gal(F [ω]/F ) ∼= G/H is isomorphic to a subgroup of (Z/(n))× and G is isomorphic to a semi-direct
product of H by G/H.
Using the cyclotomic polynomials, we now present the proof of Wedderburn’s theorem as
follows.
Proof. Let D be a finite division ring. Then the center of D, denoted by F , is a finite field (see
Exercises 2.1). Assume that |F | = q. Since D is a vector space over F , |D| = q n for some n ∈ N.
Also, for an element d ∈ D, the set C(d) = {r ∈ D : rd = dr} is a division ring containing F
and |C(d)| = q m for some m ≤ n, which is strictly less than if d ∈/ F . Thus, the class equation
(Corollary 1.2.8) for the multiplicative group D r {0} is
s s
X X qn − 1
q n − 1 = |F r {0}| + [D r {0} : C(di ) r {0}] = q − 1 + ,
q mi − 1
i=1 i=1
where d1 , d2 , . . . , ds represent the conjugacy classes of D r {0} which contains more than one
element and |C(di )| = q mi for some mi < n for all i. Because each (q n − 1)/(q mi − 1) = [D r {0} :
C(di ) r {0}] is an integer, mi is a proper divisor of n. Thus, the quotient
xn − 1
Φn (x)(xmi − 1)
is a polynomial in Z[x]. Substitute q for x, we see that Φn (q) divides (q n − 1)/(q mi − 1). It follows
from the class equation that Φn (q) divides q − 1 because it divides all the other terms. Then
|Φn (q)| ≤ q − 1. On the other hand, since 1 is the closest point, on the unit circle {z ∈ C : |z| = 1},
to the positive integer q, we have that for every primitive nth root of unity ω j ,
|q − ω j | ≥ q − 1 ≥ 1,
and the first inequality is strict unless ω j = 1, that is, unless 1 is a primitive nth root of unity which
means n = 1. So the product |Φn (q)| of the |q − ω j |’s is greater than or equal to q − 1, with equality
only if n = 1. Because |Φn (q)| is both at most q − 1 and at least q − 1, we get |Φn (q)| = q − 1
and hence n = 1. Therefore, |D| = q = |C(D)|, so D = C(D) which implies D is commutative as
desired.
Given a field F and a polynomial p(x) ∈ F [x], we say that p(x) is solvable by radicals over F
if we can find a finite sequence of fields F1 = F (ω1 ), F2 = F1 (ω2 ), . . . , Fk = Fk−1 (ωk ) such that
ω1r1 ∈ F , ω2r2 ∈ F1 , . . . , ωkrk ∈ Fk−1 and all roots of p(x) lie in Fk .
If K is the splitting field of p(x) over F , then p(x) is solvable by radicals over F if we can find
a finite sequence of fields as above such that K ⊆ Fk . An important remark, and one we shall use
later, in the proof of Theorem 5.7.10, is that if such an Fk can be found, we can, without loss of
generality, assume it to be a normal extension of F . We leave it proof as an exercise.
Theorem 5.7.10. [Galois] Let F be a field which contains a primitive nth root of unity for every
positive integer n. If a polynomial p(x) ∈ F [x] is solvable by radical over F , then the Galois group
over F of p(x) is solvable.
Proof. Let K be the splitting field of p(x) over F . Since p(x) is solvable by radicals, there exists a
finite sequence of fields
where ω1r1 ∈ F , ω2r2 ∈ F1 , . . . , ωkrk ∈ Fk−1 and K ⊆ Fk such that Fk is normal over F . As a normal
extension of F , Fk is also a normal of any intermediate fields, hence Fk is a normal extension of
each Fi . Theorem 5.7.6 implies that Fi is a normal extension of Fi−1 and Gal(Fi /Fi−1 ) is abelian
for all i. Thus, by the Galois correspondence, Gal(Fk /Fi ) is a normal subgroup in Gal(Fk /Fi−1 ).
Consider the normal series
Corollary 5.7.11. The general polynomial of degree n ≥ 5 over Q is not solvable by radical.
k−1
Exercises 5.7. 1. Prove the following statements. (a) If p is a prime number, then Φpk (x) = Φp (xp ).
(b) If n > 1 is odd, then Φ2n (x) = Φn (−x).
Φn (xp )
, if p 6 |n,
(c) If p is a prime number, then Φpn (x) = Φn (x)
p
Φn (x ), if p|n.
2πi/18
2. Let ω = e be a primitive 18th root of unity.
(a) Find the minimal polynomial of ω over Q.
(b) Draw a lattice diagram for the subgroup-intermediate subfield correspondence for the funda-
mental theorem of Galois theory of Q[ω]/Q.
3. Give an example of field E containing the field of rational numbers Q such that E is Galois over Q
and Gal(E/Q) is a cyclic group of order five.
4. Let K be a finite separable extension over F and E its normal closure (smallest normal extension
over F containing K).
(a) Prove that [E : F ] is finite. (b) If Gal(E/F ) is abelian, show that K is normal over F .
5. If p(x) is solvable by radicals over F , prove that we can find a finite sequence of fields
F ⊂ F1 = F (ω1 ) ⊂ F2 = F1 (ω2 ) ⊂ . . . ⊂ Fk = Fk−1 (ωk ),
where ω1r1 ∈ F , ω2r2 ∈ F1 , . . . , ωkrk ∈ Fk−1 containing all the roots of p(x) such that Fk is normal
over F .
6. Assume that xp − a, a ∈ Q, is irreducible in Q[x]. Show that the Galois group of xp − a over Q is
isomorphic to the group of transformations of Z/(p) of the form y 7→ ky + l where k, l ∈ Z/(p) and
k 6= 0.
Theorem 5.8.1. If E/F is a finite Galois extension with Galois group G = {1, σ2 , . . . , σn }. Then
{u1 , u2 , . . . , un } is a basis for E/F if and only if
u1 u2 ... un
σ2 (u1 ) σ2 (u2 ) . . . σ2 (un )
det . 6= 0.
..
σn (u1 ) σn (u2 ) . . . σn (un )
Proof. Call the above matrix M and suppose that det M = 0. Since M ∈ Mn (E), there are
α1 , α2 , . . . , αn ∈ E, not all zero, such that
α1 α2 . . . αn M = ~0.
θ = α1 1 + α2 σ2 + · · · + αn σn : E → E.
5.8. Normal Bases 163
Proof. We shall use induction on n. For n = 1, since f (x1 ) has only finitely many roots and F is
infinite, there is αi ∈ F such that f (α1 ) 6= 0. Assume that the statement holds for n, and let
Theorem 5.8.3. Let F be an infinite field and E/F Galois with Galois group G = Gal(E/F ) =
{1, σ2 , . . . , σn }. Suppose that 0 6= f (x1 , . . . , xn ) ∈ F [x1 , . . . , xn ] where x1 , . . . , xn are indetermi-
nates over F . Then there exists u ∈ E such that f (u, σ2 (u), . . . , σn (u)) 6= 0.
Proof. Let {u1 , . . . , un } be a basis for E/F . By Theorem 5.8.1, the matrix
u1 u2 ... un
σ2 (u1 ) σ2 (u2 ) . . . σ2 (un )
M = . ∈ Mn (E)
..
σn (u1 ) σn (u2 ) . . . σn (un )
is an isomorphism. Thus,
Consider E = Q[i] is a Galois extension over Q. Its Galois group is of order two and consists
of the identity map and the complex conjugation. A basis over Q for it is {1, i}. This basis is
not invariant under the Galois action, namely after acting by the complex conjugation, we obtain
{1, −i}. We are showing the existence of a basis for a finite Galois extension which forms a single
orbit under the action of the Galois group. For example, for Q[i], we may use {1 + i, 1 − i}. In the
case of finite fields, this means that each of the basis elements is related to any one of them by
applying the Frobenius’ automorphism repeatedly.
Definition. Let E/F be Galois with Galois group G = Gal(E/F ) = {σ1 , . . . , σn }. A normal
basis for E/F is a basis of the form {σ1 (u), . . . , σn (u)} for some u ∈ E.
Theorem 5.8.4. [Normal Basis Theorem] Let E/F be a Galois extension with Galois group G =
Gal(E/F ) = {σ1 , . . . , σn }. Then E/F has a normal basis.
Proof. We shall assume that F is infinite and leave the finite case as an exercise (see Exercise 5.6).
Let u ∈ E. By Theorem 5.8.1, {σ1 (u), σ2 (u), . . . , σn (u)} is a basis for E/F if and only if
2
σ1 (u) σ1 σ2 (u) . . . σ1 σn (u)
σ2 σ1 (u) σ 2 (u) . . . σ2 σn (u)
2
det .. 6= 0.
.
σn σ1 (u) σn σ2 (u) . . . σn2 (u)
Note that the entries in each row or column of the above matrix, call M , are a permutation of
the elements σ1 (u), . . . , σn (u). In other words, each σi (u) occurs exactly once in each row and
column of M . Thus,
M = σ1 (u)A1 + · · · + σn (u)An
where each Ai is a permutation matrix (a matrix with a single entry 1 in each row and column
and the remaining entries zero). Since det Ai = ±1, we see by inspection that if x1 , . . . , xn are
indeterminates over E
√ √
Exercises 5.8. 1. Determine a normal basis for the field Q( 2, 3) over Q by using the Galois group
in Example 5.4.4.
2. Determine a normal basis for the cyclotomic field Q(e2πi/p ) over Q where p is a prime number.
Remarks. 1. If z is transcendental over F , then F [z] ∼= F [x1 ], so F [z] is not a field and F [z] is
infinite dimensional over F .
2. If z is algebraic over F , then F [z] ∼
= F [x1 ]/(f (x1 )) where f (x1 ) is the minimal polynomial
of z over F . Thus, F [z] = F (z) is a field and F [z] is finite dimensional over F .
Example 5.9.1. Let F ⊂ F (y, z) ⊂ E where y and z are independent indeterminates over F . Then
{y 2 , z 2 } is an algebraically independent set but {y 2 , yz, z 2 } is not (for, if f (x1 , x2 , x3 ) = x1 x3 − x22 ,
then f (y 2 , yz, z 2 ) = 0).
F (X)
purely transcendental
Proof. (1) Let S = {X ⊆ E : X is algebraically independent}. Since the empty set S is alge-
braically independent, S is nonempty. Let {Xα }α∈Λ be a chain in S . Let {z1 , . . . , zn } ⊆ α∈Λ Xα .
Then ∀i, ∃αi ∈ Λ, zi ∈ Xαi . Since {Xα }α∈Λ is a chain, we may rearrange αi so that there exists
j ∈ Λ such that zi ∈ Xαj for all i. Since Xαi is algebraically independent, so is {z1 , . . . , zn }. Thus,
S
α∈Λ Xα is an upper bound of this chain in S . By Zorn’s Lemma, S has a maximal element, say
X. Hence, F (X) is purely transcendental over F . The maximality of X implies that E must be
algebraic over F .
(2) The definition of algebraically independent means that F (X) is purely transcendental over F .
Consider z ∈ E. If z ∈ X ⊂ F (X), then z is algebraic over F (X). If z ∈ / X, the set X ∪ {z} is alge-
braically dependent, so for some n there is a nonzero polynomial f (x1 , . . . , xn , xn+1 ) (x1 , . . . , xn+1
are indeterminates over F ) and a1 , . . . , an ∈ X such that f (a1 , . . . , an , z) = 0. The polynomial
f (x1 , . . . , xn , xn+1 ) cannot be a polynomial in only x1 , . . . , xn , since {a1 , . . . , an } is an algebraically
independent set. Write
Remark. There is no uniqueness for the field F (X). For example, if E = F (t) where t is an
indeterminate, then we can take X = {p(t)/q(t)} where p(t)/q(t) is any element of E which
is not in F . In this case [E : F (p(t)/q(t))] = n where n = max{deg p(t), deg q(t)} (Theorem
5.9.3). However, we shall see shortly that the number of elements in the set X is independent of
particular set X.
Remark. By Theorem 5.9.1, a transcendence basis for E/F exists. It may be empty, which hap-
pens precisely when E is algebraic over F . Also, E is purely transcendental over F if it has a
transcendence base B such that E = F (B).
Theorem 5.9.2. Let F be a subfield of E. Then any two transcendence bases for E/F have the
same cardinality.
For example, an algebraic extension has transcendence degree zero; F (x) has transcendence
degree one over F ; in general, F ((xα )α∈Λ ) has transcendence degree |Λ| over K.
The purely transcendental extension fields E/F , especially those having a finite transcendence
degree, appear to be the simplest type of extension fields. It is clear that such a field is isomor-
phic to the field of fractions F (x1 , . . . , xn ) of the polynomial ring F [x1 , . . . , xn ] in indeterminates
x1 , . . . , xn . Even though these fields look quite innocent, there are difficult and unsolved problems
particularly on the nature of the subfields of F (x1 , . . . , xn )/F . The one case where the situation
is quite simple is that in which E has transcendence degree one. We shall consider this case and
close this chapter.
Let E = F (t), t transcendental, and let u ∈ E, ∈ / F . We can write u = f (t)/g(t) where
f (t), g(t) ∈ F [t] and (f (t), g(t)) = 1. If n is the larger of the degrees of f (t) and g(t), then we can
write
f (t) = a0 + a1 t + · · · + an tn and g(t) = b0 + b1 t + · · · + bn tn ,
ai , bi ∈ F , and either an or bn 6= 0. We have f (t) − ug(t) = 0, so
Theorem 5.9.3. Let E = F (t), t transcendental over F , and let u ∈ F (t), ∈ / F . Write u =
f (t)/g(t) where (f (t), g(t)) = 1, and let n = max{deg f (t), deg g(t)}. Then u is transcendental
over F , t is algebraic over F (u), and [F (t) : F (u)] = n. Moreover, the minimal polynomial of t
over F (u) is a multiple in F (u) of f (x, u) = f (x) − ug(x).
Proof. Put f (x, y) = f (x) − yg(x) ∈ F [x, y], x, y indeterminates. This polynomial in x and y
is of first degree in y and it has no factor h(x) of positive degree since (f (x), g(x)) = 1. Thus,
it is irreducible in F [x, y]. Now t is algebraic over F (u) so if u were algebraic over F , then
t would be algebraic over F , contrary to the hypothesis. Hence, u is transcendental over F .
5.9. Transcendental Extensions 167
Then F [x, u] ∼ = F [x, y] under the isomorphism over F fixing x and mapping u into y and hence
f (x, u) is irreducible in F [x, u]. It turns out that f (x, u) is irreducible in F (u)[x]. Since f (t, u) =
f (t) − ug(t) = 0, it follows that f (x, u) is a multiple in F (u)[x] of the minimal polynomial of t
over F (u). Therefore, [F (t) : F (u)] is the degree in x of f (x, u). This degree is n, so the proof is
complete.
We can determine all of the subfields E/F for E = F (t), t transcendental: These have the
form F (u) for some u. This important result is called the Lüroth’s Theorem.
Theorem 5.9.4. [Lüroth] If E = F (t), t transcendental over F , then any subfield K of E/F ,
K 6= F , has the form F (u), u transcendental over F .
Proof. Let v ∈ K, ∈/ F . Then we have seen that t is algebraic over F (v). Thus, t is algebraic
over K. Let f (x) = xn + k1 xn−1 + · · · + kn be the minimal polynomial of t over K, so the
ki ∈ K and n = [F (t) : K]. Since t is not algebraic over F , some kj ∈ / F . We shall show
that K = F (u), u = kj . We can write u = g(t)/h(t) where g(t), h(t) ∈ F [t], (g(t), h(t)) = 1 and
m = max{deg g(t), deg h(t)} > 0. Then, by Theorem 5.9.3, [E : F (u)] = m. Since K ⊃ F (u) and
[E : K] = n, we evidently have m ≥ n and equality holds if and only if K = F (u). Now t is a root
of the polynomial g(x) − uh(x) ∈ K[x]. Hence, we have a q(x) ∈ K[x] such that
The coefficient ki of f (x) is in F (t), so there exists a nonzero polynomial c0 (t) of least degree such
that c0 (t)ki = ci (t) ∈ F [t] for 1 ≤ i ≤ n. Then c0 (t)f (x) = f (x, t) = c0 (t)xn + c1 (t)xn−1 + · · · +
cn (t) ∈ F [x, t], and f (x, t) is primitive as a polynomial in x, that is, the ci (t) are relatively prime.
The x-degree of f (x, t) is n. Since kj = g(t)/h(t) with (g(t), h(t)) = 1, the t-degree of f (x, t) is
≥ m. Now replace u in (5.9.2) by g(t)/h(t) and the coefficients of q(x) by their expressions in t.
There exist, therefore, ϕ(t) and q(x, t) ∈ F [x, t] such that
Since the coefficients c0 (t), c1 (t), . . . , cn (t) of f (x, t) have no common factor, we know that ϕ(t)
divides q(x, t). Hence, we may assume ϕ(t) = 1. It turns out that there exists a polynomial
q ′ (x, t) ∈ F [x, t] such that
g(x)h(t) − g(t)h(x) = f (x, t)q ′ (x, t).
Since the t-degree of the left-hand side is ≤ m and that of f (x, t) is ≥ m, it follows that this
degree is m and q ′ (x, t) = q ′ (x) ∈ F [x]. Then the right-hand side is primitive as a polynomial in x
and so is the left-hand side. By symmetry the left-hand side is primitive as a polynomial in t also.
Hence, q ′ (x) = q ′ ∈ F . Thus, f (x, t) has the same x-degree and t-degree so m = n, which implies
that K = F (u).
Exercises 5.9. 1. Prove that there is no intermediate field K with Q ⊆ K $ C with C purely transcen-
dental over K.
2. Prove that a purely transcendental proper extension of a field is never algebraically closed.
3. Let E = F (t, v), where t is transcendental over F and v 2 + t2 = 1. Show that E is purely transcen-
dental over F .
168 5. Field Theory
6 | Applications
This chapter demonstrates some aspects in doing research in algebra. In the first section, we see
some applications of module theory, especially a free R-module over commutative rings (§4.2),
to obtain a structure theorem for finite dimensional symplectic spaces over a local ring. In what
follows, we define the symplectic graphs over a commutative ring and study strong regularity,
vertex and arc transitivity and its chromatic number. Although the problems seem combinatorial,
it contains many algebraic topics discussed throughout this book such as group actions (§1.2),
local rings and the structure of a finite commutative ring (§4.6) and the finite fields (§5.6). Here,
we follow the work of the author published in Discrete Mathematics [30] and European Journal
of Combinatorics [31, 32].
β(a~x + b~y , ~z) = aβ(~x, ~z) + bβ(~y , ~z) and β(~x, a~y + b~z) = aβ(~x, ~y ) + bβ(~x, ~z)
0 = β(~x + ~y , ~x + ~y ) = β(~x, ~x) + β(~y , ~x) + β(~x, ~y ) + β(~y , ~y ) = β(~y , ~x) + β(~x, ~y )
for all ~x, ~y ∈ V . That is, any symplectic bilinear form is also skew-symmetric.
Example 6.1.1. Consider V = Z2 as a vector space over Z2 . The bilinear form β(x, y) = xy is
skew-symmetric but not symplectic because β(1, 1) = 1 6= 0.
169
170 6. Applications
Example 6.1.2. Let p be a prime number and let R be the ring of integers modulo pn , Zpn , or the
field of pn elements, Fpn , where n ∈ N. For ν ≥ 1, let V denote the set of 2ν-tuples (a1 , . . . , a2ν )
of elements in R. Define β : V × V → R by the product
β (a1 , . . . , a2ν ), (b1 , . . . , b2ν ) = (a1 , . . . , a2ν ) K (b1 , . . . , b2ν )t ,
0 Iν
where K = and Iν is the ν × ν identity matrix, for all vectors (a1 , . . . , a2ν ),
−Iν 0 2ν×2ν
(b1 , . . . , b2ν ) ∈ V . Then β is a non-degenerate symplectic bilinear form.
Definition. Let R be a commutative ring and V a free R-module of rank n where n ≥ 2. Let β
be a non-degenerate symplectic bilinear form. We call the pair (V, β) a symplectic space. An
R-module automorphism σ on V is an isometry on V if β(σ(~x), σ(~y )) = β (~x, ~y ) for all ~x, ~y ∈ V .
The group of isometries on V is called the symplectic group of (V, β) over R and denoted by
SpR (V ).
Definition. Let R be a commutative ring and (V, β) a symplectic space, where V is a free R-
module of rank n ≥ 2. A vector ~x in V is said to be unimodular if there is an f in HomR (V, R)
with f (~x) = 1; equivalently, if ~x = α1~b1 + · · · + αn~bn , where {~b1 , . . . , ~bn } is a basis for V , then
the ideal (α1 , . . . , αn ) = R. If ~x is unimodular, then the line Rx is a free R-direct summand of
rank one. A hyperbolic pair {~x, ~y } is a pair of unimodular vectors in V with the property that
β (~x, ~y ) = 1. The module H = R~x ⊕ R~y is called a hyperbolic plane.
Note that when R is a field, unimodular vectors coincide with nonzero vectors. When R is a
local ring, we have a criterion to determine whether a vector in V is unimodular as follows.
Theorem 6.1.1. Let R be a local ring and (V, β) a symplectic space, where V is a free R-module
of rank n ≥ 2 with basis {~e1 , . . . , ~en }. A vector ~x = a1~e1 + · · · + an~en in V is unimodular if and
only if ai is a unit of R for some i ∈ {1, . . . , n}.
In addition, if R is a local ring, we show that the rank of symplectic space (V, β) must be even.
Let {~x, ~y } be a hyperbolic pair of unimodular vectors in V and H = R~x ⊕ R~y the corresponding
hyperbolic plane. Then for ~z ∈ V , it is easy to see that the vector w
~ = ~z − β(~z, ~y )~x + β(~z, ~x)~y is in
⊥
H , and so ~z can be decomposed as the sum
Theorem 6.1.2. Let R be a local ring. Let (V, β) be a symplectic space over R of rank ≥ 2. Then
V splits as an orthogonal direct sum V = H⊥H ⊥ for some hyperbolic plane H. Moreover, H ⊥ is
a free R-module. Therefore, V is an orthogonal direct sum V = H1 ⊥H2 ⊥ . . . ⊥Hm of hyperbolic
planes H1 , H2 , . . . , Hm . In particular, the rank of V is even.
Note that H ⊥ is a direct summand of the free module V . By Corollary 4.3.4, it is finitely
generated and projective. Then this theorem follows directly from the next lemma.
Lemma 6.1.3. A finitely generated projective module V over a local ring R is free.
Proof. Let M be the unique maximal ideal in R. Choose ~v1 , . . . , ~vt ∈ V so that the cosets {~v1 +
M V, . . . , ~vt +M V } is a basis for the vector space V /M V over the field R/M . Here, the scalar
P action
is given by (c + M )(~x + M V ) = c~x + M V . Let ϕ : Rt → V be defined by ϕ(r1 , . . . , rt ) = ti=1 ri~vi .
By Remark after Theorem 4.6.1, ϕ is onto. Since V is projective, Rt = K ⊕ L where K = ker ϕ
and L ∼ = M . Then K is finitely generated. Since ϕ induces an isomorphism from Rt /M Rt to
V /M V , it follows that K/M K ⊕ L/M L ∼ = V /M V . These are finite dimensional vector spaces
over the field R/M . Comparing dimensions yields K/M K = 0. Thus, K = {~0} by Nakayama’s
lemma. Hence, ϕ is an isomorphism.
Definition. Let R be a commutative ring and (V, β) a symplectic space, where V is a free R-
module of rank 2ν, ν ≥ 1. Define the graph GSpR (V ) with vertex set is the set of lines {R~x : ~x is
a unimodular vector in V } and with adjacency given by
Here, R× denotes the group of invertible elements in R. We call GSpR (V ) , the symplectic graph
of (V, β) over R.
Let R be a local ring with unique maximal ideal M and let (V, β) be a symplectic space of
rank 2ν, where ν ≥ 1. By Theorem 6.1.2, V possesses a canonical basis {~e1 , . . . , ~e2ν } such
that {~ej , ~eν+j } is a hyperbolic pair for all 1 ≤ j ≤ ν and V is an orthogonal direct sum V =
H1 ⊥H2 ⊥ . . . ⊥Hν , where Hj = R~ej ⊕ R~eν+j is a hyperbolic plane for all 1 ≤ j ≤ ν.
Let R be a local ring. If R is finite, Theorem 6.1.1 gives the number of vertices of GSpR (V ) ,
namely,
|R|2ν − |M |2ν
|V (GSpR (V ) )| = |{R~x : ~x is a unimodular vector in V }| = .
|R× |
172 6. Applications
Write unimodular vectors ~a = a1~e1 +· · ·+a2ν ~e2ν and ~b = b1~e1 +· · ·+b2ν ~e2ν for some ai , bi ∈ R.
Then
because β(~ei , ~ei ) = 0, β(~ei , ~eν+i ) = 1 and β(~ei , ~ej ) = −β(~ej , ~ei ) for all i, j ∈ {1, . . . , 2ν}. Hence,
the adjacency condition becomes
ν
X
R~a is adjacent to R~b if and only if (ai bν+i − aν+i bi ) ∈ R× .
i=1
Lemma 6.2.1. Let R be a local ring with unique maximal ideal M and let (V, β) be a symplectic
space of R-dimension 2ν, where ν ≥ 1. Let ~a = a1~e1 + · · · + a2ν ~e2ν and ~b = b1~e1 + · · · + b2ν ~e2ν be
unimodular vectors in V and assume that ai ∈ R× for some i ∈ {1, . . . , 2ν}. If R~a is adjacent to
R~b, then ai bl − al bi is a unit for some l ∈ {1, . . . , 2ν} and l 6= i.
Theorem 6.2.2. (Theorem 2.4 of [31]) Let R be a finite local ring and let (V, β) be a symplectic
space of dimension 2ν, where ν ≥ 1.
1. The symplectic graph GSpR (V ) is |R|2ν−1 -regular.
2. Every two adjacent vertices of GSpR (V ) has |R|2ν−2 R× common neighbors.
3. Every two non-adjacent vertices of GSpR (V ) has |R|2ν−2 |R× | or |R|2ν−1 common neighbors.
Proof. Let ~a = a1~e1 + · · · + a2ν ~e2ν and ~b = b1~e1 + · · · + b2ν ~e2ν be unimodular vectors in V and
assume that R~a is adjacent to R~b. Since ~a is unimodular, there exists an i ∈ {1, . . . , 2ν} such that
ai ∈ R× . If i ≤ ν, then
bν+i = a−1
i r + (aν+1 b1 − a1 bν+1 ) + (aν+2 b2 − a2 bν+2 ) + · · ·
+ (aν+i−1 bi−1 − ai−1 bν+i−1 ) + aν+i bi + (aν+i+1 bi+1 − ai+1 bν+i+1 ) + · · ·
+ (a2ν bν − aν b2ν )
bi−ν = a−1
i (a1 bν+1 − aν+1 b1 ) + (a2 bν+2 − aν+2 b2 ) + · · ·
+ (ai−1−ν bi−1 − ai−1 bi−1−ν ) + ai−ν bi + (ai+1−ν bi+1 − ai+1 bi+1−ν ) + · · ·
+ (aν b2ν − a2ν bν ) − s
for some s ∈ R× . Therefore, there are |R|2ν−1 classes adjacent to the vertex R~a, and hence
GSpR (V ) is |R|2ν−1 -regular. This proves (1).
6.2. Symplectic Graphs 173
Next, we let ~x = x1~e1 + · · · + x2ν ~e2ν be a unimodular vector in V such that R~x is a common
neighbor of R~a and R~b. Then
(a1 xν+1 − aν+1 x1 ) + (a2 xν+2 − aν+2 x2 ) + · · · + (aν x2ν − a2ν xν ) = r′ (6.2.1)
and
(b1 xν+1 − bν+1 x1 ) + (b2 xν+2 − bν+2 x2 ) + · · · + (bν x2ν − b2ν xν ) = s′ (6.2.2)
for some r′ , s′ ∈ R× . Since ai ∈ R× and we may assume without loss of generality that i ≤ ν,
from Eq. (6.2.1) we have
xν+i = a−1
i r′ + (aν+1 x1 − a1 xν+1 ) + (aν+2 x2 − a2 xν+2 ) + · · ·
+ (aν+i−1 xi−1 − ai−1 xν+i−1 ) + aν+i xi
+ (aν+i+1 xi+1 − ai+1 xν+i+1 ) + · · · + (a2ν xν − aν x2ν )
Subtracting bi × (6.2.1) from ai ×(6.2.2) gives
ν
X ν
X
− (ai bν+j − aν+j bi )xj + (ai bj − aj bi )xν+j = ai s′ − bi r′ . (6.2.3)
j=1 j=1
j6=i
Assume that R~a is adjacent to R~b. By Lemma 6.2.1, we have ai bl − al bi is a unit for some l ∈
{1, . . . , 2ν} and l 6= i. If l ≤ ν, then
ν
X ν
X
xν+l = (ai bl − al bi )−1 ai s′ − bi r′ + (ai bν+j − aν+j bi )xj − (ai bj − aj bi )xν+j
j=1 j=1
j6=i,l
and if l ≥ ν + 1, then
ν
X ν
X
xl−ν = (ai bl − al bi )−1 ai s′ − bi r′ + (ai bν+j − aν+j bi )xj − (ai bj − aj bi )xν+j .
j=1 j=1
j6=l−ν j6=i
2ν−2 × ×
Hence, there are |R| |R|R×|
||R |
= |R|2ν−2 |R× | classes of common neighbors of adjacent vertices
R~a and R~b, and so we have (2).
Finally, suppose that R~a is not adjacent to R~b. If ai bl − al bi is a unit for some l ∈ {1, . . . , 2ν}
and l 6= i, then Eq. (6.2.3) implies that xν+l or xl depends on other 2ν − 2 variables similar to the
2ν−2 × ||R× |
previous paragraph, so that there are |R| |R|R ×| = |R|2ν−2 |R× | classes of common neighbors.
Assume that ai bl − al bi ∈ M for all l ∈ {1, . . . , 2ν} r {i}. Then bi is a unit, so
xν+i = b−1
i s′ + (bν+1 x1 − b1 xν+1 ) + (bν+2 x2 − b2 xν+2 ) + · · ·
+ (bν+i−1 xi−1 − bi−1 xν+i−1 ) + bν+i xi
+ (bν+i+1 xi+1 − bi+1 xν+i+1 ) + · · · + (b2ν xν − bν x2ν )
Clearly, if xk ∈ M for all k ∈ {1, . . . , 2ν} r {ν + i}, then xν+i ∈ R× . Hence, there are |R|2ν−1
classes of common neighbors. This completes the proof of (3).
Theorem 6.2.3. (Theorem 2.5 of [31]) Let ~a = a1~e1 + · · · + a2ν ~e2ν and ~b = b1~e1 + · · · + b2ν ~e2ν
be unimodular vectors in V and assume that ai ∈ R× for some i ∈ {1, . . . , 2ν}. If R~a and R~b are
non-adjacent vectices of GSpR (V ) , then the number of common neighbors are
(
|R|2ν−2 |R× |, if ai bl − al bi ∈ R× for some l ∈ {1, . . . , 2ν} r {i},
|R|2ν−1 , if ai bl − al bi ∈ M for all l ∈ {1, . . . , 2ν} r {i}.
174 6. Applications
As a generalization of strongly regular graphs, Erickson and Fernando [22] introduced Deza
graphs, which were firstly introduced in a slightly more restricted form by Deza and Deza [21].
Definition. A regular graph with degree k on v vertices is said to be a (v, k, λ, µ)-Deza graph
if any two distinct vertices x and y have λ or µ common adjacent vertices. A Deza graph of
diameter two is called a strictly Deza graph if it is not strongly regular. For d ≥ 2, a k-regular
graph G on v vertices is called a d-Deza graph with parameters (v, k, {c1 , . . . , cd }) if every two
distinct vertices of G has c1 , c2 , . . . , cd common adjacent vertices.
Theorem 6.2.4. (Theorem 2.6 of [31]) Let R be a finite local ring with unique maximal ideal M
and let (V, β) be a symplectic space of dimension 2ν.
1. If ν = 1, then GSpR (V ) is a strongly regular graph with parameters
3. If ν ≥ 2 and R is not a field, then GSpR (V ) is a strictly Deza graph with parameters
|R|2ν − |M |2ν 2ν−1 2ν−2 × 2ν−1
, |R| , |R| |R |, |R| .
|R× |
Example 6.2.1. Let p be a prime number and let R be the ring of integers modulo pn , Zpn , or the
field of pn elements, Fpn , where n ∈ N. For ν ≥ 1, let V denote the set of 2ν-tuples (a1 , . . . , a2ν ) of
elements in R. Consider the symplectic bilinear form β given in Example 6.1.2. The unimodular
vectors in V are those (a1 , . . . , a2ν ) of elements in R such that ai ∈ R× for some i ∈ {1, 2, . . . , 2ν}.
We write Sp(2ν) (R) for this symplectic graph. The above theorem gives the following results.
1. The symplectic graph Sp(2ν) (Fpn ) is a strongly regular graph with parameters
n 2ν
(p ) − 1 n 2ν−1 n 2ν−2 n n 2ν−2 n
, (p ) , (p ) (p − 1) , (p ) (p − 1) .
pn − 1
2. The symplectic graph Sp(2) (Zpn ) is a strongly regular graph with parameters
pn + pn−1 , pn , pn − pn−1 , pn .
R = R1 × R2 × · · · × Rt
6.2. Symplectic Graphs 175
This shows that the adjacency condition does not depend on the bilinear map β. Recall from the
statement before Lemma 6.2.1 that when Rj is a local ring, the adjacency condition becomes
ν
X
Rj~a is adjacent to Rj~b if and only if (ai bν+i − aν+i bi ) ∈ Rj× . (6.2.5)
i=1
for all j ∈ {1, 2, . . . , t}. Therefore, it follows from Eq. (6.2.5) that
GSpR (V ) ∼
= GSpR (V (1) ) ⊗ GSpanR (V (2) ) ⊗ · · · ⊗ GSpR (V (t) ) , (6.2.6)
1 2 t
as a graph isomorphism. Here, for two graphs G and H, we define their tensor product G ⊗ H to
be the graph with vertex set V(G) × V(H), where (u, v) is adjacent to (u′ , v ′ ) if and only if u is
adjacent to u′ and v is adjacent to v ′ .
From Theorem 6.2.2 (1) and the above discussion, we have the number of vertices of GSpR (V )
is equal to
t t
Y Y |Rj |2ν − |Mj |2ν
|V(GSpR (V ) )| = |V(GSpR (V (j) ) )| =
j=1
j
j=1
|Rj× |
and GSpR (V ) is regular of degree |R1 |2ν−1 |R2 |2ν−1 . . . |Rt |2ν−1 = |R|2ν−1 . Moreover, every two
adjacent vertices of GSpR (V ) has |R|2ν−2 |R× | common neighbors by Theorem 6.2.2 (2). We record
these results in the next theorem.
Theorem 6.2.5. Let R be a finite commutative ring and (V, β) be the induced symplectic space of
rank 2ν, ν ≥ 1, discussed above.
1. The symplectic graph GSpR (V ) is a |R|2ν−1 -regular and isomorphic to the graph
2. Every two adjacent vertices of GSpR (V ) has |R|2ν−2 |R× | common neighbors.
Example 6.2.2. If m > 1 and m = pn1 1 pn2 2 . . . pnt t , where ni ∈ N and pi are distinct primes for all
i ∈ {1, 2, . . . , t}, then by Chinese remainder theorem,
R = Zm ∼
= Z p 1 n1 × Z p 2 n2 × · · · × Z p t nt .
Consider V = R2ν , the induced symplectic space of dimension 2ν, ν ≥ 1. By Theorem 6.2.5, we
have:
176 6. Applications
1. the symplectic graph GSpR (V ) is a m2ν−1 -regular and isomorphic to the graph product
(j)
be unimodular vectors in V where (ai )tj=1 ∈ R× for some i ∈ {1, 2, . . . , 2ν}. Assume that
R~a and R~b are non-adjacent vertices of GSpR (V ) . Let {j1 , j2 , . . . , js } ⊆ {1, 2, . . . , t} be such that
(j ) (j ) (j ) (j ) (j ) (j )
Rjk a1 k , a2 k , . . . , a2νk and Rjk b1 k , b2 k , . . . , b2νk are non-adjacent vertices for all k ∈ {1, 2,
. . . , s}. Then the number of common neighbors of R~a and R~b are
Y Y
|Rj |2ν−2 |Rj× | Ck ,
j∈{1,2,...,t}r{j1 ,j2 ,...,js } k∈{1,2,...,s}
where
( (jk ) (jk ) (jk ) (jk )
|Rjk |2ν−2 |Rj×k |, if ai bl − al bi ∈ Rj×k for some l ∈ {1, . . . , 2ν} r {i},
Ck = (jk ) (jk ) (jk ) (jk )
|Rjk |2ν−1 , if a i bl − a l bi ∈ Mjk for all l ∈ {1, . . . , 2ν} r {i},
Proof. It follows directly from the isomorphism (6.2.6) and Theorem 6.2.3.
Definition. Let G and H be graphs. A function σ from V(G) to V(H) is a homomorphism from
G to H if σ(g1 ) and σ(g2 ) are adjacent in H whenever g1 and g2 are adjacent in G. It is called
an isomorphism if it is a bijection and σ −1 is a homomorphism from H onto G. Moreover, an
isomorphism on G is called an automorphism. The set of all automorphisms of a graph G is
denoted by Aut (G). It is a group under composition, called the automorphism group of G.
Let G and H be two graphs. Let σ and τ be automorphisms of G and H, respectively. It is easy
to see that the map
Theorem 6.3.1. For graphs G and H, Aut (G) × Aut (H) ⊆ Aut (G ⊗ H).
6.3. Vertex and Arc Transitivities of Symplectic Graphs 177
Definition. A graph G is vertex transitive if its automorphism group acts transitively on the
vertex set. That is, for any two vertices of G, there is an automorphism carrying one to the
other. A graph G is edge transitive if its automorphism group acts transitively on the edge set.
An arc in G is an ordered pair of adjacent vertices, and G is arc transitive if its automorphism
group acts transitively on its arcs.
Note that an arc transitive graph is necessarily vertex and edge transitive. More on transitive
graphs can be found in Chapter 3 of Godsil’s book [14].
We recall Proposition 2.3 of [26] as follows.
Theorem 6.3.2. [26] Let R be a local ring. If {~x, ~a} and {~x, ~b} are hyperbolic pairs of unimodular
vectors in V , then there exists an isometry σ in SpR (V ) which leaves ~x invariant and carries ~a to ~b.
Lemma 6.3.3. Let R be a finite local ring and let (V, β) be a symplectic space of dimension 2ν.
Then SpR (V ) acts transitively on unimodular vectors and on hyperbolic planes.
Proof. Let ~a and ~b be unimodular vectors in V such that R~a 6= R~b. By Theorem 6.2.2 (2) and (3),
for every two distinct vertices of GSpR (V ) , there exists a unimodular vector ~x such that {~x, ~a} and
{~x, ~b} are hyperbolic pairs. Then Theorem 6.3.2 gives an isometry σ in SpR (V ) which leaves ~x
invariant and carries ~a to ~b. Hence, SpR (V ) acts transitively on unimodular vectors.
Next, let {~a, ~b} and {~c, d}
~ be two distinct hyperbolic pairs of unimodular vectors in V . Then
there exists an isometry ρ in SpR (V ) carries ~a to ~c. Since {~a, ~b} is hyperbolic pair, so is the pair
{ρ(~x), ρ(~a)} = {~c, ρ(~b)}. Again, Theorem 6.3.2 implies an isometry τ in SpR (V ) which leaves ~c
invariant and carries ρ(~b) to d. ~ It follows that τ ◦ ρ ∈ Sp (V ) maps {~a, ~b} to {~c, d}
R
~ as desired.
Theorem 6.3.4. Let R be a finite local ring and let (V, β) be a symplectic space of dimension 2ν.
The symplectic graph GSpR (V ) is vertex transitive and arc transitive.
Proof. Observe that for any isometry σ of V , we have the induced automorphism Tσ on the vertex
set of the symplectic graph GSp(V ) given by
Tσ : R~a 7→ Rσ(~a)
for all unimodular vectors ~a ∈ V . Let ~a and ~b be unimodular vectors in V . By Lemma 6.3.3,
there is an isometry σ ∈ SpR (V ) such that σ(~a) = ~b. Thus, we have Tσ ∈ Aut GSpR (V ) and
Tσ : R~a 7→ Rσ(~a) = R~b.
For edge transitivity, we let ~a, ~b, ~c, d~ be unimodular vectors in V such that {~a, ~b} and {~c, d}
~ are
hyperbolic pairs. Again, by Lemma 6.3.3, there exists an isometry σ ∈ SpR (V ) such that σ(~a) = ~c
and σ(~b) = d.
~ Therefore, Tσ ∈ Aut GSp (V ) sends R~a to R~c and R~b to Rd.
R
~ This proof also shows
that the symplectic graph GSpR (V ) is arc transitive.
GSpR (V ) ∼
= GSpR (V (1) ) ⊗ GSpR (V (2) ) ⊗ · · · ⊗ GSpR (V (t) )
1 2 t
as we have seen before Theorem 6.2.5. From Theorem 6.3.4, for each i, we have GSpR (V (i) ) is
i
vertex transitive and arc transitive. By Theorem 6.3.1,
Aut (GSpR (V (1) ) ) × Aut (GSpR (V (2) ) ) × · · · × Aut (GSpR (V (t) ) ) ⊆ Aut (GSpR (V ) ),
1 2 t
it follows that GSpR (V ) is also vertex transitive and arc transitive. Hence, we have proved:
178 6. Applications
Theorem 6.3.5. If (V, β) is a symplectic space over a finite commutative ring R, then the symplectic
graph GSpR (V ) is vertex transitive and arc transitive.
Definition. The chromatic number of a graph G is the smallest number of colors needed to
color the vertices of G so that no two adjacent vertices share the same color. The chromatic
number of a graph G is commonly denoted by χ(G).
The chromatic number of symplectic graphs over a finite field is recalled in the next proposi-
tion.
Proposition 6.4.1. [33] If k is the field of q elements and V is the symplectic space of dimension 2ν,
ν ≥ 1, then χ(GSpk (V ) ) = q ν + 1.
Let R be a finite local ring with unique maximal ideal M and residue k = R/M . Let V be a
free R-module of rank 2ν, ν ≥ 1, and let V ′ be the 2ν-dimensional vector space over k induced
from V via the canonical map π : R → k given by
π : r 7→ r + M.
for all ~a, ~b ∈ V . Here, we write π(~a) = (π(a1 ), π(a2 ), . . . , π(a2ν )) for all ~a = (a1 , a2 , . . . , a2ν ) ∈ V .
Note that the relation
R~x ∼ R~y ⇔ kπ(~x) = kπ(~y ) (6.4.1)
is an equivalence relation on the vertex set of the graph GSpR (V ) . Since R is a local ring, it follows
that
β(~a, ~b) ∈ R× ⇔ π(β(~a, ~b)) 6= M ⇔ β ′ (π(~a), π(~b)) ∈ k × .
This gives (3) of the next theorem.
Theorem 6.4.2. Let R be a finite local ring with unique maximal ideal M and residue k = R/M .
2ν −1
Let κ = |k||k|−1 and ~x1 , ~x2 , . . . , ~xκ be unimodular vectors in V such that the vertex set
Proof. The first part of (1) follows from the relation (6.4.1) and (4) is an immediate consequence
of (3). Note that
β(~xi + m
~ 1 , ~xi + m
~ 2 ) = β(~xi , m
~ 1 ) + β(m
~ 2 , ~xi ) + β(m ~ 2) ∈ M
~ 1, m
for all i ∈ {1, 2, . . . , κ} and m ~ 2 ∈ M 2ν . This proves the second part of (1).
~ 1, m
Next, let m~ 1, m~ 2 ∈ M and assume that R(~xi + m ~ 1 ) = R(~xi + m
~ 2 ). Then ~xi + m~ 1 = λ(~xi + m
~ 2)
for some λ ∈ R× . Thus, (1 − λ)~xi = λm ~2−m ~ 1 ∈ M 2ν . Since ~xi is unimodular, 1 − λ ∈ M ,
so λ = 1 + µ for some µ ∈ M . Hence, ~xi + m ~ 1 = (1 + µ)(~xi + m ~ 2 ). Finally, we show that
R(1 + µ)(~x + m) ~ = R(~x + m) ~ for all µ ∈ M , ~x ∈ V unimodular, and m ~ ∈ M 2ν and we therefore
have (2). Clearly, R(1 + µ)(~x + m) ~ ⊆ R(~x + m).~ Since µ ∈ M , 1 + µ ∈ R× . Then r(~x + m) ~ =
−1
(r(1 + µ) )(1 + µ)(~x + m) ~ for all r ∈ R which gives another inclusion.
It follows from Proposition 2.3 of [33] that GSpk (V ′ ) is |k|ν + 1-partite with partite sets Y1 ,
Y2 , . . . , Y|k|ν +1 , where Yi ∩ Yj = ∅ for all i 6= j and there is no edge of GSpk (V ′ ) joining two vertices
of the same subset. Moreover, the subsets Y1 , Y2 , . . . , Y|k|ν +1 can be chosen so that for any distinct
indices i and j, every y ∈ Yi is adjacent to exactly |k|ν−1 vertices in Yj . In addition, the chromatic
number of GSpk (V ′ ) is |k|ν + 1 (Proposition 6.4.1). The canonical map π : R → k and Theorem
6.4.2 give the following theorem.
Theorem 6.4.3. Let R be a finite local ring with unique maximal ideal M and residue k =
R/M . The symplectic graph GSpR (V ) is |k|ν + 1-partite with partite sets π −1 (Y1 ), π −1 (Y2 ),
. . . , π −1 (Y|k|ν +1 ), where Yj , j = 1, 2, . . . , |k|ν +1, are subsets of GSpk (V ′ ) discussed above. Moreover,
for any distinct indices i and j, every a ∈ π −1 (Yi ) is adjacent to exactly |M |2ν−1 |k|ν−1 vertices in
π −1 (Yj ). As a result, the chromatic number χ(GSpR (V ) ) is |k|ν + 1.
Proof. From the above discussion, it remains to derive the chromatic number of GSpR (V ) . Since
our graph is |k|ν +1-partite, χ(GSpR (V ) ) ≤ |k|ν +1. To prove the reverse inequality, we consider the
induced subgraph of GSpR (V ) on the vertex set {Rx1 , Rx2 , . . . , Rxκ }. By Theorem 6.4.2 (3), this
subgraph is isomorphic to the symplectic graph GSpk (V ′ ) with chromatic number |k|ν + 1. Hence,
the chromatic number χ(GSpR (V ) ) is |k|ν + 1 as desired.
We shall conclude this section by studying the chromatic number of symplectic graphs over a
finite commutative ring. The key ingredient is vertex transitivity discovered above.
For example, if R is a local ring, Theorem 6.4.2 implies that the sets R(~xi + M 2ν ), i ∈
{1, 2, . . . , κ}, are independent sets in the symplectic graph GSpR (V ) . Since the symplectic graph is
regular, it follows from Theorem 6.4.3 that:
Theorem 6.4.4. Let R be a finite local ring with unique maximal ideal M and residue field k =
R/M and let (V, β) be a symplectic space of rank 2ν, ν ≥ 1. Then
|k|ν − 1
α(GSpR (V ) ) = |M |2ν−1 .
|k| − 1
The color classes of a proper l-coloring of G form a collection of l pairwise disjoint independent
sets I1 , I2 , . . . , Il whose union is V(G). The function f such that f (Ij ) = 1 for all j ∈ {1, 2, . . . , l}
and f (S) = 0 for all other independent sets S is a fractional coloring of weight l. Therefore,
χ∗ (G) ≤ χ(G). Moreover, when G is vertex transitive, we have the following proposition.
|V(G)|
χ∗ (G) = .
α(G)
Let R be a finite local ring with unique maximal ideal M and residue field k = R/M and let
(V, β) be a symplectic space of rank 2ν, ν ≥ 1. By Theorems 6.2.2 (1) and 6.4.4, we have
|R|2ν − |M |2ν |k|ν − 1
|V(GSpR (V ) )| = and α(G SpR (V ) ) = |M |2ν−1 ,
|R× | |k| − 1
respectively. Since the graph GSpR (V ) is vertex transitive, it follows from Proposition 6.4.5 that
|R|2ν − |M |2ν
|R× | |R|2ν − |M |2ν |k| − 1 |M |
χ∗ (GSpR (V ) ) = ν = 2ν ν
= |k|ν + 1,
|k| − 1 |M | |k| − 1 |R| − |M |
|M |2ν−1
|k| − 1
which is equal to the chromatic number of GSpR (V ) . We record this result in the next theorem.
Theorem 6.4.6. Let R be a finite local ring with unique maximal ideal M and residue field k =
R/M and let (V, β) be a symplectic space of rank 2ν, ν ≥ 1. Then
It is easy to see that if there is a homomorphism from a graph X to a graph Y , then χ(X) ≤
χ(Y ). Let G and H be graphs. Since both G and H are homomorphic images of G ⊗ H (using the
projection homomorphisms), we have that
χ(G ⊗ H) ≤ min{χ(G), χ(H)}.
Hedetniemi [25] has conjectured that for all graphs G and H equality occurs in the above bound.
This conjecture is still open. However, Zhu [34] showed that Hedetniemi’s conjecture is true for
fractional chromatic numbers.
Corollary 6.4.9. Let m > 1 and R = Zm ∼ = Zp1 n1 × Zp2 n2 × · · · × Zpt nt , where ni ∈ N and pi are
primes such that p1 < p2 < · · · < pt . For the symplectic space V over R of dimension 2ν, ν ≥ 1,
we have the chromatic number of the graph GSpR (V ) given by
General References
[1] D. S. Dummit and R. M. Foote, Abstract Algebra, 2nd edn, Prentice-Hall Inc., London, 1999.
[2] J. B. Fraleigh, A First Course in Abstract Algebra, 7th edn, Addison Wesley, New York, 2002.
[5] I. M. Isaacs, Algebra, a graduate course, Brooks/Cole Publishing Company, Pacific Grove,
1993.
[6] N. Jacobson, Basic Algebra I and II, W. H. Freeman & Co, 1996.
[10] W. K. Nicholson, Introduction to Abstract Algebra, John Wiley & Sons, Inc., New Jersey, 2007.
Technical References
[12] M. F. Atiyah and I. G. MacDonald, Introduction to Commutative Algebra, Addison-Wesley
Publishing Co, Reading, Mass.-London-Don Mills, Ontario, 1969.
[13] D. M. Burton, Elementary Number Theory, 7th edn, McGraw-Hill Higher Education,
Dubuque, 2010.
[14] C. Godsil and G. Royle, Algebraic Graph Theory, Spinger, New York, 2001.
[15] T. Head, Modules: A Primer of Structure Theorems, Brooks/Cole Publishing Company, Mon-
terey, 1974.
[16] W. C. Winnie Li, Number Theory with Applications, World Scientific, Singapore, 1996.
[17] B.R. McDonald, Finite Rings with Identity, Marcel Dekker, New York, 1974
[18] M. Reid, Undergraduate Commutative Algebra, Cambridge University Press, New York, 1995.
[19] I. Stewart, Galois Theory, 3rd edn, Chapman & Hall/CRC mathematics, Boca Raton, 2004.
183
184 BIBLIOGRAPHY
Research Articles
[21] A. Deza and M. Deza, The ridge graph of the metric polytope and some relatives, Polytopes:
Abstract, convex and computational, T. Bisztriczky et al. (Editors), NATO ASI Series, Kluwer
Academic, (1994) 359–373.
[23] Z. Gu, Subconstituents of symplectic graphs modulo pn , Linear Algebra Appl. 439 (2013)
1321–1329.
[25] S. Hedetniemi, Homomorphisms of graphs and automata, Technical Report 03105-44-T, Uni-
versity of Michigan, 1966.
[26] W. Klingenberg, Symplectic groups over local rings, Amer. J. Math. 85 (1963) 232–240.
[27] F. Li and Y. Wang, Subconstituents of symplectic graphs, Europ. J. Combin. 29 (2008) 1092–
1103.
[28] F. Li, K. Wang and J. Guo, More on symplectic graphs modulo pn , Linear Algebra Appl. 438
(2012) 2651–2660.
[29] F. Li, K. Wang and J. Guo, Symplectic graphs modulo pq, Discrete Math. 313 (2013), 650–
655.
[30] Y. Meemark and T. Prinyasart, On symplectic graphs modulo pn , Discrete Math. 311 (2011)
1874–1878.
[31] Y. Meemark and T. Puirod, Symplectic graphs over finite local rings, Europ. J. Combin. 34
(2013) 1114–1124.
[32] Y. Meemark and T. Puirod, Symplectic graphs over finite commutative rings, Europ. J. Com-
bin. (2014) http://dx.doi.org/10.1016/j.ejc.2014.05.004.
[33] Z. Tang and Z. Wan, Symplectic graphs and their automorphisms, Europ. J. Combin. 27
(2006) 38–50.
[34] X. Zhu, The fractional version of Hedetniemi’s conjecture is true, Europ. J. Combin. 32
(2011) 1168–1175.
Index
185
186 INDEX