Four Square Theorem - Quaternions
Four Square Theorem - Quaternions
.
The set of quarternions is called H, after its discoverer Hamiltion (1843). If
= a + di and = b + ci, where a, b, c, d R, then each quarternion is a linear
combination of the quarternion units 1,i,j,k.
a + di b + ci
b + ci a di
= a
1 0
0 1
+b
0 1
1 0
+c
0 i
i 0
+d
i 0
0 i
= a1+bi+cj+dk
1
2 JIA HONG RAY NG
Denition 1.2. The norm of a quarternion, |q| is dened to be the square root
of its determinant, hence |q|
2
is
det
= + = [[
2
+[[
2
det
a + di b + ci
b + ci a di
= a
2
+ b
2
+ c
2
+ d
2
The quarternions 1, i, j, k have norm 1 and satisfy the following relations:
i
2
= j
2
= k
2
= 1,
ij = k = ji,
jk = i = kj,
ki = j = ik
Non-zero quarternions form a group under multiplication, but the product of
quarternions is generally non-commutative: q
1
q
2
,= q
2
q
1
. Quarternions also form
an abelian group under addition, and obey the left and right distributive laws:
q
1
(q
2
+ q
3
) = q
1
q
2
+ q
1
q
3
, (q
2
+ q
3
) = q
2
q
1
+ q
3
q
1
.
Due to the multiplicative property of the determinants, det (q
1
) det (q
2
) = det (q
1
q
2
),
the norm also has a multiplicative property, i.e.
norm(q
1
) norm(q
2
) = norm(q
1
q
2
) .
It is interesting to note that n = 1, 2, 4, 8 are the only n for which R
n
has a
multiplication that distributes over vector addition, and a multiplicative norm.
They correspond to R, C, H and O (the octonions, n = 8) respectively.
Theorem 1.3. Four Square Identity: If two numbers can each be written as the
sum of four squares, then so can their product.
Proof. Let x
1
= a
2
1
+ b
2
1
+ c
2
1
+ d
2
1
= |a
1
1 + b
1
i + c
1
j + d
1
k|
2
= |q
1
|
2
and
x
2
= a
2
2
+ b
2
2
+ c
2
2
+ d
2
2
= |a
2
1 + b
2
i + c
2
j + d
2
k|
2
= |q
2
|
2
Then by the multiplicative property of norms,
x
1
x
2
= |q
1
|
2
|q
2
|
2
= |q
1
q
2
|
2
Therefore, x
1
x
2
= a
2
+ b
2
+ c
2
+ d
2
, where |q
1
q
2
|
2
= a
2
+ b
2
+ c
2
+ d
2
i+1
=
i
,
i+1
= remainder when
i
is divided by
i
.
(The remainder
i+1
is less than the divisor
i
by the division property described
in Denition 1.5.)
Now if and have a common right divisor , then
= , = for some , ,
Therefore = = = ( ) .
This means that a common right divisor of and is also a right divisor of the
remainder when is divided on the right by . Therefore
right gcd(
1
,
1
) = right gcd(
2
,
2
) = . . .
The algorithm stops when
k
divides
k
, such that g = right gcd(, ) =
right gcd(
k
,
k
) =
k
.
Theorem 1.8. Linear Representation of Greatest Common Divisor: If g is the
greatest common divisor of and , then g = gcd(, ) = + for some
integers and .
The theorem can be easily proved by induction on the number pairs obtained from
the Euclidean algorithm, starting with
1
= 1 + 0 ,
1
= 0 + 1 .
Theorem 1.9. Prime Divisor Property of Hurwitz Integers: if p is a real Hurwitz
prime and divides a Hurwitz integer product , then p divides or p divides .
4 JIA HONG RAY NG
Proof. Assume that the prime p divides but does not divide . Then according
to Theorem 1.8,
1 = right gcd(p, ) = p + .
Multiplying on the right by for both sides,
= p + .
Note that p is both a right and left divisor of whatever number it divides, since reals
commute with quarternions. Therefore p divides both p, and (by assumption).
Consequently, p divides , as required by the statement.
2. Four Square Theorem
Now, we can use the above properties of quarternions to prove the Four Square
Theorem, which says that every natural number is the sum of four squares. In fact,
the Four Square Identity (Theorem 1.3) implies that if every prime is the sum of
four squares, then all the other natural numbers except 1, which are products of
primes, can satisfy the Four Square Theorem. Clearly, 1 = 0
2
+ 0
2
+ 0
2
+ 1
2
and
2 = 0
2
+0
2
+1
2
+1
2
, therefore we can immediately simplify the problem to that of
representing every odd prime p as the sum of four integer squares. The following
proof is due to Hurwitz himself, and diers from Lagranges own proof, which argues
that the least integer m satisfying the equation mp = A
2
+ B
2
+ C
2
+ D
2
is 1.
Theorem 2.1. (Conditional Four Square Theorem): any ordinary prime p that is
not a Hurwitz prime is a sum of four integer squares.
Proof. Suppose p has a nontrivial Hurwitz integer factorization
p = (a + bi + cj + dk)
Taking conjugates on both sides,
p = p = (a bi cj dk)
Multiplying both expressions,
p
2
= (a + bi + cj + dk) (a bi cj dk)
= (a + bi + cj + dk) (a bi cj dk)
=
a
2
+ b
2
+ c
2
+ d
2
[[
2
where both a
2
+ b
2
+ c
2
+ d
2
and [[
2
are integers greater than 1. However, by
unique prime factorization, the only positve integer factorization is pp, therefore
p = a
2
+b
2
+c
2
+d
2
= (a + bi + cj + dk) (a bi cj dk). If the Hurwitz integer
a+bi +cj +dk is also in the set Z[i, j, k], then p is the sum of four integer squares.
But suppose = a +bi +cj +dk has half-integer co-ordinates, the factors of p
2
can be re-expressed to achieve integer values for the co-ecients of 1, i, j, k in .
Consider the integer =
1ijk
2
, which has norm 1, thus = 1. By choosing
appropriate signs in , can be written as = + a
+ b
i + c
j + d
k, where
a
, b
, c
, d
+ b
i + c
j + d
k) ( + a
i c
j d
k)
= ( + a
+ b
i + c
j + d
k) ( + a
i c
j d
k)
QUARTERNIONS AND THE FOUR SQUARE THEOREM 5
With respect to the rst factor, the product of and plus the even terms gives 1
plus integer terms, hence the rst factor is A+Bi+Cj+Dk for some A, B, C, D Z.
The second factor is its conjugate, thus
p = A
2
+ B
2
+ C
2
+ D
2
with A, B, C, D Z.
The following lemma is useful for showing that every odd prime is not a Hurwitz
prime, which combined with Theorem 2.1, implies that every odd prime can be
expressed as the sum of four squares.
Lemma 2.2. If an odd prime p = 2n + 1, then there are l, m Z such that p
divides 1 + l
2
+ m
2
.
Proof. The squares x
2
, y
2
of any two of the numbers 0,1,2,...,n are incongruent
mod p because
x
2
y
2
(mod p) x
2
y
2
0 (mod p)
(x y) (x + y) 0 (mod p)
x y or x + y 0 (mod p)
x + y , 0 (mod p) since 0<x + y<p. Therefore the n + 1 dierent numbers
l = 0, 1, 2, . . . , n give n + 1 incongruent values of l
2
mod p. Similarly, there
are n + 1 incongruent values of m
2
mod p and hence of 1 m
2
mod p, where
m = 0, 1, 2, . . . , n But there only exist 2n +1 incongruent values, mod p = 2n +1.
Therefore, by the pigeon-hole principle, for some l and m,
l
2
1 m
2
(mod p)
1 + l
2
+ m
2
0 (mod p)
Theorem 2.3. Four Square Theorem: Every natural number is the sum of four
squares.
Proof. Let p be an odd prime. Then we can nd l and m such that p divides
1 + l
2
+ m
2
(by Lemma 2.2)
Factorizing it,
1 + l
2
+ m
2
= (1 + li + mj) (1 li mj)
If p were a Hurwitz prime, then, according to the prime divisor property (Theorem
1.9), p divides either 1+li+mj or 1limj But neither case is possible, since neither
1
p
+
li
p
+
mj
p
nor
1
p
li
p
mj
p
is a Hurwitz integer. Hence the arbitrary odd prime p is not a Hurwitz prime, and
by Theorem 2.1,
p = A
2
+ B
2
+ C
2
+ D
2
with A, B, C, D Z
6 JIA HONG RAY NG
By the four square identity (Theorem 1.3), every natural number is the sum of four
squares.
3. Discussion
The Four Square Theorem was rst conjectured in Bachets 1621 edition of Dio-
phantus and the rst proof was given by Lagrange in 1770. Fermat claimed to have
come up with a proof, but did not publish it. In fact, there is a similar theorem
called Fermats Two Square Theorem, that is a corollary of the Four Squares The-
orem.
Theorem 3.1. Two Square Theorem: if p = 4n +1 is prime, then p = a
2
+b
2
for
some a, b Z.
A proof exists that uses the unique prime factorization of Gaussian integers
Z = a + bi : a, b Z and Lagranges lemma, a prime = 4n+1 divides m
2
+1 for
some m Z.
It has also been shown that only the numbers for which primes of the form 4k-1
do not appear to an odd power in their canonical decomposition can be represented
as the sum of two squares. Thus, not all natural numbers can be expressed as the
sum of two squares.
It is a similar situation for the sum of three squares, as it has been proved that
numbers of the form 4
n
(8k + 7) cannot be expressed as the sum of three squares.
This proof is more involved than those for sums of two or four squares and is due
to the fact that a three square identity does not exist, i.e. the product of two sums
of three squares is not always a sum of three squares.
Waring generalized the question about the expression of numbers as the sum of
squares to higher powers.
Theorem 3.2. Warings problem: There is a number g (k) for every positive inte-
ger k such that every integer can be written as at most g (k) k
th
powers.
For example, g(2) is 4, g(3) is 9, g(4) is 19, g(5) is 37 and g(6) is 73. Euler
conjectured that g (k) 2
k
+
3
2
k
| and Dickson, Pillai and Niven later conjectured
that the equality holds provided 2
k
3
2
3
2
k
| 2
k
, where x denotes the
fractional part of x. This equality has been proven to be correct for 6 n
471600000, but it is uncertain if it holds for all values of n.
References
[1] J Stillwell. Elements of Number Theory. Springer-Verlag New York Inc. 2003.
[2] P Erdos, J Suranyi. Topics in the Theory of Numbers (2nd Ed). Springer-Verlag New York
Inc. 2003.
[3] I Niven, H S Zuckerman. Introduction to the Theory of Numbers (2nd Ed). John Wiley Sons,
Inc. 1966.
[4] Weisstein, Eric W. Warings Problem. From MathWorldA Wolfram Web Resource.
http://mathworld.wolfram.com/WaringsProblem.html