Symmetric Polynomials
Symmetric Polynomials
DAVID SMYTH
Now let us assume for the time being that f (x) actually has n distinct roots, so that we can
factor
Yn
f (x) = (x − αi )
i=1
Then we can view the roots αi as variables, and the coefficients of the polynomial as giving
equations involving these variables. At least in the case n = 2, this idea should be familiar
from high school, i.e. if we write
(x − α1 )(x − α2 ) = x2 + a1 x + a2 ,
where a1 and a2 are given to begin with, then we see the the problem of finding the roots of
f is just the same as finding two numbers α1 , α2 such that
−(α1 + α2 ) = a1
α1 α2 = a2 .
2. Symmetric Functions
Definition 2.1 (Elementary Symmetric Functions). Let k[x1 , . . . , xn ] be a polynomial ring in
n variables. For i = 1, . . . , n, we define the following special polynomials si ∈ k[x1 , . . . , xn ]:
s1 = x1 + x2 + · · · + xn
s2 = x1 x2 + x1 x3 + . . . + xn−1 xn
..
.
X
sk = xi1 xi2 . . . xik
1≤i1 ≤i2 ≤...≤ik ≤n
..
.
sn = x1 x2 · · · xn
In words, we can say that the k th symmetric function is simply the sum of all degree k
monomials with no repeated variables.
The point of this definition is that the functions si precisely encode the relationship between
the roots of a polynomial and its coefficients. By some straight-forward high school algebra,
you can check:
n
Y
(x − αi ) = xn − s1 (α1 , . . . , αn )xn−1 + s2 (α1 , . . . , αn )xn−2 − . . . + (−1)n sn (α1 , . . . , αn ),
i=1
This means that finding the roots of a given polynomial f (x) = xn + a1 xn−1 + . . . + an (at
least under the assumption that f (x) has n distinct roots - later we’ll see this assumption is
ROOTS AND SYMMETRIC POLYNOMIALS 3
can work our way up the lexicographic ordering until there are monomials left! At that point,
we have expressed f as a polynomial in the symmetric functions.
Proof. First, we reduce to the case that f is homogenous. We claim that if we know the
fundamental theorem for homogenous symmetric functions, then we know the fundamental
theorem for all symmetric functions. To see this, let f be any symmetric function and write
f = f1 + . . . + fm , where each fi is homogenous of degree i. If f is symmetric, then each fi
must be as well (because the action of Sn preserves the degree of each monomial of f ). If we
know the fundamental theorem for homogenous symmetric functions, then we can write each
fi as a polynomial in elementary symmetric function. But then we clearly get a representation
of f as a polynomial in elementary symmetric functions as desired.
Now let f be a homogeneous symmetric function and let L(f ) = cxk1 . . . xkn . By Lemma
2.10, we know that k1 ≥ k2 ≥ . . . ≥ kn . We claim that there exists a monomial in the
symmetric functions, say csl11 sl22 . . . slnn such that
L(f ) = L(csl11 sl22 . . . slnn ).
To check this, we need to investigate the lowest monomials of the elementary symmetric
functions. By the definition of the elementary symmetric functions, one easily checks that:
L(si ) = x1 x2 . . . xi .
It follows that
l +l2 +...+ln−1
L(csl11 sl22 . . . slnn ) = cxl11 +l2 +...+ln x21 . . . xlnn .
Thus, in order to get L(f ) = L(csl11 sl22 . . . slnn ), we simply need to find non-negative integers
l1 , l2 , . . . , ln such that
l1 + . . . + ln = k1
l1 + . . . + ln−1 = k2
..
.
ln = kn .
Happily, the condition k1 ≥ k2 ≥ . . . ≥ kn guarantees that we can do this. Indeed, we
simply set ln = kn and li = ki − ki+1 for i = 1, . . . , n − 1. With this choice of li , we have
L(f ) = L(csl11 sl22 . . . slnn ) as desired.
Now we are basically done. If we let f 0 := f − csl11 sl22 . . . slnn , then f 0 is a symmetric function
with L(f 0 ) > L(f ). Thus, we can simply replace f by f 0 and repeat this procedure. As we do
this, we will subtract off multiples of monomials of the symmetric functions to get a sequence
of functions f, f 0 , f 00 , . . . with higher and higher lowest monomials. The only way this process
can terminate is to have f k = 0 for some k. At that point, we have an equation expressing f
as a sum of monomials of elementary symmetric functions, i.e. a polynomial in the elementary
symmetric functions.