1--arithmetic-geometry-topology
1--arithmetic-geometry-topology
Plane
In this chapter, we will study the basic properties of the field of complex numbers. We
will begin with a brief historic sketch of how the study of complex numbers came to be and
then proceed to develop tools needed to study calculus on the complex plane. We will also
give several applications of introducing complex numbers to solving classical problems from
geometry and trigonometry.
(i) L and P intersects at two points. This corresponds to (1.1) yielding two real solutions.
In this case D > 0.
(ii) L and P intersects at one point. In this case D = 0.
(iii) L and P do not intersect at all. In this case D < 0.
Thus, when D < 0, the fact that P and L do not intersect is reflected in the fact that (1.1)
are complex numbers. This shows that there is absolutely no reason to introduce complex
numbers in the study of quadratic equations. Cardano was perfectly justified in dismissing
1
1 Arithmetic of the complex plane
x3 = 3px + 2q (1.2)
is given by q q
3 3
p p
x = q + q − p + q − q2 − p3 .
2 3 (1.3)
Any arbitrary cubic x3 + ax2 + bx + c can be transformed by a linear change of coordinates
to an equation of the form (1.2).
Proof. It is an exercise for you to prove that an arbitrary cubic can indeed be transformed to
an equation of the form (1.2).
To solve (1.2), we first set x = u + v. Expanding the LHS of (1.2), we get
u3 + v3 + 3uv(u + v) = u3 + v3 + 3uvx.
Equating the above equation with the RHS of (1.2), we see that p = uv and that u3 + v3 = 2q.
Eliminating v, we end up with the equation
p3
u3 + − 2q = 0.
u3
This is a quadratic in u3 whose solutions are given by
p
u3 = q ± q2 − p3 .
By symmetry, v3 has the exact same solutions. As u3 + v3 = 2q, without loss of generality, we
can take as the solutions for u3 and v3 as
p
u3 = q + q2 − p3
p
v3 = q − q2 − p3
Geometrically, solving for the cubic (1.2) is equivalent finding the intersection of the cubic
with the line L given by the equation y = 3px + 2q. Note that a cubic equation always has at
least one real root (why?). This means that the formula (1.3) must always yield at least one
2
1 Arithmetic of the complex plane
real number. It was Bombelli who realized that there is something strange about the formula.
He considered the cubic x3 = 15x + 4 which has solutions
√3
√
3
x = 2 + 11i + 2 − 11i,
here we are freely using complex notation which I am assuming you are already familiar with.
The above does not seem to be a real number at all. But, Bombelli had a “wild thought”. By
√
guessing, he realized that x = 4 solves the cubic. So he assumed that 3 2 + 11i is an expression
√
of the type 2+ui and 3 2 − 11i is an expression of the form 2−ui, so that x = 2+ui+2−ui = 4.
Of course, for this to make sense Bombelli assumed that the ordinary laws of algebra are true
for the complex numbers, i.e.,
a + ib + a 0 + ib 0 = (a + a 0 ) + i(b + b 0 ).
where we are using i2 = −1. Expanding out (2 + ui)3 using the above rule we get
3
1 Arithmetic of the complex plane
z = x + iy
y = Im(z)
r = |z|
θ = arg(z)
0 x = Re(z)
z = x − iy
to prove inequalities.
X
n X
n
zi 6 |zi |,
i=1 i=1
||z| − |w|| 6 |z ± w|,
|z1 · · · zn | 6 |z| · |w|.
With the sum and product defined as in the previous section, C is a commutative field. The
inverse of the number z 6= 0 is given by z−1 = 1z = |z|z2 .
We now want rigorously define the the argument arg(z). To do this we will have to first
introduce trigonometric functions. The correct way to do this is to use power series which
we shall indulge in at a later time. For the time being, we recall the definitions in terms of
circular functions that you are no doubt familiar with from calculus.
First of all, we define 2π to be the length of the circumference of the circle of radius 1.
Denote the unit circle by T. We now define a map 1t : R → T as follows: for t > 0, 1t is the
point on T obtained by starting at the point 1 ∈ T and moving a distance t counter-clockwise;
for t < 0 one does the same thing but clockwise. Note that, by definition, 1t is a 2π-periodic
4
1 Arithmetic of the complex plane
function. We define
sin t := Im1t , cos t := Re1t .
All the familiar identities of trigonometry can easily by derived by using the above definition.
The fact that sin2 t + cos2 t = 1 is now obvious as (cos t, sin t) lies on the unit circle. Other
identities such as the following fundamental identities:
1s · 1t = 1t+s ,
which says that 1t is a group homomorphism from the additive group (R, +) to the multi-
plicative group (T, ·) with kernel 2πZ. We immediately get the de Moivre’s formula (cos t +
i sin t)n = cos nt + i sin nt. Classical trigonometric identities such as formulas for cos 2t, sin 2t,
etc., easily follow. One also gets the usual differentiation rules for sin and cos easily.
If 0 6= z ∈ C, then z/|z| ∈ T, which implies that we can find θ ∈ R such that 1θ = z/|z|.
Sometimes we will be sloppy with notation and let arg z denote the set {θ + 2πk, k ∈ Z}
the set of all possible arguments of z. The argument of z is uniquely determined modulo an
integer multiple of 2π. The definition of arg z combined with (1.4) easily implies that
1
arg zw = arg z + arg w, arg = arg z = − arg z.
z
We define the angle between z and w when z, w 6= 0 as the angle that goes from z to w.
i.e., arg w − arg z. This is the same as arg wz and arg wz . Note that these are oriented angles
and might be negative.
One can determine Arg z by using the inverse tangent function arctan as follows. Recall that
the arctan function is a bijective continuous function from R to (−π/2, π/2). If z = x + iy, x >
0, then Arg z = arctan yx . If z = iy, y > 0, then Arg z = π/2. In the second quadrant
z = x + iy, x < 0, y > 0, Arg z = π + arctan yx and in the third quadrant z = x + iy, x < 0, y <
0, Arg z = π − arctan yx . Finally, if z = iy, y < 0, then Arg z = −π/2.
We are now in a position to give a geometric interpretation of complex multiplication. We
represent the complex number z using the polar representation as rθ , where r = |z| and
θ = arg z. Then as |zw| = |z| · |w| and arg zw = arg z + arg w, it follows multiplying w z is same
as dilation by the real number |z| composed with a rotation by angle θ (the order in which
the dilation and rotation is performed is irrelevant). In particular, z itself can be written as
|z| · 1arg z . Very soon, we shall give a better way to represent complex numbers in terms of the
exp function.
5
1 Arithmetic of the complex plane
z = x + iy
z x y
|z| = |z| + |z|
θ = arctan yx
This means that |z| = r1/n and arg z = nθ + 2π nk . Note that two different values of k, say
k1 , k2 will give the same value for arg z iff k1 − k2 is an integer multiple of 2π. Thus there are
n-distinct solutions, corresponding to k = 0, 1, . . . , n − 1.
The n-th roots of unity form a cyclic group of order n generated by the element 12π/n which
√
is called the n-th primitive root. For x > 0, x will always denote the positive square root.
√
If 0 6= z ∈ C, then z := |z|1 Arg z . This is called the principal branch of the square root.
p
√ 2
Similarly, n z := n |z|1 Arg z .
p
n
6
1 Arithmetic of the complex plane
zn = z| ·{z
· · z}
n-times
−n −1
z = z| · · z−1}
·{z
n-times
1 n
Taking rational powers is more complicated. If q = m n
, then zq := z m . Note that this a
set and not a single number. The set does not depend on the particular representation of q.
It is easy to see that as sets, 1 n 1
zm = (zn ) m .
Furthermore, if q = m n
is in its reduced form and z = |z|1θ then in polar representation, zq is
the set of m distinct values
p
m
|z|n 1 n θ+k 2πn , k = 0, . . . , m − 1.
m m
simply because the LHS is a number whereas the RHS is a set. Think on how to rephrase the
law of exponents to make it true in our situations.
Definition 4 (Ordered Field). A field K with a total ordering 6 is said to be an ordered field
if it satisfies
• if a 6 b then a + c 6 a + c, ∀c ∈ K,
• if 0 6 a and 0 6 b then 0 6 a · b.
Proof. Assume to the contrary that 6 makes C into an ordered field. Then either i > 0 or
i < 0. Suppose i > 0. Then i2 = −1 > 0. Adding 1 to both sides, we see that 0 > 1. On
the other hand −1 > 0 implies that (−1)(−1) = 1 > 0. This is a contradiction. An analogous
argument works for i < 0.
7
2 Geometry and Topology of C
Another natural question is the following: view C as sitting in R3 as the first two coordinates;
Can we give R3 the structure of a commutative field so that C is a subfield?
Proof. R3 is obviously a vector space. Assume also that we have defined a multiplication · on
R3 that makes it a commutative field that extends C. Denote the basis vectors (1, 0, 0), (0, 1, 0)
and (0, 0, 1) by 1, i, j, respectively. With this notation, i2 = −1. Let us compute ij. Suppose
ij = a + bi + cj, a, b, c ∈ R. Observe first that
So,
−j = ai − b + c(ij) = ai − b + c(a + bi + cj).
Equating coefficients, we see that c2 = −1 which is absurd.
1. T is R-linear,
2. T commutes with multiplication by i, i.e., T (iz) = iT (z)∀z ∈ C.
It turns out that these two conditions are sufficient to guarantee C-linearity of T . Suppose
T is C-linear, then T (z) = T (1)z and therefore T is just multiplication by a scalar. Writing
T (1) = (a, c) and T (i) = (b, d), we see that the matrix of T under the standard basis is:
a b
c d
This means that T (x, y) = (ax + by, cx + dy), setting x = z+z 2 , y = 2i and performing a
z−z
8
2 Geometry and Topology of C
9
2 Geometry and Topology of C
2.3 Curves
A substantial portion of our course will involve path integrals. We recall here some basic
notions about curves and paths to set the stage for later.
We will often denote the image of the curve γ called the support of γ by γ∗ but often we
shall identify both γ and γ∗ . We imagine γ∗ to be a continuous thread in space but that
picture is incorrect as there are curves γ : [0, 1] → [0, 1] × [0, 1] such that γ∗ = [0, 1] × [0, 1]!
In this context, it is appropriate to state the Hahn–Mazurkiewicz theorem which characterizes
all possible continuous images of the unit interval.
We will often call a closed curve that is injective expect at the endpoints a Jordan curve.
Such a curve has no self-intersections except at the endpoints. Note that that a Jordan curve
is homemorphic to the unit circle S1 .
Our main focus in this course will be curves that are “sufficiently nice” to allow us to integrate
continuous functions along the curve. To this end, we make the following
We will visualize paths as being trajectories of particles moving in the plane. In this visualiza-
tion, the derivative γ 0 (t) which is geometrically the tangent to γ at the point γ(t) is nothing
but the velocity of the moving particle at time t.
Example 12. We have already encountered one example of a Jordan curve. The curve 1t |[−π,π]
is Jordan curve that is also regular. This curve is in fact of class C∞ . Such a curve is called a
smooth curve. The derivative of this curve
Notice that geometrically γ 0 (t) is the tangent to γ at the point γ(t). Thus (2.1) says that the
tangent to 1t at the point t is orthogonal to the vector 1t (t). This is because multiplication
by i is rotation by π2 .
10
2 Geometry and Topology of C
11
2 Geometry and Topology of C
2eiθ θ
θ
θ image of 1
θ
θ 1
Example 14. Another illustration off the power of complex notation is to find the equation of
a spiral. Let R(t) be an increasing continuous positivie function such that limt→−∞ R(t) = 0
and limt→∞ R(t) = ∞, then it is easy to see that
γ(t) = R(t)eit
We will now talk about reparametrization of curves. Given a curve γ, there could be many
other curves whose image coincides with γ∗ . For, instance the unit circle is the image of both
2
the curves γ1 (t) = e2πit and γ2 (t) = e−2πit . γ1 traverses the circle counter-clockwise at unit
speed whereas γ2 clockwise with speed 2.
1. Γ ∗ = γ∗ ;
2. There exists a continuous strictly monotone map Φ : [a, b] → [c, d] such that Γ ◦ Φ = γ.
12
2 Geometry and Topology of C
X
n−1
L(P) := |γ(ti+1 ) − γ(ti )|.
i=0
P should be thought of as an inscribed polygon on γ with length L(P). The length of the curve
If L(γ) < ∞ then γ is said to be rectifiable. Note that when if γ is piecewise C1 then it is
rectifiable and Zb
L(γ) = |γ 0 (t)|dt.
a
This integral is well-defined as is continuous on [a, b] except for finitely many points.
|γ 0 (t)|
Note also that the integral above is invariant under a change of parametrization.
If γ is regular then defining Zt
s(t) := |γ 0 (x)|dx,
a
we see by the fundamental theorem of calculus that s 0 (t) exists and is equal to |γ 0 (x)|. Fur-
thermore as |γ 0 | is a nowhere vanishing function, s(t) is a strictly increasing function. This
means that s : [a, b] → [0, L] is a strictly increasing C1 function and thus setting Ψ = s−1 , we
see that Γ = γ ◦ Ψ−1 is a reparametrization with the property that the length Γ |[0,t] = t. It
is also easy to see that Γ is a unit-speed curve. This parametrization is called the arc-length
parametrization.
2.4 Branches of the argument function and the index of a closed curve
As we have mentioned before, the function Arg : C \ {0} → (−π, π] is not continuous on any
point of the negative real axis. However, it is continuous at any other point in C \ R− .
Definition 18. Let E ⊂ C be a set that does not contain 0. Then any continuous mapping
g : E → R that satisfies g(z) ∈ arg z is called a continuous branch of the argument in E.
13
2 Geometry and Topology of C
Example 19. On any circle C centred at the origin, there are no continuous branches of the
argument. If not, then let g : C → R be a branch of the argument and let a ∈ R be the point
on the negative real axis that intersects C. Then on the connected set C \ {a}, we can find a
k ∈ Z such that Arg z + 2πk = g(z). As the value of Arg jumps at the point a, we see that g
is discontinuous at a, a contradiction.
Example 20. If L is any ray starting from the origin, say L := Arg z = α, all branches of arg z
are given by
arg z = Arg(z · ei (π − α) − π + α + 2πZ.
This is derived as follows. We first rotate by π − α so that the ray L coincides with the
negative real axis. Now we find the value of Arg and subtract π − α to compensate for our
initial rotation.
Example 21. Let D := D(a, r) be a disk that misses the origin, then we can find a continuous
branch of arg on D. To see this, consider a (possibly) larger disk D(a, r 0 ) such that 0 ∈
∂D(a, r 0 ). Then let L be ray starting at the origin tangent to ∂D(a, r 0 ) at 0. Then any branch
of arg on C \ L works.
Definition 22. Let X be a connected topological space and let f : X → C \ {0} be a continuous
function. A continuous branch of the argument of f is any continuous function h : X → R
such that h(x) ∈ arg f(x) ∀x ∈ X.
One important case is when X = E ⊂ C and f is the identity function in which case we
recover the definition of a continuous branch of arg. We will also consider branches of the n-th
root of f : E → C \ 0 : any continuous function h : E → C such that hn = f. If g : E → R
is branch of the argument, then h = |f|1/n eig/n is branch of the n-th root of f. The following
example illustrates that the converse is not, in general, true.
Example 23. Let E = C \ [−1, 1] and let f(z) = z2 − 1. Note that the image √ of z − 1 is
2
contained in (−∞, 0] precisely when z ∈ [−1, 1] or the imaginary axis. We take z2 − 1 using
the principal square root on the right half-plane and the negative of the principal square root
on the left half-plane.
γ(t) = R(t)eiφ(t) .
Of course, R(t) = |γ(t)| which is a continuous function as γ(t) 6= 0. For each t ∈ [a, b], we
must select φ(t) ∈ arg γ(t) in a continuous way. This is of course possible, if we can find a
branch of arg on γ∗ . But it actually suffices to find a branch of the argument of γ.
14
2 Geometry and Topology of C
Theorem 24. If γ : [a, b] → C is a curve that misses the origin, then we can find a
continuous branch of the argument of γ, say φ(t). Any other branch of the argument of
γ is of the form φ + 2πZ. Moreover, if γ is differentiable at the point t ∈ [a, b], then so
0 (t)
is φ and φ 0 (t) = Im γγ(t) .
Proof. Observe that on any disk in C that does not contain the origin, we can define a
continuous branch of the argument. Let r := dist(γ∗ , 0). As γ∗ is compact, r > 0. Finitely
many sets of the form γ−1 (D(t, r) cover [a, b] by the compactness of [a, b]. Hence, we can
find a partition a = t0 < t1 < · · · < tn = b such that each γ([ti−1 , ti ) is contained in a disk
of the form Di := D(t, r). Let gi be a branch of arg on Di . Then φi = gi ◦ γ is a branch of
arg(γ(t), t ∈ [ti−1 , ti ]. At the point ti , φi (ti ) might not be equal to φi+1 (ti ). Nevertheless,
their difference is an integer multiple of 2π which we add to φi+1 . By gluing lemma for
continuous functions, this allows to construct a global branch of the argument of γ.
Now, suppose γ is differentiable at the point t. We may, without loss of generality, assume
that γ(t) lies on the upper half-plane. This in a neighbourhood of t, φ(t) = Arg γ(t) + 2πk
for some k ∈ Z. Thus, φ 0 (t) exists and is equal to
Example 25. The curve γ(t) = eit , t ∈ [0, 2π] has a continuous branch of the argument given
by h(t) = t, but clearly no branch of arg can exist on the circle T.
Now, let γ : [a, b] → C \ {0} be a curve that misses 0 and let φ be a branch of the argument
of γ. Then φ(b) − φ(a) does not depend on the choice of φ as any two choices differ by an
element of 2πZ. This is called the variation of the argument and denoted ∆γ arg. If, in
addition, γ is a path, then
Zb 0
γ (t)
∆γ arg = Im dt,
a γ(t)
by the fundamental theorem of calculus and the previous theorem.
Definition 26. For a closed curve γ : [a, b] → C \ {0}, we define the index or the winding
∆ arg
number of γ with respect to 0, Ind(γ, 0) by the number γ2π .
Intuitively, the winding number measures the number of rotations around 0 a particle moving
along γ makes. One can easily define this notion for more general cases.
Definition 27. Let γ : [a, b] → C \ {z} be a closed curve. Then, we define the index of γ with
respect to z, Ind(γ, z) by
Ind(γ, z) = Ind(γ − z, 0).
15
2 Geometry and Topology of C
Now, suppose γ = R(t)eiφ(t) is a path that misses the origin. Then γ 0 (t) = (R 0 (t) +
iR(t)φ(t))eiφ(t) . Thus
γ 0 (t)
0
R (t) 0
= + iφ (t) .
γ(t) R(t)
Evaluating
Zb Zb Zb
γ 0 (t) γ 0 (t) γ 0 (t) |γ(b)|
dt = Re dt + i dt = log + i∆γ arg .
a γ(t) a γ(t) a γ(t) γ(a)
The above integral expressions clearly show that whenever γ is a path, the function Ind(z, γ)
as a function of z on C \ γ∗ is a continuous function.
Now, let γ : [a, b] → C be a closed curve. Then C \ γ∗ is an open set and hence has only
countably many connected components. On each of these connected components, the function
Ind(·, γ) is a constant function. Moreover, C \ γ∗ has only one connected component that
is unbounded. This is because γ∗ is compact and is hence contained in some large disk D.
Thus the unique unbounded component is the one that contains C \ D. We claim that on the
unbounded component, Ind(·, γ) is zero. We know that it is constant, so it suffices to pick
z ∈ C \ D and show that Ind(z, γ) = Ind(γ − z, 0) = 0. But γ − z ⊂ D − z and D − z is a disk
centred at z that misses the origin. This means that we can find a branch of arg on D − z and
hence ∆γ−z arg = 0 proving our claim.
2.6 Applications
Our first application will be to find an expression for cos4 t. The straightforward method is to
just compute:
which gives
1
cos4 t = (cos 4t + 4 cos 2t + 3).
8
16