ELE00038C 2023 24 Part I
ELE00038C 2023 24 Part I
Part I
2023-2024
[ELE00038C] J. J. Bissell
3
Preface
The purpose of this course is to introduce those mathematical tools that are
most relevant to solving practical problems in engineering and the physical
sciences. Where feasible, core concepts will be motivated using ‘real-life’ con-
texts; however, to develop competence and fluency we will often need to ‘drill’
technique using examples that are in themselves somewhat abstract.
Course delivery
Mathematics is (and always has been) a subject that is best learned by doing,
and our emphasis will be on active participation through problem solving. This
approach will require more effort, but it should also be more effective, and
ultimately more rewarding.
Lectures
Your timetable should include four slots each week. Although these are listed
as ‘lectures’, they will—as far as possible—be run more like workshops, with
lots of examples and exercises, with a basic structure as follows:
3. Repeat.
In this way I hope to maximise our overall contact time, and help to set the
pace of learning. Remember to bring pencil and paper with you.
Workshops
In addition to the lecture sessions, I will be holding a weekly workshop (as per
your timetable). These workshops will mainly involve practicing techniques by
solving problems on the course exercise sheets, but they can also be used as
an opportunity for you to ask me further questions about course material.
In previous years students have also set-up and used their own online forum to
discuss course content (e.g., via Discord). I encourage this kind of collaborative
engagement; however, remember that you are all coming from different back-
grounds, with different life experiences, so please be respectful of one-another.
Exercise sheets
Almost all of the techniques that we will be studying in this course can be mas-
tered by practice, and it is essential that you work on course exercise sheets
throughout term. Our ultimate goal is skilled creative work in applied con-
texts; however, to reach that point we must first hone our craft, and this will
often mean working on more abstract problems. As a rule-of-thumb, keep
mathematically fit by solving at least 10 exercise problems each week.
These notes have been written as an independent guide, and (in principle)
you should be able to learn the course content by studying each section, and
completing the worked examples, and course exercises. Reading mathematics
requires active participation, so keep pencil and paper to hand to check the
steps in an argument. For example, a mathematical statement like
x2 + 3x + 2
f (x) = , (x 6= −2) ⇒ f (x) = x + 1
x+2
might not seem obvious; however, if you have a pencil and paper to hand, then
you can confirm that it is correct by noting that
x2 + 3x + 2 (x + 1)(x + 2)
f (x) = = = x + 1. (0.1)
x+2 (x + 2)
Assessment
The course will be assessed by two components with the following weightings:
Further information about the format and timings of the multiple choice quizzes
is summarised on the Virtual Learning Environment.
Unless explicitly marked ‘(optional)’ any section in these notes could be ex-
amined. However, it would not be a very fair examination if I were to set very
difficult problems, or ask for long, and technical derivations. Clearly I cannot
tell you what will be on the exam., but I can tell you that I will try to be fair.
The best guide to the sorts of questions you might be asked in an examination
is to look at past papers (see the course wiki-pages). If you can solve most of
the course exercises, then you will do very well in the exam. - so practice!
Further reading
The simplest of these is probably Gilbert and Jordan, but the other two texts
will also be useful for second year mathematics and beyond.
1. The first rule of Maths Club is: you do to talk about Maths Club.
2. The second rule of Maths Club is: you do not talk about Maths Club.
3. Third rule of Maths Club: If someone yells “stop!”, goes limp, taps out,
or fails to draw a diagram, then the maths is over.
8. The eighth and final rule: If this is your first time at Maths Club, then
you have to math.
It has always taken a certain amount of skill to navigate university life, but
things have arguably become more difficult over recent years. Let us try to
remember, then, that fundamentally we are here as a community of scholars to
engage joyfully with the process of learning and discovery. So let’s treat each
other with goodwill, and give it our best shot.
J. J. Bissell
(Summer 2023)
1.1.1 Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.1.2 Intervals . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.4 Polynomials . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.4.2 Quadratics . . . . . . . . . . . . . . . . . . . . . . . . 21
1.6.1 Pythagoras . . . . . . . . . . . . . . . . . . . . . . . . 35
7
8 CONTENTS
1.6.3 Identities . . . . . . . . . . . . . . . . . . . . . . . . . 38
2 Differentiation 59
2.1.1 Limits . . . . . . . . . . . . . . . . . . . . . . . . . . 60
2.1.2 Continuity . . . . . . . . . . . . . . . . . . . . . . . . 63
2.5.3 Convergence . . . . . . . . . . . . . . . . . . . . . . . 93
3 Integration 101
13
14 Section 1 : Functions and Algebra
The most important functions in engineering and the physical sciences are func-
tions of numbers, and to discuss how they work we must first define the basic
terminology of numbers. Here we review sets of numbers (§1.1.1), especially
the real numbers, and subsets of real numbers defined by intervals (§1.1.2).
1.1.1 Sets
A = {a1 , a2 , . . . , an }. (1.1)
B = {b1 , b2 , . . . }, (1.2)
N = {1, 2, 3, . . . }. (1.3)
We can extend the natural numbers to include the number 0, and the negative
numbers −1, −2, −3 etc.; this new set is known as the set of integers, and
denoted Z, thus
For example, every natural number is also an integer; this means that
N ⊂ Z, (1.7)
Two other important sets are the rational numbers, and the real numbers:
• The set of rational numbers is denoted Q; these are the numbers formed
by quotients of integers and natural numbers, e.g., 12 , and − 53 .
• The set of real numbers is denoted R; these are the numbers that can
√
be approximated by rationals to arbitrary precision, e.g., π, e, and 2.
Real numbers R are essentially those numbers that can be expressed in decimal
form, irrespective of whether such representation requires an infinite number of
digits. Hence, the rational numbers Q, the integers Z, and the natural numbers
N are all subsets of the real numbers R. Example real numbers include:
(see figure 1.1). A real number R that is not a rational number Q is known as
√
an irrational number, e.g., 2, π, and e.
The set R is often referred to as the real line since any real number x ∈ R
can be represented as a point on a straight line. The reference point on the
real line is the number zero, with positive numbers (x > 0) to the right, and
negative numbers to the left (x < 0), as depicted in figure 1.1.
The relationship between the natural numbers N, the integers Z, the rational
numbers Q, and the real numbers R can be expressed using subset notation as
N ⊂ Z ⊂ Q ⊂ R. (1.9)
Historically speaking it has been necessary to introduce ever more general sets
of numbers to solve ever more general problems. To this end we introduce a
further set of numbers called the complex numbers C in Section 4.
Example 1.1 Let A be the set defined by A = {1, 2, −3, 8}. Which elements
of A are less than 2?
Example 1.2 Let A be the set defined by A = {1, 2, −3, 8}. Which of the
following statements are true?
1.1.2 Intervals
−1≤x≤6 (1.10)
is the set of all the real numbers x ∈ R that are greater than or equal to −1,
and less than or equal to 6, where −1 and 6 are the intervals endpoints.
An interval that includes its endpoints is known as a closed interval. In
contrast, an interval that excludes its endpoints is known as an open interval.
a ≤ x ≤ b. (1.11)
a < x ≤ b. (1.13)
a ≤ x < b. (1.14)
Intervals without endpoints can be denoted using the symbol ∞ for infinity;
e.g., the interval [0, ∞) means the set of numbers x that satisfy 0 ≤ x.
I Solution: The intervals are: (a) (−2, π]; (b) [−1, 9]; and (c) (−∞, −3). J
• The mass M of carbon dioxide emitted by the United Kingdom each year
Y since 1950 can be recorded in a table, where Y > 1. For each year Y
the mass M is well defined, and we say that M (Y ) is a function of Y
(even though there is no obvious formula for computing M given Y ).
Figure 1.2: Analogy between a function f , and a machine with inputs and outputs.
We are typically concerned with functions whose variables x and y are real
numbers, i.e., x, y ∈ R. Such a function f (x) returns a unique value y = f (x)
for each x in its domain, and can therefore be visualised as a set of points
(x, y) plotted on perpendicular x and y-axis known as the Cartesian plane.
As an example, consider the function f (x) defined by the rule
1
f (x) = , for x 6= ±1, (1.16)
x2 −1
which is plotted in figure 1.3. Here the domain of f (x) is not stated explicitly,
but by convention we can say that the domain is the set of numbers x ∈ R
for which the rule applies. Since division by 0 is not defined, the function f (x)
is undefined when (x2 − 1) = 0, i.e., when x = ±1. Indeed, because the
magnitude of f (x) becomes infinitely large as x approaches either x = 1 or
x = −1, we say that f (x) diverges at x = ±1.
4
3
2
1
0
-1
-2
-3
-4
-3 -2 -1 0 1 2 3
p √
Example 1.4 The functions f (x) = 2x − 1 and g(x) = f (x) = 2x − 1
are sketched in figure 1.4. Determine the domains of (i) f (x) and (ii) g(x).
I Solution: In each case we take the domain of the function to be the set of
values of x ∈ R for which the rule of the function applies.
2 3
1 2
0 1
-1 0
-2 -1
-1 0 1 2 3 -1 0 1 2 3
(a) (b)
p √
Figure 1.4: Two functions: (a) f (x) = 2x − 1; and (b) g(x) = f (x) = 2x − 1.
1.4 Polynomials
f (x) = c, (1.18)
y = f (x) = bx + c, (1.19)
where b, c ∈ R are both constants. The line intersects the y-axis when x = 0,
that is, when y = c; the constant c is therefore known as the y-intercept.
The constant b determines how ‘steep’ the line is, and is called the gradient.
the y-inctercept is thus (0, −1). The intersection with x-axis occurs when
y = 0, i.e., when x = 12 . The intersection with the x-axis is thus ( 12 , 0). J
1.4.2 Quadratics
10 10
8 8
6 6
4 4
2 2
0 0
-2 -2
-4 -4
-4 -3 -2 -1 0 1 2 3 4 -4 -3 -2 -1 0 1 2 3 4
(a) (b)
Figure 1.5: Two quadratics: (a) f (x) = x2 − x − 2; and (b) g(x) = 4x2 + 12x + 9.
The roots of a quadratic are those values of x for which f (x) = 0; in this way
the roots of a quadratic are the solutions to the quadratic equation
I Solution: The roots are those values of x for which f (x) = 0, and g(x) = 0.
In this case the roots may be found neatly by factorising.
(a) The roots of f (x) = x2 − x − 2 occur when
x2 − x − 2 = (x + 1)(x − 2) = 0. (1.24)
I Solution: The roots of f (x) are the values for x that satisfy f (x) = 0, i.e.,
2 2 b c
ax + bx + c = 0 ⇔ x + x+ =0 (1.29)
a a
2 2
b b c
⇔ x+ − + = 0, (1.30)
2a 2a a
2
b2 − 4ac
b
⇔ x+ = , (1.31)
2a 4a2
√
b ± b2 − 4ac
⇔ x+ = , (1.32)
2a √ 2a
−b ± b2 − 4ac
⇔ x= , (1.33)
2a
as required. J
Notice that the degree of the polynomial function is the highest power of x.
Thus, a polynomial of degree 2 is a quadratic function, viz.
P (x) = a2 x2 + a1 x + a0 . (1.35)
P (x) = a3 x3 + a2 x2 + a1 x + a0 . (1.36)
Example 1.8 State the degree of the following polynomials: (a) f (x) = a0 ,
and (b) f (x) = a1 x + a0 , where a0 , a1 ∈ R are constants.
Note: These polynomials are both examples of straight lines (see §1.4.1). J
The roots of a polynomial P (x) are those values of x for which P (x) = 0;
thus, if x = α is a root of P (x), then
P (α) = 0. (1.38)
The roots of a polynomial are closely related to what are called the linear
factors of the polynomial. To illustrate this idea consider the cubic polynomial
Here P (x) can be factorised into the product of a linear factor and a quadratic
factor, viz.
P (α) = 0. (1.41)
where Q(x) is a polynomial of degree one less than the degree of P (x).
This result is a consequence of the remainder theorem (see §1.4.6)
In our example above the quadratic factor can also be written as the product
of linear factors, i.e, (2x2 + 3x + 1) = (2x + 1)(x + 1), hence
The polynomial P (x) thus has three real roots x = 1, x = − 21 , and x = −1.
We shall not prove this theorem, but observe that the note in parenthesis is
important: a polynomial of degree n does not necessarily have n distinct
real roots. To illustrate this consider the cubic polynomial defined by
The linear factor is (x + 3), meaning that x = −3 is one of the roots of Q(x).
However, the quadratic factor is (x2 + 1), so the other roots occur when
x2 + 1 = 0. (1.45)
Since x2 > 0 for all x ∈ R, this equation does not have any solutions that
are real numbers. Indeed to solve this equation we must introduce complex
numbers (see Section 4). It follows that Q(x) has only one real root x = −3.
(a) Given that P (3) = 0, write down one of the linear factors of P (x).
(b) Factorise P (x) into its linear and quadratic factor.
(c) Determine all the real roots of P (x).
(c) The quadratic factor may itself be factorised as (x2 − 4) = (x − 2)(x + 2);
hence, P (x) may be factorised as
2x2 + 3x + 1. (1.48)
x−1 2x3 + x2 − 2x − 1
− 2x3 + 2x2
3x2 − 2x
− 3x2 + 3x
x−1
−x+1
0
x2 − 2x − 1. (1.52)
x−1 x3 − 3x2 + x − 1
− x 3 + x2
− 2x2 + x
2x2 − 2x
−x−1
x−1
−2
Informally, if we put x = α into equation (1.55), then we see that the value of
the remainder must be
P (α) = R. (1.56)
Result 1.1 Two polynomials P (x) and Q(x) are equal to one another for
all values of x ∈ R if and only if
P (x)
f (x) = (1.60)
Q(x)
x3 + 2x + 3
f (x) = (1.61)
x4 + 2x3 − x − 4
is a proper rational function. A rational function that is not proper is said to
be improper (see Example 1.13).
Example 1.13 Use the fact that (x + 1)(x − 2) + 3 = (x2 − x + 1) to write
x2 − x + 1
g(x) = (1.62)
x−2
as the sum of a polynomial and a proper rational function.
x2 − x + 1 (x + 1)(x − 2) + 3 3
g(x) = = = (x + 1) + , (1.63)
x−2 (x − 2) (x − 2)
5x + 1 5x + 1
f (x) = = , (1.64)
x2+x−2 (x + 2)(x − 1)
where the factors in the denominator are (x + 2) and (x − 1). The basic idea
of partial fractions is to find some constants A and B such that f (x) can be
written in terms of these factors as
A B
f (x) = + . (1.65)
(x + 2) (x − 1)
5x + 1 A B
= + . (1.66)
(x + 2)(x − 1) (x + 2) (x − 1)
With these values for A and B, it follows from equation (1.66) that f (x) may
be written as
5x + 1 3 2
f (x) = = + , (1.70)
(x + 2)(x − 1) (x + 2) (x − 1)
ax2 +bx+c A
+ B
+ C
(x+α)(x+β)(x+γ) (x+α) (x+β) (x+γ)
ax2 +bx+c A
+ Bx+C
(x+α)(x2 +βx+γc) (x+α) (x2 +βx+γc)
ax2 +bx+c A
+ B
+ C
(x+α)(x+β)2 (x+α) (x+β) (x+β)2
Table 1.1: Trial forms for partial fractions given various kinds of rational function. The
constants a, b, c, α, β, and γ are knowns; the coefficients A, B and C are to be determined.
This table is not exhaustive, but hints at suitable partial fractions for other rational functions.
3x2 − x − 5
f (x) = (1.71)
x3 − 3x + 2
3x2 − x − 5
f (x) = . (1.72)
(x + 2)(x − 1)2
Thus, f (x) is a rational function of the kind listed in the final row of Table
1.1, suggesting that we seek partial fractions of the form
3x2 − x − 5 A B C
f (x) = = + + , (1.73)
(x + 2)(x − 1)2 (x + 2) (x − 1) (x − 1)2
The coefficient B may be found by selecting any other value for x, and noting
that A = 1 and C = −1. For example, setting x = 0 in equation (1.74) gives
3x2 − x − 5 1 2 1
f (x) = 2
= + − , (1.78)
(x + 2)(x − 1) (x + 2) (x − 1) (x − 1)2
Figure 1.6 depicts a circle of radius r = 1, i.e., a unit circle, centred at the
origin O of the x-y plane; the circumference of this circle is 2πr = 2π.
Let P be an arbitrary point on the circumference of the unit circle, and let θ
denote the angle that the line OP makes with the positive x-axis, with s as
the distance along the circumference of the circle from the x-axis to P .
The angle θ is measured in radians, which are defined such that there are
exactly 2π radians in a circle. Hence, by the equality of proportions
θ s
= , that is (since r = 1), θ = s. (1.79)
2π 2πr
The trigonometric functions sine and cosine may then defined as follows. The
cosine of the angle θ, denoted cos θ, is the value of the x coordinate of P .
Likewise, the sine of the angle θ, denoted sin θ, is the y coordinate of P , viz.
in this way the trigonometric functions are said to be periodic. Such periodicity
is clear from the plots of sin θ and cos θ in figure 1.7.
Figure 1.6: Unit circle. The point P has coordinates (x, y) = (cos θ, sin θ).
2 2
1 1
0 0
-1 -1
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
(a) (b)
Figure 1.7: Trigonometric functions (a) cos θ and (b) sin θ. The horizontal (abscissa) axis
is normalised to units of 2π radians; thus, one full cycle (θ = 2π) corresponds to θ/2π = 1.
Other properties of sin θ and cos θ include the following (see figure 1.7).
Our definitions of sin θ and cos θ allow us to further define the tangent (tan θ)
of an angle θ as
sin θ
tan θ = . (1.86)
cos θ
We also define the secant (sec θ), cosecant (csc θ), and cotangent (cot θ):
1 1 1
sec θ = , csc θ = , cot θ = . (1.87)
cos θ sin θ tan θ
These functions are plotted in figure 1.8.
4 4
2 2
0 0
-2 -2
-4 -4
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
(a) (b)
4 4
2 2
0 0
-2 -2
-4 -4
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
(c) (d)
Figure 1.8: Trigonometric functions (a) tan θ, (b) sec θ, (c) csc θ, and (d) cot θ.
1.6.1 Pythagoras
Returning to our diagram of the unit circle in figure 1.6, consider the right-
angle triangle with hypotenuse OP ; this hypotenuse has length 1. The side
adjacent to the angle θ has length x = cos θ, while side opposite the angle θ
has length y = sin θ. Thus, according to Pythagoras’s theorem we have
sin2 θ 1
1+ = , that is, 1 + tan2 θ = sec2 θ. (1.89)
cos2 θ cos2 θ
Similarly, we may show that cot2 θ + 1 = csc2 θ (see Exercise 1.15).
Example 1.15 Show that cot2 θ + 1 = csc2 θ.
cos2 θ 1
2 +1= , that is, cot2 θ + 1 = csc2 θ (1.90)
sin θ sin2 θ
as required. J
these equations are known as the compound angle formulae and hold for all
α, β ∈ R. Indeed, since cos(−β) = cos(β) and sin(−β) = − sin(β), we may
use equations (1.91) and (1.92) to write
tan α + tan β
tan(α + β) = (1.95)
1 − tan α tan β
where α, β ∈ R.
sin(α + β)
tan(α + β) =
cos(α + β)
sin α cos β + cos α sin β
= (1.96)
cos α cos β − sin α sin β
as required. J
Example 1.17 Use the compound angle formulae to show that
where α, β ∈ R.
as required. J
Other formulae may be obtained by repeated application of the double angle
formulae. For example, suppose that we define
A+B A−B
α= and β= (1.102)
2 2
where A and B are any two real numbers.
(α + β) = A and (α − β) = B (1.103)
Thus, putting equations (1.103) and (1.102) into equation (1.100) we obtain.
A+B A−B
cos A + cos B = 2 cos cos . (1.104)
2 2
where A, B ∈ R.
1.6.3 Identities
If two mathematical expressions are equal for all values of the arguments,
then we say that they are equivalent, or identical. Equations which convey
equivalence are known as identities, and use an equivalence symbol ‘≡’ in
place of the equality symbol ‘=’. For example, equation (1.88) holds for all
values of θ ∈ R, and may be written as the identity.
Two commonly used identities known as the double angle formulae may be
derived by considering the identities above.
(ii) Similarly, equation (1.112) means that cos2 θ ≡ 1 − sin2 θ; hence, putting
this relation into cos(2θ) ≡ cos2 θ − sin2 θ gives
as required. J
The most important identities derived in this section are summarised below.
We can also use functions to operate on other functions, this leads us to the
idea of composite functions (§1.7.1), and inverse functions (§1.7.2).
Given two functions f (x) and g(x), we may define a third function
In practice the form of a composite function h(x) = g(f (x)) is usually found
by replacing each instance of the variable x with f (x) in the rule for g(x).
Example 1.20 Let f (x) = x2 and g(x) = cos(x). Write out following com-
posite functions in terms of x: (a) h(x) = g(f (x)); and (b) l(x) = f (g(x)).
I Solution: (a) Taking the output of f (x) and using it as the input argument
of g yields
h(x) = g(f (x)) = cos(f (x)) = cos(x2 ). (1.122)
(b) Similarly, taking the output of g(x) and using it as the input argument of
f yields
l(x) = f (g(x)) = (g(x))2 = cos2 (x). (1.123)
Figure 1.10: Composite function h(x) = g(f (x)): the output of f used as the input of g.
then we say that g(x) is the inverse function of f (x), and write
1 1
f −1 (f (x)) = f (x) = (2x) = x. (1.126)
2 2
Here f (x) = 2x is said to be a one-one function because each value of x is
paired with a distinct value of y = f (x); thus, x can be inferred from the value
of y = f (x). The domain of f is the codomain of f −1 , and vice versa.
A function f (x) that maps several values of x to the same value of y = f (x)
is known as a many-one function. An example of a many-one function is
for instance, there are many values of x (e.g., x = 0, 2π, 4π, . . . ) that map
to the value y = cos(x) = 1. Thus, even if we know y = 1, we cannot infer
whether x = 0, or x = 2π, or x = 4π (etc.). Indeed, in general many-one
functions do not have unique inverses because the value of x cannot always be
inferred from the value of y = f (x).
Many-one functions can, however, be ‘converted’ to one-one functions by re-
stricting the domain. For example, the function defined by
1 8
6
0.5
4
0 2
0
-0.5
-2
-1 -4
-1 -0.5 0 0.5 1 -4 -2 0 2 4 6 8
(a) (b)
1
Figure 1.11: Inverse functions: (a) cos(x) and arccos(x); (b) 27 (x − 1)3 and 3x1/3 + 1.
1
f (x) = (x − 1)3 , (1.129)
27
and sketch both f (x) and f −1 (x) on the same axis.
Many functions can be classified as to whether they are odd or even (see
figure 1.12); such characteristics are known as symmetry properties
It may be shown that the product of two odd or even functions is also
either odd or even according to the following rules:
(a) (b)
Figure 1.12: Symmetry about the y-axis: (a) an even (symmetric) function f (−x) = f (x);
and (b) an odd (anti-symmetric) function f (−x) = −f (x).
Example 1.22 Classify the following functions in terms of whether they are
odd or even: (a) f (x) = x4 −4; (b) g(x) = sin(x3 ); (c) h(x) = (x4 −4) sin(x3 ).
When 8 has been written in this way we say that the base is 2 and the power
(or exponent) is 3. An equivalent way of expressing the relationship above is
log2 8 = 3, (1.139)
which reads as “the logarithm to base 2 of 8 is 3”; indeed, we say that “log
to base 2 of 8 is 3”. To emphasise,
are equivalent, i.e., the logarithm of unity (1) in any base is zero.
I Solution: (a) log8 512 = 3; (b) log3 81 = 4; and (c) log2 1024 = 10. J
Our definition of the logarithm means that it is a function. To see this let
a, x, y ∈ R, and consider the equivalent statements
Hence, if we know x, then we can compute y, and vice versa; that is, f and g
are inverse functions of each other, viz.
Notice that we have expressed the rule for g using y as the independent variable;
to express the rule using x as the independent variable we simply write
g(x) = ax . (1.148)
The functions f and g are plotted in figure 1.13 using bases a = 10 and a = e,
where e ≈ 2.718281828459 is called Euler’s number (see §1.9.4).
2 2
1 1
0 0
-1 -1
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
(a) (b)
Figure 1.13: Functions f (x) = loga x and g(x) = ax for bases: (a) a = 10; (b) a = e.
Example 1.24 Solve the following equations: (a) log3 x = 2; (b) log2 x = 4.
The following properties of logarithms are often useful for solving equations.
These laws are deduced from the rules for exponents (e.g., Example 1.26).
x = an , and y = am (1.152)
Now notice that the rules for exponents mean that xy = an am = an+m , so in
logarithm form (Definition 1.7) we have
as required. This equation is the first law of logarithms listed in Result 1.5. J
x2 ∼ 10n x1 , (1.156)
where ∼ means similar to (or ‘about the same size’). For example, the average
weight of a female human in the United Kingdom is mH ≈ 70kg, whereas the
average weight of a male African elephant is mE ≈ 6000 kg; thus,
mE ∼ 102 mH , (1.157)
Logarithms are extremely useful for revealing trends in data over several orders
of magnitude. For example, consider some kind of exponential dependence
y = y 0 ax , (1.158)
Thus, plotting log10 (y/y0 ) against x yields a straight line whose gradient m
can be used to infer the value of a (see Example 1.27 and figure 1.14). The
same visual effect can also be achieved using logarithmic axis. On logarithmic
axis numbers are spaced evenly according to their magnitude, e.g., the sepa-
ration between 1 and 10 is the same as the separation between 10 and 100.
Logarithmic axis are best understood by plotting some example data-series.
where n is the number of years since 1960, and N0 = N (0) (see figure 1.14).
10 8
12 10 10 10
10 8 10 8
8
6 10 6
6
4 10 4
4
2 2 10 2
0 0 10 0
0 10 20 30 40 50 60 0 10 20 30 40 50 60 0 10 20 30 40 50 60
(a) (b) (c)
n
√
Figure 1.14: Moore’s law:
√ (a) on standard axis as N/N0 = ( 2) ; (b) on standard
√ axis
as log10 (N/N0 ) = n log10 2; (c) plotted using logarithmic y-axis as N/N0 = ( 2)n .
and so on. In general, therefore, we have the exponential form of Moore’s law
√
after n years: N (n) = N0 ( 2)n . (1.163)
as required. These expressions for Moore’s law are plotted in figure 1.14. J
Logarithms can also be used to transform a power law into a linear relationship.
For example, let
y = y 0 xn , (1.165)
10 5 6
10 10 6
8 3 10 3
6
0 10 0
4
-3 10 -3
2
0 -6 10 -6 -2
0 20 40 60 80 100 -2 -1 0 1 2 10 10 -1 10 0 10 1 10 2
(a) (b) (c)
Figure 1.15: Power law y = x3 : (a) plotted on standard axis as y = x3 ; (b) plotted on
standard axis as log10 y = 3 log10 x; (c) plotted using logarithmic axis as y = x3 .
Base 10 is said to be a standard base in the sense that it is used very com-
monly. The other standard base is base e, where
∞
X 1
e= = 2.718281828459 . . . , (1.168)
n=0
n!
Result 1.6 The standard bases are base 10 and base e (see figure 1.13).
Logarithms in these bases are often written using the following notation:
f (x) = ex , (1.170)
where e is Euler’s number. There are many equivalent ways of defining the
exponential function, but in this course we favour a power series representation
(see Definition 1.8). We discuss power series in further detail in §2.5.
Hyperbolic functions
1
cosh(x) ≡ (ex + e−x ),
2
1
sinh(x) ≡ (ex − e−x ), (1.172)
2
sinh(x)
tanh(x) ≡ (1.173)
cosh(x)
1
sech(x) ≡ ,
cosh(x)
1
csch(x) ≡ , (1.174)
sinh(x)
1
coth(x) ≡ ; (1.175)
tanh(x)
8 10
6 5
4 0
2 -5
0 -10
-3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3
(a) (b)
10 2
5 1
0 0
-5 -1
-10 -2
-3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3
(c) (d)
Figure 1.16: Hyperbolic functions (a) ex , (b) sinh x, (c) cosh x, and (d) tanh x.
Definition 1.10 It may be shown (see, e.g., Example 1.29) that the hy-
perbolic functions have inverse functions given by
√
sinh−1 (x) ≡ arcsinh(x) ≡ loge x + x2 + 1 ,
−1
√
cosh (x) ≡ arccosh(x) ≡ loge x + x − 1 , 2 (1.176)
r !
1+x
tanh−1 (x) ≡ arctanh(x) ≡ loge . (1.177)
1−x
1 1
cosh2 (x) − sinh2 (x) ≡ (ex + e−x )2 − (ex − e−x )2
4 4
1 2x 1
≡ (e + 2 + e−2x ) − (e2x − 2 + e−2x )
4 4
≡1 (1.178)
as required. J
1
x = (ey − e−y ) ⇔ e2y − 2xey − 1 = 0. (1.179)
2
Now let ey = z, then e2y = z 2 , and equation (1.179) can be expressed as
z 2 − 2xz − 1 = 0. (1.180)
This equation is the rule for the inverse function of sinh x in Definition 1.10.J
is an explicit function. Not all functions can be expressed in this way; indeed,
it is sometimes necessary (or desirable) to express the relationship between two
variables x and y in the form
g(x, y) = c (1.184)
2
xy = 2 (implicit form) ⇔ y(x) = (explicit form) (1.186)
x
both describe the same relationship between x and y (they are equivalent).
i.e., (x, y) are points on the circumference of a unit circle (see figure 1.17).
Figure 1.17: Unit semi-circle parameterised by x(t) = cos t, y(t) = sin t, with t ∈ [0, π].
where a, b, and ω are constants. Show that particle follows an elliptical path.
2π
x(t + T ) = x(t) and y(t + T ) = y(t), where T = . (1.193)
ω
Thus, the particle returns to its original position when t is incremented by T ,
i.e., T is the time period for the particle to complete an orbit of the ellipse. J
Consider the line from the origin O to P . Let r denote the length of this line,
and let θ denote the angle that the line makes with the positive x-axis. We call
r the radial coordinate of P , and θ the azimuthal coordinate of P ; in this
way P may be expressed in plane polar coordinates as (r, θ). By inspection
of figure 1.19 we see that polar coordinates are related to Cartesians by
By restricting r and θ to the set of values r ∈ [0, ∞), and θ ∈ [0, 2π), the
plane polar coordinates (r, θ) of a point are unique.
√
Example 1.32 A point P has plane polar coordinates (r, θ) = (2 2, 3π
4
);
what are the Cartesian coordinates of P ?
Example 1.33 Polar coordinates and Cartesians are related according to equa-
tions (1.194) by x = r cos θ and y = r sin θ. Show that r2 = x2 + y 2 .
as required. J
r = f (θ), (1.197)
then the function f (θ) may be depicted using plane plane polar coordinates by
the curve produced by the locus of points (r, θ), with r = f (θ). Curves of this
kind are sometimes called polar curves; classic examples include:
It is sometimes possible to convert between polar form and Cartesian form using
the fact that the two coordinate systems are related by equations (1.194).
2 2
1 1
0 0
-1 -1
-2 -2
-2 -1 0 1 2 -3 -2 -1 0 1
(a) (b)
Figure 1.20: Example polar curves: (a) a unit circle r = 1; (b) a Cardioid r = 2(1−cos θ).
2 2
1 1
0 0
-1 -1
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
(a) (b)
Figure 1.21: More example polar curves: (a) an Archimedean spiral; (b) a rhodonea curve.
r = 2 cos θ. (1.199)
r2 = 2r cos θ. (1.200)
x2 + y 2 = 2x. (1.201)
It is not obvious what curve this is; however, progress may be made by adding
unity (1) to both sides to give
(x − 1)2 + y 2 = 1. (1.203)
This is the Cartesian equation for a circle of radius 1 centred at (1, 0). J
Differentiation
Our goal in this section is to review the main ideas and rules of differentiation,
so that these can be applied with confidence to problems in engineering and
the physical sciences. As with our work on functions, much of what follows
is revision of school mathematics, although we shall go somewhat deeper by
considering foundational aspects, such as limits and continuity. By the end of
the section you should be able to. . .
Learning outcomes:
59
60 Section 2 : Differentiation
2.1.1 Limits
The function f (x) depicted in figure 2.1 is defined for x ∈ R by the rule
2x for x ≤ 1
f (x) = (2.1)
x for x > 1;
because this rule is different for different parts of the domain, f (x) is known
as a compound function. Notice that as we approach x = 1 from the left
(i.e., with x < 1) that f (x) approaches the value f (x) = 2. In mathematical
notation we write this as the left-hand-limit
and say that ‘the limit of f (x) as x approaches 1 from the left (−) is 2’. [The
minus-sign (−) superscript on 1− signifies that the limit is left-handed.]
Similarly, as we approach x = 1 from the right (wwith x > 1), the function
f (x) approaches the value f (x) = 1. In this case we have a right-hand-limit
and say that ‘the limit of f (x) as x approaches 1 from the right (+) is 2’.
[The plus-sign (+) superscript on 1+ signifies that the limit is right-handed.]
Observe here that the left and right-hand limits of f (x) at x = 1 are not
equal; this is because there is a ‘break’ in the function at x = 1, i.e., we are
concerned with the limiting behaviour of f (x) as x → 1 from either the left
or right, rather than the value of the function f (x) at x = 1. Indeed, this
distinction is most clear in equation (2.3): the limit of f (x) as x → 1+ is 1,
but the value of the function at x = 1 is f (1) = 2. To emphasise:
Now consider the behaviour of f (x) as x → 2, in this case we find that the
left and right-hand limits are identical, viz.
as the limit of f (x) as x → 2, where we do not use the ‘±’ notation, because
it is implied that the limit can be taken from either direction. More generally:
If the left and right-hand limits are equal (i.e., if L+ = L− ) then we say
that the limit of f (x) as x → a is L, and write
Note that when evaluating the limit of a function f (x), neither the limiting
value of x, nor the value of the limit itself need be finite (see Example 2.1).
1 1
(a) lim x2 ; (b) lim e−x ; (c) lim− ; (d) lim+ .
x→2 x→∞ x→0 x x→0 x
lim x2 = 4. (2.9)
x→2
1
(c) As x < 0 approaches 0 from the left, x
diverges negatively, i.e.,
1
lim− = −∞. (2.11)
x→0 x
1
(d) As x > 0 approaches 0 from the right, x
diverges positively, i.e.,
1
lim+ = +∞. (2.12)
x→0 x
Graphs of the functions associated with the above limits are depicted in fig-
ure 2.2. In formal analysis one cannot appeal to the graph of a function to
determine a limit, but graphs are nonetheless useful for illustrative purposes.J
4 6 4
3 2
4
2 0
2
1 -2
0 0 -4
-2 -1 0 1 2 -2 0 2 4 -4 -2 0 2 4
(a) (b) (c/d)
Figure 2.2: Graphs of the functions associated with limits in Example 2.1.
2.1.2 Continuity
I Solution: (a) With reference to our discussion at the start of 2.1.1 we have
Result 2.1 Let f (x) and g(x) be continuous functions whose limits at the
point x = a are
The standard rules for evaluating the limits of sums, products, and quo-
tients of these functions are then
sum rule: lim f (x) + g(x) = f (a) + g(a); (2.19a)
x→a
product rule: lim f (x)g(x) = f (a)g(a); (2.19b)
x→a
quotient rule: lim f (x)/g(x) = f (a)/g(a). (2.19c)
x→a
Note: If f (a) and g(a) are both zero or infinity, then f (a)/g(a) is not
well defined, and is said to be in indeterminate form; in these cases the
quotient rule fails, and other methods must be used (see §2.6).
We shall not prove these results here, but instead illustrate their application.
It may be shown that the functions f (x) = (x2 − sin x) and g(x) = xe−x are
continuous, so these results are consistent with f (π) = π 2 and g(0) = 0. J
Example 2.3 Let f (x) = (x2 − 9) and g(x) = (x − 3); evaluate the limit
I Solution: Notice here that both lim f (x) = 0 and lim g(x) = 0, meaning
x→3 x→3
that the quotient rule will not work (see Result 2.1). However, notice that
f (x) (x + 3)(x − 3)
= = (x + 3), for x 6= 3. (2.23)
g(x) (x − 3)
Often we are faced with algebraic expressions that cannot be solved analytically,
but which nevertheless yield useful and exact asymptotic limits. For example,
Bissell1 has investigated a convection system described by parameters C, CB ,
P1 , RS and Rc . The system exhibits ‘over-stability’ whenever C exceeds a
threshold CT (P1 ) (see figure 2.3), where CT is given by the implicit relation
s
2π 2 CT3 (P1 + 1)
CT2 − = . (2.25)
Rc 4Rc P12
1 2π 2
lim CT (P1 ) = √ = ∞, and lim CT (P1 ) = . (2.26)
P1 →0 2 Rc P1 P1 →∞ Rc
Thus, one can check the accuracy of any numerical solution for CT (P1 ) by
ensuring that the numerical solution approaches these limits as P → 0 and
P → ∞. A graph of numerical values for CT (P1 ) is plotted in figure 2.3
alongside dash-dotted curves for the asymptotic limits, or asymptotes.
1
See J. J. Bissell, Proceedings of the Royal Society, Series A, 471:20140845 (2015).
1
10
C T ( P 1)
C B ( P 1)
0
RS < Rc
10
C
−1
RS > Rc
10
−2
10 −2 −1 0 1 2
10 10 10 10 10
P1
Figure 2.3: Numerical values for CT (solid curve) with asymptotes (dash-dotted curves).
The basic idea behind differentiation is well illustrated by the problem of deter-
mining the instantaneous velocity v of an object moving in a straight line. Let
us suppose that the velocity varies with time t according to some function v(t),
then we can denote the position—or displacement—of the particle at time t as
s(t). Now let δt represent a short period of time; it follows that the distance
δs travelled by the particle during the time interval [t, t + δt] is
By definition, therefore, the average velocity hvi of the particle during the time
interval [t, t + δt] is
Since hvi is the average velocity of the particle during the time interval [t, t+δt],
to compute the instantaneous velocity v(t) of the particle at time t we simply
need to let the width δt of the interval vanish, i.e.,
δs s(t + δt) − s(t) ds
v(t) = lim = lim ≡ . (2.29)
δt→0 δt δt→0 δt dt
This limit, which we denote in shorthand by ds/dt, is called the first derivative
of s(t) with respect to t; evaluating ds/dt is known as differentiating s(t).
df δf f (x + δx) − f (x)
≡ lim ≡ lim , (2.30)
dx δx→0 δx δx→0 δx
provided this limit exists. [If the limit does not exist, then f (x) is not
differentiable at x.] Derivatives are also denoted using the ‘prime’ notation
df
f 0 (x) ≡ . (2.31)
dx
Furthermore, in operator notation the differential operator d/dx can be
used to write a derivative as
d df
(f ) ≡ , (2.32)
dx dx
where the left-hand-side means ‘the derivative of f with respect to x’.
df f (x + δx) − f (x)
= lim
dx δx→0 δx
k(x + δx) − kx
= lim
δx→0 δx
kδx
= lim
δx→0 δx
= lim k
δx→0
= k. (2.33)
= 2x. (2.34)
d 2
(x ) = 2x, (2.35)
dx
where the left-hand-side means ‘the derivative of x2 with respect to x’. J
sin(δx/2)
lim = 1. (2.36)
δx→0 δx/2
Use this result to find the first derivative of f (x) = sin x by first principles.
[Hint: You may wish to use the identity sin A−sin B ≡ 2 sin( A−B 2
) cos( A+B
2
).]
where we used the product rule for limits. Hence, the derivative of f (x) = sin x
is f 0 (x) = cos x.
Note: A similar argument may be used to show that
d
(cos x) = − sin x. (2.40)
dx
The derivatives of other trigonometric functions are described in §2.3. J
It will be apparent from our examples in the previous section that differentiating
functions from first principles is often a laborious task, and for this reason it is
more common to appeal to standard results, rather than Definition 2.3. Table
2.1 lists the derivatives of some commonly occurring functions, all of which
may (in principle) be determined by Definition 2.3. The results from this table
may then be used to evaluate derivatives in specific cases (see Example 2.7).
Example 2.7 Use table 2.1 to determine the first derivatives of:
df
f 0 (x) d2 f
f (x) f 0 (x) ≡ f 00 (x) ≡
dx dx2
a 0 eax aeax
1
xn nxn−1 loge (x + a)
x+a
1
sin(ax) a cos(ax) arcsin(x/a) √
a − x2
2
1
cos(ax) −a sin(ax) arccos(x/a) −√
a − x2
2
a
tan(ax) a sec2 (ax) arctan(x/a)
a 2 + x2
1
sinh(ax) a cosh(ax) arcsinh(x/a) √
a + x2
2
1
cosh(ax) a sinh(ax) arccosh(x/a) √
a − x2
2
a
tanh(ax) a sech2 (ax) arctanh(x/a)
a − x2
2
Table 2.1: Derivatives of some common functions, where a and n are taken as constants.
Now consider a second point Q further along the curve, and with coordinates
(x + δx, y + δy), where
The straight dashed line through P and Q is called a secant because it ‘cuts’
the curve. [In Latin the word secans means ‘cutting’ (thus secant), and the
word tangens means ‘touching’ (thus tangent).] The gradient of the secant is
δy f (x + δx) − f (x)
= . (2.43)
δx δx
Now, as δx is decreased, the point Q approaches the point P , such that the
secant (dashed line) gets closer-and-closer to coinciding with the tangent (solid
line). Indeed, in the limit that δx → 0, the secant and the tangent are identical.
It follows by equation (2.43), therefore, that the gradient of the tangent is
δy f (x + δx) − f (x) df
lim = lim ≡ , (2.44)
δx→0 δx δx→0 δx dx
i.e., the gradient of the curve y = f (x) at x is f 0 (x).
Figure 2.4: Tangent to f (x) at P (solid line), and a secant through P and Q (dashed).
Calculate the gradient of the function at the point P = (3, 9). Hence determine
the equation for the tangent to f (x) that passes through P .
y = mx + c, (2.46)
where m is the gradient, and c is the line’s intercept with the y-axis. By
definition, the gradient of the tangent to f (x) at the point P is the same as
the gradient of the function f (x) at P . Hence, we have
m = f 0 (3) = 6. (2.47)
The constant c is found by the fact that the tangent passes through P , where
x = 3 and y = 9. Thus, with m = 6, we require by equation (2.46) that
9 = 6 × 3 +c,
|{z} that is, c = −9. (2.48)
| {z }
y mx
24
21
18
15
12
0
-5 -4 -3 -2 -1 0 1 2 3 4 5
Figure 2.5: Tangent y = 6x − 9 to the function f (x) = x2 at the point P = (3, 9).
The ‘d’ notation in the derivative of a function refers explicitly to the idea
of a differential element, or infinitesimally small change (a vanishingly
small change). In particular, we say that df is the differential of f and dx
is the differential of x. Indeed, we can see how differentiation corresponds to
rates-of-change by noting that the infinitesimal change df in f caused by an
infinitesimal change dx is
df
df = f 0 (x)dx ≡ dx. (2.49)
dx
This notation suggests that derivatives work in similar way to fractions; how-
ever, this interpretation is not quite right because the notation df /dx is defined
as the limit of a ratio (see Definition 2.3), not as a fraction proper.
Let u(x) and v(x) be two functions, then any expression f (x) of the form
where a and b are constants, is called a linear combination of u(x) and v(x).
It may be shown that derivative of such a sum is equal to the sum of the
derivatives; that is, differentiation is a linear operation:
d du dv
[au(x) + bv(x)] = a +b . (2.51)
dx dx dx
Example 2.9 Determine the first derivative of f (x) = (3x2 − 2x + cos x).
df d 2
= 3x − 2x + cos x
dx dx
d d d
= (3x2 ) − (2x) + (cos x)
dx dx dx
= 6x − 2 − sin x, (2.52)
where we evaluated the derivatives using the standard results in table 2.1. J
For example, the function y(x) = cos(x2 ) may be written in the composite
form y = f (u) = cos(u), with u(x) = x2 . To differentiate a function of this
kind we proceed as follows. Suppose that x changes by δx, then u(x) changes
by δu, and f (u) changes by δf , where
df δf δf δu δf δu df du
≡ lim = lim = lim × lim = , (2.56)
dx δx→0 δx δx→0 δu δx δu→0 δu δx→0 δx du dx
where δu → 0 because δx → 0. This expression is known as the chain rule.
df df du
y 0 (x) = = = (cos u) × (4x3 ) = 4x3 cos(x4 ), (2.59)
dx du dx
where we used the fact that
d d 4
(sin u) = cos u and (x ) = 4x3 . (2.60)
du dx
Notice that it is essential to write the expression for f 0 (x) in terms of x only.J
Example 2.11 Determine the first derivative of v(t) = (2t + 1)3 .
df df dz
v 0 (t) = = = 3z 2 × 2 = 6(2t + 1)2 , (2.62)
dt dz dt
As ever, we have written the expression for v 0 (t) in terms of t only. J
Example 2.12 Determine the first derivative of y(x) = sin(exp(x2 )).
To handle a problem of this kind we must apply the chain rule repeatedly, i.e.,
df df dg df dg dh
y 0 (x) = = = . (2.64)
dx dg dx dg dh dx
df
y 0 (x) = = 2xeh cos(g) = 2x exp(x2 ) cos(exp(x2 )), (2.65)
dx
where we used the fact that f 0 (g) = cos(g), g 0 (h) = eh , and h0 (x) = 2x. J
Consider the function f (x) formed by the product of two functions u(x) and
v(x), that is,
f (x) = u(x)v(x). (2.66)
δf = f (x + δx) − f (x)
= u(x + δx)v(x + δx) − u(x)v(x)
= u(x + δx)v(x + δx) − u(x + δx)v(x) + u(x + δx)v(x) − u(x)v(x).
= u(x + δx) [v(x + δx) − v(x)] + v(x) [u(x + δx) − u(x)] . (2.67)
df δf
= lim
dx δx→0
δx
v(x + δx) − v(x) u(x + δx) − u(x)
= lim u(x + δx) + lim v(x)
δx→0 δx δx→0 δx
v(x + δx) − v(x) u(x + δx) − u(x)
= u(x) lim + v(x) lim
δx→0 δx δx→0 δx
dv du
= u(x) + v(x) . (2.69)
dx dx
This result is known as the product rule for differentiation.
Result 2.4 Let f (x) = u(x)v(x), then the product rule states that
df d dv du
= (uv) = u +v . (2.70)
dx dx dx dx
This rule may be used in combination with the chain rule to derive the
quotient rule
d u vu0 − uv 0
= , (2.71)
dx v v2
where u0 = du/dx and v 0 = dv/dx (see Example 2.14).
d
f 0 (x) = (2 sin x cos x)
dx
d d
= 2 cos x (sin x) + sin x (2 cos x)
dx dx
= 2 cos x cos x − 2 sin x sin x
= 2(cos2 x − sin2 x). (2.72)
Note: By the double angle formula we have sin(2x) ≡ 2 sin x cos x, and
cos(2x) ≡ (cos2 x − sin2 x). As expected, therefore, this result is equivalent
to saying that f (x) = sin(2x) with f 0 (x) = 2 cos(2x). J
Example 2.14 We now illustrate how to derive the quotient rule. Let
u(x) 1
f (x) = with v(x) = , (2.73)
v(x) w(x)
d u d du dw
f 0 (x) = = (uw) = w +u . (2.74)
dx v dx dx dx
(b) Since w = v −1 , we have that
dw d −1 1
= (v ) = −v −2 = − 2 (2.75)
dv dv v
Thus, according to the chain rule
dw dw dv 1 dv v0
= =− 2 = − 2. (2.76)
dx dv dx v dx v
(c) Putting equation (2.76) into equation (2.74) we obtain
0 d u du v0 u0 v0 vu0 − uv 0
f (x) = =w −u 2 = −u 2 = , (2.77)
dx v dx v v v v2
where we substituted for w = 1/v. This result is called the quotient rule. J
x4 − 3xy + y 3 = 2. (2.79)
d d
x4 − 3xy + y 3 =
(2) = 0. (2.80)
dx dx
Evaluating the left-hand-side of this equation we find
d d 4 d d 3
x4 − 3xy + y 3 =
(x ) − (3xy) + (y )
dx dx dx dx
d d d 3
= 4x3 − 3x (y) − y (3x) + (y )
dx dx dx
dy dy
= 4x3 − 3x − 3y + 3y 2 , (2.81)
dx dx
where we used the chain rule to write
d 3 d 3 dy dy
(y ) = (y ) = 3y 2 . (2.82)
dx dy dx dx
dy dy
4x3 − 3x − 3y + 3y 2 = 0, (2.83)
dx dx
whereupon solving for y 0 (x) we have
dy 3y − 4x3
y 0 (x) = = 2 . (2.84)
dx 3y − 3x
d dg dy
g(y) = ; (2.86)
dx dy dx
d dx dy dx dy
(x) = that is, 1= . (2.87)
dx dy dx dy dx
Result 2.5 Let y = f (x) have inverse function f −1 such that x = f −1 (y);
then the inverse function rule states that
−1
dx dy dx dy
1= , i.e., = . (2.88)
dy dx dy dx
dx
q √
x(y) = sin y, with = cos y = 1 − sin2 y = 1 − x2 (2.89)
dy
where we used the identity cos2 y + sin2 y ≡ 1. Hence, by the inverse function
rule we obtain
−1
dy dx 1 d 1
= =√ , i.e., arcsin(x) = √ . (2.90)
dx dy 1 − x2 dx 1 − x2
dx
x(y) = ey , with = ey = x (2.91)
dy
as required. J
Suppose that we have two variables y and x related implicitly through some
parametric equations, that is,
where t is the parameter (see §1.12). By the chain rule and inverse function
rule we have that
−1
dy dy dt dy dx dy/dt
= = = . (2.94)
dx dt dx dt dt dx/dt
d2 f f 0 (x + δx) − f 0 (x)
d df
2
≡ ≡ lim . (2.97)
dx dx dx δx→0 δx
d2 f
f 00 (x) ≡ . (2.98)
dx2
In operator notation the differential operator d/dx can be used to write
the second derivative as
2
d2 f
d d d d df
(f ) ≡ (f ) ≡ ≡ 2, (2.99)
dx dx dx dx dx dx
df d d 2 x
= f (x) = (x e ) = 2xex + x2 ex = (x2 + 2x)ex . (2.100)
dx dx dx
Likewise the second derivative is
d2 f
d df d 2
2
= = (x + 2x)ex = 2ex + 2xex + 2xex + x2 ex . (2.101)
dx dx dx dx
dx d2 x
ẋ(t) ≡ , and ẍ(t) ≡ . (2.102)
dt dt2
where A, ω, and φ are constants. Show that the motion of the particle satisfies
d d
ẋ(t) = (x) = [A cos(ωt + φ)] = −ωA sin(ωt + φ). (2.105)
dt dt
Likewise, the second derivative is
d dx d
ẍ(t) = = [−ωA sin(ωt + φ)] = −ω 2 A cos(ωt + φ). (2.106)
dt dt dt
Definition 2.6 Let f (n−1) (x) denote the (n−1)th order derivative of f (x),
where n ∈ Z is an integer, then the nth derivative of f (x) is defined by
d2 f
f (2) (x) ≡ = (x2 + 4x + 2)ex . (2.111)
dx2
Thus, the third derivative is
d3 f d (2) d 2
f (3) (x) ≡ 3
= f (x) = (x + 4x + 2)ex
dx dx dx
d d
= ex (x2 + 4x + 2) + (x2 + 4x + 2) ex
dx dx
= (2x + 4)e + (x + 4x + 2)e = (x + 6x + 6)ex .
x 2 x 2
(2.112)
(4) d4 f d (3) d 2
f (x) ≡ 4 = f (x) = (x + 6x + 6)ex
dx dx dx
d d
= ex (x2 + 6x + 6) + (x2 + 6x + 6) ex
dx dx
= (2x + 6)e + (x + 6x + 6)e = (x + 8x + 12)ex .
x 2 x 2
(2.113)
It may be shown that the nth derivative is f (n) (x) = (x2 + 2nx + n2 − n)ex .J
Consider the function f (x) depicted in figure 2.6. The points A, B, and C on
the curve are those for which the gradient of f (x) is zero, that is, where the
tangents (dashed lines) to the curve are horizontal. We say that these points
are stationary (unchanging) because the rate-of-change is f 0 (x) = 0.
Let us examine the curve at the point A. Immediately to the left of A the
function is increasing with increasing x, i.e., the gradient is positive f 0 (x) > 0;
likewise, immediately to the right of A the function is decreasing with increasing
x, i.e., the gradient is negative f 0 (x) < 0. Thus, the point A is a local
maximum of f (x), with the gradient changing from positive to negative.
Now consider the point B. Although the gradient of the function at B is zero,
i.e., f 0 (x) = 0, the gradient either side of B is negative, i.e., f 0 (x) < 0. We
call such a point a stationary point of inflection.
(see figure 2.7). Show that the projectile reaches a maximum height hM = 1.
0.5
0
0 0.25 0.5 0.75
3
Figure 2.7: Height of a projectile h(t) = 4 + 2t − 4t2 as a function of time t.
I Solution: There are two stages to the solution of this problem: first (i) we
identify the stationary points of h(t); and second (ii) we classify the stationary
points as to whether they are maxima or minima (or other).
(i) The function h(t) is stationary whenever h0 (t) = 0, i.e., when
dh 1
= 2 − 8t = 0, that is, t= . (2.115)
dt 4
1
(ii) Evaluating the second derivative at t = 4
we have
d2 h
= −8 < 0; (2.116)
dt2 t=1/4
3 1 2
hM = h( 14 ) = + 2( 41 ) − 4
4
=1 (2.117)
4
as required. J
Example 2.23 The perimeter p of a rectangle is given by p = 2(a + b), where
a is the width, and b the length. Express the area A = ab of the rectangle
as a function of p and b. Hence, show that if the perimeter is fixed, i.e., if
p = constant, then A maximised when b = a.
dA
A0 (b) = = 12 p − 2b = ( 21 p − b) − b = a − b = 0. (2.119)
db
Thus, A takes a stationary value when a = b. Evaluating the second derivative
of A(b) at b = a we have
d2 A
= −2 < 0. (2.120)
db2 b=a
where the ar are constant coefficients (see §1.4.3), and the summation includes
a finite number of terms. Polynomials are attractive because they can be
differentiated easily, and evaluated using multiplication only.
Now consider the infinite series
∞
X
P (x) = a0 + a1 x + a2 x2 + a3 x3 + · · · = ar x r . (2.122)
r=0
This expression looks like a polynomial with an infinite number of terms, and is
known as a power series (a series of powers of x). Power series are important
because they can be used to represent functions. Indeed, we saw in §1.10 that
the exponential function is in fact defined by a power series, i.e.,
∞
x2 x3 X xr
ex ≡ exp(x) ≡ 1 + x + + + ··· = . (2.123)
2! 3! r=0
r!
x x2
e ≈1+x+ , for |x| 1. (2.124)
2!
Here the symbol ‘≈’ means ‘approximately equal to’, while ‘for |x| 1’ means
‘the approximation works provided |x| is much less than 1’ (so x3 is small
compared to 1). For example, putting x = 0.1 into this approximation gives
0.1 (0.1)2
e ≈ 1 + 0.1 + = 1.105, (2.125)
2!
which is accurate to within 1 part in 1000 (e0.1 = 1.105170918 . . . ).
In this section we explore how to find such power series representations of
functions using what are termed Maclaurin series and Taylor series.
df
≡ f (1) (x) = a1 + 2a2 x + 3a3 x2 + 4a4 x3 + 5a5 x4 + . . . ; (2.128)
dx
hence, setting x = 0 in this equation we have
f (1) (0)
a1 = . (2.129)
1
Differentiating a second time then gives
d2 f
= f (2) (x) = 2a2 + (3 × 2)a3 x + (4 × 3)a4 x2 + (5 × 4)a5 x3 + . . . ; (2.130)
dx2
hence, setting x = 0 yields
f (2) (0)
a2 = . (2.131)
2×1
Similarly, differentiating a third time we find
d3 f
= f (3) (x) = (3 × 2)a3 + (4 × 3 × 2)a4 x + (5 × 4 × 3)a5 x2 + . . . ; (2.132)
dx3
while setting x = 0 yields
f (3) (0)
a3 = . (2.133)
3×2×1
Continuing in this pattern, it may be shown that the nth coefficient is given by
f (n) (0)
an = , (2.134)
n!
where n! = n × (n − 1) × · · · × 2 × 1 is n factorial.
x2 (2) x3
f (x) = f (0) (0) + xf (1) (0) + f (0) + f (3) (0) + . . . , (2.136)
2! 3!
where the notation f (0) (x) means ‘differentiate f (x) zero times’ such that
dn x
f (n) (x) = (e ) = ex , and thus f (n) (0) = e0 = 1. (2.140)
dxn
Hence, putting this result into equation (2.139) we obtain the Maclaurin series
∞
x x2 x3 X xr
e =1+x+ + + ··· = . (2.141)
2! 3! r=0
r!
Example 2.25 Determine the first four non-zero terms of the Maclaurin series
for f (x) = sin x. Hence, by using a quinitic approximation, estimate sin(0.1)
to within 1 part in 1,000,000,000.
x2 (2) x3
sin x = f (0) (0) + xf (1) (0) + f (0) + f (3) (0) + . . . (2.142)
2! 3!
Evaluating the first few derivative of f (x) = sin x we have
x3 x5 x7
sin x = x − + − + ... (2.144)
3! 5! 7!
Retaining the terms up to quintic order only (i.e., ‘truncate after x5 ’) we obtain
the approximation
(0.1)3 (0.1)5
sin (0.1) ≈ 0.1 − + = 0.099833416̇, (2.145)
3! 5!
which is accurate to 1 part in 1,000,000,000 (sin 0.1 = 0.0998334166468 . . . ).
Note: It may be shown that the following series expansions are valid for all x
∞
x3 x5 x7 X (−1)n
sin(x) = x − + − + ··· = x(2n+1) , (2.146a)
3! 5! 7! n=0
(2n + 1)!
∞
x2 x4 x6 X (−1)n
cos(x) = 1 − + − + ··· = x(2n) , (2.146b)
2! 4! 6! n=0
2n!
The first few terms of these expansions are depicted in figure 2.8.
2 2
1 1
cos(x)
sin(x)
0 0
-1 -1
ñ = 1 ñ = 1
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
x/π x/π
2 2
1 1
cos(x)
sin(x)
0 0
-1 -1
ñ = 2 ñ = 2
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
x/π x/π
2 2
1 1
cos(x)
sin(x)
0 0
-1 -1
ñ = 3 ñ = 3
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
x/π x/π
2 2
1 1
cos(x)
sin(x)
0 0
-1 -1
ñ = 4 ñ = 4
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
x/π x/π
2 2
1 1
cos(x)
sin(x)
0 0
-1 -1
ñ = 5 ñ = 5
-2 -2
-2 -1 0 1 2 -2 -1 0 1 2
x/π x/π
Figure 2.8: Maclaurin series (solid curves) for sine (left column, dash-dotted curves) and
cosine (right column, dash-dotted curves) according to the expansions in equations (2.146),
with ñ as the number of terms retained in the series. Notice the symmetry of the expansions.
where O(1/500) means ‘of order (size) 1/500 compared to unity’. Hence,
substituting this expression for (1 + 18 )1/3 into equation (2.148) we have
1/3
√
3 1 25 25
9=2 1+ ≈2× = (2.152)
8 24 12
√
3
as an approximation for 9 to an accuracy of 1 part in 500. J
2.5.3 Convergence
Generally speaking the Maclaurin series for a function will have an infinite
number of terms. For example, the Maclaurin series for f (x) = ex is
∞
x x 2 x3 X xr
e =1+x+ + + ··· = . (2.153)
2! 3! r=0
r!
This means that for some given value of x, the series involves adding together
an infinite series of numbers; for example, when x = 2 the series for ex is
∞
2 22 23 X 2r
e =1+2+ + + ··· = . (2.154)
2! 3! r=0
r!
r(r − 1) 2
(1 + x)r = 1 + rx + x + ..., |x| < 1, (2.155)
2!
where the condition ‘|x| < 1’ means ‘(1 + x)r is identical to the power series
expansion provided |x| < 1’. [If the series converges for all values of x, then this
is sometimes written as x ∈ R.] Crucially, a function and its series are only
identical for those values of x that satisfy the convergence condition.
Determining convergence is beyond the scope of this course; however, the
convergence conditions for the series described so far are listed below.
x x2 x3
e =1+x+ + + ... x ∈ R, (2.156a)
2! 3!
x3 x5 x 7
sin x = x − + − + ... x ∈ R, (2.156b)
3! 5! 7!
x2 x4 x6
cos x = 1 − + − + ... x ∈ R, (2.156c)
2! 4! 6!
r(r − 1) 2
(1 + x)r = 1 + rx + x + ... |x| < 1. (2.156d)
2!
1
f (x) = . (2.157)
2−x
Sketch f (x) and P3 (x), and comment on the suitability of the approximation.
1 1 x −1 1
f (x) = = 1− = (1 + α)−1 . (2.158)
2−x 2 2 2
Now, by the binomial series with r = −1 we have
1 x −1 1 x x2 x3 x4
f (x) = 1− = + + + + +... for |x| < 2. (2.161)
2 2 2 4 8 16 32
To obtain a cubic approximation P3 (x) we neglect terms of order x4 or higher:
1 x x2 x3
f (x) ≈ P3 (x) = + + + . (2.162)
2 4 8 16
We expect the approximation to hold provided |x| 2. (see figure 2.9). J
2
P3 (x)
1.5 f (x)
f (x)
0.5
0
-2 -1.5 -1 -0.5 0 0.5 1 1.5 2
x
Figure 2.9: Cubic approximation P3 (x) (dashed curve) to f (x) = 1/(2 − x) (solid curve).
(x − x0 )2 (2)
f (x) = f (x0 ) + (x − x0 )f (1) (x0 ) + f (x0 ) + . . . ; (2.163)
2!
we can therefore obtain expansions for functions that are not well defined at
x = 0, because ‘good behaviour’ is required near x0 only.
∆x2 (2)
f (x0 + ∆x) = f (x0 ) + ∆xf (1) (x0 ) + f (x0 ) + . . . , (2.164)
2!
where ∆x is the difference between x and x0 , i.e.,
∆x = (x − x0 ). (2.165)
This alternative expression makes it clear that a Maclaurin series is simply the
special case of a Taylor series with x0 = 0, and thus ∆x = x.
∆x2 (2)
f (x) = f (x0 + ∆x) = f (x0 ) + ∆xf (1) (x0 ) + f (x0 ) + . . . , (2.166)
2!
where ∆x = (x − x0 ). In summation notation this series is written as
∞
X (∆x)r
f (x) = f (x0 + ∆x) = f (r) (x0 ). (2.167)
r=0
r!
Equation (2.166) is called the Taylor series for f (x) even if the series does
not converge on f (x). A Maclaurin series is a Taylor series with x0 = 0.
2
Broadly speaking, we say that f (x) is ‘well behaved’ if we can differentiate f (x) as often as
required at x = x0 .
It may be shown that the convergence condition is −1 < |x| ≤ −1. This series
may also be obtained from the Maclaurin series for f (x) = loge (1 + x). J
Like Maclaurin series, Taylor series are particularly useful for problems involving
approximation (see figure 2.10). To illustrate this idea, suppose that we know
the value of a function at x = x0 , and wish to estimate its value at some nearby
Taylor series may be used to derive L’Hôpital’s rule for the limits of quotients
in indeterminate form (cf. §2.1.3). Very informally, the rule is as follows:
Result 2.7 Let f (x) and g(x) have limits at x = a that satisfy either:
lim f (x) = 0 and lim g(x) = 0; or lim |f (x)| = ∞ and lim |g(x)| = ∞.
x→a x→a x→a x→a
If the derivatives f 0 (x) and g 0 (x) exist, then L’Hôpital’s rule states that
Note: This rule may be used repeatedly if lim f 0 (x) = lim g 0 (x) = 0 etc.
x→a x→a
Since the limit of the top and bottom of the quotient are both zero, we may
use the formulation of l’Hôpital’s rule in Result 2.7 to obtain
d
sin x dx
(sin x) cos x
L = lim = lim d
= lim = 1, (2.176)
x→0 x x→0 (x) x→0 1
dx
where the final step is obtained by the quotient rule for limits (see §2.1.3). J
Example 2.30 Evaluate the limit L = lim xe−x .
x→∞
Since the limit of the top and bottom of the quotient are both infinite, we may
use the formulation of l’Hôpital’s rule in Result 2.7 to obtain
d
x (x) 1
L = lim = lim dx
d
= lim = lim e−x = 0. (2.178)
x→∞ ex x→∞ (ex ) x→∞ ex x→∞
dx
as required. J
Clearly we cannot use the quotient rule to write L = f (a)/g(a) because the
quantity 0/0 is not defined (see §2.1.3).
∆x2 (2) 3
f (x) f (a) + ∆xf (1) (a) + 2!
f (a) + ∆x 3!
f (3) (a) + . . .
= ∆x2 (2) ∆x3 (3)
, (2.183)
g(x) g(a) + ∆xg (1) (a) + 2!
g (a) + 3!
g (a) + . . .
where
∆x = (x − a). (2.184)
if x → a, then ∆x → 0, (2.187)
meaning that the limits x → a and ∆x → 0 are equivalent. Taking this limit
we obtain
" #
f (x) f (1) (a) + ∆x
2!
f (2)
(a) + . . .
lim = lim
x→a g(x) ∆x→0 g (1) (a) + ∆x g (2) (a) + . . .
2!
f (1) (a)
= , (2.188)
g (1) (a)
Integration
101
102 Section 3 : Integration
(a) (b)
(c) (d)
(e) (f)
Figure 3.1: (a) Area A beneath a curve y = f (x) between xa and xb (light grey). This
area may be approximated by the combined area of N rectangles (dark grey). Here we show
such approximations with: (b) n = 1; (c) n = 2; (d) n = 4; (e) n = 8; and (f) n = 16.
The approximation becomes exact in the limit that n → ∞ and ∆x = ((xb − xa )/n) → 0.
We call the function f (x) the integrand. The interval bounds xa and xb
are known as the lower and upper limits of the integration respectively.
Note: Very informally, the limiting process of turning a Riemann sum into
P R
an integral means making the replacements → , and ∆x → dx.
Returning to equation (3.5), therefore, it follows from Definition 3.1 that the
area A under a curve y = f (x) between xa and xb is given by
n
X Z xb
A = lim f (xk )∆x = f (x)dx. (3.9)
n→∞ xa
k=1
Figure 3.2 depicts a rod of length l and cross-sectional area A, lying parallel
to the x-axis. One end of the rod is at x = 0, while the other is at x = l; the
density ρ(x) of the rod varies as a function of position x. Notice that we can
divide the rod into n segments, each of width ∆x = l/n, and volume
∆V = A∆x. (3.10)
Figure 3.2: Rectangular bar of length l, cross-sectional area A, and density ρ(x).
Now, the density ρ(x) varies over the length of each segment ∆x; however,
if n is made sufficiently large, such that each segment ∆x = l/n becomes
very thin, then the density does not vary much. In this way the mass of each
segment may be approximated by
Adding up all these little masses, the total mass m of the rod is then
n
X n
X
m= ∆mk ≈ ρ(xk )A∆x. (3.12)
k=1 k=1
Notice that we have (broadly speaking) got from the sum in equation (3.13) to
P R
the integral in equation (3.14) by setting → and ∆x → dx. This gives
us an informal way for thinking about integration (see §3.1.3).
the rod in figure 3.2 is divided into lots of infinitesimal elements, each of width
dx, and infinitesimal volume element
dV = Adx. (3.15)
Then the volume element dV corresponding to the interval [x, x + dx] has
infinitesimal mass dm given by
The total mass of the rod is then given by integrating up all these elements,
that is, Z Z Z l
m= dm = ρ(x)dV = ρ(x)Adx, (3.17)
rod rod 0
ds = v(t)dt (3.19)
(where we used the fact that distance = speed × time). The total distance sT
R
travelled by the object is therefore found by integrating up (‘ umming up’) all
these infinitesimal distances, i.e.,
Z Z tb
sT = ds = v(t)dt, (3.20)
interval ta
There are at least two methods for evaluating integrals. The first method is to
begin with Definition 3.1 by treating the integration as the limit of a Riemann
sum; this procedure is known as integration by first principles (§3.2.1). The
second method is to think of integration as a kind of anti-differentiation; this
approach relies on the fundamental theorem of calculus (§3.2.2).
Example 3.1 In this problem we explore how integration by first principles can
be used to evaluate the definite integral of
f (x) = x2 (3.21)
(a) Suppose that the interval [0, b] is divided into n subintervals of equal
width; what is the width ∆x of each of these subintervals?
(b) Given that xk = xk−1 + ∆x, with x0 = 0, demonstrate that xk = k∆x.
(c) Use the fact that
n
X n
k 2 = (2n + 1)(n + 1) (3.22)
k=1
6
(d) Now, the limit that ∆x = (b/n) → 0 is equivalent to the limit that n → ∞.
By Definition 3.1, therefore, we have
Z b
I= f (x)dx ≡ lim S = lim S
0 ∆x→0 n→∞
b3 b3
1 1
= lim 2+ 1+ = . (3.27)
n→∞ 6 n n 3
where ta and tb are the upper and lower limits of the integral respectively.
Now recall from our work on differentiation that the velocity of the object at
time t is given by the derivative of the displacement s(t), that is,
ds
v(t) = = s0 (t). (3.29)
dt
It follows that the definite integral in equation (3.28) can be written as
Z tb
ds(t)
s(tb ) − s(ta ) = s0 (t)dt where s0 (t) = . (3.30)
ta dt
Result 3.1 If f (x) is the derivative of F (x), then the definite integral
of f (x) on the interval [a, b] is
Z b
dF (x)
f (x)dx = F (b) − F (a), where f (x) = . (3.32)
a dx
where a and b are the limits of the integration. The integral is said to be
definite because it has definite limits.
Now consider equation (3.31), and suppose that we choose an arbitrary value
for the lower limit, say a = α, but allow that upper limit to vary by setting
b = x, where x is a variable. In this way, equation (3.31) may be written
Z x
dF (u)
F (x) + c = f (u)du, f (u) = , with c = −F (α) (3.34)
α du
Example 3.3 Let f (x) = eax , where a is a constant. Show that f (x) is the
derivative of F (x) = eax /a, and hence write down the integral of f (x) = eax .
d eax aeax
dF
= = = eax = f (x), (3.41)
dx dx a a
Example 3.4 Let f (x) = sin(ax), where a is a constant. Show that f (x) is
the derivative of
1
F (x) = − cos(ax), (3.43)
a
and hence write down the integral of f (x) = sin(ax).
i.e., f (x) = 1/x is the derivative of F (x) = loge x. Hence, by the fundamental
theorem of calculus we have
Z Z
1
f (x)dx = F (x) + c, that is, dx = loge (x + a) + c, (3.48)
x
1 1
cosh(ax) sinh(ax) + c √ arcsinh(x/a) + c
a a + x2
2
1 1
sinh(ax) cosh(ax) + c √ arccosh(x/a) + c
a a − x2
2
1 a
sech2 (ax) tanh(ax) + c arctanh(x/a) + c
a a2 − x2
Z π/4
1
Example 3.7 Show that cos(2x)dx = .
0 2
as required. J
Z 8
1
Example 3.8 Evaluate the definite integral I = dx.
2 x
In addition to the standard integrals listed in table 3.1, we also often need to
make use of the following rules.
Result 3.2 Integration is a linear operation, that is, given two functions
f (x) and g(x), and two constants a and b, it may be shown that
Z Z Z
af (x) + bg(x) dx = a f (x)dx + b g(x)dx. (3.52)
For three constants a, b and c satisfying a < b < c, it may be shown that
Z c Z b Z c
f (x)dx = f (x)dx + f (x)dx. (3.54)
a a b
These rules all follow from the results listed in Theorem 3.1.
Z 2
Example 3.9 Demonstrate that (4x3 − 8x + 1)dx = 2.
0
as required. J
Z
6
Example 3.10 Show that I = du = 3 arctan(u/2) + constant.
4 + u2
This approach is formalised using the chain rule for differentiation; however,
in practice we learn what types of substitutions are effective by experience.
Z
Example 3.11 Evaluate the indefinite integral I = 7(x + 1)6 dx.
du 1
cos(x2 ) = cos(u), and = 2x, i.e., dx = du (3.61)
dx 2x
Thence, putting these expressions into our integral, we have
Z Z Z
2 du
I = 4x cos(x )dx = 4x cos(u) = 2 cos(u)du, (3.62)
2x
where we re-substituted u = x2 . Z J
Example 3.14 Evaluate the integral I = 30x2 (x3 + 8)4 dx.
du 1
(x3 + 8)4 = u4 , and = 3x2 , i.e., dx = du (3.64)
dx 3x2
Thence, putting these expressions into our integral, we have
Z Z Z
2 3 4 2 4 1
30x (x + 8) dx = 30x u du = 10u4 du = 2u5 + c. (3.65)
3x2
I Solution: Notice that we may use the identity 1 ≡ (sin2 θ + cos2 θ) to write
Z Z Z
I = cos θdθ = cos θ × cos θdθ = (1 − sin2 θ) cos θdθ.
3 2
(3.68)
du
(1 − sin2 θ) = (1 − u2 ), and du = dθ = cos θdθ, (3.69)
dθ
Thence, putting these expressions into our integral, we have
Z Z
1
I = (1 − sin θ) cos θdθ = (1 − u2 )du = u − u3 + c.
2
(3.70)
3
1
I= sin(6φ) + φ + c, (3.72)
6
where c is an arbitrary constant (see table 3.1). J
When computing an indefinite integral, therefore, one can always check one’s
work by ensuring that F 0 (x) = f (x).
Example 3.17 In Example 3.15 we found that
Z
1
I(θ) = cos3 θdθ = sin θ − sin3 θ + c, (3.74)
3
where we differentiated 1
3
sin3 by the chain rule. J
u0 (x)
Z Z Z
1 du 1
dx ≡ dx = du = loge u(x) + c, (3.76)
u(x) u dx u
where the final step follows by inspection (see table 3.1). This method of
integration is sometimes called logarithmic integration.
u0 (x)
Z
dx = loge u(x) + c, (3.77)
u(x)
u0 (θ)
Z Z Z
1 du 1
I=− dθ = − dθ = − du = − loge |u| + c. (3.79)
u(θ) u dθ u
where we used the rule for logarithms that − loge a = loge a−1 = log(1/a). J
This final form for I may be evaluated using the method in Example 3.10, i.e,
Z
6
I= du = 3 arctan(u/2) + c. (3.84)
4 + u2
P (t) t−7
φ(t) = = 2 (3.85)
Q(t) t − 2t − 3
Example 3.20 Use the method of partial fractions to express the function
t−7
φ(t) = (3.86)
t2 − 2t − 3
R
as the sum of two fractions. Hence evaluate the integral I = φ(t)dt.
t−7 t−7
φ(t) = = ; (3.87)
t2 − 2t − 3 (t + 1)(t − 3)
this expression suggests that we look for partial fractions of the form (see §1.5)
t−7 A B
= + , (3.88)
(t + 1)(t − 3) (t + 1) (t − 3)
t−7 2 1
φ(t) = = − (3.90)
t2 − 2t − 3 (t + 1) (t − 3)
t−7
Z Z Z Z
2 1
I = φ(t)dt = 2
dt = dt − dt
t − 2t − 3 (t + 1) (t − 3)
= 2 loge |t + 1| − loge |t − 3| + c, (3.91)
Example 3.21 Figure 3.4 depicts a right angled triangle with an adjacent side
of length 1, and an opposite side of length t. If we denote the angle of such a
triangle as θ/2, then
t
t = = tan(θ/2). (3.92)
1
This expression can be used as a substitution to evaluate integrals.
(a) With reference to figure 3.4, use a geometric argument to show that
1 t
cos(θ/2) = √ , and sin(θ/2) = √ . (3.93)
1 + t2 1 + t2
demonstrate that
1 − t2 2t 2t
cos θ = , sin θ = , tan θ = . (3.95)
1 + t2 1 + t2 1 − t2
Figure 3.4: Triangle with adjacent side 1, and opposite side t, subtending an angle θ/2.
I Solution: (a) With reference to figure 3.4, if the adjacent side is length 1,
and the opposite side is length t, then by Pythogoras’s theorem the hypotenuse
√ √
has length 12 + t2 = 1 + t2 . It then follows by definition that
1 t
cos(θ/2) = √ , and sin(θ/2) = √ . (3.97)
1 + t2 1 + t2
(b) Using the double angle formulae, we have by equation (3.97) that
1 2 1 + t2 1 − t2
cos θ ≡ 2 cos2 (θ/2)−1 = 2× −1 = − = , (3.98)
1 + t2 1 + t2 1 + t2 1 + t2
and similarly
t 1 2t
sin θ ≡ 2 sin(θ/2) cos(θ/2) = 2 × √ ×√ = . (3.99)
1+t2 1+t2 1 + t2
sin θ 2t 1 + t2 2t
tan θ = = 2
× 2
= . (3.100)
cos θ 1+t 1−t 1 − t2
Equations (3.98), (3.99), and (3.98) are the required forms of equation (3.95).
(c) With t = tan(θ/2) we have by the chain-rule that
dt d d 1
= tan(θ/2) = sec2 (θ/2) (θ/2) = sec2 (θ/2). (3.101)
dθ dθ dθ 2
Appealing to the hint sec2 φ ≡ (tan2 φ + 1) this equation may be written
dt 1 1 dθ 2
= (tan2 (θ/2) + 1) = (t2 + 1), that is, = 2 . (3.102)
dθ 2 2 dt t +1
It therefore follows that
dθ 2
dθ = dt = dt (3.103)
dt 1 + t2
as required. J
2
t = tan(θ/2), with dθ = dt, (3.104)
1 + t2
and may be used to express trigonometric functions of θ in the form of
rational functions of t. In particular, the substitution yields
2t 1 − t2 2t
sin θ = , cos θ = , tan θ = , (3.105)
1 + t2 1 + t2 1 − t2
Z
3
Example 3.22 Let I = dθ, use the tangent half-angle
sin θ + 2 cos θ + 3
substitution (Result 3.4), and the solution from Example 3.19 to show that
as required. J
Suppose that f (x) is the product of two functions u(x) and v(x), that is,
df d dv du
= (uv) = u +v . (3.111)
dx dx dx dx
Integrating this expression, therefore we have
Z Z Z
df dv du
f (x) = dx = u dx + v dx. (3.112)
dx dx dx
where we used the fact that f (x) = u(x)v(x). This equation forms the basis
for the method of integration by parts.
Result 3.5 If an integrand can be written in the form u(x)v 0 (x), then it
may be shown that
Z Z
dv du
u dx = uv − v dx. (3.114)
dx dx
I Solution: To use the formula for integration by parts we must write the
dv
integrand 2x cos x in the form u dx . One possibility is to set
dv
u = 2x and = cos x; (3.116)
dx
du
=2 and v = sin x. (3.117)
dx
Hence, appealing to equation (3.115), we have
Z Z Z
dv du
I = 2x cos xdx = u dx = uv − v dx
dx dx
Z
= 2x sin x − 2 sin xdx
Example 3.25 Use the formula for integration by parts to show that
Z Z
˜ 2 2
I = x sin xdx = −x cos x + 2x cos xdx. (3.121)
dv
I Solution: Suppose that we choose to write x2 sin xdx as u dx by setting
dv
u = x2 and = sin x; (3.122)
dx
du
= 2x and v = − cos x, (3.123)
dx
as the area bounded by the curve y = f (x) and the x-axis on [a, b]. As
illustrated by the odd and even functions depicted in figure 3.5, portions of the
curve above the x-axis have y = f (x) > 0, and yield a positive integral; for this
reason we say that the area above the x-axis is positive (‘+’). Similarly,
portions of the curve below the x-axis have y = f (x) < 0, and yield a negative
integral; in this case we say that the area below the x-axis is negative (‘−’).
(a) (b)
Figure 3.5: An even function (a), and an odd function (b). The area bounded by the
curves (shaded) is positive (‘+’) above the x axis, and negative (‘−’) below the x axis.
is the area bounded by the curve y = f (x) and the x-axis on the interval
[a, b]. Above the x-axis the area is positive, whereas below the x-axis the
area is negative; in this way it may be shown that
Z +a Z +a
for f (x) even about x = 0: f (x)dx = 2 f (x)dx. (3.131)
−a 0
Z +a
for f (x) odd about x = 0: f (x)dx = 0. (3.132)
−a
Example 3.26 Show that area A bounded by the curve y = 8x3 − 6x2 and
the x-axis on the interval x ∈ [−1, 2] is A = 12.
= 24 − 4 = 12, (3.133)
as required. J
Z +9
2
Example 3.27 Evaluate the integral I = e−x sin(5x)dx.
−9
I Solution: Although this integral does not look easy to evaluate, notice that
2
e−x is an even function, and sin(5x) is an odd function. We thus have
Z +9
I= (even function) × (odd function)dx
−9
Z +9
= (odd function)dx = 0, (3.134)
−9
where we appealed to the symmetry result in equation (3.132), and used the
fact that (even func.) × (odd func.) = (odd func.), as in Definition §1.6. J
When one (or both) of the limits on an integral is infinite the integral is called
an improper integral. For example, the integral
Z ∞
I= f (x)dx, (3.135)
a
is improper because f (x) diverges at c = 0 ∈ [0, 4]. In these cases the integral
must be evaluated by excluding the unbounded point c ∈ [a, b], i.e., by writing
Z b Z c−ε Z b
f (x)dx = lim f (x)dx + lim f (x)dx, (3.141)
a ε→0 a ε→0 c+ε
hf i = f (c). (3.144)
This result is known as the mean value theorem for definite integrals.
Figure 3.6: Mean value of a function hf i = f (c) as the height of a rectangle with base
length (b − a) which has the same area as that bounded by the curve f (x) on x ∈ [a, b].
f (c) = 51 c2 + 2c = 48
5
= hf i, that is, c2 + 10c − 48 = 0. (3.147)
√
This is a quadratic equation with solutions c = (± 73 − 5). Thus, selecting
√
the value that lies in the interval [2, 5], we have c = ( 73 − 5) ≈ 3.54. J
Complex Numbers
Learning outcomes:
133
134 Section 4 : Complex Numbers
this equation does not have any solutions given by real numbers because z 2 > 0
for all z ∈ R. To circumvent this problem we introduce a mathematical object
j known as the imaginary unit, which is defined such that
j 2 = −1. (4.2)
The imaginary unit j behaves like any other algebraic quantity—it can be
added, subtracted, multiplied, etc.—we just have to remember that j 2 = −1.
In this way equation (4.1) can now be written in the form
The equation therefore has two solutions, namely z = −j and z = +j, just
like any other quadratic equation.
Definition 4.1 The imaginary unit is denoted i, and defined such that
i2 = −1. (4.4)
Engineers often denote the imaginary unit using the letter j, that is,
j 2 = −1, (4.5)
z 2 = −4 = j 2 22 = (2j)2 . (4.6)
z 2 − 2z + 2 = 0. (4.7)
since j 2 = −1. Thus, taking the square-root, the two solutions for z are
z = 1 ± j. (4.9)
Example 4.2 Determine the real and imaginary parts of the complex numbers:
√ √
(a) z1 = 4 + 52 j; (b) z2 = −2 2 + 2 2j; (c) z3 = −π − 32 j; (d) z4 = −2j.
I Solution: According to Definition 4.2, the real and imaginary parts are:
The fact that each complex number describes a unique point (x, y) in the
complex plane reflects the fact that two complex numbers are equal if and only
if they have identical real (x) and imaginary (y) parts. This property is known
as the law of equality, and is considered in more detail in §4.2.4.
Figure 4.1: Argand diagram of the complex numbers defined in Example 4.2.
4.2.3 Multiplication
z1 z2 = (x1 + y1 j)(x2 + y2 j)
= x1 x2 + x1 y2 j + x2 y1 j + y1 y2 j 2
= x1 x2 + (x1 y2 + x2 y1 )j − y1 y2
= (x1 x2 − y1 y2 ) + (x1 y2 + x2 y1 )j. (4.13)
We now show that two complex numbers are equal if and only if they have
identical real and imaginary parts. To this end let z = x + yj and w = u + vj
be complex numbers with x, y, u, v ∈ R, and suppose that z = w; then
x1 + y 1 j = x2 + y 2 j ⇔ x1 = x 2 and y1 = y2 . (4.16)
z = x − yj. (4.20)
Example 4.7 Write down the complex conjugates of the complex numbers:
√ √
(a) z1 = 4 + 52 j; (b) z2 = −2 2 + 2 2j; (c) z3 = −π − 32 j; (d) z4 = −2j.
Example 4.9 Let z be the complex conjugate of z = x + yj. Show that: (i)
z = z if and only if Im{z} = 0; and (ii) z = −z if and only if Re{z} = 0.
By equating real and imaginary parts we require y = −y, but this is true if and
only if y = Im{z} = 0. Likewise for (ii) we have
z = −z ⇔ x + yj = −x + yj. (4.23)
z1 + z2 = (x1 + x2 ) + (y1 + y2 )j
= (x1 + x2 ) − (y1 + y2 )j
= (x1 − y1 j) + (x2 − y2 j)
= z1 + z2. (4.24)
z1 z2 = (x1 x2 − y1 y2 ) + (x1 y2 + x2 y1 )j
= (x1 x2 − y1 y2 ) − (x1 y2 + x2 y1 )j
= x1 (x2 − y2 j) − y1 y2 − x2 y1 j
= x1 (x2 − y2 j) + y1 y2 j 2 − x2 y1 j
= x1 (x2 − y2 j) − y1 j(x2 − y2 j)
= (x1 − y1 j)(x2 − y2 j)
= z1z2. (4.25)
4.2.6 Modulus
Notice that the product zz is a real number given by the sum of the squares
of the real and imaginary parts of z, i.e.,
Example 4.12 Determine the absolute value of the following complex num-
√
bers: (a) z1 = 1 − j ; (b) z2 = − 3 + j; and (c) z3 = 5j.
p
I Solution: We compute |z| = x2 + y 2 for each z = x + yj, thus
p √
(a) |z1 | = 12 + (−1)2 = 2.
q √ √ √
(b) |z2 | = (− 3)2 + 12 = 1 + 3 = 4 = 2.
√
(c) |z3 | = 52 = 5.
√
We consider z1 = 1 − j and z2 = − 3 + j again in Example 4.19. J
Example 4.13 Let z = x + yj; show that |z| ≥ Re{z} and |z| ≥ Im{z}.
z1 x1 + y 1 j
= for z2 6= 0 (4.30)
z2 x2 + y 2 j
Rather than memorise this result, it is easier to follow the procedure in context.
Example 4.14 Express the following quotients in Cartesian form:
1 3 + 4j
(a) z1 = ; and (b) z2 = . (4.33)
1 + 2j 1−j
z1 x1 + y 1 j
= for z2 6= 0. (4.37)
z2 x2 + y 2 j
Example 4.16 Use the result |z| ≥ Re{z} (see Example 4.13) to show that
|z1 | + |z2 | ≥ |z1 + z2 | for any two complex numbers z1 , z2 ∈ C.
I Solution: The inequality |z1 | + |z2 | ≥ |z1 + z2 | is clearly true in the case
that |z1 + z2 | = 0. Otherwise, if |z1 + z2 | =
6 0, then we have
z 2 = −3 + 4j. (4.40)
2
x = ±1 and y = = ±2. (4.45)
x
Hence, the square roots of −3 + 4j are
When z is expressed in this way we say that it has been written in polar form.
Observe that adding integer multiples of 2π to θ does not change the value of
either sin θ or cos θ. Thus, we may also write z in polar form as
z = r cos(θ + 2πk) + j sin(θ + 2πk) , where k∈Z (4.50)
Figure 4.4: Polar form z = r(cos θ + j sin θ) of a complex number, with r = |z|.
6 0).
Definition 4.5 Let z = x + yj be a non-zero complex number (|z| =
Then an argument arg(z) of z is an angle θ ∈ R that satisfies
√
Example 4.19 Consider the complex numbers z1 = 1 − j and z2 = − 3 + j.
(d) Determine the principal arguments Arg (z1 ) and Arg (z2 ) of z1 and z2
√
Figure 4.5: Argand diagrams of the complex numbers z1 = 1 − j and z2 = − 3 + j.
(c) According to Definition 4.5, and our answers to parts (a) and (b), the
polar forms of z1 and z2 are
√
2 cos(− π4 ) + j sin(− π4 ) , z2 = 2 cos( 5π 5π
z1 = 6
) + j sin( 6
) . (4.58)
φ2 φ3 φ4 φ5
eφ = 1 + φ + + + + + ... (4.61a)
2! 3! 4! 5!
φ2 φ4 φ6 φ8 φ10
cos(φ) = 1 − + − + − + ... (4.61b)
2! 4! 6! 8! 10!
φ3 φ5 φ7 φ9 φ11
sin(φ) = φ − + − + − + ... (4.61c)
3! 5! 7! 9! 11!
These series are known to be valid for any real numbers φ ∈ R.
Let us suppose that the expansion for the exponential function eφ works for
imaginary numbers, i.e., for φ = jθ, where θ ∈ R is a real number, then
equation (4.61a) gives
j 2 θ2 j 3 θ3 j 4 θ4 j 5 θ5
ejθ = 1 + jθ + + + + + ... (4.62)
2! 3! 4! 5!
Hence, because j 2 = −1, by collecting real and imaginary parts we may write
θ2 θ4 θ3 θ5
jθ
e = 1− + + . . . +j θ − + + ... , (4.63)
2! 4! 3! 5!
| {z } | {z }
cos θ sin θ
where we observed that the bracketed terms on the right-hand-side are the
series expansions for sin θ and cos θ listed in equations (4.61b) and (4.61c).
Equation (4.63) suggests that if θ ∈ R is a real number, then one way to
interpret the meaning of the function ejθ is as
where we used Euler’s formula ejy = (cos y +j sin y). Equation (4.66) suggests
the following definition for the complex exponential function.
where r = |z| and θ = arg(z) are the modulus and arguments of z. Thus,
using Euler’s formula to write (cos θ + j sin θ) = ejθ we have
z = rejθ ; (4.71)
z = rejθ . (4.72)
√
Example 4.21 Express the complex numbers z1 = 1 − j and z2 = − 3 + j
in exponential form.
√
I Solution: In Example 4.19 we found that |z1 | = 2 and |z2 | = 2. We also
found that Arg (z1 ) = − π4 and Arg (z2 ) = 5π
6
. Thus, by Definition 4.8, z1 and
z2 may be written in exponential form as
√
z1 = 2e−(π/4)j and z2 = 2e(5π/6)j . (4.73)
We have used the principal arguments, but any valid arguments will suffice. J
where r = |z| and θ = Arg (z), and ρ = |w| and φ = Arg (w) (see figure 4.6).
(a) (b)
z rejθ
= jφ = (r/ρ)e(θ−φ)j . (4.76)
w ρe
Here the modulus and argument are |z/w| = (r/ρ) and arg(wz) = θ − φ. In
this case, therefore, the geometric effect of dividing z = rejθ by w is to scale
by the factor 1/ρ = 1/|w|, and to rotate by the angle −φ = −Arg (w).
wk = e(2πk/3)j . (4.84)
The cubed roots given in equations (4.83) are sketched in figure 4.7; observe
that they all lie on a circle of radius |wk | = 1 (the unit circle), and are dis-
tributed evenly by the angle 2π/3. The reason for this even distribution follows
from the relationship between two consecutive roots wk and wk+1 , that is,
Thus, each root wk+1 is obtained by rotating the previous root wk through
by an angle of 2π/3. After three such rotations, therefore, we return to our
original root, i.e.,
wk+3 = wk e(2π)j = wk . (4.86)
It follows, therefore, that only three of the cubed roots are distinct. In
particular, taking k equal to any value other than k = 0, 1, 2 will simply repeat
the roots already listed in equation (4.83). This aspect is an instance of the
more general rule that a complex number has only n distinct nth roots.
is the nth root of z. Now k ∈ Z can take any integer value, so the numbers
etc., are all nth roots of z, where we have written the root w corresponding
to a particular value of k as
Although it would seem that z has an indefinite number of nth roots, cor-
responding to the indefinite number of integers k, not all of these roots are
distinct. To see this, observe that
i.e., the root given by wn+k is identical to the root given by wk . Thus,
listing the roots wk in sequence, we find that only n of them are distinct, viz.
Other roots are simply repetitions of the roots listed above, for example,
wn+2 = w2 , wn+1 = w1 , wn = w0 , and w−2 = wn−2 , w−1 = wn−1 . In
summary, therefore, a complex number has only n distinct nth roots .
z = rejθ . (4.95)
where
wk = r1/n ej(θ+2πk)/n , with k ∈ Z. (4.97)
These roots are distributed evenly about a circle of radius |wk | = r1/n ,
where the regular angular separation between each of the roots is 2π/n.
I Solution: Since eπj = −1 we have that z = −32 = 32eπj . Thus, the general
exponential form for z is
w0 = 2e(π/5)j , (4.100a)
w1 = 2e(3π/5)j , (4.100b)
w2 = 2e(5π/5)j = −2, (4.100c)
w3 = 2e(7π/5)j , (4.100d)
w4 = 2e(9π/5)j . (4.100e)
(there is no need to convert all these numbers into Cartesian form). These
roots are sketched on an Argand diagram in figure 4.8. Notice that the roots
are distributed evenly about the circle with radius |wk | = 2, where the regular
angular separation between each of the roots is 2π/5. J
w5 + 32 = 0. (4.101)
P (z) = az 2 + bz + c (4.102)
P (z) = az 2 + bz + c = 0, (4.103)
that is, the roots of P (z) are solutions to a general quadratic equation. By
completing the square, therefore, we find that the second-order polynomial has
two roots, z = z+ and z = z− given by
−b ± (b2 − 4ac)1/2
z = z± = , (4.104)
2a
where evaluating these roots may require computing the square-roots of a
complex number (b2 − 4ac)1/2 . More generally, any nth order polynomial has
n roots, a property known as the fundamental theorem of algebra.
The quadratic equation is special because its general solution can be written
down in a concise form. Higher order polynomials are usually more difficult
to solve unless either: (i) factorisation is straightforward; or (ii) there exists a
simple reduction to a quadratic. In this section we illustrate how to determine
the n solutions to a polynomial of order n in these simple cases.
Example 4.24 Solve the quadratic equation w2 + 4w + 7 − 4j = 0.
(w + 2)2 + 3 − 4j = 0, (4.105)
w = w± = −2 ± z = −2 ± (1 + 2j). (4.107)
w+ = −1 + 2j or w− = −3 − 2j. (4.108)
(w − w+ )(w − w− ) = 0 (4.109)
z 2 = −3 ± 4j. (4.111)
Observe here that our quartic equation has four roots (as expected) in the form
of two complex conjugate pairs.
−b ± (b2 − 4ac)1/2
z = z± = . (4.114)
2a
Now if 4ac > b2 , then (4ac − b2 ) is a positive real number. Let us signify this
property by defining
√
β= 4ac − b2 , where β∈R (4.115)
−b ± (β 2 j 2 )1/2 b β
z = z± = = − ± j. (4.117)
2a 2a 2a
Hence the roots are complex conjugate pairs with z− = z + . J
Result 4.3 Let P (z) be polynomial of order n with real coefficients, i.e.,
n
X
P (z) = ar z r = an z 2 + an−1 z n−1 + · · · + a1 z + a0 , (4.118)
r=0
n
X n
X n
X n
X
r
P (w) = ar w = ar wr = ar wr = ar wr = P (w) = 0, (4.119)
r=0 r=0 r=0 r=0
Euler’s formula (see Definition 4.6) states that if θ ∈ R is a real number, then
It follows from these equations that the real sine and cosine functions can
be written
1 +jθ 1 +jθ
e + e−jθ e − e−jθ .
cos θ = and sin θ = (4.123)
2 2j
These expressions motivate the following general definitions for the complex
sine and cosine functions.
Definition 4.10 For all z ∈ C the sine and cosine functions are defined
1 +jz 1 +jz
e + e−jz e − e−jz . (4.124)
cos z = and sin z =
2 2j
If Im{z} = y = 0, then these expressions reduce to the real sine and cosine
functions of equation (4.123).
Other complex trigonometric functions follow from this definition, for example:
sin z
tan z ≡ , (4.125)
cos z
where the complex sine and cosine functions are given in Definition 4.10.
I Solution: By the definitions of the complex sine and cosine functions (Defi-
nition 4.10) we have
1 +j 2 2
1 −1 j
e − e−j = e − e1 = 2 e−1 − e1
sin(j) =
2j 2j 2j
j 1
= − e−1 − e = e − e−1 j.
(4.126)
2 2
Similarly,
1 +(π+j)j 1 πj j 2 2
cos(π + j) = e + e−(π+j)j = e e + e−πj e−j
2 2
1 πj −1 1
e e + e−πj e = − e + e−1 ,
= (4.127)
2 2
where we used the fact that eπj = e−πj = −1. J
According to Euler’s formula ejθ = (cos θ + j sin θ), and the laws for indices,
if θ ∈ R is a real number, and n ∈ Z is an integer, then
n
(cos θ + j sin θ)n = ejθ = ejnθ = (cos(nθ) + j sin(nθ)) . (4.128)
where we divided the top and bottom of the penultimate fraction by cos2 θ.J
4.7.3 Logarithms
For real numbers x, y ∈ R the natural logarithm (loge ) is defined as the inverse
function of y = ex , that is, x = loge y. Here we seek a comparable definition
for the complex logarithm (log) for complex numbers z, w ∈ C; in particular if
z = ew , then we seek a complex logarithm function such that w = log z.
To this end let us suppose that w = u + vj, where u, v ∈ R are real numbers,
in which case our complex logarithm function must satisfy
where loge |z| is the real natural logarithm of |z|, and k ∈ Z. Since any
integer k ∈ Z may be used, this logarithm is not unique. We thus define
π 1
Log z1 = Log (1 − j) = loge (21/2 ) − j = (2 loge 2 − πj) (4.138)
4 4
as the principal logarithm of z1 . J
4.7.4 Powers
z α = eαLog z . (4.140)
This definition is consistent with our discussion of integer powers and roots.
√
I Solution: In Example 4.30 we found that Log (1 − j) = loge 2 − π4 j. Thus,
by definition 4.13 we have
√ π π
√
(1 − j)j = ej(loge 2− 4 j) = e 4 ej(loge 2) (4.141)
π
√ √
= e 4 cos(loge 2) + j sin(loge 2) , (4.142)
dI Q(t) dQ
VL (t) = −L and VC (t) = , where I(t) = (4.143)
dt C dt
is the current in the circuit, and Q(t) is the charge on the capacitor at time t.
Since these voltages must be equal, we have
dI Q(t) dI 1
−L = , that is + Q(t) = 0. (4.144)
dt C dt LC
Thus, differentiating this equation with respect to time we obtain
d2 I(t) 1
+ ω 2 I(t) = 0, where ω2 = , (4.145)
dt2 LC
and we used the fact that the current is I(t) = dQ/dt.
where A and φ are constants known as the amplitude (A) and phase (φ).
The solutions therefore oscillate sinusoidally in time. In particular, notice that
2π
I(t + T ) = I(t) where T = ; (4.147)
ω
i.e., the system oscillates with time period T , and frequency ω = 2π/T .
The reason that complex numbers are useful for describing such oscillatory
solutions may be understood as follows. Observe that Euler’s formula allows
us to write I(t) as
that is,
I(t) = Re{IA ejωt }, where IA = Aejφ (4.149)
is a complex number. In this way information about the amplitude and phase
is contained within IA = Aejφ , while ejωt describes the frequency. Thus, the
complex form allows us to make a clear and compact distinction between the
constant part of the solution (IA = Aejφ ), and the time dependent part (ejωt ).
dI(t) d2 I(t)
= jωI(t) and = j 2 ω 2 I(t) = −ω 2 I(t) (4.150)
dt dt2
respectively. Thus,
d2 I(t)
2
+ ω 2 I(t) = −ω 2 I(t) + ω 2 I(t) = 0 (4.151)
dt
as required by equation (4.145). J