Chap14 Partial Diffn
Chap14 Partial Diffn
14 Partial Derivatives 2
14.1 Functions of Several Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
14.1.1 Functions of one real variable . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
14.1.2 Functions of two real variables . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
14.1.3 Neighbourhoods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
14.1.4 Interior and boundary points in XY-plane . . . . . . . . . . . . . . . . . . . 4
14.1.5 Level Curves, and Contours of Functions of Two Variables . . . . . . . . . 5
14.2 Interior and boundary points in XYZ-space . . . . . . . . . . . . . . . . . . . . . . 5
14.3 Limits and Continuity in Higher Dimensions . . . . . . . . . . . . . . . . . . . . . . 6
14.3.1 Limit of a function of one real variable . . . . . . . . . . . . . . . . . . . . . 6
14.3.2 Limit of a function of two real variables . . . . . . . . . . . . . . . . . . . . . 8
14.4 Continuity of a function of one real variable . . . . . . . . . . . . . . . . . . . . . . 11
14.4.1 Continuity of a function of two real variables . . . . . . . . . . . . . . . . . 12
14.5 Partial Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
14.5.1 Differentiability of a function of one real variable . . . . . . . . . . . . . . . 12
14.5.2 Differentiability of a function of two real variables . . . . . . . . . . . . . . 14
14.6 The Chain Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
14.6.1 Implicit Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
14.6.2 Higher Order Partial Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . 19
14.7 Directional Derivatives and Gradient Vectors . . . . . . . . . . . . . . . . . . . . . 21
14.8 Tangent Planes and Differentials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
14.8.1 Estimating the change in f in a given direction . . . . . . . . . . . . . . . . 23
14.8.2 Taylor’s formula and linearization of f (x) . . . . . . . . . . . . . . . . . . . . 23
14.8.3 Taylor’s formula and linearization of f (x, y) . . . . . . . . . . . . . . . . . . . 24
14.8.4 Differentials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
14.9 Extreme Values and Saddle Points . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
14.9.1 Maxima and minima of a function of one variable . . . . . . . . . . . . . . 26
14.9.2 First derivative test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
14.9.3 Second derivative test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
14.10Maxima and minima of a function of two variables . . . . . . . . . . . . . . . . . . 28
14.10.1 Test for maxima and minima of f (x, y) . . . . . . . . . . . . . . . . . . . . . . 29
14.11Lagrange Multipliers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
14.11.1 Constrained Maxima and Minima . . . . . . . . . . . . . . . . . . . . . . . . . 30
14.11.2 Method of Lagrange Multipliers . . . . . . . . . . . . . . . . . . . . . . . . . . 31
14.11.3 Lagrange Multipliers with Two Constraints . . . . . . . . . . . . . . . . . . 32
14.12Some useful tips . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Note: Comments/suggestions on these lecture notes are welcome on the e-mail: [email protected] to
Dr. Suresh Kumar.
1
Chapter 14
Partial Derivatives
geometrically in XY-plane in the left panel of Figure 1. So domain of f in the interval [−1, 1] on the
X-axis, and geometrically its graph is the semicircle.
Z
1.0
0.5
Y
-1
1.0
-1.0 - 0.5
0.0 0 Y
0.8 0.0
0.5
1.0
0.6
1
0.4
X
0.2
√ p
Figure 14.1: Left panel: f (x) = 1 − x2 . Right panel: f (x, y) = 1 − x2 − y 2
2
14.1.2 Functions of two real variables
Let A, B and C be subsets of R, the set of real numbers. Then a rule f that assigns to each member
(x, y) ∈ A × B a unique member z ∈ C, is called a function from A × B to C, written as f : A × B → C.
The element z is called image of (x, y) under f . It is also called as the value of f at (x, y) and is denoted
by f (x, y), that is, z = f (x, y) while (x, y) is called pre-image of z under f . The set A × B (the set of
all pre-images) is called domain of f (denoted by D(f )) while the set of images of all the elements of
A × B under f , that is, R(f ) = {f (x, y) : (x, y) ∈ A × B} is called range of f . We call the set C as
co-domain of f . The set G(f ) = {(x, y, f (x, y)) : (x, y) ∈ A × B} is called graph of f . Notice that the
graph of f contains the triplets (x, y, f (x, y)) of real numbers, which can be shown geometrically as points
(x, y, f (x, y)) in the three dimensional XYZ-space. p
Ex. Consider the function f : {(x, y) : x2 + y 2 ≤ p 1} → R defined by f (x, y) = 1 − x2 − y 2 . Then,
2 2 2 2
we have p D(f ) = {(x, y) : x + y ≤ 1}, R(f ) = { 1 − x − y : (x, y) ∈ D(f )} = [0, 1] and G(f ) =
2 2
{(x, y, 1 − x − y ) : (x, y) ∈ D(f )}, which is displayed geometrically in the XYZ-space in the right
panel of Figure 2. So domain of f is the region within and on the circle x2 + y 2 = 1 in the XY-plane, and
geometrically its graph is the hemispherical surface.
Remark: Likewise, we can define a function of three variables. However, functions of more than two
variables can not be represented geometrically, and therefore are not p of much interest. Consider the
2 2 2
function f : {(x, y, z) : x + y + z ≤ 1} → R defined by f (x) = 1 − x2 − y 2 − z 2 . Then, we have
p
2 2 2
D(f ) = {(x, y, z)p: x + y + z ≤ 1}, R(f ) = { 1 − x2 − y 2 − z 2 : (x, y, z) ∈ D(f )} = [0, 1] and
G(f ) = {(x, y, z, 1 − x2 − y 2 − z 2 ) : (x, y, z) ∈ D(f )}. So domain of f is the region within and on
the sphere x2 + y 2 + z 2 = 1 in the XYZ-space, and of course its graph G(f ) can not be represented
geometrically.
Table 1 shows some two and three variable functions with their respective domains and ranges.
14.1.3 Neighbourhoods
Roughly speaking, neighbourhood of a point is any open region (the region carrying only its interior
points and hence without the boundary points) containing the point.
Formally, neighbourhood of a point x0 on X-axis is any open interval containing the point x0 . For
any positive real number δ, the interval (x0 − δ, x0 + δ) is a neighbourhood of x0 . We shall denote it by
Nδ (x0 ) and call it δ-neighbourhood of x0 . In fact, the interval (x0 − δ, x0 + δ) is neighbourhood of its
every point. A neighbourhood of x0 without x0 is called deleted neighbourhood of x0 .
Neighbourhood of a point (x0 , y0 ) in the XY-plane is any open area or region containing the point
(x0 , y0 ). For any positive real number δ, the area {(x, y) : |x − x0 | < δ, |yp − y0 | < δ} is a neighbourhood
of (x0 , y0 ) and is a square neighbourhood of (x0 , y0 ). Similarly, {(x, y) : (x − x0 )2 + (y − y0 )2 < δ} is
circular neighbourhood of (x0 , y0 ). We shall denote any δ-neighbourhood of (x0 , y0 ) by Nδ (x0 , y0 ).
Likewise, neighbourhood of a point (x0 , y0 , z0 ) in the XYZ-space is any open region containing the
point (x0 , y0 , z0 ). For instance, Nδ (x0 , y0 , z0 ) = {(x, y, z) : |x−x0 | < δ, |y −y0 | < δ, |z −z0 | < δ} is cubical
3
p
δ-neighbourhood of (x0 , y0 , z0 ) while Nδ (x0 , y0 , z0 ) = {(x, y, z) : (x − x0 )2 + (y − y0 )2 + (z − z0 )2 < δ}
is spherical δ-neighbourhood of (x0 , y0 , z0 ).
Figure 14.2: Left: Interior point (x0 , y0 ) of R. Right: Boundary point (x0 , y0 ) of R.
4
Figure 14.3:
Figure 14.4:
5
points make up its boundary. A region is open if it consists entirely of interior points. A region is closed
if it contains all its boundary points.
Figure 14.5: Left: Interior point (x0 , y0 , z0 ) of R. Right: Boundary point (x0 , y0 , z0 ) of R.
The set of points (x, y, z) in space where a function of three independent variables has a constant value
f (x, y, z) = c is called a level surface of f . For example, the level surfaces of the function f (x, y, z) =
p
x2 + y 2 + z 2 are concentric spheres as shown in Figure 6.
p
Figure 14.6: The level surfaces of f (x, y, z) = x2 + y 2 + z 2 are concentric spheres.
6
Ex. Suppose x takes values 1.1, 1.01, 1.001, ........ Then x ̸= 1 and takes infinitely many values in
each neighbourhood of 1. So x approaches 1. Also, all the values of x are greater than 1. So x → 1+ .
Ex. Suppose x takes values 1, 1/2, 1/3, ........ Then x ̸= 0 and takes infinitely many values in each
neighbourhood of 0. So x approaches 0. Also, all the values of x are greater than 0. So x → 0+ .
Limit of f (x) as x → x0
Let f be a function defined in some neighbourhood of x0 except possibly at x0 . Then a real number
l is said to be limit of f (x), symbolically written as lim f (x) = l, if given any positive real number ϵ
x→x0
(however small), there exists δ > 0 (depending on ϵ) such that
0 < |x − x0 | < δ =⇒ |f (x) − l| < ϵ.
or x ∈ (x0 − δ, x0 + δ) − {x0 } =⇒ f (x) ∈ (l − ϵ, l + ϵ).
Thus, lim f (x) = l if corresponding to each ϵ-neighbourhood of l, there exists a deleted δ-neighbourhood
x→x0
of x0 such that the values of f (x) corresponding to the deleted δ-neighbourhood of x0 lie in the ϵ-
neighbourhood of l. Geometrically, it implies that the portion of the curve f (x) corresponding to the
deleted δ-neighbourhood of x0 lies inside the horizontal strip created by the lines f (x) = l − ϵ and
f (x) = l + ϵ parallel to X-axis.
Further, lim f (x) = l (left hand limit of f (x)) if x ∈ (x0 − δ, x0 ) =⇒ f (x) ∈ (l − ϵ, l + ϵ), and
x→x−
0
lim f (x) = l (right hand limit of f (x)) if x ∈ (x0 , x0 + δ) =⇒ f (x) ∈ (l − ϵ, l + ϵ).
x→x+
0
Obviously, lim f (x) = l if and only if lim f (x) = l = lim f (x).
x→x0 x→x− x→x+
0 0
Ex. Show that lim (2x + 1) = 3.
x→1
Sol. Let ϵ > 0 be given. Then
|2x + 1 − 3| = |2x − 2| = 2|x − 1| < ϵ provided |x − 1| < ϵ/2.
Choosing δ = ϵ/2, we have
|x − 1| < δ =⇒ |2x + 1 − 3| < ϵ.
Thus, lim (2x + 1) = 3.
x→1
For the sake of illustration, we plot f (x) = 2x + 1 in left panel of Figure 7. We choose ϵ = 0.4
so that δ = ϵ/2 = 0.2. The horizontal dotted blue lines are f (x) = l − ϵ = 3 − 0.4 = 2.6 and
f (x) = l + ϵ = 3 + 0.4 = 3.4 while the vertical red lines are x = x0 − δ = 1 − 0.2 = 0.8 and
x = x0 + δ = 1 + 0.2 = 1.2. We can see that the part of the curve in the vertical red strip com-
pletely lies inside the horizontal blue strip, as expected.
Y Y
3 3
2 2
1 1
X X
- 0.5 O 0.5 1.0 1.5 O 0.5 1.0 1.5
Figure 14.7: Left panel: f (x) = 2x + 1. Right panel: f (x) = x for x ≤ 1 and f (x) = x + 1 for x > 1.
x if x ≤ 1
Ex. Show that lim f (x), where f (x) = , does not exist.
x→1 x+1 if x > 1
7
Sol. We shall show that lim f (x) = 1 and lim f (x) = 2.
x→1− x→1+
Let ϵ > 0 be given. Then for x < 1, we have |f (x) − 1| = |x − 1| < ϵ provided x ∈ (1 − ϵ, 1).
Choosing δ = ϵ, we have
x ∈ (1 − δ, 1) =⇒ |f (x) − 1| < ϵ.
Thus, lim f (x) = 1.
x→1−
Next for x > 1, we have
|f (x) − 2| = |x + 1 − 2| = |x − 1| < ϵ provided x ∈ (1, 1 + ϵ).
Choosing δ = ϵ, we have
x ∈ (1, 1 + δ) =⇒ |f (x) − 2| < ϵ.
Thus, lim f (x) = 2.
x→1+
Since lim f (x) ̸= lim f (x), so lim f (x) does not exist.
x→1− x→1+ x→1
x if x ≤ 1
For the sake of illustration, we plot f (x) = in right panel of Figure 7. We
x+1 if x > 1
choose ϵ = 0.2 so that δ = ϵ = 0.2. The horizontal dotted blue lines are f (x) = l − ϵ = 1 − 0.2 = 0.8
and f (x) = l + ϵ = 1 + 0.2 = 1.2 while the horizontal dotted blue lines are f (x) = l − ϵ = 2 − 0.2 = 1.8
and f (x) = 2 + ϵ = 2 + 0.2 = 2.2. The vertical red lines are x = x0 − δ = 1 − 0.2 = 0.8 and
x = x0 + δ = 1 + 0.2 = 1.2. We can see that the part of the curve corresponding to x ∈ (0.8, 1) in the
vertical Red strip completely lies inside the horizontal blue strip while the part of the curve corresponding
to x ∈ (1, 1.2) in the vertical Red p
strip completely lies inside the horizontal Green strip, as expected.
Ex. Discuss the geometry of lim 1 − x2 = 1.
√ x→0
Sol. We know that 1 − x2 is semicircular curve with the domain [−1, 1] on X-axis (See left panel of
Figure 1). It is easy to see that when x → 0 either from left or right in the neighbourhood √ of 0, the
2
corresponding part of the curve converges to the point (0, 1) p on Y-axis. It implies that 1 − x tends to
1 for both the paths along which x → 0. That is why, lim 1 − x2 = 1.
x→0
Remark: If lim f (x) exists, then the graph of the function f (x), geometrically, converges or strikes at
x→x0
the same point (whose y-coordinate is the limit of f (x)) from left as well as right in the neighbourhood of
x0 . For instance, see left panel of Figure 2. If lim f (x) does not exist but lim f (x) and lim f (x) exist
x→x0 x→x− x→x+
0 0
finitely, then the graph of the function f (x), geometrically, from the left strikes at a point different from
the point where it strikes from the right in the neighbourhood of x0 . For an example, see right panel of
Figure 2.
Remark: The limit of a function at a point may exist even if the function is not defined there at. For
example, consider the function f (x) = 2x + 1, x ̸= 1, which is not defined at x = 1 as per its given
definition. But lim (2x + 1) = 3.
x→1
For, let ϵ > 0 be given. Then
|2x + 1 − 3| = |2x − 2| = 2|x − 1| < ϵ provided 0 < |x − 1| < ϵ/2.
Choosing δ = ϵ/2, we have
0 < |x − 1| < δ =⇒ |2x + 1 − 3| < ϵ.
Thus, lim (2x + 1) = 3.
x→1
8
will carry infinitely many values of (x, y). Thus, (x, y) takes values arbitrarily close to (x0 , y0 ), and it
makes sense to say that (x, y) approaches (x0 , y0 ) or (x, y) → (x0 , y0 ).
Now, see the critical difference between x → x0 and (x, y) → (x0 , y0 ). In the case, x → x0 , the
variable x can take values only on the X-axis (one dimension), and thus can approach x0 from left or
right directions along X-axis. That is why, we talk about left and right hand limits when x → x0 . On
the other hand, in the case (x, y) → (x0 , y0 ), the ordered pair (x, y) can take values in the XY-plane (two
dimensions), and thus can approach (x0 , y0 ) along infinitely many paths in the XY-plane.
Ex. Suppose (x, y) takes values (0.9, 0.9), (0.99, 0.99), (0.999, 0.999), ........ Then (x, y) ̸= (1, 1) and takes
infinitely many values in each neighbourhood of (1, 1). So (x, y) approaches (1, 1). Also, all the values of
(x, y) lie on the line y = x. So (x, y) → (1, 1) by taking values on the straight line path y = x.
Ex. Suppose (x, y) takes values (1.1, (1.1)2 ), (1.01, (1.01)2 ), (1.001, (1.001)3 ), ........ Then (x, y) ̸= (1, 1)
and takes infinitely many values in each neighbourhood of (1, 1). So (x, y) approaches (1, 1). Also, all
the values of (x, y) lie on the parabola y = x2 . So (x, y) → (1, 1) by taking values on the parabolic path
y = x2 .
9
-2
-1
Y
0 1 2 3 0.5 0 X
Z 0.0
1
6 - 0.5
Z
2
4
0
Y
-2
xy
Figure 14.8: Left panel: f (x) = 2x + 2y + 1 (Red plane). Right panel: f (x, y) = x2 +y 2
, (x, y) ̸= (0, 0).
xy
Ex. Show that lim f (x, y), where f (x, y) = x2 +y 2
, (x, y) ̸= (0, 0) does not exist.
(x,y)→(0,0)
Sol. We shall show that the given limit is path dependent.
Suppose (x, y) → (0, 0) along the straight line path y = mx. Then
xy x.mx m
lim f (x, y) = lim = lim 2 = .
(x,y)→(0,0) (x,y)→(0,0) x2 + y 2 x→0 x + m2 x2 1 + m2
So the given limit depends on m, the slope of the path line. Hence, it get different values for different
values of m, and consequently it does not exist.
Geometry: The right panel of Figure 8 shows the the surface f (x, y) = x2xy +y 2
, (x, y) ̸= (0, 0) in Green
color. We can see that the part of the surface corresponding to a neighbourhood of the point (0, 0) is not
smooth rather it looks ruptured. Any path in the neighbourhood of (0, 0) in the XY-plane corresponds
to a curve on the Green surface. When (x, y) → (0, 0) along different paths y = mx, the corresponding
curves end up or converge to points near the ruptured part at different heights, which is evident from
the figure. That is why, the limit get different values along different paths, and consequently it does not
exist.
Remark: Note that the two path approach or path dependent test is used when the limit does not exist
uniquely. In case, the limit exists uniquely, you will get same answer for the limit along every chosen
path. However, this way you can not ensure the existence of limit because you can not exhaust infinitely
many paths. So in order to prove the existence of limit, use ϵ − δ approach. Also, in some problems,
changing the variables to polar coordinate system is quite useful for applying ϵ − δ approach.
4x2 y
Ex. Show that lim = 0.
(x,y)→(0,0) x2 + y 2
Sol. Let ϵ > 0 be given. Then, we have
4x2 y 4x2 y
− 0 = .
x2 + y 2 x2 + y 2
Using x = r cos θ and y = r sin θ, we get
4x2 y 2
p
2 + y 2 < ϵ for
p
− 0 = |4r cos θ sin θ| ≤ 4r = 4 x x2 + y 2 < ϵ/4.
x2 + y 2
Choosing ϵ/4 = δ, we have
4x2 y p
− 0 < ϵ for x2 + y 2 < δ.
x2 + y 2
4x2 y
Hence, lim = 0.
(x,y)→(0,0) x2 + y 2
p
Ex. Discuss the geometry of lim 1 − x2 − y 2 = 1.
(x,y)→(0,0)
10
p
Sol. We know that 1 − x2 − y 2 is hemispherical surface with the circular domain x2 + y 2 ≤ 1 centred
at (0, 0) in the XY-plane (See right panel of Figure 1). Any path in the neighbourhood of (0, 0) inside the
circular domain in the XY-plane corresponds to a curve on the hemispherical surface. It is easy to imag-
ine/visualize that when (x, y) → (0, 0) along any path in the neighbourhood of (0, 0), the corresponding
p
curve on the hemispherical surface converges to the point (0, 0, 1) on Z-axis. Itpimplies that 1 − x2 − y 2
tends to 1 for every path along which (x, y) → (0, 0). That is why, lim 1 − x2 − y 2 = 1.
(x,y)→(0,0)
Remark: If lim f (x, y) exists, then the graph (surface) of the function f (x, y), geometrically,
(x,y)→(x0 ,y0 )
converges or strikes at the same point (whose z-coordinate is the limit of f (x, y)) from all directions in
the neighbourhood of (x0 , y0 ).
Remark: The limit of a function at a pointp may exist even if the function is not defined there at. For
example, consider the function f (x, y) = 1 − x2 − y 2 , (x, y) ̸= (0, 0), which is not defined at x = (0, 0)
as per its given definition. Its graph is the hemispherical surface punctured at the pointp (0, 0, 1) with
2 2
the circular domain x + y ≤ 1 centered at (0, 0) in the XY-plane. We find lim 1 − x2 − y 2 = 1
(x,y)→(0,0)
since when (x, y) → (0, 0) along any path in the neighbourhood of (0, 0), the corresponding curve on the
hemispherical surface converges to the point (0, 0, 1) on Z-axis.
Sandwich Theorem: Suppose the functions g(x, y), f (x, y) and h(x, y) are defined in some neigh-
bourhood of (x0 , y0 ) except possibly at (x0 , y0 ) such that g(x, y) ≤ f (x, y) ≤ h(x, y) for all (x, y), and
lim g(x, y) = L = lim h(x, y). Then lim f (x, y) = L.
(x,y)→(x0 ,y0 ) (x,y)→(x0 ,y0 ) (x,y)→(x0 ,y0 )
11
14.4.1 Continuity of a function of two real variables
Let f be a function defined in some neighbourhood of (x0 , y0 ). Then f is said to be continuous at (x0 , y0 )
if lim f (x, y) = f (x0 , y0 ). We say that f is continuous in an open region in the XY-plane if it is
(x,y)→(x0 ,y0 )
continuous at each point of the region.
Geometry: The existence of limit at (x0 , y0 ) ensures that the graph (surface) of f (x, y) points towards
the same point from all directions in each neighbourhood of (x0 , y0 ). Also, the limit is equal to f (x0 , y0 ).
So the point of convergence of the surface is (x0 , y0 , f (x0 , y0 )). Therefore, continuity of f at (x0 , y0 ) implies
that there exists at least one neighbourhood of (x0 , y0 ) in which the f (x, y) has continuous surface, that
is, without any break point or point hole. Consequently, continuity over a region implies that the function
has continuous surface in thepregion.
Ex. The function f (x, y) = 1 − x2 − y 2 is continuous at (0, 0) since lim f (x, y) = 1 = f (0, 0). In fact,
p x→0
f (x, y) = 1 − x2 − yp 2 is continuous at every point in its domain circular region x2 + y 2 ≤ 1. That is
why, graph of f (x) = 1 − x2 − y 2 is the continuous hemispherical surface as shown in right panel of the
Figure 1.
Ex. The function f (x, y) = x2xy +y 2
, (x, y) ̸= (0, 0), is not continuous at (0, 0) since the function is not
defined at (0, 0). Even if the function is defined to have some value at (0, 0), it can not be continuous at
(0,0) since lim f (x, y) does not exist. The graph surface of f (x, y) breaks at the point (0, 0) as may
(x,y)→(0,0)
be seen in the right panel of Figure 8. Notice that the function is continuous everywhere in the XY-plane
except (0, 0).
Next, we define the expression f ′ (x)∆x to be the differential of f and denote it by df . So df = f ′ (x)∆x. If
we choose f (x) = x, then dx = 1.∆x = ∆x. Therefore, the differential of f can be written as df = f ′ (x)dx.
Remark1: Notice that by definition of differential, dx = ∆x. Likewise, we have df = ∆f . It means,
differentials can be replaced by the corresponding infinitesimals and vice versa. That is why, the expression
df
is interpreted in two ways. First, simply as the ratio of the differentials df and dx or the infinitesimals
dx
df ∆f
∆f and ∆x. Second, it is the limit of the ratio of the infinitesimals ∆f and ∆x, that is, = lim .
dx ∆x→0 ∆x
12
In this case, it is derivative of f with respect to x. Thus, the differential of f is df while the derivative of
df
f , denoted by f ′ (x) or , is the limit of the ratio of the infinitesimals ∆f and ∆x. In fact, the derivative
dx
measures a rate of change, while the differential measures the change itself. In practical situations,
sometimes we are interested in the rate of change of some quantity and sometimes we want to know the
change itself. Therefore, derivative and differential are not same but both are useful.
Remark2: If y is a differentiable function of x, and x is a differentiable function of t, then y is a
differentiable function of t, and
dy dy dx
= ,
dt dx dt
known as the chain rule of derivatives. In the chain rule, it appears like we have multiplied and divided
by dx. But it is not true because the expressions dy dy dx
dt , dx and dt are derivatives and not the ratios of the
differentials dy, dx and dt. For, let ∆t be an infinitesimal change in t, and ∆x and ∆y be corresponding
changes in x and y, respectively. Then we can write,
∆y ∆y ∆x
=
∆t ∆x ∆t
Here, we have really multiplied and divided by ∆x. In the limit ∆t → 0, we get
∆y ∆y ∆x
lim = lim lim .
∆t→0 ∆t ∆x→0 ∆x ∆t→0 ∆t
13
Ex. Show that the function f (x) = |x| is not differentiable at x = 0.
Sol. We have
1.2
1.0
0.8
0.6
0.4
0.2
14
where A and B both are independent of ∆x and ∆y; and the functions ϕ(∆x, ∆y) and ψ(∆x, ∆y) both
tend to 0 as (∆x, ∆y) → (0, 0).
Choosing ∆y = 0 and dividing (14.3) throughout by ∆x, and then taking limit as ∆x → 0, we get
∆f f (x + ∆x, y) − f (x, y)
lim = lim = A.
∆x→0 ∆x ∆x→0 ∆x
∆f f (x + ∆x, y) − f (x, y)
We define the limit lim or lim to be the partial derivative of f with
∆x→0 ∆x ∆x→0 ∆x
∂f ∂f
respect to x, and denote it by or fx (x, y). So = fx (x, y) = A.
∂x ∂x
Similarly, choosing ∆x = 0 and dividing (14.3) throughout by ∆y, and then taking limit as ∆y → 0,
we get
∂f ∂f
∆f = f (x + ∆x, y + ∆y) − f (x, y) = ∆x + ∆y + ∆x ϕ(∆x, ∆y) + ∆y ψ(∆x, ∆y), (14.4)
∂x ∂y
∂f ∂f
Next, we define the expression ∆x + ∆y to be the total differential of f and denote it by df . So
∂x ∂y
∂f ∂f
df = ∆x + ∆y. If we choose f (x, y) = x, then dx = 1.∆x = ∆x. Similarly, the choice f (x, y) = y
∂x ∂y
leads to dy = 1.∆y = ∆y. Therefore, the total differential of f can be written as
∂f ∂f
df = dx + dy.
∂x ∂y
15
Figure 14.10: Graph of f (x, y) shows that the surface is continuous but with a sharp edge at (0, 0, 0).
Sufficient condition for differentiability: If the partial derivatives fx and fy are continuous at (x0 , y0 ),
then f (x, y) is differentiable at (x0 , y0 ).
Geometry of partial derivative: Consider the partial derivative fx (x0 , y0 ) of f (x, y) at (x0 , y0 ). The
partial derivative fx (x0 , y0 ), in fact, is the ordinary derivative of f (x, y0 ), which is the curve in XYZ-space
determined by the section of the plane y = y0 with the surface z = f (x, y). Therefore, fx (x0 , y0 ) is the
slope of the tangent to the curve z = f (x, y0 ) at the point (x0 , y0 , f (x0 , y0 )) as shown in Figure 10.
16
Figure 14.11:
Ex. Find fx and fy given that f (x, y) = tan−1 xy .
Sol. We have
y2 1
1 ∂ x y
fx = 2 = 2 2
= 2 .
1+ 2 x ∂x y x +y y x + y2
y
y 2 −x
1 ∂ x −x
fy = x2
= 2 2 2
= 2 .
1+ ∂y y x +y y x + y2
y2
17
f
x y
t t
Figure 14.12: There are two paths in the tree diagram: f → x → t, and f → y → t.
x y
u v u v
Figure 14.13: There are four paths in the tree diagram: f → x → u, f → y → u, f → x → v and
f → y → v.
Likewise, we can write the chain rule for a composite function f with any number of variables.
Ex. Use the Chain Rule to find the derivative of f (x, y) = xy with respect to t along the path x = cos t,
y = sin t.
Sol. We have
df ∂f dx ∂f dy
= + = y(− sin t) + x cos t = − sin2 t + cos2 t = cos 2t.
dt ∂x dt ∂y dt
∂g ∂g
Ex. If g(x, y, z) = f (x − y, y − z, z − x), show that ∂x + ∂y + ∂g
∂z = 0.
Sol. Let p = x − y, q = y − z and r = z − x so that f is a function of p, q, r, and p, q, r are functions of
x, y, z. So
∂g ∂f ∂p ∂f ∂q ∂f ∂r ∂f ∂f ∂f ∂f ∂f
= + + = .1 + .0 + (−1) = − .
∂x ∂p ∂x ∂q ∂x ∂r ∂x ∂p ∂q ∂r ∂p ∂r
18
Similarly,
∂g ∂f ∂p ∂f ∂q ∂f ∂r ∂f ∂f
= + + = −
∂y ∂p ∂y ∂q ∂y ∂r ∂y ∂q ∂p
and
∂g ∂f ∂p ∂f ∂q ∂f ∂r ∂f ∂f
= + + = − .
∂z ∂p ∂z ∂q ∂z ∂r ∂z ∂r ∂q
∂g ∂g ∂g
Adding ∂x , ∂y and ∂z , we get the required result.
df ∂f dx ∂f dy ∂f ∂f dy
= + = + .
dx ∂x dx ∂y dx ∂x ∂y dx
df
Also, f (x, y) = 0 implies dx = 0. Thus, we obtain
∂f
dy ∂x fx
= − ∂f =− .
dx fy
∂y
dy
This formula expresses dx in terms of the partial derivatives of f (x, y). It eases the life when we calculate
dy
dx from an implicit relation in x and y.
dy
Ex. Find dx from the relation xy + y x = 1.
Sol. Assuming f (x, y) = xy + y x − 1, we get
dy fx yxy−1 + y x ln y
=− = y .
dx fy x ln x + xy x−1
∂z ∂z
Ex. Find ∂x and ∂y from the relation x3 + z 2 + yexz + z cos y = 0.
Sol. Assuming f (x, y, z) = x3 + z 2 + yexz + z cos y, we get
∂z fx 3x2 + yzexz
=− =− ,
∂x fz 2z + xyexz + cos y
∂z fy exz − z sin y
=− =− .
∂y fz 2z + xyexz + cos y
∂f f (x + h, y) − f (x, y)
fx (x, y) = = lim
∂x h→0 h
and
∂f f (x, y + k) − f (x, y)
fy (x, y) = = lim .
∂y k→0 k
19
The second order partial derivatives of f (x, y) are
∂2f
∂ ∂f fx (x + h, y) − fx (x, y)
fxx (x, y) = 2
= = lim ,
∂x ∂x ∂x h→0 h
∂2f
∂ ∂f fx (x, y + k) − fx (x, y)
fxy (x, y) = = = lim ,
∂y∂x ∂y ∂x k→0 k
∂2f
∂ ∂f fy (x + h, y) − fy (x, y)
fyx (x, y) = = = lim ,
∂x∂y ∂x ∂y h→0 h
∂2f
∂ ∂f fy (x, y + k) − fy (x, y)
fyy (x, y) = 2
= = lim .
∂y ∂y ∂y k→0 k
Likewise we can define third order and higher order partial derivatives.
Caution: While dealing with problems in your text book, you will notice that fxy = fyx . In other words,
it does not matter in which order we calculate the partial derivatives. However, you should keep in mind
that the equality fxy = fyx is ensured only when fxy and fyx are continuous functions, otherwise fxy and
fyx need not be equal.
For example,
( consider the function
xy(x2 −y 2 )
x2 +y 2
if (x, y) ̸= (0, 0)
f (x, y) =
0 if (x, y) = (0, 0)
Then it can be shown that fyx (0, 0) = 1 and fxy (0, 0) = −1. So fxy (0, 0) ̸= fyx (0, 0).
For,
fy (h, 0) − fy (0, 0)
fyx (0, 0) = lim .
h→0 h
Next,
hk(h2 −k2 )
f (h, k) − f (h, 0) h2 +k2
−0
fy (h, 0) = lim = lim =h
k→0 k k→0 k
and
f (0, k) − f (0, 0) 0−0
fy (0, 0) = lim = lim = 0.
k→0 k k→0 k
It follows that
fy (h, 0) − fy (0, 0) h−0
fyx (0, 0) = lim = lim = 1.
h→0 h h→0 h
Likewise, we get
fx (0, k) − fx (0, 0) −k − 0
fxy (0, 0) = lim = lim = −1.
k→0 k k→0 k
20
14.7 Directional Derivatives and Gradient Vectors
The directional derivative of a function
f (x, y, z) at a point (x, y, z) in the direction of a unit vector
df
n̂ = lî + mĵ + nk̂, denoted by dt or Dn̂ f , is defined as the limit
n̂
df f (x + lt, y + mt, z + nt) − f (x, y, z)
= lim ,
dt n̂ t→0 t
provided it exists. Note that (x + lt, y + mt, z + nt) is a neighbouring point of (x, y, z) on a line through
(x, y, z) in the direction of n̂ = lî + mĵ + nk̂. In particular, the directional derivative of f (x, y, z) in a
direction parallel to x-axis, where n̂ = î, reads as
df f (x + t, y, z) − f (x, y, z)
= lim ,
dt î t→0 t
which is nothing but the partial derivative of f with respect to x, that is, fx . Similarly, directional
derivatives of f (x, y, z) in directions parallel to y-axis and z-axis are fy and fz , respectively.
So we see
df
that the partial derivatives fx , fy and fz are special cases of the directional derivative dt n̂ of f (x, y, z).
The parametric equations of the line through (x, y, z) in the direction n̂ = lî+mĵ +nk̂ are x(t) = x+lt,
y(t) = y + mt and z(t) = z + nt. So by chain rule, we have
df ∂f ∂f ∂f ∂f ∂f ∂f
= l+ m+ n= î + ĵ + k̂ .(lî + mĵ + nk̂).
dt ∂x ∂y ∂z ∂x ∂y ∂z
So the directional derivative df
dt may be written as
n̂
df
= ∇f.n̂ ,
dt n̂
∂f ∂f ∂f
where ∇f = ∂x î + ∂y ĵ + ∂z k̂ is known as gradient vector of f (x, y, z).
df
This vector representation of directional derivative is useful in view of the fact that dt n̂ represents
the rate of change of f (x, y, z) in the direction n̂. If θ is angle between ∇f and n̂, then we have
df
= |∇f | cos θ.
dt n̂
Obviously, maximum value of directional derivative is |∇f |, and is obtained in the direction of ∇f .
Similarly, minimum value of directional derivative is −|∇f |, in the direction opposite to ∇f . It is 0 in
the direction perpendicular to ∇f .
Let P (x, y, z) be a point on a level surface f (x, y, z) = c, where c is a constant. Suppose → −r (t) =
x(t)î + y(t)ĵ + z(t)k̂ be any curve on the level surface f (x, y, z) = c passing through the point P (x, y, z)
so thatf (x(t), y(t), z(t)) = c. Therefore, we have
df ∂f dx ∂f dy ∂f dz d→
−r
0= = + + = ∇f. .
dt ∂x dt ∂y dt ∂z dt dt
−
→
Since ddtr is tangent vector to each curve → −r (t) = x(t)î + y(t)ĵ + z(t)k̂ on the level surface f (x, y, z) = c
passing through the point P (x, y, z), so it follows that the gradient vector ∇f is normal to the level surface
at P (x, y, z). Further, it implies that the directional derivative at any point of a level surface is maximum
along the direction of normal to the surface at that point.
21
Ex. Find ∇f at the point (1, −2, −1), given that f = 3x2 y − y 3 z 2 .
Ans. −12î − 9ĵ − 16k̂.
Ex. Find the directional derivative of f = x2 − y 2 + 2z 2 at the point P (1, 2, 3) in the direction of line P Q,
where Q is the point (5,0,4). Also find the direction and magnitude of maximum directional derivative at
P. √ √
Ans. 4 21/3, 2î − 4ĵ + 12k̂, 2 41.
Ex. If →
−
r = xî + y ĵ + z k̂, then show that ∇rn = nrn−2 →
−
r.
Ex. Find the angle between the normals to the surfaces x2 + y 2 + z 2 = 9 and z = x2 + y 2 − 3 at the
point (2, −1, 2). Also, find the equation of tangent line at (2, −1, 2) to the curve of intersection of the two
surfaces.
Sol. Angle between two surfaces at a point of intersection is equal to the angle between their normals at
that point. Let f = x2 +y 2 +z 2 −9 = 0 and g = x2 +y 2 −z−3 = 0. Then ∇f and ∇g at(2, −1, 2) are normal
∇f.∇g
√
vectors to the two given surfaces at (2, −1, 2). So required angle is cos−1 |∇f ||∇g| = cos−1 (8 21/63).
Next, the tangent line at (2, −1, 2) is the intersection of the tangent planes to the two surfaces at (2, −1, 2).
So the tangent line at (2, −1, 2) is perpendicular to the plane of the vectors ∇f and ∇g at (2, −1, 2). Thus,
the vector ∇f ×∇g at (2, −1, 2) is parallel to the tangent line at (2, −1, 2). So it gives direction ratios of the
tangent line, and it is easy to write the equation of tangent line at (2, −1, 2) by using x−x l
0
= y−y 0
m = n .
z−z0
Ex. Find the values of constants a and b so that the surfaces ax2 − byz = (a + 2)x and 4x2 y + z 3 = 4
may intersect orthogonally at the point (1, −1, 2).
Ans. a = 2.5, b = 1.
(
0, y = x
Ex. Let f (x, y) = .
1, y ̸= x
Show that the directional derivative exists at (0, 0) along the line y = x though ∇f does not exists at
(0, 0).
So. The unit vector along the line y = x is n̂ = √12 î + √12 ĵ. Also, f (x, y) = 0 along y = x. So the
directional derivative of f at (0, 0) along the line y = x is given by
df f (0 + √12 t, 0 + √12 t) − f (0, 0) 0−0
= lim = lim = 0.
dt n̂ t→0 t t→0 t
Now, ∇f at (0, 0) is given by
∇f = fx (0, 0)î + fy (0, 0)ĵ + fz (0, 0)k̂.
It does not exist since
f (h, 0) − f (0, 0) 1−0
fx (0, 0) = lim = lim
h→0 h h→0 h
does not exist.
Also, from ∇f = fx (0, 0)î + fy (0, 0)ĵ + fz (0, 0)k̂, it is clear that existence of ∇f at (0, 0) requires the
existence of all the three directional derivatives fx (0, 0), fy (0, 0) and fz (0, 0) of f at (0, 0) along x-axis,
y-axis and z-axis, respectively.
(
0, xy ̸= 0
Ex. Let f (x, y) = .
1, xy = 0
Show that the directional derivative does not exist at (0, 0) along the line y = x though ∇f exists at
(0, 0).
22
14.8 Tangent Planes and Differentials
The tangent plane at the point P0 (x0 , y0 , z0 ) on the level surface f (x, y, z) = c of a differentiable function
f is the plane through P0 normal to ∇f |P0 . So direction ratios of the normal to the tangent plane at
P0 (x0 , y0 , z0 ) are fx (P0 ), fy (P0 ) and fz (P0 ). So equation of the tangent plane is
Ex. Find the equation of tangent plane and normal to the sphere x2 + y 2 + z 2 = 1 at the point (0, 0, 1).
Sol. Normal vector to the given sphere is
→
−n = ∇(x2 + y 2 + z 2 ) = 2xî + 2y ĵ + 2z k̂ = 2k̂, at the point (0, 0, 1). So direction ratios of the normal to
the tangent plane at (0, 0, 1) are 0, 0 and 2. So equation of the tangent plane at (0, 0, 1) is
Ex. Estimate how much the value of f (x, y, z) = y sin x + 2yz will change if the point P (x, y, z) moves
0.1 unit from P0 (0, 1, 0) straight towards P1 (2, 2, −2).
−−−→
P0 P1
Sol. We find n̂ = −−−→ = 23 î + 13 ĵ − 23 k̂.
|P0 P1 |
Next, ∇f |P0 = y cos xî + (sin x + 2z)ĵ + 2y k̂|P0 (0,1,0) = î + 2k̂.
So with ds = 0.1, the resulting change in f is
1 n (n) (x )
It is called Taylor’s formula with the remainder term Rn+1 = n! (x − x0 ) f 0 after the n terms. The
values of x for which Rn+1 → 0 as n → ∞, we have
∞
X 1
f (x) = (x − x0 )k f (k) (x0 ).
k!
k=0
23
This is called the Taylor series expansion of f (x) about x = x0 .
is called linearization of f (x) about x = x0 . Since f (x) = f (x0 ) + f (x0 )(x − x0 ) is tangent to the curve
f (x) at x = x0 , the linearization is geometrically the tangent line approximation of the curve f (x) in the
close vicinity of (x0 , f (x0 )). Also, the absolute error in the linear approximation or linearization is given
by
1 ′′
E(x) = (x − x0 )2 |f (x0 + (x − x0 )θ)|.
2!
For example, sin x ≈ x is the linear approximation of sin x about x = 0, and the absolute error is
given by
1 2 1
E(x) = x | cos(θx)| ≤ x2 ≤ 0.5,
2! 2!
in case we choose x ∈ [−1, 1]. It means the maximum possible error is 0.5 in the linear approximation
sin x ≈ x for x ∈ [−1, 1].
where
∂ n
1 ∂
Rn+1 = (x − x0 ) + (y − y0 ) f (x0 + (x − x0 )θ, y0 + (y − y0 )θ)
n! ∂x ∂y
is the remainder term after the n terms. The values of (x, y) for which Rn+1 → 0 as n → ∞, we have
∞
∂ k
X 1 ∂
f (x, y) = (x − x0 ) + (y − y0 )(x − x0 ) f (x0 , y0 ).
k! ∂x ∂y
k=0
where
1
fxx (c, d)(x − x0 )2 + 2fxy (c, d)(x − x0 )(y − y0 ) + fyy (c, d)(y − y0 )2
R3 =
2!
with c = x0 + (x − x0 )θ) and d = y0 + (y − y0 )θ).
24
The approximation using the first two terms, that is,
is called linearization of f (x, y) about (x0 , y0 ). Since f (x, y) = f (x0 , y0 )+[fx (x0 , y0 )(x−x0 )+fy (x0 , y0 )(y−
y0 ) is tangent plane to the surface z = f (x, y) at (x0 , y0 , f (x0 , y0 )), the linearization is geometrically the
tangent plane approximation of the surface z = f (x, y) in the close vicinity of (x0 , y0 , f (x0 , y0 )). Also,
the absolute error in the linear approximation or linearization is given by E(x, y) = |R3 |.
If f has continuous first and second partial derivatives throughout an open set containing a rectangle
R centered at (x0 , y0 ) and if M is any upper bound for the values of |fxx |, |fxy | and |fyy | on R, then the
error E(x, y) occurred in replacing f (x, y) on R by its linearization
L(x, y) = f (x0 , y0 )+fx (x0 , y0 )(x−x0 )+fy (x0 , y0 )(y−y0 ) = f (3, 2)+fx (3, 2)(x−3)+fy (3, 2)(y−2) = 4x−y−2.
14.8.4 Differentials
Suppose the first order partial derivatives of a function f (x, y) exist at (x0 , y0 ). If x and y change from
x0 and y0 by small amounts dx and dy respectively, then the total differential
Ex. Suppose that a cylindrical can is designed to have a radius of 1 in. and a height of 5 in., but that
the radius and height are off by the amounts dr = +0.03 and dh = −0.1. Estimate the resulting absolute
change in the volume of the can.
Sol. To estimate the absolute change in V = πr2 h, we use
dV = 2πr0 h0 dr + πr02 (r0 , h0 )dh = 2π(1)(5)(0.03) + π(1)2 (−0.1) = 0.2π ≈ 0.63 in3 .
25
14.9 Extreme Values and Saddle Points
14.9.1 Maxima and minima of a function of one variable
Let f be a continuous function in some interval I, and x0 ∈ I. Then (x0 , f (x0 )) is said to be
(i) local maximum point of f (x) if f (x) − f (x0 ) remains negative in some deleted nbd of x0 , that is, there
exists some δ > 0 such that f (x) − f (x0 ) < 0 for all x ∈ (x0 − δ, x0 + δ) − {x0 },
(ii) local minimum point of f (x) if f (x) − f (x0 ) remains positive in some deleted nbd of x0 , that is, there
exists some δ > 0 such that f (x) − f (x0 ) > 0 for all x ∈ (x0 − δ, x0 + δ) − {x0 },
(iii) inflection point of f (x) if f (x) − f (x0 ) attains positive as well as negative values in every nbd of x0 ,
(iv) global maximum or absolute maximum point of f (x) if f (x) − f (x0 ) ≤ 0 for all x ∈ I,
(v) global minimum or absolute minimum point of f (x) if f (x) − f (x0 ) ≥ 0 for all x ∈ I.
Critical point
A critical point of a function is a point where its first derivative vanishes. So (x0 , f (x0 )) is a critical point
of f (x) if f ′ (x0 ) = 0.
It is easy to verify that the local maximum and minimum points of f (x) are its critical points.
For, if f (x) has local maximum value at x = x0 , then f (x) < f (x0 ) throughout some deleted nbd of
x0 . So we have
f (x) − f (x0 )
f ′ (x0 ) = lim ≥0 (∵ x − x0 < 0)
x→x−
0
x − x0
and
f (x) − f (x0 )
f ′ (x0 ) = lim ≤ 0. (∵ x − x0 > 0)
x→x+
0
x − x0
26
that (x0 , f (x0 )) is a local maximum point of f (x). Similarly, f ′′ (x0 ) < 0 implies that (x0 , f (x0 )) is a local
minimum point of f (x).
In case, f ′′ (x0 ) = 0, we can use the Taylor’s formula with remainder after three terms to obtain
1
f (x) − f (x0 ) = (x − x0 )3 f ′′′ (x0 + (x − x0 )θ).
3!
If f ′′′ (x0 ) ̸= 0, then continuity of f ′′′ implies that f ′′′ (x0 + (x − x0 )θ) ̸= 0 retains the same sign as of
f ′′′ (x0 ) in some nbd of x0 . It follows that f (x) − f (x0 ) attains positive as well as negative values in every
nbd of x0 . So (x0 , f (x0 )) is an inflection point of f (x).
In case, f ′′′ (x0 ) also vanishes, we can decide local maximum or minimum based on the sign of fourth
derivative of f at x0 as we did above in case of second derivative.
Ex. Show that (0, 0) is local minimum point of f (x) = x2 .
Sol. Here f ′ (x) = 2x. So f ′ (x) = 0 implies x = 0. We see that f ′ (x) = 2x changes its sign from negative
to positive in the nbd (−0.1, 0.1) of x = 0. So by the first derivative test, f (x) has local minimum at
x = 0, and the local minimum value is f (0) = 0.
Also, f ′′ (x) = 2 is positive at x = 0. So the second derivative test implies that f (x) has local minimum
at x = 0.
Ex. Find global maximum/minimum values of f (x) = x2 in the interval [−1, 2].
Sol. In [−1, 2], we find that f (x) has critical point at x = 0. Also, f (−1) = 1, f (0) = 0 and f (2) = 4 So
in [−1, 2], global minimum value of f (x) = x2 is 0, and global maximum value is 4. Global minimum and
maximum points are (0, 0) and (2, 4), respectively.
In case, we are given to find the global maximum/minimum values of f (x) = x2 in the open interval
(−1, 2), then the global minimum value f (0) = 0 is same as in the previous case of closed interval. But
we can not tell the global maximum value of f (x) = x2 in the open interval (−1, 2) because x = 2 does
not belong to this domain interval of f (x) = x2 .
Thus, if domain of a function is a closed interval, we can find its global maximum and minimum values
by considering the values of the function at the end points of the interval and the x-values of critical points
lying inside the interval. In case, of open interval domain, we consider only the x-values of critical points
lying inside the interval for determining global maximum/minimum values of the function.
27
14.10 Maxima and minima of a function of two variables
Let f be a continuous function defined in some region D of xy-plane, and (x0 , y0 ) ∈ D. Then (x0 , y0 , f (x0 , y0 ))
is said to be
(i) local maximum point of f (x, y) if f (x, y) − f (x0 , y0 ) remains negative in some deleted nbd of (x0 , y0 ),
(ii) local minimum point of f (x, y) if f (x, y) − f (x0 , y0 ) remains positive in some deleted nbd of (x0 , y0 ),
(iii) saddle point1 of f (x, y) if f (x, y) − f (x0 , y0 ) attains positive as well as negative values in every nbd
of (x0 , y0 ),
(iv) global maximum or absolute maximum point of f (x, y) if f (x, y) − f (x0 , y0 ) ≤ 0 for all (x, y) ∈ D,
(v) global minimum or absolute minimum point of f (x, y) if f (x, y) − f (x0 , y0 ) ≥ 0 for all (x, y) ∈ D.
Figure 14.14: Left: The person is standing at the saddle point of the hill surface. Is’nt it? Right: How
many saddle points you eat in a day:)
Critical point
A critical point of a function f (x, y) is a point where its first order partial derivatives vanishes. So
(x0 , y0 , f (x0 , y0 )) is a critical point of f (x, y) if fx (x0 , y0 ) = 0 and fx (x0 , y0 ) = 0.
It is easy to verify that the local maximum and minimum points of f (x, y) are its critical points.
1
Consider the point at the center of the surface of a saddle (seat used on a horse) placed on the back of a horse. It is easy
to imagine that the surface points rise as we move on the surface from the center along the horse while the surface points
fall as we move on the surface from the center perpendicular to the horse. Clearly, in every nbd of the center of the this
surface, there are points which are at more height than the center as well as there are points which are at less height than
the center. It implies that center of the surface of saddle is a saddle point. (That is why the name saddle point!)
28
For, if f has a local extremum (maximum or minimum) at (x0 , y0 ), then the function g(x) = f (x, y0 )
has a local extremum at x = x0 . Therefore, g ′ (x0 ) = 0. Now g ′ (x0 ) = fx (x0 , y0 ), so fx (x0 , y0 ) = 0. A
similar argument with the function h(y) = f (x0 , y) shows that fy (x0 , y0 ) = 0.
Ex. Show that (0, 0, 0) is the saddle point of the hyperbolic paraboloid z = x2 − y 2 .
Sol. Please try yourself.
29
Now, gx = −2x = 0 and gy = −2y = 0 imply (x, y) = (0, 0). Next at (0, 0), we have gxx = −2, gxy = 0
2 = 4 > 0 and g
and gyy = −2. Therefore, gxx gyy − gxy xx = −2 < 0. It implies that g(x, y), and hence
f (x, y) has local maximum value at (0, 0), and the maximum value reads as f (0, 0) = 1.
Note: In case, the domain of the function is a closed and bounded region R, its absolute maxi-
mum/minimum values are the maximum/minimum of the values of the function at the critical points
within R and at the boundary points of R as illustrated in the following example.
f (x, y) = 2 + 2x + 2y − x2 − y 2
Figure 14.16:
We find fx = 2 − 2x and fy = 2 − 2y. So fx = 0 and fy = 0 yield the only critical point (1, 1) in the
interior of R, and f (1, 1) = 4.
Now we consider the three sides of the triangular region OAB one by one.
Along OA, y = 0. So we have f (x, 0) = 2 + 2x − x2 , which may be treated a function of single variable
x on the interval [0, 9]. Its extreme values may occur at the end points x = 0, x = 9 and at the interior
points, where f ′ (x, 0) = 2 − 2x = 0 or x = 1. The values are f (0, 0) = 2, f (9, 0) = −61 and f (1, 0) = 3.
Along OB, x = 0. So we have f (0, y) = 2 + 2y − y 2 , which may be treated a function of single variable
y on the interval [0, 9]. Its extreme values may occur at the end points y = 0, y = 9 and at the interior
points, where f ′ (0, y) = 2 − 2y = 0 or y = 1. The values are f (0, 0) = 2, f (0, 9) = −61 and f (0, 1) = 3.
Along AB, y = 9 − x. So we have f (x, y) = −61 + 18x − 2x2 , which may be treated a function of
single variable x on the interval [0, 9]. Its extreme values may occur at the end points x = 0, x = 9 and
at the interior points, where f ′ (x, 9 − x) = 18 − 4x = 0 or x = 9/2, and therefore y = 9 − 9/2 = 9/2. The
values are f (0, 0) = 2, f (0, 9) = −61 and f (9/2, 9/2) = −41/2.
Finally we list all the values: 4, 2, −61, 3, −(41/2). So the absolute maximum value is 4, which f
assumes at (1, 1). The absolute minimum value is −61, which f assumes at (0, 9) and (9, 0).
30
Ex. Find the dimensions of a box open at the top with volume 32 cubic units requiring the least material
for its construction.
Sol. Let x, y, z be length, breadth and height of the box so that its volume is xyz = 32 and surface area
is xy + 2yz + 2zx. So we need to minimize xy + 2yz + 2zx subject to the constraint xyz = 32. Using
32
z = xy into the surface area function, we get the two variable function
64 64
f (x, y) = xy + + .
x y
Applying the maxima/minima test (please try yourself), we find minimum value of f (x, y) for x = 4,
y = 4. Thus, the required dimensions of the box are 4, 4, 2.
df df dx df dy df dz d→
−r
= + + = ∇f. .
dt dx dt dy dt dz dt dt
−
→
At a point P (x(t), y(t), z(t)) of local maxima or minima of f on the curve C, df dr
dt = 0. So ∇f. dt = 0,
which in turn implies that ∇f is orthogonal to C at the point of maxima or minima.
Now suppose a differentiable function f (x, y, z) takes a local maximum or minimum value at some
point P0 of a surface g(x, y, z) = 0, where g(x, y, z) is differentiable function and ∇g ̸= 0. Then f takes a
local maximum or minimum value at point P0 relative to its values on every differentiable curve passing
through P0 and lying on the surface g(x, y, z) = 0. So ∇f is orthogonal to every differentiable curve C
at the point P0 . It implies that ∇f is normal to the surface g(x, y, z) = 0 at the point P0 . Also, ∇g is
normal to the surface g(x, y, z) = 0 at every point. It implies that the normal vectors ∇f and ∇g at P0
are parallel vectors, and therefore ∇f = λ∇g for some non-zero constant λ, known as Lagrange multiplier.
Thus, to find the local maximum and minimum values of f subject to the constraint g(x, y, z) = 0, we
find the values of x, y, z and λ that simultaneously satisfy the equations
∇f = λ∇g, g(x, y, z) = 0.
Note: It must be noted that Lagrange multiplier method gives the points related to the extremum
(maximum or minimum) values of the function. It does not make distinction between points of maxima
and minima. In other words, it does not tells us the exact nature of the point. We have to judge the
nature of the point from the nature of the problem under consideration. It may be treated as a drawback
of the Lagrange multiplier method.
Ex. Find the dimensions of a box open at the top with volume 32 cubic units requiring the least material
for its construction.
Sol. Let x, y, z be length, breadth and height of the box so that its volume is xyz = 32 and surface area
is xy + 2yz + 2zx. So we need to minimize xy + 2yz + 2zx subject to the constraint xyz = 32. We model
the given problem as a Lagrange multiplier problem with
and look for the values of x, y, z and λ that satisfy the equations
∇f = λ∇g, g(x, y, z) = 0.
31
x + 2z − λxz = 0
2x + 2y − λxy = 0
xyz − 32 = 0
Multiplying the first three equations by x, y and z, respectively, we get
xy + 2xz − λxyz = 0
xy + 2yz − λxyz = 0
2xz + 2yz − λxyz = 0
Now subtracting the first two and last two of the above equations, we get x = y and y = 2z, respectively.
Using these in xyz − 32 = 0, we get x = 4, y = 4 and z = 2.
2
x2 2
Ex. Find the maximum volume of the cuboid that can be inscribed inside the ellipsoid a2
+ yb2 + zc2 = 1.
2
x2 2
Sol. Let 2x, 2y and 2z be dimensions of the cuboid that can be inscribed inside the ellipsoid a2
+ yb2 + zc2 =
1. So volume of the cuboid is 8xyz. Thus, we need to maximize
x2 y 2 z 2
g(x, y, z) = + 2 + 2 − 1 = 0. (14.6)
a2 b c
So as per the method of Lagrange multipliers, in order to find the local maximum and minimum values of
f subject to the constraint g(x, y, z) = 0, we find the values of x, y, z and λ that simultaneously satisfy
the equations
∇f = λ∇g, g(x, y, z) = 0.
12xyz + λ = 0.
Ex. The plane x + y + z = 1 cuts the cylinder x2 + y 2 = 1 in an ellipse (Figure 16). Find the points on
the ellipse that lie closest to and farthest from the origin.
Sol. We find the extreme values of
32
Figure 14.17:
Figure 14.18:
f (x, y, z) = x2 + y 2 + z 2
(the square of the distance from (x, y, z) to the origin) subject to the constraints
g1 (x, y, z) = x2 + y 2 − 1 = 0,
g2 (x, y, z) = x + y + z − 1 = 0.
The gradient equation ∇f = λ∇g1 + µ∇g2 leads to the following equations:
2x = 2λx + µ,
2y = 2λy + µ,
2z = µ.
Eliminating µ, we get the following equations:
(1 − λ)x = z,
(1 − λ)y = z.
Obviously, these two equations are satisfied simultaneously provided either λ = 1 and z = 0 or λ ̸= 1 and
x = y = z/(1 − λ).
33
In case z = 0, the solution of the equations x2 + y 2 − 1 = 0 and x + y + z − 1 = 0 yields the two points
(1, 0, 0) and (0, 1, 0).
In case x = y, the solution of the equations x2 + y 2 − 1 = 0 and x + y + z − 1 = 0 gives the two points
√ √ ! √ √ !
2 2 √ − 2 2 √
P1 = , , 1 − 2 , P2 = ,− ,1 + 2 .
2 2 2 2
The distance of the points√(1, 0, 0) and √ (0, 1, 0) from the origin is 1 unit while the distances of P1 and P2
from the origin are 4 − 2 2 and 4 + 2 2 units respectively. So the points on the ellipse closest to the
origin are (1, 0, 0) and (0, 1, 0), and the point farthest from the origin is P2 .
• To show that lim f (x, y) does not exist, prove that it is path dependent.
(x,y)→(x0 ,y0 )
• To prove the differentiability of f (x, y), either satisfy the definition of differentiability or show that
fx and fy are continuous.
• To prove the non-differentiability of f (x, y), show any of the following: (i) fx does not exist (ii) fy
does not exist (iii) f is not continuous.
• .....
34