Unit 1 Multivariate Calculus-I: 1.0 Objectives
Unit 1 Multivariate Calculus-I: 1.0 Objectives
Structure
1.0 Aims and Objectives
1.1 Introduction
1.2 Functions of Several Variables
1.2.1 Functions of Two Independent Variables
1.2.2 Level Curves
1.2.3 General Multivariate Functions
1.3 Partial Derivatives
1.3.1 First- Order Partial Derivatives
1.3.2 Second-Order Partial Derivatives
1.4 Total Differentials and Total Derivatives
1.4.1 Total Differentials
1.4.2 Total Derivatives
1.5 The Chain Rule for Multivariate Functions
1.6 Implicit functions
1.7 Homogeneous and Homothetic functions
1.8 Let Us Sum Up
1.9 Answers to Check Your Progress Exercises
1.0 OBJECTIVES
After reading this unit you will be able to:
• Describe functions where a dependent variable depends on more than
one independent variable;
• Explain the concept of a partial derivative;
• Discuss the techniques of total differentiation, and obtain total
derivatives;
• Explain what homogeneous and homothetic functions are; and
• Explain the chain rule with regard to functions of more than one variable.
1.1 INTRODUCTION
In the course on mathematical methods in economics (BECC 102) that you
studied in the previous semester, you learnt about differentiation. However,
there you studied about differentiation of univariate functions, that is,
functions where the dependent variable is dependent on one independent
variable. This Unit takes up the case of function of more than one
independent variable. It also considers differential calculus pertaining to
multivariate functions. The Unit first discusses the concept of function of
several variables in section 1.2 Section 1.3 deals with partial derivatives, 11
Functions of Several
Variables
which means differentiating the function with respect to one independent
variable, keeping others unchanged. Section 1.4 takes up the case of total
differentiation, which means the change in the dependent variable as a result
of changes in all the independent variables. This section also discusses total
derivatives. The next section, section 1.5 discusses the chain rule that you
studied in the previous unit, but also somewhat different and new. The
subsequent section, section 1.6, discusses the differentiation of functions that
are defined implicitly. Of course, the discussion of implicit function is with
regard to functions, which when expressed explicitly, would be multivariate.
This section also discusses the important implicit function theorem. Finally,
in section 1.7 you are introduced to an important type of multivariate
functions, called homogeneous functions. Other than the definition of a
homogeneous function, some important properties are discussed, including a
theorem called Euler’s theorem. Homothetic functions are defined as well,
and their relationship to homogeneous functions is mentioned.
Let us state explicitly that the present unit discusses the content of all the
topics and concepts only in mathematical terms. That is, the mathematics of
these concepts is discussed in the present unit. The next unit, unit 2 is entirely
devoted to the applications of the ideas in this unit to economics, like the
theory of the consumer, producer’s theory, markets, macroeconomics, and so
on. Just understand the content of this unit well, and you will appreciate
better the matter in the next unit, as well as what you learn in your
microeconomics and macroeconomics courses.
We can define the above function in the following way. A function f of two
real variables x and y with domain D is a rule that assigns a specified number
12 f(x, y) to each point (x,y) in domain D. In the above function, x and y are
Multivariate
called the independent variables, or arguments of the function f. The variable Calculus-I
z is called the dependent variable. The domain of the function f is the set of
all possible ordered pairs of the independent variables, while the range is the
set of corresponding values of the dependent variable. In some contexts, z is
called the endogenous variable while x and y are called exogenous variables.
Apart from simplicity, one reason for beginning with functions of only two
variables is that we are able to draw diagrams. We can depict f ( x, y )
diagrammatically in three dimensional space by drawing three mutually
perpendicular axes Ox, Oy and Oz. Figure 1.1 below shows such a diagram
depicting a surface in three-dimensional space, and a point
z0
y0 y
x0
Now let us suppose that this surface is traced out by the function z = f ( x, y ) ,
and that z is traced out as (x,y) varies over the xy-plane. Then z0 = f ( x0 , y0 ) .
At this point, think back to the unit on coordinate geometry that you studied
in course BECC 102. There you studied that the equation of a line parallel to
the y-axis is x = c , while the equation of a line parallel to the x-axis is y = c
The level curve thus connects points whose functional values are equal. In the
above, the level curve connects all the points for which the value of f is c. it is
the locus of all points, that is, the combination of x and y, for which the value
of z is equal to c.
1.2.3 General Multivariate Functions
In subsection 1.2.1 above, we had discussed about functions of two variables.
We can extend the discussion to functions in which the dependent variable is
a function of several independent variables. Let us denote a list of n variables
by ( x1 , x2 ,...xn ) We say that the variable x is indexed by i, where i =
1,2,…,n. This collection of n-variables (each xi is a real number) is called a
vector. A vector is an ordered n-tuple. This was mentioned in Units 1 and 2
of course BECC 102 that you studied in the last semester. You will study
about vectors in much greater detail in Block 3 of the present course.
Coming back to the vector x = ( x1 , x2 ,...xn ) , suppose a variable z is a function
of all these n variables. We denote this function as z = f (x1 , x2 ,..., xn ) .
∆z
If we take the limit of as Δx1 → 0, the limit we obtain is called the partial
∆x1
derivative of with respect to x1. The term partial derivative is used to
indicate that the other independent variables are held constant. We can derive
similar partial derivatives of the function with respect to each of the other
independent variables. The process of taking partial derivatives is known as
partial differentiation.
15
Functions of Several
Variables
We saw in course BECC 102 that derivatives are denoted by the symbol d.
Thus if we have y = f(x) then the derivative of y with respect to x is denoted
dy
. Partial derivatives are denoted by the symbol ∂. This is a variation of the
dx
lower-case Greek letter ‘delta’ δ. Hence we write the above partial derivative
∂z
as We call this “partial derivative of with respect to x1. For the generic
∂x1
∂z
variable xi, we write the partial derivative of with respect to xi as
∂xi
∂
We sometimes write the partial derivative as z . With such a symbol, the
∂x i
∂
part can be taken as a mathematical operator symbol showing “taking
∂x i
partial derivative (of some function)”. Since z is a function of xi (i =
∂f
1,2,…,n), we can also denote the partial derivative by . We have seen
∂xi
dy
earlier that when we have y = f(x), then is also sometimes denoted f ′( x ) .
dx
In the case of partial derivatives, we sometimes use subscripts to denote the
∂f
partial derivatives. Thus is sometimes denoted f i
∂ xi
If we denote a function as , say z =f(x, y, w, v), then we can denote the partial
derivatives as f x , f y , f w , f v rather than f1 , f 2 , f 3 , f 4
Then we can depict partial derivative of z with respect to, say x3, by
∂z ∂f ∂
or or f or f 3
∂x 3 ∂x 3 ∂x 3
Let us now take a few examples to see how partial derivatives are computed.
For simplicity let us consider functions of two independent variables. To
compute partial derivatives, remember two things: first, when you take partial
derivative with respect to one variable, treat the other variable(s) as constant;
and second, remember that the derivative of a constant equals zero. Also
remember that if you have a variable multiplied by a constant, then the
derivative of this product with respect to the variable is the product of the
constant and the derivative of this variable. For example, (d [cx]/dx =cdx/dx;
and d[cx2]/dx =2cx). Moreover, the usual rules of differentiation, like sum
and difference rules, product and quotient rules, and the composite function
rules hold in the case of partial derivatives also.
Example 1 Let f ( x, y ) = x 3 y + y 4
16
∂f ∂ 3 ∂ 4 Multivariate
= x y +
∂x ∂
( )
∂x
y ( ) Calculus-I
The first term on the right-hand side is 3 x 2 y and the second term is zero
(because y is a constant, and so is y raised to the power 4.
∂f
Thus = 3x 2 y
∂x
Similarly
∂f ∂ 3 ∂ 4
=
∂y ∂y
( )
x y +
∂y
( )
y = x3 y + 4 y3
Similarly for computing the partial derivative with respect to x2, we treat x1
as constant. We obtain
∂z
= f 2 = x1 + 6x 2
∂x 2
We can obtain the partial derivatives by using the product rule. For partial
derivative with respect to w, we hold v constant. We obtain
f w = (w + 4 )(3) + (3w + 2v )(1) = 2(3w + v + 6 )
Example 4 Let us see in this example how the quotient rule is used
(
Suppose z = (3w − 2v ) / w 2 + 3v )
We have
∂z
= fw =
( )
3 w 2 + 3v − 2 w(3w − 2v ) − 3w 2 + 4wv + 9v
=
∂w w 2 + 3v(2
) w 2 + 3v 2( )
∂z
= fv =
( )
− 2 w 2 + 3v − 3(w − 2v ) − w(2w + 9)
=
∂v (
w 2 + 3v
2
) w 2 + 3v
2
( )
17
Functions of Several
Variables
In the above we considered first-order partial derivatives. These are partial
derivatives of the given function with respect to the arguments. A basic point
about partial derivatives may be made. The first-order partial derivatives are
themselves functions of the arguments of the original function, that is the
first-order partial derivatives are themselves the functions of the independent
variables in the original function.
Suppose we have z = f (x1 , x 2 )
∂f
Then f1 = = g ( x1 , x 2 ) and
∂x1
∂f
f2 = = h( x1 , x2 )
∂x 2
We mentioned towards the end of the previous sub-section that the first-order
partial derivatives are themselves functions of x and y, that is,
f x = g ( x, y ) and
f y = h ( x, y ) .
18
Let us differentiate f x = g ( x, y ) Multivariate
Calculus-I
∂f x ∂ ∂z
We get =
∂x ∂x ∂x
∂2z
The right hand side is written
∂x 2
The second order partial derivative of z (or the function f) with respect to x is
thus written:
∂f x ∂ ∂z
f xx or f x . It’s also written or
x ∂x ∂x ∂x
This is the partial derivative of z first with respect to y and then with
respect to x.
� ��
2) ��� = �� ����
This is the partial derivative of z first with respect to x and then with
respect to y.
Young's Theorem
In the case of continuous functions with continuous partial derivatives, fxy
=fyx. This is called Young's Theorem and can be stated as under:
19
Functions of Several
Variables
The order of cross partial derivatives does not matter. That is, it does not
matter whether a function is differentiated first with respect to x and then
again differentiated with respect of y or vice versa. Symbolically:
��� = ���
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
2) Find the partial derivatives fx and fy of the functions
i) f(x, y) = 5x2 + 6y2
ii) f(x, y) = -10xy2
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
20
Multivariate
1.4 TOTAL DIFFERENTIAL AND TOTAL Calculus-I
DERIVATIVES
Let us consider again the function z = f ( x, y )
The partial derivatives gave us a small change in z when there was a small
change in x holding y constant, or a small change in y holding x constant. Let
us now study what happens to z if both x and y were to change. For that we
turn to the study of differentials
1.4.1 Total Differential
Let us begin by considering a function of a single variable, z =f(x). for a
change in x, Δx, the change in z will be Δz. In the course BECC 102, we have
seen dz/dx to be the change in z when there is a small unit change in x. so,
when there is change of Δx in x the change in z will be:
dz
∆z = ∆x
dx
Writing dz for Δz and dx for Δx, we get
dz
dz = dz
dx
This dz on the left-hand side is called the differential of z.
z z
dz fxdx f yd y dx d y approximately
x y
∂z
shows
Let us try to understand it in the following manner. In the above,
∂x
the incremental change in z when x changes by a small amount. So the
∂z
change in z due to change of dx in x is dx
∂x
∂z
Similarly, the change in z due to a change of dy in y is dy . Thus the total
∂y
change in z due to change in both x and y of dx and dy respectively is
z z
dz fxdx f yd y dx dy
x y
21
Functions of Several
Variables
if z x3 y3 , then total differential can be expressed as
2 2
dz f x dx fydy 3x d x 3 y d y
∂z ∂z ∂z
dz = dx1 + dx2 + ... + dxn , or
∂x1 ∂x 2 ∂x n
n
dz = f1dx1 + f 2 dx2 + ... + f n dxn = ∑ f i dxi (recall the rules and properties of
i =1
Rules of differentials
The following rules on total differential will be found useful. Let z and w
represent two functions of x and y , then
1) dk = 0 (constant-function rule)
2) d w z dw dz (Sum-difference rule)
= ( f x dx + f y dy ) ± (g x dx + g y dy )
3) d w z w.d z z dw
w(g x dx + g y dy ) + z ( f x dx + f y dy ) (Product rule)
w z.dw wdz
4) d
z z2
z ( f x dx + f y dy ) − w(g x dx + g y dy )
= (Quotient Rule)
z2
5) ( )
d kz n = knz n−1dz (power-function rule)
22
Multivariate
Example. Calculus-I
If z u n , where u f x, y , then
d n
dz u du nu n 1 du
dx
Let us now solve some problems on total differentials.
1) Find du when u 3x3 2 y2 y3
x2 y2 2
y2
a) u b) w ex c) u log x 2 y2
x2 y2
x2 y2
Answer: a) u , apply quotient rule
x2 y2
z ( f x dx + f y dy ) − w(g x dx + g y dy )
=
z2
(x 2
)( ) (
+ y2 d x2 − y2 − x2 − y2 d x2 + y2 )( )
(x 2
+y 2 2
)
x2 y 2 2 xdx 2 ydy x2 y 2 2 xdx 2 ydy
2
x2 y2
4 xy 2 dx 4 x 2 ydy
2
x2 y2
2
y2
b) w ex
Put u x2 y 2 so that w e u and dw eu du (1)
c) u log x 2 y2
Here we assume that x and y are independent variables, and also independent
of each other.
Consider the case where x and y are not independent variables but are
dependent variables of other functions, for example,
x = g (t ), y = h(t ) , where t is the independent variable. Thus, z is a function of
x and y; x and y are functions of t. We want to investigate as to what would
be the derivative of z with respect to t. What would be dz/dt?
∂z
We know is the change in z due to a small unit change in x holding y
∂x
∂z dx
constant hence will be the amount of change in u due to a small unit
∂x dt
∂z dx
change in t, that is transmitted through x. similarly is the amount of
∂y dt
change in z due to a small unit change in t transmitted through y.
Hence, the total change in z due to a small unit change in t will be the sum of
these two effects. We write
dz ∂z dx ∂z dx
= +
dt ∂x dt ∂y dt
, where x = g ( y )
24
Multivariate
We can combine the two functions f and g into a composite function: Calculus-I
z = f [g ( y ), y ]
In the above we see that z, x, and y are related as follows: the variable y
affects z through two channels. It can affect z directly through the function f,
and indirectly through the function g. thus the variable y is the ultimate
source of variation in z. The indirect effect of y on z can be represented as
∂z dx
. The direct effect is simply fy
∂x dy
dz ∂z dx ∂z
Thus = +
dy ∂x dy ∂y
……………………………………………………………………….
……………………………………………………………………….
……………………………………………………………………….
……………………………………………………………………….
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
25
Functions of Several
Variables 1.5 THE CHAIN RULE FOR MULTIVARIATE
FUNCTIONS
You are aware of the chain rule for function of one variable. It is based on the
function-of-a-function rule. Let us suppose z is a function of y, and y in turn
is a function of x. We can depict this as � = ���(�)�. we know in this case
how to take the derivative of z with respect x: we first take the derivative of z
with respect to y and then multiply this with the derivative of y with respect
to x. We can write this as follows:
�� �� ��
=
�� �� ��
As similar process works in the case of partial derivatives. There is a chain
rule applicable to the case of partial derivatives as well. We only have to take
care in following the rules of partial differentiation. We shall try to make this
clear with the help of some examples.
Just a matter of notation for the examples: unlike earlier, we denote one
argument in the function by z. Earlier, we denoted the dependent variable by
z. Please do not get confused.
Example 1
Suppose we have � = �(�, �, �)and here x, y, z are themselves each a
��
function of some variable, say t. then we can find �� as follows:
�� �� �� �� �� �� ��
= + +
�� �� �� �� �� �� ��
Notice that this is just the total derivative that we used in the previous
section. This gives us occasion to mention that the total derivative is used
when the variables inside the parentheses (the argument of the function) are
themselves functions of only one independent variable. In the example above,
x, y, z are functions of the single variable t. In the next example we consider
a case where variables that are arguments in the given function, are
themselves functions of more than one variable. This is where we use the
chain rule, and have to use partial derivatives for composite function. Just
study the examples below, and it will become clear.
Example 2
If we have � = �(�, �, �) and x, y, z are themselves each a function of
variable r and s then
�� �� �� �� �� �� ��
= + +
�� �� �� �� �� �� ��
and
�� �� �� �� �� �� ��
= + +
�� �� �� �� �� �� ��
26
Multivariate
The next example could also have been depicted using total derivatives as we Calculus-I
did in the previous section.
Example 3
Suppose � = �(�, �, �) and if y and z themselves depend on x, that is,
� = �(�), � = �(�). Then
�� �� �� �� �� ��
= + +
�� �� �� �� �� ��
Example 4
Let � = �(�, �, �) and if x depend on t, y depends on x, and z depends on
y, that is, � = �(�), � = �(�), � = �(�). Then
�� �� �� �� �� �� �� �� �� ��
= + +
�� �� �� �� �� �� �� �� �� ��
Notice that in the above example, z depends on y, which depends on x, which
depends on t. Thus z and y ultimately depend on t.
Example 5
Suppose � = �(�, �, �) where x = x (r,s), y = (r), z = z(y).Then
�� �� �� �� �� �� �� ��
= + +
�� �� �� �� �� �� �� ��
and
�� �� ��
=
�� �� ��
So now you should be able to grasp the technique of the chain rule in the case
of partial derivatives, and should be able to carefully observe which variable
depends on which other(s), and carry out the required differentiation. You
should be able to see from the way the variables are related to each other how
to use the chain rule, and when to take the total derivatives.
Let us take one final case and see how the chain rule and total derivatives
could be combined to obtain the relevant results.
Example 6
Suppose y = f (x1 , x 2 ) where
x1 = g (w1 , w2 ) and
x 2 = h(w1 , w2 )
∂x 2 ∂x
dx 2 = dw1 + 2 dw2
∂w1 ∂w2
Substituting for dx1 and dx2 in the equation for the total differential of y, and
collecting the terms for dw1 and dw2 ,we get
We can substitute this back into the implicit function and write
f ( x1 , x 2 ( x1 )) ≡ k
dx2
The slope of any level curve is the derivative . But this is conceptually
dx1
28 true only if in the implicit function we have defined x2 as a function of x1.
Multivariate
Here we have done so. Calculus-I
dx2
Here our function x 2 = x 2 ( x1 ) is well defined. We can get by
dx1
differentiating the identity f ( x1 , x 2 ( x1 )) ≡ k , with respect to x1, using the
chain rule. We get
∂f dx1 ∂f dx 2 ∂k
+ ≡ ≡0
∂x1 dx1 ∂x 2 dx1 ∂x1
dx 2
,or f 1 + f 2 ≡0
dx1
If we assume x 2 ≠ 0 ,
dx 2 − f 1
≡
dx1 f2
This shows that the slope of a level curve at any point is the ratio of the first-
order partial derivatives evaluated at some particular point on the given level
curve. Let us next consider a general multivariate function.
Given the implicit function f ( x1 , x2 ,..., xn ) = 0 , the partial derivative of the jth
argument of the function with respect to the ith argument of the function xi ,
∂x j
∂xi
is obtained by first finding the total differential f1dx1 + f 2 dx2 + ... + f n dxn = 0 .
dx1 dx dx dx
f1 + f 2 2 + ... + f j j + ... + f i + ... + f n n = 0
dxi dxi dxi dxi
We set all differentials other than dxi and dxj equal to zero. Then
dx j
fj + fi = 0
dxi
dx j fi
⇒ =−
dxi fj
29
Functions of Several
Variables
We can extend this definition to a function of n-variables. Suppose
z = f ( x1 , x2 ,..., x n ) . Then f is said to be homogeneous of degree r if
Given:
x
F ( x, y ) = yf ,we have
y
λx x
=F (=
λ x, λ y ) λ yf=
yf λ F ( x, y )
λ=
λy y
F ( λ x, λ y ) = λ F ( x, y ) , for any λ
1
Put λ = . The we have
y
x 1
F ,1 = F ( x, y )
y y
x
Thus yF ,1 = F ( x, y )
y
x x
But F ,1 = f .
y y
x
Hence, F ( x, y ) = yf
y
f x (λx, λy ) = λr −1 f x ( x, y )
If you notice carefully, the above equation shows that the function f x is
homogeneous of degree r – 1. The above holds for the function fy too, where
fy is the partial derivative with respect to y. the above holds in the case of any
multivariate homogeneous function.
This result says that if a function is homogeneous of degree r, then each of its
partial derivatives is homogeneous of degree r – 1.
Euler’s equation
This is a very important property displayed by homogeneous equations. Let
z f x, y
We can prove this using implicit differentiation and the chain rule:
Let us have the homogeneous function
f (λx, λy ) ≡ λr f (x, y ) (a)
Let us partially differentiate the left-hand side of equation (a) with respect to
λ. We get:
∂f (λx, λy ) ∂λx ∂f (λx, λy ) ∂λy
+
∂λx ∂λ ∂λy ∂λ
Now let us obtain the partial derivative of the right-hand side of equation (a)
with respect to l. We now get:
∂λr ∂f ( x, y )
f ( x, y ) + λr
∂λ ∂λ
rλr −1 f ( x, y ) + 0
= (c)
= rλr −1 f (x, y )
Since for equation (a), left-hand side = right hand side, therefore equation (b)
= equation (c). Hence we have
xfλx + yfλy = rλr −1 f ( x, y )
31
Functions of Several
Variables
Now λ can be any number. Let λ be 1. Then, we get Euler’s theorem:
∂z ∂z
xf x + yf y = rf ( x, y ) , i.e. x + y = rz `
∂x ∂y
The above can be generalised for functions of more than two variables.
Homothetic Functions
To understand homothetic function, recall the concept of function-of-
function, or composite function. Suppose we have y = f(x) and x = g(w), then
we can write y =f[g(w)] = h(w). We can extend the idea of composite
function to the case of multivariate function. This we indeed did in this unit,
specially when looking at partial derivatives of implicit functions, the chain
rule etc. Using this concept of composite functions in the case of multivariate
functions, let us try to explain what homothetic functions are.
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
2) Consider the following implicit functions. Find dy/dz for each of these
functions.
i) F(x,y) = y – 3x4 = 0
ii) F(x,y) = x2 +y2 -19 =0
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
32
Multivariate
3) Determine whether the following functions are homogeneous. If so, of Calculus-I
what degree?
i) f ( x, y ) = xy
ii) f ( x, y ) = x3 − xy + y 3
xy 2
iii) f ( x, y , =
w) + 2 xw
w
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
…………………………………………………………………………….
1) (i) See section 1.2 (ii) see section 1.3 and answer.
34