Professional Documents
Culture Documents
Multivariate Differential Calculus: X X F Y X and X X X Y and X X X X Y X X Y X X X X F Y
Multivariate Differential Calculus: X X F Y X and X X X Y and X X X X Y X X Y X X X X F Y
x1 and x 2,
If a variable y, called the dependent variable, depends on two independent variables, the
dependence is generally expressed in the form of explicit faction as Y f ( x1 , x 2 ) .This form of function
2 x1 x 2
Y 2 x1 x 2 , Y x12 3 x1 x 2 5 x 22 and Y
For instance, the functions x 22 are functions of
Generally, the dependence relationship between the dependent variable Y and n independent variables
x1 , x 2 , . . . x n
is expressed in an explicit form as
Y f ( x1 , x 2 , . . . x n ).
For example,
Y 3x12 x1 x 2 3 x 2 x3 4 x32 is a function with three independent variables
Notice that the value of a multivariate function at a given point in its domain can be obtained just by
substituting the given number for each independent variable in place of the corresponding variable as
given in the right side of the function. Now let us see how we do it in the following examples.
Example1: Find the value of each of the following function at a point (1, 2).
a ) Y 2 x1 x 2
2 x1 x 2
b) Y
x 22
Solutions: a) For Y 2 x1 x 2 , its value at (1,2) can be computed y by substituting 1 in place of x1 and 2
1
2 x1 x 2
Y
b) The value of x 22 at (1,2) can be computed as
2(1) 2 4
f (1,2) 1
22 4
Definition 6.1
Given Y f ( x1 , x 2 ), the rate at which Y changes ( Y ) with respect to change in x1 ( x1 ) as the ,
becomes arbitrarily small (x1 ) , while holding x 2 constant is said to be the partial derivative of f with
y
respect to x1 and it is denoted as x1 with Greek (delta) instead of Roman symbol d. Other common
y f
.
ways of denoting x1 is x1
y f ( x1 x1 , x 2 ) f ( x1 , x 2 )
lim
x1 x1 0 x1
Similarly, the partial derivative of f with respect to x 2 is the rate at which y changes ( y ) with respect
y
to change in x1 (x 2 ) as x 2 becomes arbitrarily small ( x 2 0) and it is denoted as
x 2 .
y f ( x1 , x 2 x 2 ) f ( x1 , x 2 )
lim
x 2 x2 0 x 2
2
The definition of partial derivative for n-variable case can be extended straight forwardly as follows:
Definitions 6.2
Given
Y f ( x1 , x 2 , . . . x n ), the partial derivative of f with respect to xi , is the rate of change in
Y (y ) as a result of change in xi (xi ) when xi becomes arbitrarily small ( xi 0), while keeping
y f
or or f i
all other variables constant and if is denoted as x i x i .
Mathematically, it is given by
y f ( x1 , x 2 , . . . , xi xi , . . . x n ) f ( x1 , x 2 , . . x n )
lim
xi xi 0 xi
Notes
Y f ( x1 , x2 , . . . x n ),
1. The implicit assumption of computing partial derivatives for is that the
x1 , x 2 , . . . x n
variables are all independent of one another which mean that each of the independent
variable can change by itself without affecting the other.
2. For a function having n-independent variables, we have n different partial derivatives corresponding
to each independent variable.
Notice that the process of taking or finding the partial derivative of a function is called partial
differentiation. The rules or the techniques of differentiation in univariate function can be directly
applied to get the partial derivatives but the only difference in the latter case is that we treat all the
independent variable constant except the one which assumed to vary or change.
3
Now, let us illustrate how we apply different rules of univariate differentiation to find partial derivatives
with the help of the following examples.
Example :
Solutions
2
a) Since Y f ( x1 , x 2 ) 4 x1 3 x 2 have two independent variables x1 and x 2 , we are required to find
y
x1 or
x1
two partial derivatives: the partial derivative with respect to and the partial derivative with
y
x 2 or .
respect to x 2
y
,
x
To get 1 we differentiate the function with respect to x1 by treating x 2 as constant.
2
The first term (4 x1 ) contains only x1 , so its derivative using the power rule is 8x1 . The second term
(3 x 2 ) is a constant since x 2 is treated as a constant, so its derivative is zero.
Then, putting the derivatives of the two terms together using the sum rule, we have,
y
f1 8 x1 0 8 x1
x1
y
,
Similarly, to get
x 2 we differentiate the function with respect to x 2 by treating x1 as constant.
2
Now the derivative of the first term (4 x1 ) is zero since x1 is treated as constant and the derivative of
the second term (3 x 2 ) with respect to x 2 is 3. Thus, putting these derivatives together, we have
y
f2 033
x 2
4
3
b) In Y f ( x1 , x 2 ) 3 x1 x 2 , there is only one term which contains the multiplicative form of
x1 and x 2 .
y
, 3
Now, to get x1 we consider the function as x1 times a constant (3 x 2 ) since x 2 is treated as a
3 2
constant. Then we differentiate x1 with respect to x1 to get 3 x1 . Finally, through applying the constant
y
2
multiple rules, we multiply 3x with
1 a constant (3x 2 ) to get x1 as
y
f1 (3 x12 ) (3 x 2 ) 9 x12 x 2 .
x1
y
, 2
we first consider the function as a constant (3 x1 ) times x 2 .Then we differentiate
Similarly, to get
x 2
x 2 with respect to x 2 which is equals to one. Finally, we multiply the constant (3 x12 ) with one to have
y y
f (3 x13 ) (1) 3 x13 .
x 2 as 2 x
2 2
c) In Y f ( x1 , x 2 ) x1 4 x1 x 2 x 2 , there are three terms so that we can find each partial
y
,
To get
x1 first we differentiate each term with respect to x1 while treating x 2 as a constant as
follows;
2
The derivative of the first term ( x1 ) with respect to x1 is 2 x1 . The derivative of the second term
( 4 x1 x 2 ) with respect to x1 is 4x 2 , which is the product of the derivative of x1 with respect to x1 and
2
the constant (4 x 2 ). The derivative of the third term ( x1 ) w.r.t x1 is zero since it is treated as a constant.
Then, putting the derivatives of each term together, we have
y
f1 ( x12 4 x1 x 2 x 22 ) 2 x1 4 x 2 0 2 x1 4 x 2.
x1 x1
5
y
x
Similarly, to get 2 , first we differentiate each term with respect to x 2 by treating x1 as constant so
that the derivative of the first term, the second term and the third term will be zero, 4 x1 and 2 x 2
y
respectively. Then, we collect these derivatives together to have
x 2 as
y
f2 ( x12 4 x1 x 2 x 22 ) 0 4 x1 2 x 2 4 x1 2 x 2 .
x 2 x 2
Notice that the functions given in the above example are all polynomial functions and to get the partial
derivatives of such type of functions we can use the following short-cut method.
y
x
For a function y f ( x1 , x 2 ), we can compute 1 as follows;
i) If the term contains only x1 , then apply the necessary rule to differentiate it with respect to x1 .
ii) If the term contains the multiplicative form of x1 and x 2 , then differentiate the expression in
x1 with respect to x1 and put the result together with the expression in x 2 and the number as it is given
iii) If the term contains an expression only in x 2 , then take its derivative as zero.
y
Then put the resulted derivative of each term together to get the final answer for x1 .
y
Similarly, we can compute x 2 in the same way just by substituting x 2 in place of x1 in all of the above
three cases.
Now, let us apply the above short-cut method to compute the partial derivatives of
Y x12 4 x13 x 2 x 22 .
y
,
To get
x1 first we compute the derivative of each term as follows;
6
2 3
The derivative of x1 is 2x1 since it contains only x1 . The derivative of 4 x1 x 2 is the derivative of
x13 which equal to 3 x12 times 4x 2 or (3 x12 )4 x 2 12 x12 x 2 .The derivative of x 22 is zero since it contains
y
only x 2 and treated as a constant. Then we collect the derivatives together to have
x1 as
y
f1 2 x1 12 x12 x 2 0 2 x1 12 x12 x 2 .
x1
y
,
Similarly, to get x 2 first do the followings;
2
i) Take the derivative of x1 as zero since it contains only x1 and we are differentiating the function with
respect to x 2
3 3 3 3
ii) Take the derivative of 4 x1 x 2 as 4 x1 times the derivative of x 2 which equal to (4 x1 ) (1) 4 x1 , and
2
ii) Take the derivative of x 2 as 2x 2 since it contains only x 2 .
y y
f2 0 4 x13 2 x 2 4 x13 2 x 2 .
Then, collect the derivatives together to get x 2 as x 2
note also that the functions may not always be in polynomial form as we have seen in the above
examples, they may be also in some other forms like in rational form, in radical form and so on, In fact,
finding the partial derivatives of such kind of functions is not a difficult task. The only thing we do that,
we first apply the necessary rules of differentiation like quotient rules, chain rule an so on and then we
use the above short-cut method as it is illustrated in the following example.
Example;
x12
a) y
x1 x 2 b)
Y 2 x1 3 x1 x 22
Solutions
7
x12
a) y
x1 x 2 is a rational form. So to get each of the partial derivatives, we use the quotient rule and
y
,
To compute
x1 first apply the quotient rule while differentiating w.r.t x1 to get
( x12 )( x1 x 2 ) ( x1 x 2 )( x12 )
y x x1
1
x1 ( x1 x 2 ) 2
( x12 ) 2 x1 ,
Then, use the short-cut method to have
x1 since it contains only x1 and we are taking
( x1 x 2 ) 1 0 1
the derivative with respect to x1 , and
x1 since the derivative of
( x12 )
x1 with respect to x1 is one and x 2 is treated as a constant. Finally, insert 2x1 in place of x1 and
y
( x1 x 2 ) ,
x
1 in place of 1
x
on the right side of the above expression for 1 to get
y 2 x1 ( x1 x 2 ) 1( x12 )
f1
x1 ( x1 x 2 ) 2
2 x12 2 x1 x 2 x12
( x1 x 2 ) 2
x12 2 x1 x 2
( x1 x 2 ) 2
y
,
Similarly, to compute x 2 first we apply the quotient rule but now differentiating with respect to x 2 to
get
( x12 )( x1 x 2 ) ( x1 x 2 )( x12 )
y x x 2
2
x 2 ( x1 x 2 ) 2
8
( x12 ) 0
Then, using the short cut Method, we have
x 2 since it contains only x1 and it is treated as a
( x1 x 2 ) 0 1 1,
constant and
x 2 since x1 is treated as a constant and the derivative of x 2
w.r.t x 2 is one. Finally, insert 0 and 1 in their respective place on the above expression to get.
y 0( x12 ) 1( x12 )
f2
x 2 ( x1 x 2 ) 2
x12
( x1 x 2 ) 2
b) Since
Y 2 x1 3x1 x 22 is in radical form, the corresponding rules of differentiation is the chain
rule. So to get the partial derivatives, we use the chain rule and the short-cut method as follows;
y
,
x
To compute 1 first we apply the chain rule while differentiating with respect to x1 to get
y 1
(2 x1 3 x1 x 22 ) 1 / 2 (2 x1 3 x1 x 22 )
x1 2 x1
1
. ( 2 x1 3 x1 x 22 )
2 2 x1 3x x 2
1 2
x1
(2 x1 3 x1 x 22 )
x
Then we compute 1 using the short cut method as
(2 x1 3 x1 x 22 ) (2 x1 ) (3 x1 x 22 ) 2 3 x 22 2
. Finally by inserting 2 3x 2 in place of
x1 x1 x1
( 2 x1 3 x1 x 22 )
x1 , we have
y 1 2 3 x 22
f1 .(2 3 x 22 )
x1 2 2 x1 3 x1 x 22 2 2 x1 3 x1 x 22
9
y
,
In the same way, to get
x 2 we apply the chain rule while differentiating with respect to x 2 so that
we have
y 1
(2 x1 3 x1 x 22 ) 1 / 2 (2 x1 3x1 x 22 )
x 2 2 x 2
1
. (2 x1 3 x1 x 22 )
2 2 x1 3 x1 x 22 x 2
(2 x1 3x1 x 22 ) (2 x1 ) (3 x1 x 22 ) 0 (3 x1 )( 2 x 2 ) 6 x1 x 2.
x 2 x 2 x 2 Finally, we substitute 6 x1 x 2 in
( 2 x1 3 x1 x 22 ),
place of
x 2 to get
y 1 6 x1 x 2
f2 .(6 x1 x2 )
x 2 2 2 x1 3x1 x 2
2
2 2 x1 3x1 x 22
All the techniques that we have been used to get the partial derivatives of y f ( x1 , x 2 ) can also be
used to get the partial derivatives of a function with more than two independent variables as illustrated
in the following example.
Solutions
a) Since
Y f ( x1 , x 2 , x3 ) 3 x12 x 2 x32 has three independent variables, it has three partial derivatives,
y y y
or f 1 , or f 2 and or f 3 .
namely,
x1 x 2 x 3
Again, since the function has only one term in multiplicative form, we compute each partial derivative by
differentiating the term with only one variable while treating the other two variables as constant as
follows:
10
y
f1 (3 x12 x 2 x32 ) ( x12 ).(3 x 2 x32 ) (2 x1 )(3x 2 x32 ) 6 x1 x 2 x32 ,
x1 x1 x1
y
f2 (3 x12 x 2 x32 ) ( x 2 )(3 x12 x32 ) (1)(3 x12 x32 3x12 x32 , and
x 2 x 2 x 2
y
f3 (3 x12 x 2 x32 ) ( x32 )(3 x12 x 2 ) (2 x3 )(3 x12 x 2 ) 6 x12 x 2 x3 .
x3 x3 x3
b) The function has five terms which separated by plus and minus sign. So, to get each of the partial
derivatives we differentiate each term partially with respect to the corresponding variables as follows;
y
f1 (2 x12 ) ( x1 x 2 ) (3 x 22 ) (4 x 2 x32 ) (5 x32 )
x1 x1 x1 x1 x1 x1
(2 x12 ) 4 x1 ( x1 x 2 ) ( x1 ). x 2 1.x 2 x 2
Moreover, x 1 x
since it contains only ;
1 x 1 x 1 since x 2 is
(3 x 22 ) 0,
treated as a constant; x1 since there is no x1 and x 2 is treated as a constant;
( 4 x 2 x32 ) 0, (5 x32 ) 0,
x1 x
since there is no x1 and x 2 and 3 are treated as a constant and x1 since
there is no
x1 and x3 is treated as a constant.
y
f1 4 x1 x 2 0 0 0 4 x1 x 2
x1
y
or f 2
Similarly, we get x 2 as
y
f2 (2 x12 ) ( x1 x 2 ) (3 x 22 ) (4 x 2 x32 ) (5 x32 )
x 2 x 2 x 2 x 2 x 2 x 2
0 x1 6 x 2 4 x32 0
x1 6 x 2 4 x32 .
y
or f 3
and x3 as
11
y
f3 (2 x12 ) ( x1 x 2 ) (3 x 22 ) (4 x 2 x32 ) (5 x32 )
x3 x3 x3 x3 x3 x3
0 0 0 ( x32 )( 4 x 2 ) 10 x3
x3
(2 x3 )( 4 x 2 ) 10 x3
8 x 2 x3 10 x3
y y
or f 1 and or f 2 .
x1 x 2 Moreover, each of these partial derivatives are a gain a function of x1 and x 2 .
So, we can continue differentiating each of them partially with respect to x1 and x 2 . In doing so, we can
get the following four possible second – order partial derivatives.
y
x
1. When 1 is differentiated with respect to x, while holding x 2 as a constant, we have
y z y
or f 11
x1 x1 x 2
and denoted as 1
y
x
2. When 1 is differentiated with respect to x 2 keeping x1 as a constant, we have
y 2 y
or f 12 ;
x 2 x1 and denoted as x 2 x1
y
x
3. When 2 is differentiated partially with respect to x1 we have,
y 2 y
or f 21
x1 x 2 and denoted as x1x 2 and
y
x
4. When 2 is differentiated with respect to x 2 , holding x1 as a constant, we have
12
y 2 y
or f 22
x 2 x 2
2
and denoted as x 2
i. To make the notations simple and clear, we will often use f 1 and f 2 to denote the first-
order partial derivatives with respect to x1 and x 2 respectively and f 11 , f 12 , f 22 and f 22 to denote the
second-order partial derivatives.
ii. f 11 and f 22 are called pure or own partial derivatives. f11 indicates that the given function
iii. f12 and f 21 are called cross or mixed partial derivatives f 12 indicates that y f ( x1 , x 2 ) is first
Now, before defining the second-order partial derivatives for n-variable case, let us see how we
a ) First , compute f 1 as
y
f1 (3 x1 2 x 23 x13 x 22 ) 3 0 3x12 x 22 3 3 x12 x 22
x1 x1
f 11 ( f1 ) (3 3 x12 x 22 ) (3) (3 x12 x 22 ) 0 6 x1 x 22 6 x1 x 22
x1 x 2 x1 x1
y
f 12 ( f1 ) (3 3 x12 x 22 ) (3) (3 x12 x 22 0 6 x12 x 2 6 x12 x 2 .
x 2 x 2 x 2 x 2
Second, compute f 2 as
13
y
f2 (3 x1 2 x 23 x13 x 22 ) 6 x 22 2 x13 x 2
x 2 x 2
f 21 ( f2 ) (6 x 22 2 x13 x 2 ) (6 x 22 ) (2 x13 x 2 ) 0 6 x12 x 2 6 x12 x 2
x1 x1 x1 x1 and also
f 22 ( f2 ) (6 x 22 2 x13 x 2 ) (6 x 22 ) (2 x13 x 2 ) 12 x 2 2 x13
x 2 x 2 x 2 x 2
f11 6 x1 x 22 f 21 6 x12 x 2
f12 6 x12 x 2 f 22 12 x 2 2 x13
Notice that the cross partial derivatives are equal. That is,
f 12 f 21 6 x12 x 2.
y
f1 ( x13 e 2 x 2 ) ( x13 )e 2 x2 3 x12 e 2 x2 , and
x1 x1 x1
y
f2 ( x13 e 2 x2 ) x13 (e 2 x2 ) x13 2e 2 x2 2 x13 e 2 x2
x 2 x 2 x 2
f 11 ( f1 ) (3 x12 e 2 x2 ) (3 x12 )e 2 x2 6 x1e 2 x2 ;
x1 x1 x1
f 12 ( f1 ) (3 x12 e 2 x2 ) (e 2 x2 )3 x12 2e 2 x2 (3 x12 ) 6 x12 e 2 x2 ;
x 2 x 2 x 2
f 21 ( f2 ) (2 x13 e 2 x2 ) (2 x13 )e 2 x2 6 x12 (e 2 x2 ) 6 x12 e 2 x2 ; and
x1 x1 x1
f 22 ( f2 ) (2 x13 e 2 x2 ) (e 2 x2 )2 x13 2e 2 x2 (2 x13 ) 4 x13 e 2 x2
x 2 x 2 x 2
14
f 11 6 x1e 2 x2 f 21 6 x12 e 2 x2
f 12 6 x12 e 2 x2 f 22 4 x13 e 2 x2
Again notice that the cross-partial derivatives are equal. That is,
f12 f 21 6 x12 e 2 x2
Just as we have seen for Y f ( x1 , x 2 ), the second-order partial derivatives for n-variable function can
be defined analogously as follows:
y f ( x1 , x 2 , . . . x n ), 2
Definition : The function has n first-order partial derivatives and n second-order
partial derivatives each denoted as
2 y
or f ij ( fi )
x j xi x j
f ij xi
Where is the second-order partial obtained first differentiating the function with respect to and
fi xj
then differentiating the result with respect to for i, j 1, 2, . . . n.
fi i fjj i j
The partial derivatives in form of or where are called pure or own partial derivatives
fi j i j
and those in a form of where are called cross or mixed partial derivatives.
Now let us see one very useful theorem regarding the cross partial derivatives.
Remember that in example 2.6, the cross-partial derivatives were equal (i.e, f12 f 21 ) for both
functions. This fact is also true for any multivariate function with continuous second derivatives due to
the following theorem known as Young’s Theorem.
y f ( x1 , x2 ,....xn ),
Young’s Theorem: For a function with continuous first and second-order partial
derivatives, the cross-partial derivatives are equal. That is,
fi j f ji i j.
for any pair i, j (i, j 1, 2, ...n), where
Notice that, unless otherwise it is stated, Young’s Theorem is always true as all multivariate functions in
this module are assumed to have continuous first and second order partial derivatives.
15
The theorem has various useful results as we will see in the latter parts of the module. Now let us see
one of its useful results with the help of the following example.
Solution: Since the function has three independent variables, it has 3 first order partials derivatives
f1 , f 2 and
f3 and each of them has three second-order partial derivatives. So, for this function we
2
have a group of 3 9 second-order partial derivatives which are given as follows:
f2 f 21 f 22 f 23
f3 f 31 f 32 f 33
f 12, f 21 , f 13 , f 31 , f 23 and f 32
Out of the nine second-order partials, are cross partial derivatives and
from Young’s Theorem the following are identical:
f 12 f 21 , f13 f 31 and f 23 f 32
Thus, we need only six distinct values of second-order partial derivatives: three of which are pure partial
f ,f
derivative (That is, 11 22
f
and 33 ) and three are cross partial derivatives (That is,
f12 , f13 and
f 23 ).
Moreover, the required six distinct second order partial derivatives can be obtained as follows:-
f 11 ( f1 ) 4 x1 3x 2 5 x3 4
x1 x1
f 12 ( f1 ) 4 x1 3x2 5 x3 3
x 2 x 2
f 13 ( f1 ) 4 x1 3x 2 5 x3 5
x3 x3
16
f 22 ( f2 ) 3x1 2 x2 2
x 2 x 2
f 23 ( f2 ) 3x1 2 x 2 0
x3 x3 and
f 33 ( f3 ) 5 x1 8 x3 8
x3 x3
y
Given y f ( x1 , x 2 ), we learned that x1 or f 1 gives the change in y when there is a small change in
x1 (x1 0) holding x 2 as constant. Now if we suppose that x1 changes by some amount, x1 , while
y
.x1
x 2 holding as constant, the change in y will be x1 .
y
or f 2
Similarly, since x 2 represents the change in y for a small change in x 2 (x 2 0), the change in
y
.x 2 .
y as a result of change in x 2 by some amount, x 2 will be x 2
Therefore, the total change in y (say , y ) due to small change in x1 and x 2 will be:
y y
y x1 x 2
x1 x 2
Notice that, when the change in x1 and x 2 becomes infinitesimal (That is, as x1 0 and x 2 0) ,
x1 becomes dx1 and x 2 becomes dx 2. Moreover, as x1 0 and x 2 0, the corresponding
So, using these notational relationships and from the above formula for y, the total change in y (dy )
as a result of an infinitesimal change in both x1 and x 2 (dx1 and dx 2 ) will be equal to:
y y
dy dx1 dx 2 or dy f 1 dx1 f 2 dx 2.
x1 x 2
17
In this formula,
y
. dx1
x1 is called the partial differential of y f ( x1 , x 2 ) w.r.t x1
y
. dx 2
x 2 is called the partial differential of y f ( x1 , x 2 ) w.r.t x 2 , and
y y
. dy or dx1 dx 2
x1 x 2 is called the total differential of y f ( x1 , x 2 ).
The definition and interpretation of differentials can be extended in an analogous manner to function of
several variables as follows:
dy f 1 dx1 f 2 dx 2 . . . f n dx n
x , x , . . .x n
And it measures the total change in y when all variables 1 2 are changes simultaneously.
Remarks
1. In the process of finding total differentials, we assumed that all independent variables are mutually
exclusive (That is, each independent variable can change with out affecting or changing the others).
2. There are two techniques or methods of finding the total differential of a function. These are:
In this method, first we compute partial derivatives of the function and then we use the formula given
In this method, we consider each independent variable as a function and we apply the appropriate rules
of differentiation for univariate function like power rule, sum or difference rule, product rule, quotient
rule, chain rule and so on based on the way the variable or the combination of variables are presented in
a given function. In other words, in this method, we use the following rules of total differential.
18
Rules of Total differential
dk 0
According to this rule, the total differential of a constant is zero. For instance,
d (3) 0 d (500) 0, and dA 0 if A is considered as a constant.
Rule 2: Power Rule
d cu n cnu n 1 du , where c is a constant.
Notice that we can apply this rule for x1 and x 2 if they are presented in power form. For example,
d x13 3x12 dx1 , d 4 x 25 4 5 x 24 dx 2 20 x 24 dx 2 , d x1 1dx1 dx1 and d x 2 dx 2 .
d u v du dv
For instance, d u 5v
2 3
d u d 5v 2udu 15v dv
2 3 2
x x
In terms of 1 and 2 , this rule for example, can be applied as
d 3 x12 4 x 23 d 3 x13 d 4 x 23 6 x1 dx1 12 x 22 dx 2 and d 5 x13 6 x 2 15 x12 dx1 6dx 2 .
For instance, d u v 3 2
d u v d v u 3u vdu 2uvdv .
3 2 3
Note that this rule will help us to find the total differential of a term which contains
x1 and x2 in their
multiplicative form.
19
2
For example, d 3x1 x 2 d x1 3 x 2 d x 2 3 x1
2 2
2
= 6 x1 x 2 dx1 3 x1 dx 2
u vdu udv
d ,
v v2 where v 0
u 3 d (u 3 )v 2 d (v 2 )u 3 3u 2 v 2 du 2u 3 vdv
d 2
For instance, v v 2 2
v4
Note also that we will use this rule when the function
y f x1 , x2 is given in quotient form or to get
x
the total differential of a term which contains 1 and
x2 in quotient form. For example,
d
3 x12 d 3 x12 x 2 d x 2 3x12 6 x1 x 2 dx1 3 x12 dx 2
x2 x 22 x 22
Again notice that the exponential, the logarithmic and the chain rules will also applied to get the total
differential of a function
y f ( x1 , x2 ) or any of its terms if its form entails the necessity of these
rules.
1 1 1 3
d e x1 e x1 dx1 , d nx 2
x2
dx 2 d nx13 3 d x13 3 3x12 dx1 dx1
x1 x1 x1
For example, and .
Remark
All the above rules of total differential can also applied in analogous manner to find total differential of a
function with more than two independent variables.
Now, let use see how we use the above two methods to find the total differential of multivariate
function with the help of the following example:
20
Example: Find the total differential of each of the following functions using the partial derivative method
and rules of total differential.
2 2
a) y 3 x1 x1 x 2 2 x 2
3
b) y n x1 3x 2 2
c)
y x12 x 2 x32 .
f1 6 x1 2 x1 x 2 and f 2 x12 6x 22
dy f1 dx1 f 2 dx2
= 6 x1 2 x1 x 2 dx1 x1 6 x 2 dx 2 .
2 2
The same result can be obtained using rules of total differential as follows:
dy d 3 x12 x12 x 2 2 x 23
2 2
= d 3 x1 d x1 x 2 d 2 x 2 3
= 6 x1 dx1 d x1 x 2 d x 2 x1 6 x 2 dx 2
2 2 2
2 2
= 6 x1 dx1 2 x1 x 2 dx1 x1 dx 2 6 x 2 dx 2
= 6 x1 2 x1 x 2 dx1 x1 6 x 2 dx 2 .
2 2
b) To apply the partial derivative method, first compute
f1 and f 2 as
y 1 2 1 2x
f1
x1
n x12 3 x 2 2
x1 3 x 2 x1
x1 3 x 2 2
x1 3 x 2
(2 x1 ) 2 1
x1 3x 2 , and
y 1 1 3
f2
x 2
nx1 3 x 2 2 x12 3x 2 2 (3) 2
x1 3 x 2 x 2 x1 3 x 2 x1 3 x 2 .
dy f1 dx1 f 2 dx2
21
2 x1 3
2
dx1 2 dx 2
= x 3x2
1 x1 3 x 2 .
dy d n x12 3x 2
1
2
d x12 3x 2
= x 3x2
1
1
2
d x12 d 3 x 2
= x 3x 2
1
1
2
2 x1 dx1 3dx 2
= x 3 x
1 2
2 x1 3
2
dx1 2 dx 2
= x 3x 2
1 x1 3 x 2 .
2 2
f1
x1
x1 x 2 x32 2 x1 x 2 x32 , f 2
x 2
x12 x 2 x32 x12 x32 , f3
x3
x1 x 2 x32 2 x12 x 2 x3
and
dy f1 dx1 f 2 dx 2 f 3 dx3
=
2 x x x dx x x dx 2 x
1 2
2
3 1
2
1
2
3 2
2
1 x 2 x3 dx3 .
We can also get the same result using rules of total differential as
dy d x12 x 2 x32
=
d x12 x 2 x32 d x 2 x12 x32 d x32 x12 x 2
=
2 x1 x 2 x32 dx1 x12 x32 dx 2 2 x12 x 2 x3 dx3 .
Remark
22
As we have seen in the above example, both methods give the same result for all functions. More over,
partial derivative method may be easier then using the rules of total differential in some function and
the latter may be easier in other types of functions. But since economic text books may use either or
both of the two methods to explain various concepts and to solve different problems, it will be worth
full to know the two methods.
There are two methods of finding the (first) derivatives of implicit functions. These are:
In the first method, we use a formula in terms of partial derivative which is stated as a rule for univariate
and multivariate cases as follows:
F ( x, y ) 0 and Fx and Fy Fy 0,
If both exist and then the explicit function y f (x) exists and the
dy
derivative dx is given by
dy Fx
dx Fy
F F
Fx and Fy
Where, x y .
F ( x1 , x 2 , . . . x n , y ) 0 Fi Fy 0,
If has continuous first order partial derivatives and then the explicit
y
y f ( x1 , x 2 , . . . x n ) xi is given by
function exists and the partial derivative
y Fi
xi Fy
F F
Fi and Fy
Where
xi y
23
In the implicit differentiation method, we apply the technique of total differential as follows:
In this case, first we take the total differential of both sides of the equation and then rearrange the
dy
.
terms to get dx
In this case, first we computer the total differential of both sides of the equation and then we rearrange
y
the terms to get
xi considering all other variable x j xi as constant (or making all dx j 0 for
x j xi
).
Now, let us see how we apply the above two methods to find derivatives of implicit functions with the
help of the following examples:
dy
Example: Find dx for 2 y xy 4 x 0 using both methods
F F
First let F ( x, y ) 2 y xy 4 x, and compute x and y as
F F
Fx y4 Fy 2 x
x and y
dy
Then use the formula given in rule 1 to get dx as
dy Fx ( y 4) 4 y
dx Fy 2 x 2 x
To apply the implicit differentiation method, take the total differential of both sides of the equation and
dy
rearrange the terms to get dx as follows
24
d 2 y xy 4 x d (0)
d (2 y ) d ( xy ) d (4 x) 0
( 2 x)dy ( y 4) dx 0
(2 x) dy ( y 4)dx
dy ( y 4) 4 y
dx 2 x 2 x
2
Example: Find the first order partial derivatives for 3 x1 x2 x2 y 10 0
Solution: Since the function is an implicit function with two independent variables x1 and x 2 , the
y y
x x
required derivatives are the partial derivatives 1 and 2 .
Now, to apply the partial derivative method, we first let F x1 , x2 , y 3 x1 x2 x2 y 10 , and compute
2
F1 , F2 and Fy as
F F F
F1 3 x 2 , F2 3 x1 y 2 Fy 2 x2 y
x1 x 2 and y .
y y
x x
And then, we use the formula give in rule 2 to get 1 and 2 as
y F1 3x 2 3 y F2 (3 x1 y 2 )
x1 Fy 2 x2 y 2 y x 2 Fy 2 x2 y
and .
y y
x x
The same results for both 1 and 2 is obtained using the implicit differentiation method as follows:
d 3 x1 x2 x2 y 2 10 d (0)
25
d 3x1 x 2 d x2 y 2 d (10) 0
y
,
Then to get
x1 in the last equation make dx2 0 since x 2 is considered as a constant and change d
2 x2 yy 3x2 y
y 3 x2 3
x1 2 x2 y 2 y
y
,
Similarly, to get
x2 set dx1 0 and change d by in the last equation obtained from taking total
3 x1 y 2 x2 2 x2 yy 0
2 x1 yy 3 x2 y 2 x2
y 3 x1 y 2
x2 2 x2 y
A. Marginal Analysis
Marginal analysis is central to microeconomics. In economics the term marginal refers to the change in
dependent variable caused by change in one independent variable while others are kept as a constant.
26
y
y f ( x1 , x 2 ........ x n ) xi x
In multivariate economic function, , shows the marginal effect of i on y. In
y
x x
other words, i measures the change y due to a one unit change in i .
Now, let us see how the marginal effect and other important concepts can be described using the partial
derivatives with the example of theoretical firm’s production function.
The production of certain commodities requires the use of two or more inputs or factors of production.
The amount of production of any commodity, therefore, uniquely depends on the amounts of different
inputs used under certain technical conditions.
Let Q f ( L, K ) be a production function, which relates the output Q to amount of labor input L and
capital input K. Then, the first and second partial derivatives of this function can have the following
meanings or interpretations.
Q
i) L is the marginal product of labor input L or MPL and it measures the change in output Q due to
a one unit increase in labor input L with fixed capital input K, and
Q
K is the marginal product of capital input K or MPK and it measures the change in output Q due to
a one unit increase in capital input K with fixed labor input L.
ii) If the first- order partial derivatives are positive and both of its pure second- order partial derivates
negative, then we can say that both inputs L and K satisfy the law of diminishing marginal productivity or
LDMP and the production is in its efficient or economic production range.
Mathematically,
2 Q ( Mp L ) 2Q ( Mp K )
f LL 0 f 0
If Mp L 0 and and Mp K 0 and
KK
L2 L K 2 K then it
possible to say that both inputs satisfy LDMP and the production is in its economic or efficient
production stage (That is, the production is inside the ridge lines)
27
iii) From the signs of mixed second-order partials derivatives we can determine the nature of inputs
(That is, whether the two inputs are complementary or substitutes).
If the signs of mixed second-order partials derivatives are both positive, then inputs are complementary.
Mathematically,
2Q ( Mp L ) 2Q ( Mp K )
f LK 0 f KL 0
If LK K and KL L , then L and K are complimentary.
If the signs of mixed second-order partials derivatives are both negative, then inputs are competitive or
substitutes.
Mathematically,
2Q ( Mp L ) 2Q ( Mp K )
f LK 0 f KL 0
If LK K and KL L , then L and K are substitutes.
Now let us use the partial derivatives to describe important natures of specific types of production
functions with the help of the following examples.
Example
1/ 2
Given the production function y 10 L K 1/ 2 , find the first and second order partial derivatives and
y 1 5 K 1/ 2
fL (10) L1/ 2 1 K 1/ 2 1/ 2 0
L 2 L since both L and K are positive, and
y 1 1/ 2 1 / 2 1 5 L1 / 2
fK (10) L K 1/ 2 0
K 2 K .
( f L ) 5 K 1 / 2 5 K 1/ 2
f LL 0
L L L1/ 2 2 L3 / 2
,and
28
( f K ) 5 L1/ 2 5 L1/ 2
f KK 0
K K K 1/ 2 2 K 3 / 2
Now since the first order partial for both inputs are positive and the pure second order partial are both
negative, we can say that both inputs L and K satisfy LDMP.
( f L ) 5 K 1/ 2 5
f LK 1/ 2 1/ 2 1/ 2 0
K K L 2 L K
. Notice here that it is enough to get the sign of one
mixed second order partial due to Young’s Theorem.
Thus, from the sign of f LK we can conclude that L and K are complementary.
Find and interpret (economically) the first and second order partial derivatives for the Cobb Douglas
production function with two in puts L and K:
Solution:
A K
f L MPL AL 1 K 0
L1 since 1 0 as 0 1
1 AL
f K Mp K AL K 1 0
K since 1 0 as 0 1 .
Moreover,
( f L ) ( 1)AK
f LL (AL 1 K ) ( 1) AL 2 K 0
L L L2 since 0 1 and
1 0 , and
29
( f K ) ( 1) AL
f KK ( AL K 1 ( 1) AL K 2 0
K K K 2 since 1 0 and
2 0.
( f L ) A
f LK (AL 1 K ) AL 1 K 1 1 1 0
Finally, since K K L as 1 α 0 and 1 β 0
, both L and K are complementary.
Notice that the partial derivatives have the similar interpretations in utility function as they have in
production function.
For example, in the utility function U f (q1 , q 2 ) , the first and second order partial derivatives have
the following interpretations:
U U
U1 MU1 U2 MU 2
i) q1 and q 2 , and
U U
U1 MU1 U1 MU 1 U11 U1
ii) if q1 and q1 are both positives and both q 1 and
U 22 U 2
q2 are negatives, then both good q1 and q 2 satisfy Law of Diminishing Marginal Utility
(LDMU).
How the implicit differentiation or total differential can help us in determining the slope of level sets or
level curves that are used in economic theory.
To do this, let us first see the definition and the method of finding the slope of the level set for a general
function y f ( x1 , x2 ) and we will apply the concept for different level sets that are used in economic
theory.
For a function y f(x 1 , x 2 ), the set of (x 1 , x 2 ) pairs that will generate some specific level or value for y,
y , is called a level set.
30
Mathematically, f ( x1 , x2 ) y , where y is some fixed constant is said to be a level set.
The graph of the equation of the level set (i.e, f ( x1 , x2 ) y ) is said to be a level curve.
Now, our basic concern is how we can determine the slope of such level set or or its corresponding level
curve?
The convenient way to do this is rewriting the equation f ( x1 , x2 ) y in the form that x 2 is expressed
dx 2
dx
explicitly in terms of x1 and y and determine its slope by 1 .
level sets can be described by the equation 2 x1 3 x2 y where y is a constant. Moreover, to get the
slope of this level set or level curve, first we rewrite the equation of the level set as
3x2 y 2 x1
y 2
x2 x1
3 3
dx 2 2
dx
And then we compute the slope as 1
3 .
But this convenient method does not always works, particularly, when it is difficult to express x 2
explicitly in terms of x1 and y from f ( x1 , x2 ) y . For instance, we cannot get the slope of
3 3 1
(0.3 x1 0 0.7 x2 x1 ) 3
y using the above method since it is not possible to write x 2 explicitly in
terms of x1 and y . So, the question which naturally arises is how we can determine the slope of such
level set?
Now since the level set f ( x1 , x2 ) y can be written implicitly as f ( x1 , x2 ) y 0 and its slope is given
dx 2
dx
by 1 , we can use any of the two implicit differentiation methods that we have used in section 2.6 to
dx 2
get
dx1 and thus the slope of the level set.
31
dx 2
Alternatively, we can apply total differential techniques to get
dx1 and thus the slope of the level set
as follows:
df(x 1 , x 2 ) dy
f1dx1 f 2 dx 2 0 since y is a constant
dx 2 f1
dx1 f2
dx2 f1 f f
f1 and f 2
dx1 f 2 , where x1 x 2
2
Example; Find the slope of the level curve for y f ( x1 , x2 ) x1 x2
Solution
f f
since f1 2x1x 2 and f 2 x12 , the Slope is given by
x1 x 2
dx 2 f1 2x1x 2 2x 2
dx1 f2 x12 x1
Notice that many of the curves in microeconomics are level curves; indifference curves and isoquants
are probably the most familiar. So, we can use either the implicit differentiation methods or the total
differential techniques to get the slopes of such curves.
Isoquants in the production function Q f ( L, K ) are level curves and represented by the equation
f ( L, K ) Q where Q is the level of fixed output which can be obtained from all combinations of L
dK f L
dL fk
32
In fact, this result can obtained using total differential techniques as follows:
dK
Take the total differential of both sides f ( L , K ) Q and solve for dL as
df(L, K) d Q
f L dL f K dK 0 since Q is a constant
dK f L
dL fK
The absolute value of the slope of an isoquant is called Marginal Rate of Technical Substitution (or
MRTS). So, MRTSL, K is given by
dK f
MRTS L , K L
dL f K
1 1
Solution
f 1 2 3 1 2 f 1 1 1
since f1 x1 x 2 and f 2 x1 3 x 2 2 ,
x1 3 x 2 2
1 2 3 12 1 1
x x
f1 3 1 2 2 x 22 2 2x 2
MRTS
f 2 1 x 13 x 12 3 2 3 13 3x1
1 2 x1
2
Find MRTSL,K for Q f ( L, K ) AL K , A 0,0 , 1 and show that
MPL
MRTSL,K =
MPk
Solution
f L MPL
MRTS L , K
f L MPL and f k MPk f K MPK .
Moreover, Since , We have
33
Notice that an indifference curve in the utility function U f (q1 , q 2 ) is also a level curve and it
represented by the equation f (q1 , q2 ) U as it shows all combinations of the two goods q1 and q 2
which gives the given level of utility U . Moreover, the slope of an indifference curve is given by
dq 2 f1 U U
where f1 and f 2
dq1 f2 q1 q 2 and the Marginal Rate of Substitution (or MRS) which is the
Chapter 7
Basics of Integration
7.1. Indefinite integral
Example1;
Example2;
34
Example;
35
Example;
Solution:
Example;
36
7.3. Integration by substitution and by parts
Example1;
Example2;
37
Example1;
Example2;
38
7.4. Definite integral
Example;
b)
Example
39
7.5. Economic applications of integral calculus
A) Developing a total function from marginal function
Example; If the marginal cost of a firm is given by an equation C’(Q) = -4Q3 + 6Q – 80
and the fixed cost of production is 90 birr, find the total cost of production.
❑
∫ MdQ=TC
❑
❑
TC =∫ (−4 Q3+ 6 Q – 80¿) dQ ¿ = = -Q4 + 3Q2 – 80Q and the fixed cost is given as = 90
❑
Then TC = -Q4 + 3Q2 – 80Q + 90
One of the applications of definite integration is finding the area of a region bounded by f(x), the
X axis and the vertical lines x = a and x=b.
Note: The area of the region R, denoted by A(R), bounded by f, the X-axis, the vertical lines X=a
and X=b is given by;
40
Example;
41
C) Consumer’s and Producer’s Surplus
D) Definition: -consumers’ surplus is the difference between the total amount that consumers
would be willing to spend and the actual consumer expenditure on a commodity.
E)
42
240 0.05 x 2 240
CS 20 0.05 x dx 8(240) 20 x
2 0
1920.
I) 0 =
0. 5(0 240)2
CS 0(2 240) 1920 bir 14 0.
J)
2 Thus , the total savings to consumers who are willing to pay a higher price
for the product is birr 1,440.
K) Definition: The additional money suppliers gain from the higher price is called the
producers’ surplus.
L) Example: Find the producers’ surplus at a price level of birr 20 for the price-supply
2
equation
p xs )( 2 0.0 02x .
M) The solution for this question can be answered in a similar way with consumers’ surplus.
x
ps px s ( x)dx
N)
px
0
P)
s(x)dx
0 are the total sales that the producers are willing to accept.
43
R) 20 2 0.0002 x 2 x 300
A matrix is defined as an array of numbers (or algebraic symbols) set out in rows and columns. The usual
notation system is to denote matrices by a capital letter in bold type and to enclose the elements of a
matrix in a set of squared brackets, i.e. [ ]. Matrices may also be specified with algebraic terms instead of
numbers. Each entry is usually known as an ‘element’. The elements in each matrix must form a complete
rectangle, without any blank spaces. For example, if there are 5 rows and 3 columns there must be 3
44
elements in each row and 5 elements in each column. An element may be zero though. The size of a
matrix is called its ‘order’. The order is specified as: (number of rows) × (number of columns)
For example, the matrix A below has 5 rows and 3 columns and so its order is 5 × 3.
Matrices with only one column or row are known as vectors. These are usually represented by lower case
letters, in bold. For example, vector p = [139 160 205 340 430] is 1 × 5 row vector
8.2. Matrix addition and subtraction: Matrices that have the same order can be added together, or
subtracted. The addition, or subtraction, is performed on each of the corresponding elements.
E.g. A retailer sells two products, Q and R, in two shops A and B. The number of items sold for the last 4
weeks in each shop are shown in the two matrices A and B below, where the columns represent weeks
and the rows correspond to products Q and R, respectively.
Solution:
8.3. Scalar multiplication and matrix multiplication: There are two forms of multiplication that can be
performed on matrices. A matrix can be multiplied by a specific value, such as a number (scalar
multiplication) or by another matrix (matrix multiplication). Scalar multiplication simply involves the
multiplication of each element in a matrix by the scalar value.
E.g. The number of units of a product sold by a retailer for the last 2 weeks are shown in matrix A below,
where the columns represent weeks and the rows correspond to the two different shop units that sold
them.
45
If each item sells for Birr4, derive a matrix for total sales revenue for this retailer for these two shop units
over this two-week period.
Solution:
The scalar value that a matrix is multiplied by may be an algebraic term rather than a specific number
value.
Scalar division works in the same way as scalar multiplication, but with each element divided by the
relevant scalar value.
The basic rule matrix multiplication is to multiply elements along the rows of the first matrix by the
corresponding elements down the columns of the second matrix.
Where the row vector p contains the prices of hire cars in each category and the column vector q contains
the quantities of cars in each category that your company wishes to hire for the week.
46
e.g.2.
The general m × n matrix with number of rows m and columns n can be written as;
47
Exercise:
Determinants: For a 2nd order matrix (i.e. order 2 × 2) the determinant is a number
calculated by multiplying the elements in opposite corners and subtracting. The usual
notation for a determinant is a set of vertical parallel lines either side of the array of
elements, instead of the squared brackets used for a matrix. The determinant of the general 2
× 2 matrix A, written as |A| and,
48
E.g.
Exercise:
To get the transpose of a matrix, usually written as AT, the rows and columns are swapped
around, i.e. row 1 becomes column 1 and column 1 becomes row 1, etc. If a matrix is not
square then the numbers of rows and columns will alter when it is transposed.
Minor matrix: The minor |Mij | of matrix A is the determinant of the matrix left when row i and
column j have been deleted. For example, if the first row and first column are deleted from matrix
49
e.g.
The cofactor matrix usually denoted by C same as a minor, except that its sign is
determined by the row and column that it corresponds to. The cofactor |Cij | of matrix A is the
determinant of the matrix remaining when row i and column j have been eliminated, with the
sign (−1)i+j .
Where
50
8.5. The matrix inverse
Asquare matrix A is said to be invertible or non-singular, if and only if threre is asquere matrix B
such that AB=BA=I, where I is the identity matrix that has the same order as A. only asquare
matrix can have an inverse. Invers of asquare matrix A is denoted by A -1.
e.g.
Solution:
The above method of finding an inverse matrix is tedious and time consuming. Threre is another
method of finding invers of invertible matrix, Using the adjoint of matrix the formula for A −1 (the
inverse of matrix A) can be stated as
e.g.
51
|A|=1(4-5)-(-2)(0+4)+3(0+8) = -1+8+24 = 31
Then A-1 =
Exercise
Amatrix inverse is used to solve a set of simultaneous equations specified in matrix format. For
example,
52
X1 =4, x2= 2 and x3 = 5.
53
8.7. Solving simultaneous equations using crumers rule
Cramer’s rule is another method of using matrices for solving sets of simultaneous equations, but it finds
the values of unknown variables one at a time. This means that it can be quicker and easier to use than the
matrix inversion method if you only wish to find the value of one unknown variable.
A set of n simultaneous equations involving n unknown variables x 1, x2,...,xn and n constants values can
be specified in matrix format as Ax = b where A is an n × n matrix of parameters, x is an n × 1 vector of
unknown variables and b is an n × 1 vector of constant values. Cramer’s rule says that the value of any
one of the unknown variables xi can be found by substituting the vector of constant values b for the i th
column of matrix A and then dividing the determinant of this new matrix by the determinant of the
original A matrix. Thus, if the term Ai is used to denote matrix A with column i replaced by the vector b
then Cramer’s rule gives
Example:
54
Chapter 9
Optimization functions
9.1. Unconstrained optimization
9.1.1. Optimization with one variable & its Economics application
Extreme points: Minima and Maxima
Now let us to discuss the use of derivatives to find the relative maximum or minimum value of a
function. A relative maximum of a function is a peak, point on the graph of the function that is
higher than any neighboring point on the graph. A relative minimum is the bottom of a valley,
point on the graph that is lower than any neighboring point. The relative maxima and minima of
a function are collectively termed as relative extrema.
The above graph shows the relative maxima and minima of a function. The function has a
relative maximum point A and C and a relative minimum at points B and D. Each of these points
represents an extremum in the immediate neighborhood of the point only.
First order and second order tests
First order tests
Given y=f(x) which is continuous and possess a continuous derivative, then the first derivative,
f (x) ' plays a major role in our search of its extreme values.
Definition: The values of x where the first derivative f (x) ' =0 are called critical points and the
corresponding values of f(x) are called critical values. The value of the function at the critical
value, f(x0), will be
55
A) a relative maximum if the derivative f (x) ' changes its sign from positive to negative from the
immediate left of the point x0 to its immediate right.
B) a relative minimum if f (x) ' changes its sign from negative to positive from the immediate left
of x0 to its immediate right.
C) neither a relative maximum nor a relative minimum if f (x) ' has the same sign on both the
immediate left and right of point x0.
In (a) the value of the function at x1, f(x1), is a relative minimum because the sign of the first
derivative changes f'(x) its sign from negative to positive. The value of the function at x2, f(x2), is
a relative maximum because the sign of the first derivative f '(x) changes its sign from positive to
negative. In (b) the value of the first derivative at x1, f '(x1) is zero, the derivative does not change
its sign from one side of x=x1, to the other side. Point A is called an inflection point.
The point (x1,f(x1)) and (x2,f(x2) in both (a) and (b) are called stationary points. The value of the
function f(x) at these stationary points (i,e, f(x1) and f(x 2) ) are called stationary values of y.
Note that a relative extremum must be a stationary value, but a stationary value may be
associated with either a relative extremum or an inflection point.
Example
Find the relative exterma for f(x)=x3-12x2+36x+8.
Solution
The first derivative of f(x) is f (x) ' =3x2-24x+36 equating this to zero,
3x2-24x+36=0
x1=2 and x2 = 6 are the critical values.
Next check the signs of f (x) ' by taking numbers from the left and right of x = 2 and x = 6
For x=2 take x=1.9 and x=2.1
At x = 1.9, f (x) ' = 3(1.9)2-24(1.9) +36 = 1.23
At x = 2.1, f (x) ' = 3(2.1)2-24(2.1) + 36 = -1.17
Since the sign of f (x) ' changes from positive to negative, f(2) = 40 is a relative maximum.
56
For x = 6 take x = 5.9 and x = 6.1
At x =5.9, f (x) ' =3(5.9)2-24(5.9)+36 = -1.17
At x=6.1, f (x) ' = 3(6.1)2-24(6.1) + 36 = 1.23
Since f (x) ' changes its sign from negative to positive, f(6)=8 is a relative minimum.
Second order tests
The second derivative can be used to find the relative maxima or relative minima of a function.
This test is more convenient than the first derivative test, because it does not require you to check
the derivative sign to both the left and the right of x0.
If the first derivative of a function f at x = x0 is f' (x) = 0, then the value of the function at x0,
f(x0), will be
i) a relative maximum if the second derivative value at x0 is negative. That is, if f (x) '' < 0.
ii) a relative minimum if the second derivative value at x0 is positive. That is, if f (x) '' > 0.
''
But if f (x0) =0, the stationary value f(x0) can be either a relative maximum, or a relative
minimum, or even an inflection value. In this case we must revert to the first derivative test, or to
another test.:
Example: Find the relative extremum of the function f(x) = 4x2-x by using the second derivative
test.
Solution: First find the first derivative of the function and then the critical value
f ' (x) = 8x-1and thus x=1/8 is the critical value.
Then find the second derivative
f '' (x) = 8 and check the sign of the second derivative at the critical value
f''( x) = 8 >0
Therefore, f (1/8) = -1/16 is the relative minimum.
Economic Application:
Eg1.
57
Eg2.
58
Eg3.
59
9.1.2. Optimization with more than one variable & its economics application
Optimization with Functions of Two Choice Variables
∂ f (x , y)
fy = = x2 - 1………………………(2)
∂y
From equation 2 x2 = 1, x= ±√ 1= 1 or -1. Then substitute these values for x in equasion 1 and
find the respective values of y. That is, when x=1, 3(1)2 + 2(1)y + 1 = 0
3+2y+1=0
60
e.g.
61
e.g.2.
62
9.2. Constrained optimization
Dear students, as you studied in the previous courses economics is the study of allocation of
scarce resources. Scarcity implies constraints, so much of economic theory has to do with agents
who are optimizing some objective function subject to a constraint, as in the case, for example,
when a consumer maximizes utility subject to a budget constraint. In similar manner, a producer
maximizes output subject to cost constraint.
Dear students, for further illustration let us consider together a consumer with the simple utility
function U = x1x2 + 2x1
∂u ∂u
Since the marginal utilities – the partial derivatives U 1 = and U2 = are positive for all
∂x 1 ∂x 2
positive levels of x1 and x2 , here, to have U maximized without any constraint, the consumer
should purchase infinite amount of both goods, a solution that obviously has little practical
relevance. To render the optimization problem meaningful, the purchasing power of the
consumer must also be taken into account; i.e., a budget constraint should be incorporated into
the problem. If the consumer intends to spend a given sum, say, Birr 60, on the two goods and if
the prices are P1 = 4 and P2 = 2, then the budget constraint can be expressed by the linear
equation as: 60 = 4x1 + 2x2
Such constraint renders the choice of x 1 and x2, mutually independent. The problem now is to
maximize utility, subject to this constraint. Mathematically, what the constraint (variously called
restraint, side relation, or subsidiary condition) does is to narrow the domain, and hence the
range of the objective function.
2 x2 =60−4 x 1
63
60−4 x 1
x 2= =30−2 x1
2
Then, you can combine the constraint with the objective function by substituting the above result for the
constraint into the objective function. The result is an objective function is one variable only:
2
U =x1 ( 30−2 x 1 ) +2 x 1=32 x 1−2 x1
Dear students, as you have learned in the previous unit, the stationary point of the above function can be
dU
found by setting the partial derivative equal to zero ( ¿=0=32−4 x1=0 ¿. Doing so will result in a
d x1
solution x 1 = 8 and x 2 = 30 – 2(8) = 14. We can then find the stationary value U = 128; and since the
2
d y
second derivative is 2
=−4< 0 ,that the stationary value constitutes a constrained maximum of U *.
d x1
The equality constraint was devised by a French mathematician J.L . Lagrange (1736 – 1813)
and is known as the method of Lagrange multipliers. Suppose that the interest is to find the
extreme value of the function f x, y, zsubject to the constraint g x, y, z0 which can be
formally written as
max f x, y, z
St
g x ,y, z0
64
Economic Applications of Lagrange Multiplier Method
Constrained optimization is very important for the following two applications.
I. Utility analysis
II. Profit analysis
I.Utility analysis
The main objective of rational consumer is maximization his utility given his limited income and
market prices of goods and services. Mathematically, the aim of the consumer can be written as
maxU = f(x,y,z)
s.t
I = Pxx + Pyy + Pzz, where U is a total utility, Consumer limited income, Px, Py and pz are per
unit price of good x, y and z respectively.
The main interest here is to find the consumer equilibrium (i,e, the consumption level that gives
maximum utility to the consumer). This is in fact means that maximizing the above constrained
utility function through the Lagrange Multiplier method. To do this the first step is to find the
critical points for the corresponding Lagrange Multiplier function through first order derivative
test as follows.
65
So the above equations can be re- arranged as
MUx . Px
MUy . Py
MUz . Pz
And from these equations you can derive the first condition for maximization as
e.g. Find the levels of x and y that can maximize the utility given by
66
II. Profit Analysis
Profit maximization of the firm is achieved in two ways:
A) Maximization of output subject to a cost constraint.
67
B). Minimization of cost for a given level of output.
Here also, you can use the method of Lagrange Multiplier to find the level of input combination
that can maximize the profit of the firm.
MP L w
MPK r
68
B) Minimization of cost for a given level of output
In this case output is given or remaining constant and the problem can written mathematically as
69
e.g.1. A firm can produce its product at two of its plants. The cost of manufacturing x units at the
first plant and y units at the second plant is given by the joint cost function
C( x, y) x2 y2 xy + 700. If the firm has a supply order of 500 units, how many units
should be produced at each plant to minimize the total cost?
Solution:
Cx2 y2 xy + 700
Subject to x+y=500
Again solve these equations simultaneously to get y 3x and substitute it in the supply order
constraint function as follows to get the critical point.
70