Notes PDF
Notes PDF
These lecture notes summarize, and at times complement the material in the textbook:
Advanced Engineering Economics by Park and Sharpe-Bette.
The course is about industrial projects and investments. In this course an industrial project is
any candidate project that involves the outlay of cash in exchange for an anticipated return flow of
future benefits. In contrast, an investment is a funded project. It involves the sacrifice of something
now for the prospect of something later.
We distinguish between real (physical) and financial investments (financial instruments). Some
caracteristics that distinguish real from financial investments are:
We will deal mostly with real investments, but at times we will need to talk about financial
investments as well.
The course is concerned with how to allocate capital to projects within a firm to maximize the
value of the firm to its shareholders.
Course objective: Taking into account the cost of capital, budgets, taxes, depreciation, inflation,
and uncertainty, decide which projects, if any, to undertake, reject, or postpone.
An investment project can be described by its cash flow profile, i.e., the amount and timing of
costs and benefits in the planning horizon.
We will start by ignoring taxes, inflation, depreciation, and uncertainty and later build these
features into the analysis.
Notes on Chapter 1: Accounting Income and Cash Flows.
The funds flow statement details how the company acquired funds (sources) and how the
funds were used (uses).
A company’s cash inflows are approximately equal to its net income plus depreciation plus any
taxes it has deferred, and all three of these quantities depend on the depreciation method the
company uses for tax and reporting purposes.
Many companies use one depreciation method for tax purposes (usually the method that min-
imizes the current tax liability) and another for financial reporting purposes (usually the method
that most accurately reflects the decline in value of its assets over time). It is common practice
to use an accelerated depreciation method to compute the actual tax liability and a straight-line
depreciation method for financing reporting. The difference between the computed taxes and the
actual tax liability is called deferred taxes.
Cash Flows are used in evaluating the economic value of a project. Projects usually start with
an investment, which is a cash flow from the company to the project. Cash outflows that move
from the company to the project are arbitrarily given a negative sign. Conversely, cash inflows that
move from the project to the company are given a positive sign. Each flow is usually assumed to
occur at the end of the year. The net cash flow is the difference between the cash inflows and the
cash outflows. It is also equal, see figure 1.2 in textbook, to the net income from operations, the
noncash depreciation expense, the net proceeds of equipment disposal, and new borrowing minus
the purchase of new equipment, the repayment of debt, and the net change in working capital. See
Example 1.8.
IEOR4003: Industrial Economics page 3 Professor Guillermo Gallego
An investment project can be described by its cash flow profile, i.e., the amount and timing of
expected costs (disbursements) and benefits (receipts) in the planning horizon. The term payment
is used to denote the receipts less the disbursements that occur at the same point in time.
We classify project cash flows in two categories:
(1) discrete-time cash flows, and
(2) continuous-time cash flows.
Notation:
Fn = discrete payment occurring at period n,
Ft = continuous payment occurring at time t.
Money has a time value because:
(i) it has a potential earning power,
(ii) users may have different utility of consumption of dollars at different times, and
(iii) the buying power of the dollar changes through time.
F1 = P (1 + i)
Even though financial institutions may use more than one interest period per year in compounding
the interest, they usually quote the interest on an annual basis. For example, a year’s rate at 1.5%
compounded monthly is typically quoted as “18% compounded monthly”. When stated in this
fashion, the 18% is called a nominal interest rate or annual percentage rate. The effective interest
rate represents the actual interest earned or charged for a specified time period. The effective interest
rate based on a year is referred to as the effective annual interest rate ia . The effective interest rate
based on the payment period is called the effective interest rate per payment period i.
Example: Suppose a bank charges a rate of 12% compounded quarterly (four times per year). This
means that 3% is charged every quarter. The interest per dollar accrued at the end of the year
is (1.03)4 − 1 = 0.1255. Thus the effective annual interest rate is ia = 12.55%. If the interest is
compounded monthly (12 times per year) then the effective annual interest rate is ia = (1.01)12 −1 =
0.1268 = 12.68%. In general r m
ia = 1 + −1
m
where ia is the effective annual interest rate, r is the nominal interest rate per year, m is the number
of interest periods per year, r/m is the effective interest rate per interest period.
As m goes to infinity we obtain:
r m
ia = lim 1 + − 1 = er − 1.
m→∞ m
The above formula can be generalized to compute the effective interest rate in any payment
period. If c is the number of interest periods per payment period, then the effective interest rate
per payment period is given by
r c
i = 1+ m −1
r c
i = 1 + ck − 1,
where k is the number of payment periods per year, and r/k is the nominal interest rate per payment
period.
Example: Let r = 12%, m = 12, c = 3, so k = 4. Then ia = 12.68%, and i = (1.01)3 − 1 = 3.03%.
Useful Sums.
Geometric Series:
N
1 − xN +1
xn = 1 + x + . . . + xN =
n=0
1−x
where x = 1. If −1 < x < 1, then
∞
1
xn = .
n=0
1−x
Arithmetic-Geometric Series:
N
x 1 − (N + 1)xN + N xN +1
n 2 N
nx = 0 + x + 2x + . . . + N x =
n=0
(1 − x)2
where x = 1.
If −1 < x < 1, then
∞
x
nxn = .
n=0
(1 − x)2
IEOR4003: Industrial Economics page 5 Professor Guillermo Gallego
Discrete Compounding.
Consider the cash flow F0 , F1 , . . . , FN where Fn represents the flow of cash at time n, with
the convention that positive values represent inflows and negative values represent outflows. What
amount now is equivalent to cash flow? We can answer this question very simply if we know the
current value of a dollar at time n = 0, 1, . . . , N. Let βn denote the value of a dollar at time n. Then
the current, or present value of the cash flow is
P = F0 + F1 β1 + F2 β2 + . . . + Fn βN ,
since by definition β0 = 1.
Suppose that the interest rate in period n is in . How are the β factors related to the interest
rates? At interest rate i1 a dollar now will be worth 1 + i1 dollars a year from now, so we need to
deposit β1 = 1/(1 + i1 ) dollars now to obtain a dollar a year from now. Thus, at interest rate i1 , β1
dollars now are equivalent to 1 dollar at the end of year 1. Similarly, it takes β2 = β1 /(1 + i2 ) dollars
now to obtain a dollar after two years by earning i1 % in year one and i2 % in year two. In general,
we have
1
βn = .
(1 + i1 ) . . . (1 + in )
Of special interest is the case when i1 = i2 = . . . = iN = i, i.e., the case where the interest rate
is constant over time. In this case we have
The book has formulas, as well as tables, for the present value of cash flows of the form Fn =
A, n = 1, . . . , N, Fn = (n−1)G, n = 1, . . . , N, and Fn = F1 (1+g)n−1 . The first case is known as the
uniform series, the second as the arithmetic gradient series, and the third as the geometric gradient
series. In what follows, we will develop formulas for the present and future value of these series.
We start by considering the case in which the payment periods are identical to the compounding
periods.
Single Sum. The value F at time N of a single payment P invested at interest rate i is:
F = P (1 + i)N
The factor (1 + i)N is called the single-payment compound amount factor, and is denoted by
(F/P, i, N ).
This factor represents the value of one dollar after N years when it is invested at interest rate i.
Values of (F/P, i, N ) are tabulated in tables at the back of the book.
Example: If P = $24, i = 6%, and N = 369, then
F = $24(F/P, 6%, 369) = $24(2, 177, 029, 343) = $52, 248, 704, 230.
2.
By inverting the above formula, we obtain
P = F (1 + i)−N .
The factor (1 + i)−N is called the single-payment present-worth factor, and is denoted by
(P/F, i, N ).
This factor represents the amount we need to invest now for N years at interest rate i to obtain one
dollar at time N.
IEOR4003: Industrial Economics page 6 Professor Guillermo Gallego
Uniform Series.
An amount A deposited at the end of each period, for N periods, in an account paying i% per
period would grow to:
N
N −n (1 + i)N − 1
F =A (1 + i) =A .
n=1
i
The term in brackets is called the uniform-series compound amount factor, or equal series
compound amount factor and is represented by
(1 + i)N − 1
(F/A, i, N ) = .
i
2.
The inverse relationship yields the uniform-series sinking-fund factor designated by
i
(A/F, i, N ) = .
(1 + i)N − 1
The bracketed term is the uniform-series present worth factor, designated by (P/A, i, N ).
Example: What is the present worth of an annuity that pays A = $12, 000 at the end of each
year for N = 10 years at i = 8%.
2.
The inverse factor is called the capital recovery factor, and is denoted by (A/P, i, N ).
i(1 + i)N
(A/P, i, N ) = .
(1 + i)N − 1
A = $1, 000, 000(A/P, 8%, 20) = $1, 000, 000(0.10185) = $101, 850.
2.
IEOR4003: Industrial Economics page 7 Professor Guillermo Gallego
2.
A uniform series equivalent to the gradient series is easily obtained by the relationship
between P and A. Resulting in the factor :
1 N
(A/G, i, N ) = − .
i (1 + i)N − 1
The future-worth equivalent of a gradient series can be easily obtained by multiplying the
uniform series equivalent to the gradient series by the (F/A, i, N ) factor, resulting in
G
(F/G, i, N ) = [(F/A, i, N ) − N ] .
i
Example: Let Fn = $100 + $6(n − 1), N = 8, i = 10%. Then,
2.
Geometric Series.
Consider the geometric series of payments:
Fn = F1 (1 + g)n−1 ,
for i = g.
1+i
Using g = 1+g − 1, we also obtain
F1
P = (P/A, g , N ).
1+g
Example: If F1 = $39, 600, g = 0.1, N = 5, i = 0.15, then
1 − (1.1)5 (1.15)−5
P = $39, 600 = $157, 839.18.
.05
IEOR4003: Industrial Economics page 8 Professor Guillermo Gallego
2.
Compounding period different from payment period
Whenever the payment period and the compounding period do not correspond, we approach the
problem by finding the effective interest rate based on the payment period and then use this rate in
the formulas with the appropriate accounting of the number of payments.
Example: What is the present worth of quarterly payments of A = $1, 000 that extends over a
period of N = 5 years if the interest rate is r = 8% compounded monthly? The effective interest
rate per payment period (quarter) is i = (1 + 0.08/12)3 − 1 = 2.0133%. There are N = 20 quarters
in 5 years, so
P = $1, 000(P/A, 2.0133%, 20) = $1, 000(16.33047) = $16, 330.47.
2.
Continuous Compounding.
Discrete Payments
Here again, we compute the effective interest rate i = er − 1, and use it as above. This gives rise
to a whole new set of interest formulas. For instance, if we are interested in the present value of a
uniform series of payments A over N years at interest rate r compounded continuously, we can
use the previously discussed factor (P/A, i, N ) with i = er − 1. Alternatively, we can directly use
the continuous factor (P/A, r, N ), where
rN
e −1
(P/A, r, N ) = rN r ,
e (e − 1)
and so on.
Example: What is the present worth of a uniform series of year-end payments of A = $500 each
for N = 10 years if the interest rate is r = 8% compounded continuously?
The effective rate is i = e0.08 − 1 = 8.33%. Then
P = $500(P/A, 8.33%, 10) = $3, 305.85.
2.
Example: A series of equal quarterly payments of A = $1, 000 each extends for N = 5 years.
What is the present worth at r = 8% compounded continuously?
The interest per quarter is i = e0.02 − 1 = 2.02%,
P = $1, 000(P/A, 2.02%, 20) = $16, 319.70.
2.
and
N
erN − 1
F = Ā ert dt = Ā .
0 r
These factors are called the funds flow present-worth factor, denoted by (P/Ā, r, N ), and the
funds flow compound amount factor, denoted by (F/Ā, r, N ).
Example: The end of year equivalent payment to 52 payments of $100,000 a year at 8% com-
pounded continuously is
2.
The dollar is an imperfect unit of measure because its value changes with time. Inflation (resp.,
deflation) is a measure of the decline (resp., increase) in purchasing power of the dollar.
Measures of Inflation:
CPI(Consumer Price Index) based on market basket,
GNPIPD(Gross National Product Implicit Price Deflator),
PPI(Producer Price Index),
Most inflation indexes are computed by monitoring the price changes of a basket of goods. This
tends to overstate inflation for the following reasons:
– Improvements in quality are not taken into account.
– Opportunities for substitution are not taken into account.
IEOR4003: Industrial Economics page 10 Professor Guillermo Gallego
Average Inflation.
Suppose that inflation in year one is 5%, and inflation in year 2 is 10%, then the overall inflation
is (1.05)(1.10) - 1 = 15.5%. The average inflation f satisfies (1 + f )2 = 1.155, so f = 7.47%. Notice
that the average inflation is less than the arithmetic average. This is due to the geometric-arithmetic
inequality.
1/N
N
N
1
xj ≤ xj .
j=1
N j=1
Actual Dollars. Represent the out of pocket dollars received or expended at any point in time.
(Current dollars, future dollars, inflated dollars, nominal dollars).
Constant Dollars. Hypothetical purchasing power of future receipts and disbursements in
terms of the purchasing dollars in some base year. (Real dollars, deflated dollars, todays dollars)
Market Interest rate i represents the opportunity to earn as reflected by the actual interest
rates available in financial markets.
Inflation-free interest rate i represents the earning power of money isolated from the effects
of inflation. (Real interest rate, true interest rate, constant dollar interest rate.)
Note that i can be negative.
General inflation rate f represents the average annual percentage of increases in prices of
good and services.
Relationship between i, i and f.
Amounts in constant dollars will be primed. The relationship between real and constant dollars
is through the inflation rate. That is F dollars at time n are equivalent to F = F (1 + f )n actual
dollars at time n.
Now assume that the market interest rate is i. Then the present value of F actual dollars n years
from now is given by
P = F (1 + i)−n .
On the other hand, the present value of F real dollars n years from now is obtained via the
inflation-free interest rate. Thus
P = F (1 + i )−n .
From the above we deduce that
F (1 + i)−n = F (1 + i )−n = F (1 + f )−n (1 + i )−n .
So
(1 + i) = (1 + i )(1 + f )
or
i = i + f + i f.
We can also write i in terms of i and f as
i−f
i = .
1+f
Note that f > 0 implies i < i − f.
Example: Suppose that i = 6% and f = 4%, then i = .02/1.04 = .0193 = 1.93%.
Some economists view inflation as a form of taxation on the holding of money. Indeed, if you
were to hold a dollar for a year its purchasing power at the end of the year would be (1 + f )−1 for
a loss of
f
.
1+f
IEOR4003: Industrial Economics page 11 Professor Guillermo Gallego
This is often called the “inflation” tax. For example, if f = 4%, the “inflation” tax is equal to 3.84%.
Most textbook examples assume that all cash flows respond equally to inflationary trends. This
is no so in real life. For instance depreciation schedules are not affected by inflation.
Home Ownership Analysis:
A question asked by many families is whether they should invest in buying a house. While there
may be many non economic reasons to own a house, we will analyze home ownership from a purely
economic view point.
Our objective is to find the present cost of owning a house over a reasonable time horizon, say
N years. This cost can be used for instance, to compare against the cost of renting.
Let us assume that a house of price H is purchased now by a conventional loan. That is, we
give a 20% down payment and take a 30 year fixed loan at nominal rate r compounded monthly.
In addition, let us assume that there are closing fees of the form A + 0.008Hj, where A includes
the cost of inspection, appraisal, title insurance and lawyers fees and j% represents the points
charged to secure the loan. Let Tn denote property taxes due at the end of year n, and assume
that Tn is of the form Tn = T (1 + fT )n−1 , n = 1, . . . , N, where T represents the amount due at
the end of the first year. Interest paid on the mortgage and property taxes are tax deductible; we
assume a tax rate t. Let Mn denote the maintenance cost in year n, n = 1, . . . , N. We assume that
utilities and insurance cost are included in Mn . Assume that this costs increase at rate fM so that
Mn = M (1 + fM )n−1 , n = 1, . . . , N, where M is the maintenance cost during the first year. Let us
assume that the house increases in value at the rate of fH . Thus, we estimate that after N years
the house will be sold at H(1 + fH )N . Assuming a sales commission of 6%, it is easy to write a
spreadsheet to compute the present value of the resulting cash flow at market interest rate i. This
present value can then be compared to the cost of renting.
To do the above calculations, we need to compute the present value of the sequence of interest
payments on an equal installment loan. This calculation is of importance since interest on a home
mortgage is tax deductible. We next show how to accomplish this task. Let
B amount borrowed,
ib interest rate per period,
N number of periods,
A = B(A/P, ib , N ) payment per period.
The unpaid balance Bn at the end of period n satisfies the difference equation
Bn = Bn−1 (1 + ib ) − A, n = 1, ..., N,
Now, suppose we want to find the present value of the cash flow In , n = 1, . . . , N at rate i. We see
that In is the difference of a uniform series of payments and a geometric series of payments. Thus,
N
P V (i) = In (1 + i)−n
n=1
(A − Bib )
= A(P/A, i, N ) − (P/A, j, N ),
1 + ib
i−ib
where j = 1+ib .
IEOR4003: Industrial Economics page 12 Professor Guillermo Gallego
Up to now we have been able to compute the present value of a limited number of cash flows:
constant, linear gradient and geometric gradient. Often we need to compute the present value of
more complex cash flows. A technique that comes in handy is the use of Z-transforms.
Consider an infinite cash flow {f (n)}. Its present value can be written as
∞
P V (i) = f (n)(1 + i)−n .
n=0
F (z) is called the Z-transform of f (n) and this relationship is denoted by writing F (z) = Z{f (n)}.
One of the most important and useful properties of Z−transforms is that there is a one to one
correspondence between the infinite cash flows f (n) and their Z−transforms F (z). Another property
of Z−transforms is that linear combinations of cash flows correspond to the same linear combinations
of their Z−transforms. For instance, if f (n) ↔ F (z) and g(n) ↔ G(z), then αf (n) + βg(n) ↔
αF (z) + βG(z).
These properties plus the fact that F (z) has been pre-computed for a large number of cash flows
f (n) is what makes the technique useful in practice. See Tables 3.1, 3.2, 3.3 and 3.4 in your book.
Example: If f (n) = 3n + 7(1 + j)n − 4e−jn + 8, n = 0, 1, . . . , then
z z z z
F (z) = 3 +7 −4 +8 ,
(z − 1)2 z − (1 + j) z − e−j z−1
Most realistic cash flows extend over finite time horizons, e.g {f (n)}, n = 0, 1, . . . , k. Here we
cannot directly apply Z−transforms, but we can try writing the present value as the difference of
two Z−transforms. Indeed, if
k
P V (i) = f (n)z −n .
n=0
Then ∞ ∞
−n −(k+1)
P V (i) = f (n)z −z f (n + k + 1)z −n ,
n=0 n=0
so that
P V (i) = F (z) − z −(k+1) G(z)
where F (z) = Z{f (n)} and G(z) = Z{f (n + k + 1)}.
More generally, if the cash flow is of the form {f (n)}, n = h, . . . , k, we can write
correspond to the Laplace transforms. The techniques used to compute present values of complex
continuous cash flows, using Laplace transforms, are similar to those used for complex discrete cash
flows using Z−transforms.
IEOR4003: Industrial Economics page 14 Professor Guillermo Gallego
This is a brief treatment of the major aspects of the U.S. corporate law. The emphasis is on the
depreciation and tax treatment of assets used for production and distribution in a trade or business.
f = tf (1 − s)
s = ts (1 − yf ),
or
f + tf s = tf
yts f + s = ts .
IEOR4003: Industrial Economics page 15 Professor Guillermo Gallego
This problem is very easy to solve. We find the period with largest discounted tax rate
k = argmax{n : tn (1 + i)−n }
BBn = BBn−1 − Dn ,
for n = 1, . . . , N.
Straight-Line Method.
Dn = (P − F )/N, n = 1, . . . , N,
BBn = P − n(P − F )/N, n = 1, . . . , N.
This method may be used to depreciate any property; the lifetime used for depreciation is taken
from government ADR (asset depreciation range) tables.
Sum of Year Digits (SOYD) Method.
Notice that
N
N (N + 1)
n= .
n=1
2
The idea is to depreciate the fraction
2(N + 1 − n)
N (N + 1)
in year n, n = 1, . . . , N. Thus
2(N + 1 − n)
Dn = (P − F ),
N (N + 1)
n = 1, . . . , N.
Declining Balance Method.
The declining balance method allocates each year a given fraction of the book balance at the end
of the previous year.
Dn = αBBn−1 = αP (1 − α)n−1 .
The DDB (double declining balance) sets α = 2/N, the 150% DB (one and a half declining balance)
sets α = 1.5/N. A problem with this method is that, if unchecked, it is unlikely that BBN = F.
If BBN > F, a switch to straight line is made as soon as the straight line allowance is larger than
the declining balance allowance. Otherwise depreciation terminates as soon as the BBn < F, by
appropriately adjusting the depreciation during year n.
The MACRS is DDB with optimal switching to straight-line and mid year convention for assets
in the 3,5,7 and 10 year classes; 150%DB with optimal switching and mid year convention for assets
in the 15 and 20 year classes and straight line with mid month convention for property in the 27.5
and 31.5 year classes. See table 4.8.
Example: To construct table 4.8 for N = 5, note that the double declining balance applies, so
α = 0.4. Because of the mid year convention, we allow
D1 = 0.2 so BB1 = 0.8,
D2 = αBB1 = 0.32 so BB2 = 0.48,
D3 = 0.192 etc.
The tables include the timing of the optimal switching to straight line.
Impact of Inflation. Assume that an asset worth P dollars is purchased at time 0 and that it is
100% financed with interest payments {IPn }N N
n=1 , and principal payments {P Pn }n=1 . Assume further
N
that the asset will generate the string of revenues {Rn }n=1 ; that it will be depreciated according to
the schedule {Dn }N n=1 . The tax rate is assumed to be t throughout the horizon, and the inflation
rate is assumed to be zero. According to the above formula for generalized cash flows, we have
Now, assume a similar scenario where the inflation rate f is positive and the revenues are re-
sponsive to inflation. That is, Rn = Rn (1 + f )n . The actual dollar cash flows are therefore
n = 1, . . . , N. Are we better or worst off? To answer this question we first convert the actual dollar
cash flows to constant dollar cash flows, and then obtain the present value of the cash flows. The
constant dollar cash flows are
Gn = Rn (1 − t) + [(IPn + Dn )t − T Pn ] (1 + f )−n ,
Comparing the present value with and without inflation, we see that the revenue term is not affected,
since revenues are responsive to inflation. Consequently, the answer depends on the second term
involving the interest payments, the principal payments and the depreciation schedule. If (IPn +
Dn )t − T Pn < 0 for all n = 1, . . . , N, as is usually the case when an asset is financed, then we are
better off with positive inflation. On the other hand, if the asset is paid in full at time zero, then
IPn = T Pn = 0, n = 1, . . . , N, so we are better off without inflation.
IEOR4003: Industrial Economics page 19 Professor Guillermo Gallego
The interest rate used for discounting cash flows has a direct effect on the outcome of project
evaluation and comparisons. A change in the interest rate can change the accept-reject decision for
an individual project, and it can alter the choice of the “best” from among several projects. The rate
is usually designated as the interest rate, or the discount rate; it is also known as the minimum
attractive rate of return, MARR.
Rearranging, we obtain
B
ie = k + (k − kb ).
S
We can think of k as the return on assets, of kb as the after tax return on bonds, and ie as the return
on equity. This equation suggests that if k > kb , the return on equity increases as we increase the
debt to equity ratio B/S. However, increasing, a highly leveraged company is perceived as more
risky by bondholders and after tax cost of debt capital is likely to rise reducing ie . Also, a higly
leverage firm magnifies losses to equity holders in years where k < kb .
Of more importance to us is the marginal cost of capital as a minimal attractive rate of return.
Suppose that engaging in a project that requires C units of capital changes the average cost of
capital from k0 on V to k1 on V + C. Let r be the return of the project. Then, we are better off by
engaging in the project if and only if
Cr ≥ (V + C)k1 − V k0
Cr < (V + C)k1 − V k0
or
$15, 000 > $165, 000 − $145, 000 = $20, 000.
This example illustrates that cautioned should be exercised on determining the minimum attrac-
tive rate of return.
Debt Capital
Recall that interest paid on loans is a tax-deductible expense. So the after-tax cost of loans and
bonds is less than the before-tax loan interest rate and bond yield rate.
Bank Loans: There are several cases to consider, depending on the number of interest payments
and tax payments per year. In the simplest case both are paid annually resulting in an after tax
rate
kb = (1 − tm )ib
where ib is the before-tax nominal interest rate per year and tm is the marginal corporate income
tax rate. See example 5.1
IEOR4003: Industrial Economics page 21 Professor Guillermo Gallego
Most corporations, however, pay income taxes more frequently, say quarterly, based on estimated
income statements. So the tax benefit of the interest expense is taken quarterly. To obtain the after
tax rate, we first compute the quarterly rate by solving
kb = (1 + kbp )4 − 1.
Here kbp represents the quarterly interest rate. See example 5.2
A third possibility is when the interest payments are actually made quarterly. Then the appro-
priate after-tax rate is
4
kb = [1 + ibp (1 − tm )] − 1.
Notice that the corresponding formula, 5.4, in the book is incorrect. See example 5.3
When there are discount points on a loan, the borrower does not receive the full amount of the
loan, but payments are calculated on the full amount. The effective loan rate is thereby increased.
For tax purposes the discount is prorated over the life of the loan. The exact method for finding
the after-tax rate requires setting up a loan schedule and computing the after tax cash flow.
The loan schedule method may be time consuming if an electronic spreadsheet that computes
the loan schedule and the after tax cash flow is not available. An alternative method, is to compute
the interest rate that makes the present value of the borrower’s before-tax cash flow equal to zero,
and then multiplying it by one minus the tax rate (1 − tm ). See example 5.5.
Bonds: Before discussing the after-tax cost of bonds, we will define and classify bonds. In
addition we will discuss how to compute the present value of a bond at a given interest rate, and
how to compute the yield of a bond at a given price.
A bond is a note issued by a corporation or government entity for the purpose of obtaining
needed capital for financing projects. The repayment conditions are specified at the time the bonds
are issued. These conditions include the bond face value, the bond interest rate, the bond interest
repayment period, and the bond maturity date.
The bond face values is usually a multiple of $1000. The face value:
(i) represents the sum paid at maturity date,
(ii) determines the interest payment.
Let
F bond face value,
r bond interest rate,
m number of payments per period.
Then, the interest paid per period, is I = F r/m. Often a bond is purchased at a discount (less
than face value) or at a premium (more than face value).
In addition to U.S. government bonds, bonds are usually classified as mortgage, debenture, or
municipal bonds. Mortgage bonds are backed by mortgages or specified assets. There are several
types of mortgage bonds including first and second mortgage bonds. In the event that of foreclosure
by the bondholders, the first-mortgage bonds take precedence during liquidation.
Debenture bonds are not backed by any form of collateral. The reputation of the company is
important for attracting investors to this type of bond. They can be convertible, non-convertible,
subordinate or junk bonds. Convertible bonds can be converted into common stock at a fixed rate
as long as the bonds are outstanding. Subordinate debentures represents debt that ranks behind
other debt in the event of liquidation or reorganization. Junk bonds are debenture bonds rated BBB
or lower by Standard and Poor’s bond rating.
IEOR4003: Industrial Economics page 22 Professor Guillermo Gallego
Finally, municipal bonds are income tax free bonds. They can be either general obligation or
revenue bonds. See handout for a more complete classification of bonds.
What is the present worth of a bond with face value F and coupon rate r, if it pays interest m
times per year, it matures in N years, and its cash flow is discounted at a nominal interest rate of
i% per year?
Fr
P = (P/A, i/m, mN ) + F (P/F, i/m, mN ).
m
Example: If F = $10, 000, r = 6%, m = 2, N = 15, and i = 8%, then
The rate of return, or yield, of a bond with face value F, that matures in N years, with coupon
rate r, that pays interest m times per year, and sells for P dollars, is the interest rate that makes
the present value of the cash flow equal to zero. In other words, the yield is the interest rate at
which the cash flow generated by the bond is equivalent to its current price. Thus, the yield, is the
interest rate that solves
Fr
P = (P/A, i/m, mN ) + F (P/F, i/m, mN ).
m
The solution to the above equation is the nominal yield. The effective yield is
(1 + i/m)m − 1.
Example: If F = $1, 000, r = 4%, m = 2, N = 20, and P = $800, then the nominal yield is 5.74%
and the effective yield is 5.86%.
The after-tax cost of bonds depends on the timing of interest and tax payments. In addition,
there are issuing expenses and discounts (or premiums) to be considered. The selling expense Se
must be prorated over the maturity period N for tax purposes. The annual expenses Ae are tax-
deductible each year. The discount or difference between the face value F and the selling price P,
must be spread over the maturity period by prorating (bonds issued before 7-01-82) and by the yield
method (bonds issued after 7-01-82). We will assume quarterly tax payments.
To find the after tax cost for bonds issued before 7-01-82 we must first find the after tax cost per
quarter by solving equation 5.5 in the book. The annual after tax rate is then obtained by using
equation 5.3. Alternatively, an approximate method can be used by solving equation 5.6 followed
by 5.3. See example 5.6.
To find the after tax cost for bonds issued after 7-01-82 we must find the yield to maturity of
the bond. We solve for ib using
P − (F r)(P/A, ib , N ) − F (P/F, ib , N ) = 0.
Then, for tax purposes, we treat the bond as a loan at interest rate ib , with negative amortization
and a balloon payment. The after tax cash flow is obtained by subtracting from the actual interest
payments, the interest tax benefit, the selling tax benefit, and the annual expense tax benefit. The
effective after tax cost of the bond is computed by finding the interest rate that makes the present
value of the cash flow equal to zero. See Table 5.1. In fact, this is only an approximate method. To
be more precise, we would need to consider quarterly payments of taxes, and semi-annual payments
of interest.
The above method, of finding the yield of the bond, developing a loan schedule, computing the
after tax cash flow, and finding the interest rate that makes the present value of the after tax cash
flow may be time consuming, if an electronic spreadsheet that computes the yield, the loan schedule,
and the after tax cash flow is not available. As for the case of a loan, an alternative method is to
compute the interest rate that makes the present value of the borrower’s before-tax cash flow equal
to zero, and then multiplying it by one minus the tax rate (1 − tm ). Notice that the selling expenses
IEOR4003: Industrial Economics page 23 Professor Guillermo Gallego
and the annual expenses should be taken into account in determining the before tax cash flow. See
page 178, for an example of this alternative method.
Equity Capital.
Issuance of preferred stock carries with it an obligation to pay a stated dividend annually. The
cost of capital is therefore the stated dividend divided by the price of the stock. However, the stock
is often sold at a slight discount, so the actual proceeds from the stock, rather than the stated price,
should be used as the denominator. For example, a 10% preferred stock with a value of $200, which
is sold for $190 per share (5% discount), has a cost of 10/0.95 = 10.53%. The dividends are paid
from after-tax corporate earnings.
Common Stock: The simplest valuation model for common stock considers only dividend
payments. The value of a share to a stockholder immediately after the year 0 dividend is
∞
P0 = DP Sn (1 + ie )−n ,
n=1
DP S1
P0 =
ie − g
resulting in
DP S1
ie = + g.
P0
The growth rate can be estimated by using the ratio of retained earnings to current share price
g = (EP S1 − DP S1 )/P0
so
EP S1
ie = .
P0
Example A corporation plans to raise capital for a new plant through the sale of $2,500,000
worth of common stocks valued at $20 each. If a $1 dividend is planned for the first year and an
appreciation of 9% per year is desired for future dividends. Estimate the cost of capital from this
stock issue.
1
ie = + 0.09 = 0.14.
20
2.
An alternative valuation model for common stocks, called the capital asset pricing model (CAPM),
consists of a regression analysis of the historical returns of a specific stock to the performance of the
market as a whole. The return required for a stock issue is the cost of equity capital ie . The CAPM
states that
ie = if + β(im − if )
where if is the return of a risk free asset such as US treasury bills, im is the return on stocks in a
defined market portfolio by a prescribed index, and β is the volatility factor of the company’s stock.
If β < 1.0, the stock is less volatile than the market. Conversely if β > 1.0, the stock is more volatile
than the market.
IEOR4003: Industrial Economics page 24 Professor Guillermo Gallego
Example: A computer manufacturer has an historic β value of 1.7, the market premium is 5%
and treasury bills are paying 7%. Estimate the cost of the common stock capital.
2.
Here we focus on evaluating single projects. Investment projects and securities (stocks, bonds,
etc. ) require initial outlays in order to provide a later sequence of cash receipts. Investments,
however, are not marketable securities.
We will study several different criteria to evaluate a single-project. For all the criteria we assume:
MARR is known to the decision maker,
stable, perfect capital market, can borrow or lend at MARR,
complete certainty about investment outcomes,
cash flows represent the net after tax cash flows.
Notation:
n index of time period,
i interest rate used for discounting, e.g., MARR or market rate,
Fn net cash flow at the end of period n,
N project life.
Present Value Criterion: The present value at interest rate i is
N
P V (i) = Fn (1 + i)−n .
n=0
The criterion is to accept the project if P V (i) > 0, remain indifferent if P V (i) = 0 and reject if
P V (i) < 0. A positive PV represents a surplus and the project should be accepted if sufficient funds
are available for it.
Future Value Criterion: Compute the future value F V (i) and accept the project if F V (i) > 0,
remain indifferent if F V (i) = 0 and reject if F V (i) < 0.
Annual Equivalence Criterion: Compute the annual equivalence AE(i) and accept the
project if AE(i) > 0, remain indifferent if AE(i) = 0 and reject if AE(i) < 0.
All of the above criteria are equivalent.
Project Balance Concept: Let P B0 = F0 then the project balance at the end of year n at
interest rate i is given by
P Bn = P Bn−1 (1 + i) + Fn , n = 1, . . . , N.
Sometimes we write P B(i)n to emphasize that dependence on i. We can think of P Bn as the value
at time n of the truncated cash flow {Fk : k = 0, . . . , n}. In particular P BN = F V. If P BN > 0,
we can say that the firm recovers the initial capital plus any interest owed, with and end of horizon
profit equal to P BN . If P BN = 0 the firm recovers exactly its investment plus interest owed. If
P BN < 0 the proceeds from the project are not enough to recover the investment plus the interest
owed, so the firm ends up with a loss.
If a project is accepted and the estimates of the cash flows are accurate, then P Bn indicates the
balance that we would observe at the end of period n. A negative value of P Bn indicates that the
investment plus the interest owed are yet unrecovered. If P Bn is positive at time n, then the firm
has recovered the initial investment plus interest owed and has an over-recovered balance. Notice
that the formula that computes P Bn+1 from P Bn assumes that P Bn can be invested at rate i.
Example: Consider the cash flow {−1600, 10000, −10000}. Assume i = 30%. Then P V (30%) =
175.15 > 0, so according to the present value criterion the project should be accepted. The project
IEOR4003: Industrial Economics page 27 Professor Guillermo Gallego
balances are P B0 = −1, 600, P B1 = −1, 600(1.3) + 10, 000 = 7, 920, P B3 = 7, 920(1.3) − 10, 000 =
296 > 0. Notice that F V = P B3 = P V (1.3)2 . Notice also that at the end of year one, the project
balance is positive and that there is a negative cash flow at the end of year 2. Since the project will
not generate more cash, we will need to invest P B1 outside the project. The computations leading
to P B2 assume that we can invest the money at 30%. If we cannot invest the money outside the
project at a sufficiently high interest rate, we may end up loosing money. 2.
This example indicates that one must use the PV criterion with care and must ask: What can
be done with over-recovered balances when they occur? Of course, if the company can actually
reinvest money outside the company at rate j = i, then the present value criterion will be correct.
Often however, the reinvestment assumption does not hold. When this happens, we must be very
careful in using traditional criteria such as present value, future value, annual equivalence. These
criteria assume that we can invest over-recovered balances at the same rate that we use to discount
the project’s cash flow. If the rate at which we can invest over-recovered balances is smaller than
the discount rate, these traditional criteria may lead us to incorrectly accept a project; a mistake
that may have dire financial consequences.
The safest way out, is to pre-specify a conservative rate, say j, at which we can invest over-
recovered balances. This rate may be the weighted average cost of capital, because we can always
use over-recovered balances to pay back debt, and to repurchase outstanding stock. Then, if a rate
i is to be charged to the project, we can compute the project balances as:
P B(i, j)0 = F0
and
P B(i, j)n−1 (1 + i) + Fn if P B(i, j)n−1 ≤ 0
P B(i, j)n =
P B(i, j)n−1 (1 + j) + Fn if P B(i, j)n−1 > 0.
Notice that P B(i, j)N is the amount of money (possibly negative) that the firm will have at the
end of the project after repaying the investment and the interest owed. The obvious, and correct
IEOR4003: Industrial Economics page 28 Professor Guillermo Gallego
criterion, is to accept the project only if P B(i, j)N > 0. This is the only criterion that I will
recommend. However, because most people use one or more of the traditional criteria that we have
discussed before, we need to make an effort to determine the conditions under which we can trust
these criteria.
It easy to show that P B(i, j)n is decreasing in i and increasing in j for all n (the terms increasing
and decreasing are to be interpreted in the non-strict sense). This is true, in particular, for P B(i, j)N ,
The intuition behind this assertion is that the final balance, at time N, after repaying the capital
and the interest owed will decrease as the interest rate charged to the project increases. Likewise,
the final balance increases as the interest earned on over-recovered balances increases.
Because F0 < 0, the initial balance is negative. If the interest rate charged to the project is very
large, then all the balances will be non-positive, and we will never have an occasion to invest at rate
j. Consequently, for large values of i, the balances P B(i, j)n are independent of j. Let imin be the
smallest value of i for which P B(i, j)N is independent of j. Thus for i < imin , there will be at least
one period with an over-recovered balance, whereas if i > imin all the balances, except perhaps the
last one, will be unrecovered. We divide the interval [0, ∞) into two regions: [0, imin ) and [imin , ∞).
The first region is called the mixed region, while the second region is called the pure region. If i (the
interest rate that is charged to the project) falls into the pure region, there are no over-recovered
balances, and so the decision to accept or to reject the project based on present value, future value,
or annual equivalence will be correct. On the other hand, if i falls into the mixed region, the present
value, the future value, and the annual equivalence method will yield an incorrect value unless j = i.
These incorrect values can lead us to accept a bad project. In summary, the present value, the future
value and the annual value criteria can be safely applied only if either j = i or if i > imin .
The Internal Rate Of Return Criterion:
The “internal” rate of return of a project with cash flow {Fn }N ∗
n=0 is “the” rate i such that
∗ ∗ ∗ ∗
P V (i ) = 0. Notice that P V (i ) = 0 implies AE(i ) = F V (i ) = 0. The criterion is to accept the
project if i∗ > M ARR, reject if i∗ < M ARR and to remain indifferent if i∗ = M ARR.
The reason we quote the word “the” in the definition is that a root satisfying P V (i) does not
always exists, and even when it does, it is not always unique. Later we will see why we also have the
word “internal” under quotes. Finding “the” internal rate of return of the cash flow is equivalent
to finding the roots of a polynomial in x = (1 + i)−1 . This can be done by a number of methods
including bisection and Newton’s method.
When is there a unique positive i∗ such that P V (i∗ ) = 0? To answer this question we need a
few definitions. An investment is a cash flow with F0 < 0; a conventional (or simple) investment is
an investment with only one change in the sign of the
Ncash flow. A potentially profitable investment
has a positive sum of net cash flows, i.e., P V (0) = n=1 Fn > 0.
Proof: Because the cash flow is potentially profitable, we have P V (0) > 0; because the cash flow
represents an investment we have P V (∞) = F0 < 0; consequently there is a sign change on P V (i).
Finally, because P V (i) is a continuous function of i, there must be a positive interest rate, say i∗ ,
at which the present value vanishes. To show that i∗ is unique note that because the investment
is conventional the cash flow has one sign change, and from Descartes’ rule of signs we know that
P V (i) can have at most one sign change. 2.
Because of the above proposition, we can confidently apply the IRR criterion to a potentially
profitable conventional investment. One way to interpret the internal rate of return is to think of
it as the largest interest rate at which you can borrow money to finance the project and still break
even. Alternatively, you can think of the internal rate of return as the interest rate earned on the
unrecovered project balance of an investment. You should not, however, think of the internal rate
of return as the interest earned on the initial investment.
Can we extend the class of investments for which the IRR criterion is meaningful? The answer lies
on whether or not the project has over-recovered balances at the largest root that makes the present
IEOR4003: Industrial Economics page 29 Professor Guillermo Gallego
value equal to zero. If there are no over-recovered balances, then we can meaningfully interpret that
rate as the IRR and apply the criterion. Otherwise, if there are over-recovered balances, we need to
know at what rate we can invest these balances before we can say something meaningful.
Let us call an investment pure, if it has no over-recovered balances at its largest root. Thus an
investment is pure if there exist an interest rate, say i∗ , such that P V (i∗ ) = 0, P V (i) = 0 ∀i ≥ i∗ ,
and P Bn (i∗ ) ≤ 0 n = 0, . . . , N − 1. An investment that is not pure is called mixed. It is easy to
show that all conventional investments are pure.
An alternative way of deciding whether a project is pure or mixed is to find
.
imin = inf{i : P B(i)n ≤ 0, n = 1, . . . , N − 1},
and to evaluate P B(imin )N . If P B(imin )N > 0 then i∗ > imin, and the investment is pure, if
P B(imin )N = 0 then i∗ = imin, and the investment is pure, if P B(imin )N < 0 then i∗ < imin , and
the investment is mixed.
As mentioned above, once a project is classified as mixed, we cannot say something meaningful
unless we specify a reinvestment rate. Let j be the rate at which over-recovered balances can be
invested, and let i(j) be the interest rate at which the ending balance is zero. That is, i(j) is defined
implicitly by the formula:
P B(i(j), j)N = 0.
We can think of i(j) as the largest rate at which the firm can borrow money to invest in the project
and still break even. Notice that if i is the interest rate charged to the project, then the project is
profitable only if i < i(j). This is because
Proposition 2 The rate i(j) exists and is unique. Furthermore i(j) is increasing in j.
Proof: Notice that P B(0, j)N > n Fn > 0, and that F0 < 0 implies that for sufficiently large inter-
est rate the ending balance is negative, i.e., limi→∞ P B(i, j) < 0. Because P B(i, j)N is continuous,
there must be a positive rate i(j) such that P B(i(j), j) = 0.
To see that this rate is unique, just note that for fixed j, P B(i, j)N is strictly decreasing in i, so
it cannot admit two different roots.
Finally, to see that i(j) is increasing in j note that
∂P B(i,j)N
∂i(j) ∂j
= − ∂P B(i,j) > 0.
∂j N
∂i
2.
We can obtain bounds on i(j) by observing P B(imin , j)N . If P B(imin , j)N ≥ 0, then, since
P B(i, j)N is decreasing in i and is negative in the limit, we must have i(j) > imin . Recall that
projects for which P B(imin , j)N > 0 are called pure. In this case we can identify i(j) with the
largest root of P V (i) = 0, and the internal rate of return criterion makes sense. However, if
P B(imin , j)N < 0, then the root i(j) must occur to the left of imin , and therefore the project has
over-recovered balances. These projects are called mixed, and for mixed projects the use of the
internal rate of return criterion can lead us to accept a bad project.
In addition to the Return on Investment Capital Criterion, there are a number of “modified rate
of return” methods that can be shown to be consistent with present value. These methods make
IEOR4003: Industrial Economics page 30 Professor Guillermo Gallego
consistent assumptions about the interest rate at which intermediate receipts from projects may be
invested. The reinvestment could be either in other projects or in the outside market.
Solomon’s Average Rate of Return (ARR)
This method assumes F0 = −C0 < 0, Fn ≥ 0, n = 1, . . . , N, and that the net revenues are
reinvested at rate i, letting them accumulate until time N. It then seeks the rate, say s, which C0
needs to earn to reach the same accumulated value by time N. Thus
N
C0 (1 + s)N = Fn (1 + i)N −n
n=1
so
1/N
N
n=1 Fn (1 + i)N −n
s = − 1.
C0
The criterion is
If s > i = MARR, accept,
if s = i, remain indifferent,
if s < i, reject.
It is easy to see that this criterion is consistent with present value. See example 6.8.
Modified Internal Rate of Return (MIRR)
This method allows the Fn s to be of any sign. For any number x, let x+ = max(x, 0), and
x− = max(−x, 0), note that x = x+ − x− and also that x+ ≥ 0, x− ≥ 0. Given a cash flow {Fn },
let Gn = (Fn )+ and Hn = (Fn )− , n = 0, . . . , N. Thus, Fn = Gn − Hn , ∀n so the cash flow {Fn } can
be written as the difference of {Gn } and {Hn }. Notice that the cash flow {Gn } corresponds to net
inflows, and the cash flow {Hn } correspond to net outflows.
The MIRR method first computes the single outflow at time zero, say H, that is equivalent to
the cash outflows {Hn } at rate i, and the single inflow at time N, say G, that is equivalent to the
N
N
cash inflows {Gn } at rate i. Thus H = n=0 Hn (P/F, i, n), and G = n=0 Gn (F/P, i, N − n). Once
H and G are computed, we ask for the rate, say j, at which H needs to be invested at time zero to
accumulate to G by time N. That is,
H(1 + j)N = G,
so 1/N
G
j= − 1.
H
The MIRR criterion is
If j > i = MARR, accept,
if j = i, remain indifferent,
if j < i, reject.
Again, it can be shown that MIRR is consistent with present value. The fact that ARR and MIRR
are consistent with present value does not preclude the possibility of, as we have seen, PV itself
being wrong when the re-investment assumption at i does not hold.
Benefit-Cost Ratios Another way to express the worthiness of a project is to compare the
“benefits” with the “costs” and accept the project if the “benefits” are larger than the “costs”.
There are at least three different benefit-cost ratio methods, and they differ on what is meant by
“benefit” and what is meant by “costs”.
IEOR4003: Industrial Economics page 31 Professor Guillermo Gallego
Assume that Fn = bn − cn is the net after tax cash flow, where bn (resp., cn ) represents the
revenue (resp., expense) at the end of period n. Now, let
N
B= bn (1 + i)−n
n=0
and
N
C= cn (1 + i)−n .
n=0
So, B (resp., C) represents the present value of the cash inflows (resp., outflows).
The Aggregate B/C Ratio is defined as
B
RA = .
C
An alternative expression for the benefit-cost ratio, is to separate the present value of the cash
outflows into two parts. The first part, say I, is considered to be the “initial” investment while the
second part, say O = C − I, consists of annual operating and maintenance costs. Notice that I may
consists of the present value of cash outflow during several periods.
The Netted B/C Ratio is defined as
B−O
RN = .
I
Finally, the Lorie-Savage Ratio is defined as
B−C
L−S = .
I
All of the above benefit-cost ratios are equivalent to the PV criterion for single project acceptance.
Payback Period
A popular rule-of-thumb method for evaluating projects is to determine the number of periods
needed to recover the original investment. The method favors projects with small payback periods.
It can be shown through examples, that a blind application of the payback period method may
lead us to reject good projects. This is, in part, because the payback period method ignores the
time value of money, but mainly because it ignores the cash stream after the investment is initially
recovered. However, the payback period may serve as a secondary criterion. For instance, it may be
better to select a project with present value equal to $1,000,000 with a 2 year payback period, than
a project with present value equal to $1,001,000 with a 30 years payback period.
Time-Dependent Measures of Investment Worth
Recall the definition of P B(i, j)n , the project balance at time n where an interest rate i is charged
to the project, and over-recovered balances are invested at rate j. Among projects that have the
same life N, and the same ending balance P B(i, j)N , we may prefer those for which the area of
negative balance
N
−1
ANB ≡ P B(i, j)−
n
n=0
Such projects tend to give the investor greater flexibility, in that they tend to tie-up the in-
vestment for a shorter period of time, or to generate positive cash balances for a longer period of
time.
Notice, that our definitions of ANP and APB generalizes the one given in the textbook when
j = i and P B(i, j)n has a unique change in sign as n increases.
IEOR4003: Industrial Economics page 33 Professor Guillermo Gallego
In the last few lectures, we have been concerned with different criteria to determine the worthiness
of a single-project. We now turn our attention to the problem of selecting one among a set of mutually
exclusive alternatives. Each alternative consists of a subset of the projects in consideration and, as
a rule, the do-nothing alternative is always included among the set of alternatives.
For example, if projects A, B and C are being considered, then the full set of alternatives consists
of the 8 subsets of {A, B, C}. Sometimes, however, we can reject some of the alternatives at the outset
because projects may exhibit some dependencies. For example, if B represents a die that can only
be used with machine A, then it does not make sense to consider subsets of {A, B, C} where project
B appears without project A. As a second example, if A is a machine, and B and C are projects that
require the full use of A, then it may not make sense to consider alternatives where both projects B
and C are present.
Once the set of feasible alternatives is considered, our problem is to select one and only one of
them. Before we go ahead and prescribe a way to select an alternative let us note that, in this
chapter, the decision is made without budgetary considerations. In addition, the decision is made
under the assumption that we can earn money at the rate we use to discount the after tax cash
flows. Under these conditions, if it is deemed appropriate to use the same discount rate for all the
alternatives, it is not difficult to show that, the best alternative is the one with the largest present
value.
As we have discussed earlier, the above assumptions do not always hold in practice. The results
of this section may still be valid, however, if the chosen alternative does not violate any existing
budget constraints, and if there are no over-recovered balances throughout its life.
Some of the criteria developed in Chapter 6 are harder to apply, e.g., rate of return methods and
benefit-cost ratio methods. The reason is that these methods are relative measures of investment
worth. To see this, suppose that your cost of money is 18% and you have to select between the
following two investment alternatives. Alternative 1 requires an initial investment of $1,000, and
pays back $1,500 at the end of one year. On the other hand, alternative 2 requires an investment of
$10,000 and pays back $12,000 at the end of one year. The internal rate of return of alternative 1
is 50%, while the internal rate of return of alternative 2 is only 20%. Clearly both alternatives are
desirable if the cost of money is 10%; moreover, alternative 1 seems to be more attractive from the
internal rate of return point of view. Yet, present value calculations yield $363.63 for alternative 1
and $909.09 for alternative 2. Thus, we have an example of opposite ranking. One reason for this
is that the alternatives cannot be scaled. If we could take alternative 1 10 times, then we would
invest $10,000 and would receive $15,000 at the end of one year. In that case, both the present
value and the rate of return methods would agree. However, as mentioned above, mutually exclusive
alternatives cannot be scaled. A way to obtain results that are consistent with present value is to
evaluate the alternatives in some specific order. For potentially profitable conventional investments,
we may sort the alternatives in non-decreasing order of the sum of their cash flows. For the above
example, alternative 1 comes before alternative 2. Alternative 1 is accepted because its internal rate
of return (50%), is higher than the cost of money (10%). Should we give up alternative 1 to take
alternative 2? That is equivalent to investing an additional $9,000 to earn $10,500 at the end of one
year. The internal rate of return of this marginal investment is 11.67%, and since this rate of return
is larger than the cost of money we should give up alternative 1 and take alternative 2 instead.
A similar incremental approach can be used with other rate of return methods, as well as with
the benefit-to-cost ratio methods. The ordering of the projects that should be used with the different
methods is given in page 246 of your book. Our recommendation, is to stay away from incremental
methods.
If the lifetime of each alternative is finite, and a reinvestment assumption is made, then the project
balance method can be used. The criterion is to select the alternative with largest P B(i, j)N where
N represents the last period in which an alternative has a non-zero cash flow.
Some projects may have a very long life, and may necessitate periodic replacement of an asset.
IEOR4003: Industrial Economics page 34 Professor Guillermo Gallego
It is common to assume that the asset will be replaced by an identical asset at the same cost. This
is the repeatability assumption. Implicit in this assumption, is that we are working with constant,
as opposed to real, dollars. Therefore, it is appropriate to use an inflation-free discount rate. It is
also possible to model technological changes. See the book of Canada & White for a few of these
models.
On the other hand, a project may have a shorter life than the asset useful life. In this case,
careful consideration of salvage values are important.
IEOR4003: Industrial Economics page 35 Professor Guillermo Gallego
The objective of the capital budgeting process is the selection of an optimal portfolio of invest-
ments from a set of alternative investment proposals. An optimal portfolio of investments is
defined as the set of investments that makes the greatest possible contribution to the achievement of
the organization’s goals, given its constraints. The constraints faced by a corporation in the capital
budgeting process can include limited supplies of capital or other resources (personnel, for example)
and dependencies between investment proposals. A dependency occurs if two projects are mutually
exclusive–acceptance of one requires the rejection of the other–or if one project can be accepted only
if another is accepted. Assuming that the organizational goals and constraints can be formulated
as linear functions and that the future cash flows associated with the investment proposals can be
forecast, the optimal portfolio of investments can be found using linear programming (LP).
An advantage of linear programming is that it leads to better solutions than alternative methods.
Ranking is the most common alternative method. Some of the limitations of ranking methods include
the inability to handle either multiple-resource constraints or interrelationships between the proposed
investment alternatives.
A limitation of LP models for capital budgeting is portfolios of investment proposals which may
contain fractional investment proposals. The optimal solution may, for example, include buying a
third of an oil tanker. If the optimal set of investments identified through linear programming is
infeasible because of fractional projects, integer programming should be used. Despite the “fractional
projects” limitations, solutions to LP problems have meaningful interpretations which we will discuss
later on.
As an example, consider Corporation ABC. ABC must determine the most profitable mix between
two competing investment projects subject to budgetary constraints. Let x1 and x2 be the number
of units of Investment 1 and 2 to be funded. Suppose that the present value per unit of Investment
1 is $6 and the net present value per unit of Investment 2 is $3. Suppose that Investment 1 requires
$2 of cash per unit in year 1, and $4 per unit in year 2, while Investment 2 requires $3 of cash per
unit in year 1, and $1 per unit in year 2. The total funds available in year 1 are $12, and the total
amount of cash available for investment in year 2 is $8. Allied wants to maximize net present value.
The linear program can be written as
The optimal solution is Z ∗ = 6(1.2) + 3(3.2) = $16.8. If fractional projects are not allowed then we
must require x1 and x2 to be integers, and solve the corresponding integer programming problem.
In this case the optimal solution is Z ∗ = 6(1) + 3(3) = $15.
To state the problem in more general terms, the objective of the linear programming in capital
budgeting is the allocation of limited resources among competitive activities (capital investments)
in such a way that some measure of effectiveness involving these activities is optimized. Because the
available resources are limited in quantity, the problem faced by the decision maker is the selection
of the most effective mix of activities.
The choice of the objective function depends on the type of data available regarding the invest-
ment proposals under consideration. For example, if quantifiable data are available regarding the
covariance between expected returns from the investment proposals, then minimization of the ex-
pected risk should be considered as the objective function. In that case, however, a related technique
called quadratic programming should be used. If such data are unavailable, as in most capital bud-
geting problems, the objective function should be maximization of either the total net present value
IEOR4003: Industrial Economics page 36 Professor Guillermo Gallego
or the total value of the investment proposals at a specific time in the future. If we can estimate
how long we will be involved with the investment proposals, we should maximize the total value of
the investment proposals at a future time (that is, at the end of our expected involvement with the
investment proposals). If we cannot reasonably estimate how long we are expected to be involved
with the investment proposals, we should choose to maximize total net present value.
The limits on both the availability of resources and on the possible relationships among some of
the decision variables are represented by equations or inequalities called constraints. The problem
is to find the values for the decision variables that result in maximization of the objective function
subject to the constraints.
To be an LP problem , the objective function and the constraints all must be linear functions.
Stated mathematically, the purpose of the LP model is to determine the value of the decision vector
x that maximizes
z = c x
subject to Ax ≤ b
x ≥ 0.
Linear programs can be solved by a variety of methods, including the so called interior point
methods. The most common method, however, is the simplex method; an algorithm that moves
from vertex to vertex of the polyhedron {x : Ax ≤ b, x ≥ 0}, while improving the objective value,
until no further improvement is possible.
Efficient versions of the simplex method capabale of solving moderate size problems are now
available in spreasheets such as Quattro Pro 6.0 for Windows and Excell 6.0 for Windows. Alter-
natively, commercial codes such as LINDO, and GAMS are also available. You should learn how to
use one of these codes to solve the homework problems that I will be assigning.
In a typical constrained capital budgeting problem, there are limits on the amounts of certain
resources available. Examples of such constraints are limits on trained manpower available, limits
on certain materials, and limits on the amount of capital available.
Capital rationing is a frequently encountered constrained capital budgeting problem in which the
amount of capital available for investment is limited. We will first study the pure capital rationing
with no lending or borrowing allowed. We then present a model that allows both lending and
borrowing. The intermediate models that allow borrowing but not lending or viceversa, can be
obtained from the later model, by eliminating the lending or borrowing opportunities.
Pure Capital Rationing, with No Lending or Borrowing Allowed
It should be noted that many managers actually operate in a pure capital rationing environment
once the budget has been determined during the planning stage. Therefore, pure capital rationing
is a realistic assumption in many cases.
The linear programming problem is
J
Z = max pj xj
j=1
J
subject to − anj xj ≤ Mn , n = 0, · · · , N
j=1
xj ≤ 1, j = 1, · · · , J
0 ≤ xj j = 1, · · · , J,
where
It can be shown that there exist an optimal solution to this LP with at most N + 1 fractional
projects, i.e., at most N + 1 of the x variables will be different form zero or one. Of course, if all the
variables are non-fractional, then the solution is optimal. Even if there are some fractional projects,
the LP solution may still be useful in a number of ways. First, the optimal value of the objective
function, say Z ∗ , is an upper bound on the value of the optimal integer solution. Second, valuable
information can be obtained showing the impact on the objective function of making small changes
to the budget values. Such information is obtained by looking at the solution to the “dual” linear
program. Such a solution can be readily obtained from the solution of the original, or “primal”
problem, and it is part of the standard output of most commercial codes. The dual information
may indicate, for example, that the budget for year one is larger than needed, and that there is
a high return associated with an increased budget in year 2. On the basis of the dual solution,
management may decide that it is in its best interest to revise the budget. Thus, the solution to the
linear program, and its associated dual, may be a very helpful device to management in deciding
the budget levels.
The dual of the LP can be written as:
N
J
v = min ρn M n + µj
n=0 j=1
N
subject to − anj ρn + µj ≥ pj , j = 1, · · · , J
n=0
ρn ≥ 0, n = 0, · · · , N
µj ≥ 0 j = 1, · · · , J,
J
Here the ρn is the dual variable corresponding to the budget constraint − j=1 anj xj ≤ Mn ,
and µj is the dual variable associated with the constraint xj ≤ 1.
At the optimal solution, say x∗ , ρ∗ , µ∗ , the complementary slackness conditions must hold. These
conditions state that
J
ρ∗n Mn + anj x∗j = 0, ∀n,
j=1
N
x∗j µ∗j − pj − anj ρ∗n , ∀j,
n=0
N
pj + anj ρ∗n ≥ 0.
n=0
IEOR4003: Industrial Economics page 38 Professor Guillermo Gallego
N
pj + anj ρn = 0.
n=0
we have
N
pj + anj ρn ≤ 0.
n=0
The economic interpretation is that the present value of a project plus the sum of the cash flows
over the horizon “discounted” by the dual variables is non-negative for a fully accepted project, is
exactly zero for a partially accepted project, and it is non-positive for a rejected project.
Notice that constraints on scare resources other than capital can easily be added to the above
LP formulation.
Capital Budgeting
The pure capital rationing problem was developed under the assumption that the firm had no
access to external markets, that is, it could neither borrow nor lend money. Including borrowing
opportunities, without including lending opportunities, while keeping present value as the objective
function may lead to situations where the firm borrows money at a rate that is higher than the
discount rate, see example 8.2. Borrowing at a rate higher than the discount rate may happen when
a highly profitable project is not fully taken because of tight budget constraints as shown in the
following example.
Example: Suppose that the discount rate is 20%, and that there is a single project which requires
an initial investment of $1000 and pays back $1300 at the end of one year. Suppose further that the
investment budget is $900, but that borrowing at 25% is possible. Should the firm borrow the $100
needed to take the full project? If the firm borrows $100 to take the project it will collect $1,175 at
the end of the year. This results in a present value of $1,175/1.2 = $979.16 which is larger than the
present value of taking 90% of the project and also larger than the present value of the do nothing
alternative. The explanation lies in the fact that the total cost of securing $1000 for one year is at
most $205 = ($180 = $900(.2)) + ($25 = $100(.25)), while the project returns $300. 2.
There are, however, some inconsistencies in present value models that involve borrowing and
lending that are not easy to overcome. See the discussion in section 8.3.2. Thus, when borrowing
and lending opportunities are present, it is best to look at the horizon models. The horizon, is
typically the number of periods for which the firm plans to be involved in the projects. In these
models the objective function is the value, at the horizon, of the cash flows occurring after the
horizon of the selected projects plus any cash that is available for lending minus any debt at that
horizon. The budget constraints, are now cash balance equations that take into account borrowing
and lending during the period as well as collection and payment of earlier lending and borrowing
activities.
The simplest horizon model has a borrowing rate equal to the lending rate. See Table 8.6 for the
detailed formulation, and Table 8.8 for the dual formulation. One peculiarity of this model is that
the set of accepted projects is independent of the budget levels. In fact, under this model we would
accept all projects with positive PV, and reject all projects with negative present value regardless
of the budget levels. This is not surprising given that we can borrow and lend at the same rate.
Notice however, that the value of the objective function is non-decreasing in the budget levels.
IEOR4003: Industrial Economics page 39 Professor Guillermo Gallego
One extension is to have borrowing rates higher than lending rates and to impose upper bounds
on borrowing during each budget period. This would involve adding a constraint of the form
wn ≤ Wn , ∀n,
where Wn is the limit on borrowing during year n, that is, the firm’s credit.
Another extension is to allow different borrowing rates during each period. The firm will first
borrow from the least expensive source, and will then borrow at a higher rate once this source is
exhausted. The cost of borrowing is assumed to have the shape of a step function; the larger the
amount borrowed, within limits, the higher the interest rate. Let ri be the interest rate that applies
to borrowing an amount greater than Wi−1 and less than or equal to Wi . A firm will borrow at
interest rate ri if it exhausts the limits placed on its borrowing at lower interest rates. Let vn be
the lending amount from time period n to n + 1. The LP formulation is thus
J
I
Z = max âj xj + vN − wiN
j=1 i=1
J
I
− a0j xj + v0 − wi0 ≤ M0
j=1 i=1
J
I
I
− anj xj − (1 + r)vn−1 + vn + (1 + ri )wi,n−1 − win ≤ Mn ∀n
j=1 i=1 i=1
win ≤ Win ∀i, ∀n,
0 ≤ xj ≤ 1.
Notice that the model with only one borrowing and one lending rate, see Section 8.5.2, corre-
sponds to the case r = rl < r1 = rb , with W1n = ∞ ∀n.
For the case with only one borrowing source with ulimited capacity we can obtain the following
results by studying the dual problem and the complementary slackness conditions. For a project
that is partially selected (0 < xj < 1) we must have
N
âj + anj ρn = 0.
n=0
Finally, if there is a limit Wn on the amount we can borrow at time n then the bound on ρn is
modified to
(1 + rl )ρn+1 ≤ ρn ≤ (1 + rb )ρn+1 + βn ,
where βn is the dual variable corresponding to the constriant wn ≤ Wn .
So far we have avoided the issue of integer solutions in capital budgeting problems. The reason
for this is that the LP solution, and its dual, is a useful tool to set budget limits. Once the budget
limits are firmly set, we can no longer accept solutions with fractionally accepted projects. Notice
that the LP formulation of the basic horizon model with equal borrowing and lending rates always
admits an optimal integer solution. This is because, under this formulation, all projects with positive
present value are accepted. Thus, the integrality constraints are of no concern when the borrowing
and lending rates are equal. This is not the case, however, when the borrowing and lending rates
are different, so more work is required to find an optimal integer solution. One technique that is
commonly used to solve integer programs is called Branch-and-Bound. The technique starts with an
optimal solution, say x∗ , z ∗ for the linear program. Here x∗ represents the project selection vector,
and z ∗ is the value of the objective function. If all the components of x∗ are either zero or one we
are done. Otherwise, we pick a component, say j, of x∗ such that x∗j is fractional and create two
subproblems, one with xj = 0 added as a constraint, and another with x∗j = 1 added as a constraint.
We then proceed to solve these subproblems by an LP algorithm. For each of the subproblems,
we repeat the procedure of identifying a fractional component, if such exists, and of creating new
subproblems by adding constraints setting the fractional variable to zero and to one. This is the
branching part of the algorithm. Conceivably we could follow the branches out of each subproblem
until all the components are zero or one, and then pick the integer solution with the largest objective
value. But this is working too hard! A better way is to record the solution of the subproblems and
the corresponding value of the objective function. Remember that the value of the objective function
of an LP solution is an upper bound on the value of the objective function of an optimal integer
solution. If at any point we find that the LP solution of a subproblem has an objective value that
is smaller than a previously obtained integer solution, we know immediately that we need not look
any further into this branch. This is sometimes called fathoming the branch. This is the bound part
of the algorithm. Different implementations of branch and bound differ in the way they look at
the branches. The most common is depth first search in which a branch is followed until an integer
solution is obtained, or until the branch can be fathomed.
IEOR4003: Industrial Economics page 41 Professor Guillermo Gallego
At the beginning of the course I asked the class to select between $10 for certain, and the
opportunity of earning $25 with probability 50%. Most of the class recognized that the gamble is
better in expectation, and the majority decided to take the risk. However, when I scaled the stakes
so that the choice was between $1,000,000 for certain, and an opportunity to earn $2,500,000 with
probability 50%, most of you took the $1,000,000 inspite of the fact that the gamble had a higher
expectation.
To further illustrate this behavior, consider the following game. A fair coin is tossed until the
first time a head occurs. If it takes n tosses to obtain the first head the payoff to the player is $2n
dollars. The probability that it will take n tosses to obtain the first head is 21n . Consequently, the
expected payoff is
∞ ∞
2n
= 1 = ∞.
n=1
2n n=1
We have shown that this game has an expected reward equal to plus infinity. What is the maximum
amount that you would be willing to pay to play this game?
Why is it, that rational people do not always act to maximize expected rewards? One explanation
is that many opportunities can only be taken once, so that we do not have a chance to observe the
expected value. This is true, but it does not account for the fact that scale effects may reverse
decisions as in the first example above.
Economist have tried to account for this behavior by postulating that decision makers are risk
averse. A risk averse individual will always prefer less risk, and he may even pay a premium to
reduce the risk. A homeowner that takes fire insurance is risk averse because he knows that he
is being charged more than the expected cost of repairing his home. Those of you who prefer a
$1,000,000 for certain than a gamble that pays $2,500,000 with probability one half are also risk
averse. One difficulty with this theory is that it does not explain why people who are risk averse are
willing to bet in a casino where the odds are against them. One explanation is that most risk averse
individuals will only bet relatively small amounts and they see the expected losses from playing as
a random cost for their entertainment.
To make things formal, two famous economist, Von Neumann and Morgenstern, came up with a
set of axioms of behavior that lead to the existence of a utility function. Once the utility function
has been found, they claim, the decision maker can make decisions based on maximizing expected
utility.
Risk√averse investors have an increasing concave utility function. For example, u(x) = ln(x) and
u(x) = x are two example of increasing concave utility functions. The increasing concavity reflects
a decreasing incremental value from each incremental dollar of wealth.
If X is a random variable, and u(·) is the utility function, then the expected utility is
Eu(X).
Eu(X) ≤ u(EX).
This means that an individual with a concave utility function u(·), would prefer to have EX, for
certain, than to face the random variable X. In other words, the individual will take less than EX
for certain, rather than facing X. The exact amount that they will take for certain is called the
certainty equivalent, CE. This quantity is defined by
u(CE) = Eu(X),
or by
CE = u−1 (Eu(X)) .
IEOR4003: Industrial Economics page 42 Professor Guillermo Gallego
Notice that CE is well defined if u(·) is increasing, since u(·) can be shown to be continuous. The
difference between EX and CE is called the risk premium. For example, if X is a random variable
taking value
√ $0 with probability 0.5, and $1,000,000 with probability 0.5 then EX = $500, 000. If
u(x) = x, then √
Eu(X) = $500 = CE,
so CE = $250, 000, and the risk premium is RP = EX − CE = $250, 000. In this example, the
investor would prefer to take any amount larger or equal to $250,000 than the random variable X
which has expectation $500,000.
Homework: What is the expected utility of the coin tossing game if the utility function is u(x) =
b ln(x/a)? What is the certainty equivalent? Does it depend on a, and b?
It is clear that the shape of the utility function depends on the initial wealth. Buying fire
insurance on a $250,000 house makes more sense for an individual with a net wealth of $250,000,
than for an individual with net wealth of $10,000,000. The latter can probably afford self insurance.
To measure the degree to which risk averseness depends on wealth, W, economist have come up
with the risk aversion function
−u (W )
r(W ) =
u (W )
which is defined for utility functions with first and second derivatives. It is clear that u (W ) > 0,
since we have postulated u(·) to be increasing. As W increases, we would expect u(·) to flatten
out, so r(W ) would decrease. If this is the case, then we say that the individual has decreasing risk
aversion. For example, if u(W ) = ln(W ), then r(W ) = 1/W, and r (W ) < 0, so an individual with
utility function u(W ) = ln(W ) is risk averse, with decreasing risk averseness. It can be shown that
an individual with such a utility function, pays a decreasing risk premium as his wealth increases.
An individual’s utility function, and his behavior can sometimes tell us something about his
subjective probability distribution of a random variable.√For example, suppose an individual with
initial wealth of $100,000 has a utility function u(x) = x and is willing to bet $10,000 on a ball
game. Let p be his probability of winning. If he wins his wealth will be $110,000, if he losses his
wealth will be $90,000. Since he prefers to bet, then
p 110, 000 + (1 − p) 90, 000 ≥ 100, 000,
so √ √
100, 000 − 90, 000
p≥ √ √ = 51.25, %
110, 000 − 90, 000
so his subjective probability of winning must be at least 51.25%.
Suppose that an individual has initial wealth W, and an increasing concave utility function u(·).
He is concerned about a catastrophic event that can occur with probability p, and would cause him
a loss of a dollars. What is the maximum amount that this individual would be willing to pay for
an insurance that will cover his losses if the catastrophic event occurs? Let us assume that cost of
insurance is b dollars. If he takes insurance, his wealth will be W − b, whether the catastrophic event
occurs or not, and the utility that associate with such an action is simply u(W − b). If he does not
take insurance, then he faces a random variable taking value W − a with probability p, and value
W with probability 1 − p. The expected utility of this action is pu(W − a) + (1 − p)u(W ). If the
individual acts to maximize his expected utility then he will take insurance as long as
or equivalently as long as
.
b ≤ W − u−1 (pu(W − a) + (1 − p)u(W )) = b̄.
We conclude that the maximum amount that the individual will pay is b̄.
IEOR4003: Industrial Economics page 43 Professor Guillermo Gallego
so r > p. Given r > p, the individual must decide how much coverage to buy. If he buys B dollars
of coverage, then his wealth will be equal to W − Br if the catastrophic event does not occur, and
will be W − Br − a + B if the catastrophic event occurs. His expected utility, as a function of B, is
.
g(B) = pu(W − Br − a + B) + (1 − p)u(W − Br).
The problem is to maximize g(B) over the region 0 ≤ B ≤ a. It can be shown that g(·) is concave.
If there is an interior point, say B ∗ , satisfying g (B ∗ ) = 0, then B ∗ is optimal. Taking the derivative
of g(·) and equating it to zero results in
u (W − Br − a + B) r(1 − p)
= > 1.
u (W − Br) p(1 − r)
u (W − B ∗ r − a + B ∗ ) > u (W − B ∗ r),
W − B ∗ r − a + B ∗ < W − B ∗ r.
Eu(X) > Eu(Y ). Notice, that this choice is not altered if his utility function is replaced by au(·) + b,
EX
with a > 0; hence utility functions are equivalent up to affine transformations. By taking a = u(EX) ,
.
and b = 0, if necessary, we can assume without loss of generality that u(EX) = EX = µ. Expanding
the utility function around µ, we obtain
Based on this second order approximation, the decision maker prefers the random portfolio X to Y,
if
0.5u (µ)V ar(X) > 0.5u (µ)V ar(Y ).
If the investor is risk averse, he will prefer X to Y, with EX = EY if V ar(X) < V ar(Y ). This result
is part of what is called mean-variance analysis. In words, risk averse investors prefer the portfolio
with the smallest variance among all portfolios with the same expected return. Hence the variance
and its square root, the standard deviation, are taken to be measures of risk.
The other part of the mean-variance analysis, states that all investors will prefer the portfolio
with the highest expected return among all portfolios with the same variance. This part of the mean
variance is less transparent than the first, yet it can be derived without making any assumption
on the sign of the second derivative of u(·). Let µx = EX, µy = EY, and assume that V ar(X) =
.
V ar(Y ) = σ 2 . Now, expanding the utility function around µ = 0.5(µx + µy ), we find that
Eu(X) u(µ) + (µx − µ)u (µ) + 0.5(σ 2 + (µx − µ)2 )u (µ),
notice, that by the choice of µ, the last term vanishes when we take the difference of the expected
utility, so
Eu(X) − Eu(Y ) u (µ)(µx − µy ).
Thus, since u (µ) > 0, this second order approximation indicates that Eu(X) > Eu(Y ) if µx > µy .
To summarize, a second order approximation of the utility function indicates that risk averse
investors will select the portfolio with the smallest variance among all portfolios with the same mean,
and that all investors will select the portfolio with highest return among all portfolios with the same
variance.
The above is helpful in comparing portfolios with the same variance, and portfolios with the same
expected return. More generally, we can ask for the set of portfolios that have the same expected
utility as measured by the second order approximation. Plotting the set we obtain iso-utility curves
in the mean-variance space, or in the mean-standard deviation space. This iso-utility curves have a
positive slope indicating that a higher return is necessary to compensate a higher risk.
IEOR4003: Industrial Economics page 45 Professor Guillermo Gallego
We have already discussed mean-variance analysis in the context of utility theory. The two
mean-variance rules are: (i) Select the project with the lowest variance among projects with the
same expected return, and (ii) Select the project with the largest return among projects with the
same variance. Recall that both rules were obtained by approximating the utility function by its
second degree Taylor expansion, under the assumption that individuals act to maximize expected
utility. Rule (i) was established under the additional assumption of risk averseness, while rule (ii)
did not require risk averseness.
If we have a set of projects under consideration, and we plot their means and their variances,
we will be able to use rules (i) and (ii) to eliminate some of the projects. Those, that we cannot
eliminate are said to lie in the efficient frontier. The ultimate choice between projects in the efficient
frontier depends on the tradeoff between risk and reward for the decision maker. A very conservative
decision maker will not be willing to take substantial additional risk to increase the expected return,
so he will tend to select low return, low variance projects in the efficient frontier. In contrast, a
more aggressive investor will be willing to incur substantial additional risk to increase the expected
return, so he will tend to select high return, high variance projects in the efficient frontier. If the
utility function of the investor is known, then the decision maker selects the project with the largest
utility function. Graphically, this is the project that intersects the largest iso-utility curve.
Recall that σ 2 = E(X − µ)2 , as such the variance is a measure of risk that penalizes deviations
from the mean value on both directions. The semi-variance is an alternative measure of risk that
penalizes deviations below a certain value h. The semi-variance Sh is defined to be
Sh = E[(X − h)− ]2 ,
where values below h are considered bad. Typical values of h are 0 and µ. Not surprisingly, the
knowledge of the semi-variance may help us reach a decision where the mean variance analysis does
not. For example selecting between Project 6 with mean 2 and variance 176, and Project 7 with
mean 28 and variance 356 in Chapter 11 is not possible by the rules of mean variance analysis.
However, once we realize that all of the variance of Project 7, is good variance, and measure risk
through the semi-variance we see that Project 7 dominates Project 6 under the mean semi-variance
rules. The mean semi-variance rules are similar to the mean variance rules except that the semi-
variance is used instead of the variance as a measure of risk. Notice that there is no theoretical
justification for these rules.
A third criterion, called the safety-first criterion favors the alternative with the smallest proba-
bility of loss, where loss is usually defined as a negative PV or as an IRR below MARR.
The above criteria often do not indicate a clear choice among competing alternatives. A concept
that is useful to decide among competing alternatives is called stochastic dominance. Several types
of stochastic dominance exist. The definition of first degree stochastic dominance is given next.
Given two random variables X and Y with probability functions F (·) and G(·), we say that
XDY (X dominates Y ) or F DG [F dominates G], if
Eu(X) ≥ Eu(Y )
for every non-decreasing function u(·) for which the expectation exists and if the inequality is strict
for at least one non-decreasing function u(·).
Suppose that X and Y represent the outcomes of two projects. Since decision makers act to
maximize expected utility it is clear X is preferred to Y, if Eu(X) ≥ Eu(Y ). Thus, to say that X
dominates Y is to say that all decision makers having non-decreasing utility functions will prefer X
to Y.
The above is a strong statement, because it essentially says that all decision makers (with nonde-
creasing utility functions) prefer X to Y. An example of first degree stochastic dominance is given by
Projects 2 and 3 in Chapter 11. Letting P V2 and P V3 denote the random present value of projects
IEOR4003: Industrial Economics page 47 Professor Guillermo Gallego
Notice that the above holds for all nondecreasing utility functions. So all investors with nondecreas-
ing utility functions will prefer Project 3 over Project 2. Notice that the mean-variance analysis was
not capable of deciding between Projects 2 and 3.
While the condition defining first degree stochastic dominance has an intuitive interpretation,
it is not always easy to verify it directly because it involves the set of all non-decreasing functions
u(·). Fortunately, checking XDY [F DG] can be done directly, just by looking at the cumulative
distribution functions. Indeed, a necessary and sufficient condition is that
Eu(X) ≥ Eu(Y )
for every non-decreasing concave function u(·) for which the expectations exists, and if the inequality
is strict for at least one non-decreasing concave function u(·). Again, while the definition of second
degree stochastic dominance is intuitively appealing–XD2Y if and only if all decision makers with
an increasing concave utility function prefer X to Y – it is difficult to check the condition directly.
However, an alternative definition states that we can establish XD2 Y or [F D2 G] by looking into
the areas under the cumulative distribution functions. Indeed, a necessary and sufficient condition
is that
x
x
F (s)ds ≤ G(s)ds ∀x,
−∞ −∞
with mean
J
E(R) = xj µj
j=1
and variance
J
V ar(R) = x2j σj2 + 2 xj xk ρjk σj σk .
j=1 j<k
where r is the minimal desired return, and B is the budget to be allocated. Most formulations take
advantage of the fact that both the objective and the constraints are homogeneous of degree 1, so
we can assume without loss of generality that B = 1. With B = 1 we are essentially asking how to
allocate $1 dollar among the securities. If we have B > 1 dollars to invest, then we just scale the
allocations.
2
As presented above the model requires J +3J 2 estimates of means, variances and covariances.
The amount of historical data that is required to accurately estimate these parameters is enormous.
Thus, the original formulation can only be used to allocate portfolios among a small set of securities
for which the parameters can be accurately estimated. More frequently, the portfolio model is used
to allocate money among classes of securities, rather than among securities themselves. For example,
the model may be used to decide how to allocate money among stocks, bonds, and money market
accounts.
A different version of the above model, called the index model, uses a market index to estimate
µj , σj2 , and ρjk . The index mode postulates that
Rj = Aj + Bj I + Cj , ∀j,
where Aj and Bj are constants, I is a random market index, and Cj is a random error. Thus, the
index model postulates that the return of security j is linearly related to the performance of the
market index I. Let AJ+1 , and QJ+1 denote the expectation and the variance of the random index
.
I. The random errors are assumed to have mean zero and variance Qj = V ar(Cj ) j = 1 . . . , J. Using
the above notation, we obtain
µj = E(Rj ) = Aj + Bj AJ+1 , j = 1, . . . , J,
IEOR4003: Industrial Economics page 49 Professor Guillermo Gallego
and variance 2
J
J
V ar(R) = x2j Qj + Bj xj QJ+1 .
j=1 j=1
Notice that the index model requires only 3J + 2 estimates. The index model is much easier to solve,
and it provides a reasonable approximation to the full covariance model. It is of course possible to
consider models with two rather than one index, e.g., gross national product, and the number of
house starts.
In selecting a portfolio of securities, one generally has the opportunity to invest part of one’s
funds in a risk free asset such as US Treasury securities, with the rest of the funds being invested in
some risky portfolio. Let Rf be the return of the risk free security. We assume that Rf is a known
constant. If we invest a fraction α of our wealth in the risk free security, and the fraction 1 − α in
portfolio with random return Rm , the combined security is:
Rc = αRf + (1 − α)Rm .
and variance
σc2 = V ar(Rc ) = (1 − α)2 σm
2
.
Thus, by selecting the fraction α, we can control the risk σc2 of the combined portfolio. For any
value of σc2 we find
σc
α=1− ,
σm
and for that α, we obtain the expected return
µm − Rf
µc = Rf + σc . (1)
σm
Given Rf , µm , and σm , the above equation for the expected return, µc , is linear function of the
associated with the risk level σc .
Since we want to obtain the largest possible return at any level of risk, it is clear that we would
select the risky portfolio m, that maximizes the slope
µm − Rf
.
σm
Such a portfolio is called the market portfolio, and the associated linear equation (1) is called the
Capital Market Line (CML).
Any risk averse decision maker will select the point where his iso-utility curve is tangent to the
CML. That means that all risk averse investors will allocate part of their money to the risk free
security and the other part to the market portfolio. If the iso-utility curve is tangent to the CML to
IEOR4003: Industrial Economics page 50 Professor Guillermo Gallego
the right of the market portfolio, the investor will be willing to borrow money at the risk free rate
Rf and invest it, together with his own money in the market portfolio. Thus portfolios to the right
of the market portfolio involve borrowing, while portfolios to the left of the market portfolio involve
lending.
From the capital market line, we see that
µc − Rf µm − Rf
=
σc σm ,
the above ratios are called reward to volatility ratios.
Virtually any individual can invest at the risk free rate. However, most investors cannot borrow
at the risk free rate. This requires a modification of the above to incorporate different rates for
borrowing and the separation of the risk return decision; the risky-investment decision does not hold
any longer over the entire mean-standard deviation space. However, it continues to hold over some
region of the mean-standard deviation space.
What happens if we combine a risky security, say Rj , with the market portfolio? We obtain a
new portfolio
Rp = αRj + (1 − α)Rm .
Then
µp = αµj + (1 − α)µm ,
and
σp2 = α2 σj2 + 2α(1 − α)σjm + (1 − α)2 σm
2
.
We see that
∂µp
= µj − µm ,
∂α
and that
2
∂σp σjm − σm
= ,
∂α σm
at α = 0. Consequently,
∂µp (µj − µm )σm µm − Rf
= 2
= .
∂σp σjm − σm σm
Solving for µj we obtain
µj = Rf + β(µm − Rf )
where
σjm
β= 2
.
σm
The above model, called the capital asset pricing model, can be used to price out securities by
comparing the actual to the predicted expected returns.
I like to write the CAPM as
µm − Rf µm − Rf
µj = Rf + ρj σj ≤ Rf + σj ,
σm σm
where ρj is the correlation between security j and Rm , i.e., ρj = σjm /(σj σm ).
IEOR4003: Industrial Economics page 51 Professor Guillermo Gallego
J
Ri = xj Rij .
j=1
The downside risk under scenario i is DRi = max(−Ri , 0) and the average downside risk is ADR =
n
i=1 pi DRi . The formulation calls for minimizing the ADR subject to a constraint on the return.
The down side risk, and the average down side risk, as currently defined, are piecewise linear func-
tions. To convert this into a linear function, we relax the above definition of DRi and make DRi
a decision variable satisfying the constraints DRi ≥ 0 and DRi + Ri ≥ 0. This relaxation results
in a linear program, and since we are minimizing the optimal solution will always have DRi = 0 if
Ri ≥ 0 and DRi = −Ri if Ri < 0.
IEOR4003: Industrial Economics page 52 Professor Guillermo Gallego
For many complex problems, for which analytic techniques are hard to apply, simulation provides
a convenient alternative. Simulation, provides an easy way to combine the necessary probabilistic
distributions -and in some cases the only possible way.
For example, if the Cash Flow in a given period is a function of a number of random variables such
as selling prices, size of the market, investment required, inflation rate, etc., so that it is difficult or
impossible to find the distribution of the Cash Flow, we may resort to simulation to answer questions
such as: What is the probability that the Cash Flow is greater than zero?
We will give only sufficient detail to present the major aspects of the method and how it can be
applied to investment analysis.
Monte Carlo Sampling. Suppose you are interested in the present value:
F1 F2
P V = F0 + + .
1 + i (1 + i)2
If we can express the random variables by probability distributions, we can simulate the actual
state of nature by sampling the state of each random variable from its specific distribution, and
then compute the value of PV. These computations are repeated until a sufficient number of PV
values is available to tabulate its distribution. Three questions arise: (i) How to sample values of
the random variables?, (ii) How many repetitions are required?, (iii) What happens if the variables
are correlated? We will address these questions later.
Using the Simulation Output. For a single project, we can construct estimates of
Pr(P V ≤ x),
Pr(P VA ≥ P VB ),
Example:
Suppose X has an exponential distribution with parameter λ. Then
P (X ≤ x) = 1 − e−λx ,
X = σΦ−1 (u) + µ,
where Φ(·) is the cumulative distribution of the standard normal random variable, and u is a value
drawn from a uniform (0, 1) random variable. Other ways of generating Normal, and other distrib-
utions, are discussed in pp 514.
Sampling Procedures for Dependent Random Variables Suppose that X and Y are two
random variables that are statistically dependent on each other. Dependency can be checked, for
example, by looking at a cross plot of (X, Y ), to observe if there are underlying trends. The problem
is to generate representative pairs of (X, Y ). This can be done, if we know the distribution of X, say
F (·), and for each realization X = x, we know the distribution of Y |X = x, say FY |X=x (·). Then,
(F −1 (U1 ), FY−1
|X=F −1 (U1 ) (U2 ))
is a pair of random variables with the appropriate distribution where U1 and U2 are uniform (0, 1)
random variables. Other methods of generating dependent random variables are discussed in section
12.5 of the book.
Output Data Analysis How close is an estimator to its true measure? Suppose Y is a random
variable with mean µ, then
n
1
µ = lim Yj w.p. 1
n→∞ n
j=1
where Yj , j = 1, . . . n, are n replications of Y. When to stop? Well, suppose that we repeat the
experiment m times, the k th replication yielding
n
1
Ȳk = Ykj .
n j=1
and
m
1
S2 = (Ȳk − Ȳ )2 ,
m−1
k=1
IEOR4003: Industrial Economics page 54 Professor Guillermo Gallego
and σ̂ = √S . Then a 100(1 − α)% confidence interval for µ based on the t− distribution is given by:
m
Homework: Airline companies typically open a flight t units of time before the schedule depar-
ture time; t is typically one year. Assume that the plane scheduled for the flight has n seats, and
that seats are initially offered at a promotional fare of p1 dollars. After some time the promotional
fare is phased out and the remaining seats are priced at the “regular” fare p2 > p1 .
Assume that at fare pi , the demand over s units of time, say Ni (s), is a Poisson random variable
with parameter λi s, i = 1, 2, with p1 λ1 > p2 λ2 , and λ1 t > n > λ2 t. Our objective is to maximize
expected revenue by selecting the best time to switch from the promotional to the regular fare. Two
policies, to be described below are being considered. Your job is to simulate the two policies and to
use the outcome of the simulation to decide which policy is better.
Policy 1. Is to keep the promotional fare until the first time s ∈ [0, t] such that
N1 (s) + λ2 (t − s) ≥ n.
In words policy 1 says to phase out the promotional fare as soon as we expect to fill the remaining
seats at the regular fare. To see this, note that at time s, there are n − N1 (s) remaining seats, and
over the remaining t − s units of time, the expected demand at the regular fare is λ2 (t − s).
Policy 2. After the sale, say at time s, of a seat at the promotional fare, we decide to keep the
promotional fare until the next seat is sold if
t := t + E, R := R + p, N1 := N1 + 1.
(iii) If the phase out condition for the low fare has not been met go to step(i),
(iv) Set
λ = λ2 , p = p2 .
(v) At time t generate an exponential random variable by drawing
1
E = − ln(W )
λ
where W is a uniform (0, 1) random variable.
(vi) If t + E > T, or N2 = n − N1 stop. Else update the variables by
t := t + E, R := R + p, N2 := N2 + 1,
and go to (v).
At the end of the simulation, N1 is the number of seats sold at the promotional fare, N2 is the
number of seats sold at the regular fare, and R = p1 N1 +p2 N2 is the total revenue. For both policies,
you should report R, N1 , and N2 .
IEOR4003: Industrial Economics page 55 Professor Guillermo Gallego
Of course, if our objective is to maximize the expected monetary value we simply select a to maximize
r(a). So
EMV = max r(a).
a∈A
If we had perfect information, that is, if we knew w ∈ Ω before selecting a ∈ A, we would clearly act
to select a to maximize r(a, w) for every w ∈ Ω. Thus, the expected monetary value under perfect
information, EMVPI, is
EMVPI = p(w) max r(a, w).
a∈A
w∈Ω
The EMVPI is an upper bound on EMV, and the difference is called the maximal value of additional
information MVAI. Thus,
MVAI = p(w) max r(a, w) − max r(a, w)p(w).
a∈A a∈A
w∈Ω w∈Ω
Typically decision makers collapse the states of nature into three broad states: poor (p), moderate
(m), and good (g). Thus r(a, p) is the reward obtain by selection action a if the state of nature is p.
Suppose that this is the case, and that a consultant is hired to do a study. At the end of the study, the
consultant will predict favorable (f ), inconclusive (i), or unfavorable (u) conditions. In order to asses
the value of the study, we need to know how good the consultant is in predicting the states of nature.
Suppose that from past history you know the conditional probabilities p(f |g), p(f |m), . . . , p(u|p).
That is for every state of nature you know the probability that the consultant will report f, i, or u.
By Bayes theorem, we can compute the conditional probabilities p(g|f ), p(g|i), p(g|u), . . . , p(p|u), as
well as the probabilities p(f ), p(i), and p(p). For example,
p(f |g)p(g)
p(g|f ) = .
p(f |g)p(g) + p(f |m)p(m) + p(f |p)p(p)
Given these conditional probabilities, we can compute the conditional rewards associated with
each action a ∈ A. Thus
The rewards r(a|i), and r(a|u) can be similarly computed. Of course, for each possible outcome of
the study, we select the action a ∈ A, that maximizes r(a|·). To obtain the expected monetary value
taking the study, EMVS, we compute
Where p(f ), p(i), and p(p) are the probabilities that the study predicts f, i, and p. The value of the
study is then
EMVS − EMV.
See the Retail Convenience Store Problem, page 552 of textbook.
IEOR4003: Industrial Economics page 56 Professor Guillermo Gallego
where x+ = max(x, 0). Of course, the optimal decision is to wait only when the expected value is
larger than investing now. Consequently, the optimal expected net present value is given by
ENPV∗ = max(ENPV0 , ENPVw ).
To illustrate, let us continue with our numerical example with Po = 200, q = 0.5, and i = 10%.
For values of I ∈ ($1, 100, $3, 300] we have
I
ENPVw = 0.5 3000 −
1.1
I
= 1500 − .
2.2
IEOR4003: Industrial Economics page 57 Professor Guillermo Gallego
Comparing to 2200 − I we see that ENPVw > ENPV for values of I ∈ ($1, 283.33, $3, 300] with the
opposite inequality holding for ($1, 000, $1, 283.33]. This results in ENPV∗ = ENPV0 = 2200 − I on
I ≤ $1, 283.33 and ENPV∗ = ENPVw = 1500 − I/2.2 on I ∈ ($1, 283.33, $3, 300].
Thus, for example, if I = $1, 600 we see that ENPV0 = $600 > 0, and yet it is better to wait,
since ENPVw = $772.73. Here, we are better off postponing the investment even though the present
value is positive. On the other hand, suppose that I = $2, 500. Then, ENPV0 = −$300 so the
present value is negative, yet ENPVw = $363.64 > 0. Here, if we were force to make the decision
now we would reject the project. Yet, the expected present value of the project is positive when we
have the option of waiting!
In summary, we would invest now if I < $1, 283.33 and never invest if I > $3, 300. For interme-
diate values it is best to wait one year and investment if the price goes up.
A similar analysis can be carried out to determine values of Po for which it is best to invest now,
wait, or not invest at all. Indeed, if q = 0.5 then
ENPV0 = −I + P0 (1 + i)/i,
so ENPV0 ≥ 0 whenver
i
P0 ≥ I .
1+i
Let us now consider the option of waiting. If
1 i i
I ≤ P0 ≤ 2I ,
1.5 1 + i 1+i
then we would only invest if the price goes up, so
1.5P0 I
ENPVw = 0.5 − .
i 1+i
Again,
ENPV∗ = max(ENPV0 , ENPVw ).
Here ENPV0 ≤ ENPVw on
1 + 2i i
P0 ≤ 2I .
1 + i 1 + 4i
Thus if i = 10% and I = 1600 waiting is better than investing now if
97 ≤ P0 ≤ 249.
P0 > 249.
P0 < 97.